Corpora: brill tagger

Adam Przepiórkowski adamp at ipipan.waw.pl
Thu Mar 14 15:50:49 UTC 2002


> I was wondering if anyone new what tagset Eric Brill's Transformation-based learning Tagger used, or whether it has it's own tagset.

The tagger can be trained on any a corpus with any tagset, but the
pre-trained version was trained on WSJ, as far as I remember, and so
it assumes the UPenn Treebank tagset:

http://www.comp.leeds.ac.uk/amalgam/tagsets/upenn.html

--
	Adam P.



More information about the Corpora mailing list