Corpora: brill tagger
Adam Przepiórkowski
adamp at ipipan.waw.pl
Thu Mar 14 15:50:49 UTC 2002
> I was wondering if anyone new what tagset Eric Brill's Transformation-based learning Tagger used, or whether it has it's own tagset.
The tagger can be trained on any a corpus with any tagset, but the
pre-trained version was trained on WSJ, as far as I remember, and so
it assumes the UPenn Treebank tagset:
http://www.comp.leeds.ac.uk/amalgam/tagsets/upenn.html
--
Adam P.
More information about the Corpora
mailing list