[Corpora-List] Open source HMM POS tagger

Fatemeh Torabi Asr torabiasr at gmail.com
Mon Jan 7 10:37:41 UTC 2013


Dear all,

I'm looking for an open source efficient HMM POS tagger to run it for
something like an artificial language. I would like it to be configurable
for different sizes of N-grams, taking the list of possible tags and a
dictionary (small tagged corpus) and then could be trained on a large
corpus of un-annotated text.
I also wonder if any of the existing *HMM-based* POS taggers consider word
features (not only the word content but instead a feature vector of the
observable properties of the word in the un-labled text, e.g., some
semantic features attached to the word frame). So, it would be great if an
state-of-the-art HMM tagger implementation is already available considering
such a representation of the states.

Best,
Fatemeh
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20130107/f673684d/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list