I know lingPipe provides a HMM pos tagger which is open source.<div><br></div><div>Best,</div><div><br></div><div>Lushan<br><br><div class="gmail_quote">On Mon, Jan 7, 2013 at 5:37 AM, Fatemeh Torabi Asr <span dir="ltr"><<a href="mailto:torabiasr@gmail.com" target="_blank">torabiasr@gmail.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Dear all,<div><br></div><div>I'm looking for an open source efficient HMM POS tagger to run it for something like an artificial language. I would like it to be configurable for different sizes of N-grams, taking the list of possible tags and a dictionary (small tagged corpus) and then could be trained on a large corpus of un-annotated text. </div>
<div>I also wonder if any of the existing *HMM-based* POS taggers consider word features (not only the word content but instead a feature vector of the observable properties of the word in the un-labled text, e.g., some semantic features attached to the word frame). So, it would be great if an state-of-the-art HMM tagger implementation is already available considering such a representation of the states.</div>
<div><div><br></div>Best,<br>Fatemeh<div><br></div><div><span style="font-size:x-small"><font face="garamond, serif"><font color="#3333FF"><br></font></font></span></div><div><font color="#3333FF" face="garamond, serif"><span style="font-size:x-small"><br>
</span></font></div>
</div>
<br>_______________________________________________<br>
UNSUBSCRIBE from this page: <a href="http://mailman.uib.no/options/corpora" target="_blank">http://mailman.uib.no/options/corpora</a><br>
Corpora mailing list<br>
<a href="mailto:Corpora@uib.no">Corpora@uib.no</a><br>
<a href="http://mailman.uib.no/listinfo/corpora" target="_blank">http://mailman.uib.no/listinfo/corpora</a><br>
<br></blockquote></div><br></div>