<div dir="ltr"><div class="gmail_default" style="font-family:arial,helvetica,sans-serif">Hi Matias,</div><div class="gmail_default" style="font-family:arial,helvetica,sans-serif"><br></div><div class="gmail_default" style="font-family:arial,helvetica,sans-serif">
Yatbaz and his colleagues are working on Unsupervised POS induction and achieves state-of-the-art scores over ~16 languages. This paper is submitted to ACL '14. I am not sure how big your corpora will be but you may want to try it. </div>
<div class="gmail_default" style="font-family:arial,helvetica,sans-serif"><br></div><div class="gmail_default" style="font-family:arial,helvetica,sans-serif"><a href="https://github.com/ai-ku/upos_2014">https://github.com/ai-ku/upos_2014</a><br>
</div>
</div><div class="gmail_extra"><br><br><div class="gmail_quote">On Fri, Feb 14, 2014 at 10:58 AM, Matías Guzmán Naranjo <span dir="ltr"><<a href="mailto:mortem.dei@gmail.com" target="_blank">mortem.dei@gmail.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr">Thanks Grzegorz, I'll take a look. I don't really need the tagger to tell me whether a particular word is a verb or a noun, but to tell me which words appear to belong to the same grammatical class, whatever that class might be. I need to analyze examples taken from corpora of languages I don't know, so a bit of initial help would make things easier<br>
</div><div class="gmail_extra"><br><br><div class="gmail_quote">2014-02-14 9:52 GMT+01:00 Grzegorz Chrupała <span dir="ltr"><<a href="mailto:G.A.Chrupala@uvt.nl" target="_blank">G.A.Chrupala@uvt.nl</a>></span>:<div class="">
<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Hi Matías,<br>
<br>
I think fully unsupervised POS tagging isn't yet quite good enough to<br>
be useful for end users. But it depends on what exactly you need.<br>
<br>
Have a look at the papers from the shared task at the 2012 Workshop on<br>
Inducing Linguistic Structure:<br>
<a href="http://wiki.cs.ox.ac.uk/InducingLinguisticStructure/SharedTask" target="_blank">http://wiki.cs.ox.ac.uk/InducingLinguisticStructure/SharedTask</a><br>
--<br>
Grzegorz<br>
<br>
On Fri, Feb 14, 2014 at 12:16 AM, Matías Guzmán Naranjo<br>
<<a href="mailto:mortem.dei@gmail.com" target="_blank">mortem.dei@gmail.com</a>> wrote:<br>
> Dear all,<br>
><br>
> I would like to hear your opinions on which is/are the best unsupervised pos<br>
> tagger/s (preferably foss). I'm working with corpora for many different<br>
> languages for which there are no specific taggers developed and need to get<br>
> a basic idea about parts of speech.<br>
><br>
> Thanks!<br>
><br>
> Matías<br>
><br>
> _______________________________________________<br>
> UNSUBSCRIBE from this page: <a href="http://mailman.uib.no/options/corpora" target="_blank">http://mailman.uib.no/options/corpora</a><br>
> Corpora mailing list<br>
> <a href="mailto:Corpora@uib.no" target="_blank">Corpora@uib.no</a><br>
> <a href="http://mailman.uib.no/listinfo/corpora" target="_blank">http://mailman.uib.no/listinfo/corpora</a><br>
><br>
</blockquote></div></div><br></div>
<br>_______________________________________________<br>
UNSUBSCRIBE from this page: <a href="http://mailman.uib.no/options/corpora" target="_blank">http://mailman.uib.no/options/corpora</a><br>
Corpora mailing list<br>
<a href="mailto:Corpora@uib.no">Corpora@uib.no</a><br>
<a href="http://mailman.uib.no/listinfo/corpora" target="_blank">http://mailman.uib.no/listinfo/corpora</a><br>
<br></blockquote></div><br></div>