[Corpora] [Corpora-List] Language identification tool

Valerio Basile v.basile at rug.nl
Fri Nov 14 17:21:26 UTC 2014


> is any of you aware of a language identification tool that covers at
least the EU official languages.
> Preferably a stand alone application.

I'd like to throw in TextCat:

  http://odur.let.rug.nl/~vannoord/TextCat/

It's a Perl script, and it supports 76 languages, the complete list is on
the website.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20141114/952b68c4/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list