[Corpora-List] Application for lemmatising corpora
Oliver Strunk
strunk at ub.edu
Thu Mar 22 23:25:44 UTC 2007
Maybe the TreeTagger from IMS Stuttgart?
http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/DecisionTreeTagg
er.html
It is available for linux and windows; the output includes POS and
lemmatized text and can easily be converted.
Oliver Strunk
LADA - University of Barcelona
From: owner-corpora at lists.uib.no [mailto:owner-corpora at lists.uib.no] On
Behalf Of Hunter, Duncan
Sent: Thursday, March 22, 2007 11:45 PM
To: corpora at uib.no
Subject: [Corpora-List] Application for lemmatising corpora
Hi All,
Could anybody suggest a small, downloadable and free application for
lemmatising texts? For various reasons I need the texts I am examining to be
in lemmatised form before analysis with corpus tools. It's a small
collection of texts, a few hundred shortish (article -sized) ones in text
format.
I've had some trouble with the software I'm using at the moment. It tends to
'stick' when given a formidable lemma list to process (I'm using Yasumasa
Someya's fairly lengthy one).
All the best,
Duncan Hunter
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20070323/5490245a/attachment.htm>
More information about the Corpora
mailing list