[Corpora-List] Application for lemmatising corpora

jasper holmes jasper.holmes at gmail.com
Fri Mar 23 09:58:28 UTC 2007


You could try WMatrix: http://www.comp.lancs.ac.uk/ucrel/wmatrix/
You need to get a username (one month free trial), and then you do it
online. This does tagging and lemmatising and also some analysis
(frequencies, concordances, key words).

Jasper
http://go.warwick.ac.uk/BAWE


On 3/22/07, Oliver Strunk <strunk at ub.edu> wrote:
>
>
>
> Maybe the TreeTagger from IMS Stuttgart?
>
>
>
> http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/DecisionTreeTagger.html
>
>
>
> It is available for linux and windows; the output includes POS and
> lemmatized text and can easily be converted.
>
>
>
> Oliver Strunk
>
> LADA – University of Barcelona
>
>
>
>
> From: owner-corpora at lists.uib.no [mailto:owner-corpora at lists.uib.no] On
> Behalf Of Hunter, Duncan
> Sent: Thursday, March 22, 2007 11:45 PM
> To: corpora at uib.no
> Subject: [Corpora-List] Application for lemmatising corpora
>
>
>
>
>
> Hi All,
>
>
>
>
>
> Could anybody suggest a small, downloadable and free application for
> lemmatising texts? For various reasons I need the texts I am examining to be
> in lemmatised form before analysis with corpus tools. It's a small
> collection of texts, a few hundred shortish (article -sized) ones in text
> format.
>
>
>
>
>
> I've had some trouble with the software I'm using at the moment. It tends to
> 'stick' when given a formidable lemma list to process (I'm using Yasumasa
> Someya's fairly lengthy one).
>
>
>
>
>
> All the best,
>
>
>
>
>
> Duncan Hunter



More information about the Corpora mailing list