[Corpora-List] POS Tagger for German / Java

Ciar án Ó Duibhín ciaran at oduibhin.freeserve.co.uk
Wed Jan 10 13:49:00 UTC 2007


Michael,
I cannot judge how good it is, but you might look at the Stuttgart Tree
Tagger
http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/DecisionTreeTagg
er.html which has been trained for German (among other languages).
I think it is written in C, but I'm not sure of that.
Ciarán Ó Duibhín.

----- Original Message -----
From: "Michael Sonntag" <sonntag_michael at hotmail.com>
To: <CORPORA at UIB.NO>
Sent: Tuesday, January 09, 2007 7:51 PM
Subject: [Corpora-List] POS Tagger for German / Java


> Hi all,
>
> I am currently working on a system for toponym recognition in natural
german
> (web-based) text documents, as my master thesis.
> The system uses a POS tagger for extracting good NE candidates for a
> gazetteer.
>
> Now, here my question arises
> 1. Do you know of any good POS tagger for German language, best
Java-based?
> (I need only the NE-tagged tokens.)
> 2. I used tnt, but that one is based on perl/C, and it is not easy to
> integrate into my java framework.
> 3. I also used qtag. But it comes only with a, for my task too small data
> base (lexicon and matrix).
>
> So, is there any POS tagger out there that is easy to use and up for the
> task?
>
> Cheers & thx for listening in, yours
> Mike Sonntag
>
> _________________________________________________________________
> Sie suchen E-Mails, Dokumente oder Fotos? Die neue MSN Suche Toolbar mit
> Windows-Desktopsuche liefert in sekundenschnelle Ergebnisse. Jetzt neu!
> http://desktop.msn.de/ Jetzt gratis downloaden!
>
>
>



More information about the Corpora mailing list