[Corpora-List] Natural Language Toolkit: NLTK-Lite version 0.6.5 released
Diana Maynard
d.maynard at dcs.shef.ac.uk
Tue Jul 11 11:00:47 UTC 2006
A German specialisation of the GATE tokeniser does come with the
distribution, though someone may well have an improved version.
Regards
Diana
Hamish Cunningham wrote:
> Markus,
>
> You might try the unicode-based tokeniser included with GATE
> (http://gate.ac.uk), or ask on the user list for a German
> specialisation of
> it.
>
> Best
More information about the Corpora
mailing list