[Corpora-List] NLTK 0.9.8 released

Steven Bird sb at csse.unimelb.edu.au
Tue Feb 24 23:01:35 UTC 2009


NLTK 0.9.8 has been released.

  http://www.nltk.org/download

This version contains a new off-the-shelf tokenizer, POS tagger, and
named-entity tagger.  A new metrics package includes inter-annotator
agreement scores and various distance and word association measures
(Tom Lippincott and Joel Nothman).  There's a new collocations package
(Joel Nothman).  There are many improvements to the WordNet package
and browser (Steven Bethard, Jordan Boyd-Graber, Paul Bone), and to
the semantics and inference packages (Dan Garrette).  The NLTK corpus
collection now includes the PE08 Parser Evaluation data, and the CoNLL
2007 Basque and Catalan Dependency Treebanks.  We have added an
interface for dependency treebanks.  Many chapters of the book have
been revised in response to feedback from readers.  For full details
see the ChangeLog
[http://nltk.googlecode.com/svn/trunk/nltk/ChangeLog].  NB some method
names have been changed for consistency and simplicity.  Use of old
names will generate deprecation warnings that indicate the correct
name to use.

-Steven Bird, Edward Loper and Ewan Klein

_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list