[Corpora-List] ANN: First public release of Morphix-NLP

Zhang Le ejoy at xinhuanet.com
Tue Nov 11 15:58:29 UTC 2003


Hi all,
  I'm pleased to announce that the first public release of Morphix-NLP
  Live CD is now available for download.

What is Morphix-NLP?
====================
   Morphix-NLP is a Live CD Linux distribution with a rich collection of
   Natural Language Processing (NLP) applications. Though the field of
   NLP has undergone decades of intensive research, software designed in
   the NLP community are often scattered around the net and are not
   known by the larger computer user community. Consequently, most NLP
   software can not be found in mainstream distributions even years
   after the first public release.

   The purpose of this CD is twofold:
     * In the first place, it tries to break the software acquisition and
       installation barrier facing many researchers and students in the
       NLP community by providing most NLP related software on a single
       Live CD.
     * In the second place, the CD can be used to promote Natural
       Language Processing among average computer users. Simply plugging
       the CD into cd-drive and watching some NLP applications in action,
       most users will get some knowledge of Natural Language Processing
       and what NLP can do.

System Requirements
===================
x86 machine with more than 96 MB RAM plus a bootable CD-ROM (VMware is
ok).


What is included on this CD?
============================
A broad range of NLP software are included for performing common NLP tasks
including:

Tokenizers:
    Qtoken, MXTERMINATOR, Chinese word segmenters...

POS Taggers:
    Brill's TBL Tagger, MXPOST, fnTBL tagger, QTag, Tree-Tagger,
    Memory-based Tagger...

Parsers:
    Collins' Parser, Link Parser, LoPar...

Language Modeling Tools:
    CMU SLM toolkit, Trigger Toolkit, Ngram Statistics Package...

Speech Software:
    Festival Speech Synthesis System

Develpment Tools:
    SVM-light, Maxent, SNoW, TiMBL, fnTBL

Other software:
    WordNet Browser 2.0, Word Concordance program (antconc), unaccent,
    and many other software...

All software are well tested and documented.  More software will be
included in next release.

The compressed ISO image is only 448MB (with kernel 2.4, XFACE, gimp1.3,
gcc3.2, XFree86-4...), leaving plenty room for future extension. One can
easily add extra personal data (demo software, corpus...) on the CD
before burning it.

Where to get it?
================
Current location of the CD is:
http://www.nlplab.cn/zhangle/morphix-nlp/

Online Manual:
http://www.nlplab.cn/zhangle/morphix-nlp/manual/

ISO Image:
http://f4f.ivyol.com/morphix-nlp/morphix-nlp-1.1.iso
http://f4f.ivyol.com/morphix-nlp/morphix-nlp-1.1.iso.md5

Comments, suggestions and bug reports are always welcome :-)

Have fun!

--
Zhang Le
Natural Language Processing Lab
Northeastern University, P.R.China
http://www.nlplab.cn/zhangle/



More information about the Corpora mailing list