[Corpora-List] ANN: First public release of Morphix-NLP
Zhang Le
ejoy at xinhuanet.com
Tue Nov 11 15:58:29 UTC 2003
Hi all,
I'm pleased to announce that the first public release of Morphix-NLP
Live CD is now available for download.
What is Morphix-NLP?
====================
Morphix-NLP is a Live CD Linux distribution with a rich collection of
Natural Language Processing (NLP) applications. Though the field of
NLP has undergone decades of intensive research, software designed in
the NLP community are often scattered around the net and are not
known by the larger computer user community. Consequently, most NLP
software can not be found in mainstream distributions even years
after the first public release.
The purpose of this CD is twofold:
* In the first place, it tries to break the software acquisition and
installation barrier facing many researchers and students in the
NLP community by providing most NLP related software on a single
Live CD.
* In the second place, the CD can be used to promote Natural
Language Processing among average computer users. Simply plugging
the CD into cd-drive and watching some NLP applications in action,
most users will get some knowledge of Natural Language Processing
and what NLP can do.
System Requirements
===================
x86 machine with more than 96 MB RAM plus a bootable CD-ROM (VMware is
ok).
What is included on this CD?
============================
A broad range of NLP software are included for performing common NLP tasks
including:
Tokenizers:
Qtoken, MXTERMINATOR, Chinese word segmenters...
POS Taggers:
Brill's TBL Tagger, MXPOST, fnTBL tagger, QTag, Tree-Tagger,
Memory-based Tagger...
Parsers:
Collins' Parser, Link Parser, LoPar...
Language Modeling Tools:
CMU SLM toolkit, Trigger Toolkit, Ngram Statistics Package...
Speech Software:
Festival Speech Synthesis System
Develpment Tools:
SVM-light, Maxent, SNoW, TiMBL, fnTBL
Other software:
WordNet Browser 2.0, Word Concordance program (antconc), unaccent,
and many other software...
All software are well tested and documented. More software will be
included in next release.
The compressed ISO image is only 448MB (with kernel 2.4, XFACE, gimp1.3,
gcc3.2, XFree86-4...), leaving plenty room for future extension. One can
easily add extra personal data (demo software, corpus...) on the CD
before burning it.
Where to get it?
================
Current location of the CD is:
http://www.nlplab.cn/zhangle/morphix-nlp/
Online Manual:
http://www.nlplab.cn/zhangle/morphix-nlp/manual/
ISO Image:
http://f4f.ivyol.com/morphix-nlp/morphix-nlp-1.1.iso
http://f4f.ivyol.com/morphix-nlp/morphix-nlp-1.1.iso.md5
Comments, suggestions and bug reports are always welcome :-)
Have fun!
--
Zhang Le
Natural Language Processing Lab
Northeastern University, P.R.China
http://www.nlplab.cn/zhangle/
More information about the Corpora
mailing list