21.4878, Software: NooJ: Finite-State Language Processing
Fri Dec 3 22:16:11 UTC 2010
LINGUIST List: Vol-21-4878. Fri Dec 03 2010. ISSN: 1068 - 4875.
Subject: 21.4878, Software: NooJ: Finite-State Language Processing
Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
Reviews: Monica Macaulay, U of Wisconsin-Madison
Eric Raimy, U of Wisconsin-Madison
Joseph Salmons, U of Wisconsin-Madison
Anja Wanner, U of Wisconsin-Madison
<reviews at linguistlist.org>
Homepage: http://linguistlist.org/
The LINGUIST List is funded by Eastern Michigan University,
and donations from subscribers and publishers.
Editor for this issue: Susanne Vejdemo <susanne at linguistlist.org>
To post to LINGUIST, use our convenient web form at
Date: 03-Dec-2010
From: Chris Humphrey [chumphrey at c-s-p.org]
Subject: NooJ: Finite-State Language Processing
-------------------------Message 1 ----------------------------------
Date: Fri, 03 Dec 2010 17:15:14
From: Chris Humphrey [chumphrey at c-s-p.org]
Subject: NooJ: Finite-State Language Processing
E-mail this message to a friend:
NooJ is both a corpus processing tool and a linguistic development
environment: it allows linguists to formalize several levels of linguistic
phenomena: orthography and spelling, lexicons for simple words, multiword
units and frozen expressions, inflectional, derivational and productive
morphology, local, structural syntax and transformational syntax. For each
of these levels, NooJ provides linguists with one or more formal tools
specifically designed to facilitate the description of each phenomenon, as
well as parsing tools designed to be as computationally efficient as
possible. This approach distinguishes NooJ from most computational
linguistic tools, which provide a single formalism that should describe
everything. As a corpus processing tool, NooJ allows users to apply
sophisticated linguistic queries to large corpora in order to build indices
and concordances, annotate texts automatically, perform statistical
analyses, etc.
NooJ is freely available and linguistic modules can already be downloaded
for Acadian, Arabic, Armenian, Bulgarian, Catalan, Chinese, Croatian,
French, English, German, Hebrew, Greek, Hungarian, Italian, Polish,
Portuguese, Spanish and Turkish.
Linguistic Field(s): Morphology
Text/Corpus Linguistics
Subject Language(s): Armenian (hye)
Bulgarian (bul)
Chinese, Mandarin (cmn)
Catalan-Valencian-Balear (cat)
English (eng)
French (fra)
German, Standard (deu)
Greek (ell)
Hebrew (heb)
Hungarian (hun)
Italian (ita)
Portuguese (por)
Polish (pol)
Spanish (spa)
Turkish (tur)
Croatian (hrv)
LINGUIST List: Vol-21-4878
More information about the LINGUIST
mailing list