21.2091, FYI: New Version: English-Universal Networking Lang Dictionary
linguist at LINGUISTLIST.ORG
linguist at LINGUISTLIST.ORG
Wed May 5 13:57:31 UTC 2010
LINGUIST List: Vol-21-2091. Wed May 05 2010. ISSN: 1068 - 4875.
Subject: 21.2091, FYI: New Version: English-Universal Networking Lang Dictionary
Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
Reviews: Monica Macaulay, U of Wisconsin-Madison
Eric Raimy, U of Wisconsin-Madison
Joseph Salmons, U of Wisconsin-Madison
Anja Wanner, U of Wisconsin-Madison
<reviews at linguistlist.org>
Homepage: http://linguistlist.org/
The LINGUIST List is funded by Eastern Michigan University,
and donations from subscribers and publishers.
Editor for this issue: Danielle St. Jean <danielle at linguistlist.org>
================================================================
To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.cfm.
===========================Directory==============================
1)
Date: 20-Apr-2010
From: Ronaldo Martins < r.martins at undlfoundation.org >
Subject: New Version: English-Universal Networking Lang Dictionary
-------------------------Message 1 ----------------------------------
Date: Wed, 05 May 2010 09:55:56
From: Ronaldo Martins [r.martins at undlfoundation.org]
Subject: New Version: English-Universal Networking Lang Dictionary
E-mail this message to a friend:
http://linguistlist.org/issues/emailmessage/verification.cfm?iss=21-2091.html&submissionid=2628214&topicid=6&msgnumber=1
The Universal Networking Digital Language (UNDL) Foundation has
released a new version of the English-Universal Networking Language
(UNL) dictionary. The English-UNL dictionary is a bidirectional
(ENG>UNL, UNL>ENG) machine-tractable lexical database comprising
more than 200,000 mappings between English and UNL. It brings
extensive information about lexical items of English, including
morphological structure, inflectional paradigms and subcategorization
frames, as well as semantic information about UNL entries. The
dictionary is available under an Attribution Share Alike (CC-BY-SA)
Creative Commons license at the UNLarium
(http://www.unlweb.net/unlarium).
In order to browse and to export the dictionary, users are expected to
sign in to the UNLweb (with their own account or as guests):
http://www.unlweb.net/user/index.php?page=login
Users can then go to the UNLarium, where they will find the option
Dictionary and, inside it, the English-UNL dictionary, along with others
that are still being provided.
How the English-UNL dictionary was created:
The English-UNL dictionary was mainly derived from a word list
extracted from the English WordNet 3.0, which was automatically
analyzed and humanly revised for lexical categories, lexical structure
(roots, affixes), part of speech, number (singular, plural, singulare
tantum, plurale tantum, invariant), valence, transitivity, inflectional
paradigms (for nouns and verbs) and subcategorization frames
(according to the X-bar theory). English entries were mapped onto
entries of the UNL dictionary (i.e., Universal Words [UWs]) and may be
freely exported in two different formats: generative, containing only
base forms and the corresponding generation (inflectional and
composition) rules; and enumerative, containing word forms and lexical
features. A sample of entries is presented below.
base form
[foot] {2883} "100284665" (POS=NOU, MOR=STE, LST=WRD,
NUM=SNG, INF=M1, FLX(PLR:="feet";)) ;
word forms
[foot] {2883} "100284665" (POS=NOU, MOR=WFO, LST=WRD,
NUM=SNG, INF=M1);
[feet] {2883} "100284665" (POS=NOU, MOR=WFO, LST=WRD,
NUM=PLR, INF=M1);
The English-UNL dictionary is generated in real time according to the
specifications and to the tagset described at the UNLwiki
(http://www.unlweb.net/wiki). As an ongoing project and a dynamic
database, the dictionary is subject to permanent augmentation and
improvement, and reports on problems and other contributions are
mostly welcome.
For further information, please contact:
Ronaldo Martins (r.martins at undlfoundation.org)
Language Resources Manager
UNDL Foundation
48, route de Chancy
CH-1213 - Geneva - Switzerland
+41 22 879 8090
What is UNL?
The UNL is an artificial language that has been used for several
different tasks in natural language processing, such as machine
translation, multilingual document generation, summarization,
information retrieval and semantic reasoning. It has been originally
proposed by the Institute of Advanced Studies of the United Nations
University, in Tokyo, and has been currently promoted by the UNDL
Foundation, in Geneva, Switzerland, under a mandate of the United
Nations. [read more about UNL in http://www.unlweb.net]
The UNDL Foundation:
The UNDL Foundation (http://www.undlfoundation.org) is a non-profit
organization based in Geneva, Switzerland, which has received from
the United Nations the mandate for implementing the Universal
Networking Language (UNL). The UNL Programme is a collaborative
effort to create natural language resources and technology to reduce
language barriers and strengthen cross-cultural communication in the
framework of the United Nations. Participation in the Programme is free
and open to individuals and institutions, either as researchers or as
developers. Special funds are available for some languages.
Linguistic Field(s): Computational Linguistics
-----------------------------------------------------------
LINGUIST List: Vol-21-2091
More information about the LINGUIST
mailing list