Corpora: ELRA News
Valerie Mapelli
mapelli at elda.fr
Tue Jul 25 10:17:24 UTC 2000
[ We apologise for the duplicate posting of this announcement ]
___________________________________________________________
ELRA
European Language Resources Association
ELRA News
___________________________________________________________
*** ELRA NEW RESOURCES ***
As a means of extending the Language Resources (LRs) offered
in the ELRA catalogue to include more Asian LRs, ELRA has
reached an agreement for mutual collaboration with the Japan
Electronic Dictionary Research (EDR) Institute. Through this
agreement, ELRA is now able to distribute the EDR Electronic
Dictionary.
The EDR Electronic Dictionary is a result of combining the
information of conventional paper-based Japanese and English
dictionaries, thesauri and corpora. The words treated in the
dictionary include basic or commonly used words and technical
terms from the field of information processing. The EDR Electronic
Dictionary is subdivided into 8 distinct subdictionaries. Each
subdictionary shares the same basic design, including the record
number, headword information, co-occurrence constituent information,
syntactic information, semantic information, co-occurrence
situation information, and management information. A basic
descriptive format makes use of a portion of SGML (Standard
Generalized Mark-up Language), though it is not SGML.
The 8 subdictionaries are given below:
o The Japanese Word Dictionary (ref. ELRA-L0036), composed
of 260,000 Japanese word records arranged alphabetically
according to the Japanese syllabary.
o The English Word Dictionary (ref. ELRA-L0037), composed
of 190,000 English word records arranged alphabetically.
o The Concept Dictionary (ref. ELRA-L0038),which provides
400,000 concepts that are made reference to in the Japanese
and English Word Dictionaries (ref. ELRA-L0036 and L0037),
the Japanese-English and English-Japanese Bilingual Dictionaries
(ref. ELRA-M0023 and M0024) as well as in the Japanese and
English Co-occurrence Dictionaries (ref. ELRA-L0039 and L0040).
The Concept Dictionary is composed of three separate dictionaries:
- the Headconcept Dictionary gives a description of each
concept in words
- the Concept Classification Dictionary contains a classification
of concepts that have a super-sub relation
- the Concept Description Dictionary provides all other
information regarding the relation between concepts.
o The Japanese-English Bilingual Dictionary (ref. ELRA-M0023),
composed of 230,000 word records arranged alphabetically
according to the Japanese syllabary.
o The English-Japanese Bilingual Dictionary (ref. ELRA-M0024),
composed of 160,000 bilingual word records arranged alphabetically
according to the headword.
o The Japanese Co-occurrence Dictionary (ref. ELRA-L0039),
composed of 900,000 headphrase notations arranged according
to the Japanese syllabary. A Japanese corpus is appended to
this dictionary.
o The English Co-occurrence Dictionary (ref. ELRA-L0040),
composed of 460,000 alphabetically arranged of headphrases.
An English corpus is appended to this Dictionary.
o The Technical Terms Dictionary (Information processing)
(ref. ELRA-L0041) contains 80,000 technical terms in English
and 120,000 technical terms in Japanese from the field of information
processing.
=====================================
Contacts:
ELRA/ELDA Tel: +33 (0)1 43 13 33 33
55-57 rue Brillat-Savarin Fax: +33 (0)1 43 13 33 30
F-75013 Paris, France E-mail: mapelli at elda.fr
EDR - Japan Electronic Dictionary Research Institute, Ltd.
Akishima Building 4-34-7, Fax +81 (0)3 5951 3966
Ikebukuro, Toshima-Ku E-mail thoth at edr.co.jp
171-0014 Tokyo, Japan http://www.iijnet.or.jp/edr/
Visit the online catalogue on the ELDA Web site:
http://www.elda.fr
=====================================
More information about the Corpora
mailing list