20.3020, Software: ELRA: Distribution Agreement signed for BioLexicon
linguist at LINGUISTLIST.ORG
linguist at LINGUISTLIST.ORG
Wed Sep 9 02:17:28 UTC 2009
LINGUIST List: Vol-20-3020. Tue Sep 08 2009. ISSN: 1068 - 4875.
Subject: 20.3020, Software: ELRA: Distribution Agreement signed for BioLexicon
Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
Reviews: Randall Eggert, U of Utah
<reviews at linguistlist.org>
Homepage: http://linguistlist.org/
The LINGUIST List is funded by Eastern Michigan University,
and donations from subscribers and publishers.
Editor for this issue: Hannah Morales <hannah at linguistlist.org>
================================================================
To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.html.
===========================Directory==============================
1)
Date: 08-Sep-2009
From: Helene Mazo < mazo at elda.org >
Subject: ELRA: Distribution Agreement signed for BioLexicon
-------------------------Message 1 ----------------------------------
Date: Tue, 08 Sep 2009 22:14:47
From: Helene Mazo [mazo at elda.org]
Subject: ELRA: Distribution Agreement signed for BioLexicon
E-mail this message to a friend:
http://linguistlist.org/issues/emailmessage/verification.cfm?iss=20-3020.html&submissionid=229367&topicid=13&msgnumber=1
ELRA together with the European Bioinformatics Institute (EBI, Hinxton,
UK), Istituto di Linguistica Computazionale-Consiglio Nazionale Ricerche
(ILC-CNR, Pisa, Italy), and the National Centre for Text Mining (NaCTeM,
University of Manchester, UK) has signed a Language Resources distribution
agreement for a large-scale English language terminological resource in the
biomedical domain: BioLexicon.
Biological terminology is a frequent cause of analysis errors when
processing literature written in the biology domain, due largely to the
high degree of variation in term forms, to the frequent mismatches between
labels of controlled vocabularies and ontologies on the one hand and the
forms actually occurring in text on the other, and to the lack of detailed
formal information on the linguistic behavior of domain terms. For example,
'retro-regulate' is a terminological verb often used in molecular biology
but it is not included in conventional dictionaries.
BioLexicon is a linguistic resource for the biology domain, tailored to
cope with these problems. It contains information on:
- terminological nouns, including nominalised verbs and proper names (e.g.
gene names)
- terminological adjectives
- terminological adverbs
- terminological verbs
- general English words frequently used in the biology domain
Existing information on terms was integrated, augmented, complemented and
linked, through processing of massive amounts of biomedical text, to yield
inter alia over 2.2M entries, and information on over 1.8M variants and on
over 2M synonymy relations. Moreover, extensive information is provided on
how verbs and nominalised verbs in the domain behave at both syntactic and
semantic levels, supporting thus applications aiming at discovery of
relations and events involving biological entities in text. This
comprehensive coverage of biological terms makes BioLexicon a unique
linguistic resource within the domain. It is primarily intended to support
text mining and information retrieval in the biomedical domain, however its
standards-based structure and rich content make it a valuable resource for
many other kinds of application.
On behalf of ELRA, ELDA will act as the distribution agency by
incorporating the BioLexicon in the ELRA Language Resources catalogue. With
these resources, ELRA is willing to extend the current catalogue, by
offering specialized resources and thus allow a better coverage of the
language. For more information on BioLexicon (catalogue reference:
ELRA-S0373): http://catalog.elra.info/product_info.php?products_id=1113
For more information on the ELRA catalogue, please contact Valérie Mapelli,
mapelli at elda.org
For more information on ELRA & ELDA, please contact Khalid Choukri,
choukri at elda.org; Hélène Mazo, mazo at elda.org
ELDA
55-57, rue Brillat Savarin
75013 Paris (France)
Tel.: +33 1 43 13 33 33
Fax: +33 1 43 13 33 30
ELRA is a non-profit making organisation founded by the European Commission
in 1995, with the mission of providing a clearing house for language
resources and promoting Human Language Technologies (HLT). To find out more
about ELRA, please visit http://www.elra.info.
ELDA is ELRA's operational body. ELDA identifies, collects, markets, and
distributes language resources, along with the dissemination of general
information in the field of HLT. ELDA also participates in some evaluation
projects and campaigns, has considerable knowledge and skills in HLT
applications and has participated in many French, European and
international projects. To find out more about ELDA, please visit
http://www.elda.org.
To find out more about BioLexicon partners, please visit:
EBI: http://www.ebi.ac.uk
ILC-CNR: http://www.ilc.cnr.it
NaCTeM: http://www.nactem.ac.uk
Linguistic Field(s): Computational Linguistics
Lexicography
Semantics
-----------------------------------------------------------
LINGUIST List: Vol-20-3020
More information about the LINGUIST
mailing list