[Corpora-List] PR ELRA: Distribution Agreement signed for BioLexicon
info at elda.org
info at elda.org
Thu Sep 3 14:35:34 UTC 2009
Press Release - Immediate
Paris, France, September, 3rd 2009
Distribution Agreement signed for BioLexicon
ELRA together with the European Bioinformatics Institute (EBI, Hinxton,
UK), Istituto di Linguistica Computazionale-Consiglio Nazionale Ricerche
(ILC-CNR, Pisa, Italy), and the National Centre for Text Mining (NaCTeM,
University of Manchester, UK) has signed a Language Resources
distribution agreement for a large-scale English language terminological
resource in the biomedical domain: *BioLexicon.*
Biological terminology is a frequent cause of analysis errors when
processing literature written in the biology domain, due largely to the
high degree of variation in term forms, to the frequent mis-matches
between labels of controlled vocabularies and ontologies on the one hand
and the forms actually occurring in text on the other, and to the lack
of detailed formal information on the linguistic behaviour of domain
terms. For example, "retro-regulate" is a terminological verb often
used in molecular biology but it is not included in conventional
dictionaries. BioLexicon is a linguistic resource for the biology
domain, tailored to cope with these problems. It contains information on:
- terminological nouns, including nominalised verbs and proper names
(e.g., gene names)
- terminological adjectives
- terminological adverbs
- terminological verbs
- general English words frequently used in the biology domain
Existing information on terms was integrated, augmented, complemented
and linked, through processing of massive amounts of biomedical text, to
yield inter alia over 2.2M entries, and information on over 1.8M
variants and on over 2M synonymy relations. Moreover, extensive
information is provided on how verbs and nominalised verbs in the domain
behave at both syntactic and semantic levels, supporting thus
applications aiming at discovery of relations and events involving
biological entities in text.
This comprehensive coverage of biological terms makes BioLexicon a
unique linguistic resource within the domain. It is primarily intended
to support text mining and information retrieval in the biomedical
domain, however its standards-based structure and rich content make it a
valuable resource for many other kinds of application.
On behalf of ELRA, ELDA will act as the distribution agency, by
incorporating the BioLexicon in the ELRA Language Resources catalogue.
With these resources, ELRA is willing to extend the current catalogue,
by offering specialized resources and thus allow a better coverage of
the language.
For more information on BioLexicon (catalogue reference: ELRA-S0373):
http://catalog.elra.info/product_info.php?products_id=1113
For more information on the ELRA catalogue, please contact:
Valérie Mapelli, mapelli at elda.org
For more information on ELRA & ELDA, please contact:
Khalid Choukri, choukri at elda.org
Hélène Mazo, mazo at elda.org
ELDA
55-57, rue Brillat Savarin
75013 Paris (France)
Tel.: +33 1 43 13 33 33
Fax: +33 1 43 13 33 30
*** About ELRA ***
The European Language Resources Association (ELRA) is a non-profit
making organisation founded by the European Commission in 1995, with the
mission of providing a clearing house for language resources and
promoting Human Language Technologies (HLT).
To find out more about ELRA, please visit our web site: http://www.elra.info
*** About ELDA ***
The Evaluation and Language resources Distribution Agency (ELDA) is
ELRA's operational body. ELDA identifies, collects, markets, and
distributes language resources, along with the dissemination of general
information in the field of HLT. ELDA also participates in some
evaluation projects and campaigns, has considerable knowledge and skills
in HLT applications and has participated in many French, European and
international projects.
To find out more about ELDA, please visit our web site: http://www.elda.org
*** About the partners: EBI, ILC-CNR and NaCTeM ***
To find out more about BioLexicon partners, please visit the following
websites:
EBI: http://www.ebi.ac.uk
ILC-CNR: http://www.ilc.cnr.it
NaCTeM: http://www.nactem.ac.uk <http://www.nactem.ac.uk/>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20090903/702d7dd0/attachment.htm>
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list