Soft: BioYaTeA release announcement

Thierry Hamon thierry.hamon at UNIV-PARIS13.FR
Sat Apr 20 17:41:51 UTC 2013


Date: Wed, 17 Apr 2013 16:42:34 +0200
From: wiktoria <wiktoria.golik at jouy.inra.fr>
Message-ID: <516EB4DA.9010005 at jouy.inra.fr>
X-url: http://search.cpan.org/~bibliome/Lingua-BioYaTeA/
X-url: http://search.cpan.org/~thhamon/Lingua-YaTeA/

Dear colleagues,

We are pleased to announce the release of the open source term extractor
*BioYaTeA.*
It is now available on CPAN: 
http://search.cpan.org/~bibliome/Lingua-BioYaTeA/ 

Description:
BioYaTeA is a version of the YaTeA term extractor
(http://search.cpan.org/~thhamon/Lingua-YaTeA/ ) that has been adapted
for term extraction in the biology domain.  The extracted terms contain
noun and adjective phrases. The method is based on a morpho-syntactic
analysis and shallow parsing.  The innovative aspect of BioYaTeA is its
ability to handle participles and prepositional phrases and to filter
out irrelevant terms.  BioYaTeA has been shown to improve the accuracy
of information extraction in the BioNL-ST'11 BB Shared Task
(http://2013.bionlp-st.org/supporting-resources).

BioYaTeA takes as input POS-tagged text in TreeTagger or GeniaTagger
format.
The output is in the XML-BioYaTeA format. BioYaTeA also computes the
term position, its syntactic structure and its subterms.

For further details on the linguistic principles and the evaluation of
BioYaTeA, please consult the following paper, which will soon be
available online:

Golik Wiktoria, Robert Bossy, Ratkovic Zorana and Nédellec Claire. (To
appear in 2013). Improving Term Extraction with Linguistic Analysis in
the Biomedical Domain. Proceedings of the 14th International Conference
on Intelligent Text Processing and Computational Linguistics
(CICLing'13), Special Issue of the Journal Research in Computing
Science, ISSN 1870-4069, www.micai.org/rcs, 24-30 March, Samos, Greece,
2013.

BioYaTeA e-mail contact: wiktoria.golik at jouy.inra.fr,
robert.bossy at jouy.inra.fr

YaTeA e-mail contact: thierry.hamon at univ-paris13.fr

Best regards,

Wiktoria Golik
Bibliome group (INRA-MIG)

-------------------------------------------------------------------------
Message diffuse par la liste Langage Naturel <LN at cines.fr>
Informations, abonnement : http://www.atala.org/article.php3?id_article=48
English version       : 
Archives                 : http://listserv.linguistlist.org/archives/ln.html
                                http://liste.cines.fr/info/ln

La liste LN est parrainee par l'ATALA (Association pour le Traitement
Automatique des Langues)
Information et adhesion  : http://www.atala.org/
-------------------------------------------------------------------------



More information about the Ln mailing list