Ressources: BBN Named Entity annotation of 15 million word OANC now available

Thierry Hamon thierry.hamon at UNIV-PARIS13.FR
Wed Nov 10 19:57:17 UTC 2010


Date: Tue, 9 Nov 2010 12:03:22 -0500
From: Nancy Ide <ide at cs.vassar.edu>
Message-Id: <B7DE9E2E-C24B-4620-AF64-3281F0088E4D at cs.vassar.edu>
X-url: http://www.anc.org
X-url: http://www.aclweb.org/anthology-new/N/N04/N04-1043.pdf
X-url: http://www.anc.org:8080/ANC2Go
X-url: http://www.anc.org/contribute.html

 *******************************************************************
     BBN Named Entity annotation of the 15 million word Open ANC
 *******************************************************************
                          http://www.anc.org

The American National Corpus (ANC) project has received a contribution
of named entity annotation for the entire 15 million words of the Open
American National Corpus, which is now freely available for download
from the ANC website. The annotations were automatically produced by
the BBN named entity tagger (see
http://www.aclweb.org/anthology-new/N/N04/N04-1043.pdf) and
contributed by Sameer Pradhan. The download contains the OANC texts,
respecting the OANC directory structure, with inline annotations in an
XML-like format.

The ANC project is in the process of generating a version of these
annotations in standoff GrAF format so that they may be combined with
other OANC annotations using the ANC2Go web application
http://www.anc.org:8080/ANC2Go) or the stand-alone ANCTool.

The ANC welcomes contributions of both annotations and texts, which we
release for free download by the community from our website. ANC,
OANC, and MASC data and annotations are or will be also distributed
through the Linguistic Data Consortium. To contribute, send email to
anc at anc.org or consult http://www.anc.org/contribute.html.

 =======================================================================
THE ANC PROJECT IS COMMITTED TO OPEN DATA FOR LANGUAGE RESEARCH,
DEVELOPMENT, AND EDUCATION. ALL CONTRIBUTIONS OF BOTH DATA AND
ANNOTATIONS SHOULD BE UNENCUMBERED BY LICENSING RESTRICTIONS. ALL
CONTRIBUTIONS ARE MADE FREELY AVAILABLE FOR USE BY THE COMMUNITY.
 =======================================================================

-------------------------------------------------------------------------
Message diffuse par la liste Langage Naturel <LN at cines.fr>
Informations, abonnement : http://www.atala.org/article.php3?id_article=48
English version       : 
Archives                 : http://listserv.linguistlist.org/archives/ln.html
                                http://liste.cines.fr/info/ln

La liste LN est parrainee par l'ATALA (Association pour le Traitement
Automatique des Langues)
Information et adhesion  : http://www.atala.org/
-------------------------------------------------------------------------



More information about the Ln mailing list