[Corpora-List] Annotation of internal structure of named entities

Michał Marcińczuk marcinczuk at gmail.com
Thu Jan 13 13:38:20 UTC 2011


Hi Grzegorz,

I can suggest two corpora for Polish (both are under development).
The first one is being developed on Wrocław University of Technology
and can be accessed here:
http://nlp.pwr.wroc.pl/inforex/index.php?corpus=7&page=browse
In this corpus all nested mentions are annotated. In case of person names
first names, surnames and given names are annotated.
The corpus will be published under the CC license.

The other corpus is NKJP http://nkjp.pl/.

Regards,
  Michał Marcińczuk


--
www.czuk.eu


2011/1/13 Grzegorz Chrupała <pitekus at gmail.com>

> Hi all,
>
> Does anyone know of a dataset where the internal structure of named
> entities is annotated? What I have in mind is for example parts of
> person names such as title, first name, initial, surname etc. I would
> be interested mostly in English, German, Spanish or French resources
> but other languages could also be OK.
>
> Best,
> --
> Grzegorz Chrupała
> Saarland University
> FR 7.4 Spoken Language Systems
> Building C7 1, Room 0.04
> 66041 Saarbrücken
> +49 681 302 58126
> gchrupala at lsv.uni-saarland.de
>
> _______________________________________________
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20110113/f23fa686/attachment.htm>
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list