[Corpora-List] Available: IcePaHC 0.5, 632.000 words

Anton Karl Ingason anton.karl.ingason at gmail.com
Tue Jul 12 10:58:45 UTC 2011


IcePaHC 0.5, the latest version of the Icelandic Parsed Historical Corpus,
is now available for download:

http://linguist.is/icelandic_treebank/Download

- 632.000 words total, from every century between the 12th and the 20th
centuries inclusive
- Annotated for phrase structure, part-of-speech-tagged and lemmatized
- LGPL license: You are free to copy, modify and redistribute the corpus for
research and/or profit

Joel C. Wallenberg (joel.wallenberg at gmail.com)
Anton Karl Ingason (anton.karl.ingason at gmail.com)
Einar Freyr Sigurðsson (einarfs at gmail.com)
Eiríkur Rögnvaldsson (eirikur at hi.is)
University of Iceland

The project is funded by the following grants:

Icelandic Research Fund (RANNÍS), grant nr. 090662011,"Viable Language
Technology beyond English – Icelandic as a test case".

U.S. National Science Foundation (NSF) International Research Fellowship
Program (IRFP), grant #OISE-0853114, "Evolution of Language Systems: a
comparative study of grammatical change in Icelandic and English".
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20110712/115abec5/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list