22.2774, FYI: Available: IcePaHC 0.5 (Icelandic Corpus)
linguist at LINGUISTLIST.ORG
linguist at LINGUISTLIST.ORG
Wed Jul 6 17:20:10 UTC 2011
LINGUIST List: Vol-22-2774. Wed Jul 06 2011. ISSN: 1068 - 4875.
Subject: 22.2774, FYI: Available: IcePaHC 0.5 (Icelandic Corpus)
Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
Reviews: Veronika Drake, U of Wisconsin-Madison
Monica Macaulay, U of Wisconsin-Madison
Rajiv Rao, U of Wisconsin-Madison
Joseph Salmons, U of Wisconsin-Madison
Anja Wanner, U of Wisconsin-Madison
<reviews at linguistlist.org>
Homepage: http://linguistlist.org/
The LINGUIST List is funded by Eastern Michigan University,
and donations from subscribers and publishers.
Editor for this issue: Brent Miller <brent at linguistlist.org>
================================================================
Visit LL's Multitree project for over 1000 trees dynamically generated
from scholarly hypotheses about language relationships:
http://multitree.linguistlist.org/
To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.cfm.
===========================Directory==============================
1)
Date: 05-Jul-2011
From: Joel Wallenberg [joel.wallenberg at gmail.com]
Subject: Available: IcePaHC 0.5 (Icelandic Corpus)
-------------------------Message 1 ----------------------------------
Date: Wed, 06 Jul 2011 13:19:12
From: Joel Wallenberg [joel.wallenberg at gmail.com]
Subject: Available: IcePaHC 0.5 (Icelandic Corpus)
E-mail this message to a friend:
http://linguistlist.org/issues/emailmessage/verification.cfm?iss=22-2774.html&submissionid=4525449&topicid=6&msgnumber=1
IcePaHC 0.5, the latest version of the Icelandic Parsed Historical Corpus,
is now available for download:
http://linguist.is/icelandic_treebank/Download
- 632.000 words total, from every century between the 12th and the 20th
centuries inclusive
- Annotated for phrase structure, part-of-speech-tagged and lemmatized
- LGPL license: You are free to copy, modify and redistribute the corpus
for research and/or profit
Joel C. Wallenberg (joel.wallenberg at gmail.com)
Anton Karl Ingason (anton.karl.ingason at gmail.com)
Einar Freyr Sigurðsson (einarfs at gmail.com)
Eiríkur Rögnvaldsson (eirikur at hi.is)
University of Iceland
The project is funded by the following grants:
Icelandic Research Fund (RANNÍS), grant nr. 090662011,''Viable Language
Technology beyond English - Icelandic as a test case''.
U.S. National Science Foundation (NSF) International Research Fellowship
Program (IRFP), grant #OISE-0853114, ''Evolution of Language Systems: a
comparative study of grammatical change in Icelandic and English''.
Linguistic Field(s): Computational Linguistics
Historical Linguistics
Text/Corpus Linguistics
-----------------------------------------------------------
LINGUIST List: Vol-22-2774
----------------------------------------------------------
Visit LL's Multitree project for over 1000 trees dynamically generated
from scholarly hypotheses about language relationships:
http://multitree.linguistlist.org/
More information about the LINGUIST
mailing list