22.2774, FYI: Available: IcePaHC 0.5 (Icelandic Corpus)

linguist at LINGUISTLIST.ORG linguist at LINGUISTLIST.ORG
Wed Jul 6 17:20:10 UTC 2011


LINGUIST List: Vol-22-2774. Wed Jul 06 2011. ISSN: 1068 - 4875.

Subject: 22.2774, FYI: Available: IcePaHC 0.5 (Icelandic Corpus)

Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
            Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
 
Reviews: Veronika Drake, U of Wisconsin-Madison  
Monica Macaulay, U of Wisconsin-Madison  
Rajiv Rao, U of Wisconsin-Madison  
Joseph Salmons, U of Wisconsin-Madison  
Anja Wanner, U of Wisconsin-Madison  
       <reviews at linguistlist.org> 

Homepage: http://linguistlist.org/

The LINGUIST List is funded by Eastern Michigan University, 
and donations from subscribers and publishers.

Editor for this issue: Brent Miller <brent at linguistlist.org>
================================================================  
Visit LL's Multitree project for over 1000 trees dynamically generated
from scholarly hypotheses about language relationships:
          http://multitree.linguistlist.org/
					
					
To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.cfm.

===========================Directory==============================  

1)
Date: 05-Jul-2011
From: Joel Wallenberg [joel.wallenberg at gmail.com]
Subject: Available: IcePaHC 0.5 (Icelandic Corpus)
 

	
-------------------------Message 1 ---------------------------------- 
Date: Wed, 06 Jul 2011 13:19:12
From: Joel Wallenberg [joel.wallenberg at gmail.com]
Subject: Available: IcePaHC 0.5 (Icelandic Corpus)

E-mail this message to a friend:
http://linguistlist.org/issues/emailmessage/verification.cfm?iss=22-2774.html&submissionid=4525449&topicid=6&msgnumber=1
  


IcePaHC 0.5, the latest version of the Icelandic Parsed Historical Corpus,
is now available for download:

http://linguist.is/icelandic_treebank/Download

- 632.000 words total, from every century between the 12th and the 20th
centuries inclusive
- Annotated for phrase structure, part-of-speech-tagged and lemmatized
- LGPL license: You are free to copy, modify and redistribute the corpus
for research and/or profit

Joel C. Wallenberg (joel.wallenberg at gmail.com)
Anton Karl Ingason (anton.karl.ingason at gmail.com)
Einar Freyr Sigurðsson (einarfs at gmail.com)
Eiríkur Rögnvaldsson (eirikur at hi.is)
University of Iceland

The project is funded by the following grants:

Icelandic Research Fund (RANNÍS), grant nr. 090662011,''Viable Language
Technology beyond English - Icelandic as a test case''.

U.S. National Science Foundation (NSF) International Research Fellowship
Program (IRFP), grant #OISE-0853114, ''Evolution of Language Systems: a
comparative study of grammatical change in Icelandic and English''. 



Linguistic Field(s): Computational Linguistics
                     Historical Linguistics
                     Text/Corpus Linguistics





 







-----------------------------------------------------------
LINGUIST List: Vol-22-2774	
----------------------------------------------------------
Visit LL's Multitree project for over 1000 trees dynamically generated
from scholarly hypotheses about language relationships:
          http://multitree.linguistlist.org/
					
					

	



More information about the LINGUIST mailing list