25.4260, Software: Universal Dependencies, Version 1
The LINGUIST List via LINGUIST
linguist at listserv.linguistlist.org
Tue Oct 28 02:07:28 UTC 2014
LINGUIST List: Vol-25-4260. Mon Oct 27 2014. ISSN: 1069 - 4875.
Subject: 25.4260, Software: Universal Dependencies, Version 1
Moderators: Damir Cavar, Indiana U <damir at linguistlist.org>
Malgorzata E. Cavar, Indiana U <gosia at linguistlist.org>
Reviews: reviews at linguistlist.org
Anthony Aristar <aristar at linguistlist.org>
Helen Aristar-Dry <hdry at linguistlist.org>
Sara Couture, Indiana U <sara at linguistlist.org>
Homepage: http://linguistlist.org
Do you want to donate to LINGUIST without spending an extra penny? Bookmark
the Amazon link for your country below; then use it whenever you buy from
Amazon!
USA: http://www.amazon.com/?_encoding=UTF8&tag=linguistlist-20
Britain: http://www.amazon.co.uk/?_encoding=UTF8&tag=linguistlist-21
Germany: http://www.amazon.de/?_encoding=UTF8&tag=linguistlistd-21
Japan: http://www.amazon.co.jp/?_encoding=UTF8&tag=linguistlist-22
Canada: http://www.amazon.ca/?_encoding=UTF8&tag=linguistlistc-20
France: http://www.amazon.fr/?_encoding=UTF8&tag=linguistlistf-21
For more information on the LINGUIST Amazon store please visit our
FAQ at http://linguistlist.org/amazon-faq.cfm.
Editor for this issue: Damir Cavar <damir at linguistlist.org>
================================================================
Date: Mon, 27 Oct 2014 22:07:10
From: Joakim Nivre [joakim.nivre at lingfil.uu.se]
Subject: Universal Dependencies, Version 1
E-mail this message to a friend:
http://linguistlist.org/issues/emailmessage/verification.cfm?iss=25-4260.html&submissionid=35964517&topicid=13&msgnumber=1
We are happy to announce the release of the annotation guidelines for
Universal Dependencies at http://universaldependencies.github.io/docs/.
Universal Dependencies is a project that seeks to develop cross-linguistically
consistent treebank annotation for many languages with the goal of
facilitating multilingual parser development, cross-lingual learning, and
parsing research from a language typology perspective. The annotation scheme
is based on (universal) Stanford dependencies (de Marneffe et al., 2006, 2008,
2014), Google universal part-of-speech tags (Petrov et al., 2012), and the
Interset interlingua for morphosyntactic tagsets (Zeman, 2008). The general
philosophy is to provide a universal inventory of categories and guidelines to
facilitate consistent annotation of similar constructions across languages,
while allowing language-specific extensions when necessary.
We intend to treat version 1 as stable for at least the next year, but we may
subsequently make further revisions based on experiences using it to treebank
a range of languages. Our goal is to make a first release of data sets with
language-specific documentation by January 1, 2015. If you are interested in
contributing to this effort, please get in touch.
Jinho Choi, Marie-Catherine de Marneffe, Tim Dozat, Filip Ginter, Yoav
Goldberg, Jan Hajic, Christopher Manning, Ryan McDonald, Joakim Nivre, Slav
Petrov, Sampo Pyysalo, Natalia Silveira, Reut Tsarfaty, Dan Zeman
Linguistic Field(s): Computational Linguistics
----------------------------------------------------------
LINGUIST List: Vol-25-4260
----------------------------------------------------------
More information about the LINGUIST
mailing list