25.4260, Software: Universal Dependencies, Version 1

The LINGUIST List via LINGUIST linguist at listserv.linguistlist.org
Tue Oct 28 02:07:28 UTC 2014

LINGUIST List: Vol-25-4260. Mon Oct 27 2014. ISSN: 1069 - 4875.

Subject: 25.4260, Software: Universal Dependencies, Version 1

Moderators: Damir Cavar, Indiana U <damir at linguistlist.org>
            Malgorzata E. Cavar, Indiana U <gosia at linguistlist.org>

Reviews: reviews at linguistlist.org
Anthony Aristar <aristar at linguistlist.org>
Helen Aristar-Dry <hdry at linguistlist.org>
Sara Couture, Indiana U <sara at linguistlist.org>

Homepage: http://linguistlist.org

Do you want to donate to LINGUIST without spending an extra penny? Bookmark
the Amazon link for your country below; then use it whenever you buy from

USA: http://www.amazon.com/?_encoding=UTF8&tag=linguistlist-20
Britain: http://www.amazon.co.uk/?_encoding=UTF8&tag=linguistlist-21
Germany: http://www.amazon.de/?_encoding=UTF8&tag=linguistlistd-21
Japan: http://www.amazon.co.jp/?_encoding=UTF8&tag=linguistlist-22
Canada: http://www.amazon.ca/?_encoding=UTF8&tag=linguistlistc-20
France: http://www.amazon.fr/?_encoding=UTF8&tag=linguistlistf-21

For more information on the LINGUIST Amazon store please visit our
FAQ at http://linguistlist.org/amazon-faq.cfm.

Editor for this issue: Damir Cavar <damir at linguistlist.org>

Date: Mon, 27 Oct 2014 22:07:10
From: Joakim Nivre [joakim.nivre at lingfil.uu.se]
Subject: Universal Dependencies, Version 1

E-mail this message to a friend:
We are happy to announce the release of the annotation guidelines for
Universal Dependencies at http://universaldependencies.github.io/docs/.
Universal Dependencies is a project that seeks to develop cross-linguistically
consistent treebank annotation for many languages with the goal of
facilitating multilingual parser development, cross-lingual learning, and
parsing research from a language typology perspective. The annotation scheme
is based on (universal) Stanford dependencies (de Marneffe et al., 2006, 2008,
2014), Google universal part-of-speech tags (Petrov et al., 2012), and the
Interset interlingua for morphosyntactic tagsets (Zeman, 2008). The general
philosophy is to provide a universal inventory of categories and guidelines to
facilitate consistent annotation of similar constructions across languages,
while allowing language-specific extensions when necessary.

We intend to treat version 1 as stable for at least the next year, but we may
subsequently make further revisions based on experiences using it to treebank
a range of languages. Our goal is to make a first release of data sets with
language-specific documentation by January 1, 2015. If you are interested in
contributing to this effort, please get in touch.

Jinho Choi, Marie-Catherine de Marneffe, Tim Dozat, Filip Ginter, Yoav
Goldberg, Jan Hajic, Christopher Manning, Ryan McDonald, Joakim Nivre, Slav
Petrov, Sampo Pyysalo, Natalia Silveira, Reut Tsarfaty, Dan Zeman

Linguistic Field(s): Computational Linguistics

LINGUIST List: Vol-25-4260	


More information about the LINGUIST mailing list