20.2019, FYI: New Releases of SMULTRON and TreeAligner

Sun May 31 00:29:56 UTC 2009

LINGUIST List: Vol-20-2019. Sat May 30 2009. ISSN: 1068 - 4875.

Subject: 20.2019, FYI: New Releases of SMULTRON and TreeAligner

Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
            Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
Reviews: Randall Eggert, U of Utah  
       <reviews at linguistlist.org> 

Homepage: http://linguistlist.org/

The LINGUIST List is funded by Eastern Michigan University, 
and donations from subscribers and publishers.

Editor for this issue: Catherine Adams <catherin at linguistlist.org>

To post to LINGUIST, use our convenient web form at


Date: 28-May-2009
From: Torsten Marek < marek at ifi.uzh.ch >
Subject: New Releases of SMULTRON and TreeAligner

-------------------------Message 1 ---------------------------------- 
Date: Sat, 30 May 2009 20:28:34
From: Torsten Marek [marek at ifi.uzh.ch]
Subject: New Releases of SMULTRON and TreeAligner

E-mail this message to a friend:

The Parallel Treebank Group at the Institute of Computational Linguistics
at the University of Zürich is proud to announce the availability of new
releases for SMULTRON, an aligned parallel treebank, and the TreeAligner, a
tool for annotating, browsing and querying parallel treebanks.


SMULTRON (Stockholm MULtilingual TReebank) is a parallel treebank which
contains around 1000 sentences in English, German and Swedish. The
sentences have been PoS-tagged and annotated with phrase structure trees.
The trees have been aligned across languages on sentence, phrase and word
level. Additionally, the German and Swedish monolingual treebanks contain
lemma information. The SMULTRON corpus is freely available for research
purposes, please see the registration page: http://www.cl.uzh.ch/kitt/smultron/

New in version 1.1:
 * new German-Swedish alignments
 * various annotation errors fixed
 * compatibility updates for the new TreeAligner

TreeAligner v1.1

The TreeAligner is a graphical tool for creating aligned parallel treebanks
by drawing alignment links between phrases. The monolingual treebanks must
currently be encoded in TIGER-XML. 

The TreeAligner also allows querying the aligned treebanks, using an
extended version of the TIGER corpus query language. 

Easy installers for the TreeAligner are available for Windows and Ubuntu:

The source code is available as a package or from public repositories:

All code is licensed under the GPLv2.

New in version 1.1:
 * improved annotation workflow & tree interactivity
 * corpus information display (feature values etc.)
 * automatic alignment suggestions (experimental feature)
 * monolingual queries for parallel treebanks
 * much faster query evaluation engine
 * query language extensions for restricted universal quantification:
 * improved tree layout algorithm
 * sampler from the SMULTRON corpus included in the installers

If you are interested, please join us on the TreeAligner mailing list:

Linguistic Field(s): Computational Linguistics
                     Text/Corpus Linguistics


LINGUIST List: Vol-20-2019	


More information about the LINGUIST mailing list