19.166, Software: Release of Parallel Treebank Corpus and Tool

LINGUIST Network linguist at LINGUISTLIST.ORG
Tue Jan 15 15:54:36 UTC 2008


LINGUIST List: Vol-19-166. Tue Jan 15 2008. ISSN: 1068 - 4875.

Subject: 19.166, Software: Release of Parallel Treebank Corpus and Tool

Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
            Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
 
Reviews: Randall Eggert, U of Utah  
         <reviews at linguistlist.org> 

Homepage: http://linguistlist.org/

The LINGUIST List is funded by Eastern Michigan University, 
and donations from subscribers and publishers.

Editor for this issue: Hannah Morales <hannah at linguistlist.org>
================================================================  

To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.html.

===========================Directory==============================  

1)
Date: 15-Jan-2008
From: Martin Volk < volk at ling.su.se >
Subject: Release of Parallel Treebank Corpus and Tool

 

	
-------------------------Message 1 ---------------------------------- 
Date: Tue, 15 Jan 2008 10:53:39
From: Martin Volk [volk at ling.su.se]
Subject: Release of Parallel Treebank Corpus and Tool
E-mail this message to a friend:
http://linguistlist.org/issues/emailmessage/verification.cfm?iss=19-166.html&submissionid=166403&topicid=13&msgnumber=1  


The Computational Linguistics Group at the Department of Linguistics at
Stockholm University makes available an aligned parallel treebank (called
SMULTRON) and an accompanying alignment and query tool (called the
Stockholm TreeAligner).

SMULTRON (Stockholm MULtilingual TReebank) is a parallel treebank and
contains around 1000 sentences in English, German and Swedish. The
sentences have been PoS-tagged and annotated with phrase structure trees.
The trees have been aligned across languages on sentence, phrase and word
level. Additionally, the German and Swedish monolingual treebanks contain
lemma information.

SMULTRON is freely available for research purposes from
http://www.ling.su.se/DaLi/research/smultron/index.htm

The Stockholm TreeAligner allows the user to view alignment links across
two parallel trees. It also allows the user to create and modify such links
between corresponding nodes or words in two treebanks.

The Stockholm TreeAligner displays trees from input files in TigerXML
format with node labels, edge labels, and crossing branches, making it
useful for browsing TigerXML files.

Moreover the Stockholm TreeAligner allows querying parallel treebanks
(inspired by the TIGERSearch query language but additionally allowing
alignment queries). Search results are highlighted in a graphical display.

The Stockholm TreeAligner is free software and can be downloaded from
http://www.ling.su.se/dali/downloads/treealigner/index.htm 
Linguistic Field(s): Computational Linguistics
                     Syntax
                     Text/Corpus Linguistics





-----------------------------------------------------------
LINGUIST List: Vol-19-166	

	



More information about the LINGUIST mailing list