[Corpora-List] summary of replies to "software for (manual) syntactic annotation of corpora?"

Adam Przepiorkowski adamp at ipipan.waw.pl
Sat Aug 1 17:34:47 UTC 2009


Two weeks ago I asked about tools for the manual annotation of corpora
with constituency information (original e-mail below).

The following tools were mentioned in reply:

* EasyRef: http://atoll.inria.fr/easyrefpub/ ("is a prototype of web
service to handle (view, edit, import, export, bug reports) syntactic
annotations (for the French EASy and PASSAGE formats)")

* GATE: http://gate.ac.uk/ ("has many (but not all) of the features
you are looking for"),

* MMAX:
http://www.eml-research.de/english/research/nlp/download/mmax.php,
http://mmax2.sourceforge.net/ (recommended by a couple of people)

* SACODEYL Annotator:
http://www.um.es/sacodeyl/en/pages/software.htm#annotator ("is perfect
for manual annotation. It's desktop at the moment (exciteing plans for
the future, though).  It's open source, TEI - XML")

* Synpathy:
http://www.mpi.nl/research/research-projects/language-archiving-technology/tools
("free and open", but "closed" in terms of development and maintenance)

* TrEd: http://ufal.mff.cuni.cz/~pajas/tred/ ("is a highly customizable
open source toolkit for manual and/or semi automatic annotation,
usable both for constituency and dependency trees")

* WordFreak: http://wordfreak.sourceforge.net/

Many thanks to everybody who answered my query: Eric de la Clergerie,
Ciprian Gerstenberger, Rob Malouf, Petr Pajas, Pascual Pérez-Paredes,
Peter Wittenburg, Magdalena Wolska, Amir Zeldes.

Adam P.

P.S. If I get more replies within the next couple of weeks, I'll send
another summary.


Adam Przepiorkowski <adamp at ipipan.waw.pl>:

> I am looking for freely available (preferably open-source) software
> for the manual annotation of corpora at the syntactic (constituency)
> level.  This should be annotation from scratch, not discrimination
> between results of a parser.
>
> Some desirable features:
>
> - web interface (work over the Internet)
>
> - partial annotation possible (e.g., only identification of NPs or
>   only named entities, not necessarily full rooted trees)
>
> - management of annotators, incl. controlling inter-annotator
>   agreement
>
> I'll send a summary of any replies I'll get off the list.
>
> Adam P.
>
> -- 
> Adam Przepiórkowski                   ˈadam ˌpʃɛpjurˈkɔfskʲi
> http://nlp.ipipan.waw.pl/  ___  Linguistic Engineering Group
> http://korpus.pl/  _____________   IPI PAN  Corpus of Polish
> http://nkjp.pl/  _________________ National Corpus of Polish
>

_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list