[Corpora-List] summary of replies to "software for (manual) syntactic annotation of corpora?"
Adam Przepiorkowski
adamp at ipipan.waw.pl
Sat Aug 1 17:34:47 UTC 2009
Two weeks ago I asked about tools for the manual annotation of corpora
with constituency information (original e-mail below).
The following tools were mentioned in reply:
* EasyRef: http://atoll.inria.fr/easyrefpub/ ("is a prototype of web
service to handle (view, edit, import, export, bug reports) syntactic
annotations (for the French EASy and PASSAGE formats)")
* GATE: http://gate.ac.uk/ ("has many (but not all) of the features
you are looking for"),
* MMAX:
http://www.eml-research.de/english/research/nlp/download/mmax.php,
http://mmax2.sourceforge.net/ (recommended by a couple of people)
* SACODEYL Annotator:
http://www.um.es/sacodeyl/en/pages/software.htm#annotator ("is perfect
for manual annotation. It's desktop at the moment (exciteing plans for
the future, though). It's open source, TEI - XML")
* Synpathy:
http://www.mpi.nl/research/research-projects/language-archiving-technology/tools
("free and open", but "closed" in terms of development and maintenance)
* TrEd: http://ufal.mff.cuni.cz/~pajas/tred/ ("is a highly customizable
open source toolkit for manual and/or semi automatic annotation,
usable both for constituency and dependency trees")
* WordFreak: http://wordfreak.sourceforge.net/
Many thanks to everybody who answered my query: Eric de la Clergerie,
Ciprian Gerstenberger, Rob Malouf, Petr Pajas, Pascual Pérez-Paredes,
Peter Wittenburg, Magdalena Wolska, Amir Zeldes.
Adam P.
P.S. If I get more replies within the next couple of weeks, I'll send
another summary.
Adam Przepiorkowski <adamp at ipipan.waw.pl>:
> I am looking for freely available (preferably open-source) software
> for the manual annotation of corpora at the syntactic (constituency)
> level. This should be annotation from scratch, not discrimination
> between results of a parser.
>
> Some desirable features:
>
> - web interface (work over the Internet)
>
> - partial annotation possible (e.g., only identification of NPs or
> only named entities, not necessarily full rooted trees)
>
> - management of annotators, incl. controlling inter-annotator
> agreement
>
> I'll send a summary of any replies I'll get off the list.
>
> Adam P.
>
> --
> Adam Przepiórkowski ˈadam ˌpʃɛpjurˈkɔfskʲi
> http://nlp.ipipan.waw.pl/ ___ Linguistic Engineering Group
> http://korpus.pl/ _____________ IPI PAN Corpus of Polish
> http://nkjp.pl/ _________________ National Corpus of Polish
>
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list