[Corpora-List] sentence detector and phrase chunker returning absolute positions in text

Adam Radziszewski kocikikut at gmail.com
Mon Jul 19 08:18:46 UTC 2010


Dear Wiebke,

> I have checked OpenNLP, Gate, LingPipe and MontyLingua but did not find

I doubt if Gate has a ready-made option triggerable from the GUI to
output these positions only. However, its XML format is based on such
character positions:
http://gate.ac.uk/sale/tao/splitch5.html#sec:corpora:schemas
and, what is more, if you write your own plugin or just use the Java
API, you can easily iterate over Annotation objects and fetch their
starting and ending positions.

Best,
Adam Radziszewski

_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list