[Corpora-List] Re : sentence detector and phrase chunker returning absolute positions in text
Slim Mesfar
mesfarslim at yahoo.fr
Sat Jul 24 12:11:13 UTC 2010
Dear Wiebke,
You can use the linguistic development environment NooJ (cf. www.nooj4nlp.net)
and its command-line program noojapply.exe.
The "noojapply.exe" command-line program allows users to apply to texts and
corpora dictionaries and grammars automatically ...
For further information about how to use/integrate it, you can download the
NooJ's manual or visit the users' forum at
http://groups.yahoo.com/group/nooj-info
I hope this could help you,
Best,
Dr. Slim Mesfar
________________________________
De : Wiebke Wagner <wagner at lifebiosystems.com>
À : Corpora at uib.no
Envoyé le : Lun 19 juillet 2010, 7h 58min 12s
Objet : [Corpora-List] sentence detector and phrase chunker returning absolute
positions in text
Dear all,
I am looking for a tool that performs sentece detection, part-of-speech
tagging and phrase-chunking. My problem is that most of these tools
return annotated text. What I need, however, is the absolute positions
in text of the sentece boundaries and of the chunks. For example,
consider the following sentences:
"This is a sentence. And here is another one."
I would need the information that the 19th and respectivly the 44th
character in the text is a sentence boundary. For the chunks, the
position and the length of the chunk would be ideal.
I have checked OpenNLP, Gate, LingPipe and MontyLingua but did not find
any information about such an output (at leas not for sentences AND
chunks).
Is anyone aware of such a tool?
Best,
Wiebke Wagner
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20100724/24dfb5fe/attachment.htm>
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list