<html><head><style type="text/css"><!-- DIV {margin:0px;} --></style></head><body><div style="font-family:times new roman,new york,times,serif;font-size:12pt"><div>Dear Wiebke,<br><br><span>You can use the linguistic development environment NooJ (cf. <a target="_blank" href="http://www.nooj4nlp.net">www.nooj4nlp.net</a>) and its command-line program noojapply.exe.</span><br>The "noojapply.exe" command-line program allows users to apply to texts and corpora dictionaries and grammars automatically ... <br><span>For further information about how to use/integrate it, you can download the NooJ's manual or visit the users' forum at <a target="_blank" href="http://groups.yahoo.com/group/nooj-info">http://groups.yahoo.com/group/nooj-info</a> </span><br><br>I hope this could help you,<br><br>Best,<br><br>Dr. Slim Mesfar<br></div><div style="font-family: times new roman,new york,times,serif; font-size: 12pt;"><br><div style="font-family:
arial,helvetica,sans-serif; font-size: 13px;"><font face="Tahoma" size="2"><hr size="1"><b><span style="font-weight: bold;">De :</span></b> Wiebke Wagner <wagner@lifebiosystems.com><br><b><span style="font-weight: bold;">À :</span></b> Corpora@uib.no<br><b><span style="font-weight: bold;">Envoyé le :</span></b> Lun 19 juillet 2010, 7h 58min 12s<br><b><span style="font-weight: bold;">Objet :</span></b> [Corpora-List] sentence detector and phrase chunker returning absolute positions in text<br></font><br>Dear all,<br><br>I am looking for a tool that performs sentece detection, part-of-speech<br>tagging and phrase-chunking. My problem is that most of these tools<br>return annotated text. What I need, however, is the absolute positions<br>in text of the sentece boundaries and of the chunks. For example,<br>consider the following sentences:<br><br>"This is a sentence. And here is another one."<br><br>I would need the information that the 19th
and respectivly the 44th<br>character in the text is a sentence boundary. For the chunks, the<br>position and the length of the chunk would be ideal.<br>I have checked OpenNLP, Gate, LingPipe and MontyLingua but did not find<br>any information about such an output (at leas not for sentences AND<br>chunks).<br>Is anyone aware of such a tool? <br><br>Best,<br>Wiebke Wagner<br><br><br><br>_______________________________________________<br>Corpora mailing list<br><a ymailto="mailto:Corpora@uib.no" href="mailto:Corpora@uib.no">Corpora@uib.no</a><br><a href="http://mailman.uib.no/listinfo/corpora" target="_blank">http://mailman.uib.no/listinfo/corpora</a><br></div></div>
</div><br>
</body></html>