<br><br>
<div class="gmail_quote">
<blockquote style="BORDER-LEFT: #ccc 1px solid; MARGIN: 0px 0px 0px 0.8ex; PADDING-LEFT: 1ex" class="gmail_quote">
<div>Yes Norton, </div>
<div> </div>
<div>GATE is really amazing and will help you definitely what you are looking for.</div>
<div>Moreover if you find any problem then you can always ask to GATE members by registering yourself to their mailing list which is on website already suggested by Sabine.<br><br>-- <br>Regards<br><br>Saman Hina,<br>PhD Student,<br>
Language Research Group,<br>School of Computing,<br>University of Leeds, UK<br></div></blockquote>
<div> </div>
<blockquote style="BORDER-LEFT: #ccc 1px solid; MARGIN: 0px 0px 0px 0.8ex; PADDING-LEFT: 1ex" class="gmail_quote">
<div><br> </div>
<div>
<div></div>
<div class="h5">
<div class="gmail_quote">On Wed, Jul 28, 2010 at 6:58 PM, Sabine Bartsch <span dir="ltr"><<a href="mailto:bartsch@linglit.tu-darmstadt.de" target="_blank">bartsch@linglit.tu-darmstadt.de</a>></span> wrote:<br>
<blockquote style="BORDER-LEFT: #ccc 1px solid; MARGIN: 0px 0px 0px 0.8ex; PADDING-LEFT: 1ex" class="gmail_quote">Hi Norton,<br><br>you might want to look into GATE (<a href="http://gate.ac.uk/" target="_blank">http://gate.ac.uk/</a>). It is a mature and well-documented processing framework integrating lots of different tools and plugins. The default ANNIE processing application already comprises some of the things you're looking for and is preconfigured to work out of the box.<br>
For further options, look at the CREOLE plugins. You'll find that the OpenNLP Tools as well as LingPipe and other tools serving your purposes are already available in GATE.<br><br>Hope this helps,<br>Sabine<br><br><br>
On 28.07.2010 17:06, Norton Roman wrote:<br>
<blockquote style="BORDER-LEFT: #ccc 1px solid; MARGIN: 0px 0px 0px 0.8ex; PADDING-LEFT: 1ex" class="gmail_quote">Hello everybody.<br><br>Do you by any chance have ever come across some sort of text unit<br>splitting tool? I's looking for something to help me out with defining<br>
units (e.g. clauses, sentences or anything the user comes up with) in a<br>source text (it might be by adding some rules or by manually annotating<br>the units).<br><br>Thanks in advance<br><br>Norton<br><br><br><br>_______________________________________________<br>
Corpora mailing list<br><a href="mailto:Corpora@uib.no" target="_blank">Corpora@uib.no</a><br><a href="http://mailman.uib.no/listinfo/corpora" target="_blank">http://mailman.uib.no/listinfo/corpora</a><br></blockquote><br>
-- <br>Dr. Sabine Bartsch<br>Technische Universität Darmstadt<br>Institut für Sprach- und Literaturwissenschaft<br>Hochschulstrasse 1 64289 Darmstadt<br>Fon: +49-6151-16 4570 Fax: +49-6151-16 3694<br><a href="http://www.linglit.tu-darmstadt.de/index.php?id=bartsch" target="_blank">http://www.linglit.tu-darmstadt.de/index.php?id=bartsch</a><br>
<br>_______________________________________________<br>Corpora mailing list<br><a href="mailto:Corpora@uib.no" target="_blank">Corpora@uib.no</a><br><a href="http://mailman.uib.no/listinfo/corpora" target="_blank">http://mailman.uib.no/listinfo/corpora</a><br>
<br></blockquote></div><br><br clear="all"><br></div></div><font color="#888888">-- <br>Regards<br><br>Saman Hina,<br>PhD Student,<br>Language Research Group,<br>School of Computing,<br>University of Leeds, UK<br></font></blockquote>
</div>