Norton,<br><br>You might want to use an ensemble of tools--for example, LingPipe for splitting sentences, Yamcha for splitting base phrases, and your own tool for other arbitrary units that your user comes up with.<br><br>
Kevin<br><br><div class="gmail_quote">On Wed, Jul 28, 2010 at 9:06 AM, Norton Roman <span dir="ltr"><<a href="mailto:nortontr@gmail.com">nortontr@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">
Hello everybody.<br><br>Do you by any chance have ever come across some sort of text unit splitting tool? I's looking for something to help me out with defining units (e.g. clauses, sentences or anything the user comes up with) in a source text (it might be by adding some rules or by manually annotating the units).<br>
<br>Thanks in advance<br><font color="#888888"><br>Norton<br>
</font><br>_______________________________________________<br>
Corpora mailing list<br>
<a href="mailto:Corpora@uib.no">Corpora@uib.no</a><br>
<a href="http://mailman.uib.no/listinfo/corpora" target="_blank">http://mailman.uib.no/listinfo/corpora</a><br>
<br></blockquote></div><br><br clear="all"><br>-- <br>Kevin Bretonnel Cohen, PhD<br>Biomedical Text Mining Group Lead, Center for Computational Pharmacology, U. Colorado School of Medicine<br>and<br>Lead Artificial Intelligence Engineer, The MITRE Corporation, Human Language Technology Division<br>
303-916-2417 (cell) 303-377-9194 (home)<br><a href="http://compbio.ucdenver.edu/Hunter_lab/Cohen">http://compbio.ucdenver.edu/Hunter_lab/Cohen</a><br><br>