[Corpora-List] Sentence boundary detection

Scott Piao Scott.Piao at manchester.ac.uk
Fri Jul 20 16:54:54 UTC 2007


Hi Kelly,

I put my sentence breaker at the site:
http://text0.mib.man.ac.uk:8080/scottpiao/sent_detector

It has performed with very high precisions, including in a commercial 
context. It is for English, I am not sure if it works on Spanish. You 
can try on the
demo website.

Best

Scott Piao
----------------------------------
Text Mining Group
School of Computer Science
University of Manchester
UK


Quoting Kelly Vincent <kpvincent at hotmail.com>:

> I am interested in what the current state-of-the-art is in sentence boundary
> detection and (to a lesser degree) tokenization. I have been able to locate
> several articles, but very few that are quite recent. I would appreciate any
> pointers to particularly important papers or to available tools, as well as
> the community's thoughts on the topic.
>
> We are building a Spanish corpus so I am particularly interested in these
> topics from the Spanish perspective, though not confined to that.
>
> Regards,
> Kelly Vincent
> Software Engineer
> MetaMetrics, Inc.
>
> _________________________________________________________________
> Local listings, incredible imagery, and driving directions - all in one
> place! http://maps.live.com/?wip=69&FORM=MGAC01
>
>
> _______________________________________________
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
>




_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list