[Corpora-List] Sentence boundary detection

Kelly Vincent kpvincent at hotmail.com
Fri Jul 20 14:11:23 UTC 2007


I am interested in what the current state-of-the-art is in sentence boundary 
detection and (to a lesser degree) tokenization. I have been able to locate 
several articles, but very few that are quite recent. I would appreciate any 
pointers to particularly important papers or to available tools, as well as 
the community's thoughts on the topic.

We are building a Spanish corpus so I am particularly interested in these 
topics from the Spanish perspective, though not confined to that.

Regards,
Kelly Vincent
Software Engineer
MetaMetrics, Inc.

_________________________________________________________________
Local listings, incredible imagery, and driving directions - all in one 
place! http://maps.live.com/?wip=69&FORM=MGAC01


_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list