[Corpora-List] Sentence splitter
Joerg Schuster
js at cis.uni-muenchen.de
Mon Mar 17 12:12:19 UTC 2003
I have also asked for sentencizers very recently. Here is a summary:
+-----------------+------------+--------------------------------------------------------+-------------+
|Name/Nickname |Author |Web Site |Comment |
+-----------------+------------+--------------------------------------------------------+-------------+
|ave |Ave Wrigley |http://search.cpan.org/author/TGROSE/HTML-Summary-0.017/|perl module |
| | | | |
+-----------------+------------+--------------------------------------------------------+-------------+
|mxterminator |Adwait |http://www.cis.upenn.edu/~adwait/statnlp.html |java, |
| |Ratnaparkhi | |probabilistic|
+-----------------+------------+--------------------------------------------------------+-------------+
|satz |David |http://elib.cs.berkeley.edu/src/satz/ |written in c,|
| |D. Palmer | |has to be |
| | | |trained |
+-----------------+------------+--------------------------------------------------------+-------------+
|sentence.cgi |? |http://misshoover.si.umich.edu/~zzheng/sentence/ |cgi script |
+-----------------+------------+--------------------------------------------------------+-------------+
|shlomo |Shlomo Yona |http://search.cpan.org/author/SHLOMOY/ |perl module |
| | |Lingua-EN-Sentence-0.25/lib/Lingua/EN/Sentence.pm | |
+-----------------+------------+--------------------------------------------------------+-------------+
|ttt |? |http://www.ltg.ed.ac.uk/software/ttt/index.html |Seems to be |
| | | |available |
| | | |only for |
| | | |SPARC |
| | | |machines |
+-----------------+------------+--------------------------------------------------------+-------------+
You can test the programs ave, mxterminator and shlomo here:
http://www.cis.uni-muenchen.de/~js/sentencize
If you do non-trivial tests, please let me know the results.
Jörg Schuster
More information about the Corpora
mailing list