Corpora: plain text

David Grant drg199 at ecs.soton.ac.uk
Thu Dec 6 16:49:05 UTC 2001


Hi,

I'm looking for plain text, tokenized, english text, with which to test a tagger.  Does anyone know where i could find some.

by tokenized i mean all words and punctuation must be separated by atleast one space.

ie

hello how are you ? I am fine .

cheers

David Grant
drg199 at ecs.soton.ac.uk
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20011206/1e1b1a32/attachment.htm>


More information about the Corpora mailing list