Corpora: plain text
David Grant
drg199 at ecs.soton.ac.uk
Thu Dec 6 16:49:05 UTC 2001
Hi,
I'm looking for plain text, tokenized, english text, with which to test a tagger. Does anyone know where i could find some.
by tokenized i mean all words and punctuation must be separated by atleast one space.
ie
hello how are you ? I am fine .
cheers
David Grant
drg199 at ecs.soton.ac.uk
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20011206/1e1b1a32/attachment.htm>
More information about the Corpora
mailing list