[Corpora-List] corpus of textbooks; "Just download the PDF's and convert to text"
Michele Filannino
michele.filannino at cs.manchester.ac.uk
Fri Oct 12 11:18:42 UTC 2012
Maybe you will find it useful:
http://en.wikipedia.org/wiki/Pdftotext
Bye,
Michele Filannino.
CDT PhD student in Computer Science
Room IT301 - IT Building
The University of Manchester
filannim at cs.manchester.ac.uk
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20121012/aaf6f810/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list