[Corpora-List] Converting PDFs in Arabic to txt/xml for further corpus analysis (fwd)

Maximilian Haeussler max at soe.ucsc.edu
Sat Sep 13 00:35:42 UTC 2014


On a related note: can someone recommend a layout/text flow engine for
tesseract?

_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list