[Corpora-List] plain text and .caj files (Mike Scott)

Lei Lei leileileo at gmail.com
Fri Nov 12 00:55:12 UTC 2010


Re: plain text and .caj files (Mike Scott) 
Does anyone know how to extract the plain Chinese text from .caj text 
files? I understand these are similar in conception to .PDFs.
Thanks -- Mike Scott, Aston University
---------------------------
Hi, Mike,
Yes, it is similar to .PDFs.
If you want to extract the Chinese characters (text) from .caj files, one quick but dirty method is to first virtually print the .caj files into .PDFs, and then to extract the Chinese characters from the .PDFs.  
You can also directly use the "choose text" button to choose the text you want and then extract the text by copying and pasting within the CAJViewer, but I prefer the aforementioned method.
Good luck!
Lei


2010-11-12 



Lei Lei
Associate Professor
School of Foreign Languages
Huazhong University of Science and Technology

Email: leileicn at 126.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20101112/a7bba0e5/attachment.htm>
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list