[Corpora-List] .doc to .txt converter

Tristan Miller miller at ukp.informatik.tu-darmstadt.de
Fri Oct 26 15:19:35 UTC 2012


Dear Sara,

On 26/10/12 02:03 PM, Sara Berlanda wrote:
> can anybody advise me about a tool which can convert 3000 
> Word files (.doc) into 3000 .txt files at once? The
> tool should run on Windows 7 platform.

LibreOffice <http://www.libreoffice.org/> can do this when invoked from
the command line with appropriate parameters.

The following works for me on GNU/Linux, though the call should be
similar or identical on Windows 7:

libreoffice --headless --convert-to txt:text *.doc

Regards,
Tristan

-- 
Tristan Miller, Doctoral Researcher
Ubiquitous Knowledge Processing Lab (UKP-TUDA)
Department of Computer Science, Technische Universität Darmstadt
Tel: +49 6151 16 6166 | Web: http://www.ukp.tu-darmstadt.de/



-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 259 bytes
Desc: OpenPGP digital signature
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20121026/7ba0584a/attachment-0001.sig>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list