<html>
<head>
<meta content="text/html; charset=ISO-8859-1"
http-equiv="Content-Type">
</head>
<body bgcolor="#FFFFFF" text="#000000">
<div class="moz-cite-prefix">Dear Sara,<br>
<br>
I seem to have converged on this solution over the past years:<br>
<br>
$ abiword -t txt *.doc<br>
<br>
This in fact gives text output with extension *.txt for each
TextFileName.doc. There are also options to keep or change the
files' character encoding should you want/need to do that.<br>
<br>
As is Tristan's solution, this is a command line one. The $ in the
command stands for the command line 'prompt'.<br>
<br>
Google seems to tell me there are plenty of Windows versions for
Abiword. If I had to do this right now, I would sure compare both
LibreOffice and Abiword performances in terms of output.<br>
<br>
As it is, I do not seem to recall how one would go about doing
this in a GUI-environment unless there is an actual option for
batch conversion in one of the program's menus. <br>
<br>
Seems you have several options. Do let us know which worked best
;0)<br>
<br>
Regards,<br>
<br>
Martin<br>
<br>
<br>
On 10/26/2012 05:19 PM, Tristan Miller wrote:<br>
</div>
<blockquote
cite="mid:508AAA07.4060303@ukp.informatik.tu-darmstadt.de"
type="cite">
<pre wrap="">Dear Sara,
On 26/10/12 02:03 PM, Sara Berlanda wrote:
</pre>
<blockquote type="cite">
<pre wrap="">can anybody advise me about a tool which can convert 3000
Word files (.doc) into 3000 .txt files at once? The
tool should run on Windows 7 platform.
</pre>
</blockquote>
<pre wrap="">
LibreOffice <a class="moz-txt-link-rfc2396E" href="http://www.libreoffice.org/"><http://www.libreoffice.org/></a> can do this when invoked from
the command line with appropriate parameters.
The following works for me on GNU/Linux, though the call should be
similar or identical on Windows 7:
libreoffice --headless --convert-to txt:text *.doc
Regards,
Tristan
</pre>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<br>
<pre wrap="">_______________________________________________
UNSUBSCRIBE from this page: <a class="moz-txt-link-freetext" href="http://mailman.uib.no/options/corpora">http://mailman.uib.no/options/corpora</a>
Corpora mailing list
<a class="moz-txt-link-abbreviated" href="mailto:Corpora@uib.no">Corpora@uib.no</a>
<a class="moz-txt-link-freetext" href="http://mailman.uib.no/listinfo/corpora">http://mailman.uib.no/listinfo/corpora</a>
</pre>
</blockquote>
<br>
</body>
</html>