[Corpora-List] Tools for batch conversion Word to UTF-8.

Josep M. Fontana josepm.fontana at upf.edu
Fri Feb 10 07:55:43 UTC 2012


Great. Thanks Andrew.

JM
> Hi Josep,
>
> The script itself is not particular to any version of Word (I tested it in Word2007 FWIW). The important question is whether the *version of Word you are using* is compatible with all your different docs. But fortunately, if you have an up-to-date version of Word, it will happily open docs created by that version or by any earlier versions.
>
> HOWEVER, if you have both "doc" and "docx" formats, you will need to modify the string in the macro that specifies what files you want (e.g. to "*.doc*") to catch both forms of the file extension.
>
> best
>
> Andrew.
>
> -----Original Message-----
> From: corpora-bounces at uib.no [mailto:corpora-bounces at uib.no] On Behalf Of Josep M. Fontana
> Sent: 09 February 2012 20:19
> To: Adam Radziszewski
> Cc: corpora at uib.no
> Subject: Re: [Corpora-List] Tools for batch conversion Word to UTF-8.
>
> Thanks to everybody that responded. We will certainly try all the options since none of the tools we had tried so far did an acceptable job.
>
> Adam: if you can send me the script, I'd really appreciate it. We've checked the documentation for PyUNO and it is not all that clear so your scripts might give us a good headstart.
>
> Andrew: your macro works with all versions of Word? We have documents created with different Word versions.
>
> JM
>
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora


_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list