Converting doc transcripts to CHAT; Cylliric alphabet; inconvenient special symbols

Leonid Spektor spektor at andrew.cmu.edu
Wed Mar 25 04:29:12 UTC 2009


Susanna,

    If cut&paste doesn't work, then try to save your windows doc documents
in MS Word using "File->Save As..." menu.

1 add "@UTF8" line at the top of the file . This should be on the very first
line by itself.

2 Select "File->Save As..." menu. In the "Save As" dialog in "Format:" or
"Save as type" pop-up menu select "Plain Text (.txt)" and press "Save"
button. 

3 After that you should see "File Conversion" dialog box. Click on "Other
encoding:" box and select "Unicode UTF-8" encoding and press OK.

4 Next change the extension of the file to ".cha" and open it with CLAN.

If the file doesn't look the same as it does in MS Word, then please sent us
one sample doc file that you are trying to convert and we'll try to figure
out a procedure to do the conversion. You can send the sample file directly
to me instead of chibolts at googlegroups.com.

Leonid.


On 24-03-09 17:51, "bartsch" <bartsch at zas.gwz-berlin.de> wrote:

> 
> Dear all,
> 
> In behalf of a colleague working with a large corpus of Bulgarian
> transcripts, I'd like to ask for help with regard to following issues:
> 
> 1. Is there any efficient way to painlessly convert transcripts originally
> written with Word (.doc) to CHAT transcripts? Which kind of corrections
> might be necessary after converting?
> 
> 2. We attempted to produce CHAT transcripts by copying&pasting from Word
> files to CHAT files. One problem was that the Cylliric letters got lost in
> the copy & paste action. We are wondering whether this has something to do
> with the computer being used. I hadn't this problem with my PC (Windows
> Vista), while my colleague has a Mac (OS X 10.4.11).
> 
> 3. Another problem observed by copying & pasting was the generation of
> unwanted symbols in the CHAT file.
> 
> Any suggestions would be greatly appreciated.
> 
> Thanks and kind regards,
> Susanna
> 



--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups "chibolts" group.
To post to this group, send email to chibolts at googlegroups.com
To unsubscribe from this group, send email to chibolts+unsubscribe at googlegroups.com
For more options, visit this group at http://groups.google.com/group/chibolts?hl=en
-~----------~----~----~----~------~----~------~--~---



More information about the Chibolts mailing list