Text converter

Danko Sipka sipkadan at hum.amu.edu.pl
Wed Jan 17 09:47:37 UTC 1996


In the anonymous text archive

ftp.amu.edu.pl/pub/Serbo-Croat (alias: math.amu.edu.pl/pub/Serbo-Croat)

there is the file convert.zip containing convert.exe particularly
useful for Windows users who want to work with Slavic texts
from different Internet archives.

You can also approach this archive from my Web page:

http://www.amu.edu.pl/~sipkadan/ja.htm

The other files in the Serbo-Croatian archive can also
be of interest. See 00-index.txt for the contents.

These web pages might be interesing:

http://www.amu.edu.pl/~sipkadan/vuk.html (about Vuk Karadzic)
http://www.amu.edu.pl/~sipkadan/mig.html (Minimal Information Grammar)

Some of you might want to subscribe to the discussion list
about Serbian terminology. Send:

subscribe ST-L Yourname Yoursurname

to listserv at math.amu.edu.pl

Here is what you can use convert.exe for.

USING CONVERT WITH SLAVIC TEXT ARCHIVES
=======================================

This program converts text files between different standards.
You need Windows to run it. You can use this program to create
your own conversion table, by changing generic conversion table
called convert.cpc to fit your needs.

There are also several conversion tables particularly useful for
those Windows users interested in using Slavic text from the
net, while not wasting their time and nerves on figuring out
how to convert them. Here are the contents and what you can
do with it. Let me add that code page 1251 is the one used in
Windows (you only need to choose ttf ending in Cyr or CE to see
Slavic Cyrillic or Latin fonts), 852 is the DOS code page containing
Latin characters for Slavic languages, and 866 the DOS code page with
Cyrillic characters.

CONVERT  EXE
  - program Convert (you need Windows to run it)
CONV  EXE
  - DOS version of CONVERT  EXE
CONVERT  DOC
  - instructions on how to use the program
CONVERT  TXT
  - this file
SLW-SCW  CPC
  - Serbian Latin -> Serbian Cyrillic
MORK-WFW CPC
  - Files from Prof. Moerk's Serbo-Croatian Text Corpus -> cp 1251
RCW-KO7  CPC
  - cp 1251 Russian Cyrillic -> KOI 7 Latin
FOW-RCW  CPC
  - Files from George Fowler's Russian Corpus -> cp 1251 Russian Cyrillic
KO7-RCW  CPC
  - KOI 7 Latin -> cp 1251 Russian Cyrillic
852-1250 CPC
  - cp 852 -> cp 1250
ANSI-437 CPC
  - ansi standard -> cp 437
437-ANSI CPC
  - cp 437 - > ansii standard
CONVERT  CPC
  - generic conversion table
850-ANSI CPC
  - cp 850 -> ansi standard
1251-866 CPC
  - cp 1251 -> cp 866
1250-852 CPC
  - cp 1251 -> cp 852
866-1251 CPC
  - cp 866 -> cp 1251
(this one is useful for the files from ftp.funet.fi
 /pub/culture/russian  archives. First you convert it
from Unix to DOS, and after that you use 866-1251.cpc)
README   TXT
  - Info on free downlodable dictionaries (Russian, Polish,
    Croatian, Serbian...)


-------------------------------------------------------------------------
Dr. Danko Sipka
Slavic Department, Adam Mickiewicz University, Poznan
Slavic Department, University of Wroclaw, Wroclaw
Institute of Philosophy and Sociology, Polish Academy of Sciences, Warsaw
Translation Experts Ltd, Polish Division, Poznan
-------------------------------------------------------------------------
e-mail:     sipkadan at hum.amu.edu.pl or sipkadan at plpuam11.amu.edu.pl
www:        http://www.amu.edu.pl/~sipkadan/ja.htm
phone/fax:  ++48-61-535-143
mail:       Strzelecka 50 m. 6, PL-61-846 Poznan, Poland
-------------------------------------------------------------------------



More information about the SEELANG mailing list