<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">

<HTML><HEAD>

<META content="text/html; charset=iso-8859-1" http-equiv=Content-Type>

<META name=GENERATOR content="MSHTML 8.00.6001.18876">

<STYLE></STYLE>

</HEAD>

<BODY bgColor=#ffffff>

<DIV><FONT size=2 face=Arial>Dear Corpora members, when someone asked for 

transcribed or transliterated texts Corpora members advised him English texts in 

the usual English orthography, just like in Gutenberg collection. May be I don't 

understand English well enough since I have been studying it for 55 years only, 

but I understood that the person has the same problem as I had, have and is 

going to have, i.e. the texts in tanscription. One can call it a Phonetic 

Corpora, as I do. As far as I understand there is not any reliable programme to 

transcribe texts by a computer. Anyway, all our texts were transcribed by hand. 

And then checked and rechecked. Computer transcribe texts with many mistakes. We 

found it for English, Russian, German and many more of the 168 languages we put 

into our Phonetic Corpora. Now, it looks that our Corpora allow us to compare 

all world fed into it with the help of some 9 phonetic features. Thus, we could 

tell Russian from Belorussian, and the rest Slavonic languages, basing just on 

the Sound Picture of it. In fact, the sound picture of a language is its 

Phonetic Corpus. However, if the Corpora Members really know the transcribed 

corpora of any language, I for one, would be very much interested. Please, let 

me know by sending me a message to <A 

href="mailto:yutamb@mail.ru">yutamb@mail.ru</A>  Be WELL, remain yours most 

sincerely and phonetically Yuri Tambovtsev, Novosibirsk, Russia 

</FONT></DIV></BODY></HTML>