[Corpora-List] Density of Language Taxa

Yuri Tambovtsev yutamb at mail.cis.ru
Tue May 9 12:32:34 UTC 2006


Dear Corpora colleagues, please comment on the following: 
Yuri Tambovtsev, Novosibirsk Pedagog. University, Russia. 
yutamb at mail.ru 
Dispersion of the Uralic language taxon from a typological viewpoint.
   The goal of this research was to compute the similarity of the 
distribution of 8 consonantal groups (labial, front, palatal, back, 
sonorant, occlusive, fricative and voiced) in the speech sound chains 
of different world languages. The value of the coefficient of variance 
was chosen as the measure of similarity. Let us analyse the values in 
some language taxa: groups, families and super-families. 
The Value of the Mean of the Coefficient of Variance (V%). 
Ugric group (5 languages) - V%= 27.66%  
Volgaic group (4) - V% = 17.90%
Baltic-Finnic group (7) = 23.24%
Finno-Ugric family (20) = 23.91%
Samoyedic family (4) = 16.30%
Uralic super-family (24) = 28.31%. 
The value of the mean of the coefficient of variance of the Ugric 
group (27.66%) is really great. We can compare it to the analogical 
means of the groups of the Indo-European family: Baltic (2 languages) 
- 9.08%; Iranian (8 languages) - 11.69%; Slavonic (12 languages) -
15.78%; Indic -20.40%; Germanic (6 languages) - 24.51%.
It is possibele to explain the great value of dispersion of
the Ugric group by the fact that the structure of the Hungarian 
speech sound chain is too different from those of Mansi and Hanty.
The fact that the value of the mean of the coefficient of variance in the 
Samoyedic language taxon may tell us that the languages of the 
Samoyedic origin are more typologically similar, than those of Indic 
or Germanic origin. If we unite the Finno-Ugric languages (23.91%) 
and the Samoyedic languages (16.30%) into one language taxon, 
called Uralic, then the dispersion increases to 28.31%, which is much 
greater than those of the Finno-Ugric and Samoyedic families taken 
separately. It means that typologically these two parts are quite 
different. This is why, one should be cautious to unite them. They 
seem quite different from the point of view of the distribution of the 
consonants in their speech chains. Usually, genetically related 
languages have similar speech sound chains, that is, they are 
typologically close. Basing on the typological data, it is possible to 
suppose that Finno-Ugric and Samoyedic languages have gone into 
different directions and this distance is rather great. 
I'd like to hear comments of colleagues concerning the distances 
between the languages inside the language groups, families and super-
families based on the typological data. I wish I could co-operate with 
the linguists who may be interested in my method. It is possible to 
study the density and dispersion of the language taxa of American 
Indian language taxa or the taxa of the Aboriginal languages of 
Australia, etc., etc. Looking forward to hearing from you soon to  
yutamb at mail.ru  Remain yours sincerely Yuri Tambovtsev, Novosibirsk 
Pedagog. University, Novosibirsk, Russia. 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20060509/c17c2bfe/attachment.htm>


More information about the Corpora mailing list