<div dir="ltr"><div>Dear Sebastian,</div><div><br></div><div>The Spanish text-based statistics are misleading. The frequency of /e/, /a/ and /o/ is due to common inflectional suffixes (for gendered forms: masculine o, feminine a and general e; and for verbs: first person o, and other-persons a or e). If you want to consider "token" frequency, then the most appropriate place might be in the lexicon, looking at roots, rather than looking at tokens in texts because you will find an overwhelming skew based on function words and common inflectional suffixes. (Of course you might argue that there is some underlying reason why certain vowels might be more likely to appear in grammaticalized forms.) For comparison, you would find that /i/ is much more common in Italian, simply because it is a second-person singular verb suffix and masculine plural noun suffix, while /u/ does not serve any similarly frequent role so it would be as infrequent as in Spanish. But Spanish and Italian have broadly similar phonology. (One further issue in Spanish is that <i> and <u> are used as semi-vowels in diphthongs so you'd have to decide whether you're counting that or not. Their frequencies would be lower if you exclude that context.)</div><div><br></div><div>Daniel<br></div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Mon, Sep 14, 2020 at 3:14 AM Mark Donohue <<a href="mailto:mhdonohue@gmail.com" target="_blank">mhdonohue@gmail.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><span style="color:rgb(0,0,0)"><font face="arial, sans-serif">Checking my database of 6950 languages/varieties, I get the following figures (interpreting your question somewhat, keeping the categories [HIGH], [LOW], [FRONT], [BACK] distinct):</font></span></div><div dir="ltr"><font face="arial, sans-serif" color="#000000"><span><br></span></font></div><div><font face="arial, sans-serif"><font color="#000000"><span>Taking the vowel set to be limited to [</span></font><span style="color:rgb(0,0,0)">ieɛæaɐɑɔou</span><span style="color:rgb(0,0,0)">]</span></font></div><div dir="ltr"><font face="arial, sans-serif" color="#000000"><span>Front vowels: 15,127 occurrences</span></font></div><div dir="ltr"><font face="arial, sans-serif"><font color="#000000"><span>(i, e, </span></font><span style="color:rgb(0,0,0)">ɛ, </span><span style="color:rgb(0,0,0)">æ</span><span style="color:rgb(0,0,0)">)</span></font></div><div><span style="color:rgb(0,0,0)"><font face="arial, sans-serif">Back vowels: 14,762 occurrences</font></span></div><div><font face="arial, sans-serif"><span style="color:rgb(0,0,0)">(</span><span style="color:rgb(0,0,0)">ɑ, ɔ, o, u</span><span style="color:rgb(0,0,0)">)</span></font></div><div><font face="arial, sans-serif" color="#000000"><span>High vowels: 13,252 occurrences</span></font></div><div><font face="arial, sans-serif" color="#000000"><span>(i, u)</span></font></div><div><font face="arial, sans-serif" color="#000000"><span>Mid vowels: 15,768 occurrences</span></font></div><div><font face="arial, sans-serif"><font color="#000000"><span>(</span></font><span style="color:rgb(0,0,0)">e, ɛ, ɔ, o</span><span style="color:rgb(0,0,0)">)</span></font></div><div><font face="arial, sans-serif" color="#000000"><span>Low vowels: 7,779 occurrences</span></font></div><div><font face="arial, sans-serif"><font color="#000000"><span>(</span></font><span style="color:rgb(0,0,0)">æ, a, ɐ, ɑ</span><span style="color:rgb(0,0,0)">)</span></font></div><div dir="ltr"><font face="arial, sans-serif" color="#000000"><span><br></span></font></div><div><font face="arial, sans-serif" color="#000000"><span>I, for one, don't find this very helpful.</span></font></div><div><font face="arial, sans-serif" color="#000000"><span><br></span></font></div><div><font face="arial, sans-serif"><font color="#000000"><span>Sebastian's questions are more easily answered if we look at i</span></font><span style="color:rgb(0,0,0)">ndividual frequencies:</span></font></div><div><font face="arial, sans-serif"><font color="#000000"><span>(note: when there is no more explicit information, and no contrast between </span></font><span style="color:rgb(0,0,0)">e </span><font color="#000000">and</font><span style="color:rgb(0,0,0)"> ɛ</span><span style="color:rgb(0,0,0)">, or </span><span style="color:rgb(0,0,0)">o</span><span style="color:rgb(0,0,0)"> and </span><span style="color:rgb(0,0,0)">ɔ</span><span style="color:rgb(0,0,0)">, they are counted as [</span><span style="color:rgb(0,0,0)">ɛ, ɔ</span><span style="color:rgb(0,0,0)">].)</span></font></div><div><span style="color:rgb(0,0,0)"><font face="arial, sans-serif">i 6,766 97%</font></span></div><div><span style="color:rgb(0,0,0)"><font face="arial, sans-serif">e 1,846 27%</font></span></div><div><span style="color:rgb(0,0,0)"><font face="arial, sans-serif">ɛ 6,007 86%</font></span></div><div><span style="color:rgb(0,0,0)"><font face="arial, sans-serif">æ 508 7%</font></span></div><div><span style="color:rgb(0,0,0)"><font face="arial, sans-serif">a 6,688 96%</font></span></div><div><span style="color:rgb(0,0,0)"><font face="arial, sans-serif">ɐ 95 1%</font></span></div><div><span style="color:rgb(0,0,0)"><font face="arial, sans-serif">ɑ 361 5%</font></span></div><div><span style="color:rgb(0,0,0)"><font face="arial, sans-serif">ɔ 5,787 83%</font></span></div><div><span style="color:rgb(0,0,0)"><font face="arial, sans-serif">o 2,128 31%</font></span></div><div><font face="arial, sans-serif"><span style="color:rgb(0,0,0)">u 6,486 93%</span><font color="#000000"><span><br></span></font></font></div><div dir="ltr"><font face="arial, sans-serif" color="#000000"><span><br></span></font></div><div><font face="arial, sans-serif" color="#000000"><span>And we really should do it separately for different types of vowel systems.</span></font></div><div><font face="arial, sans-serif" color="#000000"><span>For example, there are 2,037 languages in the database with 5 contrasts in quality (in short vowels)</span></font></div><div><font face="arial, sans-serif" color="#000000"><span>The most common is </span></font></div><div><font face="arial, sans-serif" color="#000000"><span><br></span></font></div><div><table style="color:rgb(0,0,0);border-collapse:collapse;width:180pt" width="240" cellspacing="0" cellpadding="0" border="0"><colgroup><col style="width:30pt" width="40" span="6"></colgroup><tbody><tr style="height:16pt" height="21"><td style="padding:0px;vertical-align:bottom;border:medium none;white-space:nowrap;height:16pt;width:30pt" width="40" height="21" align="left"><font face="arial, sans-serif">i</font></td><td style="padding:0px;vertical-align:bottom;border:medium none;white-space:nowrap;width:30pt" width="40" align="left"><font face="arial, sans-serif">ɛ</font></td><td style="padding:0px;vertical-align:bottom;border:medium none;white-space:nowrap;width:30pt" width="40" align="left"><font face="arial, sans-serif">a</font></td><td style="padding:0px;vertical-align:bottom;border:medium none;white-space:nowrap;width:30pt" width="40" align="left"><font face="arial, sans-serif">ɔ</font></td><td style="padding:0px;vertical-align:bottom;border:medium none;white-space:nowrap;width:30pt" width="40" align="left"><font face="arial, sans-serif">u</font></td><td style="padding:0px;vertical-align:bottom;border:medium none;white-space:nowrap;width:30pt" width="40"><font face="arial, sans-serif">1785 lgs</font></td></tr></tbody></table></div><div><font face="arial, sans-serif" color="#000000"><span><br></span></font></div><div><font face="arial, sans-serif" color="#000000"><span>All balanced in terms of front/back, and high/mid.</span></font></div><div><font face="arial, sans-serif" color="#000000"><span><br></span></font></div><div><font face="arial, sans-serif" color="#000000"><span>Of the remaining 252, we start to see asymmetries of the sort that Sebastian is asking about: looking at the languages that are missing just one of the vowels above, we have</span></font></div><div><font face="arial, sans-serif" color="#000000"><span><br></span></font></div><div><font face="arial, sans-serif"><font color="#000000"><span>missing i: 2 lgs (add </span></font>ə or ɨ)</font></div><div><font face="arial, sans-serif"><font color="#000000"><span>missing </span></font><span style="color:rgb(0,0,0)">ɛ: 26 lgs (12 with </span><span style="color:rgb(0,0,0)">ɨ, 5 with </span><span style="color:rgb(0,0,0)">ə, 2 with </span>æ, 2 with y, one with ɐ and one with ɯ<span style="color:rgb(0,0,0)">)</span></font></div><div><font face="arial, sans-serif"><font color="#000000"><span>missing a: 15 lgs (9 with </span></font><span style="color:rgb(0,0,0)">æ, 5 with </span><span style="color:rgb(0,0,0)">ɑ, 1 with </span><span style="color:rgb(0,0,0)">ɐ</span><span style="color:rgb(0,0,0)">)</span></font></div><div><font face="arial, sans-serif">missing <span style="color:rgb(0,0,0)">ɔ: 62 lgs (4 with o, 23 with </span><span style="color:rgb(0,0,0)">ɨ, 24 with </span><span style="color:rgb(0,0,0)">ə, 5 with </span>ɒ, 4 with ɤ, 3 with <span style="color:rgb(0,0,0)">æ, 3 with y, 3 with </span>œ, 1 with ʌ and 1 with ø<span style="color:rgb(0,0,0)">)</span></font></div><div></div><div><font face="arial, sans-serif"><font color="#000000"><span>missing u: 28 lgs (11 with </span></font>ɯ, 11 with <span style="color:rgb(0,0,0)">ɨ, 2 with </span>ʉ, 1 each with <span style="color:rgb(0,0,0)">ɤ, </span>ɪ, <span style="color:rgb(0,0,0)">ə and o</span><span style="color:rgb(0,0,0)">)</span></font></div><div><font face="arial, sans-serif"><br></font></div><div><font face="arial, sans-serif"><font color="#000000"><span>If we look at missing-2-of the i-</span></font><span style="color:rgb(0,0,0)">ɛ-a-</span><span style="color:rgb(0,0,0)">ɔ-u set, of which there are 76 languages, we find that the most common pattern involves </span><span style="color:rgb(0,0,0)">missing </span><span style="color:rgb(0,0,0)">ɔ and u: 30 languages (14 have o and </span><span style="color:rgb(0,0,0)">ɨ, 6 have o and </span><span style="color:rgb(0,0,0)">ɯ, 5 have o and </span><span style="color:rgb(0,0,0)">ə, and a variety of minority patterns</span><span style="color:rgb(0,0,0)">)</span></font></div><div><span style="color:rgb(0,0,0)"><font face="arial, sans-serif"><br></font></span></div><div><span style="color:rgb(0,0,0)"><font face="arial, sans-serif">If we look at 3-vowel systems, i-a-u is the most common pattern, but i-a-o is pretty frequent as well, and dominant in some parts of the world (see Ross and Donohue 2011).</font></span></div><div><span style="color:rgb(0,0,0)"><font face="arial, sans-serif"><br></font></span></div><div><span style="color:rgb(0,0,0)"><font face="arial, sans-serif">The point is that we need to look at these things in terms of systems; and it's clear that losing/substituting a canonical back vowel is more common than a front vowel, and that losing/substituting a mid vowel is more common than a high vowel, and that "losing/substituting" a/the low vowel pretty much always means the vowel is more explicitly front, or back, but still low, and so losing a low vowel from the system isn't really a thing that languages do (we can note that there are 16 systems of three-vowel languages, with no high vowels, but only 12 with no low vowels (generally with a schwa).</font></span></div><div><span style="color:rgb(0,0,0)"><font face="arial, sans-serif"><br></font></span></div><div><span style="color:rgb(0,0,0)"><font face="arial, sans-serif">-Mark</font></span></div><div><font face="arial, sans-serif"><br></font></div><div><span style="color:rgb(0,0,0)"><font face="arial, sans-serif">Ross, Bill, and Mark Donohue. 2011. The many origins of diversity and complexity in phonology. <i>Linguistic Typology</i> 15: 251-265.</font></span></div><div><span style="color:rgb(0,0,0);font-family:-webkit-standard;font-size:medium"><br></span></div><div dir="ltr"><div style="color:rgb(0,0,0)"><font face="arial, sans-serif"><br></font></div></div></div></div><font face="arial, sans-serif"><br></font><div class="gmail_quote"><div dir="ltr" class="gmail_attr"><font face="arial, sans-serif">On Mon, 14 Sep 2020 at 18:18, Sebastian Nordhoff <<a href="mailto:sebastian.nordhoff@glottotopia.de" target="_blank">sebastian.nordhoff@glottotopia.de</a>> wrote:<br></font></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><font face="arial, sans-serif">Dear list members,<br>
do we have any information about the cross-linguistic validity of the <br>
following hypotheses?<br>
<br>
1) front vowels like /i/, /e/ are more frequent than back vowels like <br>
/u/, /o/<br>
<br>
2) high vowels like /i/, /u/ are more frequent than mid vowels like <br>
/e/, /o/.<br>
<br>
3) "corner vowels" /a/, /i/, /u/ are more frequent than anything else.<br>
<br>
I am interested in information about types (phonemic inventory) as well <br>
as tokens (counts in texts).<br>
<br>
Best wishes and than you for your time<br>
Sebastian<br>
<br>
<br>
<br>
<br>
<br>
<br>
_______________________________________________<br>
Lingtyp mailing list<br>
<a href="mailto:Lingtyp@listserv.linguistlist.org" target="_blank">Lingtyp@listserv.linguistlist.org</a><br>
<a href="http://listserv.linguistlist.org/mailman/listinfo/lingtyp" rel="noreferrer" target="_blank">http://listserv.linguistlist.org/mailman/listinfo/lingtyp</a></font><br>
</blockquote></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div>
_______________________________________________<br>
Lingtyp mailing list<br>
<a href="mailto:Lingtyp@listserv.linguistlist.org" target="_blank">Lingtyp@listserv.linguistlist.org</a><br>
<a href="http://listserv.linguistlist.org/mailman/listinfo/lingtyp" rel="noreferrer" target="_blank">http://listserv.linguistlist.org/mailman/listinfo/lingtyp</a><br>
</blockquote></div>