<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<title></title>
</head>
<body>
<div name="messageBodySection">
<div dir="auto">Dear Matías,<br>
<br>
I see your point, and my excuses for misunderstanding how substitution/transposition works.<br>
What could be a solution is to weigh the measures differently - for example 0.5 point for each transposition, 1 point for each substitution, and so on. </div>
</div>
<div name="messageSignatureSection"><br>
Regards,
<div dir="auto">Ian</div>
</div>
<div name="messageReplySection">On 9 May 2021, 4:58 PM +0800, Matías GN <mortem.dei@gmail.com>, wrote:<br>
<blockquote type="cite" style="border-left-color: grey; border-left-width: thin; border-left-style: solid; margin: 5px 5px;padding-left: 10px;">
<div dir="ltr">
<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
but I would ask: isn't word order one of the most important part of syntax?</blockquote>
<div><br>
</div>
<div>what you'd be saying is that English' with everything identical to English but SOV order would be more similar to Japanese than to English.</div>
<div>If that's what you want then that's fine.<br>
</div>
<div><br>
</div>
<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<div>So for example, if we compare two languages who use the same words, but one in the order of SVO, and another in the order of OVS, the distance will be 2 (two transpositions).</div>
</blockquote>
<div><br>
</div>
<div>No. You get two substitutions, not transpositions. To get two transpositions you'd need:
<i>abcd</i> and <i>badc</i>.</div>
<div><br>
</div>
<div><br>
</div>
</div>
<br>
<div class="gmail_quote">
<div dir="ltr" class="gmail_attr">El dom, 9 de may. de 2021 a la(s) 10:43, JOO, Ian [Student] (<a href="mailto:ian.joo@connect.polyu.hk">ian.joo@connect.polyu.hk</a>) escribió:<br>
</div>
<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<div>
<div name="messageBodySection">
<div dir="auto">Dear Matías,<br>
<br>
that is a good point, but I would ask: isn't word order one of the most important part of syntax?<br>
So for example, if we compare two languages who use the same words, but one in the order of SVO, and another in the order of OVS, the distance will be 2 (two transpositions).<br>
If we compare two languages that both have the same SVO structure, but happen to use different words for V and O, the distance will be 2 (two substitutions).<br>
In this situation, can we really say that the two SVO languages are much more similar to each other than the SVO and OVS languages are? I think that is debatable.</div>
</div>
<div name="messageSignatureSection"><br>
Regards,
<div dir="auto">Ian</div>
</div>
<div name="messageReplySection">On 9 May 2021, 4:36 PM +0800, Matías GN <<a href="mailto:mortem.dei@gmail.com" target="_blank">mortem.dei@gmail.com</a>>, wrote:<br>
<blockquote type="cite" style="border-left:thin solid grey;margin:5px;padding-left:10px">
<div dir="ltr">
<div>Dear Ian,</div>
<div><br>
</div>
<div>using edit distances makes little sense because edit distances are highly sensitive to ordering.</div>
<div>They are, after all, used for finding optimal alignments.<br>
</div>
<div>The distance between <i>abcde</i> and <i>edcba</i> is 4, which is the same as the distance between
<i>abcde</i> and <i>afglk</i>, but clearly <i>abcde</i> and <i>edcba</i> are <i>much</i> more similar than
<i>abcde</i> is to <i>afglk</i> for obvious reasons.</div>
<div>Whatever you end up measuring will be heavily biased towards word order.</div>
<div><br>
</div>
<div>Bes,<br>
</div>
</div>
<br>
<div class="gmail_quote">
<div dir="ltr" class="gmail_attr">El dom, 9 de may. de 2021 a la(s) 10:23, JOO, Ian [Student] (<a href="mailto:ian.joo@connect.polyu.hk" target="_blank">ian.joo@connect.polyu.hk</a>) escribió:<br>
</div>
<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<div>
<div name="messageBodySection">
<div dir="auto">Dear Sandra,<br>
<br>
I think those are good points.<br>
I agree that there could be a translation bias, and the idea is to elicit the most natural, preferred style of utterance from the speaker. As Alex François recommended, it may be better to use a dialogue rather than isolated sentences, to elicit the most natural
way of speaking and avoid translation bias. And I agree that the number of 50 sentences is small and it should be expanded.<br>
As for the standardization of glosses, I try to make it into the most uniform way possible, for example all copulas being glossed as COP rather than 'be', based on Leipzig glossing rules and other common practices.<br>
As for example 1, I forgot to mention that there is another counting measure, substitution - so NOM is "substituted" into COP, because they are in the same position, so that counts as 1. I'm not 100% sure what counts as the same position, I just let R calculate
that, so I should have a closer look into it.<br>
</div>
</div>
<div name="messageSignatureSection"><br>
Regards,
<div dir="auto">Ian</div>
</div>
<div name="messageReplySection">
<div dir="auto">On 9 May 2021, 3:25 PM +0800, Sandra Auderset <<a href="mailto:sandrauderset@gmail.com" target="_blank">sandrauderset@gmail.com</a>>, wrote:</div>
<blockquote style="border-left:thin solid rgb(26,188,156);margin:5px;padding-left:10px">
<img src="https://read-receipts.canarymail.io:8100/track/751F2D81C98B64244A1B80D2AD97DBC4_D79E3658BBA3D07A60FC38740E7A46A2.png" width="0px" height="0px">Hi Ian,<br>
<br>
Following up on Hartmut, Yunfan and others, I have some questions:<br>
• What do you do with variation? I’m not familiar with the languages you work on, but ’Tense Future’ in German could be translated as “Ich werde morgen gehen” or “Ich gehe morgen”. The latter would be more frequent in spoken language, but you might get the
former because of translation bias. Would you include both? You say that your method accounts for the choice of the speaker, but again I wonder if this isn’t just translation bias.<br>
• You say that this method has the advantage of including more frequently observed features. I wonder how you know whether that’s the case or not? Do you mean in spoken or written language? As Yunfan pointed out, with 50 sentences you might easily miss some
common features.<br>
• How do you standardize the glosses? For example, how do you decide whether something should be glossed as ‘be’ or copula? That seems important to me, since glossing is very subjective and you might inadvertendly bias the whole calculation. Especially since
you already wrote up the conclusion.<br>
• Lastly, I find it odd that Example 2) is calculated as having distance 1. To me, there are two differences: presence/absence of nominative and the presence/absence of a copula. How do you determine that the copula is in the same slot as the nominative for
calculation?<br>
<br>
Best,<br>
Sandra<br>
<br>
<br>
<span style="font-family:Helvetica">—</span><a style="font-family:Tahoma;font-size:12px" href="https://sauderset.github.io/" target="_blank"><strong>Sandra Auderset</strong></a><span style="font-family:Tahoma;font-size:12px">PhD Candidate | [she/her]</span><span style="font-family:Tahoma;font-size:12px">Department
of Linguistic and Cultural Evolution</span><span style="font-family:Tahoma;font-size:12px">MPI for Evolutionary Anthropology</span><span style="font-family:Tahoma;font-size:12px">&</span><span style="font-family:Tahoma;font-size:12px">Department of Linguistics</span><span style="font-family:Tahoma;font-size:12px">University
of California Santa Barbara</span><br>
<blockquote style="border-left:thin solid rgb(230,126,34);margin:5px;padding-left:10px">
On Saturday, May 08, 2021 at 19:16, Hartmut Haberland <<a href="https://mailto:hartmut@ruc.dk" target="_blank">hartmut@ruc.dk</a>> wrote:<br>
<span style="color:rgb(31,73,125);font-family:Calibri,sans-serif;font-size:11pt">Dear Ian,</span><span style="color:rgb(31,73,125);font-family:Calibri,sans-serif;font-size:11pt">I have a few comments.</span><span style="color:rgb(31,73,125);font-family:Calibri,sans-serif;font-size:11pt">I
was wondering about</span>
<table style="width:197pt;border-collapse:collapse" width="0" cellspacing="0" cellpadding="0" border="0">
<tbody>
<tr style="height:14pt">
<td style="width:96pt;padding:0cm 3.5pt;height:14pt" width="128" nowrap="nowrap">
<p class="MsoNormal"><span style="font-family:"Courier New";color:black">Genitive</span></p>
</td>
<td style="width:101pt;padding:0cm 3.5pt;height:14pt" width="135" nowrap="nowrap">
<p class="MsoNormal"><span style="font-family:"Courier New";color:black">Alienable</span></p>
</td>
</tr>
<tr style="height:14pt">
<td style="width:96pt;padding:0cm 3.5pt;height:14pt" width="128" nowrap="nowrap">
<p class="MsoNormal"><span style="font-family:"Courier New";color:black">Genitive</span></p>
</td>
<td style="width:101pt;padding:0cm 3.5pt;height:14pt" width="135" nowrap="nowrap">
<p class="MsoNormal"><span style="font-family:"Courier New";color:black">Inalienable</span></p>
</td>
</tr>
</tbody>
</table>
<span style="color:rgb(31,73,125);font-family:Calibri,sans-serif;font-size:11pt">Is it a good idea to use ‘genitive’? Would ‘possessive’ not be better?</span><span style="color:rgb(31,73,125);font-family:Calibri,sans-serif;font-size:11pt">Also I wonder about
languages like Finnish which express contrast between definiteness and indefiniteness by word order:</span><span style="color:rgb(31,73,125);font-family:Calibri,sans-serif;font-size:11pt">Auto on kadulla. ‘</span><u style="color:rgb(31,73,125);font-family:Calibri,sans-serif;font-size:11pt">The
car</u><span style="color:rgb(31,73,125);font-family:Calibri,sans-serif;font-size:11pt"> is in the street.’</span><span style="color:rgb(31,73,125);font-family:Calibri,sans-serif;font-size:11pt">Kadulla on auto. ‘There is </span><u style="color:rgb(31,73,125);font-family:Calibri,sans-serif;font-size:11pt">a
car</u><span style="color:rgb(31,73,125);font-family:Calibri,sans-serif;font-size:11pt"> in the street.’ (-ulla is inessive case.)</span><span style="color:rgb(31,73,125);font-family:Calibri,sans-serif;font-size:11pt">Also think of Italian</span><span style="color:rgb(31,73,125);font-family:Calibri,sans-serif;font-size:11pt">La
macchina è rotta.</span><span style="color:rgb(31,73,125);font-family:Calibri,sans-serif;font-size:11pt">È rotta la machina.</span><span style="color:rgb(31,73,125);font-family:Calibri,sans-serif;font-size:11pt">both ‘The car is broken’, but are answers to
different questions (Where is your car?, Why are you late?, resp.); same (SV vs. VS) in Greek. How would you get these results?</span><span style="color:rgb(31,73,125);font-family:Calibri,sans-serif;font-size:11pt">Best, Hartmut</span><span style="color:rgb(31,73,125);font-family:Calibri,sans-serif;font-size:11pt"> </span><strong style="font-family:Calibri,sans-serif;font-size:11pt">Fra:</strong> <span style="font-family:Calibri,sans-serif;font-size:11pt">Lingtyp
<<a href="mailto:lingtyp-bounces@listserv.linguistlist.org" target="_blank">lingtyp-bounces@listserv.linguistlist.org</a>> </span><strong style="font-family:Calibri,sans-serif;font-size:11pt">På vegne af</strong><span style="font-family:Calibri,sans-serif;font-size:11pt"> JOO,
Ian [Student]</span><strong style="font-family:Calibri,sans-serif;font-size:11pt">Sendt:</strong><span style="font-family:Calibri,sans-serif;font-size:11pt"> 8. maj 2021 15:08</span><strong style="font-family:Calibri,sans-serif;font-size:11pt">Til:</strong><span style="font-family:Calibri,sans-serif;font-size:11pt"> LINGTYP
<<a href="mailto:lingtyp@listserv.linguistlist.org" target="_blank">lingtyp@listserv.linguistlist.org</a>></span><strong style="font-family:Calibri,sans-serif;font-size:11pt">Emne:</strong><span style="font-family:Calibri,sans-serif;font-size:11pt"> [Lingtyp]
A list of 50 basic sentences</span> <br>
Dear all,<br>
<br>
I am trying to make a list of 50 basic sentential meanings.<br>
The goal is to make parallel corpora of different languages based on this list of sentences.<br>
Each sentence on the list serves to check whether a language has a given grammatical feature, and if so, in what form the language expresses it.<br>
When creating each sentence, I tried to limit its vocabulary to basic words that are found in most languages, avoiding culture-specific words.<br>
I would appreciate it if you could have a look at the attached file and advise what I should add/remove/modify.<br>
<br>
From Hong Kong,<br>
Ian<br>
<img style="max-width: 100%; height: auto;" src="https://www.polyu.edu.hk/emaildisclaimer/PolyU_Email_Signature.jpg"><br>
<em>Disclaimer:</em><em style="color:black">This message (including any attachments) contains confidential information intended for a specific individual and purpose. If you are not the intended recipient, you should delete this message and notify the sender
and The Hong Kong Polytechnic University (the University) immediately. Any disclosure, copying, or distribution of this message, or the taking of any action based on it, is strictly prohibited and may be unlawful.</em><em>The University specifically denies
any responsibility for the accuracy or quality of information obtained through University E-mail Facilities. Any views and opinions expressed are only those of the author(s) and do not necessarily represent those of the University and the University accepts
no liability whatsoever for any losses or damages incurred or caused to any party as a result of the use of such information.</em>_______________________________________________<br>
Lingtyp mailing list<br>
<a href="mailto:Lingtyp@listserv.linguistlist.org" target="_blank">Lingtyp@listserv.linguistlist.org</a><br>
<a href="http://listserv.linguistlist.org/mailman/listinfo/lingtyp" target="_blank">http://listserv.linguistlist.org/mailman/listinfo/lingtyp</a></blockquote>
</blockquote>
</div>
<img alt="" src="https://www.polyu.edu.hk/emaildisclaimer/PolyU_Email_Signature.jpg">
<p><br>
<em><font size="3" face="Times New Roman">Disclaimer:</font></em></p>
<p></p>
<p style="margin-left:0.5in"><i><font size="3" face="Times New Roman" color="black"><span>This message (including any attachments) contains confidential information intended for a specific individual and purpose. If you are not the intended recipient, you should
delete this message and notify the sender and The Hong Kong Polytechnic University (the University) immediately. Any disclosure, copying, or distribution of this message, or the taking of any action based on it, is strictly prohibited and may be unlawful.</span></font></i></p>
<p style="margin-left:0.5in"><i><span><font size="3" face="Times New Roman">The University specifically denies any responsibility for the accuracy or quality of information obtained through University E-mail Facilities. Any views and opinions expressed are
only those of the author(s) and do not necessarily represent those of the University and the University accepts no liability whatsoever for any losses or damages incurred or caused to any party as a result of the use of such information.</font></span></i></p>
</div>
_______________________________________________<br>
Lingtyp mailing list<br>
<a href="mailto:Lingtyp@listserv.linguistlist.org" target="_blank">Lingtyp@listserv.linguistlist.org</a><br>
<a href="http://listserv.linguistlist.org/mailman/listinfo/lingtyp" rel="noreferrer" target="_blank">http://listserv.linguistlist.org/mailman/listinfo/lingtyp</a><br>
</blockquote>
</div>
</blockquote>
</div>
<img alt="" src="https://www.polyu.edu.hk/emaildisclaimer/PolyU_Email_Signature.jpg">
<p><br>
<em><font size="3" face="Times New Roman">Disclaimer:</font></em></p>
<p></p>
<p style="margin-left:0.5in"><i><font size="3" face="Times New Roman" color="black"><span>This message (including any attachments) contains confidential information intended for a specific individual and purpose. If you are not the intended recipient, you should
delete this message and notify the sender and The Hong Kong Polytechnic University (the University) immediately. Any disclosure, copying, or distribution of this message, or the taking of any action based on it, is strictly prohibited and may be unlawful.</span></font></i></p>
<p style="margin-left:0.5in"><i><span><font size="3" face="Times New Roman">The University specifically denies any responsibility for the accuracy or quality of information obtained through University E-mail Facilities. Any views and opinions expressed are
only those of the author(s) and do not necessarily represent those of the University and the University accepts no liability whatsoever for any losses or damages incurred or caused to any party as a result of the use of such information.</font></span></i></p>
</div>
</blockquote>
</div>
</blockquote>
</div>
<style type="text/css">
<!--
p
{margin-top:12px;
margin-bottom:12px}
div.WordSection1
{}
--> </style><img alt="" src="https://www.polyu.edu.hk/emaildisclaimer/PolyU_Email_Signature.jpg">
<p><br>
<em><font face="Times New Roman" size="3">Disclaimer:</font></em></p>
<p></p>
<p style="margin-left: 0.5in"><i><font color="black" face="Times New Roman" size="3"><span>This message (including any attachments) contains confidential information intended for a specific individual and purpose. If you are not the intended recipient, you
should delete this message and notify the sender and The Hong Kong Polytechnic University (the University) immediately. Any disclosure, copying, or distribution of this message, or the taking of any action based on it, is strictly prohibited and may be unlawful.</span></font></i></p>
<p style="margin-left: 0.5in"><i><span><font face="Times New Roman" size="3">The University specifically denies any responsibility for the accuracy or quality of information obtained through University E-mail Facilities. Any views and opinions expressed are
only those of the author(s) and do not necessarily represent those of the University and the University accepts no liability whatsoever for any losses or damages incurred or caused to any party as a result of the use of such information.</font></span></i></p>
</body>
</html>