<html xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=gb2312">
<meta name="Generator" content="Microsoft Word 15 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
{font-family:Helvetica;
panose-1:0 0 0 0 0 0 0 0 0 0;}
@font-face
{font-family:"MS Gothic";
panose-1:2 11 6 9 7 2 5 8 2 4;}
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
{font-family:Verdana;
panose-1:2 11 6 4 3 5 4 4 2 4;}
@font-face
{font-family:"PingFang TC";
panose-1:2 11 4 0 0 0 0 0 0 0;}
@font-face
{font-family:"\@MS Gothic";
panose-1:2 11 6 9 7 2 5 8 2 4;}
@font-face
{font-family:"\@PingFang TC";}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in;
font-size:10.0pt;
font-family:"Calibri",sans-serif;}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:blue;
text-decoration:underline;}
span.apple-converted-space
{mso-style-name:apple-converted-space;}
span.EmailStyle20
{mso-style-type:personal-reply;
font-family:"Calibri",sans-serif;
color:windowtext;}
.MsoChpDefault
{mso-style-type:export-only;
font-size:10.0pt;}
@page WordSection1
{size:8.5in 11.0in;
margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
{page:WordSection1;}
--></style>
</head>
<body lang="EN-US" link="blue" vlink="purple" style="word-wrap:break-word;-webkit-nbsp-mode: space;line-break:after-white-space">
<div class="WordSection1">
<p class="MsoNormal"><span style="font-size:11.0pt">Dear all – Matthew’s post made me think of a possible interpretation of Randy’s point about induction where previously there was none in my mind, i.e., I failed to understand why Randi was invoking induction
and AI. Now, on the off chance that that interpretation is ballpark correct, I’d like to add the following to Matthew’s response:<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt">Computational linguists are currently very much debating, and empirically researching, how much neural networks really know about language. This is not merely a matter of how this knowledge is represented
– i.e., in the form of probability vectors instead of rules. Rather, it appears that this knowledge is often rather superficial. Transformers are easily thrown into making mistakes that human speakers, and even human learners, don’t make. See for example here,
and works cited therein:<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"><a href="http://www.acsu.buffalo.edu/~rchaves/lookatthat.pdf">http://www.acsu.buffalo.edu/~rchaves/lookatthat.pdf</a><o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt">Note this work is not at all coming from a rationalist/innatist bent. Rather, the point is that it seems premature to consider contemporary deep learning networks adequate representations of human linguistic
knowledge. I’m persuaded that we may eventually (and perhaps sooner rather than later) see the evolution of artificial systems that are able to acquire such knowledge. But we don’t yet seem to be able to predict how exactly such systems will work.
<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt">I’m willing to bet that they will rely on statistical learning, mind you. As do we humans. If this is where Randy was going by invoking induction, we are not in disagreement up to that point.
<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt">However, it seems to me that once artificial systems have evolved to the point where they can no longer be tricked into non-human-like generalizations by manipulations such as those used in the Chaves & Richter
paper, their linguistic knowledge may well be adequately <i>described</i> in terms of traditional grammar rules, even if this knowledge is not actually internally represented in such rules. Except perhaps those will be rules that are stated in probabilistic
rather than categorical form. Which may be a format linguists and psychologists will one day also apply to the description of human linguistic knowledge.
<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt">Best -- Juergen<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"><o:p> </o:p></span></p>
<div>
<div>
<div>
<p class="MsoNormal"><span style="font-size:9.0pt;font-family:Helvetica;color:black">Juergen Bohnemeyer (He/Him)<br>
Professor, Department of Linguistics<br>
University at Buffalo <br>
<br>
Office: 642 Baldy Hall, UB North Campus<br>
Mailing address: 609 Baldy Hall, Buffalo, NY 14260 <br>
Phone: (716) 645 0127 <br>
Fax: (716) 645 3825<br>
Email: </span><span style="font-size:11.0pt"><a href="mailto:jb77@buffalo.edu" title="mailto:jb77@buffalo.edu"><span style="font-size:9.0pt;font-family:Helvetica;color:#0078D4">jb77@buffalo.edu</span></a></span><span style="font-size:9.0pt;font-family:Helvetica;color:black"><br>
Web: </span><span style="font-size:11.0pt"><a href="http://www.acsu.buffalo.edu/~jb77/" title="http://www.acsu.buffalo.edu/~jb77/"><span style="font-size:9.0pt;font-family:Helvetica;color:#0563C1">http://www.acsu.buffalo.edu/~jb77/</span></a></span><span style="font-size:9.0pt;font-family:Helvetica;color:black"> <br>
<br>
</span><span style="font-size:11.0pt;color:black">Office hours Tu/Th 3:30-4:30pm in 642 Baldy or via Zoom (Meeting ID 585 520 2411; Passcode Hoorheh) </span><span style="font-size:9.0pt;font-family:Helvetica;color:black"><br>
<br>
There’s A Crack In Everything - That’s How The Light Gets In <br>
(Leonard Cohen) </span><span style="font-size:11.0pt"><o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt">-- <o:p></o:p></span></p>
</div>
</div>
</div>
<p class="MsoNormal"><span lang="RU" style="font-size:11.0pt"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="RU" style="font-size:11.0pt"><o:p> </o:p></span></p>
<div style="border:none;border-top:solid #B5C4DF 1.0pt;padding:3.0pt 0in 0in 0in">
<p class="MsoNormal" style="margin-bottom:12.0pt"><b><span style="font-size:12.0pt;color:black">From:
</span></b><span style="font-size:12.0pt;color:black">Lingtyp <lingtyp-bounces@listserv.linguistlist.org> on behalf of Matthew Dryer <dryer@buffalo.edu><br>
<b>Date: </b>Saturday, December 17, 2022 at 9:44 PM<br>
<b>To: </b>Randy J. LaPolla <randy.lapolla@gmail.com>, Adam Singerman <adamsingerman@gmail.com><br>
<b>Cc: </b>lingtyp@listserv.linguistlist.org <lingtyp@listserv.linguistlist.org><br>
<b>Subject: </b>Re: [Lingtyp] spectrograms in linguistic description and for language comparison<o:p></o:p></span></p>
</div>
<p class="MsoNormal"><span style="font-size:12.0pt">Randy,</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:12.0pt"> </span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:12.0pt">(A belated response, but the best of reasons for Lingtyp: I have been too busy at ALT to read my email the past few days.)</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:12.0pt"> </span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:12.0pt">Randy, you don’t want to use this experience with computers identifying sentences to support your case. The reason that the symbolic approach in AI didn’t work is that it proved impossible to identify sentences
without the system having access to the same world knowledge, the same knowledge of context and the same knowledge of the addressee that people use in interpreting sentences. In other words, the problem was that it is not possible to interpret sentences in
context without all that. In order for the symbolic approach to work, the system would have to have all the knowledge of the world and the context that real people have.</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:12.0pt"> </span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:12.0pt">The alternative that replaced it used induction on interpreted data to identify transitional probabilities to identify words. But what the means is that that the system only identifies what the words are;
it cannot interpret what sentences mean in context. I.e. the system doesn’t do what real people do. That’s not an argument for induction.</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:12.0pt"> </span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:12.0pt">Matthew</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> </span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> </span><o:p></o:p></p>
<div style="border:none;border-top:solid #B5C4DF 1.0pt;padding:3.0pt 0in 0in 0in">
<p class="MsoNormal" style="margin-bottom:12.0pt"><b><span style="font-size:12.0pt;color:black">From:
</span></b><span style="font-size:12.0pt;color:black">Lingtyp <lingtyp-bounces@listserv.linguistlist.org> on behalf of Randy J. LaPolla <randy.lapolla@gmail.com><br>
<b>Date: </b>Wednesday, December 14, 2022 at 12:49 PM<br>
<b>To: </b>Adam Singerman <adamsingerman@gmail.com><br>
<b>Cc: </b>lingtyp@listserv.linguistlist.org <lingtyp@listserv.linguistlist.org><br>
<b>Subject: </b>Re: [Lingtyp] spectrograms in linguistic description and for language comparison</span><o:p></o:p></p>
</div>
<p class="MsoNormal"><span style="font-size:11.0pt">PS: One thing I forgot to mention about induction:</span><o:p></o:p></p>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt">If you know about the history of AI, in the 70’s and 80’s the main paradigm was symbolic AI, which is rule-based. They worked very hard for many years to get the systems to parse even simple sentences, but
when it was clear to all that the rule-based approach was a failure, they experimented with a purely inductive approach, and the difference in output convinced them right away this was the way to go. After their experiments, Jeff Dean, the head of Google’s
Brain Lab, famously said, “We don’t need grammar”. Google translate is as good as it is now because of this switch to an inductive method. Of course those of us doing fieldwork will not have the large database Google has or the speed of the machines, but the
principle is the same: induction can get you there.</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt"> </span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt">Randy</span><o:p></o:p></p>
<div>
<div>
<div>
<div>
<div>
<div>
<div>
<div>
<div>
<div>
<p class="MsoNormal"><span style="font-size:10.5pt;color:black">——</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:10.5pt;color:black">Professor Randy J. LaPolla</span><span style="font-family:"MS Gothic";color:black">(</span><span style="font-family:"PingFang TC",sans-serif;color:black">罗</span><span style="font-family:"MS Gothic";color:black">仁地</span><span style="font-family:"Verdana",sans-serif;color:black">)</span><span style="font-size:10.5pt;color:black">,
PhD FAHA </span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:10.5pt;color:black">Center for Language Sciences</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:10.5pt;color:black">Institute for Advanced Studies in Humanities and Social Sciences</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:10.5pt;color:black">Beijing Normal University at Zhuhai</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:10.5pt;color:black">A302, Muduo Building, #18 Jinfeng Road, Zhuhai City, Guangdong, China</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt;color:black"> </span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:10.5pt;color:black"><a href="https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Frandylapolla.info%2F&data=05%7C01%7Cjb77%40buffalo.edu%7Cee38548507844ee9c51808dae0a1a736%7C96464a8af8ed40b199e25f6b50a20250%7C0%7C0%7C638069282736362779%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C2000%7C%7C%7C&sdata=YNygoF0AktpENrVnDO%2Fus1x1mLnKubpqW6fqJpUd%2Fpw%3D&reserved=0">https://randylapolla.info</a></span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:12.0pt;font-family:"Times New Roman",serif;color:black">ORCID ID:<span class="apple-converted-space"> </span></span><span style="font-size:12.0pt;font-family:"Times New Roman",serif;color:#494A4C;background:white"><a href="https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Forcid.org%2F0000-0002-6100-6196&data=05%7C01%7Cjb77%40buffalo.edu%7Cee38548507844ee9c51808dae0a1a736%7C96464a8af8ed40b199e25f6b50a20250%7C0%7C0%7C638069282736362779%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C2000%7C%7C%7C&sdata=2FzMRwbkFUd1PP8odOv90IwfyGHVSfmzvxaaqTyGj5w%3D&reserved=0">https://orcid.org/0000-0002-6100-6196</a> </span><span style="font-size:11.0pt;color:black">
</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt;color:black"> </span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:11.5pt;font-family:"PingFang TC",sans-serif;color:black">邮编</span><span style="font-size:11.5pt;font-family:"MS Gothic";color:black">:</span><span style="font-size:11.5pt;color:black">519000<br>
</span><span style="font-size:11.5pt;font-family:"MS Gothic";color:black">广</span><span style="font-size:11.5pt;font-family:"PingFang TC",sans-serif;color:black">东</span><span style="font-size:11.5pt;font-family:"MS Gothic";color:black">省珠海市唐家湾</span><span style="font-size:11.5pt;font-family:"PingFang TC",sans-serif;color:black">镇</span><span style="font-size:11.5pt;font-family:"MS Gothic";color:black">金</span><span style="font-size:11.5pt;font-family:"PingFang TC",sans-serif;color:black">凤</span><span style="font-size:11.5pt;font-family:"MS Gothic";color:black">路</span><span style="font-size:11.5pt;color:black">18</span><span style="font-size:11.5pt;font-family:"MS Gothic";color:black">号木</span><span style="font-size:11.5pt;font-family:"PingFang TC",sans-serif;color:black">铎</span><span style="font-size:11.5pt;font-family:"MS Gothic";color:black">楼</span><span style="font-size:11.5pt;color:black">A302<br>
</span><span style="font-size:11.5pt;font-family:"MS Gothic";color:black">北京</span><span style="font-size:11.5pt;font-family:"PingFang TC",sans-serif;color:black">师</span><span style="font-size:11.5pt;font-family:"MS Gothic";color:black">范大学珠海校区</span><span style="font-size:11.5pt;color:black"><br>
</span><span style="font-size:11.5pt;font-family:"MS Gothic";color:black">人文和社会科学高等研究院</span><span style="font-size:11.5pt;color:black"><br>
</span><span style="font-size:11.5pt;font-family:"PingFang TC",sans-serif;color:black">语</span><span style="font-size:11.5pt;font-family:"MS Gothic";color:black">言科学研究中心</span><span style="font-size:11.5pt;color:black"> </span><o:p></o:p></p>
</div>
</div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt;color:black"> </span><o:p></o:p></p>
</div>
</div>
<p class="MsoNormal"><span style="font-size:11.0pt;color:black"> </span><o:p></o:p></p>
</div>
<p class="MsoNormal"><span style="font-size:11.0pt;color:black"> </span><o:p></o:p></p>
</div>
<p class="MsoNormal"><span style="font-size:11.0pt;color:black"> </span><o:p></o:p></p>
</div>
<p class="MsoNormal"><span style="font-size:11.0pt;color:black"> </span><o:p></o:p></p>
</div>
<p class="MsoNormal"><span style="font-size:11.0pt;color:black"> </span><o:p></o:p></p>
</div>
<p class="MsoNormal"><span style="font-size:11.0pt;color:black"> </span><o:p></o:p></p>
</div>
<p class="MsoNormal" style="margin-bottom:12.0pt"><span style="font-size:11.0pt"> </span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt"><br>
<br>
<br>
</span><o:p></o:p></p>
<blockquote style="margin-top:5.0pt;margin-bottom:5.0pt">
<div>
<p class="MsoNormal"><span style="font-size:11.0pt">On 14 Dec 2022, at 11:15 PM, Randy J. LaPolla <<a href="mailto:randy.lapolla@gmail.com">randy.lapolla@gmail.com</a>> wrote:</span><o:p></o:p></p>
</div>
<p class="MsoNormal"><span style="font-size:11.0pt"> </span><o:p></o:p></p>
<div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt">Dear Adam,</span><o:p></o:p></p>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt">Sorry to just be getting back to you on this.</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt"> </span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt">We have very different conceptions of language and goals in doing linguistics (your interest in "(control vs raising structures, pied-piping, islands, gaps in inflectional paradigms, etc)” very much reflects
this, as these are not things that concern me). This is what my blog post is about, how different the choices can be. Saying I am “wrong” implies there is only one right way to do linguistics. Not to be rude, but as Popper said, "Whenever a theory appears
to you as the only possible one, take this as a sign that you have neither understood the theory nor the problem which it was intended to solve."</span><span style="font-size:12.0pt;font-family:"Times New Roman",serif"> -Karl Popper,
</span><i><span style="font-size:12.0pt;font-family:"Times New Roman",serif;mso-fareast-language:ZH-CN">Objective Knowledge: An Evolutionary Approach</span></i><span style="font-size:12.0pt;font-family:"Times New Roman",serif"> (<span style="color:black;background:white">Clarendon
Press,</span> 1972, p. 266)</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt">All of our theories and methodologies are subjective attempts to achieve some goal, actually heuristics. As there are different ways of looking at the same phenomenon and for different purposes, there is no
right and wrong, only more or less useful ways of analysing the phenomena relative to some purpose, and also depend on our assumptions, definition of language, etc., as I discuss in the blog. Y. R. Chao and J. R. Firth both argued that there are different
ways to analyse language depending on the language and your purposes, including for example not using a phoneme type of representation of the phonology, but some different type. This is not just linguistics, but for example, physics as well, where light can
be understood as a wave or a particle depending on your purposes. And our understanding of the Universe has gone through many major changes, each one thought to be “right” at the time, but later overthrown by a later theory. </span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt"> </span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt">You assume that we can somehow "fully represent a given language’s grammatical possibilities”. As language is a complex adaptive system that is constantly changing, and is human behaviour and so not a finite
thing, I don’t think that it will ever be possible to "fully represent a given language’s grammatical possibilities”. One problem I see with modern linguistics is not acknowledging the tremendous diversity of usages within a single language, as the search
has been for universals and for a single tight system, which even Charles Hocket (1967) said was a “wild goose chase”. That is just for one language, never mind trying to do that for all languages, which has led to linguistics missing so much of the diversity
between languages.</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt"> </span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt">The beauty of working inductively is that you are only responsible for what is in your data. You don’t have to make broad generalisations about the language that in many cases turn out to be problematic, you
just say this is what is and is not in my data. Of course the more data you have the stronger the generalisations you can make. I did not rule out using some stimuli such as the MPI sets, as these set up contexts that the speaker can talk about, but asking
people to translate word lists or sentences will not give you useful data. What you will get back are the categories of the working language. But again, this is part of the problem. Too many linguists think that words are translatable and mean the same thing
in different languages. This is easily shown to be false. Not only is the prototype of the cognitive category represented by the word different for different cultures (even different speakers), but the extension (the use of the word for different objects or
situations) is also different. This is true of every word in the languages. Humboldt knew this, and argued against Aristotle’s view that all people have the same object in mind even if the word is different. Humboldt said no, even if we both look at the same
horse we are seeing different things, as our cognitive categories are different.</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt"> </span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt">All the best,</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt">Randy</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt"> </span><o:p></o:p></p>
<div>
<div>
<div>
<div>
<div>
<div>
<div>
<div>
<div>
<div>
<p class="MsoNormal"><span style="font-size:10.5pt">——</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:10.5pt">Professor Randy J. LaPolla</span><span style="font-family:"MS Gothic"">(</span><span style="font-family:"PingFang TC",sans-serif">罗</span><span style="font-family:"MS Gothic"">仁地</span><span style="font-family:"Verdana",sans-serif">)</span><span style="font-size:10.5pt">,
PhD FAHA </span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:10.5pt">Center for Language Sciences</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:10.5pt">Institute for Advanced Studies in Humanities and Social Sciences</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:10.5pt">Beijing Normal University at Zhuhai</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:10.5pt">A302, Muduo Building, #18 Jinfeng Road, Zhuhai City, Guangdong, China</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt"> </span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:10.5pt"><a href="https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Frandylapolla.info%2F&data=05%7C01%7Cjb77%40buffalo.edu%7Cee38548507844ee9c51808dae0a1a736%7C96464a8af8ed40b199e25f6b50a20250%7C0%7C0%7C638069282736362779%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C2000%7C%7C%7C&sdata=YNygoF0AktpENrVnDO%2Fus1x1mLnKubpqW6fqJpUd%2Fpw%3D&reserved=0">https://randylapolla.info</a></span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:12.0pt;font-family:"Times New Roman",serif">ORCID ID:<span class="apple-converted-space"> </span><span style="color:#494A4C;background:white"><a href="https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Forcid.org%2F0000-0002-6100-6196&data=05%7C01%7Cjb77%40buffalo.edu%7Cee38548507844ee9c51808dae0a1a736%7C96464a8af8ed40b199e25f6b50a20250%7C0%7C0%7C638069282736519012%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C2000%7C%7C%7C&sdata=yHwyN4xOcSMU98FwOOyrg9SJlsoMBI2GcuJ1n55kseE%3D&reserved=0">https://orcid.org/0000-0002-6100-6196</a> </span></span><span style="font-size:11.0pt">
</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt"> </span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:11.5pt;font-family:"PingFang TC",sans-serif">邮编</span><span style="font-size:11.5pt;font-family:"MS Gothic"">:</span><span style="font-size:11.5pt">519000<br>
</span><span style="font-size:11.5pt;font-family:"MS Gothic"">广</span><span style="font-size:11.5pt;font-family:"PingFang TC",sans-serif">东</span><span style="font-size:11.5pt;font-family:"MS Gothic"">省珠海市唐家湾</span><span style="font-size:11.5pt;font-family:"PingFang TC",sans-serif">镇</span><span style="font-size:11.5pt;font-family:"MS Gothic"">金</span><span style="font-size:11.5pt;font-family:"PingFang TC",sans-serif">凤</span><span style="font-size:11.5pt;font-family:"MS Gothic"">路</span><span style="font-size:11.5pt">18</span><span style="font-size:11.5pt;font-family:"MS Gothic"">号木</span><span style="font-size:11.5pt;font-family:"PingFang TC",sans-serif">铎</span><span style="font-size:11.5pt;font-family:"MS Gothic"">楼</span><span style="font-size:11.5pt">A302<br>
</span><span style="font-size:11.5pt;font-family:"MS Gothic"">北京</span><span style="font-size:11.5pt;font-family:"PingFang TC",sans-serif">师</span><span style="font-size:11.5pt;font-family:"MS Gothic"">范大学珠海校区</span><span style="font-size:11.5pt"><br>
</span><span style="font-size:11.5pt;font-family:"MS Gothic"">人文和社会科学高等研究院</span><span style="font-size:11.5pt"><br>
</span><span style="font-size:11.5pt;font-family:"PingFang TC",sans-serif">语</span><span style="font-size:11.5pt;font-family:"MS Gothic"">言科学研究中心</span><span style="font-size:11.5pt"> </span><o:p></o:p></p>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
<p class="MsoNormal"><span style="font-size:11.0pt"> </span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt"><br>
<br>
<br>
</span><o:p></o:p></p>
<blockquote style="margin-top:5.0pt;margin-bottom:5.0pt">
<div>
<p class="MsoNormal"><span style="font-size:11.0pt">On 11 Dec 2022, at 11:00 AM, Adam Singerman <<a href="mailto:adamsingerman@gmail.com">adamsingerman@gmail.com</a>> wrote:</span><o:p></o:p></p>
</div>
<p class="MsoNormal"><span style="font-size:11.0pt"> </span><o:p></o:p></p>
<div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt">I think Randy is wrong (sorry if this comes across as blunt) and so I<br>
am writing, on a Saturday night no less, to voice a different view.<br>
<br>
Working inductively from a corpus is great, but no corpus is ever<br>
going to be large enough to fully represent a given language's<br>
grammatical possibilities. If we limit ourselves to working<br>
inductively from corpora then many basic questions about the languages<br>
we research will go unanswered. From a corpus of natural data we<br>
simply cannot know whether a given pattern is missing because the<br>
corpus is finite (i.e., it's just a statistical accident that the<br>
pattern isn't attested) or whether there's a genuine reason why the<br>
pattern is not showing up (i.e., its non-attestation is principled).<br>
<br>
When I am writing up my research on Tuparí I always prioritize<br>
non-elicited data (texts, in-person conversation, WhatsApp chats). But<br>
interpreting and analyzing the non-elicited data requires making<br>
reference to acceptability judgments. The prefix (e)tareman- is a<br>
negative polarity item, and it always co-occurs with (and inside the<br>
scope of) a negator morpheme. But the only way I can make this point<br>
is by showing that speakers invariably reject tokens of (e)tareman-<br>
without a licensing negator. Those rejected examples are by definition<br>
not going to be present in any corpus of naturalistic speech, but they<br>
tell me something crucial about what the structure of Tuparí does and<br>
does not allow. If I limit myself to inductively working from a<br>
corpus, fundamental facts about the prefix (e)tareman- and about<br>
negation in Tuparí more broadly will be missed.<br>
<br>
A lot of recent scholarship has made major strides towards improving<br>
the methodology of collecting and interpreting acceptability<br>
judgments. The formal semanticists who work on understudied languages<br>
(here I am thinking of Judith Tonhauser, Lisa Matthewson, Ryan<br>
Bochnak, Amy Rose Deal, Scott AnderBois) are extremely careful about<br>
teasing apart utterances that are rejected because of some<br>
morphosyntactic ill-formedness (i.e., ungrammaticality) versus ones<br>
that are rejected because of semantic or pragmatic oddity. The<br>
important point is that such teasing apart can be done, and the<br>
descriptions and analyses that result from this work are richer than<br>
what would result from a methodology that uses corpus examination or<br>
elicitation only.<br>
<br>
One more example from Tuparí: this language has an obligatory<br>
witnessed/non-witnessed evidential distinction, but the deictic<br>
orientation of the distinction (to the speaker or to the addressee) is<br>
determined via clause type. There is a nuanced set of interactions<br>
between the evidential morphology and the clause-typing morphology,<br>
and it would have been impossible for me to figure out the basics of<br>
those interactions without relying primarily on conversational data<br>
and discourse context. But I still needed to get some acceptability<br>
judgments to ensure that the picture I'd arrived at wasn't overly<br>
biased by the limitations of my corpus. Finding speakers who were<br>
willing to work with me on those judgments wasn't always easy; a fair<br>
amount of metalinguistic awareness was needed. But it was worth it!<br>
The generalizations that I was able to publish were much more solid<br>
than if I had worked exclusively from corpus data. And the methodology<br>
I learned from the Tonhauser/Matthewson/etc crowd was fundamental to<br>
this work.<br>
<br>
The call to work inductively from corpora would have the practical<br>
effect of making certain topics totally inaccessible for research<br>
(control vs raising structures, pied-piping, islands, gaps in<br>
inflectional paradigms, etc) even though large scale acceptability<br>
tasks have shown that these phenomena are "real," i.e., they're not<br>
just in the minds of linguists who are using introspection. Randy's<br>
point that "no other science allows the scientist to make up his or<br>
her own data, and so this is something linguists should give up" is a<br>
straw man argument now that many experimentalist syntacticians use<br>
large-scale acceptability judgments on platforms like Mechanical Turk<br>
to get at speakers' judgments. I think we do a disservice to our<br>
students and to junior scholars if we tell them that the only real<br>
stuff to be studied will be in the corpora that we assemble. Even the<br>
best corpora are finite, whereas L1 speakers' knowledge of their<br>
language is infinitely productive.<br>
<br>
— Adam<br>
_______________________________________________<br>
Lingtyp mailing list<br>
<a href="mailto:Lingtyp@listserv.linguistlist.org">Lingtyp@listserv.linguistlist.org</a><br>
<a href="https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Flistserv.linguistlist.org%2Fcgi-bin%2Fmailman%2Flistinfo%2Flingtyp&data=05%7C01%7Cjb77%40buffalo.edu%7Cee38548507844ee9c51808dae0a1a736%7C96464a8af8ed40b199e25f6b50a20250%7C0%7C0%7C638069282736519012%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C2000%7C%7C%7C&sdata=3t0QO%2F9xKj5YczewvjCLGAN9zGb719qe%2F2IuRsX%2B8ec%3D&reserved=0">https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/lingtyp</a></span><o:p></o:p></p>
</div>
</div>
</blockquote>
</div>
<p class="MsoNormal"><span style="font-size:11.0pt"> </span><o:p></o:p></p>
</div>
</div>
</div>
</blockquote>
</div>
<p class="MsoNormal"><span style="font-size:11.0pt"> </span><o:p></o:p></p>
</div>
</div>
</body>
</html>