<html>
  <head>
    <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
  </head>
  <body>
    Dear all,<br>
    <br>
    Thanks to everyone for this super-interesting discussion! I think
    that we really all agree that wellformedness judgements are crucial
    to grammatical description, and they have always been – but as we
    move out from highly frequent inflectional pattern (Christian
    Lehmann's apt example was English <i>*goed</i> vs. <i>went</i>) to
    less frequent patterns of syntax, the judgements get more difficult.
    It may be that many of the judgement data that we find in the
    literature are problematic, but the key role of wellformedness
    judgements as such is not in question, I think. (This is like
    elsewhere in cognition: We can <span style="font-size:11.0pt">"distinguish
      cats from dogs from other critters and things with great
      confidence", as Juergen notes.)</span><br>
    <br>
    In Randy's recent blogpost (<a class="moz-txt-link-freetext" href="https://dlc.hypotheses.org/2825">https://dlc.hypotheses.org/2825</a>), he
    says that asking for acceptability judgements is really <span
      style="letter-spacing: -0.1px;" class=""></span>"testing<span
      style="letter-spacing: -0.1px;" class=""> </span>the<span
      style="letter-spacing: -0.1px;" class=""> </span>person’s<span
      style="letter-spacing: -0.1px;" class=""> </span>ability<span
      style="letter-spacing: -0.1px;" class=""> </span>to<span
      style="letter-spacing: -0.1px;" class=""> </span>imagine<span
      style="letter-spacing: -0.1px;" class=""> </span>a<span
      style="letter-spacing: -0.1px;" class=""> </span>context<span
      style="letter-spacing: -0.1px;" class=""> </span>in<span
      style="letter-spacing: -0.1px;" class=""> </span>which<span
      style="letter-spacing: -0.1px;" class=""> </span>the<span
      style="letter-spacing: -0.1px;" class=""> </span>sentence<span
      style="letter-spacing: -0.1px;" class=""> </span>makes<span
      style="letter-spacing: -0.1px;" class=""> </span>sense" – which I
    think is exactly right (and providing more context makes the task
    easier)! But crucially, many expressions constructed by the linguist
    do not make sense in *any* context (they are grammatically
    ill-formed), and many speakers recognize this immediately. And I
    agree with Neil Myler that it's not the data that are invented, but
    the "experimental stimuli", which is a routine procedure in other
    fields such as psychology or behavioural economics.<br>
    <br>
    I don't think we should ideologize this very basic aspect of
    language description. Adam Singerman mentioned "the
    Tonhauser/Matthewson/etc crowd", and unfortunately, Davis et al.
    (2014) wrote a paper saying that hypothesis-testing is somehow
    characteristic of "C-linguists" as opposed to "D-linguists", which I
    think was very odd (I think they confused descriptive with
    comparative hypothesis-testing, as I noted in my commentary:
    <a class="moz-txt-link-freetext" href="https://muse.jhu.edu/article/563097">https://muse.jhu.edu/article/563097</a>). This odd distinction between
    "C-" and "D-" linguists was originally made by Levinson & Evans
    (2010), and Randy LaPolla, too ideologizes the discussion by putting
    a "structuralist" label on it. I'm not sue that these labels are
    helpful (I think that "we are all structuralists":
    <a class="moz-txt-link-freetext" href="https://dlc.hypotheses.org/2356">https://dlc.hypotheses.org/2356</a>).<br>
    <br>
    Maybe there is one difference in interpreting acceptability
    judgements that still needs further discussion, also in usage-based
    linguistics: Do such judgements tell us about the mental grammars of
    the speakers, or do they merely tell us about the social
    acceptability of linguistic expressions? I think that it's the
    latter (as I said in this blogpost:
    <a class="moz-txt-link-freetext" href="https://dlc.hypotheses.org/2433">https://dlc.hypotheses.org/2433</a>), whereas many linguists seem to
    jumpt to conclusions about mental representations very quickly.<br>
    <br>
    Best,<br>
    Martin<br>
    <br>
    ****************<br>
    Reference<br>
    <br>
    <div class="csl-bib-body" style="line-height: 1.35; margin-left:
      2em; text-indent:-2em;">
      <div class="csl-entry">Davis, Henry & Gillon, Carrie &
        Matthewson, Lisa. 2014. How to investigate linguistic diversity:
        Lessons from the Pacific Northwest. <i>Language</i> 90(4).
        e180–e226.</div>
      <span class="Z3988"
title="url_ver=Z39.88-2004&ctx_ver=Z39.88-2004&rfr_id=info%3Asid%2Fzotero.org%3A2&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=How%20to%20investigate%20linguistic%20diversity%3A%20Lessons%20from%20the%20Pacific%20Northwest&rft.jtitle=Language&rft.volume=90&rft.issue=4&rft.aufirst=Henry&rft.aulast=Davis&rft.au=Henry%20Davis&rft.au=Carrie%20Gillon&rft.au=Lisa%20Matthewson&rft.date=2014&rft.pages=e180-e226&rft.spage=e180&rft.epage=e226"></span></div>
    <div class="csl-bib-body" style="line-height: 1.35; margin-left:
      2em; text-indent:-2em;">
      <div class="csl-entry">Levinson, Stephen C. & Evans, Nicholas.
        2010. Time for a sea-change in linguistics: Response to comments
        on ‘The myth of language universals.’ <i>Lingua</i> 120(12).
        2733–2758.<br>
        <br>
      </div>
      <span class="Z3988"
title="url_ver=Z39.88-2004&ctx_ver=Z39.88-2004&rfr_id=info%3Asid%2Fzotero.org%3A2&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Time%20for%20a%20sea-change%20in%20linguistics%3A%20Response%20to%20comments%20on%20%E2%80%98The%20myth%20of%20language%20universals%E2%80%99&rft.jtitle=Lingua&rft.volume=120&rft.issue=12&rft.aufirst=Stephen%20C.&rft.aulast=Levinson&rft.au=Stephen%20C.%20Levinson&rft.au=Nicholas%20Evans&rft.date=2010&rft.pages=2733%E2%80%932758&rft.spage=2733&rft.epage=2758"></span></div>
    <div class="moz-cite-prefix">Am 11.12.22 um 06:07 schrieb Juergen
      Bohnemeyer:<br>
    </div>
    <blockquote type="cite"
cite="mid:SJ0PR15MB4696E37FC59EFE92BF804357DD1E9@SJ0PR15MB4696.namprd15.prod.outlook.com">
      <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
      <meta name="Generator" content="Microsoft Word 15 (filtered
        medium)">
      <style>@font-face
        {font-family:Helvetica;
        panose-1:0 0 0 0 0 0 0 0 0 0;}@font-face
        {font-family:"Cambria Math";
        panose-1:2 4 5 3 5 4 6 3 2 4;}@font-face
        {font-family:Calibri;
        panose-1:2 15 5 2 2 2 4 3 2 4;}p.MsoNormal, li.MsoNormal, div.MsoNormal
        {margin:0in;
        font-size:10.0pt;
        font-family:"Calibri",sans-serif;}a:link, span.MsoHyperlink
        {mso-style-priority:99;
        color:blue;
        text-decoration:underline;}span.EmailStyle18
        {mso-style-type:personal-reply;
        font-family:"Calibri",sans-serif;
        color:windowtext;}.MsoChpDefault
        {mso-style-type:export-only;
        font-size:10.0pt;}div.WordSection1
        {page:WordSection1;}</style>
      <div class="WordSection1">
        <p class="MsoNormal"><span style="font-size:11.0pt">Dear all --
            I agree with Adam and Neil. I’m attaching rough proofs of my
            chapter in the upcoming Handbook of Cognitive Semantics,
            which is in production with Brill. The chapter surveys
            sources of data for *<b>empirical</b>* research in
            linguistics (with special emphasis on semantic research; but
            I argue that the sources of evidence we have at our disposal
            are fundamentally the same across languages). It discusses
            what we can and cannot get out of corpora and spontaneous
            observation, attempts a typology of elicitation techniques,
            and proposes best practices for their implementation.
            <o:p></o:p></span></p>
        <p class="MsoNormal"><span style="font-size:11.0pt"><o:p> </o:p></span></p>
        <p class="MsoNormal"><span style="font-size:11.0pt">(The text is
            also largely identical with Ch5 of my book
            <i>Semantic Research</i>, which is under contract with CUP
            and hopefully will see the light of day in 2023 or 2024 at
            the latest. That book, on which I’ve been laboring for a
            decade (much of it in collaboration with David Wilkins), is
            a stab at a textbook-cum-handbook for semantic research as
            an empirical science.)<o:p></o:p></span></p>
        <p class="MsoNormal"><span style="font-size:11.0pt"><o:p> </o:p></span></p>
        <p class="MsoNormal"><span style="font-size:11.0pt">Best --
            Juergen<o:p></o:p></span></p>
        <p class="MsoNormal"><span style="font-size:11.0pt"><o:p> </o:p></span></p>
        <div>
          <div>
            <div>
              <p class="MsoNormal"><span
                  style="font-size:9.0pt;font-family:Helvetica;color:black">Juergen
                  Bohnemeyer (He/Him)<br>
                  Professor, Department of Linguistics<br>
                  University at Buffalo <br>
                  <br>
                  Office: 642 Baldy Hall, UB North Campus<br>
                  Mailing address: 609 Baldy Hall, Buffalo, NY 14260 <br>
                  Phone: (716) 645 0127 <br>
                  Fax: (716) 645 3825<br>
                  Email: </span><span style="font-size:11.0pt"><a
                    href="mailto:jb77@buffalo.edu"
                    title="mailto:jb77@buffalo.edu"
                    moz-do-not-send="true"><span
                      style="font-size:9.0pt;font-family:Helvetica;color:#0078D4">jb77@buffalo.edu</span></a></span><span
style="font-size:9.0pt;font-family:Helvetica;color:black"><br>
                  Web: </span><span style="font-size:11.0pt"><a
                    href="http://www.acsu.buffalo.edu/~jb77/"
                    title="http://www.acsu.buffalo.edu/~jb77/"
                    moz-do-not-send="true"><span
                      style="font-size:9.0pt;font-family:Helvetica;color:#0563C1">http://www.acsu.buffalo.edu/~jb77/</span></a></span><span
style="font-size:9.0pt;font-family:Helvetica;color:black"> <br>
                  <br>
                </span><span style="font-size:11.0pt;color:black">Office
                  hours Tu/Th 3:30-4:30pm in 642 Baldy or via Zoom
                  (Meeting ID 585 520 2411; Passcode Hoorheh) </span><span
style="font-size:9.0pt;font-family:Helvetica;color:black"><br>
                  <br>
                  There’s A Crack In Everything - That’s How The Light
                  Gets In <br>
                  (Leonard Cohen)  </span><span style="font-size:11.0pt"><o:p></o:p></span></p>
              <p class="MsoNormal"><span style="font-size:11.0pt">-- <o:p></o:p></span></p>
            </div>
          </div>
        </div>
        <p class="MsoNormal"><span style="font-size:11.0pt" lang="RU"><o:p> </o:p></span></p>
        <p class="MsoNormal"><span style="font-size:11.0pt" lang="RU"><o:p> </o:p></span></p>
        <div style="border:none;border-top:solid #B5C4DF
          1.0pt;padding:3.0pt 0in 0in 0in">
          <p class="MsoNormal" style="margin-bottom:12.0pt"><b><span
                style="font-size:12.0pt;color:black">From:
              </span></b><span style="font-size:12.0pt;color:black">Lingtyp
              <a class="moz-txt-link-rfc2396E" href="mailto:lingtyp-bounces@listserv.linguistlist.org"><lingtyp-bounces@listserv.linguistlist.org></a> on
              behalf of Neil Myler <a class="moz-txt-link-rfc2396E" href="mailto:myler@bu.edu"><myler@bu.edu></a><br>
              <b>Date: </b>Saturday, December 10, 2022 at 10:21 PM<br>
              <b>To: </b><a class="moz-txt-link-abbreviated" href="mailto:LINGTYP@listserv.linguistlist.org">LINGTYP@listserv.linguistlist.org</a>
              <a class="moz-txt-link-rfc2396E" href="mailto:LINGTYP@listserv.linguistlist.org"><LINGTYP@listserv.linguistlist.org></a><br>
              <b>Subject: </b>Re: [Lingtyp] spectrograms in linguistic
              description and for language comparison<o:p></o:p></span></p>
        </div>
        <div>
          <p class="MsoNormal"><span style="font-size:11.0pt">I agree
              with everything here, with one addendum: it's a strawman
              even if you do ignore more formal judgment experiments. 
              The examples are invented, but each data point is a
              *pairing* of an example and a judgment. Since the
              judgments aren't invented (except in cases of misconduct),
              it's wrong to say that the data are.<o:p></o:p></span></p>
          <div>
            <p class="MsoNormal"><span style="font-size:11.0pt">Neil<o:p></o:p></span></p>
            <div>
              <div>
                <p class="MsoNormal"><span style="font-size:11.0pt">On
                    Sat, Dec 10, 2022, 10:05 PM Adam Singerman <<a
                      href="mailto:adamsingerman@gmail.com"
                      target="_blank" moz-do-not-send="true"
                      class="moz-txt-link-freetext">adamsingerman@gmail.com</a>>
                    wrote:<o:p></o:p></span></p>
              </div>
              <blockquote style="border:none;border-left:solid #CCCCCC
                1.0pt;padding:0in 0in 0in
                6.0pt;margin-left:4.8pt;margin-right:0in">
                <p class="MsoNormal"><span style="font-size:11.0pt">I
                    think Randy is wrong (sorry if this comes across as
                    blunt) and so I<br>
                    am writing, on a Saturday night no less, to voice a
                    different view.<br>
                    <br>
                    Working inductively from a corpus is great, but no
                    corpus is ever<br>
                    going to be large enough to fully represent a given
                    language's<br>
                    grammatical possibilities. If we limit ourselves to
                    working<br>
                    inductively from corpora then many basic questions
                    about the languages<br>
                    we research will go unanswered. From a corpus of
                    natural data we<br>
                    simply cannot know whether a given pattern is
                    missing because the<br>
                    corpus is finite (i.e., it's just a statistical
                    accident that the<br>
                    pattern isn't attested) or whether there's a genuine
                    reason why the<br>
                    pattern is not showing up (i.e., its non-attestation
                    is principled).<br>
                    <br>
                    When I am writing up my research on Tuparí I always
                    prioritize<br>
                    non-elicited data (texts, in-person conversation,
                    WhatsApp chats). But<br>
                    interpreting and analyzing the non-elicited data
                    requires making<br>
                    reference to acceptability judgments. The prefix
                    (e)tareman- is a<br>
                    negative polarity item, and it always co-occurs with
                    (and inside the<br>
                    scope of) a negator morpheme. But the only way I can
                    make this point<br>
                    is by showing that speakers invariably reject tokens
                    of (e)tareman-<br>
                    without a licensing negator. Those rejected examples
                    are by definition<br>
                    not going to be present in any corpus of
                    naturalistic speech, but they<br>
                    tell me something crucial about what the structure
                    of Tuparí does and<br>
                    does not allow. If I limit myself to inductively
                    working from a<br>
                    corpus, fundamental facts about the prefix
                    (e)tareman- and about<br>
                    negation in Tuparí more broadly will be missed.<br>
                    <br>
                    A lot of recent scholarship has made major strides
                    towards improving<br>
                    the methodology of collecting and interpreting
                    acceptability<br>
                    judgments. The formal semanticists who work on
                    understudied languages<br>
                    (here I am thinking of Judith Tonhauser, Lisa
                    Matthewson, Ryan<br>
                    Bochnak, Amy Rose Deal, Scott AnderBois) are
                    extremely careful about<br>
                    teasing apart utterances that are rejected because
                    of some<br>
                    morphosyntactic ill-formedness (i.e.,
                    ungrammaticality) versus ones<br>
                    that are rejected because of semantic or pragmatic
                    oddity. The<br>
                    important point is that such teasing apart can be
                    done, and the<br>
                    descriptions and analyses that result from this work
                    are richer than<br>
                    what would result from a methodology that uses
                    corpus examination or<br>
                    elicitation only.<br>
                    <br>
                    One more example from Tuparí: this language has an
                    obligatory<br>
                    witnessed/non-witnessed evidential distinction, but
                    the deictic<br>
                    orientation of the distinction (to the speaker or to
                    the addressee) is<br>
                    determined via clause type. There is a nuanced set
                    of interactions<br>
                    between the evidential morphology and the
                    clause-typing morphology,<br>
                    and it would have been impossible for me to figure
                    out the basics of<br>
                    those interactions without relying primarily on
                    conversational data<br>
                    and discourse context. But I still needed to get
                    some acceptability<br>
                    judgments to ensure that the picture I'd arrived at
                    wasn't overly<br>
                    biased by the limitations of my corpus. Finding
                    speakers who were<br>
                    willing to work with me on those judgments wasn't
                    always easy; a fair<br>
                    amount of metalinguistic awareness was needed. But
                    it was worth it!<br>
                    The generalizations that I was able to publish were
                    much more solid<br>
                    than if I had worked exclusively from corpus data.
                    And the methodology<br>
                    I learned from the Tonhauser/Matthewson/etc crowd
                    was fundamental to<br>
                    this work.<br>
                    <br>
                    The call to work inductively from corpora would have
                    the practical<br>
                    effect of making certain topics totally inaccessible
                    for research<br>
                    (control vs raising structures, pied-piping,
                    islands, gaps in<br>
                    inflectional paradigms, etc) even though large scale
                    acceptability<br>
                    tasks have shown that these phenomena are "real,"
                    i.e., they're not<br>
                    just in the minds of linguists who are using
                    introspection. Randy's<br>
                    point that "no other science allows the scientist to
                    make up his or<br>
                    her own data, and so this is something linguists
                    should give up" is a<br>
                    straw man argument now that many experimentalist
                    syntacticians use<br>
                    large-scale acceptability judgments on platforms
                    like Mechanical Turk<br>
                    to get at speakers' judgments. I think we do a
                    disservice to our<br>
                    students and to junior scholars if we tell them that
                    the only real<br>
                    stuff to be studied will be in the corpora that we
                    assemble. Even the<br>
                    best corpora are finite, whereas L1 speakers'
                    knowledge of their<br>
                    language is infinitely productive.<br>
                    <br>
                    — Adam<br>
                    _______________________________________________<br>
                    Lingtyp mailing list<br>
                    <a href="mailto:Lingtyp@listserv.linguistlist.org"
                      target="_blank" moz-do-not-send="true"
                      class="moz-txt-link-freetext">Lingtyp@listserv.linguistlist.org</a><br>
                    <a
href="https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Flistserv.linguistlist.org%2Fcgi-bin%2Fmailman%2Flistinfo%2Flingtyp&data=05%7C01%7Cjb77%40buffalo.edu%7C243aea832f624794e4da08dadb26ac49%7C96464a8af8ed40b199e25f6b50a20250%7C0%7C0%7C638063256872055833%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=RktZzmPhH9YRxywpcxrcF8eeMu3t0v5J%2FxNjCPJcx6s%3D&reserved=0"
                      target="_blank" moz-do-not-send="true">https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/lingtyp</a><o:p></o:p></span></p>
              </blockquote>
            </div>
          </div>
        </div>
      </div>
      <br>
      <fieldset class="moz-mime-attachment-header"></fieldset>
      <pre class="moz-quote-pre" wrap="">_______________________________________________
Lingtyp mailing list
<a class="moz-txt-link-abbreviated" href="mailto:Lingtyp@listserv.linguistlist.org">Lingtyp@listserv.linguistlist.org</a>
<a class="moz-txt-link-freetext" href="https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/lingtyp">https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/lingtyp</a>
</pre>
    </blockquote>
    <br>
    <pre class="moz-signature" cols="72">-- 
Martin Haspelmath
Max Planck Institute for Evolutionary Anthropology
Deutscher Platz 6
D-04103 Leipzig
<a class="moz-txt-link-freetext" href="https://www.eva.mpg.de/linguistic-and-cultural-evolution/staff/martin-haspelmath/">https://www.eva.mpg.de/linguistic-and-cultural-evolution/staff/martin-haspelmath/</a></pre>
  </body>
</html>