<!DOCTYPE html>
<html>
  <head>
    <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
  </head>
  <body>
    Dear all,<br>
    <br>
    Thanks to Juegen Bohnemeyer for this thoughtful contribution. <br>
    <br>
    A couple of remarks.<br>
    <br>
    About the phylogenies we used in Jäger & Wahle 2021: We actually
    did not use the world tree. The reason for this choice was the fact
    that branch lengths are an important aspect of the phylogenetic
    comparative method. Branch lengths above the family level in the
    world tree (the one from
    <a class="moz-txt-link-freetext" href="https://www.nature.com/articles/sdata2018189">https://www.nature.com/articles/sdata2018189</a>) are so obviously too
    short that this seriously undermines the method. (Btw, the
    phylogenetic comparative method is fairly robust regarding errors in
    the topology of the phylogenies, but branch lengths are important).
    <br>
    <br>
    Instead of using the world tree, we inferred a posterior sample of
    phylogenies for each Glottolog family. So like Dunn et al., we used
    separate trees for different families. The main technical innovation
    - besides the fact that we used dozens of families instead of four -
    was that we tested using the same Markov Chain parameters across
    families, where Dunn et al. fitted a separate model for each family.
    Our's is effectively an implementation of Maslova's ideas.<br>
    <br>
    In the extreme case where each family contains just one language in
    a sample, the Jäger & Wahle method is actually equivalent to
    stratified sampling where one language is sampled from each family.
    So our method is not so much an alternative to stratified sampling
    but an extension that allows to use all languages for which you have
    data.<br>
    <br>
    Regarding the phylogenies derived from ASJP: Juergen, you say,
    "Notably, Jäger’s method takes into account systematic sound
    correspondences, so to my uninitiated eye, it looks like an
    unassailable computational implementation of the
    historic-comparative method." I am afraid you are doing me too much
    honor here. My method takes recurring sound correspondences into
    account, but it is still a far cry from a full implementation of the
    historic-comparative method. However, in Jäger 2018
    (<a class="moz-txt-link-freetext" href="https://www.nature.com/articles/sdata2018189">https://www.nature.com/articles/sdata2018189</a>), I ran a lot of tests
    demonstrating that the quality of the phylogenies obtained this way
    is fairly adequate (using Glottolog as gold standard) despite the
    method's conceptual shortcomings.<br>
    <br>
    I agree with Juergen that there is no consensus approach for the
    phylogenetic comparative method. I am not sure though that such a
    consensus is possible or even desirable. A linguistic result - say,
    the universal correlation between adposition-noun order and
    verb-object order - is trustworthy if several different methods
    confirm it. These methods may include stratified sampling with
    different sampling protocols, family bias estimation, phylogenetic
    comparative methods with different trees and different priors,
    perhaps others. If some approaches confirm it and others do not,
    that is a reason to dig deeper.<br>
    <br>
    Best, Gerhard<br>
    <br>
    <br>
    <br>
    <div class="moz-cite-prefix">On 11/6/23 00:22, Juergen Bohnemeyer
      wrote:<br>
    </div>
    <blockquote type="cite"
cite="mid:SJ0PR15MB46966D0B2BE32910582F1528DDA4A@SJ0PR15MB4696.namprd15.prod.outlook.com">
      <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
      <meta name="Generator"
        content="Microsoft Word 15 (filtered medium)">
      <style>@font-face
        {font-family:Helvetica;
        panose-1:0 0 0 0 0 0 0 0 0 0;}@font-face
        {font-family:Wingdings;
        panose-1:5 0 0 0 0 0 0 0 0 0;}@font-face
        {font-family:SimSun;
        panose-1:2 1 6 0 3 1 1 1 1 1;}@font-face
        {font-family:"Cambria Math";
        panose-1:2 4 5 3 5 4 6 3 2 4;}@font-face
        {font-family:Calibri;
        panose-1:2 15 5 2 2 2 4 3 2 4;}@font-face
        {font-family:"CMU Serif";
        panose-1:2 0 6 3 0 0 0 0 0 0;}@font-face
        {font-family:"Times New Roman \(Body CS\)";
        panose-1:2 11 6 4 2 2 2 2 2 4;}@font-face
        {font-family:"\@SimSun";
        panose-1:2 1 6 0 3 1 1 1 1 1;}p.MsoNormal, li.MsoNormal, div.MsoNormal
        {margin:0in;
        font-size:10.0pt;
        font-family:"Calibri",sans-serif;}a:link, span.MsoHyperlink
        {mso-style-priority:99;
        color:blue;
        text-decoration:underline;}p.MsoListParagraph, li.MsoListParagraph, div.MsoListParagraph
        {mso-style-priority:34;
        margin-top:0in;
        margin-right:0in;
        margin-bottom:0in;
        margin-left:.5in;
        font-size:10.0pt;
        font-family:"Calibri",sans-serif;}span.EmailStyle19
        {mso-style-type:personal-reply;
        font-family:"CMU Serif";
        color:windowtext;
        font-weight:normal;
        font-style:normal;}.MsoChpDefault
        {mso-style-type:export-only;
        font-size:10.0pt;
        mso-ligatures:none;}div.WordSection1
        {page:WordSection1;}ol
        {margin-bottom:0in;}ul
        {margin-bottom:0in;}</style>
      <div class="WordSection1">
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif"">Dear
            all – Two points:<o:p></o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif""><o:p> </o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif"">First,
            this thread has served to alleviate somewhat my concerns
            about the review processes used by the generic science
            journals – mostly through offline one-on-one communications
            I’ve received. Thanks!<o:p></o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif""><o:p> </o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif"">Secondly,
            I’d like to, maybe gently push back, or may be just
            elaborate, on the contrast Simon Greenhill draws between
            phylogenetic inference and the “overly simplistic
            correlation methods”, as he calls them. In the process, I’d
            like to try to clarify what in my mind are the strengths and
            weaknesses of each (type of) method. And I’ll pose a couple
            of challenge questions to the detractors of
            phylogenetic/coevolutionary testing and one for their
            proponents, and I hope all of these questions won’t go
            unanswered </span>
          <span
style="font-size:12.0pt;font-family:"Apple Color Emoji"">😊</span><span
            style="font-size:12.0pt;font-family:"CMU Serif""><o:p></o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif""><o:p> </o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif"">Before
            I do that, let me flag my interest in this: I’m currently in
            Leipzig spending a sabbatical with Russell Gray’s Department
            of Linguistic and Cultural Evolution (</span><span
style="font-size:12.0pt;font-family:"Cambria Math",serif;mso-fareast-language:ZH-CN">≈
          </span><span
            style="font-size:12.0pt;font-family:"CMU Serif"">the
            Grambank team), with the principal objective of figuring out
            how to capitalize the most on Grambank for my project. The
            project tracks the typological distribution of functional
            expressions and attempts to account for it (the
            distribution) in terms of an evolutionary process that makes
            grammaticalization sensitive to functional pressures. Since
            coming here, I’ve been struggling with the question whether
            I should adopt stratified sampling, a phylogenetic method
            (I’m not going to use the term ‘co-evolution’ here, because
            I’m at this stage of the project just looking at the
            emergence of individual functional expressions), or both. So
            I’m airing my thoughts very much in hopes of eliciting
            feedback.</span><span
style="font-size:12.0pt;font-family:"Cambria Math",serif;mso-fareast-language:ZH-CN"><o:p></o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif""><o:p> </o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif"">So.
            Let’s start with a very basic reminder: We cannot observe
            causality, we can only infer it, as David Hume taught us
            (i.e., humanity) nearly three centuries ago. The difference
            between the methods at issue is one between synchronic
            inference and diachronic inference.
            <o:p></o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif""><o:p> </o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif"">Now,
            before I go any further, a big disclaimer: I think I
            understand at the most abstract level the ideas underlying
            phylogenetic inference. But I don’t understand the
            statistical “sausage making” involved. Meaning I have many
            basic questions about the algorithms used for phylogenetic
            inference.
            <o:p></o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif""><o:p> </o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif"">With
            that said: Imagine an ideal world in which we had perfect
            and reliable knowledge about the complete phylogeny of every
            extant human language and every language that ever existed.
            It seems indisputable to me that in this world, we would
            test hypotheses about causal relations between two
            linguistic features not merely by looking at their
            synchronic distributions, but also by looking at whether the
            two features tend to co-evolve across phylogenies or whether
            they appear to have merely accidentally “travelled together”
            through time in a few families.
            <o:p></o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif""><o:p> </o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif"">So
            this would be my first question to the detractors of
            phylogenetic/dynamic (Maslova’s term)/co-evolutionary tests
            for typological generalizations: do you disagree that if we
            had that perfect knowledge, we would as a matter of course
            take into account the phylogenetic perspective?<o:p></o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif""><o:p> </o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif"">Now,
            in the real world, our knowledge of phylogenies is itself
            largely inferred from synchronic distributions. Crucially,
            though, these inferences are based on data that may overlap
            with the typological patterns of interest, but is largely
            independent of it. To make this more concrete:<o:p></o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif""><o:p> </o:p></span></p>
        <ul style="margin-top:0in" type="disc">
          <li class="MsoListParagraph"
            style="margin-left:0in;mso-list:l0 level1 lfo1"><span
              style="font-size:12.0pt;font-family:"CMU Serif"">Jäger
              & Wahle (2021) use the world tree inferred from
              <a href="https://asjp.clld.org/" moz-do-not-send="true">ASJP</a>
              cognate data presented in Jäger (2018). Notably, Jäger’s
              method takes into account systematic sound
              correspondences, so to my uninitiated eye, it looks like
              an unassailable computational implementation of the
              historic-comparative method. Of course, the data the
              analysis is based on is another matter.<o:p></o:p></span></li>
        </ul>
        <p class="MsoListParagraph"><span
            style="font-size:12.0pt;font-family:"CMU Serif""><o:p> </o:p></span></p>
        <ul style="margin-top:0in" type="disc">
          <li class="MsoListParagraph"
            style="margin-left:0in;mso-list:l0 level1 lfo1"><span
              style="font-size:12.0pt;font-family:"CMU Serif"">Verkerk
              et al. (ms.), which Martin cites in the post that started
              the thread, rely on Glottolog, which Jäger & Wahle
              also used to validate their inferred world tree. In my
              understanding, Glottolog is sourced by a compilation of
              the best available evidence for phylogenetic relations,
              principally curated by its lead author, Harald
              Hammarström.<o:p></o:p></span></li>
        </ul>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif""><o:p> </o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif"">Two
            more points to keep in mind here:<o:p></o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif""><o:p> </o:p></span></p>
        <ul style="margin-top:0in" type="disc">
          <li class="MsoListParagraph"
            style="margin-left:0in;mso-list:l0 level1 lfo1"><span
              style="font-size:12.0pt;font-family:"CMU Serif"">Stratified
              sampling of course also takes into account phylogenetic
              and areal information. So the two types of approaches have
              really more in common than Simon gives them credit for in
              his reply, I think. However, stratified sampling uses
              phylogenetic and areal information in first approximation
              only “negatively”, as it were: by systematically removing
              observations from the analysis that are suspected to be
              “tainted” by areal or genealogical dependencies. <o:p></o:p></span></li>
        </ul>
        <p class="MsoListParagraph"><span
            style="font-size:12.0pt;font-family:"CMU Serif""><o:p> </o:p></span></p>
        <ul style="margin-top:0in" type="disc">
          <li class="MsoListParagraph"
            style="margin-left:0in;mso-list:l0 level1 lfo1"><span
              style="font-size:12.0pt;font-family:"CMU Serif"">The
              distance of our real world from that ideal world in which
              we have perfect knowledge of phylogenetic and areal
              relations is in my (again, uninitiated, in the sense that
              I’m not a historical linguist) mind pretty darn great.
              However, the phylogenetic statistics algorithms use
              “forests” of alternative phylogenetic trees weighted for
              the confidence the field (or the analysts) has/have in
              them, factored into the tests as Bayesian priors. In this
              sense, the use of phylogenetic information is more
              sophisticated in the dynamic tests than in anything based
              on stratified sampling that I have seen.<o:p></o:p></span></li>
        </ul>
        <p class="MsoListParagraph"><span
            style="font-size:12.0pt;font-family:"CMU Serif""><o:p> </o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif"">So
            with all this assembled, here’s my semi-informed take on the
            pros and cons of each approach:<o:p></o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif""><o:p> </o:p></span></p>
        <ul style="margin-top:0in" type="disc">
          <li class="MsoListParagraph"
            style="margin-left:0in;mso-list:l0 level1 lfo1"><span
              style="font-size:12.0pt;font-family:"CMU Serif"">Stratified
              sampling (I’m going to start with the method that has been
              practiced the longest)<o:p></o:p></span></li>
        </ul>
        <p class="MsoListParagraph"><span
            style="font-size:12.0pt;font-family:"CMU Serif""><o:p> </o:p></span></p>
        <ul style="margin-top:0in" type="disc">
          <ul style="margin-top:0in" type="circle">
            <li class="MsoListParagraph"
              style="margin-left:0in;mso-list:l0 level2 lfo1"><span
style="font-size:12.0pt;font-family:"CMU Serif"">Pro<o:p></o:p></span></li>
            <ul style="margin-top:0in" type="square">
              <li class="MsoListParagraph"
                style="margin-left:0in;mso-list:l0 level3 lfo1"><span
style="font-size:12.0pt;font-family:"CMU Serif"">Technologically
                  and conceptually simple, easy to implement by anybody
                  with the most basic level of training in inferential
                  statistics, and transparent for the analyst to track
                  and understand the effects of their decisions.<o:p></o:p></span></li>
              <li class="MsoListParagraph"
                style="margin-left:0in;mso-list:l0 level3 lfo1"><span
style="font-size:12.0pt;font-family:"CMU Serif"">Statistically
                  conservative, in the specific sense that it minimizes
                  the odds of false positives, i.e., of support for
                  generalizations for which the evidence isn’t really
                  there. (One might think that Dunn et al. 2011 present
                  counterevidence to this. But the apparent false
                  positives pointed out by Dunn et al. turn out to be
                  true positives after all, now that Jäger & Wahle
                  and Verkerk et al. have replicated their analysis with
                  a larger sample.)<o:p></o:p></span></li>
              <li class="MsoListParagraph"
                style="margin-left:0in;mso-list:l0 level3 lfo1"><span
style="font-size:12.0pt;font-family:"CMU Serif"">Validity of
                  testing depends only on one key assumption – the big
                  one: independence of observations. (Detractors of
                  stratified sampling might however argue that it is
                  impossible to ensure that this assumption is warranted
                  in any decent-sized sample.)<o:p></o:p></span></li>
            </ul>
            <li class="MsoListParagraph"
              style="margin-left:0in;mso-list:l0 level2 lfo1"><span
style="font-size:12.0pt;font-family:"CMU Serif"">Con<o:p></o:p></span></li>
            <ul style="margin-top:0in" type="square">
              <li class="MsoListParagraph"
                style="margin-left:0in;mso-list:l0 level3 lfo1"><span
style="font-size:12.0pt;font-family:"CMU Serif"">There has
                  never been a consensus on an optimal sampling
                  algorithm that balances minimization of areal and
                  genealogical biases with retaining enough statistical
                  power to detect the patterns of interest. With such a
                  consensus absent after 45 years of some of the
                  smartest minds in the field applying themselves to the
                  problem, maybe there is no optimal solution to be had?<o:p></o:p></span></li>
              <li class="MsoListParagraph"
                style="margin-left:0in;mso-list:l0 level3 lfo1"><span
style="font-size:12.0pt;font-family:"CMU Serif"">Based on
                  information reduction (exclusion of observations);
                  does not take into account phylogenetic information to
                  the fullest measure of what can be concluded from it.<o:p></o:p></span></li>
            </ul>
          </ul>
        </ul>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif""><o:p> </o:p></span></p>
        <ul style="margin-top:0in" type="disc">
          <li class="MsoListParagraph"
            style="margin-left:0in;mso-list:l0 level1 lfo1"><span
              style="font-size:12.0pt;font-family:"CMU Serif"">Dynamic/phylogenetic/co-evolutionary
              inference<o:p></o:p></span></li>
        </ul>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif""><o:p> </o:p></span></p>
        <ul style="margin-top:0in" type="disc">
          <ul style="margin-top:0in" type="circle">
            <li class="MsoListParagraph"
              style="margin-left:0in;mso-list:l0 level2 lfo1"><span
style="font-size:12.0pt;font-family:"CMU Serif"">Pro<o:p></o:p></span></li>
            <ul style="margin-top:0in" type="square">
              <li class="MsoListParagraph"
                style="margin-left:0in;mso-list:l0 level3 lfo1"><span
style="font-size:12.0pt;font-family:"CMU Serif"">Utilizes the
                  best available phylogenetic information “for all it’s
                  worth”, directly examining whether features tend to
                  coevolve across lineages or only in particular
                  lineages. (The same cannot currently be said about
                  areal information. However, Verkerk et al. (ms.) use
                  mixed models regression – which is feasible based on
                  Grambank thanks to its relatively low percentage of
                  missing observations – to factor in possible areal
                  effects.)<o:p></o:p></span></li>
              <li class="MsoListParagraph"
                style="margin-left:0in;mso-list:l0 level3 lfo1"><span
style="font-size:12.0pt;font-family:"CMU Serif"">Does not
                  depend on the assumption of independence of
                  observation.<o:p></o:p></span></li>
            </ul>
          </ul>
        </ul>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif""><o:p> </o:p></span></p>
        <ul style="margin-top:0in" type="disc">
          <ul style="margin-top:0in" type="circle">
            <li class="MsoListParagraph"
              style="margin-left:0in;mso-list:l0 level2 lfo1"><span
style="font-size:12.0pt;font-family:"CMU Serif"">Con<o:p></o:p></span></li>
            <ul style="margin-top:0in" type="square">
              <li class="MsoListParagraph"
                style="margin-left:0in;mso-list:l0 level3 lfo1"><span
style="font-size:12.0pt;font-family:"CMU Serif"">Very steep
                  training demands both on the conceptual and on the
                  technological side. Currently only a handful of
                  specialists are able to perform such analyses. In
                  addition, the computational complexities are so
                  demanding as to render the analyses effectively opaque
                  for all but those specialists, raising questions about
                  the validation of such tests. (Gerhard Jäger’s
                  comparison to carbon dating in archeology is apt here
                  as well, I think: while phylogenetic analysis will
                  likely become more standard in the future, it may be
                  the case that typologists will always have to rely on
                  specialists for actually performing such tests, just
                  like archeologists rely on physicists or lab
                  technicians for performing carbon dating. I think </span>
                <span
style="font-size:12.0pt;font-family:"Apple Color Emoji"">😉</span><span
style="font-size:12.0pt;font-family:"CMU Serif"">)<o:p></o:p></span></li>
              <li class="MsoListParagraph"
                style="margin-left:0in;mso-list:l0 level3 lfo1"><span
style="font-size:12.0pt;font-family:"CMU Serif"">Reliability
                  of phylogenetic inference seems to depend on the size
                  of the phylogenies involved: the smaller the family,
                  the higher the uncertainty in the absence of actual
                  historic data.<o:p></o:p></span></li>
              <li class="MsoListParagraph"
                style="margin-left:0in;mso-list:l0 level3 lfo1"><span
style="font-size:12.0pt;font-family:"CMU Serif"">As with
                  stratified sampling, there does not yet appear to be a
                  consensus approach.
                </span><span
style="font-size:12.0pt;font-family:"CMU Serif"" lang="DE">Jäger
                  & Wahle (2021) and Verkerk et al.
                </span><span
style="font-size:12.0pt;font-family:"CMU Serif"">(ms.) use
                  different algorithms (implemented in different
                  software packages), make different assumptions, and
                  their tests are conceptually quite distinct.
                  <o:p></o:p></span></li>
              <li class="MsoListParagraph"
                style="margin-left:0in;mso-list:l0 level3 lfo1"><span
style="font-size:12.0pt;font-family:"CMU Serif"">Outcomes
                  depend on many assumptions which are surrounded by
                  varying degrees of uncertainty.
                  <o:p></o:p></span></li>
            </ul>
          </ul>
        </ul>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif""><o:p> </o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif"">So
            at long last, here are my three challenge questions:<o:p></o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif""><o:p> </o:p></span></p>
        <ul style="margin-top:0in" type="disc">
          <li class="MsoListParagraph"
            style="margin-left:0in;mso-list:l0 level1 lfo1"><span
              style="font-size:12.0pt;font-family:"CMU Serif"">For
              the detractors of phylogenetic inference:
              <o:p></o:p></span></li>
          <ul style="margin-top:0in" type="circle">
            <li class="MsoListParagraph"
              style="margin-left:0in;mso-list:l0 level2 lfo1"><span
style="font-size:12.0pt;font-family:"CMU Serif"">(Repeated
                from above to make sure this doesn’t get lost) Do you
                (dis)agree that if we had complete and robust knowledge
                of the phylogenies of the extant languages, we would
                perform phylogenetic tests as a matter of course when
                attempting to validate typological generalizations?<o:p></o:p></span></li>
            <li class="MsoListParagraph"
              style="margin-left:0in;mso-list:l0 level2 lfo1"><span
style="font-size:12.0pt;font-family:"CMU Serif"">Do you
                dispute that since phylogenetic testing of typological
                generalizations has become possible, typology as a field
                cannot simply ignore it and move on without it? <o:p></o:p></span></li>
          </ul>
          <li class="MsoListParagraph"
            style="margin-left:0in;mso-list:l0 level1 lfo1"><span
              style="font-size:12.0pt;font-family:"CMU Serif"">For
              the proponents of phylogenetic inference:
              <o:p></o:p></span></li>
          <ul style="margin-top:0in" type="circle">
            <li class="MsoListParagraph"
              style="margin-left:0in;mso-list:l0 level2 lfo1"><span
style="font-size:12.0pt;font-family:"CMU Serif"">Do you
                dispute that stratified sampling is statistically
                conservative and has the advantage of transparency and
                robustness vis-à-vis the underlying assumptions, so it
                is likely not a method that typology will simply move
                beyond and disregard in the future?<o:p></o:p></span></li>
          </ul>
        </ul>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif""><o:p> </o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif"">And
            there it. Apologies for the overlong post! – Cheers –
            Juergen<o:p></o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif""><o:p> </o:p></span></p>
        <div>
          <div>
            <p class="MsoNormal"><span
style="font-size:9.0pt;font-family:Helvetica;color:black">Juergen
                Bohnemeyer (He/Him)<br>
                Professor, Department of Linguistics<br>
                University at Buffalo <br>
                <br>
                Office: 642 Baldy Hall, UB North Campus<br>
                Mailing address: 609 Baldy Hall, Buffalo, NY 14260 <br>
                Phone: (716) 645 0127 <br>
                Fax: (716) 645 3825<br>
                Email: </span><span style="font-size:11.0pt"><a
                  href="mailto:jb77@buffalo.edu"
                  title="mailto:jb77@buffalo.edu" moz-do-not-send="true"><span
style="font-size:9.0pt;font-family:Helvetica;color:#0078D4">jb77@buffalo.edu</span></a></span><span
style="font-size:9.0pt;font-family:Helvetica;color:black"><br>
                Web: </span><span style="font-size:11.0pt"><a
                  href="http://www.acsu.buffalo.edu/~jb77/"
                  title="http://www.acsu.buffalo.edu/~jb77/"
                  moz-do-not-send="true"><span
style="font-size:9.0pt;font-family:Helvetica;color:#0563C1">http://www.acsu.buffalo.edu/~jb77/</span></a></span><span
style="font-size:9.0pt;font-family:Helvetica;color:black"> <br>
                <br>
              </span><span style="font-size:11.0pt;color:black">Office
                hours Tu/Th 3:30-4:30pm in 642 Baldy or via Zoom
                (Meeting ID 585 520 2411; Passcode Hoorheh) </span><span
style="font-size:9.0pt;font-family:Helvetica;color:black"><br>
                <br>
                There’s A Crack In Everything - That’s How The Light
                Gets In <br>
                (Leonard Cohen)  </span><span style="font-size:11.0pt"><o:p></o:p></span></p>
            <p class="MsoNormal"><span style="font-size:11.0pt">-- <o:p></o:p></span></p>
          </div>
        </div>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif""
            lang="DE"><o:p> </o:p></span></p>
        <p class="MsoNormal"><span
            style="font-size:12.0pt;font-family:"CMU Serif""
            lang="DE"><o:p> </o:p></span></p>
        <div id="mail-editor-reference-message-container">
          <div>
            <div
style="border:none;border-top:solid #B5C4DF 1.0pt;padding:3.0pt 0in 0in 0in">
              <p class="MsoNormal"
style="mso-margin-top-alt:0in;margin-right:0in;margin-bottom:12.0pt;margin-left:.5in">
                <b><span style="font-size:12.0pt;color:black">From: </span></b><span
                  style="font-size:12.0pt;color:black">Lingtyp
                  <a class="moz-txt-link-rfc2396E" href="mailto:lingtyp-bounces@listserv.linguistlist.org"><lingtyp-bounces@listserv.linguistlist.org></a> on
                  behalf of Simon Greenhill <a class="moz-txt-link-rfc2396E" href="mailto:simon@simon.net.nz"><simon@simon.net.nz></a><br>
                  <b>Date: </b>Saturday, November 4, 2023 at 13:39<br>
                  <b>To: </b><a class="moz-txt-link-abbreviated" href="mailto:lingtyp@listserv.linguistlist.org">lingtyp@listserv.linguistlist.org</a>
                  <a class="moz-txt-link-rfc2396E" href="mailto:lingtyp@listserv.linguistlist.org"><lingtyp@listserv.linguistlist.org></a><br>
                  <b>Cc: </b>Dunn Michael
                  <a class="moz-txt-link-rfc2396E" href="mailto:michael.dunn@lingfil.uu.se"><michael.dunn@lingfil.uu.se></a>, Russell Gray
                  <a class="moz-txt-link-rfc2396E" href="mailto:russell_gray@eva.mpg.de"><russell_gray@eva.mpg.de></a><br>
                  <b>Subject: </b>[Lingtyp] Greenbergian word order
                  universals: confirmed after all<o:p></o:p></span></p>
            </div>
            <div>
              <p class="MsoNormal" style="margin-left:.5in"><span
                  style="font-size:11.0pt">Colleagues, Martin, everyone
                  else<br>
                  <br>
                  Thank you for sharing your perspectives on our 2011
                  paper. It's nice to see this still be discussed more
                  than a decade later. However, I would like to express
                  my concerns and disagreements with some of the points
                  you've raised.<br>
                  <br>
                  I'm very proud of the Dunn et al. paper for a number
                  of reasons. I'll name three.<br>
                  <br>
                  First, the paper showed that the overly simplistic
                  correlation methods that had been used to make
                  sweeping global claims were problematic. We need
                  better tools to tackle these questions, and the tools
                  we applied were one part of a better toolkit.
                  <br>
                  <br>
                  Second, it highlighted the need to understand language
                  systems in a diachronic manner. We cannot decouple
                  language typology from language history, instead we
                  need to understand how these are entangled.<br>
                  <br>
                  Third, it emphasised the way that particular
                  configurations of languages can be arrived at via
                  different routes in different families at different
                  times. This enables a much richer understanding of how
                  these particular generalisations have arisen.<br>
                  <br>
                  Have Jäger and Wähle disproved any of that? no. Maybe
                  these were not completely novel insights (Maslova’s
                  work has been mentioned which touches on a few of
                  these issues too, for example), but these ideas did
                  appear to crystallise in this paper.<br>
                  <br>
                  While it's certainly important to revisit and
                  reevaluate research findings to ensure accuracy, it is
                  crucial to approach these discussions with an
                  understanding of the scientific process. Scientific
                  paradigms evolve over time, and different studies may
                  yield varying results due to changes in methodologies,
                  data sources, and sample sizes. This doesn't
                  necessarily imply that the initial research was flawed
                  or that the authors were neglectful. In particular,
                  the tools, the data, and our understanding of how
                  languages change are substantially further advanced
                  than they were a decade ago (or, I know that *my*
                  understanding of these things is more advanced now, at
                  least). And these other papers that you mention -- and
                  many other studies -- have built upon the work we did
                  in 2011. <br>
                  <br>
                  Furthermore, I would like to caution against drawing
                  overly broad conclusions about the quality of research
                  published in high-prestige journals. The peer-review
                  process in such journals is rigorous, and while they
                  may occasionally feature sensationalist claims, this
                  doesn't diminish the overall value they contribute to
                  the scientific community. For the record, of the
                  handful of papers I've had in these journals *all*
                  have been reviewed by people I would infer to be
                  linguists based on the comments and issues they
                  raised. We did not send these papers to these journals
                  to avoid linguistic reviewers but, frankly, I've had
                  better reviews at these journals than at prominent
                  linguistics journals (and by "better" I mean more
                  rigorous, more thorough, and more critical).<br>
                  <br>
                  Finally, linguistic typology is an ongoing and
                  evolving field trying to tackle very difficult
                  problems. We need all the tools and approaches we can
                  get to solve these problems across all the levels that
                  languages operate on (from detailed language internal
                  analyses to high-level global analyses). Rather than
                  looking back and gate-keeping what is 'real’ typology
                  published in ‘real’ linguistics journals, we should
                  shift our focus forward. Typology can be a welcoming
                  and diverse community that embraces a wide range of
                  approaches, analyses, and styles. Let's look outward
                  to foster connections with other fields and
                  disciplines.
                  <br>
                  <br>
                  After all, why shouldn't linguistic typology work be
                  everywhere in science? it's certainly interesting
                  enough.
                  <br>
                  <br>
                  Simon<br>
                  <br>
                  Dr. Simon J. Greenhill<br>
                  <br>
                  Associate Professor <br>
                  <br>
                  Te Kura Mātauranga Koiora | School of Biological
                  Sciences<br>
                  Te Whare Wānanga o Tāmaki Makaurau | University of
                  Auckland<br>
                  <br>
                  Abteilung für Sprach- und Kulturevolution | Department
                  of Linguistic and Cultural Evolution<br>
                  Max-Planck-Institut für Evolutionäre Anthropologie |
                  Max Planck Institute for Evolutionary Anthropology<br>
                  <br>
                  _______________________________________________<br>
                  Lingtyp mailing list<br>
                  <a class="moz-txt-link-abbreviated" href="mailto:Lingtyp@listserv.linguistlist.org">Lingtyp@listserv.linguistlist.org</a><br>
                  <a
href="https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/lingtyp"
                    moz-do-not-send="true">https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Flistserv.linguistlist.org%2Fcgi-bin%2Fmailman%2Flistinfo%2Flingtyp&data=05%7C01%7Cjb77%40buffalo.edu%7Cbe1181a9879e434ccc0308dbdd3319c4%7C96464a8af8ed40b199e25f6b50a20250%7C0%7C0%7C638346983806627973%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=OgjDO%2FGow%2FfFmxOIvmrWNNq1nwcsZaLzI5OMnoGjBXg%3D&reserved=0</a><o:p></o:p></span></p>
            </div>
          </div>
        </div>
      </div>
      <br>
      <fieldset class="moz-mime-attachment-header"></fieldset>
      <pre class="moz-quote-pre" wrap="">_______________________________________________
Lingtyp mailing list
<a class="moz-txt-link-abbreviated" href="mailto:Lingtyp@listserv.linguistlist.org">Lingtyp@listserv.linguistlist.org</a>
<a class="moz-txt-link-freetext" href="https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/lingtyp">https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/lingtyp</a>
</pre>
    </blockquote>
    <br>
    <pre class="moz-signature" cols="72">-- 
Prof. Dr. Gerhard Jäger
Universität Tübingen
Seminar für Sprachwissenschaft
Tel.: +49-7071-29-77302
<a class="moz-txt-link-freetext" href="http://www.sfs.uni-tuebingen.de/~gjaeger/">http://www.sfs.uni-tuebingen.de/~gjaeger/</a></pre>
  </body>
</html>