[Lingtyp] Concerns about U.S. policies and linguistic archives

Emily M. Bender ebender at uw.edu
Tue Feb 4 16:25:22 UTC 2025


Well, for one thing, our government isn't asking advice at this point --
they are issuing edicts.

For another thing, this is yet another reason that linguists should be
proactively LOUD about the unsuitability for LLMs (for just about anything).

Emily


On Tue, Feb 4, 2025 at 8:23 AM Stela Manova via Lingtyp <
lingtyp at listserv.linguistlist.org> wrote:

> In my message, I do not advertise LLMs as true archives. I only point out
> that much data are stored somewhere already; of course, we do not know in
> what format and order. Then, I do not think that when your government
> decides which data should be preserved (or backed up), they will ask
> linguists, which thus again leads us to LLMs / computer scientists.
> Stela
>
> ------------------------------
> *From:* Lingtyp on behalf of William Croft via Lingtyp
> *Sent:* Tuesday, February 4, 2025 5:01 PM
> *To:* Juergen Bohnemeyer
> *Cc:* lingtyp at listserv.linguistlist.org
> *Subject:* Re: [Lingtyp] Concerns about U.S. policies and linguistic
> archives
>
> I would add that if Musk/Trump are successful in their efforts (they
> aren’t stopped by the courts or the people), then red states such as Texas
> may follow their lead. Also, federal funding pervades government budgets at
> all levels (state, local etc.), as everyone here is about to discover. That
> gives the federal government leverage over state and local goverment
> activities and budget allocations.
>
> Bill
>
> On Feb 4, 2025, at 7:33 AM, Juergen Bohnemeyer via Lingtyp <
> lingtyp at listserv.linguistlist.org> wrote:
>
> Dear Jocelyn – Indeed, we are once again finding ourselves in
> “interesting”, read unprecedented and disturbing, times. Now, I may not be
> in the best position to respond to your query, but any immediate concern
> for the safety of language archives would only relate to things that are
> under the control of the federal government, such as the Library of
> Congress or the National Endowment for the Humanities. And as far as I
> know, these have not been archiving data and records from endangered
> languages.
>
> I do, however, worry about the Smithsonian Institution in this regard.
> Other than the Smithsonian, the language archive that comes immediately to
> mind, AILLA at UT, is not under the purview of the federal government.
>
> In any event, beyond the current situation, it seems indeed vitally
> important to connect the world’s digital language archives and create a
> system of mirrors in order to effectively decentralize the data and thereby
> make it less vulnerable to threats on any one site or even country. It’s my
> understanding that the people in charge of the archives are well aware of
> this and have begun to take steps. But it’s a long-haul project, based on
> my very incomplete understanding.
>
> Best – Juergen
>
> Juergen Bohnemeyer (He/Him)
> Professor, Department of Linguistics
> University at Buffalo
>
> Office: 642 Baldy Hall, UB North Campus
> Mailing address: 609 Baldy Hall, Buffalo, NY 14260
> Phone: (716) 645 0127
> Fax: (716) 645 3825
> Email: *jb77 at buffalo.edu <jb77 at buffalo.edu>*
> Web: *http://www.acsu.buffalo.edu/~jb77/
> <https://urldefense.com/v3/__http://www.acsu.buffalo.edu/*jb77/__;fg!!K-Hz7m0Vt54!lZlQBu-ZCcJBmwwcV31AfzXekzrXaQEwQaPqlmajisvNUOG3CMZIRvoOhlu6rT0G0NN9jTggHRp5r9LUgMC-b9YZs2JLRTe3-Q$>*
>
>
> Office hours Tu/Th 3:30-4:30pm in 642 Baldy or via Zoom (Meeting ID 585
> 520 2411; Passcode Hoorheh)
>
> There’s A Crack In Everything - That’s How The Light Gets In
> (Leonard Cohen)
> --
>
>
>
> *From: *Lingtyp <*lingtyp-bounces at listserv.linguistlist.org
> <lingtyp-bounces at listserv.linguistlist.org>*> on behalf of Jocelyn Aznar
> via Lingtyp <*lingtyp at listserv.linguistlist.org
> <lingtyp at listserv.linguistlist.org>*>
> *Date: *Tuesday, February 4, 2025 at 05:02
> *To: **lingtyp at listserv.linguistlist.org
> <lingtyp at listserv.linguistlist.org>* <*lingtyp at listserv.linguistlist.org
> <lingtyp at listserv.linguistlist.org>*>
> *Subject: *[Lingtyp] Concerns about U.S. policies and linguistic archives
> Dear colleagues,
>
> I know this list is primarily meant for discussing ideas and
> observations related to linguistic typology, rather than politics.
> However, current U.S. policies regarding scientific data have led me to
> wonder whether these policies might affect the fields of linguistics and
> humanities.
>
> When I heard about data related to ecology and the environment being
> discarded, I immediately worried the same could happen to linguistic
> archives and datasets. But maybe it is just me, dear colleagues working
> in the US, what do you think? Could this happen as well to archives
> related to linguistics and humanities?
>
> I believe that if we address this issue proactively, we’ll be better
> placed to preserve more data should the need arise. For instance, we
> could check whether the existing infrastructure outside of the US, ELAR,
> HumaNum/Ortolang, Pangloss, Paradisec, etc, would be able to handle or
> help to face such a crisis? or whether we should consider setting up
> some sort of emergency server so that researchers can transfer data at
> risk of being lost?
>
> One possible strategy would be to prepare a brief manual (probably as a
> webpage), after discussing with each institution of course, describing
> which archives outside the U.S. could accept data from an archive from
> the US, in which format, what kind of data would be accepted, etc. Then,
> if needed, U.S based researchers could formulate a plan to safeguard
> their data. By doing that, we could also identify gaps in current
> coverage and, if necessary, establish an emergency archive or server to
> fill those gaps.
>
> Best regards,
> Jocelyn Aznar
>
> ¹ I’m of course also concerned about data from other fields, though I
> feel more competent discussing linguistic data. Still, if we build an
> infrastructure for linguistic data from the U.S., it might be possible
> to scale it up for other disciplines as well.
> _______________________________________________
> Lingtyp mailing list
> *Lingtyp at listserv.linguistlist.org <Lingtyp at listserv.linguistlist.org>*
> *https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Flistserv.linguistlist.org%2Fcgi-bin%2Fmailman%2Flistinfo%2Flingtyp&data=05%7C02%7Cjb77%40buffalo.edu%7C8ad63c8d02e04bd681a208dd4502fde3%7C96464a8af8ed40b199e25f6b50a20250%7C0%7C0%7C638742601354965905%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=DJ%2Fl9ABlxi%2BjR%2B8C9PDqzDpGS5vkWcUnOZy6OWubBuI%3D&reserved=0
> <https://urldefense.com/v3/__https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/lingtyp__;!!K-Hz7m0Vt54!lZlQBu-ZCcJBmwwcV31AfzXekzrXaQEwQaPqlmajisvNUOG3CMZIRvoOhlu6rT0G0NN9jTggHRp5r9LUgMC-b9YZs2KKINT-RA$>*
> _______________________________________________
> Lingtyp mailing list
> *Lingtyp at listserv.linguistlist.org <Lingtyp at listserv.linguistlist.org>*
> *https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/lingtyp
> <https://urldefense.com/v3/__https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/lingtyp__;!!K-Hz7m0Vt54!lZlQBu-ZCcJBmwwcV31AfzXekzrXaQEwQaPqlmajisvNUOG3CMZIRvoOhlu6rT0G0NN9jTggHRp5r9LUgMC-b9YZs2KKINT-RA$>*
>
>
> _______________________________________________
> Lingtyp mailing list
> Lingtyp at listserv.linguistlist.org
>
> https://urldefense.com/v3/__https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/lingtyp__;!!K-Hz7m0Vt54!lZlQBu-ZCcJBmwwcV31AfzXekzrXaQEwQaPqlmajisvNUOG3CMZIRvoOhlu6rT0G0NN9jTggHRp5r9LUgMC-b9YZs2KKINT-RA$
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/lingtyp/attachments/20250204/2d8f83d6/attachment.htm>


More information about the Lingtyp mailing list