[Linganth] Recommendations for tools transcribing and analyzing large amounts of data

Baiocchi, Maria Lis baiocchiml at pitt.edu
Thu Apr 9 22:57:51 UTC 2026


Dear Dominika,
I've used Pinpoint and Express Scribe for transcription and they work quite well. My colleagues tell me Otter.ai and TurboScribe also work well, though I have not tried those.
Good luck!
Maria Lis

________________________________
From: Linganth <linganth-bounces at listserv.linguistlist.org> on behalf of linganth-request at listserv.linguistlist.org <linganth-request at listserv.linguistlist.org>
Sent: Thursday, April 9, 2026 17:30
To: linganth at listserv.linguistlist.org <linganth at listserv.linguistlist.org>
Subject: Linganth Digest, Vol 139, Issue 13

Send Linganth mailing list submissions to
        linganth at listserv.linguistlist.org

To subscribe or unsubscribe via the World Wide Web, visit
        https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Flistserv.linguistlist.org%2Fcgi-bin%2Fmailman%2Flistinfo%2Flinganth&data=05%7C02%7Cbaiocchiml%40pitt.edu%7C1eb154ffe5b547886dd608de9676e943%7C9ef9f489e0a04eeb87cc3a526112fd0d%7C1%7C0%7C639113634665573105%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=li6oKixEj0TAfoCxIdeFOrfBFhKCkC5eWQ1QHioosqQ%3D&reserved=0<https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/linganth>
or, via email, send a message with subject or body 'help' to
        linganth-request at listserv.linguistlist.org

You can reach the person managing the list at
        linganth-owner at listserv.linguistlist.org

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Linganth digest..."


Today's Topics:

   1. Re: Recommendations for tools transcribing and analyzing
      large amounts of data (Kathe Managan)
   2. Re: [EXT] Re: Recommendations for tools transcribing and
      analyzing large amounts of data (Dominika Baran, Ph.D.)


----------------------------------------------------------------------

Message: 1
Date: Thu, 9 Apr 2026 20:01:20 +0000
From: Kathe Managan <kathe.managan at louisiana.edu>
To: "Dominika Baran, Ph.D." <dominika.baran at duke.edu>, "Linguistic
        Anthropology Discussion Group (LINGANTH at listserv.linguistlist.org)"
        <linganth at listserv.linguistlist.org>
Subject: Re: [Linganth] Recommendations for tools transcribing and
        analyzing large amounts of data
Message-ID:
        <PH0PR22MB253408F24F85527553B721E6FD582 at PH0PR22MB2534.namprd22.prod.outlook.com>

Content-Type: text/plain; charset="windows-1252"

Hi Dominika,

I recently switched to Trint for a big project. It works in 40 different languages and has good accuracy. It also has robust privacy and security features.

Best,
Kathe

Get Outlook for iOS<https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Faka.ms%2Fo0ukef&data=05%7C02%7Cbaiocchiml%40pitt.edu%7C1eb154ffe5b547886dd608de9676e943%7C9ef9f489e0a04eeb87cc3a526112fd0d%7C1%7C0%7C639113634665607199%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=681XD9iMoFpTo7YZmXATdDvz0xL2R9Ch%2BY46HNalk0c%3D&reserved=0<https://aka.ms/o0ukef>>
________________________________
From: Linganth <linganth-bounces at listserv.linguistlist.org> on behalf of Dominika Baran, Ph.D. <dominika.baran at duke.edu>
Sent: Thursday, April 9, 2026 2:01:58 PM
To: Linguistic Anthropology Discussion Group (LINGANTH at listserv.linguistlist.org) <linganth at listserv.linguistlist.org>
Subject: [Linganth] Recommendations for tools transcribing and analyzing large amounts of data

CAUTION: This email originated from outside of UL Lafayette. Do not click links or open attachments unless you recognize the sender and know the content is safe.

Dear Colleagues,

I am looking for recommendations of your favorite tool(s), at the moment, for processing large amounts of recorded spoken & written conversational data (informal interviews, free conversations), for both transcription and coding & analysis.

I have about 100 hours of digitally recorded conversations, including those among multiple speakers, with lots of simultaneous speech, two conversations going on at once, overlap, and code-switching (mostly bilingual, occasionally trilingual). I also have 13 years of written group chat conversations, which don?t need transcribing but it is over 300,000 words. I am looking for suggestions for software, online or otherwise, for both transcription (which is tricky because of the multilingual and overlapping conversations) and, more importantly, organization, coding, and analysis. It has been a while since I have dealt with THIS much data and I am sure there is a lot out there that I don?t know about - all and any suggestions of what has worked for folks are very much appreciated!

Best,
Dominika



Dominika M. Baran

Associate Professor

English Department

Duke University

Allen Building 303

Durham, NC 27708



Pronouns: she/her/hers


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://nam12.safelinks.protection.outlook.com/?url=http%3A%2F%2Flistserv.linguistlist.org%2Fpipermail%2Flinganth%2Fattachments%2F20260409%2F3b6b460c%2Fattachment-0001.htm&data=05%7C02%7Cbaiocchiml%40pitt.edu%7C1eb154ffe5b547886dd608de9676e943%7C9ef9f489e0a04eeb87cc3a526112fd0d%7C1%7C0%7C639113634665627854%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=7TAC3RgjnMfO8%2B2ZtfItQOC0r%2FRDFcgh8AOi%2FxCMA7s%3D&reserved=0<http://listserv.linguistlist.org/pipermail/linganth/attachments/20260409/3b6b460c/attachment-0001.htm>>

------------------------------

Message: 2
Date: Thu, 9 Apr 2026 20:30:47 +0000
From: "Dominika Baran, Ph.D." <dominika.baran at duke.edu>
To: "Roth-Gordon, Jen - (jenrothg)" <jenrothg at arizona.edu>,
        "linganth at listserv.linguistlist.org"
        <linganth at listserv.linguistlist.org>
Subject: Re: [Linganth] [EXT] Re: Recommendations for tools
        transcribing and analyzing large amounts of data
Message-ID:
        <DM6PR05MB4572F5003B7AFC4D7F41AE48ED582 at DM6PR05MB4572.namprd05.prod.outlook.com>

Content-Type: text/plain; charset="utf-8"

Hi Jen,

Yes!! That?s exactly what happened here! Who knew in 2013 that? etc.

Best,
Dominika


Dominika M. Baran
Associate Professor
English Department
Duke University
Allen Building 303
Durham, NC 27708

Pronouns: she/her/hers

From: Linganth <linganth-bounces at listserv.linguistlist.org> on behalf of Roth-Gordon, Jen - (jenrothg) <jenrothg at arizona.edu>
Date: Thursday, April 9, 2026 at 3:58?PM
To: linganth at listserv.linguistlist.org <linganth at listserv.linguistlist.org>
Subject: Re: [Linganth] [EXT] Re: Recommendations for tools transcribing and analyzing large amounts of data

Re: "I'm curious, how do you end up with so much data without first thinking
about how you will handle it?"

Funny! I think many of us find ourselves swimming in data and notes that would fill rooms if printed out (especially for long-term projects). While overwhelming, that would be my definition of a successful research project!

Sending solidarity (in lieu of concrete tech suggestions),
jen

Jennifer Roth-Gordon
Associate Professor Emerita
School of Anthropology
University of Arizona
Tucson, AZ 85721-0030



________________________________
From: Linganth <linganth-bounces at listserv.linguistlist.org> on behalf of Jocelyn Aznar <contact at jocelynaznar.eu>
Sent: Thursday, April 9, 2026 9:28 PM
To: linganth at listserv.linguistlist.org <linganth at listserv.linguistlist.org>
Subject: [EXT] Re: [Linganth] Recommendations for tools transcribing and analyzing large amounts of data

External Email

Hi everyone,

I'm curious, how do you end up with so much data without first thinking
about how you will handle it?

As you are within an English department, I assume you work with English?
Do you have some budget? What kind of annotation do you need? which
format? how do you do your analysis? using CSV files? XML? should the
data be reusable by other researchers? meant for being archived? FAIR? etc.

Using online AI tools is probably not ethical, as you have no way to
know what will the companies do with the data and what the people you
recorded said... If you have a recent computer, some budget or access to
University servers, you can use for instance Whisper and a model from
Mistral (like the 7B) to do some annotations automatically. With
languages like English, French and co, it works quite well. But that
requires some scripting.

Best,
Jocelyn

Le 09/04/2026 ? 21:13, Nathan Straub ??? a ?crit :
> Hi Dominika,
>
> I use Vook.ai (an AI-based subscription service) for rapid automatic
> transcription of English. (It also does Spanish, French, Italian,
> Portuguese, and German.) You would likely have to sort out overlaps and
> speaker labels on you own after that.
>
> For field recordings, I liked using SIL's Saymore software, because it
> provided a place to store recordings and break up a recording into short
> breath groups and listen again and again with slow speech and type up
> rough transcriptions, and then I could port the vernacular and free
> translation lines into FLEx.
>
> Which languages are you working with?
>
> Nathan
>
> We are sent into this world for some end.  It is our duty to discover by
> close study what this end is & when we once discover it to pursue it
> with unconquerable perseverance.
> JQA at age 12 to his brother Charles (June 1778)
>
> On Thu, Apr 9, 2026, 12:02 Dominika Baran, Ph.D.
> <dominika.baran at duke.edu <mailto:dominika.baran at duke.edu>> wrote:
>
> Dear Colleagues,
>
> I am looking for recommendations of your favorite tool(s), at the
> moment, for processing large amounts of recorded spoken & written
> conversational data (informal interviews, free conversations), for
> both transcription and coding & analysis.
>
> I have about 100 hours of digitally recorded conversations,
> including those among multiple speakers, with lots of simultaneous
> speech, two conversations going on at once, overlap, and code-
> switching (mostly bilingual, occasionally trilingual). I also have
> 13 years of written group chat conversations, which don?t need
> transcribing but it is over 300,000 words. I am looking for
> suggestions for software, online or otherwise, for both
> transcription (which is tricky because of the multilingual and
> overlapping conversations) and, more importantly, organization,
> coding, and analysis. It has been a while since I have dealt with
> THIS much data and I am sure there is a lot out there that I don?t
> know about - all and any suggestions of what has worked for folks
> are very much appreciated!
>
> Best,
> Dominika
>
>
> Dominika M. Baran
>
> Associate Professor
>
> English Department
>
> Duke University
>
> Allen Building 303
>
> Durham, NC 27708
>
> Pronouns: she/her/hers
>
> _______________________________________________
> Linganth mailing list
> Linganth at listserv.linguistlist.org
> <mailto:Linganth at listserv.linguistlist.org>
> https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Flistserv.linguistlist.org%2Fcgi-bin%2Fmailman%2Flistinfo%2Flinganth&data=05%7C02%7Cbaiocchiml%40pitt.edu%7C1eb154ffe5b547886dd608de9676e943%7C9ef9f489e0a04eeb87cc3a526112fd0d%7C1%7C0%7C639113634665647797%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=pZokr%2F%2F%2FGm4ssQ2iXnl%2BhuxsdcyTdy2XFkk3SCpKv14%3D&reserved=0<https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/linganth>
> <https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Flistserv.linguistlist.org%2Fcgi-bin%2Fmailman%2Flistinfo%2Flinganth&data=05%7C02%7Cbaiocchiml%40pitt.edu%7C1eb154ffe5b547886dd608de9676e943%7C9ef9f489e0a04eeb87cc3a526112fd0d%7C1%7C0%7C639113634665667017%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=%2BhDZUWuz7TJ7CFTGmRKTwEzqDOJHenIxFdd9U6hcCmQ%3D&reserved=0<https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/linganth>>
>
>
> _______________________________________________
> Linganth mailing list
> Linganth at listserv.linguistlist.org
> https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Flistserv.linguistlist.org%2Fcgi-bin%2Fmailman%2Flistinfo%2Flinganth&data=05%7C02%7Cbaiocchiml%40pitt.edu%7C1eb154ffe5b547886dd608de9676e943%7C9ef9f489e0a04eeb87cc3a526112fd0d%7C1%7C0%7C639113634665689775%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=Lt0fec2B5rEbpIJMAZCR03GRHq3S10Skk45%2FLIId8eA%3D&reserved=0<https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/linganth>

_______________________________________________
Linganth mailing list
Linganth at listserv.linguistlist.org
https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Flistserv.linguistlist.org%2Fcgi-bin%2Fmailman%2Flistinfo%2Flinganth&data=05%7C02%7Cbaiocchiml%40pitt.edu%7C1eb154ffe5b547886dd608de9676e943%7C9ef9f489e0a04eeb87cc3a526112fd0d%7C1%7C0%7C639113634665709432%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=AkSlX%2FJmy4Sj9ZHqrSZaBQI0rUC12CtHwrs6vauOY5o%3D&reserved=0<https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/linganth>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://nam12.safelinks.protection.outlook.com/?url=http%3A%2F%2Flistserv.linguistlist.org%2Fpipermail%2Flinganth%2Fattachments%2F20260409%2Fc64fc84c%2Fattachment.htm&data=05%7C02%7Cbaiocchiml%40pitt.edu%7C1eb154ffe5b547886dd608de9676e943%7C9ef9f489e0a04eeb87cc3a526112fd0d%7C1%7C0%7C639113634665729224%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=%2FXbROUoNPbDYZChD0KXm%2BqIIlHRa2EX41EeIISL0QhE%3D&reserved=0<http://listserv.linguistlist.org/pipermail/linganth/attachments/20260409/c64fc84c/attachment.htm>>

------------------------------

Subject: Digest Footer

_______________________________________________
Linganth mailing list
Linganth at listserv.linguistlist.org
https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Flistserv.linguistlist.org%2Fcgi-bin%2Fmailman%2Flistinfo%2Flinganth&data=05%7C02%7Cbaiocchiml%40pitt.edu%7C1eb154ffe5b547886dd608de9676e943%7C9ef9f489e0a04eeb87cc3a526112fd0d%7C1%7C0%7C639113634665747995%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=RKv5NpxaUWW5WJppCr3te8AE5dRvHuHFXgAasBJggiM%3D&reserved=0<https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/linganth>


------------------------------

End of Linganth Digest, Vol 139, Issue 13
*****************************************
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/linganth/attachments/20260409/2dc1e646/attachment-0001.htm>


More information about the Linganth mailing list