<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
</head>
<body>
<div style="direction: ltr; font-family: Aptos, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Hi Jen, </div>
<div style="direction: ltr; font-family: Aptos, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<br>
</div>
<div style="direction: ltr; font-family: Aptos, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Yes!! That’s exactly what happened here! Who knew in 2013 that… etc. </div>
<div style="direction: ltr; font-family: Aptos, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<br>
</div>
<div style="direction: ltr; font-family: Aptos, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Best,</div>
<div style="direction: ltr; font-family: Aptos, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Dominika </div>
<div style="direction: ltr; font-family: Aptos, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<br>
</div>
<div style="direction: ltr; font-family: Aptos, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<br>
</div>
<div id="ms-outlook-mobile-signature">
<p class="MsoNormal" style="line-height: 1.2; margin: 0in; font-family: Calibri, sans-serif; font-size: 11pt;">
<span style="font-family: "Times New Roman", serif; font-size: 10pt; color: rgb(32, 56, 100);">Dominika M. Baran</span></p>
<p class="MsoNormal" style="line-height: 1.2; margin: 0in; font-family: Calibri, sans-serif; font-size: 11pt;">
<span style="font-family: "Times New Roman", serif; font-size: 10pt; color: rgb(32, 56, 100);">Associate Professor</span></p>
<p class="MsoNormal" style="line-height: 1.2; margin: 0in; font-family: Calibri, sans-serif; font-size: 11pt;">
<span style="font-family: "Times New Roman", serif; font-size: 10pt; color: rgb(32, 56, 100);">English Department</span></p>
<p class="MsoNormal" style="line-height: 1.2; margin: 0in; font-family: Calibri, sans-serif; font-size: 11pt;">
<span style="font-family: "Times New Roman", serif; font-size: 10pt; color: rgb(32, 56, 100);">Duke University </span></p>
<p class="MsoNormal" style="line-height: 1.2; margin: 0in; font-family: Calibri, sans-serif; font-size: 11pt;">
<span style="font-family: "Times New Roman", serif; font-size: 10pt; color: rgb(32, 56, 100);">Allen Building 303</span></p>
<p class="MsoNormal" style="line-height: 1.2; margin: 0in; font-family: Calibri, sans-serif; font-size: 11pt;">
<span style="font-family: "Times New Roman", serif; font-size: 10pt; color: rgb(32, 56, 100);">Durham, NC 27708</span></p>
<p class="MsoNormal" style="line-height: 1.2; margin: 0in; font-family: Calibri, sans-serif; font-size: 11pt;">
<span style="font-family: "Times New Roman", serif; font-size: 10pt; color: rgb(32, 56, 100);"> </span></p>
<p class="MsoNormal" style="line-height: 1.2; margin: 0in; font-family: Calibri, sans-serif; font-size: 11pt;">
<span style="font-family: "Times New Roman", serif; font-size: 10pt; color: rgb(32, 56, 100);">Pronouns: she/her/hers</span></p>
<p class="MsoNormal" style="margin: 0in; font-family: Calibri, sans-serif; font-size: 11pt;">
<span style="color: rgb(47, 85, 151);"> </span></p>
</div>
<div id="mail-editor-reference-message-container">
<div class="ms-outlook-mobile-reference-message skipProofing" style="direction: ltr;">
</div>
<div class="ms-outlook-mobile-reference-message skipProofing" style="text-align: left; padding: 3pt 0in 0in; border-width: 1pt medium medium; border-style: solid none none; border-color: rgb(181, 196, 223) currentcolor currentcolor; font-family: Aptos; font-size: 12pt; color: black;">
<b>From: </b>Linganth <linganth-bounces@listserv.linguistlist.org> on behalf of Roth-Gordon, Jen - (jenrothg) <jenrothg@arizona.edu><br>
<b>Date: </b>Thursday, April 9, 2026 at 3:58 PM<br>
<b>To: </b>linganth@listserv.linguistlist.org <linganth@listserv.linguistlist.org><br>
<b>Subject: </b>Re: [Linganth] [EXT] Re: Recommendations for tools transcribing and analyzing large amounts of data<br>
<br>
</div>
<div class="elementToProof" style="font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Re: "I'm curious, how do you end up with so much data without first thinking<br>
about how you will handle it?"</div>
<div class="elementToProof" style="direction: ltr; font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<br>
</div>
<div class="elementToProof" style="font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Funny! I think many of us find ourselves swimming in data and notes that would fill rooms if printed out (especially for long-term projects). While overwhelming, that would be my definition of a successful research project!</div>
<div class="elementToProof" style="direction: ltr; font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<br>
</div>
<div class="elementToProof" style="font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Sending solidarity (in lieu of concrete tech suggestions),</div>
<div class="elementToProof" style="font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
jen</div>
<div id="Signature" class="elementToProof">
<div class="elementToProof" style="font-size: 10pt;"> <br>
Jennifer Roth-Gordon<br>
Associate Professor Emerita<br>
School of Anthropology<br>
University of Arizona<br>
Tucson, AZ 85721-0030</div>
<div class="elementToProof" style="direction: ltr; font-size: 10pt;"><br>
</div>
<div class="elementToProof" style="direction: ltr; font-size: 16px; color: rgb(64, 54, 53);">
<span style="background-color: rgb(255, 255, 255);"><br>
</span><span style="font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, Calibri, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);"><br>
</span></div>
</div>
<hr style="display: inline-block; width: 98%;">
<div id="divRplyFwdMsg">
<div style="direction: ltr; font-family: Calibri, sans-serif; font-size: 11pt; color: rgb(0, 0, 0);">
<b>From:</b> Linganth <linganth-bounces@listserv.linguistlist.org> on behalf of Jocelyn Aznar <contact@jocelynaznar.eu><br>
<b>Sent:</b> Thursday, April 9, 2026 9:28 PM<br>
<b>To:</b> linganth@listserv.linguistlist.org <linganth@listserv.linguistlist.org><br>
<b>Subject:</b> [EXT] Re: [Linganth] Recommendations for tools transcribing and analyzing large amounts of data</div>
<div style="direction: ltr;"> </div>
</div>
<div class="ms-outlook-mobile-reference-message skipProofing">External Email<br>
<br>
Hi everyone,<br>
<br>
I'm curious, how do you end up with so much data without first thinking<br>
about how you will handle it?<br>
<br>
As you are within an English department, I assume you work with English?<br>
Do you have some budget? What kind of annotation do you need? which<br>
format? how do you do your analysis? using CSV files? XML? should the<br>
data be reusable by other researchers? meant for being archived? FAIR? etc.<br>
<br>
Using online AI tools is probably not ethical, as you have no way to<br>
know what will the companies do with the data and what the people you<br>
recorded said... If you have a recent computer, some budget or access to<br>
University servers, you can use for instance Whisper and a model from<br>
Mistral (like the 7B) to do some annotations automatically. With<br>
languages like English, French and co, it works quite well. But that<br>
requires some scripting.<br>
<br>
Best,<br>
Jocelyn<br>
<br>
Le 09/04/2026 à 21:13, Nathan Straub 曹內森 a écrit :<br>
> Hi Dominika,<br>
><br>
> I use Vook.ai (an AI-based subscription service) for rapid automatic<br>
> transcription of English. (It also does Spanish, French, Italian,<br>
> Portuguese, and German.) You would likely have to sort out overlaps and<br>
> speaker labels on you own after that.<br>
><br>
> For field recordings, I liked using SIL's Saymore software, because it<br>
> provided a place to store recordings and break up a recording into short<br>
> breath groups and listen again and again with slow speech and type up<br>
> rough transcriptions, and then I could port the vernacular and free<br>
> translation lines into FLEx.<br>
><br>
> Which languages are you working with?<br>
><br>
> Nathan<br>
><br>
> We are sent into this world for some end. It is our duty to discover by<br>
> close study what this end is & when we once discover it to pursue it<br>
> with unconquerable perseverance.<br>
> JQA at age 12 to his brother Charles (June 1778)<br>
><br>
> On Thu, Apr 9, 2026, 12:02 Dominika Baran, Ph.D.<br>
> <dominika.baran@duke.edu <mailto:dominika.baran@duke.edu>> wrote:<br>
><br>
> Dear Colleagues,<br>
><br>
> I am looking for recommendations of your favorite tool(s), at the<br>
> moment, for processing large amounts of recorded spoken & written<br>
> conversational data (informal interviews, free conversations), for<br>
> both transcription and coding & analysis.<br>
><br>
> I have about 100 hours of digitally recorded conversations,<br>
> including those among multiple speakers, with lots of simultaneous<br>
> speech, two conversations going on at once, overlap, and code-<br>
> switching (mostly bilingual, occasionally trilingual). I also have<br>
> 13 years of written group chat conversations, which don’t need<br>
> transcribing but it is over 300,000 words. I am looking for<br>
> suggestions for software, online or otherwise, for both<br>
> transcription (which is tricky because of the multilingual and<br>
> overlapping conversations) and, more importantly, organization,<br>
> coding, and analysis. It has been a while since I have dealt with<br>
> THIS much data and I am sure there is a lot out there that I don’t<br>
> know about - all and any suggestions of what has worked for folks<br>
> are very much appreciated!<br>
><br>
> Best,<br>
> Dominika<br>
><br>
><br>
> Dominika M. Baran<br>
><br>
> Associate Professor<br>
><br>
> English Department<br>
><br>
> Duke University<br>
><br>
> Allen Building 303<br>
><br>
> Durham, NC 27708<br>
><br>
> Pronouns: she/her/hers<br>
><br>
> _______________________________________________<br>
> Linganth mailing list<br>
> Linganth@listserv.linguistlist.org<br>
> <mailto:Linganth@listserv.linguistlist.org><br>
> <a href="https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/linganth" id="OWAdd63cae6-e10e-7a2b-724a-c20382269579" class="OWAAutoLink" originalsrc="https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/linganth" data-auth="NotApplicable" data-outlook-id="8f57e28c-6904-4d68-8518-da5b7d282217">
https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/linganth</a><br>
> <<a href="https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/linganth" id="OWAc5022b7d-4202-615b-940f-4d745f6ff29e" class="OWAAutoLink" originalsrc="https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/linganth" data-auth="NotApplicable" data-outlook-id="b7c07b7f-2526-46cd-92c1-e6a43f564127">https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/linganth</a>><br>
><br>
><br>
> _______________________________________________<br>
> Linganth mailing list<br>
> Linganth@listserv.linguistlist.org<br>
> <a href="https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/linganth" id="OWA43663e37-26ff-6530-0702-716de95b562c" class="OWAAutoLink" originalsrc="https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/linganth" data-auth="NotApplicable" data-outlook-id="cc4e5cce-c71e-423c-bc4a-a3c791c10fd9">
https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/linganth</a><br>
<br>
_______________________________________________<br>
Linganth mailing list<br>
Linganth@listserv.linguistlist.org<br>
<a href="https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/linganth" id="OWA424422c2-720b-6812-4aa8-aad2878ddc74" class="OWAAutoLink" originalsrc="https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/linganth" data-auth="NotApplicable" data-outlook-id="201ec168-fc05-42c7-b302-5f266eac4890">https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/linganth</a><br>
<br>
</div>
</div>
</body>
</html>