[Lingtyp] Running an R code to analyze Phonotacticon

Ian Joo ian_joo at nucba.ac.jp
Tue May 23 14:49:09 UTC 2023


Dear all,

apparently I cannot send large files to the mailing list. I will try here again without my thesis draft, which you can find in this link instead: ianjoo.github.io/Thesis.pdf <http://ianjoo.github.io/Thesis.pdf>
Sorry for the cross-posting.

Regards,
Ian



> 22/5/2023 오후 5:38, Ian Joo <ian_joo at nucba.ac.jp> 작성:
> 
> Dear all,
> 
> Since several have already kindly replied to offer me help, I thought it would be more pratical to send my data and R script to everyone here. Please find attached the R Markdown script, the data files (Phonotacticon and PanPhon), and also a draft of my thesis for your reference. Again, thank you for your kindness.
> 
> Regards,
> Ian
> 
> 
> 
>> 2023. 5. 22. 오후 4:30, Ian Joo <ian_joo at nucba.ac.jp> 작성:
>> Dear typologists,
>> 
>> for my doctoral project, I am compiling and analyzing a database called Phonotacticon, a cross-linguistic database of basic phonotactic information.
>> I have collected more than 350 lects in my database as of now, the goal being around 450.
>> For my thesis, I have written an R script to analyze the phonological distances between Eurasian lects based on Phonotacticon. Running the code worked fine until 200 languages or so, albeit with several hours of running time. But now, as the size of the lects has grown (and the distances between each pair of lects have also grown exponentially), my 2020 model Macbook Pro with 16gb RAM cannot run the code anymore without crashing in the middle.
>> Perhaps it’s the hardware limit of my Macbook, or maybe I have written the code in an inefficient way. Anyway, I need to run the code somehow to finish revising my thesis. I tried using virtual machines like Google Pro+ with 32gb RAM, but the code crashed there too.
>> In case where any of you are using a high-end computer better than mine and you are also experienced with R, I was wondering if I can send you my R script and data so that you can run it on your computer and send me the results, or better yet, see if anything is wrong with my R script so that I can fix it to run it on my own computer.
>> I would much appreciate your help direly needed as this point.
>> 
>> From Netherlands,
>> Ian

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/lingtyp/attachments/20230523/086cf05c/attachment-0004.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Phonotacticon.Rmd
Type: application/octet-stream
Size: 33700 bytes
Desc: not available
URL: <http://listserv.linguistlist.org/pipermail/lingtyp/attachments/20230523/086cf05c/attachment-0001.obj>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/lingtyp/attachments/20230523/086cf05c/attachment-0005.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Phonotacticon.xlsx
Type: application/vnd.openxmlformats-officedocument.spreadsheetml.sheet
Size: 452323 bytes
Desc: not available
URL: <http://listserv.linguistlist.org/pipermail/lingtyp/attachments/20230523/086cf05c/attachment-0001.xlsx>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/lingtyp/attachments/20230523/086cf05c/attachment-0006.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: PanPhonPhonotacticon1_0.csv
Type: text/csv
Size: 2318806 bytes
Desc: not available
URL: <http://listserv.linguistlist.org/pipermail/lingtyp/attachments/20230523/086cf05c/attachment-0001.csv>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/lingtyp/attachments/20230523/086cf05c/attachment-0007.htm>


More information about the Lingtyp mailing list