<html><head><meta http-equiv="Content-Type" content="text/html; charset=utf-8"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class="">Elnaz,<div class=""><br class=""></div><div class="">I am not familiar with Adobe Acrobat DC, so I will defer to Brian's email.<div class=""><br class=""></div><div class="">The problem with the university upload of the txt files to the university database is something that people who support this process need to clarify. It is possible that they expect some BOM character at the beginning of the txt file to explicitly indicate the text encoding. It would be best if you tell them that those text files are UTF-8 text encoding and let them say what they believe is missing in those files for them to get the upload right. Normally newer text editors can automatically detect the text file encoding and adjust their display accordingly.<br class=""><div class="">
<br class=""><br class="">Leonid.
</div>
<div><br class=""><blockquote type="cite" class=""><div class="">On Oct 28, 2021, at 15:08, Brian Macwhinney <<a href="mailto:macw@andrew.cmu.edu" class="">macw@andrew.cmu.edu</a>> wrote:</div><br class="Apple-interchange-newline"><div class=""><meta http-equiv="Content-Type" content="text/html; charset=utf-8" class=""><div style="word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class="">When I convert files to PDF using Adobe Acrobat DC, all the characters look fine.<div class="">I just did this for one file.  Perhaps batch doesn’t work well?</div><div class=""><br class=""></div><div class="">Why are your computer people uploading in text format? They should just be uploading in CHAT format.</div><div class=""><br class=""><div class="">
<meta charset="UTF-8" class=""><div dir="auto" style="caret-color: rgb(0, 0, 0); letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none; word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class=""><div dir="auto" style="caret-color: rgb(0, 0, 0); letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none; word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class=""><div dir="auto" style="caret-color: rgb(0, 0, 0); letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none; word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class=""><div dir="auto" style="caret-color: rgb(0, 0, 0); letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none; word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class=""><div dir="auto" style="caret-color: rgb(0, 0, 0); letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none; word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class=""><div dir="auto" style="caret-color: rgb(0, 0, 0); letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none; word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class=""><div dir="auto" style="caret-color: rgb(0, 0, 0); letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none; word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class=""><div dir="auto" style="caret-color: rgb(0, 0, 0); letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none; word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class=""><div dir="auto" style="caret-color: rgb(0, 0, 0); letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none; word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class=""><div dir="auto" style="caret-color: rgb(0, 0, 0); letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none; word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class=""><div dir="auto" style="caret-color: rgb(0, 0, 0); letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none; word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class=""><div dir="auto" style="caret-color: rgb(0, 0, 0); letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none; word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class=""><div dir="auto" style="caret-color: rgb(0, 0, 0); letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none; word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class=""><div style="caret-color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px; font-style: normal; font-variant-caps: normal; font-weight: normal; letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none;" class=""><span style="font-size: 17px;" class="">— Brian MacWhinney</span><br style="font-size: 17px;" class=""><span style="font-size: 17px;" class="">Teresa Heinz Professor of Cognitive Psychology, </span><br style="font-size: 17px;" class=""><span style="font-size: 17px;" class="">Computational Linguistics, </span><br style="font-size: 17px;" class=""><span style="font-size: 17px;" class="">and Modern Languages, CMU</span></div></div></div></div></div></div></div></div></div></div></div></div></div></div>
</div>
<div class=""><br class=""><blockquote type="cite" class=""><div class="">On Oct 28, 2021, at 3:03 PM, Elnaz Kia <<a href="mailto:ek325@nau.edu" class="">ek325@nau.edu</a>> wrote:</div><br class="Apple-interchange-newline"><div class=""><div dir="ltr" class="">Dear Leonid and Brian,<div class=""><br class=""></div><div class="">Thank you both so much for your detailed responses.</div><div class=""><br class=""></div><div class=""><a class="gmail_plusreply docs-creator" id="plusReplyChip-0" href="mailto:spektor@andrew.cmu.edu" tabindex="-1">@Leonid Spektor</a> you are right. I just realized that the txt files that I created using the CHSTRING and REN commands on my computer were correct. However, when the database crew at the university upload the txt files to the university database, the txt files do not show the correct characters. Do you have any solutions for this? </div><div class=""><br class=""></div><div class="">Another issue is with the pdf versions of the mentioned files. Even on my computer when I convert the correctly converted txt files to pdf, it does not show the characters correctly. Are there any solutions for this problem? Note: I create the pdf files in batch using the Adobe Acrobat DC Create PDFs Tool.</div><div class=""><br class=""></div><div class="">Many thanks for taking the time and answering my questions!</div><div class=""><br class=""></div><div class="">Best,</div><div class="">Elnaz</div><div class=""><br class=""></div><div class="">Also, </div><div class=""><br class=""></div><div class="">Also<br clear="all" class=""><div class=""><div dir="ltr" class="gmail_signature" data-smartmail="gmail_signature"><div dir="ltr" class=""><div dir="ltr" class=""><div dir="ltr" class=""><p class="MsoNormal" style="margin-bottom:0in;line-height:normal"><span style="font-size:19.0pt;font-family:"Brush Script MT";color:#00386c" class="">Elnaz
Kia, Ph.D</span><span style="font-size:18.0pt;font-family:"Brush Script MT";color:#00386c" class="">.</span><span style="font-size:12.0pt;font-family:"Brush Script MT";color:#00386c" class=""> </span><span style="font-size:12.0pt;font-family:"Times New Roman",serif;color:#00386c" class="">(she,
her, hers)</span></p><p class="MsoNormal" style="margin-bottom:0in;line-height:normal"><i class=""><span style="font-size: 12pt; font-family: "Times New Roman", serif;" class=""><a href="https://l2trec.utah.edu/about/staff-directory.php" target="_blank" class="docs-creator">Post-Doctoral Research
Associate</a></span></i></p><p class="MsoNormal" style="margin-bottom:0in;line-height:normal"><a href="https://l2trec.utah.edu/" target="_blank" class="docs-creator"><span style="font-size:12.0pt;font-family:"Times New Roman",serif;color:#0d0d0d" class="">Second Language Teaching and Research
Center (L2TReC)</span></a><span class=""><span style="font-size:12.0pt;color:#0d0d0d" class=""></span></span></p><p class="MsoNormal" style="margin-bottom:0in;line-height:normal"><span class=""><span style="font-size: 12pt; font-family: "Times New Roman", serif;" class="">University of Utah</span></span></p><p class="MsoNormal" style="margin-bottom:0in;line-height:normal"><a href="https://www.linkedin.com/in/elnaz-kia?lipi=urn%3Ali%3Apage%3Ad_flagship3_profile_view_base_contact_details%3B%2FResGVF2SMGMAxhocdPnRw%3D%3D" target="_blank" class="docs-creator"><span style="font-size:12.0pt;font-family:"Times New Roman",serif;color:#0d0d0d" class="">LinkedIn</span></a><span class=""><span style="color:#0d0d0d" class=""></span></span></p><p class="MsoNormal" style="margin-bottom:0in;line-height:normal"><a href="https://elnazkia.weebly.com/" target="_blank" class="docs-creator"><span style="font-size:12.0pt;font-family:"Times New Roman",serif" class="">Personal
Website</span></a><span class=""><span style="color:#0d0d0d" class=""></span></span></p><p class="MsoNormal" style="margin-bottom:0in;line-height:normal"><img src="https://docs.google.com/uc?export=download&id=1-im6hnX4Kj15S9S-Fy7_gMFj4Nn-RGP_&revid=0B56lCp579fCjaXhEQlhaSndZdjgxWDRrRk8wMWFHN0wydGFZPQ" class=""></p></div></div></div></div></div><br class=""></div></div><br class=""><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Thu, Oct 28, 2021 at 11:50 AM Leonid Spektor <<a href="mailto:spektor@andrew.cmu.edu" class="">spektor@andrew.cmu.edu</a>> wrote:<br class=""></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div style="overflow-wrap: break-word;" class="">Elnaz,<div class=""><br class=""></div><div class=""><span style="white-space:pre-wrap" class="">     </span>I just want to add more specific information to what Brian wrote. Your text editor needs to be able to display Unicode UTF-8 encoded characters. If you open the .txt file with CLAN, then you will see that characters from .txt file are displayed correctly.</div><div class=""><br class=""></div><div class="">If characters in your .txt file are not displayed correctly in CLAN, then please make sure that you have the latest version of CLAN. Otherwise, please email your sample file that show this problem to me directly for further testing.</div><div class=""><br class=""></div><div class="">Copying the line from .cha transcript that you have in your email below to a test file on my computer and then running the CHSTRING and REN command produces correct result on my computer.<br class=""><div class="">
<br class=""><br class="">Leonid.
</div>
<div class=""><br class=""><blockquote type="cite" class=""><div class="">On Oct 28, 2021, at 14:06, Brian Macwhinney <<a href="mailto:macw@andrew.cmu.edu" target="_blank" class="">macw@andrew.cmu.edu</a>> wrote:</div><br class=""><div class=""><div style="overflow-wrap: break-word;" class="">Dear Elnaz,<div class="">     In order to view non-Roman characters such as é, as well as diacritics, CLAN relies on use of the Arial Unicode font which supports not only special European characters, but also Chinese, Sinhalese etc. because it is all of Unicode.  If you convert a CHAT file viewed in Unicode to .txt format, you are going to see what you are seeing now unless your editor for the .txt format allows you to load in a Unicode font.</div><div class=""> <br class=""><div class="">
<div dir="auto" style="letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;text-decoration:none" class=""><div dir="auto" style="letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;text-decoration:none" class=""><div dir="auto" style="letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;text-decoration:none" class=""><div dir="auto" style="letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;text-decoration:none" class=""><div dir="auto" style="letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;text-decoration:none" class=""><div dir="auto" style="letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;text-decoration:none" class=""><div dir="auto" style="letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;text-decoration:none" class=""><div dir="auto" style="letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;text-decoration:none" class=""><div dir="auto" style="letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;text-decoration:none" class=""><div dir="auto" style="letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;text-decoration:none" class=""><div dir="auto" style="letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;text-decoration:none" class=""><div dir="auto" style="letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;text-decoration:none" class=""><div dir="auto" style="letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;text-decoration:none" class=""><div style="font-family:Helvetica;font-size:12px;font-style:normal;font-variant-caps:normal;font-weight:normal;letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;text-decoration:none" class=""><span style="font-size:17px" class="">— Brian MacWhinney</span><br style="font-size:17px" class=""><span style="font-size:17px" class="">Teresa Heinz Professor of Cognitive Psychology, </span><br style="font-size:17px" class=""><span style="font-size:17px" class="">Computational Linguistics, </span><br style="font-size:17px" class=""><span style="font-size:17px" class="">and Modern Languages, CMU</span></div></div></div></div></div></div></div></div></div></div></div></div></div></div>
</div>
<div class=""><br class=""><blockquote type="cite" class=""><div class="">On Oct 28, 2021, at 1:27 PM, Elnaz Kia <<a href="mailto:ek325@nau.edu" target="_blank" class="">ek325@nau.edu</a>> wrote:</div><br class=""><div class=""><div dir="ltr" class="">Hi Everyone,<div class=""><br class=""></div><div class="">I have a question about accents in Spanish. So, here is what I have on a .cha transcript file:</div><div class=""><br class=""></div><div class="">*STU:     yo [:: _] me gusta ver(lo) <b class=""><font color="#ff0000" class="">él</font></b> [:: _] porque él es muy bien<br class="">   [:: bueno] en el xxx eh porque es el goleador .</div><div class=""><br class=""></div><div class="">And when I convert the file to a text. this happens:</div><pre style="white-space:pre-wrap" class=""><font class="">*STU:       yo [:: _] me gusta ver(lo)</font><b class=""><font class=""> </font><font color="#ff0000" class="">él</font></b><font color="#ff0000" class=""> </font><span class="">[:: _] porque Ã©l es muy bien </span></pre><div class=""><span style="white-space:pre-wrap" class="">   [:: bueno] en el xxx eh porque es el goleador .</span></div><div class=""><br class=""></div><div class=""><font class=""><span style="white-space:pre-wrap" class="">And this is how I convert .cha files to .txt files:</span></font></div><div class=""><font class=""><span style="white-space:pre-wrap" class=""><br class=""></span></font></div><div class=""><font face="times new roman, serif" color="#ff0000" class="">chstring<span style="font-size:11pt" class=""> </span></font><span style="color:red;font-size:11pt;font-family:"Times New Roman",serif" class="">+re +cbullets.cut *.cha</span></div>
<div class=""><span style="color:red;font-family:"Times New Roman",serif;font-size:11pt" class="">ren -f +re *.chstr.cex *.txt</span> <br class=""></div><div class=""><br class=""></div><div class="">My question is, how can I avoid this problem?</div><div class=""><br class=""></div><div class="">Thanks,</div><div class="">Elnaz</div><div class=""><br class=""></div><div class=""><br class=""></div><div class=""><br class=""></div><div class=""><br class=""></div><div class=""><div class=""><div dir="ltr" class=""><div dir="ltr" class=""><div dir="ltr" class=""><div dir="ltr" class=""><p class="MsoNormal" style="margin-bottom:0in;line-height:normal"><span style="font-size:19pt;font-family:"Brush Script MT";color:rgb(0,56,108)" class="">Elnaz
Kia, Ph.D</span><span style="font-size:18pt;font-family:"Brush Script MT";color:rgb(0,56,108)" class="">.</span><span style="font-size:12pt;font-family:"Brush Script MT";color:rgb(0,56,108)" class=""> </span><span style="font-size:12pt;font-family:"Times New Roman",serif;color:rgb(0,56,108)" class="">(she,
her, hers)</span></p><p class="MsoNormal" style="margin-bottom:0in;line-height:normal"><i class=""><span style="font-size:12pt;font-family:"Times New Roman",serif" class=""><a href="https://l2trec.utah.edu/about/staff-directory.php" target="_blank" class="">Post-Doctoral Research
Associate</a></span></i></p><p class="MsoNormal" style="margin-bottom:0in;line-height:normal"><a href="https://l2trec.utah.edu/" target="_blank" class=""><span style="font-size:12pt;font-family:"Times New Roman",serif;color:rgb(13,13,13)" class="">Second Language Teaching and Research
Center (L2TReC)</span></a><span class=""><span style="font-size:12pt;color:rgb(13,13,13)" class=""></span></span></p><p class="MsoNormal" style="margin-bottom:0in;line-height:normal"><span class=""><span style="font-size:12pt;font-family:"Times New Roman",serif" class="">University of Utah</span></span></p><p class="MsoNormal" style="margin-bottom:0in;line-height:normal"><a href="https://www.linkedin.com/in/elnaz-kia?lipi=urn%3Ali%3Apage%3Ad_flagship3_profile_view_base_contact_details%3B%2FResGVF2SMGMAxhocdPnRw%3D%3D" target="_blank" class=""><span style="font-size:12pt;font-family:"Times New Roman",serif;color:rgb(13,13,13)" class="">LinkedIn</span></a><span class=""><span style="color:rgb(13,13,13)" class=""></span></span></p><p class="MsoNormal" style="margin-bottom:0in;line-height:normal"><a href="https://elnazkia.weebly.com/" target="_blank" class=""><span style="font-size:12pt;font-family:"Times New Roman",serif" class="">Personal
Website</span></a><span class=""><span style="color:rgb(13,13,13)" class=""></span></span></p><p class="MsoNormal" style="margin-bottom:0in;line-height:normal"><img src="https://docs.google.com/uc?export=download&id=1-im6hnX4Kj15S9S-Fy7_gMFj4Nn-RGP_&revid=0B56lCp579fCjaXhEQlhaSndZdjgxWDRrRk8wMWFHN0wydGFZPQ" class=""></p></div></div></div></div></div></div></div><div class=""><br class=""></div>
-- <br class="">
You received this message because you are subscribed to the Google Groups "chibolts" group.<br class="">
To unsubscribe from this group and stop receiving emails from it, send an email to <a href="mailto:chibolts+unsubscribe@googlegroups.com" target="_blank" class="">chibolts+unsubscribe@googlegroups.com</a>.<br class="">
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/chibolts/CAOwOJYkD%3DarGj%2B7sDYxHzyZikHs_Jd3wHJmCALbCfJCWe69T1A%40mail.gmail.com?utm_medium=email&utm_source=footer" target="_blank" class="">https://groups.google.com/d/msgid/chibolts/CAOwOJYkD%3DarGj%2B7sDYxHzyZikHs_Jd3wHJmCALbCfJCWe69T1A%40mail.gmail.com</a>.<br class="">
</div></blockquote></div><br class=""></div></div><div class=""><br class=""></div>
-- <br class="">
You received this message because you are subscribed to the Google Groups "chibolts" group.<br class="">
To unsubscribe from this group and stop receiving emails from it, send an email to <a href="mailto:chibolts+unsubscribe@googlegroups.com" target="_blank" class="">chibolts+unsubscribe@googlegroups.com</a>.<br class="">
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/chibolts/EE73C6A2-849E-4FED-906C-7ED78BAD803E%40andrew.cmu.edu?utm_medium=email&utm_source=footer" target="_blank" class="">https://groups.google.com/d/msgid/chibolts/EE73C6A2-849E-4FED-906C-7ED78BAD803E%40andrew.cmu.edu</a>.<br class="">
</div></blockquote></div><br class=""></div></div></blockquote></div><div class=""><br class="webkit-block-placeholder"></div>
-- <br class="">
You received this message because you are subscribed to the Google Groups "chibolts" group.<br class="">
To unsubscribe from this group and stop receiving emails from it, send an email to <a href="mailto:chibolts+unsubscribe@googlegroups.com" class="">chibolts+unsubscribe@googlegroups.com</a>.<br class="">
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/chibolts/CAOwOJYnvJvV%3DDbDHN8if%3Dikg-4YOyi_e0p5Bj9VkAMONvKVXeg%40mail.gmail.com?utm_medium=email&utm_source=footer" class="">https://groups.google.com/d/msgid/chibolts/CAOwOJYnvJvV%3DDbDHN8if%3Dikg-4YOyi_e0p5Bj9VkAMONvKVXeg%40mail.gmail.com</a>.<br class="">
</div></blockquote></div><br class=""></div></div></div></blockquote></div><br class=""></div></div></body></html>
<p></p>
-- <br />
You received this message because you are subscribed to the Google Groups "chibolts" group.<br />
To unsubscribe from this group and stop receiving emails from it, send an email to <a href="mailto:chibolts+unsubscribe@googlegroups.com">chibolts+unsubscribe@googlegroups.com</a>.<br />
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/chibolts/8E5C16EE-1F46-4A67-BD05-ED236B195E01%40andrew.cmu.edu?utm_medium=email&utm_source=footer">https://groups.google.com/d/msgid/chibolts/8E5C16EE-1F46-4A67-BD05-ED236B195E01%40andrew.cmu.edu</a>.<br />