<div dir="ltr"><div>Hello,</div><div><br></div><div>First of all, Sorry to pick up this conversation for so long ago! </div><div><br></div><div>I am also trying to use the "flo" command to extract "clean" text from .cha files, and it works very well except one small thing -- it seems to automatically add line wraps to break long lines exceeding a certain length to several lines. </div><div><br></div><div>For example, for a file (060002c.cha) in the MacWhinney database, I run </div><div>flo +cr +t* 060002c.cha<br></div><div><br></div><div>and for a long line in the original .cha file </div><div>"</div><div>*MAR: no (.) it's not Mr Munsters (.) it's only the Munsters (.) what if the monsters won't be on anymore and xxx will be with other movie (.) what if it's at with the other program . <br></div><div>"</div><div><br></div><div>I got three lines </div><div>"<br></div><div>no it's not Mr Munsters it's only the Munsters what if the monsters won't<br> be on anymore and will be with other movie what if it's at with the other<br> program.<br></div><div>" <br></div><div>I am just wondering if there is any command/option/switch within Clan to avoid this and still keep them on the same line? I tried "LONGTIER", but it did not work. <br></div><div><br></div><div>Many thanks!</div><div><br></div><div>Sincerely,</div><div>Xiaowei</div><div><br></div><div><p style="color:rgb(0,0,0);margin:0in 0in 0.0001pt;font-family:Calibri,sans-serif;font-size:11pt"><span style="border:0px;font-style:inherit;font-variant:inherit;font-weight:inherit;font-stretch:inherit;font-size:10pt;line-height:inherit;font-family:Arial,sans-serif;font-kerning:inherit;font-feature-settings:inherit;margin:0px;padding:0px;vertical-align:baseline;color:rgb(31,73,125)"><b>Xiaowei Zhao, Ph.D.</b></span></p><p style="color:rgb(0,0,0);margin:0in 0in 0.0001pt;font-family:Calibri,sans-serif;font-size:11pt"><span style="border:0px;font-style:inherit;font-variant:inherit;font-weight:inherit;font-stretch:inherit;font-size:10pt;line-height:inherit;font-family:Arial,sans-serif;font-kerning:inherit;font-feature-settings:inherit;margin:0px;padding:0px;vertical-align:baseline;color:rgb(31,73,125)">Professor of Psychology</span></p><p style="color:rgb(0,0,0);margin:0in 0in 0.0001pt;font-family:Calibri,sans-serif;font-size:11pt"><span style="border:0px;font-style:inherit;font-variant:inherit;font-weight:inherit;font-stretch:inherit;font-size:10pt;line-height:inherit;font-family:Arial,sans-serif;font-kerning:inherit;font-feature-settings:inherit;margin:0px;padding:0px;vertical-align:baseline;color:rgb(31,73,125)"><br></span></p><p style="color:rgb(0,0,0);line-height:16.8667px;margin:0in 0in 0.0001pt;font-family:Calibri,sans-serif;font-size:11pt"><span style="border:0px;font-style:inherit;font-variant:inherit;font-weight:inherit;font-stretch:inherit;font-size:10pt;line-height:inherit;font-family:Arial,sans-serif;font-kerning:inherit;font-feature-settings:inherit;margin:0px;padding:0px;vertical-align:baseline;color:rgb(0,75,135)"><b>Emmanuel College</b></span></p><p style="color:rgb(0,0,0);margin:0in 0in 0.0001pt;font-family:Calibri,sans-serif;font-size:11pt"><span style="border:0px;font-style:inherit;font-variant:inherit;font-weight:inherit;font-stretch:inherit;font-size:10pt;line-height:inherit;font-family:Arial,sans-serif;font-kerning:inherit;font-feature-settings:inherit;margin:0px;padding:0px;vertical-align:baseline;color:rgb(31,73,125)">400 The Fenway | Boston | MA 02115</span></p><p style="color:rgb(0,0,0);line-height:29.3333px;margin:0in 0in 0.0001pt;font-family:Calibri,sans-serif;font-size:11pt"><span style="border:0px;font-style:inherit;font-variant:inherit;font-weight:inherit;font-stretch:inherit;font-size:10pt;line-height:inherit;font-family:Arial,sans-serif;font-kerning:inherit;font-feature-settings:inherit;margin:0px;padding:0px;vertical-align:baseline;color:rgb(5,99,193)"><a href="http://www.emmanuel.edu/" style="border:0px;font:inherit;margin:0px;padding:0px;vertical-align:baseline;color:rgb(5,99,193)">www.emmanuel.edu</a></span></p></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Tue, Feb 6, 2024 at 4:39 PM Leonid Spektor <<a href="mailto:spektor@andrew.cmu.edu">spektor@andrew.cmu.edu</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div>Command <font color="#ff2600">flo +ca +t* *.cha</font> should work.<br id="m_1841540442675779114lineBreakAtBeginningOfMessage"><div>
<br><br style="color:rgb(0,0,0);font-family:Arial;font-size:16px;font-style:normal;font-variant-caps:normal;font-weight:400;letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;text-decoration:none"><span style="color:rgb(0,0,0);font-family:Arial;font-size:16px;font-style:normal;font-variant-caps:normal;font-weight:400;letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;text-decoration:none;float:none;display:inline">Leonid.</span>
</div>
<div><br><blockquote type="cite"><div>On Feb 6, 2024, at 16:14, Snigdha Khanna <<a href="mailto:snkhanna@iu.edu" target="_blank">snkhanna@iu.edu</a>> wrote:</div><br><div>I want to remove all annotations like the gestures and errors. Hence, I would like to use the txt format of just the transcribed text without annotations.<div><br></div><div>Any idea how to do that?</div><div><br></div><div><br></div><div class="gmail_quote"><div dir="auto" class="gmail_attr">On Tuesday, February 6, 2024 at 4:10:32 PM UTC-5 macw wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">CLAN’s FLO program does most of this. Alternatively, you could grab all the <w> tags from the XML version of the database.
<br>
<br>What kind of NLP do you want to use? You could apply Universal Dependencies directly.
<br>
<br>— Brian MacWhinney
<br>Teresa Heinz Professor of Cognitive Psychology,
<br>Language Technologies and Modern Languages, CMU
<br>
<br>> On Feb 6, 2024, at 3:08 PM, Snigdha Khanna <<a rel="nofollow">snkh...@iu.edu</a>> wrote:
<br>>
<br>> Hello!
<br>>
<br>> I am trying to extract "clean" text from annotated transcripts that I have. Is there any way to use CLAN to export a txt file format, or a simpler method to remove annotations from the transcripts, so that I can parse it using NLP?
<br>>
<br>> Any help is appreciated!
<br>>
<br>> Thanks,
<br>> Snigdha
<br>>
<br>> --
<br>> You received this message because you are subscribed to the Google Groups "chibolts" group.
<br>> To unsubscribe from this group and stop receiving emails from it, send an email to <a rel="nofollow">chibolts+u...@googlegroups.com</a>.
<br>> To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/chibolts/237e8996-63ba-4476-859f-4b1e6841ab3an%40googlegroups.com" rel="nofollow" target="_blank">https://groups.google.com/d/msgid/chibolts/237e8996-63ba-4476-859f-4b1e6841ab3an%40googlegroups.com</a>.
<br>
<br></blockquote></div><div><br></div>
-- <br>
You received this message because you are subscribed to the Google Groups "chibolts" group.<br>
To unsubscribe from this group and stop receiving emails from it, send an email to <a href="mailto:chibolts+unsubscribe@googlegroups.com" target="_blank">chibolts+unsubscribe@googlegroups.com</a>.<br>
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/chibolts/cb3c67ac-e21e-492a-8710-3f1ef74cda6dn%40googlegroups.com?utm_medium=email&utm_source=footer" target="_blank">https://groups.google.com/d/msgid/chibolts/cb3c67ac-e21e-492a-8710-3f1ef74cda6dn%40googlegroups.com</a>.<br>
</div></blockquote></div><br></div>
<p></p>
-- <br>
You received this message because you are subscribed to the Google Groups "chibolts" group.<br>
To unsubscribe from this group and stop receiving emails from it, send an email to <a href="mailto:chibolts+unsubscribe@googlegroups.com" target="_blank">chibolts+unsubscribe@googlegroups.com</a>.<br>
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/chibolts/7256CB6D-33FE-461B-9A0E-F479DDCC69C7%40andrew.cmu.edu?utm_medium=email&utm_source=footer" target="_blank">https://groups.google.com/d/msgid/chibolts/7256CB6D-33FE-461B-9A0E-F479DDCC69C7%40andrew.cmu.edu</a>.<br>
</blockquote></div></div>
<p></p>
-- <br />
You received this message because you are subscribed to the Google Groups "chibolts" group.<br />
To unsubscribe from this group and stop receiving emails from it, send an email to <a href="mailto:chibolts+unsubscribe@googlegroups.com">chibolts+unsubscribe@googlegroups.com</a>.<br />
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/chibolts/CANVosvX1Q%2BjGDL0WxZKTr2CjtAZeUAPn7%2Bz6gb6X061c%3Du_4-A%40mail.gmail.com?utm_medium=email&utm_source=footer">https://groups.google.com/d/msgid/chibolts/CANVosvX1Q%2BjGDL0WxZKTr2CjtAZeUAPn7%2Bz6gb6X061c%3Du_4-A%40mail.gmail.com</a>.<br />