<div dir="ltr">Hi Leonid, <div>  Thank you - the commands in step 1 identified a set of corrupted files. I'm very grateful. It will take me some time to restore those files to the right directories, but I suspect that will solve the problem. </div><div><br></div><div>Amanda <br><br>On Tuesday, March 21, 2017 at 1:48:02 PM UTC-5, Spektor, Leonid: CMU wrote:<blockquote class="gmail_quote" style="margin: 0;margin-left: 0.8ex;border-left: 1px #ccc solid;padding-left: 1ex;">
  
    
  
  <div bgcolor="#FFFFFF" text="#000000">
    <p>Amanda,</p>
    <p>    I want you to try two thing. <br>
    </p>
    <p>1. Please set working directory: CHILDES by Age Folder to 0-15
      months and run command "dir -r *.cha" at the end of the output in
      "CLAN Output" window you will see how many files CLAN has found.
      If the number is 340 files, then for some reason, maybe bad file
      extension or bad directory name or file protection, CLAN can't see
      other files as .cha files. In this case run command "dir -r -n
      *.cha" and you will see files that CLAN doesn't recognize as .cha
      files.</p>
    <p>2. If "dir -r *.cha" command finds 372 files, then the problem
      might be with FLO command or "+re" function. Please get data from
      our server at URL
      <a href="http://childes.talkbank.org/data/Eng-NA/Braunwald.zip" target="_blank" rel="nofollow" onmousedown="this.href='http://www.google.com/url?q\x3dhttp%3A%2F%2Fchildes.talkbank.org%2Fdata%2FEng-NA%2FBraunwald.zip\x26sa\x3dD\x26sntz\x3d1\x26usg\x3dAFQjCNHHgnIhDOY2Ye63b4LM73dFDRWcdw';return true;" onclick="this.href='http://www.google.com/url?q\x3dhttp%3A%2F%2Fchildes.talkbank.org%2Fdata%2FEng-NA%2FBraunwald.zip\x26sa\x3dD\x26sntz\x3d1\x26usg\x3dAFQjCNHHgnIhDOY2Ye63b4LM73dFDRWcdw';return true;">"http://childes.talkbank.org/<wbr>data/Eng-NA/Braunwald.zip"</a>. Unzip it
      and in CLAN set working directory to unzipped Braunwald directory.
      Set output to TEMP directory that is empty and run command "FLO
      *.cha -t% +d +r1 +re +ffin". On my Mac and Windows 10 PC I get 900
      .fin.cex files in TEMP directory. If you get the same number, then
      something is wrong with files in your 0-15 months set. If you get
      a different number, then make sure you have the latest CLAN. Maybe
      even reboot your computer and try the same above command again. <br>
    </p>
    <p>If you still get less than 900 files in TEMP directory, then
      please email to me directly the full output of CLAN Output window
      after you run "FLO *.cha -t% +d +r1 +re +ffin" command, tell me if
      you are using Mac or PC.</p>
    <p>If you get 900 files in TEMP, but you still can't figure out why
      in step 1 you get 340 files, then zip and email your 0-15 months
      directory to me and I will see if I can figure out what is wrong.<br>
    </p>
    <pre cols="72">Leonid.

</pre>
    <div>On 21-03-17 13:23, Brian MacWhinney
      wrote:<br>
    </div>
    <blockquote type="cite">
      
      
      
      
      
      <div>
        <p class="MsoNormal"><span style="font-size:11.0pt;font-family:Calibri">Dear Amanda,</span></p>
        <p class="MsoNormal"><span style="font-size:11.0pt;font-family:Calibri"> </span></p>
        <p class="MsoNormal"><span style="font-size:11.0pt;font-family:Calibri">  From what you
            write, the problem occurs during your use of FLO.  For us
            (Leonid or me) to replicate the problem, we would need the
            complete collection of 340 files for this 0-15 months
            period.  It could be that some particular file is causing
            the problem, but it could also be the case that you are
            running up against a machine limitation or a CLAN
            limitation.  In any case, we would need to receive the
            collection that triggers the problem, along with the command
            you are using to replicate the problem.  You could send this
            to me or, better, Leonid (<a href="javascript:" target="_blank" gdf-obfuscated-mailto="tbq_HCxrBwAJ" rel="nofollow" onmousedown="this.href='javascript:';return true;" onclick="this.href='javascript:';return true;">spe...@andrew.cmu.edu</a>) as a
            zipped email attachment, preserving the folder structure you
            are using.  Before sending to us,  please make sure that
            this problem is replicable on your side.  You might also
            want to test on a second computer.  Also please make sure
            you are using a current version of CLAN.</span></p>
        <p class="MsoNormal"><span style="font-size:11.0pt;font-family:Calibri"> </span></p>
        <p class="MsoNormal"><span style="font-size:11.0pt;font-family:Calibri">--Brian</span></p>
        <p class="MsoNormal"><span style="font-size:11.0pt;font-family:Calibri"> </span></p>
        <div style="border:none;border-top:solid #b5c4df 1.0pt;padding:3.0pt 0in 0in 0in">
          <p class="MsoNormal"><b><span style="font-family:Calibri;color:black">From: </span>
            </b><span style="font-family:Calibri;color:black"><a href="javascript:" target="_blank" gdf-obfuscated-mailto="tbq_HCxrBwAJ" rel="nofollow" onmousedown="this.href='javascript:';return true;" onclick="this.href='javascript:';return true;"><chi...@googlegroups.com></a>
              on behalf of Amanda Owen Van Horne
              <a href="javascript:" target="_blank" gdf-obfuscated-mailto="tbq_HCxrBwAJ" rel="nofollow" onmousedown="this.href='javascript:';return true;" onclick="this.href='javascript:';return true;"><aj...@gmail.com></a><br>
              <b>Reply-To: </b><a href="javascript:" target="_blank" gdf-obfuscated-mailto="tbq_HCxrBwAJ" rel="nofollow" onmousedown="this.href='javascript:';return true;" onclick="this.href='javascript:';return true;">"chi...@googlegroups.com"</a>
              <a href="javascript:" target="_blank" gdf-obfuscated-mailto="tbq_HCxrBwAJ" rel="nofollow" onmousedown="this.href='javascript:';return true;" onclick="this.href='javascript:';return true;"><chi...@googlegroups.com></a><br>
              <b>Date: </b>Wednesday, March 22, 2017 at 1:10 AM<br>
              <b>To: </b><a href="javascript:" target="_blank" gdf-obfuscated-mailto="tbq_HCxrBwAJ" rel="nofollow" onmousedown="this.href='javascript:';return true;" onclick="this.href='javascript:';return true;">"chi...@googlegroups.com"</a>
              <a href="javascript:" target="_blank" gdf-obfuscated-mailto="tbq_HCxrBwAJ" rel="nofollow" onmousedown="this.href='javascript:';return true;" onclick="this.href='javascript:';return true;"><chi...@googlegroups.com></a><br>
              <b>Subject: </b>Large scale combining CHILDES files</span></p>
        </div>
        <div>
          <p class="MsoNormal"> </p>
        </div>
        <div>
          <p class="MsoNormal">Hi,  </p>
          <div>
            <p class="MsoNormal"> </p>
          </div>
          <div>
            <p class="MsoNormal">I'm try to combine all available
              English/non clinical CHILDES files based on the target
              child's age.  I've organized my files (by hand) into
              folders binned by month based on the child's age reported
              in the header information and now I would like to strip
              CHILDES codes from the speaker tier and output all of
              those files into a temp file, then I will use this temp
              file to create a single file of only adult/only child
              speakers.  The trouble I am running into is as the number
              of files I am working with gets larger, CLAN seems to skip
              files. When I run for 0-12 months I get the (expected) 192
              files following FLO.  When I run for 0-15 months I get 340
              files in the TEMP folder, when I should be getting 372. 
              This dropping of files continues and becomes more
              problematic as we move to broader and broader age ranges. 
              It's hard to track down individual files that might be
              contributing because so many files are involved.  Can
              anyone provide any guidance? </p>
          </div>
          <div>
            <p class="MsoNormal"> </p>
          </div>
          <div>
            <p class="MsoNormal">Amanda </p>
          </div>
          <div>
            <p class="MsoNormal"> </p>
          </div>
          <div>
            <p class="MsoNormal">working directory: CHILDES by Age
              Folder</p>
          </div>
          <div>
            <p class="MsoNormal">output directory: TEMP</p>
          </div>
          <div>
            <p class="MsoNormal"> </p>
          </div>
          <div>
            <p class="MsoNormal">FLO *.cha -t% +d +r1 +re +ffin</p>
          </div>
          <div>
            <ul type="disc">
              <li class="MsoNormal">
                FLO -- command to strip codes from main tier </li>
              <li class="MsoNormal">
                *.cha -- apply to all files in working directory </li>
              <li class="MsoNormal">
                -t% - get rid of non-speaker related tiers like mor and
                spa </li>
              <li class="MsoNormal">
                +d - output in chat format </li>
              <li class="MsoNormal">
                +r1 - if something is in () remove () and keep content
                (e.g., (be)cause = because)
                </li>
              <li class="MsoNormal">
                +re works recursively through subfolders </li>
              <li class="MsoNormal">
                +ffin - output to a file with the code .fin before .cex
                </li>
            </ul>
          </div>
          <div>
            <p>output (TEMP) will fill with *.fin.cex
              files (one per original file) </p>
            <p>then change your working directory to
              the temp file. reset your output directory to someplace
              memorable.</p>
            <p>KWAL *.cex -t*CHI +d +r1 +x>0w +u +f</p>
            <ul type="disc">
              <li class="MsoNormal">
                KWAL - keyword analysis with no keyword specified
                outputs all content </li>
              <li class="MsoNormal">
                *.cex  - all files in working directory </li>
              <li class="MsoNormal">
                -t*CHI - only adult speakers </li>
              <li class="MsoNormal">
                +d  - in chat format </li>
              <li class="MsoNormal">
                +r1 - - if something is in () remove () and keep content
                (e.g., (be)cause = because)
                </li>
              <li class="MsoNormal">
                +x>0w - only lines with 1 or more words; no empty
                utterances or utterances that only have info on other
                tiers
                </li>
              <li class="MsoNormal">
                +u - combine all output into one file </li>
              <li class="MsoNormal">
                +f - print to file (not to the screen)  </li>
            </ul>
            <p>Final output from these two processes
              will end with *.fin.kwal.cex (a single combined file) </p>
          </div>
          <div>
            <p class="MsoNormal"> </p>
          </div>
          <div>
            <p class="MsoNormal"><br clear="all">
              </p>
            <div>
              <div>
                <p class="MsoNormal">Amanda J. Owen Van Horne, PhD
                  CCC-SLP</p>
              </div>
              <div>
                <p class="MsoNormal">Associate Professor</p>
              </div>
              <div>
                <p class="MsoNormal">University of Iowa</p>
              </div>
              <div>
                <p class="MsoNormal" style="margin-bottom:12.0pt"><a href="javascript:" target="_blank" gdf-obfuscated-mailto="tbq_HCxrBwAJ" rel="nofollow" onmousedown="this.href='javascript:';return true;" onclick="this.href='javascript:';return true;">amanda-owe...@uiowa.edu</a><wbr> <br>
                  <br>
                  <br>
                  </p>
              </div>
            </div>
          </div>
        </div>
        <p class="MsoNormal">-- <br>
          You received this message because you are subscribed to the
          Google Groups "chibolts" group.<br>
          To unsubscribe from this group and stop receiving emails from
          it, send an email to
          <a href="javascript:" target="_blank" gdf-obfuscated-mailto="tbq_HCxrBwAJ" rel="nofollow" onmousedown="this.href='javascript:';return true;" onclick="this.href='javascript:';return true;">chibolts+u...@<wbr>googlegroups.com</a>.<br>
          To post to this group, send email to <a href="javascript:" target="_blank" gdf-obfuscated-mailto="tbq_HCxrBwAJ" rel="nofollow" onmousedown="this.href='javascript:';return true;" onclick="this.href='javascript:';return true;">chib...@googlegroups.com</a>.<br>
          To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/chibolts/CA%2BUfwo47syFFvAc9T-F9m%3DxNhRt8FxmOPBEK9okjaP3iBG%2BTdQ%40mail.gmail.com?utm_medium=email&utm_source=footer" target="_blank" rel="nofollow" onmousedown="this.href='https://groups.google.com/d/msgid/chibolts/CA%2BUfwo47syFFvAc9T-F9m%3DxNhRt8FxmOPBEK9okjaP3iBG%2BTdQ%40mail.gmail.com?utm_medium\x3demail\x26utm_source\x3dfooter';return true;" onclick="this.href='https://groups.google.com/d/msgid/chibolts/CA%2BUfwo47syFFvAc9T-F9m%3DxNhRt8FxmOPBEK9okjaP3iBG%2BTdQ%40mail.gmail.com?utm_medium\x3demail\x26utm_source\x3dfooter';return true;">https://groups.google.com/d/<wbr>msgid/chibolts/CA%<wbr>2BUfwo47syFFvAc9T-F9m%<wbr>3DxNhRt8FxmOPBEK9okjaP3iBG%<wbr>2BTdQ%40mail.gmail.com</a>.<br>
          For more options, visit <a href="https://groups.google.com/d/optout" target="_blank" rel="nofollow" onmousedown="this.href='https://groups.google.com/d/optout';return true;" onclick="this.href='https://groups.google.com/d/optout';return true;">https://groups.google.com/d/<wbr>optout</a>.<br>
          <br>
          </p>
      </div>
      -- <br>
      You received this message because you are subscribed to the Google
      Groups "chibolts" group.<br>
      To unsubscribe from this group and stop receiving emails from it,
      send an email to <a href="javascript:" target="_blank" gdf-obfuscated-mailto="tbq_HCxrBwAJ" rel="nofollow" onmousedown="this.href='javascript:';return true;" onclick="this.href='javascript:';return true;">chibolts+u...@<wbr>googlegroups.com</a>.<br>
      To post to this group, send email to <a href="javascript:" target="_blank" gdf-obfuscated-mailto="tbq_HCxrBwAJ" rel="nofollow" onmousedown="this.href='javascript:';return true;" onclick="this.href='javascript:';return true;">chib...@googlegroups.com</a>.<br>
      To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/chibolts/cda1a20adfc14a22974bd654396bc4d6%40PGH-MSGMLT-01.andrew.ad.cmu.edu?utm_medium=email&utm_source=footer" target="_blank" rel="nofollow" onmousedown="this.href='https://groups.google.com/d/msgid/chibolts/cda1a20adfc14a22974bd654396bc4d6%40PGH-MSGMLT-01.andrew.ad.cmu.edu?utm_medium\x3demail\x26utm_source\x3dfooter';return true;" onclick="this.href='https://groups.google.com/d/msgid/chibolts/cda1a20adfc14a22974bd654396bc4d6%40PGH-MSGMLT-01.andrew.ad.cmu.edu?utm_medium\x3demail\x26utm_source\x3dfooter';return true;">https://groups.google.com/d/<wbr>msgid/chibolts/<wbr>cda1a20adfc14a22974bd654396bc4<wbr>d6%40PGH-MSGMLT-01.andrew.ad.<wbr>cmu.edu</a>.<br>
      For more options, visit <a href="https://groups.google.com/d/optout" target="_blank" rel="nofollow" onmousedown="this.href='https://groups.google.com/d/optout';return true;" onclick="this.href='https://groups.google.com/d/optout';return true;">https://groups.google.com/d/<wbr>optout</a>.<br>
    </blockquote>
    <br>
  </div>

</blockquote></div></div>

<p></p>

-- <br />
You received this message because you are subscribed to the Google Groups "chibolts" group.<br />
To unsubscribe from this group and stop receiving emails from it, send an email to <a href="mailto:chibolts+unsubscribe@googlegroups.com">chibolts+unsubscribe@googlegroups.com</a>.<br />
To post to this group, send email to <a href="mailto:chibolts@googlegroups.com">chibolts@googlegroups.com</a>.<br />
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/chibolts/47f21e59-151b-4866-9709-e4f971efb432%40googlegroups.com?utm_medium=email&utm_source=footer">https://groups.google.com/d/msgid/chibolts/47f21e59-151b-4866-9709-e4f971efb432%40googlegroups.com</a>.<br />
For more options, visit <a href="https://groups.google.com/d/optout">https://groups.google.com/d/optout</a>.<br />