<html><head><meta http-equiv="Content-Type" content="text/html charset=windows-1252"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;">I think I see how this might be happening.  There are several sentences in the training corpus that erroneously have this double subject marking.  Once those are fixed, this type of marking should decrease.  Also, we plan to do some more retraining later this month.<div><br></div><div>—Brian</div><div><br><div><div>On Jul 3, 2014, at 2:55 PM, Zhuo (Cindy) Chen <<a href="mailto:czcindy426@gmail.com">czcindy426@gmail.com</a>> wrote:</div><br class="Apple-interchange-newline"><blockquote type="cite"><div dir="ltr">Dear CHIBOLTS,<div><br></div><div>I found that you updated Manchester corpus. I believe it's because you finished working on the problem of double subject.</div><div><br></div><div>I had a random look and found in Anne05b, you have this utterance where everything before the verb "think" is tagged as subject.</div><div><br></div><div><pre style="font-size: 14px; line-height: 18px; font-family: CAFont, 'Arial Unicode MS', Courier, serif;">*MOT:    what's that „ do you think ?
%mor:   pro:wh|what~cop|be&3S pro:dem|that end|end v|do pro|you v|think ?
%gra:   1|2|SUBJ 2|7|SUBJ 3|7|SUBJ 4|7|SUBJ 5|7|SUBJ 6|7|SUBJ 7|0|ROOT 8|7|PUNCT</pre><div><br>On Tuesday, June 17, 2014 10:35:55 AM UTC-4, Spektor, Leonid: CMU wrote:<blockquote class="gmail_quote" style="margin: 0;margin-left: 0.8ex;border-left: 1px #ccc solid;padding-left: 1ex;"><div style="word-wrap:break-word">Zhou Chen,<div><br></div><div><span style="white-space:pre">   </span>The KWAL command that Brian mentions is a brand new feature and at this time it only works with Mac version of CLAN. I will update Windows PC and Unix versions later today and you will need to download latest version of CLAN in order to use this command.<br><div>
<div style="font-family: 'Lucida Grande'; font-size: 12px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px;"><br>Leonid.</div><div style="font-family: 'Lucida Grande'; font-size: 12px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px;"><br></div><br>

</div>
<br><div><div>On Jun 17, 2014, at 10:13, Brian MacWhinney <<a href="javascript:" target="_blank" gdf-obfuscated-mailto="1LbBq7YkNxYJ" onmousedown="this.href='javascript:';return true;" onclick="this.href='javascript:';return true;">ma...@cmu.edu</a>> wrote:</div><br><blockquote type="cite"><div style="word-wrap:break-word">Dear Zhou Chen,<div>    I did some more checking on the double subject problem.  It appears that this arises because our major training corpus includes a lot of adult speech and adults are not using these verb-verb constructions in the same way as the children.  I can resolve this by working on fixing the relevant sentences in the Eve-Train training corpus.  I can spot the problems using this command:</div><div><br></div><div><div style="margin:0px;font-family:'Arial Unicode MS'">kwal +t%mor +t%grt +d7 +sSUBJ +s@|v *.cha</div></div><div style="margin:0px;font-family:'Arial Unicode MS'"><br></div><div style="margin:0px;font-family:'Arial Unicode MS'">However, there are about 80 sentences to fix and this will take some time.  However, eventually I should be able to fix this problem.  You can also use that command to spot the problems yourself.  However, you would need to change +t%grt to +t%gra</div><div style="margin:0px;font-family:'Arial Unicode MS'"><br></div><div style="margin:0px;font-family:'Arial Unicode MS'">— Brian MacWhinney</div></div><div><br></div>

-- <br>
You received this message because you are subscribed to the Google Groups "chibolts" group.<br>
To unsubscribe from this group and stop receiving emails from it, send an email to <a href="javascript:" target="_blank" gdf-obfuscated-mailto="1LbBq7YkNxYJ" onmousedown="this.href='javascript:';return true;" onclick="this.href='javascript:';return true;">chibolts+u...@<wbr>googlegroups.com</a>.<br>
To post to this group, send email to <a href="javascript:" target="_blank" gdf-obfuscated-mailto="1LbBq7YkNxYJ" onmousedown="this.href='javascript:';return true;" onclick="this.href='javascript:';return true;">chib...@googlegroups.com</a>.<br>
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/chibolts/38FB1649-202A-4135-8773-50571745D1DD%40cmu.edu?utm_medium=email&utm_source=footer" target="_blank" onmousedown="this.href='https://groups.google.com/d/msgid/chibolts/38FB1649-202A-4135-8773-50571745D1DD%40cmu.edu?utm_medium\75email\46utm_source\75footer';return true;" onclick="this.href='https://groups.google.com/d/msgid/chibolts/38FB1649-202A-4135-8773-50571745D1DD%40cmu.edu?utm_medium\75email\46utm_source\75footer';return true;">https://groups.google.com/d/<wbr>msgid/chibolts/38FB1649-202A-<wbr>4135-8773-50571745D1DD%40cmu.<wbr>edu</a>.<br>
For more options, visit <a href="https://groups.google.com/d/optout" target="_blank" onmousedown="this.href='https://groups.google.com/d/optout';return true;" onclick="this.href='https://groups.google.com/d/optout';return true;">https://groups.google.com/d/<wbr>optout</a>.<br>
</blockquote></div><br></div></div></blockquote></div></div></div><div><br class="webkit-block-placeholder"></div>

-- <br>
You received this message because you are subscribed to the Google Groups "chibolts" group.<br>
To unsubscribe from this group and stop receiving emails from it, send an email to <a href="mailto:chibolts+unsubscribe@googlegroups.com">chibolts+unsubscribe@googlegroups.com</a>.<br>
To post to this group, send email to <a href="mailto:chibolts@googlegroups.com">chibolts@googlegroups.com</a>.<br>
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/chibolts/dfae7d73-b195-4cd2-806c-54320f248e92%40googlegroups.com?utm_medium=email&utm_source=footer">https://groups.google.com/d/msgid/chibolts/dfae7d73-b195-4cd2-806c-54320f248e92%40googlegroups.com</a>.<br>
For more options, visit <a href="https://groups.google.com/d/optout">https://groups.google.com/d/optout</a>.<br>
</blockquote></div><br></div></body></html>

<p></p>

-- <br />
You received this message because you are subscribed to the Google Groups "chibolts" group.<br />
To unsubscribe from this group and stop receiving emails from it, send an email to <a href="mailto:chibolts+unsubscribe@googlegroups.com">chibolts+unsubscribe@googlegroups.com</a>.<br />
To post to this group, send email to <a href="mailto:chibolts@googlegroups.com">chibolts@googlegroups.com</a>.<br />
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/chibolts/F24B1260-9CFB-4902-922B-9AF724225A71%40cmu.edu?utm_medium=email&utm_source=footer">https://groups.google.com/d/msgid/chibolts/F24B1260-9CFB-4902-922B-9AF724225A71%40cmu.edu</a>.<br />
For more options, visit <a href="https://groups.google.com/d/optout">https://groups.google.com/d/optout</a>.<br />