<div dir="ltr">Thank you Leonid,<br><br>Is there anyway to create an "in-house" depfile.cut that will allow uses of # for non-separable prefixes like:<br>*CHI: re#bueno.<br>I tried tweaking it adding *_#_* to the *: line but I can't get it to work. (This requires at least one character before and after the #, right?)<br>(I need to do a first pass morphemicizing on the main line for reasons internal to the project I am working on...)<br>Thanks<br>Bruno<br><br>On Thursday, April 3, 2014 3:09:56 PM UTC-4, Spektor, Leonid: CMU wrote:<blockquote class="gmail_quote" style="margin: 0;margin-left: 0.8ex;border-left: 1px #ccc solid;padding-left: 1ex;"><div style="word-wrap:break-word">Bruno,<div><br></div><div><span style="white-space:pre"> </span>The *_*# in depfile.cut is for words that end with ‘#’ character only. In languages like Hebrew prefixes can be separate from the stem word and they are marked with ‘#’ sign at the end. For example prefixes like “ha#” and “ba#”. If you have ‘#’ character in the middle or the beginning of the the word, then CHECK will complain.<br><div>
<span style="border-collapse:separate;border-spacing:0px"><span style="border-collapse:separate;color:rgb(0,0,0);font-family:'Lucida Grande';font-style:normal;font-variant:normal;font-weight:normal;letter-spacing:normal;line-height:normal;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px"><div style="word-wrap:break-word"><span style="border-collapse:separate;color:rgb(0,0,0);font-family:'Lucida Grande';font-style:normal;font-variant:normal;font-weight:normal;letter-spacing:normal;line-height:normal;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px"><div style="word-wrap:break-word"><span style="border-collapse:separate;color:rgb(0,0,0);font-family:Helvetica;font-style:normal;font-variant:normal;font-weight:normal;letter-spacing:normal;line-height:normal;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px"><span style="border-collapse:separate;color:rgb(0,0,0);font-family:Helvetica;font-style:normal;font-variant:normal;font-weight:normal;letter-spacing:normal;line-height:normal;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px"><div style="word-wrap:break-word"><span style="border-collapse:separate;color:rgb(0,0,0);font-family:Helvetica;font-style:normal;font-variant:normal;font-weight:normal;letter-spacing:normal;line-height:normal;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px"><div style="word-wrap:break-word"><div><br>Leonid.</div><div><br></div></div></span></div></span></span></div></span></div></span></span><br>
</div>
<br><div><div>On Apr 3, 2014, at 14:25, Bruno Estigarribia <<a href="javascript:" target="_blank" gdf-obfuscated-mailto="5HX2uOZmG_QJ" onmousedown="this.href='javascript:';return true;" onclick="this.href='javascript:';return true;">brun...@gmail.com</a>> wrote:</div><br><blockquote type="cite"><div dir="ltr">Hello,<br><br>I have a .cha file where I have marked prefixes using #. CHECK doesn't like this (error message: "Illegal character(s) '#' found.(48)").<br> know, because Brian has said this to me before, that I should be morphologizing directly on the %mor tier. I understand this recommendation (it is repeated several times in section 6 of the CHAT manual). However, when I look at the depfile, the option for using # is still there:<br>*: * , ,, [x _*] [- _*] [+ _*] [^ *] *~_* *_*# *-_* <br>[and it goes on...]<br>So why is CHECK choking on it? <br>Thanks<br>Bruno<br></div><div><br></div>
-- <br>
You received this message because you are subscribed to the Google Groups "chibolts" group.<br>
To unsubscribe from this group and stop receiving emails from it, send an email to <a href="javascript:" target="_blank" gdf-obfuscated-mailto="5HX2uOZmG_QJ" onmousedown="this.href='javascript:';return true;" onclick="this.href='javascript:';return true;">chibolts+u...@<wbr>googlegroups.com</a>.<br>
To post to this group, send email to <a href="javascript:" target="_blank" gdf-obfuscated-mailto="5HX2uOZmG_QJ" onmousedown="this.href='javascript:';return true;" onclick="this.href='javascript:';return true;">chib...@googlegroups.com</a>.<br>
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/chibolts/eb1a1fb4-d31f-41ac-a9c9-a4174a992f97%40googlegroups.com?utm_medium=email&utm_source=footer" target="_blank" onmousedown="this.href='https://groups.google.com/d/msgid/chibolts/eb1a1fb4-d31f-41ac-a9c9-a4174a992f97%40googlegroups.com?utm_medium\75email\46utm_source\75footer';return true;" onclick="this.href='https://groups.google.com/d/msgid/chibolts/eb1a1fb4-d31f-41ac-a9c9-a4174a992f97%40googlegroups.com?utm_medium\75email\46utm_source\75footer';return true;">https://groups.google.com/d/<wbr>msgid/chibolts/eb1a1fb4-d31f-<wbr>41ac-a9c9-a4174a992f97%<wbr>40googlegroups.com</a>.<br>
For more options, visit <a href="https://groups.google.com/d/optout" target="_blank" onmousedown="this.href='https://groups.google.com/d/optout';return true;" onclick="this.href='https://groups.google.com/d/optout';return true;">https://groups.google.com/d/<wbr>optout</a>.<br>
</blockquote></div><br></div></div></blockquote></div>
<p></p>
-- <br />
You received this message because you are subscribed to the Google Groups "chibolts" group.<br />
To unsubscribe from this group and stop receiving emails from it, send an email to <a href="mailto:chibolts+unsubscribe@googlegroups.com">chibolts+unsubscribe@googlegroups.com</a>.<br />
To post to this group, send email to <a href="mailto:chibolts@googlegroups.com">chibolts@googlegroups.com</a>.<br />
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/chibolts/1cb9fa42-9f92-426c-ae3e-956c27325aa6%40googlegroups.com?utm_medium=email&utm_source=footer">https://groups.google.com/d/msgid/chibolts/1cb9fa42-9f92-426c-ae3e-956c27325aa6%40googlegroups.com</a>.<br />
For more options, visit <a href="https://groups.google.com/d/optout">https://groups.google.com/d/optout</a>.<br />