<div dir="ltr"><p><span style="font-family:"Calibri",sans-serif;color:black">Hi, I have encountered a problem with Chinese data. Clan does not

appear to segment Chinese sentences into word tokens correctly. Part of

speech tagging is also affected. Attached is the clan

output after running mlu and freq commands on a test file

without mor tier (TestFileOutput), and the same test file with mor tier

added (TestFileMor). Does anyone have any ideas how to resolve this? Thanks.<o:p></o:p></span></p></div>


<p></p>


-- <br />

You received this message because you are subscribed to the Google Groups "chibolts" group.<br />

To unsubscribe from this group and stop receiving emails from it, send an email to <a href="mailto:chibolts+unsubscribe@googlegroups.com">chibolts+unsubscribe@googlegroups.com</a>.<br />

To post to this group, send email to <a href="mailto:chibolts@googlegroups.com">chibolts@googlegroups.com</a>.<br />

To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/chibolts/f1080fe4-4646-4e24-bd2f-7b6b753d7c54%40googlegroups.com?utm_medium=email&utm_source=footer">https://groups.google.com/d/msgid/chibolts/f1080fe4-4646-4e24-bd2f-7b6b753d7c54%40googlegroups.com</a>.<br />

For more options, visit <a href="https://groups.google.com/d/optout">https://groups.google.com/d/optout</a>.<br />