Some problems with extracting error-free utterances and verbs from CHAT files
zlmailhouse at 163.com
Thu Jun 28 12:48:18 EDT 2018
I encounter some problems with extracting utterances/ verbs in CHAT files.
Firstly, I have tagged ungrammatical utterances of *CHI with either [*], [*
aux] or [* wh]. Now I wanna calculate the number of utterances without
those tags([*], [* aux], [* wh] as well as those containing www, yyy. I
tried using the following command: trim -s"[*_ ]" +1 , only to find it
turns out to be unsuccessful.
Secondly, I would like to extract all the verbs of CHI* (including
copulers, modals, auxiliaries as well as regular verbs ) in the file. I
find out that at%mor, "walking" is coded not as a verb but as "PART |" . In
that case, I guess I need to also include "PART|" , right? I was
wondering what might be the comprehensive command to be used to extract all
the verbs mentioned above?
You received this message because you are subscribed to the Google Groups "chibolts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+unsubscribe at googlegroups.com.
To post to this group, send email to chibolts at googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/addb310b-f4ed-497a-bd48-e1f91c045f53%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the Chibolts