bilingual
Ying
yl5834 at gmail.com
Mon Dec 30 23:46:16 UTC 2013
Dear Leonid,
I want to get MLU, the number of different words, and also TTR from some
Mandarin(Putonghua)-English bilingual narrative data. Also I added some
word and utterance level codes and want to summarize the codes. For
example, for the following sample (I am attaching the transcript after
running the commands),
*CHI: [- zho] 我 去 了 一 个 <一个> [/] park at s yesterday at s. [+ CS]
*CHI: It is a very big one.
*EXA: Nice.
*CHI: My mom say [* tense] “We will come from time to time”. [+ GE]
Note:
The precode [- zho] is for Mandarin/Putonghua, as [- yue] is for Cantonese
[+ CS] is an utterance level code, indicating code-switched sentences
[+ GE] is an utterance level code, indicating sentences with grammatical
errors
[* tense] is a word level code, indicating a tense error
Here are the commands I used:
mor +s"[- zho]" sample_English.cha +1
post sample_English.cha +1
mor -s"[- zho]" sample_English.cha +1
post sample_English.cha +1
Esc_L
freq +s"[% *]" *.cha
Questions I have:
(1) May I ignore an error and move to the next one when I run CHECK?
(2) for code-switched words within an utterance, I don't care for mor info
such as noun or verb. But I do want to calculate MLU and TTR. If I go with
park at s but don't bother to make park at s$n, will CLAN give me the correct
results?
(3) I can get codes [zho], [CS], and [GE] calculated using FREQ, but not [*
tense]. How may I count the occurance of [* tense]. Moreover, can I know
whether it is the same verb (e.g., say) coded [* tense]?
Thank you very much!
Happy New Year!
Sincerely,
Ying
--
You received this message because you are subscribed to the Google Groups "chibolts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+unsubscribe at googlegroups.com.
To post to this group, send email to chibolts at googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/6de36ec6-5c23-41d7-a0b5-06f5b35a5ce2%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/chibolts/attachments/20131230/2c99e832/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: sample_English.cha
Type: application/octet-stream
Size: 762 bytes
Desc: not available
URL: <http://listserv.linguistlist.org/pipermail/chibolts/attachments/20131230/2c99e832/attachment-0001.obj>
More information about the Chibolts
mailing list