within-word coding in Mandarin transcripts

Brian Macwhinney macw at andrew.cmu.edu
Mon Oct 25 16:51:00 UTC 2021


Dear Peitzu,
     I am not recommending replacing the main line with thhe %ort line, but rather adding the %ort line with Pinyin to support disfluency coding. There is software that can do automatic recoding of Hanzi to Pinyin.  I could discuss that with you separately, if you decide to go that way.

— Brian MacWhinney
Teresa Heinz Professor of Cognitive Psychology, 
Computational Linguistics, 
and Modern Languages, CMU

> On Oct 25, 2021, at 12:39 PM, Pei-Tzu Tsai <peitzu.tsai at sjsu.edu> wrote:
> 
> Thank you, Brian. I'm guessing the first option would then prevent flucalc from identifying the disfluencies since the coding is replaced. Please advise if there is a way around it. If we go with converting all samples to pinyin entirely, using the segmenter/translater command that runs in the terminal, is there any way we can at the same time convert the characters from the main tier to the %ort tier? 
> Peitzu
> 
> 
> On Saturday, October 23, 2021 at 9:14:10 AM UTC-7 macw wrote:
> Dear Peitzu, 
> There are two ways to do this. If you just want to do this occasionally for one or two words, you can use the form 
> ↫g↫gou3 [: 狗] 
> then MOR will ignore the ↫g↫gou3 and only make use of 狗. 
> Alternatively, if you want to study phonology systematically, you can create a complete %pho tier. 
> Your example also makes the important point that the systematic coding of disfluencies faces some challenges in Chinese and other languages with whole-character coding. I see that you are using disfluency coding for the repetition of the initial consonant. If you wanted to study this extensively, it might almost be best to create a secondary %ort line that would be the basis of a complete Pinyin transcription. 
> 
> — Brian MacWhinney 
> 
> > On Oct 23, 2021, at 1:01 AM, Pei-Tzu Tsai <peitz... at sjsu.edu <applewebdata://3730A1B0-67BE-4E33-9EC4-F7A89172854A>> wrote: 
> > 
> > Hi, 
> > Is there any recommended way of coding Mandarin transcripts in Chinese characters at the sound level, while still running Mor successfully? For example, initial sound repetition in 狗 (↫g↫gou3) 看 到 了. We tried replacing the character with pinyin but Mor doesn't recognize it. 
> > Thanks, 
> > Peitzu 
> > 
> > -- 
> > You received this message because you are subscribed to the Google Groups "chibolts" group. 
> > To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+u... at googlegroups.com <applewebdata://3730A1B0-67BE-4E33-9EC4-F7A89172854A>. 
> > To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/836ba77e-f988-4a61-bf1c-27df26fdfc9en%40googlegroups.com <https://groups.google.com/d/msgid/chibolts/836ba77e-f988-4a61-bf1c-27df26fdfc9en%40googlegroups.com>. 
> 
> 
> -- 
> You received this message because you are subscribed to the Google Groups "chibolts" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+unsubscribe at googlegroups.com <mailto:chibolts+unsubscribe at googlegroups.com>.
> To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/67cd16d0-9ef5-47b5-a369-8a276e5c1123n%40googlegroups.com <https://groups.google.com/d/msgid/chibolts/67cd16d0-9ef5-47b5-a369-8a276e5c1123n%40googlegroups.com?utm_medium=email&utm_source=footer>.

-- 
You received this message because you are subscribed to the Google Groups "chibolts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+unsubscribe at googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/C26AF2ED-F1EB-4040-943C-3384089A56F5%40andrew.cmu.edu.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/chibolts/attachments/20211025/08508d27/attachment.htm>


More information about the Chibolts mailing list