within-word coding in Mandarin transcripts

Pei-Tzu Tsai peitzu.tsai at sjsu.edu
Tue Oct 26 16:50:31 UTC 2021


Expanding FLUCALC to other languages would certainly help speed up fluency 
research across languages. I'd happy to continue the discussion on the side 
about fluency data set in Mandarin to support the modification of 
FLUCALC. For now, we'll rely on FREQ to get part of the analysis done. 
Thanks for all the input. 
Peitzu

On Monday, October 25, 2021 at 12:05:09 PM UTC-7 macw wrote:

> Correct. Moreover we have no comparison fluency data set for other 
> languages, although there might be some eventually for Dutch. Supporting 
> fluency analysis for Chinese would be a great thing for fluency research, 
> but it would be a big project, hopefully supported by grants from China.
>
> — Brian 
>
> On Oct 25, 2021, at 2:54 PM, Leonid Spektor <spe... at andrew.cmu.edu> wrote:
>
> I have to add that FLUCALC will not work correctly no matter what option 
> you choose, because it was designed to work for English language only.
>
>
> Leonid. 
>
> On Oct 25, 2021, at 12:51, Brian Macwhinney <ma... at andrew.cmu.edu> wrote:
>
> Dear Peitzu,
>      I am not recommending replacing the main line with thhe %ort line, 
> but rather adding the %ort line with Pinyin to support disfluency coding. 
> There is software that can do automatic recoding of Hanzi to Pinyin.  I 
> could discuss that with you separately, if you decide to go that way.
>
> — Brian MacWhinney
> Teresa Heinz Professor of Cognitive Psychology, 
> Computational Linguistics, 
> and Modern Languages, CMU
>
> On Oct 25, 2021, at 12:39 PM, Pei-Tzu Tsai <peitz... at sjsu.edu> wrote:
>
> Thank you, Brian. I'm guessing the first option would then prevent flucalc 
> from identifying the disfluencies since the coding is replaced. Please 
> advise if there is a way around it. If we go with converting all samples to 
> pinyin entirely, using the segmenter/translater command that runs in the 
> terminal, is there any way we can at the same time convert the characters 
> from the main tier to the %ort tier? 
> Peitzu
>
>
> On Saturday, October 23, 2021 at 9:14:10 AM UTC-7 macw wrote:
>
>> Dear Peitzu, 
>> There are two ways to do this. If you just want to do this occasionally 
>> for one or two words, you can use the form 
>> ↫g↫gou3 [: 狗] 
>> then MOR will ignore the ↫g↫gou3 and only make use of 狗. 
>> Alternatively, if you want to study phonology systematically, you can 
>> create a complete %pho tier. 
>> Your example also makes the important point that the systematic coding of 
>> disfluencies faces some challenges in Chinese and other languages with 
>> whole-character coding. I see that you are using disfluency coding for the 
>> repetition of the initial consonant. If you wanted to study this 
>> extensively, it might almost be best to create a secondary %ort line that 
>> would be the basis of a complete Pinyin transcription. 
>>
>> — Brian MacWhinney 
>>
>> > On Oct 23, 2021, at 1:01 AM, Pei-Tzu Tsai <peitz... at sjsu.edu> wrote: 
>> > 
>> > Hi, 
>> > Is there any recommended way of coding Mandarin transcripts in Chinese 
>> characters at the sound level, while still running Mor successfully? For 
>> example, initial sound repetition in 狗 (↫g↫gou3) 看 到 了. We tried replacing 
>> the character with pinyin but Mor doesn't recognize it. 
>> > Thanks, 
>> > Peitzu 
>> > 
>> > -- 
>> > You received this message because you are subscribed to the Google 
>> Groups "chibolts" group. 
>> > To unsubscribe from this group and stop receiving emails from it, send 
>> an email to chibolts+u... at googlegroups.com. 
>> > To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/chibolts/836ba77e-f988-4a61-bf1c-27df26fdfc9en%40googlegroups.com. 
>>
>>
>>
> -- 
> You received this message because you are subscribed to the Google Groups 
> "chibolts" group.
> To unsubscribe from this group and stop receiving emails from it, send an 
> email to chibolts+u... at googlegroups.com.
> To view this discussion on the web visit 
> https://groups.google.com/d/msgid/chibolts/67cd16d0-9ef5-47b5-a369-8a276e5c1123n%40googlegroups.com 
> <https://groups.google.com/d/msgid/chibolts/67cd16d0-9ef5-47b5-a369-8a276e5c1123n%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
>
>
>
> -- 
> You received this message because you are subscribed to the Google Groups 
> "chibolts" group.
> To unsubscribe from this group and stop receiving emails from it, send an 
> email to chibolts+u... at googlegroups.com.
> To view this discussion on the web visit 
> https://groups.google.com/d/msgid/chibolts/C26AF2ED-F1EB-4040-943C-3384089A56F5%40andrew.cmu.edu 
> <https://groups.google.com/d/msgid/chibolts/C26AF2ED-F1EB-4040-943C-3384089A56F5%40andrew.cmu.edu?utm_medium=email&utm_source=footer>
> .
>
>
>
> -- 
> You received this message because you are subscribed to the Google Groups 
> "chibolts" group.
> To unsubscribe from this group and stop receiving emails from it, send an 
> email to chibolts+u... at googlegroups.com.
>
> To view this discussion on the web visit 
> https://groups.google.com/d/msgid/chibolts/CFC86A33-7EA8-4D5C-A712-AC4322DDCC88%40andrew.cmu.edu 
> <https://groups.google.com/d/msgid/chibolts/CFC86A33-7EA8-4D5C-A712-AC4322DDCC88%40andrew.cmu.edu?utm_medium=email&utm_source=footer>
> .
>
>
>

-- 
You received this message because you are subscribed to the Google Groups "chibolts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+unsubscribe at googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/d386d230-3c79-4922-85db-796e940e370cn%40googlegroups.com.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/chibolts/attachments/20211026/e74b3010/attachment.htm>


More information about the Chibolts mailing list