Question about converting SALT files to CHAT format

Ying Lu ying.lu at utexas.edu
Mon Nov 5 14:48:14 UTC 2012


Dear Leonid,


Thank you very much for your reply!

For problem (a), I understand the errors are due to the unrecognized
initial characters of the tiers (e.g., $, C). But I don't know why that
happened. I suspect there is a compatibility issue here, i.e., SALTIN can't
recognize some symbols generated by the SALT 2012 student version. I will
try other SALT versions later and see if that works.

As for problem (b), I got a reply from Dr. MacWhinney last night and was
able to get the comparable MLU. Thanks a lot again for directing me to use
POST!!!

Sincerely,
Ying Lu


On Mon, Nov 5, 2012 at 12:05 AM, Leonid Spektor <spektor at andrew.cmu.edu>wrote:

> Ying Lu,
>
> (a)
> The first error "** ERROR: unknown symbol <> found on 1 line." is probably
> because the first character on the line 1 is not '$' character. It might be
> some special control character that SALT or your text editor has inserted
> and CLAN doesn't recognize. The rest of the errors are due to first line
> not being recognized by SALTIN and as a result SALTIN can't identify
> speaker name code.
>
> I have created a test file from your example below and here is the result
> from SALTIN that I get:
>
> @Begin
> @Languages: eng
> @Participants: CHI Target_Child, INV Investigator
> @ID: eng|change_me_later|CHI|||||Target_Child|||
> @ID: eng|change_me_later|INV|||||Investigator|||
> @Time Start: 00:00
> @Comment: Language: English
> *CHI: there is a boy whose name is tom .
> *CHI: he likes swimming .
> %tim: 00:05
> @End
>
> It would be very helpful if you would send to me directly the "Tran1.slt"
> file that you used.
>
>
> (b)
> The difference in MLU results is due to %mor tier not being disambiguated
> by POST command. If you run POST command, then you will get this CHAT file:
>
> *CHI:    the dog was [% ip] looking [% mv] in the beehive and 0at [% po]
> the mole [% u] .
> %mor: det|the n|dog aux|be&PAST&13S part|look-PROG prep|in det|the
> n|beehive coord|and 0prep|at det|the n|mole .
>
> and this MLU result:
>
> MLU for Speaker: *CHI:
>   MLU (xxx, yyy and www are EXCLUDED from the utterance and morpheme
> counts):
> Number of: utterances = 1, morphemes = 11
> Ratio of morphemes over utterances = 11.000
> Standard deviation = 0.000
>
>
> Leonid.
>
>
>
>  On Nov 4, 2012, at 20:33 , Ying Lu wrote:
>
> *Dear Chibolts group,
>
> I am trying to convert some SALT files and compare the MLU calculated by
> SALT with that by CLAN. I encounter some problems:
>
> (a) I am using SALT 2012 Student Version. *
>
> *The original SALT file is (I am using a short version sample here):*
>
> $ Child, Examiner
> + Language: English
> - 0:00
>
> C There is a boy whose name is Tom.
> C He likes swimming.
>
> - 0:05
>
> *But I got the following message when converting a SALT file into CHAT
> format using the command line "SALTIN Tran1.slt": *
> > SALTIN Tran1.slt
> saltin Tran1.slt
> Sun Nov 04 18:39:34 2012
> saltin (07-Sep-2012)
> ****************************************
> From file <Tran1.slt>
> ** ERROR: unknown symbol <> found on 1 line.
> ** ERROR: unknown symbol <c> found on 5 line.
> ** ERROR: unknown symbol <c> found on 7 line.
> Output file <Tran1.sltin.cha>
>
> *The output file looks like this:*
>
> @Begin
> @Languages:    eng
> @Time Start:    00:00
> @Time Start:    there is a boy whose name is tom .he likes swimming .00:05
> @End
>
> *I don't understand why the errors would occur.*
>
> *(b) Also, the MLU calculated by SALT is often different from that
> calculated by CLAN. For example,*
> C The dog was look/ing in the beehive and *at the mole. *(in SALT)*
>
> *When I convert this to CHAT format, and use the MOR and MLU commands, it
> showed: *
> *CHI:    The dog was [% ip] looking [% mv]
>     in the beehive and 0at [% po] the mole [% u] .
> %mor:    det|the v|dog^n|dog v:cop|be&PAST&13S^aux|be&PAST&13S
>     adj:part|look-PROG^n:gerund|look-PROG^part|look-PROG
>     prep|in^adv:loc|in^pfx|in det|the n|beehive coord|and 0prep|at
>     det|the n|mole .
>
> *The MLU by SALT is 11, but the Ratio of morphemes over utterances = 13
> (Number of: utterances = 1, morphemes = 13) by CLAN.*
>
> Apparently, the MLU result by SALT is what I want. But how could I get the
> same result in CLAN?
>
> Thank you very much for your attention! Look forward to your help!
> Sincerely,
> Ying Lu
>
> --
> You received this message because you are subscribed to the Google Groups
> "chibolts" group.
> To post to this group, send email to chibolts at googlegroups.com.
> To unsubscribe from this group, send email to
> chibolts+unsubscribe at googlegroups.com.
> To view this discussion on the web visit
> https://groups.google.com/d/msg/chibolts/-/z7qY4rboSlAJ.
> For more options, visit https://groups.google.com/groups/opt_out.
>
>
>
>
>  --
> You received this message because you are subscribed to the Google Groups
> "chibolts" group.
> To post to this group, send email to chibolts at googlegroups.com.
> To unsubscribe from this group, send email to
> chibolts+unsubscribe at googlegroups.com.
> For more options, visit https://groups.google.com/groups/opt_out.
>
>
>

-- 
You received this message because you are subscribed to the Google Groups "chibolts" group.
To post to this group, send email to chibolts at googlegroups.com.
To unsubscribe from this group, send email to chibolts+unsubscribe at googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/chibolts/attachments/20121105/80e29553/attachment.htm>


More information about the Chibolts mailing list