Frequency of verb forms by verb type
Leonid Spektor
spektor at andrew.cmu.edu
Wed Oct 9 02:18:59 UTC 2013
Kevin,
You are absolutely right that CHAT is not CLAN specific format. But, being a plain text format makes it prone to have a lot of extraneous data in between. For example if someone wants to look at a speaker tier or %mor tier only you would have to filter out the rest. If you are looking for %mor tier of just one particular speaker only, then it becomes even more complicated. Using CLAN to filter unneeded data is the easiest solution. After that CHAT is just a plain text.
For those who do not want to use CLAN at all and still have an easy way to parse the data we have XML-CHAT on our server. Just look for "XML" in "Database" section on our web server's home page.
Leonid.
On Oct 8, 2013, at 17:54 , Kevin Donnelly wrote:
> Hi Leonid
>
> ::::On Tuesday 08 October 2013 Leonid Spektor said::::
>> Your suggestion to use Unix commands or any other non-CLAN commands
>> on CHAT data is not a good idea, unless you take extra precautions,
>> because CHAT format allows long tiers to wrap-around. Non-CLAN commands
>> will work often, but they will fail on wrapped-around tiers and will leave
>> out the part of tier that is wrapped. If you really want to use non-CLAN
>> commands, then you should first run CLAN's LONGTIER command to remove all
>> the tier wrapping and to make sure that the whole tier is on one line.
>
> Sure, good point - you need to use LONGTIER or something like Sed to
> straighten the lines (and in fact that might usefully be added to CLAN as an
> option). But I think it's important to remember that one of the CHAT format's
> strengths is that it is a vanilla plain text file, and therefore does not
> actually require CLAN programs to analyse - using other programs is eminently
> possible. It's maybe worth highlighting this because there was a recent
> exchange on an R (stats language) list where the OP seemed to be under the
> impression that CHAT files could ONLY be handled by CLAN - in fact, R will
> consume them with no problem at all, just as it will consume any other text
> file.
>
> --
> Pob hwyl / Best wishes
>
> Kevin Donnelly
> kevindonnelly.org.uk
> bangortalk.org.uk
>
> --
> You received this message because you are subscribed to the Google Groups "chibolts" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+unsubscribe at googlegroups.com.
> To post to this group, send email to chibolts at googlegroups.com.
> To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/201310082254.25791.kevin%40dotmon.com.
> For more options, visit https://groups.google.com/groups/opt_out.
--
You received this message because you are subscribed to the Google Groups "chibolts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+unsubscribe at googlegroups.com.
To post to this group, send email to chibolts at googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/435D74B4-10CF-4C5D-9E4F-EE1F2A21078B%40andrew.cmu.edu.
For more options, visit https://groups.google.com/groups/opt_out.
More information about the Chibolts
mailing list