extract plain text from CHAT ?

Leonid Spektor spektor at andrew.cmu.edu
Thu Jul 31 15:32:53 UTC 2008


Frank,

    You can use kwal program to extract any tiers using +/-t option and/or
+/-s to extract only certain codes. I am using sample.cha file that comes
with every CLAN and is located in clan/lib/samples folder. Here is the
command to extract only *CHI tiers:

kwal +t*chi +t%spa sample.cha +d +d3

>From file <sample.cha>
*CHI:    yeah . [+ Q]
%spa:    $RES:sel:ve $DES:tes:ve
*CHI:    yeah . [+ Q]
%spa:    $RES:sel:in $DES:tes:non
*CHI:    Mommy .
%spa:    $RFA:sel:non $DES:sel:non $INI:sel:non
*CHI:    what's that ? [+ I]
%spa:    $IMI:sel:ve $CON:sel:in
*CHI:    neat chalk chalk .
%spa:    $CON:sel:in $RES:tes:in
*CHI:    xxx .  [+ V]

Here is the command to extract only *CHI tiers that and to exclude any $CON
codes from data:

kwal +t*chi +t%spa -s$con* sample.cha +d +d3

>From file <sample.cha>
*CHI:    yeah . 
%spa:    $res:sel:ve $des:tes:ve
*CHI:    yeah . 
%spa:    $res:sel:in $des:tes:non
*CHI:    mommy .
%spa:    $rfa:sel:non $des:sel:non $ini:sel:non
*CHI:    what's that ?
%spa:    $imi:sel:ve
*CHI:    neat chalk chalk .
%spa:    $res:tes:in
*CHI:    . 


This output is a legal CHAT format and can be used as an input to any of
CLAN programs. Also, the output is in a UTF8 Unicode plain text encoding and
can be opened by any text editor that can decode UTF8 encoded text files.

Leonid.



On 31-07-08 08:17, "Frank Binder" <fbinder at eva.mpg.de> wrote:

> 
> Dear chibolts,
> 
> there is this simple question that for some reason nobody asks, but I
> am feeling lucky today ...
> 
> 
> Do you know of any (CLAN?) tool that allows to extract data from CHAT
> files, such as the main tier contents, and export it to "plain"
> (unicode) text?
> 
> 
> That is, I am looking for a tool that removes the CHAT from the CHAT.
> Ideally this would support CLAN's shared options - such as +R +S +T etc.
> -  to select speakers and include or exclude certain annotations and
> symbols/punctuation. Although this would probably be a one-way ticket,
> it seems needed sometimes.
> 
> Also, if there's no such tool, any suggestions or experience on how to
> do it?
> 
> Thanks in advance and best regards,
> 
> Frank
> 
> 
> 
> > 
> 



--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups "chibolts" group.
To post to this group, send email to chibolts at googlegroups.com
To unsubscribe from this group, send email to chibolts+unsubscribe at googlegroups.com
For more options, visit this group at http://groups.google.com/group/chibolts?hl=en
-~----------~----~----~----~------~----~------~--~---



More information about the Chibolts mailing list