KWAL not finding words that FREQ has found...

Alcock, Katie k.j.alcock at lancaster.ac.uk
Mon Dec 5 09:31:56 UTC 2016


Thanks, I'll try that. I have a new version and since my FREQ output is of lemmas I don’t know what the forms were on the speaker tier but I can fix them so they are the right format for the %mor tier.

Katie

From: <chibolts at googlegroups.com> on behalf of Leonid Spektor <spektor at andrew.cmu.edu>
Reply-To: "chibolts at googlegroups.com" <chibolts at googlegroups.com>
Date: Thursday, 1 December 2016 at 18:03
To: "chibolts at googlegroups.com" <chibolts at googlegroups.com>
Subject: Re: KWAL not finding words that FREQ has found...


Katie,

    The include files like the "Kwal words to find.txt" that you are using have to have the same format as the +s option does. So, if you are looking on %mor tier, then new format is:

m;mboat
m;givin

If you have an older version of CLAN, then format is:

@r-mboat
@r-givin

If you are looking for those words on speaker tier, then you need to add @u to the end of each word in the list:

mboat at u
givin at u

or for more general search do:

*mboat*
*givin*

Leonid.


On 01-12-16 12:14, Alcock, Katie wrote:
The issue is not so much how they are marked but that KWAL isn’t finding them.
Is there a way to make KWAL include these forms that I haven’t found? It is doing this on all the other UK corpora that I’ve tried so far.
I’m going in alphabetical order and it’s done it with Belfast, Cruttenden, Fletcher and Howe!

Thanks

katie

From: Brian MacWhinney <macw at cmu.edu><mailto:macw at cmu.edu>
Date: Thursday, 1 December 2016 at 16:38
To: "chibolts at googlegroups.com"<mailto:chibolts at googlegroups.com> <chibolts at googlegroups.com><mailto:chibolts at googlegroups.com>, Katie Alcock <k.j.alcock at lancaster.ac.uk><mailto:k.j.alcock at lancaster.ac.uk>
Subject: Re: KWAL not finding words that FREQ has found...

Dear Katie,
The Gathburn corpus marks various strange forms with the @u special form marker.  You are looking only at the %mor tier.  If you look at the main tier for forms like “mboat” you will see that they are coded as “mboat at u”.   So, these are not spelling errors, but the actual forms that were produced.  Without having the audio for this corpus, it is difficult to go much further.

-- Brian MacWhinney

From: ChiBolts <chibolts at googlegroups.com><mailto:chibolts at googlegroups.com> on behalf of "Alcock, Katie" <k.j.alcock at lancaster.ac.uk><mailto:k.j.alcock at lancaster.ac.uk>
Reply-To: ChiBolts <chibolts at googlegroups.com><mailto:chibolts at googlegroups.com>
Date: Thursday, December 1, 2016 at 7:51 AM
To: ChiBolts <chibolts at googlegroups.com><mailto:chibolts at googlegroups.com>
Subject: KWAL not finding words that FREQ has found...

I’ve just been running this command on multiple files (I’m going to give the examples from the Gathburn corpus where I’m working only on the three year olds)

freq +o +d2 +f   @  +u +k  +t%mor -t* +s@"r*,o%" +t*GER: +t*KAT: +t*VID: +t*JAS: +t*ALL: +t*UNK:

It gives me a  nice output that appears to be all the lemmas from all the individual child tiers plus the child tier ALL and the one UNK.

It includes obviously some idiosyncratic words such as bektim which GER says and I can find this in one of the files by searching.

However if I run KWAL with this list of idiosyncratic and/or words I want to check:

kwal  @ +t*GER: +t*KAT: +t*VID: +t*JAS: +t*ALL: +t*UNK:  +t%mor +f +s@"Kwal words to find.txt"


cazbare

cenj

dida

givin

hawi

kit

ma

mboat

sith

tikdik


(I suspect givin and mboat are spelling errors, “dida” may also be, and I’m trying to find out if “ma” here is being used for “mother” and if “kit” is being used for “kat”)

then ONLY finds “kit”. This is the same whether I run it on the mor tier or the main speaker tier.

Any thoughts? If I run KWAL on each word individually it does the same – only the time I run it for Kit does it come up with something .

Incidentally the FREQ command also tells me that one of the lemmas is “genmod”.

Thanks

Katie

--
You received this message because you are subscribed to the Google Groups "chibolts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+unsubscribe at googlegroups.com<mailto:chibolts+unsubscribe at googlegroups.com>.
To post to this group, send email to chibolts at googlegroups.com<mailto:chibolts at googlegroups.com>.
To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/1B6302315CC46C48962C4C1856B583D28E6916E5%40EX-0-MB2.lancs.local<https://groups.google.com/d/msgid/chibolts/1B6302315CC46C48962C4C1856B583D28E6916E5%40EX-0-MB2.lancs.local?utm_medium=email&utm_source=footer>.
For more options, visit https://groups.google.com/d/optout.



--
You received this message because you are subscribed to the Google Groups "chibolts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+unsubscribe at googlegroups.com<mailto:chibolts+unsubscribe at googlegroups.com>.
To post to this group, send email to chibolts at googlegroups.com<mailto:chibolts at googlegroups.com>.
To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/32B95BFF-E8D0-4757-93D0-EB6131495ED7%40lancaster.ac.uk<https://groups.google.com/d/msgid/chibolts/32B95BFF-E8D0-4757-93D0-EB6131495ED7%40lancaster.ac.uk?utm_medium=email&utm_source=footer>.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "chibolts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+unsubscribe at googlegroups.com<mailto:chibolts+unsubscribe at googlegroups.com>.
To post to this group, send email to chibolts at googlegroups.com<mailto:chibolts at googlegroups.com>.
To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/e6c93196-d60b-2653-3ff5-5c3fff864418%40andrew.cmu.edu<https://groups.google.com/d/msgid/chibolts/e6c93196-d60b-2653-3ff5-5c3fff864418%40andrew.cmu.edu?utm_medium=email&utm_source=footer>.
For more options, visit https://groups.google.com/d/optout.

-- 
You received this message because you are subscribed to the Google Groups "chibolts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+unsubscribe at googlegroups.com.
To post to this group, send email to chibolts at googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/75CD999F-5D2E-40B9-8A4C-703A73983017%40lancaster.ac.uk.
For more options, visit https://groups.google.com/d/optout.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/chibolts/attachments/20161205/605a3810/attachment.htm>


More information about the Chibolts mailing list