Freq on French MOR tiers
Florence Chenu
Florence.Chenu at univ-lyon2.fr
Mon Jun 21 19:38:08 UTC 2010
Dear Leonid,
I begin to have fun with the new feature but I can't figure out how I can
get rid of replacement word presiding [: ...] code marker (in my results, I
have sometimes v|verb at replaced_word).
And your help text is sometimes puzzling to me:
1) what does a + or a - ? ("followed by - or + and/or the following...")
2) what does " word -find "word"" mean ?
3) in your example :
" +t%mor -t* +s"@r*,|adv,o%"
find all stems of all "adv" and erase all other markers"
how come you don't use the "-" as in the first example (+t%mor -t*
+s"@r-*,|adv,o-%") ???
Thanks,
Florence.
-----Message d'origine-----
De : chibolts at googlegroups.com [mailto:chibolts at googlegroups.com] De la part
de Leonid Spektor
Envoyé : samedi 19 juin 2010 16:07
À : chibolts at googlegroups.com
Objet : Re: Freq on French MOR tiers
Florence,
I think you are using an older version of CLAN. I just tried your
command below with the latest version of CLAN on the data that has both
v|...-... and v|...&... elements and I did not get any results with "&" in
them.
But, I would recommend that you switch to a new way of searching for
items on the %mor tier. New method is specifically designed for searching on
%mor tier and provides a more precise match. You can type "freq +s@" in
commands window to get more information and few example on this new feature.
For example, your command below would look like this:
freq +s"@|-v*,r-*,o-%" +t*sbj *.cha
This command looks for items that have part of speech "v*", indicated by
"|-v*", and any stem, indicated by "r-*". The "o-%" part instructs program
to exclude all other parts of each item from output. "o-%" acts the same way
as "-%%" and "&%%" in command below.
You will need to get the latest version of CLAN to try this new
command.
Leonid.
On Jun 18, 2010, at 08:55, Florence Chenu wrote:
> Hi Leonid,
>
> I would like to get a list of verbs in a series of files. I tried that
> command:
>
> freq +t%mor -t* +s"v*|*-%%" +s"v*|*&%%" *.cha +t*sbj +u
>
>
> In the result window, I get things like:
>
> 67 v|taper
> 1 v|taquiner
> 7 v|tenir
> 8 v|terminer
> 5 v|tirer
> 36 v|tomber
> 1 v|tondre
> 12 v|toucher
>
> Which I totally expect
>
> but I also get things like :
>
> 1 v|voir&COND&3S
> 2 v|voir&FUT&2S
> 3 v|voir&IMPF&12S
> 9 v|voir&IMPF&3S
> 73 v|voir&INF
> 34 v|voir&PRES&12S
> 10 v|voir&PRES&3P
> 31 v|voir&PRES&3S
> 1 v|voir&SUBJV:PRES&3S
>
> and I wonder why ????
>
> Any tips ?
>
> Thanks,
> Florence.
>
>
> --
> You received this message because you are subscribed to the Google Groups
"chibolts" group.
> To post to this group, send email to chibolts at googlegroups.com.
> To unsubscribe from this group, send email to
chibolts+unsubscribe at googlegroups.com.
> For more options, visit this group at
http://groups.google.com/group/chibolts?hl=en.
>
>
--
You received this message because you are subscribed to the Google Groups
"chibolts" group.
To post to this group, send email to chibolts at googlegroups.com.
To unsubscribe from this group, send email to
chibolts+unsubscribe at googlegroups.com.
For more options, visit this group at
http://groups.google.com/group/chibolts?hl=en.
--
You received this message because you are subscribed to the Google Groups "chibolts" group.
To post to this group, send email to chibolts at googlegroups.com.
To unsubscribe from this group, send email to chibolts+unsubscribe at googlegroups.com.
For more options, visit this group at http://groups.google.com/group/chibolts?hl=en.
More information about the Chibolts
mailing list