Lemmatization on French mor tiers
Nicola Phillips
nicolakphillips at gmail.com
Thu Nov 23 23:59:11 UTC 2023
Hello,
I was reading a chapter by Treffers-Daller today (citation below) and
wondered if a solution had ever been found for the problem described as
follows:
"the French mor tier distinguishes different subcategories of verb forms
(infinitival, participial, progressive and other forms) in the information
to the left of the pipe separator (|) which separates the syntactic
category information from the word itself.... This means that FREQ counts
these different forms of trouver as different types rather than as
different tokens of the type trouver, which results in inflated indices of
lexical richness. This problem also exists, but to a lesser extent, for
other syntactic categories such as pronouns. Using switches such as s”*-%
%”, which tell CLAN to ignore form variants, does not solve the problem,
because these switches only look at information after the pipe separator"
Thanks so much for your input!
Nicky Phillips
Treffers-Daller, J. (2009). Language Dominance and Lexical Diversity: How
Bilinguals and L2 Learners Differ in their Knowledge and Use of French
Lexical and Functional Items. In B. Richards, M. H. Daller, D. D. Malvern,
P. Meara, J. Milton, & J. Treffers-Daller (Eds.), Vocabulary Studies in
First and Second Language Acquisition: The Interface Between Theory and
Application (pp. 74-90). Palgrave Macmillan UK.
https://doi.org/10.1057/9780230242258_5
--
You received this message because you are subscribed to the Google Groups "chibolts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+unsubscribe at googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/4784d587-912e-4467-8a18-33af28cc1f73n%40googlegroups.com.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/chibolts/attachments/20231123/1590c74b/attachment.htm>
More information about the Chibolts
mailing list