scoping of a clause delimiter
brielle.stark
brielle.stark at gmail.com
Fri Jan 10 16:47:36 UTC 2020
I should also say that we typically code C-units and not T-units, and we're
hoping to not have to code two types of utterances (e.g. not teach raters
how to do both T- and C-unit delineation).
Brie
On Friday, January 10, 2020 at 11:46:49 AM UTC-5, brielle.stark wrote:
>
> I just asked a similar question, I believe, e.g. how to count certain
> clauses. Perhaps we're on the same train of thought.
>
> For instance, we're using the Peer Conflict Resolution task (
> https://pubs.asha.org/doi/full/10.1044/1058-0360%282007/022%29), and
> their main outcomes measures are (1) mean length of T-unit, (2) clausal
> density (based on number of independent, nominative, relative and adverbial
> clauses) and (3) overall subordinate clause use. It's really quite
> difficult to train raters to code each clause type within an already
> heavily coded sample (e.g. we're doing typical word-level codes,
> utterance-level codes as well as correct information unit [e] codes), so I
> didn't know if there was something in the %gra dependencies that we could
> extract to get this information without hand-coding? Perhaps this is a
> similar answer to what Monika is looking for?
>
> Best,
>
> Brie
>
>
> On Thursday, May 9, 2019 at 8:11:13 AM UTC-4, Monika Bader wrote:
>>
>> Dear Brian,
>> thanks for responding so quickly! We are working with written texts
>> produced by young foreign language learners and want to code both for
>> clauses and T-units/C-units to get ratios such as clauses per T-unit (a
>> measure for which scoping is not crucial). We are relatively new to CLAN
>> and had a tutorial with Victoria Johansson from Lund University, where they
>> used CLAN for a big project on language development. They used the
>> following strategy to encode T-units and clauses: clauses were placed on
>> separate chat lines, while T-units were separated using @EndTurn.
>> Center-embedded clauses (which there are many of in Swedish) were repeated
>> on a separate subordinate tier, named %ces.
>> After reading the CHAT/CLAN manuals, we realized that an alternative
>> option would be to place T-units on separate chat lines, and use [^c] to
>> code for clauses. We have been since grappling with the possible
>> consequences of choosing one strategy over the other, so if you have some
>> thoughts around this, we would be happy to get your advice. We want later
>> to share the corpus with our students for further analyses (and further
>> coding), so we are trying to think carefully about the different options.So
>> far we are leaning towards the [^c] strategy. Though T-units have been used
>> in the literature to measure development, we are also interested in
>> conducting a more detailed analysis, and investigating what kind of clauses
>> and structures are inside those T-units to better understand what our
>> learners can do and which clauses and structures they rely on. We are also
>> interested in investigating the extent to which they use what LGSWE calls
>> syntactic nonclausal structures (which seem to be quite common in our data
>> set). We have the impression that a lot of this information can be encoded
>> by modifying [^c]. It seems to us that there might be issues related to
>> scoping if one then wanted to limit the search to features related to
>> subtypes of [^c] (for instance, number of errors/or certain types of errors
>> in certain types of clauses, such as relative clauses). It is possible that
>> as you suggest these are better handled by relying on the %gra line, though
>> I think we would have to conduct an extensive manual error analysis if we
>> wanted to get a relatively accurate %gra line.
>>
>> Best,
>> Monika
>>
>> onsdag 8. mai 2019 16.01.54 UTC+2 skrev macw følgende:
>>>
>>> Dear Monika,
>>> I am not familiar with work that calculates MLU based on clauses and
>>> I am not sure why one would want to use such a measure. The major point of
>>> MLU is to consider the extent to which speakers compose more complex
>>> sentences and the act of breaking up sentences into clauses would actually
>>> remove the thing that it is trying to measure.
>>> As you say, this system of clause marking is definitely not going to
>>> work well for center embedding. You could get the scope of the embedded
>>> clause, but then the main clause would be broken up. But perhaps that is
>>> interesting in itself.
>>> I am curious why you are using this type of analysis. What exactly
>>> are you interested in measuring. It seems to me that, if you have a
>>> relatively accurate %gra line for an utterance, then that could be more
>>> useful that hand-done clause marking.
>>>
>>> --Brian MacWhinney
>>>
>>> On May 8, 2019, at 7:48 AM, 'Monika Bader' via chibolts <
>>> chib... at googlegroups.com> wrote:
>>>
>>> Hi,
>>> we are trying to decide on the best way to code for clauses. The manual
>>> suggests using a clause delimiter, and we quite like this option
>>> (especially the possibility of creating user defined codes). However, we
>>> are somewhat worried about the scoping of the symbol. We understand that
>>> for some analyses, such as MLU/MLT based on clauses, this is not a crucial
>>> issue, but we do believe that for some other analyses one would need the
>>> right scoping (if we are not mistaken). For instance, calculating words per
>>> error free clauses (or any other clause code one uses). In examples such as
>>>
>>> The book [that you buyed yesterday] [^c err] has disappeared [^c]
>>>
>>> [^c err] would scope over "the book" as well, which we wouldn't
>>> necessarily want to include. In some languages these kinds of
>>> nested/center-embedded clauses are more common than in other. The manual
>>> says that "it is not necessary to mark the scope", but is it possible? or
>>> is there any other way to deal with cases such as these?
>>>
>>> We appreciate any suggestions!
>>>
>>> Best,
>>> Monika
>>>
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "chibolts" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to chib... at googlegroups.com.
>>> To post to this group, send email to chib... at googlegroups.com.
>>> To view this discussion on the web visit
>>> https://groups.google.com/d/msgid/chibolts/9212b6f1-ad4c-4183-b277-45711661af3c%40googlegroups.com
>>> <https://groups.google.com/d/msgid/chibolts/9212b6f1-ad4c-4183-b277-45711661af3c%40googlegroups.com?utm_medium=email&utm_source=footer>
>>> .
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>>>
>>>
--
You received this message because you are subscribed to the Google Groups "chibolts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+unsubscribe at googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/09e7d5bf-faa7-465a-b637-24d561681d2f%40googlegroups.com.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/chibolts/attachments/20200110/43b7055b/attachment.htm>
More information about the Chibolts
mailing list