Precodes and continuation markers

Brian MacWhinney macw at cmu.edu
Sun Dec 11 17:52:28 UTC 2011


Kevin,

     This is a helpful observation.  For words like "ticket", you can add a feature to their lexical entry 
in MOR that marks them as borrowings.  In that case, you would code:

*MAR:   pero el ticket es mil dólares.

That coding seems intuitively right.  Then you would be have to modify the Spanish lexicon to include a feature such
as {bor eng} for the relevant words.  To get this feature to appear in the MOR output, you would the following line to the output.cut file:

bor:

I think this is the right way to do this.  For consistent management, you can spot all such recent English borrowings in the current Spanish MOR and we can pull them out into a file called, for example, n-bor.cut.

You could do the same for English. 

-- Brian

On Dec 11, 2011, at 8:32 AM, Kevin Donnelly wrote:

> Hi Brian
> 
> ::::On Thursday 08 December 2011 Brian MacWhinney said::::
>> I assume that you are using "indeterminates" to refer to word at s:eng&spa.
>> But, even with that assumption in place, I don't quite follow.  I guess I
>> would need a concrete example indicating what you want to track.
> 
> I'm having no problem with tracking our data - I was simply pointing out that 
> "just looking for utterances with @s" may not give you the full picture.
> 
> For example, these two utterances are "mixed" in the sense that they both use 
> "English" words, and just counting @s will tell you that.
> 
> *CAR:	new at s:eng money at s:eng que le llaman .
> They call it "new money".
> 
> *MAR:	pero el ticket at s:eng&spa es mil dólares +... 
> But the ticket is a thousand dollars
> 
> However, in the second the word "ticket" also appears in a Spanish dictionary, 
> and this is reflected in the tag, but would be missed if you only counted @s. 
> 
> Does this mean that the two utterances are "different" in some way?  Maybe, 
> maybe not (though some recent work does seem to show that codeswitches may be 
> more frequent in the presence of "triggers" - cognates like "ticket").  At any 
> rate, it's probably prudent not to ignore data by collapsing it prematurely.
> 
> -- 
> Pob hwyl / Best wishes
> 
> Kevin Donnelly
> kevindonnelly.org.uk
> 
> -- 
> You received this message because you are subscribed to the Google Groups "chibolts" group.
> To post to this group, send email to chibolts at googlegroups.com.
> To unsubscribe from this group, send email to chibolts+unsubscribe at googlegroups.com.
> For more options, visit this group at http://groups.google.com/group/chibolts?hl=en.
> 
> 

-- 
You received this message because you are subscribed to the Google Groups "chibolts" group.
To post to this group, send email to chibolts at googlegroups.com.
To unsubscribe from this group, send email to chibolts+unsubscribe at googlegroups.com.
For more options, visit this group at http://groups.google.com/group/chibolts?hl=en.



More information about the Chibolts mailing list