Precodes and continuation markers

Kevin Donnelly kevin at dotmon.com
Mon Dec 12 22:32:44 UTC 2011


Hi Brian

::::On Sunday 11 December 2011 Brian MacWhinney said::::
>      This is a helpful observation.  For words like "ticket", you can add a
> feature to their lexical entry in MOR that marks them as borrowings.  In
> that case, you would code:
> *MAR:   pero el ticket es mil dólares.
> 
> That coding seems intuitively right.  Then you would be have to modify the
> Spanish lexicon to include a feature such as {bor eng} for the relevant
> words.  To get this feature to appear in the MOR output, you would the
> following line to the output.cut file:
> bor:
> 
> I think this is the right way to do this.  For consistent management, you
> can spot all such recent English borrowings in the current Spanish MOR and
> we can pull them out into a file called, for example, n-bor.cut.

Thanks for this - I'll put it in my set of notes on CLAN.  This approach 
wouldn't apply directly to us, though, since we're not using MOR.  Using a 
database means we can pick out sets of words directly from a monolithic 
dictionary (ie no need to segment it into word-types) in familiar grid format, 
and inject whatever tags we like into the output file.

-- 
Pob hwyl / Best wishes

Kevin Donnelly
kevindonnelly.org.uk

-- 
You received this message because you are subscribed to the Google Groups "chibolts" group.
To post to this group, send email to chibolts at googlegroups.com.
To unsubscribe from this group, send email to chibolts+unsubscribe at googlegroups.com.
For more options, visit this group at http://groups.google.com/group/chibolts?hl=en.



More information about the Chibolts mailing list