build a font for your endangered language...

William J Poser wjposer at LDC.UPENN.EDU
Sat May 17 02:48:15 UTC 2008


Heather,

Unicode does not encode n with a slash through it as a single codepoint,
but it does have slash as a combining character. In fact, it has two:
U+0337 COMBINING SHORT SOLIDUS OVERLAY
U+0338 COMBINING LONG SOLIDUS OVERLAY
(The forward slash is called "solidus" in Unicode-speak.)

So, you can get a lower-case n overlaid by a long slash by entering
first the n and then U+0338. They are two separate codepoints but will
be rendered by Unicode-aware software as the single character you want.

Because this is treated as a sequence of two "characters" in Unicode,
you may have to do something special to get your sort order the
way you want it. Such things are a little easier if you can get what you
want as a single codepoint, but the Unicode Consortium is reluctant
to add single codepoints for things that can be composed from existing
combining characters.

Bill



More information about the Ilat mailing list