encodings (was Re: [Lexicog] sounds animals make)

William J Poser billposer at ALUM.MIT.EDU
Tue May 25 14:27:58 UTC 2004


Dear Hayim,

There are codepoints for the dotted letters that you require:

	s 0x1e63
	S 0x1e62
	t 0x1e6d
	T 0x1e6c
	h 0x1e25
	H 0x1e24

and for h with the little subscript arc:

	h 0x1e2b
	H 0x1e2c

However, Unicode does not provide directly for all possible combinations
of base character and diacritic, so it provides many diacritics as
separate codepoints. There is a subscript dot diacritic. The Unicode name
is "Combining Dot Below", and the hexadecimal code is 0x0323.
Such diacritics immediately follow the base character, so you can
get also get dotted-s with the sequence 0x0073 0x0323. By using the
diacritic in this way you can represent combinations of a letter with
the subscript dot for which there is no separate codepoint.

The subscript arc is also available as a separate diacritic. It is
called "Combining Breve Below" and has the code 0x032e.

Best,
Bill

--
Bill Poser, Linguistics, University of Pennsylvania
http://www.ling.upenn.edu/~wjposer/ billposer at alum.mit.edu


------------------------ Yahoo! Groups Sponsor --------------------~-->
Make a clean sweep of pop-up ads. Yahoo! Companion Toolbar.
Now with Pop-Up Blocker. Get it for free!
http://us.click.yahoo.com/L5YrjA/eSIIAA/yQLSAA/HKE4lB/TM
--------------------------------------------------------------------~->


Yahoo! Groups Links

<*> To visit your group on the web, go to:
     http://groups.yahoo.com/group/lexicographylist/

<*> To unsubscribe from this group, send an email to:
     lexicographylist-unsubscribe at yahoogroups.com

<*> Your use of Yahoo! Groups is subject to:
     http://docs.yahoo.com/info/terms/



More information about the Lexicography mailing list