SignWriting List September 27, 2004 Dear SW List: Here is a wonderful article on the web by Joe Clark. I do not know Joe. You will notice he mentions the SGN code for sign languages, and says there is no way to specify a sign language at this time, but he just doesn't realize that more has been done...But he mentions the coding of regional dialects below...Or go to his web page: http://www.joeclark.org/book/sashay/serialization/AppendixB.html ---------------------- Trebuchet MSAUTHOR’S NOTE – You’re reading the HTML version of a chapter from the book 0000,3333,9999Building Accessible Websites (ISBN 0-7357-1150-X). 0000,3333,9999Copyright © 0000,3333,9999Joe Clark, 2002 (0000,3333,9999about the author). All rights reserved. ¶ 0000,3333,9999Back to Contents Lucida GrandeUnder the Web Content Accessibility Guidelines, you are required to specify changes in the “natural” or human language used in documents. You do this by adding the Andale Monolang="CourierlanguagecodeAndale Mono"Lucida Grande attribute to virtually any tag (like Andale Mono<

<

Lucida Grande, Andale Mono<<Lucida Grande, Andale Mono<<Lucida Grande, or Andale Mono<CourierxAndale Mono><CourierxAndale Mono>Lucida Grande. Also, in order to specify a change in language, you must already have declared the default, base, or original language, which you do by adding Andale Monolang="CourierlanguagecodeAndale Mono"Lucida Grande to the Andale Mono<Lucida Grande or (preferably) Andale Mono<Lucida Grande tags, like so: • Andale Monobody lang="en"Lucida GrandeAndale Monohtml lang="fr-ca"Lucida Grande So just what GeorgiaareLucida Grande those language codes? They’re two-letter abbreviations, optionally followed by a hyphen and some other qualifier. In the second example above, French is specified Andale Mono(fr)Lucida Grande, but of the Canadian variety (Andale MonocaLucida Grande). The exact specification is GeorgiaISOLucida Grande 639-1, “Codes for the Representation of Names of Languages,” whose homepage resides at the Library of Congress: lcweb.loc.gov/standards/iso639-2/. (Yes, that URL says “iso639-2”; you have to hunt around at the site to find the 639-1 section, which is a bit outdated.) Note that the companion standard, GeorgiaISOLucida Grande 639-2, provides GeorgiathreeLucida Grande-letter codes for languages – and for a vastly wider range of languages, at that. Online, however, we must stick with the two-letter codes. At least, this is my interpretation. A page at the 5555,1A1A,8B8BWorld Wide Web Consortium Internationalization site tells us: VerdanaAccording to RFC 3066, for languages with both a two-letter and a three-letter code, the two-letter code must be used. This also solves the problem of those languages that have two different three-letter codes, because all of them also have a two-letter code. Lucida GrandeSo this “solves the problem,” does it? I don’t see a lot of problems that are actually “solved” here. The RFC (request for comment) mentioned in this citation merely refers back to GeorgiaISOLucida Grande 639-1 and tells us, in effect, that the only three-letter language codes we may use are those that do not have a two-language code. But there are somewhat complex rules in place governing when a three-letter code may be coined without creating a corresponding two-letter code. From an accessibility perspective, this restriction will eventually have to be lifted. Textual media are not the only kind available on the Web, and as more and more video becomes available, more and more sign languages will be available, and all sign-language names exist in the three-character specification (under Andale MonosgnLucida Grande). It is technically impossible to specify a sign language on a Website as the standards currently exist. My recommendation? Damn the torpedoes! If you have to specify a language with a three-letter code because you cannot find a two-letter code, do it. Such a practice appears to be permitted anyway and is the only one that makes sense. Let’s start with the two-letter codes. Now, hundreds of languages have been defined, and I’m not going to list every single one of them here because the super-obscure language codes have no practical value to my audience. (It’s nice to know that Faroese has its own language code, but how many readers of a book on Web accessibility will have cause to design Websites in Faroese? And won’t such designers already know that Faroese’s language code is Andale MonofoLucida Grande?) Besides, the GeorgiaISOLucida Grande 639-1 specs are all online and provide all the codes for you. I have not found a truly reliable source for the Top Ten languages used online (after English – the Top Eleven, really). I have synthesized various lists into the following somewhat longer compilation – not quite Top Forty, but close. Trebuchet MSVery-widely-used languages online Lucida GrandeJapanese Andale MonojaLucida Grande German Andale MonodeLucida Grande Chinese Andale MonozhLucida Grande French Andale MonofrLucida Grande Spanish Andale MonoesLucida Grande Italian Andale MonoitLucida Grande Dutch Andale MononlLucida Grande Portuguese Andale MonoptLucida Grande Finnish Andale MonofiLucida Grande Swedish Andale MonosvLucida Grande Norwegian Andale MononoLucida Grande Danish Andale MonodaLucida Grande Korean Andale MonokoLucida Grande Polish Andale MonoplLucida Grande Russian Andale MonoruLucida Grande Hebrew Andale MonoheLucida Grande Hungarian Andale MonohuLucida Grande Greek Andale MonoelLucida Grande Turkish Andale MonotrLucida Grande Czech Andale MonocsLucida Grande Thai Andale MonothLucida Grande Arabic Andale MonoarLucida Grande Icelandic Andale MonoisLucida Grande Trebuchet MSConfusable codes Lucida GrandeNote that country codes and language codes are often Georgiajust different enoughLucida Grande to get you into trouble if you’re not eagle-eyed. • Japan is Andale MonojpLucida Grande, but Japanese is Andale MonojaLucida Grande. • China is Andale MonocnLucida Grande, but Chinese is Andale MonozhLucida Grande. • The Netherlands is Andale MononlLucida Grande, and so is Dutch. • Sweden is Andale MonoseLucida Grande, but Swedish is Andale MonosvLucida Grande. • Denmark is Andale MonodkLucida Grande, but Danish is Andale MonodaLucida Grande. • Greece is Andale MonogrLucida Grande, but Greek is Andale MonoelLucida Grande. Trebuchet MSDialects Lucida GrandeSome dialect names are standardized under GeorgiaISOLucida Grande 639-1, while others, usually of a more fanciful nature (Cockney, Newfoundland, GeorgiajoualLucida Grande) are not. Both types are permitted; it is up to the browser or device to interpret the codes correctly. It is possible and legal, for example, to specify all these variants of English: • Andale MonoenLucida Grande (English: No specified variant) • Andale Monoen-usLucida Grande (United States English) • Andale Monoen-au-tasLucida Grande (Tasmanian English, Australia) • Andale Monoen-inLucida Grande (Indian English) • Andale Monoen-uk-Cockney-Rhyming-SlangLucida Grande You must not assume, however, that browsers or devices will be able to understand or represent anything beyond the first dash. In rather more important cases, like the two variations of Norwegian, Bokmål and Nynorsk, enough social importance is given to the dialects that they have their own codes. • Andale MononoLucida Grande (Norwegian: No specified variant) • Andale MononbLucida Grande (Norwegian Bokmål) • Andale MononnLucida Grande (Norwegian Nynorsk) Authors writing in Norwegian will likely know which dialect they are using and can cite it appropriately. Authors who merely quote Norwegian text or make some other casual use of it may not know which is which; that’s what the generic Andale MononoLucida Grande tag is for. If you’re wondering about Chinese (no doubt you are), Mandarin and Cantonese are not the only recognized dialects, but all of them are subsumed under Andale MonozhLucida Grande. You must use dialect codes for Mandarin (Andale Monozh-guoyuLucida Grande) and Cantonese (Andale Monozh-yueLucida Grande) if you wish to differentiate them. (The distinction is nearly meaningless on Websites that do not use voice given that the two dialects use the same writing system.) There is no difference in language code between Traditional and Simplified Chinese; arguably there should be. Take my word for this as a linguist and an accessibility GeorgiaobsessifLucida Grande: This stuff is more detailed and pedantic than trainspotting, and almost as addictive to susceptible personalities. Just keep in mind that dinner-party guests are never really as interested in this topic as we are. Trebuchet MS0000,3333,9999PreviousLucida Grande   ¶   Trebuchet MS0000,3333,9999ContentsLucida Grande   ¶   Trebuchet MS0000,3333,9999NextLucida Grande