Fixing a fundamental flaw in Binary SIgnWriting

Adam Frost icemandeaf at GMAIL.COM
Tue Jun 1 21:34:52 UTC 2010


I know that this will seem small for most people as it won't really  
affect people unless they are in the coding level of things, but I  
have to agree that this change is best even when it may be hard for  
some to change with. (Like my poor blog that I have not been keeping  
up even though it is on my list. HA!) I am sure it will all work out  
for the best. Thanks for giving a heads up. ;-)

Adam

On Jun 1, 2010, at 2:00 PM, Valerie Sutton wrote:

> SignWriting List
> June 1, 2010
>
> Hello Steve -
> You are such a generous and brilliant programmer, and we are  
> grateful for all your hard work. This sounds like a good change  
> towards possible Unicode acceptance someday and I think it is  
> wonderful!
>
> For all the programmers out there, stability with the symbols and  
> coding is coming soon - This is apparently a necessary change, and  
> it is Steve who has chosen to do this, and I am following his lead  
> willingly ;-)
>
> And thank you, Steve, for your caring and sharing with everyone -  
> you really create magic for us daily... There have been positive  
> responses on my Facebook page about the new SignWriting Wiki today -  
> I was really pleased to see it -
>
> SignWriting Wiki
> http://www.signbank.org/wiki
>
> Val ;-)
>
> ------
>
>
> On Jun 1, 2010, at 1:49 PM, Steve Slevinski wrote:
>
>> Hi List,
>>
>> This is a technical discussion.  Nothing is going to change  
>> regarding the writing system.  The change is only data related.
>>
>> Back in 2008, I made a poor design choice for Binary SignWriting.    
>> I needed to define what was a character for the encoding model.  I  
>> decided that each symbol should be a character.  Some others  
>> (Stuart Thiessen, Michael Everson, members of the WLDC, ...)  
>> thought that each BaseSymbol should be a character with an  
>> individual symbol being defined as a BaseSymbol character with one  
>> or two modifying characters.
>>
>> Encoding with symbol characters seemed the better choice, rather  
>> than using 3 times the amount of data to say the same thing.  I was  
>> wrong.  My choice made searching by BaseSymbol much more  
>> difficult.  I was forced to pre-process the data before I could  
>> search.  This was wasted effort.  I realized the error of my ways  
>> when I was reading a discussion of searching with Unicode.
>>
>> I need to fix my poor design choice and reencode the ISWA 2010 with  
>> BaseSymbol characters and modifiers.  I then need to refactor the  
>> character encoding model.  This should be a quick fix I'll have  
>> ready by Friday, but it changes BSW once again.  Hopefully for the  
>> last time.
>>
>> On the bright side, this makes it easier for inclusion in Unicode.   
>> With my previous encoding, I required an entire Unicode plane of  
>> 65,000 characters.  With the new encoding, I only need 1,280  
>> characters.  This is a much better number.
>> Years ago, Michael Everson worked with Unicode for the tentative  
>> acceptance of SignWriting into the standard.  If you look at the  
>> Unicode roadmap for the Supplementary Multilingual Plane, you'll  
>> see that Sutton SignWriting has 4 rows set aside awaiting a  
>> proposal.  These 4 rows represent 1024 characters.  With the new  
>> encoding, I can create a proposal that requires 5 rows.  Much more  
>> reasonable that an entire plane.
>> http://www.unicode.org/roadmaps/smp/
>>
>> Sorry to any and all programmers / users this will inconvenience,  
>> but it is a needed change.
>>
>> Regards,
>> -Steve
>>
>



More information about the Sw-l mailing list