Corpora: Re: Unicode

LITTLECHILD Peter peter.littlechild at swift.com
Tue Dec 12 12:43:24 UTC 2000


"Mcenery, Tony" wrote:
> So while I think Unicode is the way for corpus work to go in the future,
> treading that path with non-alphabetic writing systems at this moment in time
> is somewhat difficult.

I had a quick look at Unicode as a way of solving the problem of accented characters in Welsh. My first impressions were that I
would have to throw away my favourite editors and have all my Perl and Balise programs re-written by rocket scientists.

But maybe that's too pessimistic..

<from>
<name>peter littlechild</name>
<section>publishing tools and technology</section>
<dept>user documentation</dept>
<firm>s.w.i.f.t. sc</firm>
</from>



More information about the Corpora mailing list