Corpora: a particular type of sloppiness
Geoffrey Williams
geoffrey.williams at wanadoo.fr
Fri Apr 20 09:18:08 UTC 2001
Just to add my penny's worth.
I suppose one of the problems of sloppiness comes from our past experience
with email. Like a lot of people I started out on a UNIX workstation, so
when writing in French I could not use diacritics, even if I had wanted to.
As the use of email spread to the administration and the arts department we
all got used to unreadable emails. MIME changed this, although only
gradually as some of us remained on outdated workstations long after the
arts department had been equipped with new computers. The result is that
some of us remain nervous as to what we are sending, and therefore tend to
remain in basic ASCII. I am not sure that our failure to readapt after what
was a pragmatic, and not a sloppy choice, counts as sloppiness.
Diacritics remain a problem. In teaching corpus linguistics, one of my
first problems is explaining the inherent problem of "unstable" characters
as the ease of use of current technology has masked the underlying markup.
In such cases a little bit of knowledge of computing prehistory does help.
best
Geoffrey
PS In French I admit to my incompetence being more a problem than
sloppiness. Laziness for a two-finger typist is also a factor, especially
when using upper case characters.
****************************************************************************
*************************************
Geoffrey Clive Williams
Langues Etrangères Appliquées
Université de Bretagne Sud
4 rue Jean Zay
56000 LORIENT
Geoffrey.Williams at univ-ubs.fr
http://www.univ-ubs.fr/crellic
****************************************************************************
***************************************
More information about the Corpora
mailing list