[Corpora-List] Encoding of apostrophes and quotes
Eric Atwell
eric at comp.leeds.ac.uk
Fri Jun 30 07:48:56 UTC 2006
I think it is quite reasonable for UNICODE standards to give apostrophe
and single-quote a single encoding, if in practice many people cant
understand the difference and use either character interchangeably.
I see this as analogous to a word whcih can have more than one function,
e.g. "to" can function as preposition or infinitival marker,
or "one" has four possible tags in the ICE/TOSCA part-of-speech tagset
corresponding to four separable functions.
Eric Atwell
Senior Lecturer, Language research group, School of Computing,
Faculty of Engineering, University of Leeds, LEEDS LS2 9JT, England
TEL: +44-113-3435430 FAX: +44-113-3435468 http://www.comp.leeds.ac.uk/eric
More information about the Corpora
mailing list