Corpora: Diacritics
Steven Bird
sb at unagi.cis.upenn.edu
Fri Apr 20 13:32:34 UTC 2001
"Chris Gledhill" <cjg6 at st-andrews.ac.uk> wrote:
> the UNIbet system. As I remember, UNIbet used standard ASCII characters to
> represent the main IPA symbols. Does anybody use it nowadays in corpus work?
Note that there is also SAMPA, XSAMPA, SAMPROSA and CSLU's Worldbet.
See the following for pointers:
http://www.ldc.upenn.edu/annotation/#SAMPA
Steven Bird
--
Steven.Bird at ldc.upenn.edu http://www.ldc.upenn.edu/sb
Assoc Director, LDC; Adj Assoc Prof, CIS & Linguistics
Linguistic Data Consortium, University of Pennsylvania
3615 Market St, Suite 200, Philadelphia, PA 19104-2608
More information about the Corpora
mailing list