Corpora: Diacritics

Steven Bird sb at unagi.cis.upenn.edu
Fri Apr 20 13:32:34 UTC 2001


"Chris Gledhill" <cjg6 at st-andrews.ac.uk> wrote:
> the UNIbet system. As I remember, UNIbet used standard ASCII characters to
> represent the main IPA symbols. Does anybody use it nowadays in corpus work?

Note that there is also SAMPA, XSAMPA, SAMPROSA and CSLU's Worldbet.
See the following for pointers:

  http://www.ldc.upenn.edu/annotation/#SAMPA

Steven Bird

--
Steven.Bird at ldc.upenn.edu  http://www.ldc.upenn.edu/sb
Assoc Director, LDC; Adj Assoc Prof, CIS & Linguistics
Linguistic Data Consortium, University of Pennsylvania
3615 Market St, Suite 200, Philadelphia, PA 19104-2608



More information about the Corpora mailing list