ELL: recent minority language papers

Jeff ALLEN jeff at elda.fr
Sun May 16 18:25:29 UTC 1999


id PAA19491
To: owner-endangered-languages-l at carmen.murdoch.edu.au
Precedence: bulk
Reply-To: endangered-languages-l at carmen.murdoch.edu.au

*** EOOH ***
Return-Path: <owner-endangered-languages-l at carmen.murdoch.edu.au>
X-Authentication-Warning: carmen.murdoch.edu.au: majodomo set sender to
owner-endangered-languages-l at carmen.murdoch.edu.au using -f
X-Sender: jeff!elda.fr at 192.168.1.1
Date: Sun, 16 May 1999 20:25:29 +0200
To: endangered-languages-l at carmen.murdoch.edu.au
From: Jeff ALLEN <jeff at elda.fr>
Subject: ELL: recent minority language papers
Content-Type: text/plain; charset="iso-8859-1"
X-MIME-Autoconverted: from quoted-printable to 8bit by carmen.murdoch.edu.au
id PAA19491
Sender: owner-endangered-languages-l at carmen.murdoch.edu.au
Precedence: bulk
Reply-To: endangered-languages-l at carmen.murdoch.edu.au

Dear Endangered Languages subscribers:

Here are a few recent papers on the topic of developing
new language technology systems for minority languages:


1.      Christopher Hogan.  "OCR for Minority Languages",
        In Proceedings of the 1999 Symposium on Document
	        Image Understanding Technology, Annapolis, Maryland,
		        April 1999, pp. 235--244.

			Abstract
			In this paper I discuss the difficulties encountered
			when applying Optical Character Recognition (OCR)
			to minority languages. In particular, I explore the
			case of developing OCR for Haitian Creole (HC), a
			vernacular, minority language. Although HC is writ.
			ten with a variant of the Roman alphabet, no OCR
			device has ever been developed specifically with HC
			in mind, with the result that recognition can be
			fairly
			poor. I present a technique for post.processing OCR
			output that is independent of the OCR device be.
			ing used, and demonstrate that it can improve OCR
			recognition for HC.

2.    HOGAN, Christopher and Jeffrey ALLEN.  (submitted)
           Phonemic and Orthographic realizations of ..and ..
	              in Haitian Creole.  Paper to be presented at the
		                 International Conference of the Phonetic
           Sciences
	              (ICPhS 98), San Francisco, 1-7 August, 1999.
Abstract:
This paper presents a synchronic perspective on the
phonemic status of the orthographic forms ..and ..
that appear in Haitian Creole (HC) texts. Other HC
language researchers have postulated two phonemes
(i.e., /r/ and /w/) conditioned by roundness/labialization.
Such evidence is contradicted by written corpora. Our
analyses take into account the variation in HC found
in written and spoken corpora.  From this work, we
aim to determine the status and distribution of the
related phonemes and phonetic realizations in HC.
Our findings have considerable bearing on speech
recognition and speech synthesis systems that are
currently under development  for HC and other
languages.


3.   ALLEN, Jeffrey and Christopher HOGAN.
      (accepted for presentation)    Le 'r' et le 'w' en
           cr.le ha.ien: 1, 2 ou 3 phonemes?   To be presented at
	        the 9.Colloque du Comit.International des Etudes
		     Cr.les.  Held at the Universit.de Provence,
		          Aix-en-Provence, France, 24 - 29 juin 1999.

			  I'm still working on the abstract and final version
			  of
			  the paper.
4.    ALLEN, Jeffrey.  (accepted for presentation)  La
       standardisation du cr.le ha.ien par l'interm.iaire
              de la linguistique computationnelle. To be presented
	             at the Round Table on Creole Language Standardization
		            Issues at the 9.Colloque du Comit.International
			          des Etudes Cr.les.  Held at the Universit.de
			  Provence,
			         Aix-en-Provence, France, 24 - 29 juin 1999.

				 I'm still working on the abstract and final
				 version of
				 the paper.

				 
=================================================
Jeff ALLEN - Technical Manager/Directeur Technique
European Language Resources Association (ELRA)  &
European Language resources - Distribution Agency (ELDA)
(Agence Europe'enne de Distribution des Ressources Linguistiques)
55, rue Brillat-Savarin
75013   Paris   FRANCE
Tel: (+33) 1.43.13.33.33 - Fax: (+33) 1.43.13.33.30
mailto:jeff at elda.fr
http://www.icp.grenet.fr/ELRA/home.html
----
Endangered-Languages-L Forum: endangered-languages-l at carmen.murdoch.edu.au
Web pages http://carmen.murdoch.edu.au/lists/endangered-languages-l/
Subscribe/unsubscribe and other commands: majordomo at carmen.murdoch.edu.au
----




More information about the Endangered-languages-l mailing list