<html><body><a href="http://infoling.org" target="_blank"><img
src="http://infoling.org/img/infoling.png" alt="Logo image by Hay
Kranen / CC-BY" width="255" height="50" align="left" border="0" /></a>
<br /><br /><br />
<br /><font style="font-size:80%"><table border="0" bordercolor="#FFF"
width="500px"><tr><td width="130"><a style="text-decoration:none"
target="_blank" href="http://infoling.org/revista/" title="Ir a
Infoling Revista"><img style="vertical-align:-30%;width:130px"
alt="Infoling Revista"
src="http://delicious.uab.es/infoling2/img/infolingIR.png"></a></td><td
width="80"><a style="text-decoration:none"
href="http://www.facebook.com/infoling" target="_blank"><img
border="0" src="http://infoling.org/img/facebook-icon.png"
alt="Infoling at Facebook"
style="vertical-align:-30%;height:16px;width:16px" />
Facebook</a></td><td width="180"><a style="text-decoration:none"
href="http://www.twitter.com/infoling" target="_blank"><img border="0"
src="http://infoling.org/img/t_small-b.png" alt="Infoling at Twitter"
style="vertical-align:-30%;height:16px;width:16px" />
Twitter</a></td></tr></table></font>
<br />Moderador/a: Carlos Subirats (U. Autónoma Barcelona), Mar Cruz
(U. Barcelona)
<br />Editoras: Paloma Garrido (U. Rey Juan Carlos), Laura Romero (UB)
<br />Programación, desarrollo: Marc Ortega (UAB)
<br />Directoras/es de reseñas: Alexandra Álvarez (U. Los Andes,
Venezuela), Yvette Bürki (U. Bern, Suiza), María Luisa Calero (U.
Córdoba, España), Luis Cortés (U. Almería)
<br />Asesoras/es: Isabel Verdaguer (UB), Gerd Wotjak (U. Leipzig,
Alemania)
<br />Asesor legal: Daniel Birba
<br />Colaboradoras/es: Julia Bernd (Cause Data Collective, EE.UU),
Miroslava Cruz (U. Autónoma del Estado de Morelos. México), Matthias
Raab (UB), Antonio Ríos (UAB)
<br />
<br />Con el patrocinio de:
<br /><table border="0" bordercolor="#FFFFFF"
width="200px"><tr><!--<td><a href="http://www.fundacioncomillas.es/"
target="_blank"><img
src="http://www.infoling.org/img/logo-comillas.png" alt="Fundación
Comillas" width="85" height="49" align="left" border="0"
/></a></td>--><td><a
href="http://www.arcomuralla.com/Arco/Shop/default.asp"
target="_blank"><img style="border:0;margin-top:10px"
src="http://www.infoling.org/img/logoarco.jpg" alt="Arco Libros"
width="62" height="34" align="left" border="0"
/></a></td></tr></table><br /><hr /><font style="font-size:90%">
<br /><b>Infoling 9.51 (2013)</b><br />ISSN: 1576-3404 </font>
<br /><font style="font-size:90%">© Infoling 1996-2012. Reservados
todos los derechos</font>
<br />
<br /><hr /><b>Recursos lingüísticos: </b><br />GlobalPhone Spanish
(Latin American) Pronunciation Dictionary<br /><b>URL:</b> <a
href="http://catalog.elra.info/product_info.php?products_id=1209"
target="_blank">http://catalog.elra.info/product_info.php?products_id=1209</a><br
/><b>Información de:</b> Infoling List
<infoling@infoling.org><br />Compartir: <a
href="http://api.addthis.com/oexchange/0.8/forward/facebook/offer?url=http://www.infoling.org/informacion/RecursoL117.html&pubid=ra-4def7f4a7565a706"
target="_blank" rel="nofollow"><img border="0"
src="http://infoling.org/img/facebook-icon.png" alt="Send to Facebook"
title="Send to Facebook"
style="vertical-align:-30%;height:16px;width:16px"
/></a> <a
href="http://api.addthis.com/oexchange/0.8/forward/twitter/offer?url=http://www.infoling.org/informacion/RecursoL117.html&template=@infoling%20GlobalPhone%20Spanish%20(Latin%20American)%20Pronunciation%20Dictionary%20{{url}}&pubid=ra-4def7f4a7565a706&shortener=bitly&bitly.login=infoling&bitly.apiKey=R_60e1d6b1cb688030e7759b835f63d0c0"
target="_blank" rel="nofollow"><img border="0"
src="http://infoling.org/img/t_small-b.png" alt="Tweet this"
title="Tweet this" style="vertical-align:-30%;height:16px;width:16px"
/></a><hr /><p><a
href="http://infoling.org/english/search/recursos/ID/117"
target="_blank">View with English headings</a></p><hr /><br
/><b>Descripción</b><br /><p> The GlobalPhone pronunciation
dictionaries, created within the framework of the multilingual speech
and language corpus GlobalPhone, were developed in collaboration with
the Karlsruhe Institute of Technology (KIT).<br /><br />The
GlobalPhone pronunciation dictionaries contain the pronunciations of
all word forms found in the transcription data of the GlobalPhone
speech & text database. The pronunciation dictionaries are currently
available in 15 languages: Arabic (29230 entries/27059 words),
Bulgarian (20193 entries), Croatian (23497 entries/20628 words), Czech
(33049 entries/32942 words), French (36837 entries/20710 words),
German (48979 entries/46035 words), Hausa (42662 entries/42079 words),
Japanese (18094 entries), Polish (36484 entries), Portuguese
(Brazilian) (54146 entries/54130 words), Russian (28818 entries/27667
words), Spanish (Latin American) (43264 entries/33960 words), Swedish
(about 25000 entries), Turkish (31330 entries/31087 words), and
Vietnamese (38504 entries/29974 words). Other 3 languages will also be
released: Chinese-Mandarin, Korean and Thai.<br /><br />1) Dictionary
Encoding:<br />The pronunciation dictionary entries consist of full
word forms and are either given in the original script of that
language, mostly in UTF-8 encoding (Bulgarian, Croatian, Czech,
French, Polish, Russian, Spanish, Thai) corresponding to the trl-files
of the GlobalPhone transcriptions or in Romanized script (Arabic,
German, Hausa, Japanese, Korean, Mandarin, Portuguese, Swedish,
Turkish, Vietnamese) corresponding to the rmn-files of the GlobalPhone
transcriptions, respectively. In the latter case the documentation
mostly provides a mapping from the Romanized to the original
script.<br /><br />2) Dictionary Phone set:<br />The phone sets for
each language were derived individually from the literature following
best practices for automatic speech processing. Each phone set is
explained and described in the documentation using the international
standards of the International Phonetic Alphabet (IPA). For most
languages a mapping to the language independent GlobalPhone naming
conventions (indicated by “M_”) is provided for the purpose of
data sharing across languages to build multilingual acoustic
models.<br /><br />3) Dictionary Generation:<br />Whenever the
grapheme-to-phoneme relationship allowed, the dictionaries were
created semi-automatically in a rule-based fashion using a set of
grapheme-to-phoneme mapping rules. The number of rules highly depends
on the language. After the automatic creation process, all
dictionaries were manually cross-checked by native speakers,
correcting potential errors of the automatic pronunciation generation
process. Most of the dictionaries have been applied to large
vocabulary speech recognition. In many cases the GlobalPhone
dictionaries were compared to straight-forward grapheme-based speech
recognition and to alternative sources, such as Wiktionary and usually
demonstrated to be superior in terms of quality, coverage, and
accuracy.<br /><br />4) Format:<br />The format of the dictionaries is
the same across languages and is straight-forward. Each line consists
of one word form and its pronunciation separated by blank. The
pronunciation consists of a concatenation of phone symbols separated
by blanks. Both, words and their pronunciations are given in
tcl-script list format, i.e. enclosed in “{}”, since phones can
carry tags, indicating the tone and length of a vowel, or the word
boundary tag “WB”, indicating the boundary of a dictionary unit.
The WB tag can for example be included as a standard question in the
decision tree questions for capturing crossword models in
context-dependent modeling. Pronunciation variants are indicated by ()
with n = 2, 3, 4,… indicating the number of variants per word. The
order in which variants occur in the dictionary is not necessarily
related to their frequency in the corpus.<br />{word} {{w WB} o r {d
WB}}<br /><br />5) Documentation: The pronunciation dictionaries for
each language are complemented by a documentation that describes the
format of the dictionary, the phone set including its mapping to the
International Phonetic Alphabet (IPA), and the frequency distribution
of the phones in the dictionary. Most of the pronunciation
dictionaries have been successfully applied to large vocabulary speech
recognition and references to publications are given when
available.<br /><br />Academic non-member prices: 700.00 EUR</p><br
/><b>Área temática:</b> Fonética, Fonología, Lingüística
computacional, Lingüística de corpus<br /><br /><b>Información en
la web de Infoling:</b><br /> <a
href="http://www.infoling.org/informacion/RecursoL117.html"
target="_blank">
http://www.infoling.org/informacion/RecursoL117.html</a></body></html>