Corpora: ELRA New Resources

Valerie Mapelli mapelli at elda.fr
Tue Aug 8 15:45:43 UTC 2000


[ We apologise for the duplicate posting of this announcement ]
___________________________________________________________
				ELRA
		European Language Resources Association
			       ELRA News
___________________________________________________________

		     *** ELRA NEW RESOURCES ***

We are happy to announce a new resource available via ELRA:

_______________________________________
ELRA-S0085 BABEL Bulgarian Database
_______________________________________

The BABEL Database is a speech database that was produced
by a research consortium funded by the European Union
under the COPERNICUS programme (COPERNICUS Project
1304). The project began in March 1995 and was completed
in December 1998. The objective was to create a database of
languages of Central and Eastern Europe in parallel to the
EUROM1 databases produced by the SAM Project (funded by
the ESPRIT programme).

The BABEL consortium included six partners from Central
and Eastern Europe (who had the major responsibility of
planning and carrying out the recording and labelling) and six
from Western Europe (whose role was mainly to advise and in
some cases to act as host to BABEL researchers). The five
databases collected within the project concern the Bulgarian,
Estonian, Hungarian, Polish, and Romanian languages.

The Bulgarian database consists of the basic "common" set which is:

- Many Talker Set: 30 males, 30 females; each to read twice
the five blocks of numbers (each of which  contains 10 numbers),
3 connected passages and one “filler” passage.
- Few Talker Set: 5 males, 5 females, selected from the above
group: each to read 5 times the blocks of  numbers, 15 connected
passages and 2 “filler” passages, and 5 repetitions of the lists of
monosyllables.
- Very Few Talker Set: 1 male, 1 female, selected from Few
Talker set: each to read blocks of monosyllables in carrier sentences
and five repetitions of the context words.

And the extension part: semi-spontaneous answers to questions:
the answers were recorded by the 10 Few Talker Set speakers.

The other languages will be available soon.

=====================================
For further information, please contact:

     ELRA/ELDA	               Tel  +33 01 43 13 33 33
     55-57 rue Brillat-Savarin         Fax  +33 01 43 13 33 30
     F-75013 Paris, France           E-mail  mapelli at elda.fr

or visit the online catalogue on our Web site:

     http://www.icp.grenet.fr/ELRA/home.html
     or http://www.elda.fr
=====================================



More information about the Corpora mailing list