12.1216, FYI: ELRA News, Comp Ling Fall School

The LINGUIST Network linguist at linguistlist.org
Wed May 2 21:18:03 UTC 2001


LINGUIST List:  Vol-12-1216. Wed May 2 2001. ISSN: 1068-4875.

Subject: 12.1216, FYI: ELRA News, Comp Ling Fall School

Moderators: Anthony Aristar, Wayne State U.<aristar at linguistlist.org>
            Helen Dry, Eastern Michigan U. <hdry at linguistlist.org>
            Andrew Carnie, U. of Arizona <carnie at linguistlist.org>

Reviews (reviews at linguistlist.org):
	Simin Karimi, U. of Arizona
	Terence Langendoen, U. of Arizona

Editors (linguist at linguistlist.org):
	Karen Milligan, WSU 		Naomi Ogasawara, EMU
	Lydia Grebenyova, EMU		Jody Huellmantel, WSU
	James Yuells, WSU		Michael Appleby, EMU
	Marie Klopfenstein, WSU		Ljuba Veselinova, Stockholm U.
		Heather Taylor-Loring, EMU		

Software: John Remmers, E. Michigan U. <remmers at emunix.emich.edu>
          Gayathri Sriram, E. Michigan U. <gayatri at linguistlist.org>

Home Page:  http://linguistlist.org/

The LINGUIST List is funded by Eastern Michigan University, Wayne
State University, and donations from subscribers and publishers.



Editor for this issue: Lydia Grebenyova <lydia at linguistlist.org>

=================================Directory=================================

1)
Date:  Mon, 30 Apr 2001 16:05:41 +0200
From:  Magali Duclaux <duclaux at elda.fr>
Subject:  European Lang Resources Association (ELRA) News

2)
Date:  Wed, 2 May 2001 21:06:51 +0100 (WET DST)
From:  Miriam Butt <mutt at callisto.sprachwiss.uni-konstanz.de>
Subject:  Computational Linguistics Fall School at Konstanz

-------------------------------- Message 1 -------------------------------

Date:  Mon, 30 Apr 2001 16:05:41 +0200
From:  Magali Duclaux <duclaux at elda.fr>
Subject:  European Lang Resources Association (ELRA) News

ELRA
European Language Resources Association
ELRA News

We are happy to announce new resources available via ELRA in the framework
of the Aurora project.

  *** AURORA PROJECT DATABASE ***

[ELRA is releasing  5 databases made within the ETSI STQ-AURORA DSR working
group.]

AURORA PROJECT DATABASE 2.0

The Aurora project is releasing a revised version of the Noisy TI digits
database to follow on the work of ETSI. This CD set is a replacement for
the previous set (version 1.0 consisted of 2 CDs while version 2.0 now
consists of 4 CDs) .
This database is intended for the evaluation of algorithms for front-end
feature extraction algorithms in background noise but may also be used more
widely by speech researchers to evaluate and compare the performance of
noise robust speech recognition algorithms.

Compared to version 1.0 the changes are as follows:

1) The files are restored to the energy level of the original speech
in the TI digits database.
2) One of the noise types added to the speech has been changed
(the babble one)
3) There is an additional test sets where the noises are mismatched
to those used in the training set.
4) There is a convolutional distortion test.
5) There is a clean training set
The CD ROM will be used for the next round of ETSI Aurora standards
evaluation.

AURORA PROJECT DATABASE 3.0- SUBSET OF SPEECHDAT-CAR FINNISH DATABASE

This database is a subset of the SpeechDat-Car database in Finnish language
which has been collected as part of the European Union funded SpeechDat-Car
project. It contains isolated and connected Finnish digits spoken in the
following driving conditions inside a car:
1. 0 km/hr with the car engine on
2. 40-60 km/hr with the car windows closed
3. 40-60 km/hr with the car windows open
4. 100-120km/hr with no music in the background
5. 100-120km/hr with music in the background
The database also contains the software needed to run simulations
using the Entropic's HTK, which has been adopted as the "standard"
HMM recogniser for the Aurora standard evaluation.

AURORA PROJECT DATABASE 3.0- SUBSET OF SPEECHDAT-CAR DANISH DATABASE

The Aurora project was originally set up to establish a world wide standard
for the feature extraction software which forms the core of the front-end
of a DSR (Distributed Speech Recognition) system. ETSI formally adopted
this activity as work items 007 and 008.The two work items within ETSI are:
-       ETSI DES/STQ WI007: Distributed Speech Recognition - Front-End
Feature Extraction Algorithm & Compression Algorithm
-       ETSI DES/STQ WI008: Distributed Speech Recognition - Advanced
Feature Extraction Algorithm.

This database is a subset of the SpeechDat-Car database in Danish language
which has been collected as part of the European Union funded SpeechDat-Car
project. It contains isolated and connected Danish digits spoken in the
following noise and driving conditions inside a car:

1.      High speed good road
2.      Low speed rough road
3.      Stopped with motor running
4.      Town traffic

AURORA PROJECT DATABASE 3.0- SUBSET OF SPEECHDAT-CAR GERMAN DATABASE

The Aurora project was originally set up to establish a world wide standard
for the feature extraction software which forms the core of the front-end
of a DSR (Distributed Speech Recognition) system. ETSI formally adopted
this activity as work items 007 and 008.The two work items within ETSI are:
-       ETSI DES/STQ WI007: Distributed Speech Recognition - Front-End
Feature Extraction Algorithm & Compression Algorithm
-       ETSI DES/STQ WI008: Distributed Speech Recognition - Advanced
Feature Extraction Algorithm.

This database is a subset of the SpeechDat-Car database in German language
which has been collected as part of the European Union funded SpeechDat-Car
project. It contains isolated and connected German digits spoken in the
following noise and driving conditions inside a car:

5.      High speed good road
6.      Low speed rough road
7.      Stopped with motor running
8.      Town traffic

AURORA PROJECT DATABASE 3.0- SUBSET OF SPEECHDAT-CAR SPANISHDATABASE

The Aurora project was originally set up to establish a world wide standard
for the feature extraction software which forms the core of the front-end
of a DSR (Distributed Speech Recognition) system. ETSI formally adopted
this activity as work items 007 and 008.The two work items within ETSI are:
-       ETSI DES/STQ WI007: Distributed Speech Recognition - Front-End
Feature Extraction Algorithm & Compression Algorithm
-       ETSI DES/STQ WI008: Distributed Speech Recognition - Advanced
Feature Extraction Algorithm.

This database is a subset of the SpeechDat-Car database in Spanish language
which has been collected as part of the European Union funded SpeechDat-Car
project. It contains isolated and connected Spanish digits spoken in the
following noise and driving conditions inside a car:

1.      Quiet environment. Stop motor running.
2.      Low noise. Town traffic + low speed rough road.
3.      High noise: High speed good road.
=====================================
For further information, please contact:
ELRA/ELDA Tel +33 01 43 13 33 33
55-57 rue Brillat-Savarin Fax +33 01 43 13 33 30
F-75013 Paris, France E-mail mapelli at elda.fr
or visit the online catalogue on our Web site:
http://www.icp.grenet.fr/ELRA/home.html
or http://www.elda.fr
=====================================



-------------------------------- Message 2 -------------------------------

Date:  Wed, 2 May 2001 21:06:51 +0100 (WET DST)
From:  Miriam Butt <mutt at callisto.sprachwiss.uni-konstanz.de>
Subject:  Computational Linguistics Fall School at Konstanz

REMINDER:

The early registration deadline for the Computational Linguistics Fall
School at the University of Konstanz is

			     JUNE 1, 2001


The Fall School wil be held September 10-21, 2001.

Detailed information and registration forms can be found at:

       http://ling.uni-konstanz.de/pages/conferences/dgfs-cl00/

The registration rates go up after June 1.


Courses

     Stefan Müller (DFKI, Saarbruecken), Jonas Kuhn (IMS Stuttgart)
     Grammar Development in constraint-based formalisms: HPSG und LFG

     Henning Reetz (Konstanz)
     From the Speechsignal to the Word

     Tibor Kiss (Bochum)
     PERL for Linguists

     Sabine Schulte im Walde, Heike Zinsmeister (IMS Stuttgart)
     Statistical Grammar Development and Corpuslinguistic Information
     Extraction


Invited Talks:

      Mary Dalrymple (Xerox PARC)
      Louisa Sadler (Essex)


---------------------------------------------------------------------------
LINGUIST List: Vol-12-1216



More information about the LINGUIST mailing list