11.1315, FYI: Lang Resources/ELRA, Australian Ling Society
The LINGUIST Network
linguist at linguistlist.org
Tue Jun 13 17:07:18 UTC 2000
LINGUIST List: Vol-11-1315. Tue Jun 13 2000. ISSN: 1068-4875.
Subject: 11.1315, FYI: Lang Resources/ELRA, Australian Ling Society
Moderators: Anthony Rodrigues Aristar, Wayne State U.<aristar at linguistlist.org>
Helen Dry, Eastern Michigan U. <hdry at linguistlist.org>
Andrew Carnie, U. of Arizona <carnie at linguistlist.org>
Reviews: Andrew Carnie: U. of Arizona <carnie at linguistlist.org>
Associate Editors: Ljuba Veselinova, Stockholm U. <ljuba at linguistlist.org>
Scott Fults, E. Michigan U. <scott at linguistlist.org>
Jody Huellmantel, Wayne State U. <jody at linguistlist.org>
Karen Milligan, Wayne State U. <karen at linguistlist.org>
Assistant Editors: Lydia Grebenyova, E. Michigan U. <lydia at linguistlist.org>
Naomi Ogasawara, E. Michigan U. <naomi at linguistlist.org>
James Yuells, Wayne State U. <james at linguistlist.org>
Software development: John Remmers, E. Michigan U. <remmers at emunix.emich.edu>
Sudheendra Adiga, Wayne State U. <sudhi at linguistlist.org>
Qian Liao, E. Michigan U. <qian at linguistlist.org>
Home Page: http://linguistlist.org/
The LINGUIST List is funded jointly by Eastern Michigan University,
Wayne State University, and donations from subscribers and publishers.
Editor for this issue: Lydia Grebenyova <lydia at linguistlist.org>
=================================Directory=================================
1)
Date: June 13, 2000 16:20:12 +0200
From: Valerie Mapelli <mapelli at elda.fr>
Subject: Portuguese Corpus/Lexicon - ELRA News
2)
Date: June 13, 2000 16:20:16 +0200
From: Valerie Mapelli <mapelli at elda.fr>
Subject: AURORA Project Database - ELRA News
3)
Date: Mon, 12 Jun 2000 13:20:30 +0800
From: John Henderson <john.henderson at uwa.edu.au>
Subject: Australian Linguistic Society 1999 - Conf Proceedings
-------------------------------- Message 1 -------------------------------
Date: June 13, 2000 16:20:12 +0200
From: Valerie Mapelli <mapelli at elda.fr>
Subject: Portuguese Corpus/Lexicon - ELRA News
___________________________________________________________
ELRA
European Language Resources Association
ELRA News
___________________________________________________________
*** ELRA NEW RESOURCES ***
We are happy to announce new resources available via ELRA:
ELRA-W0024 PAROLE Portuguese Corpus
ELRA-L0035 PAROLE Portuguese Lexicon
A description of each database is given below.
_______________________________________
ELRA-W0024 PAROLE Portuguese Corpus
_______________________________________
The parole Portuguese corpus contains approximately 3 million
running words of European Portuguese distributed by Medium,
as follows:
- Newspaper: about 65%, covering the period 1996-1997 of 3 titles;
- Book: about 20%, concerning 12 titles from 3 editing houses;
- Periodical: about 5%, concerning 7 weekly issues of 1 title, 1996;
- Miscellaneous: about 10%, concerning several files distributed by 8 titles.
The corpus was classified and encoded according to the common
core parole encoding standard. The file format of this corpus is SGML.
A subcorpus of the PAROLE Portuguese Corpus, which reproduces
approximately the whole Corpus distribution by Medium
(Newspaper: about 65%, Book: ab. 20%, Periodical: ab. 5%,
Miscellaneous: ab. 10%) is also available.
It has about 250,000 words morpho-syntactically tagged accordingly
to the parole common tagset and morpho-syntactic annotation standards.
Disambiguation was manually checked.
_______________________________________
ELRA-L0035 PAROLE Portuguese Lexicon
_______________________________________
The PAROLE Portuguese Lexicon is constituted by 20 thousand
entries morpho-syntactically and syntactically encoded, accordingly
to the parole common encoding standards. The data is in SGML format.
=====================================
For further information, please contact:
ELRA/ELDA Tel +33 01 43 13 33 33
55-57 rue Brillat-Savarin Fax +33 01 43 13 33 30
F-75013 Paris, France E-mail mapelli at elda.fr
or visit the online catalogue on our Web site:
http://www.icp.grenet.fr/ELRA/home.html
or http://www.elda.fr
=====================================
-------------------------------- Message 2 -------------------------------
Date: June 13, 2000 16:20:16 +0200
From: Valerie Mapelli <mapelli at elda.fr>
Subject: AURORA Project Database - ELRA News
___________________________________________________________
ELRA
European Language Resources Association
ELRA News
___________________________________________________________
*** AURORA Project Database ***
ELRA is releasing two databases made within the ETSI STQ-AURORA DSR
working group.
_______________________________________
AURORA Project Database 2.0
_______________________________________
The Aurora project is releasing a revised version of the Noisy TI digits
database to follow on the work of ETSI. This CD set is a replacement for
the previous set (version 1.0 consisted of 2 CDs while version 2.0 now
consists of 4 CDs) .
This database is intended for the evaluation of algorithms for front-end
feature extraction algorithms in background noise but may also be used
more widely by speech researchers to evaluate and compare the performance
of noise robust speech recognition algorithms.
Compared to version 1.0 the changes are as follows:
1) The files are restored to the energy level of the original speech
in the TI digits database.
2) One of the noise types added to the speech has been changed (the babble
one)
3) There is an additional test sets where the noises are mismatched to
those used in the training set
4) There is a convolutional distortion test.
5) There is a clean training set
The CD ROM will be used for the next round of ETSI Aurora standards
evaluation.
_______________________________________
AURORA Project Database 3.0 - Subset of SpeechDat-Car
Finnish database
_______________________________________
This database is a subset of the SpeechDat-Car database in Finnish
language which has been collected as part of the European Union
funded SpeechDat-Car project. It contains isolated and connected
Finnish digits spoken in the following driving conditions inside a car:
1. 0 km/hr with the car engine on
2. 40-60 km/hr with the car windows closed
3. 40-60 km/hr with the car windows open
4. 100-120km/hr with no music in the background
5. 100-120km/hr with music in the background
The database also contains the software needed to run simulations
using the Entropic's HTK, which has been adopted as the "standard"
HMM recogniser for the Aurora standard evaluation.
=====================================
For further information, please contact:
ELRA/ELDA Tel +33 01 43 13 33 33
55-57 rue Brillat-Savarin Fax +33 01 43 13 33 30
F-75013 Paris, France E-mail mapelli at elda.fr
or visit the online catalogue on our Web site:
http://www.icp.grenet.fr/ELRA/home.html
or http://www.elda.fr
=====================================
-------------------------------- Message 3 -------------------------------
Date: Mon, 12 Jun 2000 13:20:30 +0800
From: John Henderson <john.henderson at uwa.edu.au>
Subject: Australian Linguistic Society 1999 - Conf Proceedings
The Proceedings of the 1999 Australian Linguistic Society Conference are
published at
http://www.arts.uwa.edu.au/LingWWW/als99/proceedings. All papers are
available in pdf format.
Contents:
The Lexicon and Quantity Implicatures
Keith Allan
A Preliminary Analysis of Lebanese Arabic Intonation
Dana Chahal
An Acoustic-Phonetic Descriptive Analysis of Pitch Realisations in
Kagoshima Japanese
Shunichi Ishihara
Constraints on the Pre-auxiliary Position in Warlpiri and the Nature of the
Auxiliary
Mary Laughren
Thematic Role Hierarchies and Role Engagement
Tom Mylne
Suffix Coherence and Stress in Australian Languages
Rob Pensalfini
"Just do it ...!" Discourse strategies for 'getting the message across' in
a factory production team
Maria Stubbe
False witness: when historical texts fail
Nicholas Thieberger
Set Marking Tags - 'and stuff'
Joanne Winter and Catrin Norrby
_______________________________
Department of Linguistics,
University of Western Australia
WA 6907
Ph. (08) 9380 2870 (direct)
(Int'l 61 8 9380 2870)
Fax (08) 9380 1154
(Int'l 61 8 9380 2870)
---------------------------------------------------------------------------
LINGUIST List: Vol-11-1315
More information about the LINGUIST
mailing list