12.1373, FYI: ELRA News, Syntactic Database/Spanish
The LINGUIST Network
linguist at linguistlist.org
Fri May 18 21:24:41 UTC 2001
LINGUIST List: Vol-12-1373. Fri May 18 2001. ISSN: 1068-4875.
Subject: 12.1373, FYI: ELRA News, Syntactic Database/Spanish
Moderators: Anthony Aristar, Wayne State U.<aristar at linguistlist.org>
Helen Dry, Eastern Michigan U. <hdry at linguistlist.org>
Andrew Carnie, U. of Arizona <carnie at linguistlist.org>
Reviews (reviews at linguistlist.org):
Simin Karimi, U. of Arizona
Terence Langendoen, U. of Arizona
Editors (linguist at linguistlist.org):
Karen Milligan, WSU Naomi Ogasawara, EMU
Lydia Grebenyova, EMU Jody Huellmantel, WSU
James Yuells, WSU Michael Appleby, EMU
Marie Klopfenstein, WSU Ljuba Veselinova, Stockholm U.
Heather Taylor-Loring, EMU Dina Kapetangianni, EMU
Software: John Remmers, E. Michigan U. <remmers at emunix.emich.edu>
Gayathri Sriram, E. Michigan U. <gayatri at linguistlist.org>
Home Page: http://linguistlist.org/
The LINGUIST List is funded by Eastern Michigan University, Wayne
State University, and donations from subscribers and publishers.
Editor for this issue: Lydia Grebenyova <lydia at linguistlist.org>
=================================Directory=================================
1)
Date: Mon, 14 May 2001 10:23:22 +0200
From: Magali Duclaux <duclaux at elda.fr>
Subject: European Language Resources Association (ELRA) News
2)
Date: Tue, 15 May 2001 09:21:22 +0200 (MET DST)
From: Paula Santalla del Rio <fempsr at usc.es>
Subject: Syntactic Database of Current Spanish (SDB, release 3.5.1)
-------------------------------- Message 1 -------------------------------
Date: Mon, 14 May 2001 10:23:22 +0200
From: Magali Duclaux <duclaux at elda.fr>
Subject: European Language Resources Association (ELRA) News
***************************************************************************
ELRA
European Language Resources Association
ELRA News
****************************************************************************
We are happy to announce a new resource available via ELRA:
ELRA S0106 Dutch SpeechDat(II) MDB-250
A description of this database is given below.
The Dutch SpeechDat(II) MDB-250 comprises 250 Dutch speakers (125 males,
125 females) recorded over the Dutch mobile telephone network. The
recordings were made at SPEX, the Netherlands, and the recording
application was developed and run with Show 'N Tel. This database is
partitioned into 5 CDs The speech databases made within the SpeechDat(II)
project were validated by SPEX to assess their compliance with the
SpeechDat format and content specifications.
Speech samples are stored as sequences of 8-bit 8 kHz A-law. Each prompted
utterance is stored in a separate file. Each signal file is accompanied by
an ASCII SAM label file which contains the relevant descriptive information.
The following items were recorded:
8 application words (2 optional); 2 isolated digits; 1 sequence of 10
isolated digits; 3 connected digits: 1 telephone number (1-10 digits), 1
credit card number (1-16 digits), 1 digit PIN code (6 digits); 3 dates: 1
spontaneous date, 1 date, 1 relative date expression;
1 embedded application word; 3 spelled words: 1 forename (spontaneous), 1
city name, 1 word; 1 currency money amount; 1 natural number; 6 directory
assistance names: 1 forename (spontaneous), 1 city of birth, 1 most
frequent city, 1 city name, 1 company name, 1 forename surname; 2 yes/no
questions: 1 predominantly "yes" question, 1 predominantly "no" question; 9
phonetically rich sentences; 2 time phrases: 1 time of day (spontaneous), 1
time phrase; 4 phonetically rich words.
The following age distribution has been obtained: 5 speakers are under 16,
90 are between 16 and 30,
89 between 31 and 45, 56 between 46 and 60, and 10 are over 60. The lexicon
was created following
the guidelines in SD1.3.1 v4.3.
=====================================
For further information, please contact:
ELRA/ELDA Tel +33 01 43 13 33 33
55-57 rue Brillat-Savarin Fax +33 01 43 13 33 30
F-75013 Paris, France E-mail mapelli at elda.fr
or visit the online catalogue on our Web site:
http://www.icp.grenet.fr/ELRA/home.html
or http://www.elda.fr
=====================================
-------------------------------- Message 2 -------------------------------
Date: Tue, 15 May 2001 09:21:22 +0200 (MET DST)
From: Paula Santalla del Rio <fempsr at usc.es>
Subject: Syntactic Database of Current Spanish (SDB, release 3.5.1)
_________________________________________________________
Syntactic Database of Current Spanish (SDB, release 3.5.1)
__________________________________________________________
The research group on Spanish Syntax of the University of
Santiago de Compostela ( http://www.sintx.usc.es ) makes
available for the interested researchers the Syntactic
Spanish Database (SDB), the result of the work carried out
by the group throughout the last ten years. The data can be
looked up in
http://www.bds.usc.es/busquedas.html
Developed with the financial support of the Dirección Xeral
de Educación y Ordenación Universitaria of the Xunta de
Galicia and of the Dirección General de Investigación
Científica y Técnica of the Ministerio de Educación, SDB is
the result of the manual analysis, having in mind
constitutional and functional analysis principles, of the
syntactic characteristics of the almost 160.000 clauses
contained in the contemporary part of the Hispanic Texts
Archive of the University of Santiago, constituted by
approximately million and a half words of texts taken from
all the Hispanic countries and including oral samples as
well as novels, press and theater, all of them published
between 1980 and 1990.
In SDB the primary unit of description has been the clause
and the analysis has been encoded so as to make obvious the
organization of the syntactic functions around the verb form
functioning as the predicate of the clause. For every clause
in the database, the syntactic characteristics considered
relevant in this first stage of development have been
included. In first place, we have recorded general
information about the clauses: clause type, clause function,
voice, modality, verb inflection, syntactic functions found
in the clause and order of them. Next, we have recorded
detailed information about each of the syntactic functions
found in the clause: type of structural unit, determination,
animation, countability, preposition introducing the
syntactic function, etc. Current search possibilities (as
well as other not yet fully available but already on
schedule) are conceived from an internal point of view, that
is, they are devised so as to show the internal structure of
clauses organized around verb forms in the corpus: syntactic
schemes and subschemes of each verb documented in it, verbs
documented with one scheme or subscheme, verbs requiring one
preposition, etc.
These are, however, a web page and a search interface still
under development. For this reason, on the one hand, we
apologize for the fact that certain search options on the
menu are not yet ready, and on the other, we will be very
grateful for any suggestion about search possibilities that
you miss in the system and to which we have not given
priority in this first phase of distribution of results.
Currently, with the financial support of the Secretaría
Xeral de Investigación e Desenvolvemento of the Xunta de
Galicia, SDB has started a second phase of development in
which we will analyse in depth the syntactic and semantic
characteristics of the approximately 160.000 clauses
constituting the corpus under study.
- ----------------------------------------------------------------------
- ----------------------------------------------------------------------
- ----------------------------------------------------------------------
María Paula Santalla del Río
Dpto. de Lingua Española
Facultade de Filoloxía, Universidade de Santiago de Compostela
Avda. Burgo das Nacións, s/n,
Santiago de Compostela 15782
Tfno: (+34) 981 575340/563100, ext. 11908
Fax: (+34) 981 574646
- ---------------------------------------------------------------------
- ---------------------------------------------------------------------
- ---------------------------------------------------------------------
---------------------------------------------------------------------------
LINGUIST List: Vol-12-1373
More information about the LINGUIST
mailing list