27.5020, FYI: Corpus of Spoken Spanish Available Online: ESLORA

The LINGUIST List via LINGUIST linguist at listserv.linguistlist.org
Thu Dec 8 13:06:22 EST 2016


LINGUIST List: Vol-27-5020. Thu Dec 08 2016. ISSN: 1069 - 4875.

Subject: 27.5020, FYI: Corpus of Spoken Spanish Available Online: ESLORA

Moderators: linguist at linguistlist.org (Damir Cavar, Malgorzata E. Cavar)
Reviews: reviews at linguistlist.org (Helen Aristar-Dry, Robert Coté,
                                   Michael Czerniakowski)
Homepage: http://linguistlist.org

*****************    LINGUIST List Support    *****************
                       Fund Drive 2016
                   25 years of LINGUIST List!
Please support the LL editors and operation with a donation at:
           http://funddrive.linguistlist.org/donate/

Editor for this issue: Yue Chen <yue at linguistlist.org>
================================================================


Date: Thu, 08 Dec 2016 13:06:13
From: Victoria Vázquez [victoria.vazquez at usc.es]
Subject: Corpus of Spoken Spanish Available Online: ESLORA

 
ESLORA is a corpus of Spanish made up of semi-structured interviews and
spontaneous conversations recorded in Galicia between 2007 and 2015,
orthographically transcribed and linked to audio files. The transcriptions
have been POS and morphologically tagged and lemmatized to facilitate the
retrieval of lexical and grammatical information.

The search engine enables the users to:

- run queries that combine social variables (age, gender, level of education
and role of the speaker) with lexical and grammatical categories;
- have direct access to the sound fragments that match the search results;
- download the search results in TSV format.

The application can be accessed at http://galvan.usc.es/eslora. The multiple
functions of the search engine are fully described in the User Guide
(galvan.usc.es/eslora/guide_description). 

To date, the material available on the internet comprises 479.840 tokens,
which correspond to 36 interviews.

The ESLORA corpus has been compiled by the Spanish Grammar Research Group at
the University of Santiago de Compostela through the ESLORA and ESLORA2
projects funded by the Ministry of Economy and Competitivity (FFI2010-17417 y
FFI2014-52287-P).

ESLORA: Corpus para el estudio del español oral <http://galvan.usc.es/eslora>,
version 1.0, November 2016, ISSN: 1988-1541.

The ESLORA project team
 



Linguistic Field(s): Text/Corpus Linguistics

Subject Language(s): Spanish (spa)





 



------------------------------------------------------------------------------

*****************    LINGUIST List Support    *****************
                       Fund Drive 2016
Please support the LL editors and operation with a donation at:
            http://funddrive.linguistlist.org/donate/

        Thank you very much for your support of LINGUIST!
 


----------------------------------------------------------
LINGUIST List: Vol-27-5020	
----------------------------------------------------------






More information about the LINGUIST mailing list