29.1701, Software: Corpus of Regional African American Language
The LINGUIST List
linguist at listserv.linguistlist.org
Fri Apr 20 17:25:36 UTC 2018
LINGUIST List: Vol-29-1701. Fri Apr 20 2018. ISSN: 1069 - 4875.
Subject: 29.1701, Software: Corpus of Regional African American Language
Moderators: linguist at linguistlist.org (Damir Cavar, Malgorzata E. Cavar)
Reviews: reviews at linguistlist.org (Helen Aristar-Dry, Robert Coté,
Michael Czerniakowski)
Homepage: http://linguistlist.org
Please support the LL editors and operation with a donation at:
http://funddrive.linguistlist.org/donate/
Editor for this issue: Kenneth Steimel <ken at linguistlist.org>
================================================================
Date: Fri, 20 Apr 2018 13:25:27
From: Charlie Farrington [crf at uoregon.edu]
Subject: Corpus of Regional African American Language
The Corpus of Regional African American Language (CORAAL, version 2018.04.06)
is available at:
https://oraal.uoregon.edu/coraal
CORAAL is the first public corpus of African American Language (AAL). The
corpus features sociolinguistic interviews from regional varieties of AAL and
includes audio recordings along with orthographic transcription time-aligned
at the utterance level. All recordings have been anonymized and are available
in high-quality uncompressed (.wav) format, and transcripts are available in
three formats, Praat TextGrid (.TextGrid) files, ELAN (.eaf) files, and as
plain text (.txt) files with tab-delimited fields.
CORAAL is a long-term corpus-building project conceived of in terms of several
components. The first two components of CORAAL focus on AAL in Washington DC,
the nation’s capital, a city with a long-standing African American majority,
and the site of much early research on AAL (e.g. Fasold 1972). The first
supplemental component of CORAAL, CORAAL:PRV, released in April 2018, makes
available data for a sample of speakers from a rural community in central
North Carolina. Together, these include data from over 100 sociolinguistic
interviews from speakers born between 1891 and 2005 and comprise approximately
a million words of transcribed conversational speech.
CORAAL is available for free, public use for research purposes. It is
available under the Creative Commons Attribution-NonCommercial ShareAlike 4.0
International License.
For additional questions, please contact the CORAAL development team directly
via email: corpusofregionalAAL at gmail.com
Linguistic Field(s): Sociolinguistics
Text/Corpus Linguistics
Subject Language(s): English (eng)
------------------------------------------------------------------------------
***************** LINGUIST List Support *****************
Please support the LL editors and operation with a donation at:
The IU Foundation Crowd Funding site:
https://iufoundation.fundly.com/the-linguist-list
The LINGUIST List FundDrive Page:
http://funddrive.linguistlist.org/donate/
----------------------------------------------------------
LINGUIST List: Vol-29-1701
----------------------------------------------------------
More information about the LINGUIST
mailing list