29.1701, Software: Corpus of Regional African American Language

The LINGUIST List linguist at listserv.linguistlist.org
Fri Apr 20 17:25:36 UTC 2018


LINGUIST List: Vol-29-1701. Fri Apr 20 2018. ISSN: 1069 - 4875.

Subject: 29.1701, Software: Corpus of Regional African American Language

Moderators: linguist at linguistlist.org (Damir Cavar, Malgorzata E. Cavar)
Reviews: reviews at linguistlist.org (Helen Aristar-Dry, Robert Coté,
                                   Michael Czerniakowski)
Homepage: http://linguistlist.org

Please support the LL editors and operation with a donation at:
           http://funddrive.linguistlist.org/donate/

Editor for this issue: Kenneth Steimel <ken at linguistlist.org>
================================================================


Date: Fri, 20 Apr 2018 13:25:27
From: Charlie Farrington [crf at uoregon.edu]
Subject: Corpus of Regional African American Language

 
The Corpus of Regional African American Language (CORAAL, version 2018.04.06)
is available at:

https://oraal.uoregon.edu/coraal

CORAAL is the first public corpus of African American Language (AAL). The
corpus features sociolinguistic interviews from regional varieties of AAL and
includes audio recordings along with orthographic transcription time-aligned
at the utterance level. All recordings have been anonymized and are available
in high-quality uncompressed (.wav) format, and transcripts are available in
three formats, Praat TextGrid (.TextGrid) files, ELAN (.eaf) files, and as
plain text (.txt) files with tab-delimited fields.

CORAAL is a long-term corpus-building project conceived of in terms of several
components. The first two components of CORAAL focus on AAL in Washington DC,
the nation’s capital, a city with a long-standing African American majority,
and the site of much early research on AAL (e.g. Fasold 1972). The first
supplemental component of CORAAL, CORAAL:PRV, released in April 2018, makes
available data for a sample of speakers from a rural community in central
North Carolina. Together, these include data from over 100 sociolinguistic
interviews from speakers born between 1891 and 2005 and comprise approximately
a million words of transcribed conversational speech. 

CORAAL is available for free, public use for research purposes. It is
available under the Creative Commons Attribution-NonCommercial ShareAlike 4.0
International License.

For additional questions, please contact the CORAAL development team directly
via email: corpusofregionalAAL at gmail.com


Linguistic Field(s): Sociolinguistics
                     Text/Corpus Linguistics

Subject Language(s): English (eng)



------------------------------------------------------------------------------

*****************    LINGUIST List Support    *****************
Please support the LL editors and operation with a donation at:

              The IU Foundation Crowd Funding site:
       https://iufoundation.fundly.com/the-linguist-list

               The LINGUIST List FundDrive Page:
            http://funddrive.linguistlist.org/donate/
 


----------------------------------------------------------
LINGUIST List: Vol-29-1701	
----------------------------------------------------------






More information about the LINGUIST mailing list