26.221, FYI: New Website for Searching and Downloading the Arabic Learner Corpus

The LINGUIST List via LINGUIST linguist at listserv.linguistlist.org
Wed Jan 14 20:19:11 UTC 2015


LINGUIST List: Vol-26-221. Wed Jan 14 2015. ISSN: 1069 - 4875.

Subject: 26.221, FYI: New Website for Searching and Downloading the Arabic Learner Corpus

Moderators: Damir Cavar, Indiana U <damir at linguistlist.org>
            Malgorzata E. Cavar, Indiana U <gosia at linguistlist.org>

Reviews: reviews at linguistlist.org
Anthony Aristar <aristar at linguistlist.org>
Helen Aristar-Dry <hdry at linguistlist.org>
Sara Couture, Indiana U <sara at linguistlist.org>

Homepage: http://linguistlist.org

Do you want to donate to LINGUIST without spending an extra penny? Bookmark
the Amazon link for your country below; then use it whenever you buy from
Amazon!

USA: http://www.amazon.com/?_encoding=UTF8&tag=linguistlist-20
Britain: http://www.amazon.co.uk/?_encoding=UTF8&tag=linguistlist-21
Germany: http://www.amazon.de/?_encoding=UTF8&tag=linguistlistd-21
Japan: http://www.amazon.co.jp/?_encoding=UTF8&tag=linguistlist-22
Canada: http://www.amazon.ca/?_encoding=UTF8&tag=linguistlistc-20
France: http://www.amazon.fr/?_encoding=UTF8&tag=linguistlistf-21

For more information on the LINGUIST Amazon store please visit our
FAQ at http://linguistlist.org/amazon-faq.cfm.

Editor for this issue: Uliana Kazagasheva <uliana at linguistlist.org>
================================================================


Date: Wed, 14 Jan 2015 15:18:08
From: Abdullah Alfaifi [afaifi at hotmail.com]
Subject: New Website for Searching and Downloading the Arabic Learner Corpus

E-mail this message to a friend:
http://linguistlist.org/issues/emailmessage/verification.cfm?iss=26-221.html&submissionid=35999177&topicid=6&msgnumber=1
 
Dear LINGUIST List members,

We are pleased to announce the new website for searching the Arabic Learner
Corpus (ALC):
www.alcsearch.com

The website provides a simple search function (what you type is what you get);
however, it enables researchers to search the entire corpus or a subset data
using 26 determinants, 12 of them about the text author (e.g., age, gender,
nationality, mother tongue, nativeness, etc.), while 14 are about the text
itself (e.g., genre, timing, references use, mode, length, etc.).

As an additional function, the website also enables its users to download the
corpus or a subset of the data in different files formats: txt, xml, pdf (for
the hand-written texts) and mp3 (for the audio recordings).
The user can switch between Arabic and English interfaces. Also the user's
guide is in both Arabic and English.

ALC contains a collection of written essays and spoken recordings categorised
under two topics: A vacation trip (narrative) and My study interest
(discussion) by learners of Arabic in Saudi Arabia in 2012 and 2013. The
corpus includes 282,732 words, 29,627 types, and 1,585 materials. It was
produced by 942 students from 67 nationalities, and 66 different L1
backgrounds studying at pre-university and university levels. The average
length of a text is 178 words. Version 2.0 of the ALC contains raw data which
includes three parts: transcriptions of hand writing (76%), writing done on
computer (17%), and transcriptions of audio recordings (7%). 

For more information about the ALC please refer to the main website:
www.ArabicLearnerCorpus.com

For searching the corpus:
www.ALCsearch.com

User's Guide (English):
http://www.alcsearch.com/ALCfiles/ALC_User_Guides/User_Guide_En.pdf

User's Guide (Arabic):
http://www.alcsearch.com/ALCfiles/ALC_User_Guides/User_Guide_Ar.pdf
 



Linguistic Field(s): Text/Corpus Linguistics

Subject Language(s): Arabic, Standard (arb)





 






----------------------------------------------------------
LINGUIST List: Vol-26-221	
----------------------------------------------------------







More information about the LINGUIST mailing list