[Corpora-List] the most user-friendly online corpora (monolingual and parrallel)

Linda Bawcom linda.bawcom at sbcglobal.net
Wed Nov 11 00:30:11 UTC 2009


Dear Xiaotian Guo,

Mark has been very modest regarding the BYU corpora. As one of those 43,000 users Mark is talking about, I can attest to its user-friendliness and usefulness. I have referred to it (COCA)  frequently for my doctoral thesis and when teaching. ( I even have it as a shortcut on my screen for quick access in class). The only thing you need to be a bit careful of with COCA is when comparing a lexical item's frequency by genre,  the "spoken" mainly comes from the news (so Mark's  BNC interface is perhaps a bit more reliable there).

Kindest regards,
Linda




________________________________
From: Mark Davies <Mark_Davies at byu.edu>
To: Xiaotian Guo <garlickfred at gmail.com>; Corpora list <corpora at uib.no>
Sent: Tue, November 10, 2009 7:55:40 AM
Subject: Re: [Corpora-List] the most user-friendly online corpora (monolingual and parrallel)

Dear Xiaotian Guo,

You might look at the corpora from:

http://corpus.byu.edu

(English corpora = BYU-BNC, Corpus of Contemporary American English (COCA), TIME Corpus, etc).

You asked about who is using what. In terms of usage of these three corpora, in the past month there were:

43,000 distinct users for the Corpus of Contemporary American English
21,000 for BYU-BNC
4,000 for TIME

I believe that BYU-BNC is the most widely-used online interface for the BNC, but I can't prove this; I don't have full usage data from the creators of other BNC interfaces. COCA and the BNC were being used at about the same rate until 8-9 months ago, whereas COCA is now used about twice as much as the BNC.

>> corpora available online such as the Bank of English, the British National Corpus, the American National Corpus

I don't believe that the ANC is available via an online interface, and the full Bank of English online costs about $1150 per year (via Wordbanks Online: http://www.collinslanguage.com/wordbanks/subscribe/default.aspx).

I hope this helps.

Mark Davies

============================================
Mark Davies
Professor of (Corpus) Linguistics
Brigham Young University
(phone) 801-422-9168 / (fax) 801-422-0906

http://davies-linguistics.byu.edu

** Corpus design and use // Linguistic databases **
** Historical linguistics // Language variation **
** English, Spanish, and Portuguese **
============================================ 

From: corpora-bounces at uib.no [mailto:corpora-bounces at uib.no] On Behalf Of Xiaotian Guo
Sent: Tuesday, November 10, 2009 6:38 AM
To: Corpora list
Subject: [Corpora-List] the most user-friendly online corpora (monolingual and parrallel)

Dear corpora Colleagues

I am teaching a group of students in SOAS Translation Technology course which involves the use of corpora for human translation as a Computer-Aided Translation means. I am aware there are many many monolingual and parallel corpora available online such as the Bank of English, the British National Corpus, the American National Corpus and a number of parallel corpora between various language pairs. But I feel different people may have strong preferences to particular corpora for their own use due to the corpora's user-friendliness (or availability perhaps). Could anyone be kind enough to let me know your favourites and perhaps why? 

My students are native speakers of Chinese, Japanese, Korean, Arabic and Persian.

If anyone is teaching a similar course which involves the use of corpora for human translation, please feel free to share your experiences with me. I would be very grateful.

All the best

Xiaotian Guo
SOAS
New Vision Language Centre



_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20091110/28be3e34/attachment.htm>
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list