[Corpora-List] Corpus with Grammatically Ill-formed Sentences

Christian Hadiwinoto chrhad at comp.nus.edu.sg
Fri Oct 4 13:59:31 UTC 2013


Hi,

Just to add for your information, the NUCLE corpus also contains proposed
correction (gold-standard annotation) by English teachers.

Thanks.

Regards,

Christian Hadiwinoto
Postgraduate Student
School of Computing
National University of Singapore

-----Original Message-----
From: corpora-bounces at uib.no [mailto:corpora-bounces at uib.no] On Behalf Of
Christian Hadiwinoto
Sent: Friday, 4 October, 2013 2:17 PM
To: CORPORA at UIB.NO
Subject: Re: [Corpora-List] Corpus with Grammatically Ill-formed Sentences

Hi,

You may also try the NUCLE corpus (NUS Corpus of Learner English) is a
collection of about 1,400 English learner essays written by the National
University of Singapore students. It is freely available for research
purpose. See the following reference:

Dahlmeier, Daniel, & Ng, Hwee Tou, & Wu, Siew Mei (2013). Building a Large
Annotated Corpus of Learner English: The NUS Corpus of Learner English.
Proceedings of the 8th Workshop on Innovative Use of NLP for Building
Educational Applications (BEA 2013). Atlanta, Georgia, USA, pages 22-31.

PDF: http://www.comp.nus.edu.sg/~nght/pubs/bea2013_nucle.pdf

Regards,

Christian Hadiwinoto
Postgraduate Student
School of Computing
National University of Singapore

-----Original Message-----
From: corpora-bounces at uib.no [mailto:corpora-bounces at uib.no] On Behalf Of
Eric Atwell
Sent: Monday, September 30, 2013 6:50 PM
To: Jey Han Lau
Cc: corpora at uib.no
Subject: Re: [Corpora-List] Corpus with Grammatically Ill-formed Sentences

Hi Jey,

you could try our Arabic Learner Corpus
  http://www.comp.leeds.ac.uk/scayga/alc/

... or if for some reason you don't want to work with an Arabic corpus,
Sylviane Granger and her research group have compiled a list of other
Learner corpora around the world:
http://www.uclouvain.be/en-cecl-lcworld.html

  Eric Atwell, School of Computing, Leeds University



On Mon, 30 Sep 2013, Jey Han Lau wrote:

> Hi all,
>
> Quick question, does anyone know of a corpus that contain abundant and 
> real (i.e. not synthetic) sentences with grammatical errors (probably 
> something like a second language learner's corpus)?
>
> Cheers,
> Jey Han
>

_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list