[Corpora-List] CfP: NLP-TEA-2 Shared Task on Chinese Grammatical Error Diagnosis

Lung-Hao Lee lunghaolee at gmail.com
Mon Jan 5 09:26:04 UTC 2015


~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

The 2nd Workshop on Natural Language Processing
Techniques for Educational Applications (*NLP-TEA-2*)
with Shared Task on Chinese Grammatical Error Diagnosis (*CGED*)

*July 31, 2015* at *Beijing, China *
in conjunction with ACL-IJCNLP 2015

NLP-TEA-2: https://sites.google.com/site/nlptea2cged/
CGED: https://sites.google.com/site/nlptea2cged/shared-task
ACL-IJCNLP 2015: http://acl2015.org/

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

*Shared Task Description*

The goal of this shared task is developing the computer-assisted tools to
diagnose several kinds of grammatical errors, i.e., *redundant word,
missing word, word disorder, and word selection*. The input sentence
contains one of defined error types. The developed tool should indicate
which kind of error type is embedded in the given sentence and its occurred
positions. If the input contains no grammatical errors, the tool should
return: *sid, correct*. The output format should be a quadruple of*sid,
start_off, end_off, error_type* , if the input contains a grammatical
error.  In this output format, sid means the unique sentence identifier,
start_off and end_off represent the positions of starting and ending
character where a grammatical error occurs, in which each character or
punctuation occupies 1 for counting positions, and error_type should be one
of defined errors. Examples are shown as follows:
    *Example 1*:
        Input: (sid=B2-0880)    他是我的以前的室友
        output:  B2-0880, 4, 4, Redundant
        (Notes: "的" is a redundant character in this input sentence)
    *Example 2*:
        Input: (sid=A2-0017)    那電影是機器人的故事
        Output:  A2-0017, 2, 2, Missing
        (Notes: There is a missing word between "那" and "電影")
    *Example 3*:
        Input: (sid=A2-0017)    那部電影是機器人的故事
        Output:  A2-0017, correct
    *Example 4*:
        Input: (sid=B1-1193)    吳先生是修理腳踏車的拿手
        Output:  B1-1193, 11, 12, Selection
       (Notes: "拿手" is wrong word. One of correct words should be "好手")
    *Example 5*:
        Input: (sid=B2-2292)    所以我不會讓失望她
        Output:  B2-2292, Disorder
       (Notes: "失望她" is a word ordering error. The correct order should be
"她失望")

*Important Dates*

   - Registration Open: *January 5, 2015*
   - Release of Training Data: *February* *2, 2015*
   - Registration Close: *March 9, 2015*
   - Dry Run (Format Validation) Data Released: *April 13, 2015*
   - Release of Test Data: *April* *27**, 2015*
   - Testing Results Submission Due: *April 29**, 2015*
   - Release of Evaluation Results: *May 1**, 2015*
   - Technical Report Submission Due: *May 21, 2015*
   - Report Reviews Returned: *June 4**, 2015*
   - Camera-Ready Due: *June 21, 2015*
   - Workshop Date: *July 31, 2015*

*Workshop Co-organizers*

   - Hsin-Hsi Chen (National Taiwan University)
   - Yuen-Hsien Tseng (National Taiwan Normal University)
   - Yuji Matsumoto (Nara Institute of Science and Technology)
   - Lung Hsiang Wong (Nanyang Technological University)

*Shared Task Co-organizers*

   - Lung-Hao Lee (National Taiwan University)
   - Liang-Chih Yu (Yuan Ze University)
   - Li-Ping Chang (National Taiwan Normal University)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20150105/c3264eb9/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list