17.807, FYI: Third SIGHAN Chinese Language Processing Bakeoff

linguist at LINGUISTLIST.ORG linguist at LINGUISTLIST.ORG
Thu Mar 16 19:28:58 UTC 2006


LINGUIST List: Vol-17-807. Thu Mar 16 2006. ISSN: 1068 - 4875.

Subject: 17.807, FYI: Third SIGHAN Chinese Language Processing Bakeoff

Moderators: Anthony Aristar, Wayne State U <aristar at linguistlist.org>
            Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
 
Reviews (reviews at linguistlist.org) 
        Sheila Dooley, U of Arizona  
        Terry Langendoen, U of Arizona  

Homepage: http://linguistlist.org/

The LINGUIST List is funded by Eastern Michigan University, Wayne
State University, and donations from subscribers and publishers.

Editor for this issue: Svetlana Aksenova <svetlana at linguistlist.org>
================================================================  

To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.html.


===========================Directory==============================  

1)
Date: 15-Mar-2006
From: Gina-Anne Levow < levow at cs.uchicago.edu >
Subject: Third SIGHAN Chinese Language Processing Bakeoff 

	
-------------------------Message 1 ---------------------------------- 
Date: Thu, 16 Mar 2006 14:26:41
From: Gina-Anne Levow < levow at cs.uchicago.edu >
Subject: Third SIGHAN Chinese Language Processing Bakeoff 
 


Call for Participation

The Third International Chinese Language Processing Bakeoff
Description and Important Dates

1. Introduction

This is the official announcement for the Third International Chinese
Language Processing Bakeoff, sponsored by the Special Interest Group for
Chinese Language Processing (SIGHAN) of the Association for Computational
Linguistics. The bakeoff will occur over the late spring of 2006 and the
results will be presented at the 5th SIGHAN Workshop, to be held at
ACL-COLING 2006 in Sydney, Australia, July 22-23, 2006.

The first bakeoff, held in 2003 and presented at the 2nd SIGHAN Workshop at
ACL 2003 in Sapporo, has become the pre-eminent measure for Chinese word
segmentation evaluation and has been cited in numerous papers. The second
bakeoff held in 2005 and presented at the 4th SIGHAN Workshop at IJCNLP-05
on Jeju Island, Korea demostrated further progress in this task. In a
change from the first two evaluations, the third bakeoff will augment the
classic Word Segmentation task with a new Named Entity Recognition task.
Corpora from the  following organizations will be available for use:

- Beijing Universty, China
- CKIP, Academia Sinica, Taiwan
- City University of Hong Kong, Hong Kong SAR
- Linguistic Data Consortium, United States
- Microsoft Research, China
- University of Pennsylvania and University of Colorado, Boulder, United States

The full details of the segmentation and named entity tagging task will be
made available through the registration site which will open March 15, 2006.

Participants are required to submit a short paper describing their system
and analyzing their performance, and present a summary at the workshop. The
reports will be published in the SIGHAN workshop proceedings.

The language of the workshop is English. Papers must be submitted and
presented in English. Note that unlike the workshop proper, there will not
be a peer review process on the bakeoff reports.

2. Important Dates

2006-03-15            Registration Open
2006-04-17            Training data made available
2006-05-15            Testing data made available
2006-05-17            Test results due back to organizers
2006-05-19            Results privately reported to participants
2006-06-2              Final reports due from participants

3. Contact Information

The bakeoff is being organized by Gina-Anne Levow of University of Chicago
and Olivia Oi Yee Kwong, City University of Hong Kong.

The web page for the competition is:

http://sighan.cs.uchicago.edu/bakeoff2006/

Questions on the bakeoff should be addressed to Gina-Anne Levow,
levow at cs.uchicago.edu 



Linguistic Field(s): Computational Linguistics
                     Morphology
                     Text/Corpus Linguistics





 




-----------------------------------------------------------
LINGUIST List: Vol-17-807	

	



More information about the LINGUIST mailing list