Corpora: ACL'2000: 2nd Chinese Language Processing Workshop-Last Call
Priscilla Rasmussen
rasmusse at cs.rutgers.edu
Fri Jun 30 22:27:08 UTC 2000
What's new in this call for papers?
. The submission deadline is extended to July 10, 2000.
. Please email us two POSTSCRIPT files: one includes the title
and the content, and the other includes only the ID page.
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
LAST CALL FOR PAPERS
The second Chinese Language Processing Workshop
Sponsored by SIGLEX, SIGDAT and SIGPARSE.
October 2000
Hong Kong University of Science and Technology
In conjunction with ACL-2000
Growing interest in Chinese Language Processing is leading to the development
of resources such as annotated corpora and automatic segmenters, part-of-speech
taggers and parsers. The first Asian ACL provides an ideal opportunity to
bring together influential researchers from Taiwan, Singapore, Hong Kong, and
Beijing, as well as Chinese language researchers in the rest of the world, to
discuss issues that are specific to the processing of Chinese. A critical tool
for developing Chinese language processing tools is the availability of
annotated corpora. The greater the consensus we have around guidelines for
corpus annotation of part-of-speech tags, syntactic bracketing and other areas,
the more useful this corpora will be.
We welcome submissions that address the following topics on Chinese
language processing:
. word segmentation
. POS tagging
. phrase identification
. parsing
. grammar development
. lexicon acquisition
. corpus development
We invite workshop participants to take advantage of two bracketed corpora:
. The first one, the Chinese Penn Treebank, was developed at University of
Pennsylvania, USA. It includes 100-thousand words from Xinhua News.
The corpus has been released via LDC at UPenn.
For more information and the release announcement, please check the
website "http://www.ldc.upenn.edu/ctb/".
. The second one, developed by CKIP, Academia Sinica in Taiwan, contains
more than 30,000 sentences. A trial version of 1,000 sentences is now
available for downloading by the public at
http://godel.iis.sinica.edu.tw/CKIP/trees1000.htm
The release of the complete treebank is being reviewed by Academia Sinica.
Preliminary arrangements have been made for the treebank to be licensed
through ROCLING. Please check their website
(http://rocling.iis.sinica.edu.tw/ROCLING) for announcements.
The workshop will be held either on Oct 7 or Oct 8. For the latest updates
of the workshop, please check "http://www.ldc.upenn.edu/ctb/clp00.html".
If you have any questions concerning the workshop or the Treebank,
please email us at chinese at linc.cis.upenn.edu.
Submissions:
Submissions are limited to original, unpublished work. Papers may not exceed
3200 words (exclusive of title page and references). Papers outside the
specified length are subject to be rejected without review. The paper should
be written in English.
The style files for submission are the same as the ones for ACL regular
papers, which can be downloaded from
http://www.cs.ust.hk/acl2000/fcfp.html. The reviewing of papers will be blind.
Hence the title page and paper should not include the authors' names and
affiliations. Furthermore, self-references that reveal the author's identity
(e.g., "We previously showed (Smith, 1991) ...") should be avoided. Instead,
use citations such as "Smith previously showed (Smith, 1991) ...".
As reviewing will be blind, a separate title page and identification page
will be required. The title page should include the following information:
Title:
Paper ID Code: just put "00"
Topic Area: one or two general topic areas
Keywords: Up to 5 keywords specifying subject area
Which Session: just put "G"
Word Count, excluding title page and references:
Under Consideration for other Conferences (specify):
Abstract: short summary (up to 5 lines)
The identification page should contain all of the information in the title
page, but in addition must include the authors' names, affiliations, and email
addresses. The format for the identification page should be as follows:
Title:
Paper ID Code: just put "00"
Authors' names, affiliations, and email addresses
Topic Area: one or two general topic areas
Keywords: Up to 5 keywords specifying subject area
Which Session: just put "G"
Word Count, excluding title page and references:
Under Consideration for other Conferences (specify):
Abstract: short summary (up to 5 lines)
Submissions must be received by July 10, 2000. Six (6) paper copies
(printed on both sides of the page if possible) including the title page plus
two (2) identification pages should be submitted to the following address:
CLP-2000 submission
c/o Fei Xia
Institute for Research in Cognitive Science (IRCS)
3401 Walnut St. Suite 400A
Philadelphia, PA 19104, USA
Also, a postscript version of the paper (EXCLUDING the ID page) must be
emailed to chinese at linc.cis.upenn.edu on or before July 10. Please also email
us an ID page in a separate email. Once the submission is received,
a paper ID will be assigned to the paper and this ID number will be emailed to
the authors. The authors should include the ID number in the subject lines
in subsequent email exchanges.
Important Dates:
July 10, 2000 Submission of full-length paper
August 10, 2000 Acceptance notice
September 5, 2000 Camera-ready paper due
October 7 or 8 Conference date
Organizational committee:
Martha Palmer - UPenn
Mitch Marcus - UPenn
Fei Xia - UPenn
Aravind Joshi - UPenn
Program committee:
Keh-Jian Chen - Academia Sinica, Taiwan
Shengli Feng - U of Kansas, USA
Laurie Gerber - Systran, USA
Gan Kok Wee - Hong Kong Science and Technology University
Chang-Ning Huang - Microsoft Research, China
Chu-ren Huang - Academia Sinica, Taiwan
Wanying Jin - New Mexicon State University, USA
Tan Chew Lim - National University of Singapore
Kim-Teng Lua - National University of Singapore
John Kovarik - Department of Defense, USA
K.L. Kwok - Queens College, USA
Mary Ellen Okurowski - Department of Defense, USA
Fuji Ren - Hiroshima City University, Japan
Richard Sproat - AT&T Research Lab, USA
Bangalore Srinivas - AT&T Research Lab, USA
Keh-Yih Su - Behavior Design Corporation, Taiwan
Maosong Sun - Tsinghua University, China
Benjamin K Tsou - Hong Kong City Univeristy
Amy Weinberg - U. of Maryland, USA
Ralph Weischedel - BBN, USA
Andi Wu - Mircrosoft, USA
Dekai Wu - Hong Kong Science and Technology University
Nianwen Xue - U of Delaware, USA
Jin Yang - Systran, USA
Shiwen Yu - Peking University, China
Chunfa Yuan - Tsinghua University, China
Dong Zhendong - Hownet designer, China
Joe Zhou - Intel China Research Center, China
Qiang Zhou - Tsinghua University, China
More information about the Corpora
mailing list