Corpora: CFP: NTCIR WORKSHOP 2
Noriko Kando
Noriko.Kando at nii.ac.jp
Tue May 30 13:12:41 UTC 2000
apology for multiple copies...
-----
CALL FOR PARTICIPATION
NTCIR Workshop 2
Evaluation of Chinese & Japanese Text Retrieval
and Text Summarization
May 2000 - Feb 2001
http://www.rd.nacsis.ac.jp/~ntcadm/workshop/cfp2-en.html
enquiries: ntcadm at rd.nacsis.ac.jp
An evaluation workshop in Chinese and Japanese text retrieval
and text summarization will be held from May 2000 to February
2001. Participation is invited from anyone interested in
Chinese and/or Japanese text retrieval and English-Chinese
and English-Japanese cross-lingual information retrieval from
large-scale collections and text summarization of Japanese texts.
WORKSHOP OBJECTIVES
- To encourage research in information retrieval, cross-lingual
information retrieval and text summarization by providing reusable
test collections.
- To provide a forum for research groups interested in comparing
results and exchanging ideas or opinions in an informal atmosphere.
- To improve the quality of the test collections based on the
feedback from participants.
DESCRIPTION OF THE COLLECTION (DATA)
- CHINESE IR TASKS (Chinese and English-Chinese IR)
- Chinese news documents will be used for the Chinese IR Tasks.
Details will be announce in June.
- JAPANESE & ENGLISH IR TASKS (Japanese, English and
English-Japanese IR)
- Training set: NTCIR-1 CD, more than 330,000 author abstracts
of conference papers; more than half are Japanese-English
paired (document alignments); alignments are known and
usable for training;
- Test set: NTCIR-1 and NTCIR-2.
NTCIR-2 consists of two document subfiles;
(1) ca.300,000 extended summaries of the research reports;
about 25% are Japanese-English paired.
(2) ca.100,000 author abstracts of conference papers;
more than half are Japanese-English paired; the
alignments are not announced before result submission
- Segmented Japanese texts are available for both Japanese
documents and topics in the NTCIR-1&2; each sentence is
segmented into terms and term components (similar to
phrases and words); use of this data is optional.
- TEXT SUMMARIZATION TASK
- Vairous types of articles in the Japanese newspapers.
WORKSHOP SCHEDULE
- By June 30, 2000: Submit application.
- NTCIR-1 are available to those who have returned required forms.
- For Chinese IR & Text Summarization: application deadline may
vary. announce later.
- August 10, 2000: NTCIR-2 CD (new documents and fifty topics)
will be distributed to the participants of Japanese & English
IR tasks.
- September 18, 2000: Search results submission (Japanese & English IR)
- January 10, 2001: Results of Relevance Assessments for the new
topics will be distributed to the participants
- February 21-23, 2001: Workshop meeting at NII, Tokyo, Japan.
Day 1: Open to public, Days 2-3: Active participants only
TASK DESCRIPTION
Below, is a brief summary of the tasks envisaged for the Workshop. A
participant will conduct one or more of the tasks or subtasks below.
Participation in only one subtask (for example Japanese monolingual IR
(J-J task)) is available:
- Chinese Information Retrieval Task: Chinese monolingual IR;
English-Chinese cross-lingual IR; to investigate the search
effectiveness of systems that search a static set of Chinese
documents using new Chinese and/or English topics.
- Japanese & English Information Retrieval Task: Japanese and/or
English monolingual IR; cross-lingual IR of single language document
and mixed-language documents of English and Japanese by Japanese
and/or English topics; to investigate the search effectiveness of
systems that search a static set of documents
- Text Summarization task (Japanese description only): automatic
text summarization of Japanese texts; the aim is (1) to collect
qualified text data for summarization in Japanese. we will have
newspaper articles summarized by hand, and make them available
for research purpose$B!
(B (2) to evaluate text summarization systems;
an extrinsic evaluation, task based evaluation.
TYPES OF PATICIPATION
- A. FULL: Submit retrieval results and describe the system. The
correspondence between the group name and the group ID will
be announced.
- B. ANONYMOUS: Submit retrieval results. The details of the system
may not be reported. The correspondence between the group name
and the group ID is not announced. This category is mainly for
the participants from the companies who have troubles to report
the details.
The list of the participating groups is made public but the evaluation
results will be announced using the group IDs only. Whichever of the
types of participation, every participating group must submit (1) a
paper for the workshop proceedings, (2) a system description form
which describes your system, and (3) bibliographic references and a
copy of all your papers using NTCIR test collections.
APPLICATIONS
Online application is available at:
http://www.rd.nacsis.ac.jp/~ntcadm/workshop/application2/app2-en.html
For the text version of application form, please complete and return
it via e-mail, fax, or postal mail to;
ATTN: Noriko Kando
NTCIR Project
National Institute of Informatics (NII)
2-1-2 Hitotsubashi, Chiyoda-ku,Tokyo 101-8430, Japan
email: ntcadm at rd.nacsis.ac.jp
fax: +81-3-3556-1916 phone: +81-3-4212-2529
TRAVEL SUPPORT
Financial support to attend the NTCIR Workshop meeting will be
available for the limited number of active oversea participants who
will present material at the workshop meeting in February, 2001, and
who are not receiving other funding to attend the NTCIR Workshop
meeting. Priority will be given to younger researchers. The detail
will be announced later.
ENQUIRIES
- Please send email to Noriko Kando, project manager, at
kando at nii.ac.jp, or to NTCIR Project administrators (
ntcadm at rd.nacsis.ac.jp).
- About "Chinese IR Task", please send email to the Task Chairs,
Hsin-Hsi Chen (hh_chen at csie.ntu.edu.twi) or Kuang-Hua Chen
(khchen at ccms.ntu.edu.tw ).
- About "Text Summarization Task", please send email to the Task
Chairs, Manabu Okumura (oku at pi.titech.ac.jp) or Takahiro Fukushima
(fukusima at res.otemon.ac.jp).
NOTES
- The first day of the Workshop meeting will be open forum of the
researchers who are interested in the topics. The second and third
days will be open only to the active participating groups that have
submited results and selected people from organizing agencies.
- The proceedings will be published online as well as printed-form.
- Dissemination of the research results using the NTCIR collections
other than in the Workshop's Proceedings is welcome. However, the
conditions of participation preclude specific advertising claims
based on the results using the Collection or the Workshop.
- International participants are welcome. Announcements will be in
English and Japanese, and English and Chinese for Chinese IR Task.
- The official language for the proceedings papers and presentation
at the Workshop meeting in February, 2001 is English.
- An evaluation of Korean text retrieval is organized by Prof Sung
Hyon Myaeng, Korea (shmyaeng at chungnam.ac.kr). We keep close
relationship each other.
For more information, please visit;
http://www.rd.nacsis.ac.jp/~ntcadm/workshop/cfp2-en.html
=====================================================================
More information about the Corpora
mailing list