[Corpora-List] IJCNLP-04 Newsletter No.5
Eiko Yamamoto
eiko at crl.go.jp
Fri Jan 9 09:35:26 UTC 2004
*********** IJCNLP-04 Newsletter No.5 (9th of Jan. 2004)***************
The 1st International Joint Conference on Natural Language Processing
organized by the Asia Federation of NLP associations (AFNLP)
Website:
http://www.rcl.cityu.edu.hk/ijcnlp04
http://www.colips.org/conference/ijcnlp04/ (mirror site in Singapore)
http://www.isi.edu/natural-language/ijcnlp04 (mirror site at USC)
http://www.cipsc.org.cn/IJCNLP-04/
************************************************************************
[Date]
Main Conference: March 22-24, 2004
Workshops/symposium: March 25-26, 2004
[Venue]
Resort Golden Palm http://www.resortgp.com.cn/
Sanya, Hainan island, China
***Land's End - Hainan is so remote on the sea that ancient people,
while believing that earth is square, really thought it is where
the land ends***
http://www.regenttour.com/chinaplanner/hainan/
[Sponsoring Organizations]
Association for Natural Language Processing of Japan (ANLP), Tokyo
Association for Computational Linguistics (ACL), Philadelphia
Association for Computational Linguistics and Chinese Language
Processing (ACLCLP), Taipei
Korea NLP society, Seoul
Chinese Information Processing Society of China (CIPSC), Beijing
(more to be added)
************************************************************************
This issue contains
[1] Conference Registration
[2] List of accepted papers
[3] Call for Papers: IJCNLP-04 Interactive Poster/Demo Sessions
Paper submission deadline: January 15, 2003
[4] Workshop on Named Entity Recognition for NLP Applications
Paper submission deadline: January 16, 2003 *** Extended ***
************************************************************************
[1] Conference Registration
The registration site for non-PRC participants is now open. Participants
can register for the main conference (with tutorials and workshops) and/or
the satellite symposium at
http://www.rcl.cityu.edu.hk/ijcnlp04/registration.htm. Accommodation can
also be booked simultaneously. PRC participants should register via the
local organising committee (please refer to
http://www.cipsc.org.cn/IJCNLP-04/). Enquiries on registration can be
directed to ijcnlp04.enquiry at cityu.edu.hk.
Note to authors of accepted papers:
At least one author from each accepted paper should register for the
conference by 24 January 2004 to avoid removal of the paper from the
program. Each author can register for one paper only.
************************************************************************
[2] List of accepted papers
To Authors of accepted papers :If you found any errors in titles,
names, etc., please let us know.
(please send an e-mail to isahara at crl.go.jp)
[Oral Presentation]
A Three Level Cache-based Adaptive Chinese Language Model
Junlin Zhang, Weimin Qu, Le Sun, Lin Du, Yufang Sun
An Enhanced Semantic Indexing Implementation for Conceptual
Information Retrieval
Eric Jiang
Information Flow Analysis with Chinese Text
Paulo Cheong, Dawei Song, Peter Bruza, Kam-Fai Wong
Chinese New Word Identification Based on Character Parsing Model
Yao Meng, Hao Yu, Fumihito Nishino
A Study of Semi-Discrete Matrix Decomposition for LSI in Automated
Text Categorization
XiaoLong Wang, Yi Guan, Qiang Wang
Dit4Dah: Predictive Pruning For Morse Code Text Entry: Towards An
Entry System For the Seriously Impaired
Kumiko Tanaka-Ishii, Ian Frank
Capturing Long Distance Dependency in Language Modeling: Am Empirical
Study
Jianfeng Gao, Hisami Suzuki
Automatic Genre Detection of Web Documents
Chul Su Lim, Kong Joo Lee, Gil Chang Kim
Statistical Substring Reduction in Linear Time
Lv Xueqiang, Zhang Le
Improve Noun Phrase Coreference Resolution by Matching Strings
Xiaofeng Yang, Jian Su, Guodong Zhou, Chew Lim Tan
SVM-based Biological Named Entity Recognition using Minimum
Edit-Distance Feature Boosted by Virtual Examples
Eunji Yi, Gary Geunbae Lee, Soo-Jun Park
Acquiring Bilingual Named Entity Translations from Content-aligned
Corpora
Tadashi Kumano, Hideki Kashioka, Hideki Tanaka, Takahiro Fukusima
BBS Based Hot Topic Retrieval Using Back-Propagation Neural Network
Lan You, Jiayin Ge, Yongping Du, Xuanjing Huang, Lide Wu
High Speed Unknown Word Prediction Using Support Vector Machine For
Chinese Text-to-Speech Systems
Juhong ha, Yu Zheng, Gary Geunbae Lee
You don't have to think twice if you carefully tokenize
Stefan Klatt
Semantic roles and the beauty of trees
Rik De Busser, Marie-Francine Moens
The Automatic Acquisition of Verb Subcategorisations and their Impact
on the Performance of an HPSG Parser
Alex Chengyu Fang, John Carroll
Data-Oriented Parsing and the Penn Chinese Treebank
Mary Hearne, Andy Way
A Novel Pattern Learning Method for Open Domain Question Answering
Xuanjing Du, Xin Li Huang, Lide Wu, Yongping Du
FML-Based SCF Predefinition Learning for Chinese Verbs
Xiwu Han, Tiejun Zhao, Muyun Yang
Influence of Disambiguation on Cross-Language Information Retrieval
In-Su Kang, Seung-Hoon Na, Jong-Hyeok Lee
Natural Language Database Access using Semi-Automatically Constructed
Translation Knowledge
In-Su Kang, Jae-Hak J. Bae, Jong-Hyeok Lee
Visual Semantics and Ontology of Eventive Verbs
Minhua Ma, Paul Mc Kevitt
Word Folding: Taking the Snapshot of Words Instead of the Whole
Jin-Dong Kim, Jun'ichi Tsujii
A Novel Approach to Improve Word Translations Extraction from Non
Parallel, Comparable Corpora
Yun-Chuang Chiao, Jean-David Sta, Pierre Zweigenbaum
Automatic Learning of Parallel Dependency Treelet Pairs
Yuan Ding, Martha Palmer
Flexible Margin Selection for Reranking with Full Pairwise Samples
Libin Shen, Aravind K. Joshi
Discriminative Reranking for Machine Translation
Libin Shen, Anoop Sarkar, Franz Josef Och
Example-based Machine Translation without Saying Inferable Predicate
Eiji Aramaki, Sadao Kurohashi, Hideki Kashioka, Hideki Tanaka
Zero Pronoun Resolution based on Automatically Constructed Case Frames
and Structural Preference of Antecedents
Daisuke Kawahara, Sadao Kurohashi
Chinese Chunk Identification Using SVMs plus Sigmoid
Yong-mei Tan, Tian-shun Yao, Qing Chen, Jing-bo Zhu
Corpus-oriented Grammar Development for Acquiring a Head-driven Phrase
Structure Grammar from the Penn Treebank
Yusuke Miyao, Takashi Ninomiya, Jun'ichi Tsujii
Specification Retrieval -- How to Find Attribute-Value Information on
the Web?
Minoru Yoshida, Hiroshi Nakagawa
Detection of Incorrect Case Assignments in Automatically Generated
Paraphrases of Japanese Sentences
Atsushi Fujita, Kentaro Inui, Yuji Matsumoto
Acquiring Hyponymy Relations from Web Documents
Keiji Shinzato, Kentaro Torisawa
Improving Word Sense Disambiguation by Pseudo Samples
Wang Xiaojie, Yuji Matsumoto
Chinese Named Entity Recognition Based on Multilevel Linguistic
Features
Honglei Guo, Jianmin Jiang, Gang Hu, Tong Zhang
Systematic Construction of Hierarchical Classifier in SVM-based Text
Categorization
Yongwook Yoon, Changki Lee, Gary Geunbae Lee
Syntactic Analysis of Long Sentences based on S-clauses
Mi-Young Kim, Jong-Hyeok Lee
The Role of Semantic Information in Question Classification
Xin Li, Dan Roth, Kevin Small
Spoken versus Written Queries for Mobile Information Access: an
Experiment with Mandarin Chinese
Heather Du, Fabio Crestani
Mining Biomedical Abstracts: What's in a Term?
Goran Nenadic, Irena Spasic, Sophia Ananiadou
Learning Cross-document Structural Relationships using Both Labeled
and Unlabeled Data
Zhu Zhang, Dragomir Radev
Implementing the Syntax of Japanese Numeral Classifiers
Emily M. Bender, Melanie Siegel
Phoneme-based Transliteration of Foreign Names in Cross Language
Information Retrieval
Wei Gao, Kam-Fai Wong, Wai Lam
Adding Syntax to Dynamic Programming for Aligning Comparable Texts
Dragomir R. Radev, Siwei Shen
Concept-based Sense Disambiguation for Korean Nouns
You-Jin Chung, Jong-Hyeok Lee
Categorizing Unknown Text Patterns for Information Extraction Using a
Search Result Mining Approach
Chien-Chung Huang, Shui-Lung Chuang, Lee-Feng Chien
Causal Relation Extraction Using Cue Phrases and Lexical Pair
Probabilities
Du-Seong Chang, Key-Sun Choi
Annotation of Gene Products in the Literature with Gene Ontology Terms
using Syntactic Dependencies
Jung-jae Kim, Jong C. Park
The Use of SVM for Chinese New Word Identification
Hongqiao Li, Chang-Ning Huang, Jianfeng Gao
A re-examination of IR techniques in QA system
Yi Chang, Hongbo Xu, Shuo Bai
Window-based Method for Information Retrieval
Qianli Jin, Jun Zhao, Bo Xu
Bilingual Sentence Alignment Based on Punctuation Statistics and
Lexicons
Thomas C. Chuang, Jian-Cheng Wu, Tracy Lin, Wen-Chie Shei, Jason
S. Chang
Iterative CKY parsing for Probabilistic Context-Free Grammars
Yoshimasa Tsuruoka, Jun'ichi Tsujii
Feature Selection and Machine Learning for Pronominalization
Ji-Eun Roh, Jong-Hyeok Lee
Comparing Entropies within the Chinese language
Benjamin K Tsou, Tom B Y Lai, Ka-po Chow
Unsupervised Event Extraction from Biomedical Literature using
Co-occurrence Information and Basic Patterns
Hong-woo Chun, Young-sook Hwang, Hae-chang Rim
Bilingual Chunk Alignment Based on Interactional Matching and
Probabilistic Latent Semantic indexing
Feifan Liu, Qianli Jin, Jun Zhao, Bo Xu
An Example-based Study on Chinese Word Segmentation Using Critical
Fragments
Qinan Hu, Haihua Pan, Chunyu Kit
Unsupervised Segmentation of Chinese Corpus using Accessor Variety
Haodi Feng, Kang Chen, Chunyu Kit, Xiaotie Deng
A Nearest-Neighbor Method for Resolving PP-Attachment Ambiguity
Shaojun Zhao, Dekang Lin
An Interactive Proofreading System for Inappropriately Selected Words
on using Predictive Text Entry
Hideya Iwasaki, Kumiko Tanaka-Ishii
Acquiring Selectional Preferences in A Thai Lexical Database
Canasai Kruengkrai, Thatsanee Charoenporn, Virach Sornlertlamvanich,
Hitoshi Isahara
Chinese Unknown Word Identification Using Class-based LM
Guohong Fu, Kang Kwong Luke
Practical Translation Pattern Acquisition from Combined Language
Resouces
Mihoko Kitamura, Yuji Matsumoto
Harmonic Mean Weight through Hybrid Collaborative Filtering and
Content based Filtering in Recommender System
Kyung-Yong Jung, Jung-Hyun Lee
------
[Poster Presentation]
Chinese Treebanks and Grammar Extraction
Keh-Jiann Chen, Yu-Ming Hsieh
Recognition of HTML Table Structure
Hidetaka Masuda, Shuichi Tsukamoto, Hiroshi Nakagawa
Improving Back-Transliteration by Combining Information Sources
Slaven Bilac, Hozumi Tanaka
Automatic Method of Extracting Foreign Words from Korean Corpora
Ok-Keum Kim, Tetsuya Ishikawa, Sang-Yool Lee, Jong-Hyeok Lee
Headword Percolation in a Multi-Parser Architecture for Natural
Language Understanding
Helen Meng, Po-Chui Luk
Robust Speaker Identification System Based on Wavelet Transform and
Gaussian Mixture Model
Wan-Chen Chen, Ching-Tang Hsieh, Eugene Lai
A Graph Grammar Approach to Map between Dependency Trees and
Topological Models
Bernd Bohnet
Word-Spacing System with Statistics Extracted from the Processed
Training Data
Mi-Young Kang, Sung-ja Choi, Hyuk-chul Kwon
Word Sense Disambiguation using Heterogeneous Language Resources
Kiyoaki Shirai, Takayuki Tamagaki
Improving PinYin to Chinese Conversion with a Whole Sentence Maximum
Entropy Model
Zhang Le and Yao Tian-shun
Improving Quality of the Web Corpus
Youichi Sekiguchi, Kazuhide Yamamoto
Tagging Complex NEs with Maxent Models: Layered Structures versus
Extended Tagset
Xiong Deyi, Yu Hongkui, Liu Qun
Deep Analysis of Modern Greek
Valia Kordoni, Julia Neu
Using a Smoothing Maximum Entropy Model for Chinese Nominal Entity
Tagging
Jinying Chen, Nianwen Xue, Martha Palmer
Making Use of furigana
Gary Kacmarcik
Building a parallel bilingual syntactically annotated corpus
Martin Cmejrek, Jan Curin, Jiri Havelka, Vladislav Kubon
Processing Metonymic Expressions for the Matching of a QA System
Yoji Kiyota, Sadao Kurohashi, Fuyuko Kido
A Collaborative Ability Measurement for Co-Training
Dan Shen, Jie Zhang, Jian Su, Guodong Zhou, Chew-Lim Tan
Using a Paraphraser to Improve Machine Translation Evaluation
Andrew Finch, Yasuhiro Akiba, Eiichiro Sumita
Deterministic dependency structure analyzer for Chinese
Yuchang Cheng, Masayuki Asahara, Yuji Matsumoto
A Comparative Study on the Use of Labeled and Unlabeled for Large
Margin Classifiers
Hiroya Takamura, Manabu Okumura
Detecting sentence boundaries in Japanese speech transcriptions using
a morphological analyzer
Sachie Tajima, Hidetsugu Nanba, Manabu Okumura
Improving Relevance Feedback in the Language Modeling Approach:
Maximum a Posteriori Probability Criterion and Three-component
Mix-ture Model
Seung-Hoon Na, In-Su Kang, Jong-Hyeok Lee
A Persistent Feature-Object Database for Intelligent Text Archive
Systems
Takashi Ninomiya, Jun'ichi Tsujii, Yusuke Miyao
A English-Hindi Statistical Machine Translation System
Raghavendra Udupa, Tanveer A Faruquie
Fast Reinforcement Learning of Dialogue Policies using Linear Function
Approximation
Matthias Denecke, Kohji Dohsaka, Mikio Nakano
Collecting Evaluative Expressions for Opinion Extraction
Nozomi Kobayashi, Kentaro Inui, Yuji Matsumoto, Kenji Tateishi,
Toshikazu Fukushima
Mining Table Information on the Internet
Sung-won Jung, Hyuk-chul Kwon
Learning to Filter Junk E-Mail from Positive and Unlabeled Examples
Karl-Michael Schneider
The Hinoki Treebank A Treebank for Text Understanding
Francis Bond, Sanae Fujita, Chikara Hashimoto, Kaname Kasahara,
Shigeko Nariyama, Eric Nichols, Akira Ohtani, Takaaki Tanaka, Shigeaki
Amano
Selecting Prosody Parameters for Unit Selection Based Chinese TTS
Minghui Dong, Kim-Teng Lua
Parsing Mixed Constructions in a Type Feature Structure Grammar
Jong-Bok Kim, Jaehyung Yang
How Effective is Query Expansion for Finding Novel Information?
Min Zhang, Shaoping Ma
N-fold Templated Piped Correction
Dekai Wu, Grace Ngai, Marine Carpuat
User Adaptation in MT-mediated Communication
Kentaro Ogura, Yoshihiro Hayashi, Saeko Nomura, Toru Ishida
************************************************************************
[3] Call for Papers: IJCNLP-04 Interactive Poster/Demo Sessions
You may have a very cool demo but don't have time to package a full
size of paper for main conference, or you get a late breaking report
but has not yet finished solid evaluation, or you prefer to present
your works in an interactive style, this session will provide you an
exciting channel to show your cool demos and ideas and get valuable
feedbacks at the same time.
The special session for interactive posters and demonstrations
provided in IJCNLP-04 will welcome poster/demo presentations with
novel ideas and profound applications, or the works that are best
presented and discussed in an interactive style. This session will
provide a forum of academic and technical exchanges. Presentations
from both the academic and industries are welcomed. The topics of
interest will cover the same area of the main conference
(http://www-tsujii.is.s.u-tokyo.ac.jp/ijc-nlp04/submission.html).
The authors should submit an original paper (no more than 4 pages in
IJCNLP format) that describes the problem of the research and the
novel methods. Presentations with demos should include an outline of
the system design and enough details to allow the evaluation of
technical solidness and usefulness. The equipments required for the
demonstration must be provided. The Interactive Poster/Demo papers
will be included in a separated proceeding that is in parallel to the
main conference proceedings.
Each submission will be blind reviewed by three reviewers. Reviewing
will be managed by an international program committee.
Presentation Style of the Interactive Poster/Demo Sessions
The Interactive Poster/Demo Sessions will run in the afternoons of
March 22 and March 23. Each presentation will receive a booth with a
1.8m x 0.6m desk and a 1.8m x 2.1m panel. Network connection and
electricity outlets will be supplied to each booth.
Submission Information
Submissions should follow the format of IJCNLP proceedings and
should not exceed four (4) pages, including references. Since the
reviewing will be blind, the paper should not include the authors'
names and affiliations. Furthermore, self-references that reveal the
author's identity should be avoided.
Submission Procedure
All papers must be submitted electronically via email to the
following address. Either a PDF or PS file must be sent as an attached
file. Please use the first author's surname to name the file. The
Subject field should be "IJCNLP-04 Poster/Demo submission". Please
include the name, affiliation and email address of the contact person
in the body of your email.
The Important Dates
Paper submission deadline: January 15, 2003
Notification of acceptance: Feb. 15, 2004
Camera ready papers due: Feb 28, 2004
Please submit papers to: mingzhou at microsoft.com
Program Committee for Interactive Poster/Demo sessions
Chair: Ming Zhou, Microsoft Research Asia
PC members:
Masaaki Nagata, NTT
Takenobu Tokunaga, Tokyo Institute of Technology
Genichiro Kikui, ATR
Sadao Kurohashi, The University of Tokyo
Donghong Ji, Kent Ridge Digital Labs
Jian-Yun Nie, Univ. of Montreal
Dekang Lin, Univ. of Alberta
Hsin-Hsi Chen, National Taiwan University
Lee-Feng Chien, Academia Sinica
Kam-Fai Wong, Chinese University of Hong Kong
Gary Geunbae Lee, POSTECH
Jong-Hyeok Lee, POSTECH
Maosong Sun, Tsinghua University
Jun Zhao, Institute of Automation, Chinese Academy of Sciences
Tiejun Zhao, Harbin Institute of Technology
Qun Liu, China Academy of Science
Haifeng Wang, Toshiba R&D Centre
Kui-Lam Kwok, City University of New York
Dan Moldovan, University of Texas at Dallas
Chin-Yew Lin, ISI/USC
Tilman Becker, DFKI
Rens Bod, University of Amsterdam
Harry Bunt, Tilburg University
Christian Boitet, Universite Joseph Fourier
Michael Zock, LIMSI
************************************************************************
[4] Workshop on Named Entity Recognition for NLP Applications
*** SUBMISSION DEADLINE NOW EXTENDED TO 16 JANUARY 2004 ***
Workshop website: http://personal.cityu.edu.hk/~rlolivia/W4_NE.htm
Named Entities (NEs) occupy a considerable proportion in natural
language and have remained an important area in natural language
processing (NLP). The recognition of proper names as unknown words
has long been an issue in word segmentation and part-of-speech tagging,
especially for non-alphabetic Asian languages and interlingual NLP
involving these languages. Named entities constitute significant pieces
of data in information extraction. Proper transliteration of named entities,
especially proper names, is critical for the intelligibility and accuracy of
machine translation output.This workshop aims at bringing researchers together
to discuss the issues and advances in NE recognition and extraction,
and how NE could be handled most cost-effectively in a variety of NLP applications.
Papers are invited for original and unpublished research on all aspects
of NE recognition and extraction, including but not limited to:
- Symbolic and statistical models for NE recognition
- NE recognition systems
- Translation of NEs across multiple languages
- Resources (lexicons, grammars) for NE extraction
- NE recognition as a subtask in NLP applications
- Evaluation of NE processing in NLP applications
Submission Method
Papers should be written in English and may not exceed 7 pages
(including references, and using 11pt or 12pt for the main text).
Simultaneous submission to other conferences or workshops must be
clearly indicated on the identification page (see below).
Nevertheless, a paper accepted for presentation in this workshop
cannot be presented or have been presented in any other meeting
with publicly published available proceedings.
We strongly recommend the use of the LaTeX style files or MS-Word
document template for IJCNLP-04.
These style files can be downloaded from
http://www-tsujii.is.s.u-tokyo.ac.jp/ijc-nlp04/submission.html.
As reviewing will be blind, self-references that reveal the author's
identity (e.g., "We previously showed (Smith, 1991) .") should be
avoided in the submission.
Instead, use references like "Smith previously showed (Smith, 1991) .".
Please include, on a separate identification page, the following
information: title, name(s) of author(s), affiliation(s), email
address(es), up to 5 keywords, whether the paper is under consideration
for other conferences, and a short summary of the paper.
Please submit your paper electronically to rlolivia at cityu.edu.hk by 16
January 2004. Acceptable file formats are Portable Document Format
(.pdf), PostScript (.ps), and MS Word (.doc), with all non-ASCII fonts
embedded.
Important Dates
Submission Deadline: 16 January 2004 *** Extended ***
Notification of Acceptance: 31 January 2004
Camera-Ready Paper Due: 10 February 2004
Workshop Date: 26 March 2004
Program Committee for Workshop on Named Entity Recognition for NLP Applications
Chair: Benjamin Tsou (City University of Hong Kong)
PC members:
Roberto Basili (University of Rome Tor Vergata, Rome)
Ralph Grishman (New York University, New York)
Kevin Humphreys (Microsoft, Redmond)
Hideki Isozaki (NTT Communication Science Labs, Kyoto)
Gary Geunbae Lee (POSTECH, Pohang)
Masaaki Nagata (NTT Cyber Space Laboratories, Kanagawa)
Hwee Tou Ng (National University of Singapore, Singapore)
Thierry Poibeau (Institut National des Langues et Civilisations
Orientales, Paris)
Manabu Sassano (Fujitsu Laboratories Ltd., Kawasaki)
Satoshi Sekine (New York University, New York)
Rou Song (Beijing Language and Culture University, Beijing)
Kiyotaka Uchimoto (Communications Research Laboratory, Kyoto)
Takehito Utsuro (Kyoto University, Kyoto)
Jingbo Zhu (Northeastern University, Shenyang)
************************************************************************
More information about the Corpora
mailing list