Corpora: KDD-2000 Workshop on Text Mining

David Elworthy davidelw at microsoft.com
Wed Mar 29 12:41:03 UTC 2000


(With the usual apologies if you receive multiple announcements...)

Call For Papers: KDD-2000 Workshop on Text Mining
=================================================
To be held at KDD-2000, Sixth ACM SIGKDD International Conference on
Knowledge Discovery and Data Mining, August 20-23, 2000, Boston, MA, USA

Invited speakers:
Ronen Feldman, Instinct Software, Israel
David Lewis, AT&T Research, USA

More information at: http://www.cs.cmu.edu/~dunja/WshKDD2000.html.

Workshop Description
--------------------
The growing importance of electronic media for storing and exchanging text
documents has led to a growing interest in tools and approaches for dealing
with unstructured or semi-structured information included in the text
documents. In addition to well-organized and maintained text databases, one
of the important sources of textual information is the World Wide Web which
is expected to continue to grow in the number of users and amount of
information available.

Methods developed for mining structured and unstructured data sets as well
as text learning and natural language processing techniques are essential
for analysis of textual data. While many approaches to text processing are
based on statistics and thus only weakly dependent on the language the data
is written in, those that involve deeper linguistic processing are typically
aimed at English texts. Furthermore, an important step towards exploiting
information from texts is automated information extraction from large
document sets and building more or less domain specific knowledge bases.

This leads to interesting and important questions of scalability of
developed approaches and their applicability to a variety of document
formats and languages.

Topics of interest
------------------
The objective of this workshop is to enable presentation and exchange of
ideas on various aspects of Text Mining. Our desire is to facilitate
communication among researchers and practitioners from related and
complementary research areas, who are working on similar problems but with
possibly different focus and problem solving approaches. More precisely, we
invite papers from the four areas:

Text Mining (or Text Learning) (TM)
Information Retrieval (IR)
Natural Language Processing (NLP)
Information Extraction (IE).

Particular topics of interest for the workshop include but are not limited
to:
 text mining & information retrieval
 text mining & natural language processing
 text mining & web mining
 text representation
 text categorization
 text segmentation
 information extraction
 scalability of developed approaches
 performance evaluation measures
 feature selection
 multilingual approaches to text mining
 influence of domain and domain specific text mining
 innovative applications of text mining.

The workshop will consist of invited talks, presentation of refereed papers,
and discussions. We hope that the program will stimulate future
collaboration among researchers on text mining problems.

Submission Guidelines
---------------------
Submissions should be sent by May 15, 2000, in the electronic form as a PDF
or PostScript file to mailto:Dunja.Mladenic at cs.cmu.edu, Subject: KDD-2000
workshop submission paper.

Each submission should indicate which of the four areas listed above it best
fits into (TM, IR, NLP, IE). The length and formatting of the submissions
should follow the KDD-2000 recommendations (max. 20 pages, 12-point font, 1
inch margin on each side).

Submitted papers will be reviewed by referees from the Program Committee.
Accepted papers will be published in the working notes provided by ACM. The
authors will be notified about the acceptance or rejection of their papers
by June 15, 2000. Camera-ready versions of the papers are due July 15, 2000.



More information about the Corpora mailing list