Appel: DMNLP'14, Workshop on Interactions between Data Mining and Natural Language Processing at ECML/PKDD

Thierry Hamon hamon at LIMSI.FR
Sun Apr 27 07:06:53 UTC 2014

Date: Sat, 26 Apr 2014 11:04:51 +0200
From: Yannick Toussaint <Yannick.Toussaint at>
Message-Id: <ECCEDC55-E52D-4E28-B272-597FD7591207 at>

[Apologies for multiple postings]

                 CALL FOR PAPERS

     Workshop on Interactions between Data Mining
        and Natural Language Processing

             ECML/PKDD 2014 Workshop
      September 15-19, 2014, Nancy, France



On the one hand, in the field of Natural Language Processing (NLP),
numerical Machine Learning methods (e.g., SVM, CRF) have been
intensively explored and applied. Despite the good results obtained by
the numerical methods, one major drawback is that they do not provide a
human readable model. A promising direction is the integration of
symbolic knowledge.

On the other hand, research in Data Mining has progressed significantly
in the last decades, through the development of advanced algorithms and
techniques to extract knowledge from data in different forms. In
particular, for two decades Pattern Mining has been one of the most
active field in Knowledge Discovery.

Recently, a new field has emerged taking benefit of both domains: Data
Mining and NLP. The objective of DMNLP is thus to provide a forum to
discuss how Data Mining can be interesting for NLP tasks, providing
symbolic knowledge, but also how NLP can enhance data mining approaches
by providing richer and/or more complex information to mine and by
integrating linguistics knowledge directly in the mining process.

The workshop aims at bringing together researchers from both communities
in order to stimulate discussions about the cross-fertilization of those
two research fields. The idea of this workshop is to discuss future
directions and new challenges emerging from the cross-fertilization of
Data Mining and NLP and in the same time initiate collaborations between
researchers of both communities.


The workshop promotes works where the two following dimensions are
combined in one as symbiosis. The first dimension is Data Mining, for
instance Pattern Mining (itemsets, sequences, trees, graphs, association
rules), classification (decision trees, FCA,...), inductive logic
programming. The second dimension is NLP, for example question/answering
systems, translation, information extraction, linguistic analysis
(lexical analysis, terminology, syntax, semantics, discourse,
stylistics), classification, knowledge extraction/ontology building from
texts, information retrieval, corpus annotation, social/opinion mining.

A list of non-exhaustive topics that fit the scope of the workshop is

- Pattern discovery for NLP
- Constraint-based pattern mining in text
- Data mining query languages for expressing NLP tasks
- Data representation (sequence, tree, graphs) for NLP
- Modelization of text for data mining
- Relationships between data mining and NLP
- Modeling and visualizing data mining results on text
- Integrating NLP characteristics in data mining
- Data mining approaches for linguistic knowledge building
- Knowledge discovery for linguistic analysis (e.g. stylistic,
- Linguistically-informed text representations for Data Mining


Our main goal is to stimulate discussions, collaborations and the
sharing of experiences.
In that respect, we would have three submission types:

* unpublished works (max 16 pages, double submissions allowed)
* short papers and vision statements (max 8 pages)
* recently published works (special oral-only track, no page limits)

For more details on submission, see the DMNLP webpage at


* Deadline for submissions: Friday, June 20, 2014
* Author notification: Friday, July 11, 2014
* Final version: Tuesday July 22, 2014
* Workshop date: Monday September 15 or 19, 2014


* Peggy Cellier, INSA Rennes, IRISA, Rennes, France
* Thierry Charnois, Université de Paris 13, LIPN, France
* Andreas Hotho, University of Kassel, Germany
* Stan Matwin, Dalhousie University, Canada
* Marie-Francine Moens, Katholieke Universiteit, Leuven, Belgium
* Yannick Toussaint, INRIA, LORIA, Vandoeuvre-les-Nancy, France


* Martin Atzmueller, University of Kassel, Germany
* Delphine Battistelli, MoDyCo-Université Paris Ouest, France
* Yves Bestgen, Université catholique de Louvain, Belgium
* Philipp Cimiano, University of Bielefeld, Germany
* Bruno Cremilleux, Université de Caen, France
* Beatrice, Daille, LINA, France
* Yves Lepage, Waseda University, Japan
* Francois Jacquenet, Laboratoire Hubert Curien, Saint-Etienne, France
* Jiri Klema, Czech Technical University, Prague, Czech Republic
* Amedeo Napoli, LORIA Nancy, France
* Adeline Nazarenko, Université de Paris 13, LIPN, France
* Claire Nédellec, Institut National de Recherche Agronomique, France
* Pascal Poncelet, LIRMM Montpellier, France
* Maria Teresa Pazienza, University of Roma "Tor Vergata", Italy
* Stephen Poteet, Boeing, USA
* Solen Quiniou, LINA-Université de Nantes, France
* Mathieu Roche, TETIS, Montpellier, France
* Arnaud Soulet, Université François Rabelais, Tours, France
* Steffen, Staab, University of Koblenz-Landau, Germany
* Koichi Takeuchi, Okayama University, Japan
* Isabelle Tellier, Lattice, Paris, France
* Johanna Völker, University of Mannheim, Germany
* Xifeng Yan, University of California, Santa Barbara, USA
* Pierre Zweigenbaum, LIMSI-CNRS, Paris, France

Message diffuse par la liste Langage Naturel <LN at>
Informations, abonnement :
English version       : 
Archives                 :

La liste LN est parrainee par l'ATALA (Association pour le Traitement
Automatique des Langues)
Information et adhesion  :

ATALA décline toute responsabilité concernant le contenu des
messages diffusés sur la liste LN

More information about the Ln mailing list