[Corpora-List] CFP: LRE journal - Special Issue on Analysis of short texts on the Web
prosso at dsic.upv.es
prosso at dsic.upv.es
Fri Nov 19 18:27:07 UTC 2010
Language Resources and Evaluation Journal
Special issue on Analysis of short texts on the Web
CALL FOR PAPERS
The huge volume of information available on the Web is continuously
growing. There is great interest in analyzing this information in
order to fulfil specific user needs. The challenges that researchers
must deal with when analyzing the content of Web pages are related to
the fact that quite often they are written in natural language, and
very often without any specific helpful structure. In other words, it
is a problem of processing almost pure raw data, often just short
texts which make the task quite challenging. In fact, short texts
typically contain a small number of words whose absolute frequency is
relatively low in comparison with their frequency in long documents.
This makes tasks such as text categorization harder.
The exponential growth in the number of Web documents furnishes
abundant proof of the necessity of analyzing short texts. For
instance, digital libraries and Web-based repositories of scientific
and technical information provide free access only to abstracts and
not to the full texts of the documents. News, document titles,
snippets, FAQs, chats, abstracts etc. are some examples of the high
volume of short texts available on the Web.
With the so-called Web 2.0, the largest communication and
collaborative platform, new short texts are created on daily basis as
on-line evaluations of commercial products, posts of blogs or comments
in social networks. Twitter, for instance, is a new successful social
network technology of the Web 2.0 genre which is used by millions of
people and thousands of companies to publish very short messages with
the purpose of sharing experiences and/or opinions about a product or
service. Due to the huge amount of information available in social
media, there is a clear need for mining useful information from these
messages in order to discover knowledge about the collective thinking
of the crowds. Tweet analysis is considered to be potentially very
important because comments, opinions, suggestions and complaints can
be used to define new marketing strategies or to obtain information on
companies? reputation.
In recent years there has been sufficient interest from the
computational linguistics community on the efficient analysis of short
texts. In fact, several tracks have been organized in the framework of
the different evaluation frameworks at TREC (blog and Web tracks),
CLEF (Web people search laboratory), NTCIR (opinion analysis pilot
task), INEX (ad-hoc passage retrieval task), ROMIP (track on news
clustering), and FIRE (ad-hoc task on retrieval from technical forums
and mailing lists).
This special issue aims to collect state-of-the-art contributions to
the development and use of techniques for the analysis of short texts
on the Web, with special emphasis on resources of the collaborative
platform of the Web 2.0. Thus, we welcome contributions that include,
but are not limited to, resources of short texts such as posts of
blogs, tweets, text messages, etc, as well as innovative techniques
using linguistic resources for improved understanding of mono or
multi-lingual short texts.
TOPICS OF INTEREST
We are particularly interested in articles showing the benefits of
using such resources and techniques that include, but not limited to,
the following topics:
* Categorization of short texts
* Cross-lingual short text mining on the Web
* Analysis of weblogs, tweets, text messages and snippets
* Knowledge discovery from Web 2.0
* Opinion mining in social media
* Enterprise 2.0 and market analysis
* Automatic generation of collaborative linguistic resources
* Evaluation of techniques and short text resources
IMPORTANT DATES
* Submission deadline (abstract): March 15, 2011
* Submission deadline (full paper): March 31, 2011
* First-round reviews due: May 31, 2011
* Revised versions due: July 15, 2011
* Second-round reviews due: September 15, 2011
* Final versions due: October 31, 2011
* Special issue publication: sometimes in 2012
PROGRAM COMMITTEE
Eneko Agirre, University of the Basque Country
Mikhail Alexandrov, Autonomous University of Barcelona
Enrique Alfonseca, Google Zurich
Benajiba Yassine, Philips Research North America
Andrew Borthwick, Intelius
Pavel Braslavski, Yandex
Paul Clough, University of Sheffield
José Carlos Cortizo, BrainSins
Alexander Gelbukh, National Polytechnic Institute
Alfio Massimiliano Gliozzo, IBM Watson
Julio Gonzalo, UNED
Chu-Ren Huang, The Hong Kong Polytechnic University
Hitoshi Isahara, Toyohashi University of Technology
Jaap Kamps, University of Amsterdam
Pavel Makagonov, MIxtecTechnological University
Presenit Majumder, DAIICT Gandhinagar
Antonia Martí, University of Barcelona
Patricio Martínez, University of Alicante
Rada Mihalcea, University of North Texas
Mandar Mitra, Indian Statistical Institute
Manuel Montes y Gómez, INAOE Puebla
Roberto Navigli, University of Rome La Sapienza
Boris Novikov, St. Petersburg University
Ted Pedersen, University of Minnesota
Marco Pennacchiotti, Yahoo! Labs Santa Clara
Efstathios Stamatatos, University of the Aegean
Benno Stein, Bauhaus-Universität Weimar
José Antonio Troyano, University of Seville
Dan Tufi?, Romanian Academy
Jan Wiebe, University of Pittsburgh
Xiaofang Zhou, University of Queensland
Xiaoyan Zhu, Tsinghua University Beijing
GUEST EDITORS
Paolo Rosso, Universidad Politécnica de Valencia, Spain
Marcelo Errecalde, Universidad Nacional de San Luís, Argentina
David Pinto, Benemérita Universidad Autónoma de Puebla, Mexico
SUBMISSION INFORMATION
Please follow the submission instructions available from the LRE
webpage at http://chum.edmgr.com/
For the submission of the abstract and additional information, please
contact David Pinto (dpinto at cs.buap.mx)
---
Paolo Rosso
Head of Natural Language Engineering Lab.
Dpto. Sistemas Informáticos y Computación
Universidad Politécnica Valencia
Spain
URL: http://www.dsic.upv.es/~prosso
email: prosso [at] dsic.upv.es
fax: +34 963877359
tel: +34 963877007 ext. 73571
----------------------------------------------------------------
This message was sent using IMP, the Internet Messaging Program.
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list