[Corpora-List] CFP - ACL/IJCNLP-2009 Workshop - The People's Web meets NLP: Collaboratively Constructed Semantic Resources

Torsten Zesch zesch at tk.informatik.tu-darmstadt.de
Thu Dec 18 14:32:10 UTC 2008


ACL/IJCNLP-2009 Workshop

"The People's Web meets NLP:
Collaboratively Constructed Semantic Resources"

Co-located with Joint conference of the 47th Annual Meeting of the
Association for Computational Linguistics and the 4th International
Joint Conference on Natural Language Processing of the Asian
Federation of Natural Language Processing

Singapore
6-7 August 2009
http://www.ukp.tu-darmstadt.de/acl-ijcnlp-2009-workshop/


INTRODUCTION

In recent years, online resources collaboratively constructed by ordinary
users on the Web have considerably influenced the NLP community. In many
works, they have been used as a substitute for conventional semantic
resources and as semantically structured corpora with great success.
While conventional resources such as WordNet are developed by trained
linguists [1], online semantic resources can now be automatically
extracted from the content collaboratively created by the users [2].
Thereby, the knowledge acquisition bottlenecks and coverage problems
pertinent to conventional lexical semantic resources can be overcome.

The resource that has gained the greatest popularity in this respect
so far is Wikipedia. However, other resources recently discovered in
NLP, such as folksonomies, the multilingual collaboratively
constructed dictionary Wiktionary, or Q&A sites like WikiAnswers or
Yahoo! Answers are also very promising. Moreover, new wiki-based
platforms such as Citizendium or Knol have recently emerged that
offer features distinct from Wikipedia and are of high potential
in terms of their use in NLP.

The benefits of using Web-based resources come along with new
challenges, such as the interoperability with existing resources and
the quality of the knowledge represented. As collaboratively created
resources lack editorial control, they are typically incomplete. For
the interoperability with conventional resources, the mappings have
to be investigated. The quality of collaboratively constructed
resources is questioned in many cases, and the information extraction
remains a complicated task due to the incompleteness and semi-
structuredness of the content. Therefore, the research community has
begun to develop and provide tools for accessing collaboratively
constructed resources [2,5].

The above listed challenges actually present a chance for NLP
techniques to improve the quality of Web-based semantic resources.
Researchers have therefore proposed techniques for link prediction [3]
or information extraction [4] that can be used to guide the "crowds"
to construct resources that are better suited for being used in NLP
in return.

[1] Christiane Fellbaum
    WordNet An Electronic Lexical Database.
    MIT press, 1998.
[2] Torsten Zesch, Christof Müller and Iryna Gurevych
    Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary
    Proceedings of the Conference on Language Resources and Evaluation
    (LREC), 2008.
    http://www.ukp.tu-darmstadt.de/software/jwpl/
    http://www.ukp.tu-darmstadt.de/software/jwktl/
[3] Rada Mihalcea and Andras Csomai
    Wikify!: Linking Documents to Encyclopedic Knowledge.
    Proceedings of the Sixteenth ACM Conference on Information and
    Knowledge Management, CIKM 2007.
[4] Daniel S. Weld et al.
    Intelligence in Wikipedia.
    Twenty-Third Conference on Artificial Intelligence (AAAI), 2008.
[5] Kotaro Nakayama et al.
    Wikipedia Mining - Wikipedia as a Corpus for Knowledge Extraction.
    Proceedings of the Annual Wikipedia Conference (Wikimania), 2008.
    http://wikipedia-lab.org/en/index.php


TOPICS

The workshop will bring together researchers from both worlds: those
using collaboratively created resources in NLP applications such as
information retrieval, named entity recognition, or keyword extraction,
and those using NLP applications for improving the resources or
extracting different types of semantic information from them. Hopefully,
this will turn into a feedback loop, where NLP techniques improved by
collaboratively constructed resources are used to improve the resources
in exchange.

Specific topics include but are not limited to:
 * Different types of collaboratively constructed resources, such as
   wiki-based platforms, Q&A sites or folksonomies;
 * Using collaboratively constructed resources in NLP such as
   information retrieval, text categorization, information
   extraction, etc.;
 * Analyzing the properties of collaboratively constructed resources
   related to their use in NLP;
 * Interoperability of collaboratively constructed resources with
   conventional semantic resources and between themselves;
 * Converting unstructured information into structured lexical
   semantic information; tools for mining social and collaborative
   content;
 * Quality issues with respect to collaboratively constructed resources.

We also encourage the submission of short papers describing publicly
available tools for accessing or analyzing collaboratively created
resources. During the breaks, tables can be provided for demonstrations.


INVITED SPEAKER

Rada Mihalcea, University of North Texas


SUBMISSION INFORMATION

Full paper submissions should follow the two-column format of ACL-IJCNLP
2009 proceedings without exceeding eight (8) pages of content plus one
extra page for references.  Short paper submissions should also follow
the two-column format of ACL-IJCNLP 2009 proceedings, and should not
exceed four (4) pages, including references.

Submission will be electronic using a submission software that will be
available later at the conference website. All accepted papers will be
presented orally and published in the workshop proceedings.


IMPORTANT DATES

Paper submission deadline (full and short): May 1, 2009
Notification of acceptance of papers:       June 1, 2009
Camera-ready copy of papers due:            June 7, 2009
ACL-IJCNLP 2009 Workshops:                  Aug 6-7, 2009


ORGANIZERS

Iryna Gurevych
Torsten Zesch

Ubiquitous Knowledge Processing Lab
Technical University of Darmstadt, Germany


PROGRAM COMMITTEE

Delphine Bernhard   Technische Universiät Darmstadt
Paul Buitelaar      DFKI Saarbruecken
Razvan Bunescu      University of Texas at Austin
Pablo Castells      Universidad Autónonoma de Madrid
Philipp Cimiano     Karlsruhe University
Irene Cramer        Dortmund University of Technology
Andras Csomai       Google Inc.
Ernesto De Luca     University of Magdeburg
Roxana Girju        University of Illinois at Urbana-Champaign
Andreas Hotho       University of Kassel
Graeme Hirst        University of Toronto
Ed Hovy             University of Southern California
Jussi Karlgren      Swedish Institute of Computer Science
Boris Katz          Massachusetts Institute of Technology
Adam Kilgarriff     Lexical Computing Ltd
Chin-Yew Lin        Microsoft Research
James Martin        University of Colorado Boulder
Olena Medelyan      University of Waikato
David Milne         University of Waikato
Saif Mohammad       University of Maryland
Dan Moldovan        University of Texas at Dallas
Kotaro Nakayama     University of Tokyo
Ani Nenkova         University of Pennsylvania
Guenter Neumann     DFKI Saarbruecken
Maarten de Rijke    University of Amsterdam
Magnus Sahlgren     Swedish Institute of Computer Science
Manfred Stede       Potsdam University
Benno Stein         Bauhaus University Weimar
Tonio Wandmacher    University of Osnabrueck
Rene Witte          Concordia University Montreal
Hans-Peter Zorn     European Media Lab, Heidelberg

_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list