<html><head></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><br>*********************************************************************************************************************<br><br>               FINAL CALL FOR PAPERS <br><span class="Apple-tab-span" style="white-space: pre; ">        </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span><br><span class="Apple-tab-span" style="white-space: pre; ">       </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span>AND<br><span class="Apple-tab-span" style="white-space: pre; ">    </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span><br><span class="Apple-tab-span" style="white-space: pre; ">       </span>*** SHARED TASK -- CALL FOR PARTICIPATION ***<br><br>                  ACL Workshop on<br>        Distributional Semantics and Compositionality (DiSCo'2011)<br><span class="Apple-tab-span" style="white-space: pre; ">  </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span><a href="http://disco2011.fzi.de/">http://disco2011.fzi.de/</a><br><br><span class="Apple-tab-span" style="white-space: pre; ">        </span><span class="Apple-tab-span" style="white-space: pre; "> </span>June 24, 2011, Portland, Oregon, USA<div><font class="Apple-style-span" color="#5435DC" face="Times" size="4"><span class="Apple-style-span" style="font-size: 15px; line-height: 24px; "><font class="Apple-style-span" color="#000000" face="Helvetica"><span class="Apple-style-span" style="line-height: normal; font-size: medium; "><br></span></font></span></font><div><br>Test data release:<span class="Apple-tab-span" style="white-space: pre; ">      </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span>March 31, 2011<br>Regular paper submission deadline:<span class="Apple-tab-span" style="white-space: pre; ">       </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span>April 1, 2011<br>Test data submission and system description deadline:<span class="Apple-tab-span" style="white-space: pre; ">     </span>April 8, 2011<br>Notification of acceptance:<span class="Apple-tab-span" style="white-space: pre; ">       </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span>Apr 25, 2011<br>Camera-ready deadline:<span class="Apple-tab-span" style="white-space: pre; ">     </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span><span class="Apple-tab-span" style="white-space: pre; "> </span>May 06, 2011<br><font class="Apple-style-span" color="#5435DC" face="Times" size="4"><span class="Apple-style-span" style="font-size: 15px; line-height: 24px; "><br></span></font>**********************************************************************************************************************<br><br></div><div><span class="Apple-style-span" style="font-family: Times; color: rgb(84, 53, 220); font-size: 15px; line-height: 24px; ">*** We are pleased to announce </span><span class="Apple-style-span" style="font-family: Times; color: rgb(84, 53, 220); font-size: 15px; line-height: 24px; "><b>Dominic Widdows</b></span><span class="Apple-style-span" style="font-family: Times; color: rgb(84, 53, 220); font-size: 15px; line-height: 24px; "> as the invited speaker at DiSCo'2011 ***</span><br><br>** Introduction **<br><br>Any NLP system that does semantic processing relies on the assumption of semantic compositionality: the meaning of a phrase is determined by the meanings of its parts and their combination. However, this assumption does not hold for lexicalized phrases such as idiomatic expressions, which causes pain points not only for semantic, but also for syntactic processing, see (Sag et al. 2001). In particular, while distributional methods in semantics have proved to be very efficient in tackling a wide range of tasks in natural language processing, e.g., document retrieval, clustering and classification, question answering, query expansion, word similarity, synonym extraction, relation extraction, textual advertisement matching in search engines, etc. (see Turney and Pantel 2010 for a detailed overview), they are still strongly limited by being inherently word-based. While dictionaries and other lexical resources contain multiword entries, these are expensive to obtain, not available for all languages to a sufficient extent, the definition of a multiword varies across resources and non-compositional phrases are merely a subclass of multiwords. The workshop brings together researchers that are interested in extracting non-compositional phrases from large corpora by applying distributional models that assign a graded compositionality score to a phrase as well as researchers interested in expressing compositional meaning with such models. This score denotes the extent to which the compositionality assumption holds for a given expression. The latter can be used, for example, to decide whether the phrase should be treated as a single unit in applications. We emphasize that the focus is on automatically acquiring semantic compositionality. Approaches that employ prefabricated lists of non-compositional phrases should consider a different venue.<br><br>This event consists of a main session and a shared task.<br><br>References:<br>Ivan A Sag, Timothy Baldwin, Francis Bond, Ann Copestake, Dan Flickinger (2001): Multi-word Expressions: A Pain in the Neck for NLP. In Proc. of the 3rd International Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2002), Mexico City, Mexico<br><br>Turney, P. and P. Pantel. (2010). From Frequency to Meaning: Vector Space Models of Semantics. Journal of Artificial Intelligence Research, 37, 141-188.<br><br>** Call for Papers **<br><br>For the main session, we invite submission of papers on the topic of automatically acquiring a model for semantic compositionality. This includes, but is not limited to:<br><br><span class="Apple-tab-span" style="white-space: pre; ">     </span>• Models of Distributional Similarity<br><span class="Apple-tab-span" style="white-space: pre; ">  </span>• Graph-based models over word spaces<br><span class="Apple-tab-span" style="white-space: pre; ">  </span>• Vector-space models for distributional semantics<br><span class="Apple-tab-span" style="white-space: pre; ">     </span>• Applications of semantic compositionality<br><span class="Apple-tab-span" style="white-space: pre; ">    </span>• Evaluation of semantic compositionality<br><br>Authors are invited to submit papers on original, unpublished work in the topic area of this workshop. In addition to long papers presenting completed work, we also invite short papers and demos:<br><br>- Long papers should present completed work and should not exceed 8 pages plus 1 page of references<br><br>- Short papers/demos can present work in progress or the description of a system, and should not exceed 4 pages plus 1 page of references.<br><br>As reviewing will be blind, please ensure that papers are anonymous. The papers should not include the authors' names and affiliations or any references to web sites, project names etc. revealing the authors' identity.<br><br><br>** Shared Task **<br><br>The organizers extracted candidate phrases from two large-scale freely available web-corpora, UkWaC and DeWaC (cf. <a href="http://wacky.sslmit.unibo.it/">http://wacky.sslmit.unibo.it/</a>), containing respectively English and German POS tagged text. These data have been manually evaluated for compositionality with Amazon Turk. Workers were presented a sentence with a bolded target phrase and were asked to score how literal the phrase was between 0 and 10. 4-5 different, randomly sampled sentences from the WaCKy corpora for UK English and German were presented to 4 workers each.<br><br>Phrases consist of two lemmas and come in three grammatical relations:<br><br><span class="Apple-tab-span" style="white-space: pre; "> </span>• ADJ_NN: adjective modifying a noun<br><span class="Apple-tab-span" style="white-space: pre; ">   </span>• V_SUBJ: noun as a subject of a verb<br><span class="Apple-tab-span" style="white-space: pre; ">  </span>• V_OBJ: noun as an object of a verb<br><br>Phrases were extracted semi-automatically. The relations were assigned by patterns and manually checked for validity. Phrases were selected in a way as to balance the data set while controlling for frequency. The complete data was split into 40% training, 10% validation and 50% test. <br><br>More details on the data set as well as the download link to the training and validation data are available from the workshop's website (<a href="http://disco2011.fzi.de/">http://disco2011.fzi.de/</a>) <br><br>Participants of the task are free to choose whatever method and data resources they will use in their submission. Prefabricated lists of multiwords are not allowed. Since the data set is derived from the WaCkY corpora, participants are strongly encouraged to use these freely available text collections to build their models of compositionality, thus ensuring the highest possible comparability of results. Furthermore, since the WaCkY corpora are provided already POS-tagged and lemmatized, the workload on the participants' side is considerably reduced. This information (POS tags and lemmatization) may or may not be used by the participants. If needed, additional linguistic annotations or processing may also be added to the corpora. For obtaining the WaCky corpora, please email us (< disco2011workshop @ <a href="http://gmail.com">gmail.com</a> >) for instructions to minimize load on the WaCky organizers. Of course, you can also directly contact the WaCky community at <a href="http://wacky.sslmit.unibo.it/doku.php?id=start">http://wacky.sslmit.unibo.it/doku.php?id=start</a>.<br><br>Participants further submit a 4 page system description for publication in the workshop volume.<br><br>** Program Committee **<br><br><span class="Apple-tab-span" style="white-space: pre; "> </span>• Enrique Alfonseca, Google Research, Switzerland<br><span class="Apple-tab-span" style="white-space: pre; ">      </span>• Tim Baldwin, University of Melbourne, Australia<br><span class="Apple-tab-span" style="white-space: pre; ">      </span>• Marco Baroni, University of Trento, Italy<br><span class="Apple-tab-span" style="white-space: pre; ">    </span>• Paul Buitelaar, National University of Ireland, Ireland<br><span class="Apple-tab-span" style="white-space: pre; ">      </span>• Chris Brockett, Microsoft Research, Redmond, US<br><span class="Apple-tab-span" style="white-space: pre; ">      </span>• Tim van de Cruys, INRIA, France<br><span class="Apple-tab-span" style="white-space: pre; ">      </span>• Stefan Evert, University of Osnabrück, Germany<br><span class="Apple-tab-span" style="white-space: pre; ">       </span>• Antske Fokkens, Saarland University, Germany<br><span class="Apple-tab-span" style="white-space: pre; "> </span>• Silvana Hartmann, TU Darmstadt, Germany<br><span class="Apple-tab-span" style="white-space: pre; ">      </span>• Alfio Massimiliano Gliozzo, IBM, Hawthorne, NY, USA<br><span class="Apple-tab-span" style="white-space: pre; ">  </span>• Mirella Lapata, University of Edinburgh, UK<br><span class="Apple-tab-span" style="white-space: pre; ">  </span>• Ted Pedersen, University of Minnesota, Duluth, USA<br><span class="Apple-tab-span" style="white-space: pre; ">   </span>• Yves Peirsman, Stanford University, USA</div><div><span class="Apple-tab-span" style="white-space: pre; "> </span>• Sebastian Rudolph, Karlsruhe Institute of Technology, Germany</div><div><span class="Apple-tab-span" style="white-space: pre; ">   </span>• Peter D. Turney, National Research Council Canada, Canada<br><span class="Apple-tab-span" style="white-space: pre; ">    </span>• Magnus Sahlgren, Gavagai, Sweden<br><span class="Apple-tab-span" style="white-space: pre; ">     </span>• Serge Sharoff, University of Leeds, UK<br><span class="Apple-tab-span" style="white-space: pre; ">       </span>• Anders Søgaard, University of Copenhagen, Denmark<br><span class="Apple-tab-span" style="white-space: pre; ">    </span>• Daniel Sonntag, German Research Center for AI, Germany<br><span class="Apple-tab-span" style="white-space: pre; ">       </span>• Diana McCarthy, Lexical Computing Ltd., UK<br><span class="Apple-tab-span" style="white-space: pre; ">   </span>• Dominic Widdows, Google, USA<br><br><br>Workshop Chairs:<br><br><span class="Apple-tab-span" style="white-space: pre; "> </span>• Chris Biemann, San Francisco, USA<br><span class="Apple-tab-span" style="white-space: pre; ">    </span>• Eugenie Giesbrecht, FZI Research Center for Information Technology at the University of Karlsruhe, Germany<br><span class="Apple-tab-span" style="white-space: pre; ">   </span>• Emiliano Guevara, Institute for Linguistics and Scandinavian Studies, University of Oslo, Norway<br><br>Contact email: < disco2011workshop @ <a href="http://gmail.com">gmail.com</a> ><br></div></div></body></html>