From hamon at LIMSI.FR Wed Dec 4 12:58:54 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 13:58:54 +0100 Subject: Appel: DISCOURS n=?ISO-8859-1?Q?=B014?= Message-ID: Date: Mon, 2 Dec 2013 17:09:23 +0100 From: "laure.sarda at ens.fr" Message-ID: X-url: http://discours.revues.org/index208.html ******************************************** *Call for papers for DISCOURS, Issue 14 * Coordination: Laure SARDA & Denis VIGIER We are inviting submissions for the next issue of DISCOURS, to appear in June 2014. DISCOURS is an international and interdisciplinary e-journal, which publishes two issues a year, and occasional special issues. The journal focuses on the following topics: discourse structure, cohesion, co-reference, linearization, indexation, information structure, word order, segmentation markers, integration markers, discourse relations, cognitive processes involved in discourse comprehension and production, and other related topics. It is intended as a forum for exchanging and comparing data, analyses and opinions for all linguists, psycholinguists and computer linguists working in fields involving the description, comprehension, formalization and processing of text organization. Papers (in English or French) should be sent to discours at revues.org before December 15, 2013. Full instructions can be found on the website (http:// discours.revues.org/224) *Appel à publication DISCOURS, numéro 14 * Coordination : Laure SARDA & Denis VIGIER Nous invitons des soumissions pour le prochain numéro de DISCOURS, à paraitre en Juin 2014 DISCOURS est une revue électronique internationale et interdisciplinaire qui publie deux numéros par an. Parallèlement, la revue accepte les projets de numéros thématiques. La ligne éditoriale de la revue se concentre autour des thèmes suivants : structuration du discours, cohésion, co-référence, linéarisation, indexation, structure informationnelle, ordre des mots, marqueurs de segmentation, marqueurs d’intégration, relations de discours, processus cognitifs à l'œuvre lors de la compréhension et de la production de textes et autres thèmes reliés. Elle est un lieu d'échange et de confrontation des données, des analyses et des théories pour la communauté des linguistes, psycholinguistes et informaticiens travaillant plus généralement à la description, la compréhension, la formalisation et le traitement informatique de l’organisation des textes. Envoyez vos articles en anglais ou en français à discours at revues.orgjusqu’au 15 décembre 2013 (voir les modalités de soumission http://discours.revues.org/index208.html) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Dec 4 12:56:04 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 13:56:04 +0100 Subject: These: Laurence Longo, Vers des moteurs de recherche intelligents Message-ID: Date: Sat, 30 Nov 2013 19:51:20 +0100 From: Laurence Longo Message-ID: X-url: http://www.unistra.fr/plans/centre/72/index.html Bonjour, J'ai le plaisir de vous convier à la soutenance de ma thèse intitulée : "Vers des moteurs de recherche 'intelligents' : un outil de détection automatique de thèmes. Méthode basée sur l'identification automatique des chaînes de référence." Elle aura lieu le jeudi 12 décembre 2013 à partir de 14h30 dans la salle des thèses du Nouveau Patio, Campus Universitaire Esplanade, 20a rue René Descartes, 67084 Strasbourg (plan : http://www.unistra.fr/plans/centre/72/index.html). La soutenance sera suivie du traditionnel pot de thèse (à partir de 18h30, en salle 0-01) auquel vous êtes chaleureusement invités. Bien cordialement. Laurence Longo -------------------- Composition du jury : --------- Mme Catherine Schnedecker - Professeur, LiLPa, Université de Strasbourg - Directrice Mme Amalia Todirascu - Maître de conférences, LiLPa, Université de Strasbourg - Co-directrice M. Yves Bestgen - Professeur, Université catholique de Louvain - Rapporteur M. Denis Maurel - Professeur, Université François Rabelais - Rapporteur Mme Agnès Tutin - Professeur, Université Stendhal Grenoble - Examinatrice M. Frédéric Landragin - Chargé de recherche CNRS, Université Paris 3 - Examinateur Résumé : -------- Cette thèse se situe dans le domaine du Traitement Automatique des Langues et vise à optimiser la classification des documents dans les moteurs de recherche. Les travaux se concentrent sur le développement d’un outil de détection automatique des thèmes des documents (ATDS-fr). Utilisant peu de connaissances, la méthode hybride adoptée allie des techniques statistiques de segmentation thématique à des méthodes linguistiques identifiant des marqueurs de cohésion. Parmi eux, les chaînes de référence – séquence d’expressions référentielles se rapportant à la même entité du discours (e.g. Paul…il…cet homme) – ont fait l’objet d’une attention particulière, car elles constituent un indice textuel important dans la détection des thèmes (i.e. ce sont des marqueurs d’introduction, de maintien et de changement thématique). Ainsi, à partir d’une étude des chaînes de référence menée dans un corpus issu de genres textuels variés (articles de journaux, rapports publics, lois européennes, éditoriaux, roman), nous avons développé un module d’identification automatique des chaînes de référence RefGen qui a été évalué suivant les métriques actuelles de la coréférence. Mots-clés : ----------- Détection automatique de thèmes, chaînes de référence, traitement automatique des langues, sémantique lexicale, coréférence, genres textuels, segmentation thématique, marqueurs linguistiques, cohésion, linguistique de corpus Abstract : ---------- This thesis in the field of Natural Language Processing aims at optimizing documents classification in search engines. This work focuses on the development of a tool that automatically detects documents topics (ATDS-fr). Using poor knowledge, the hybrid method combines statistical techniques for topic segmentation and linguistic methods that identify cohesive markers. Among them, reference chains - sequences of referential expressions referring to the same entity (e.g. Paul ... he ... this man) - have been given special attention as they are important topic markers (i.e. they are markers of topic introduction, maintenance and change). Thus, from a study of reference chains extracted from a corpus composed of various textual genres (newspapers, public reports, European laws, editorials and novel) we developed RefGen, an automatic reference chains identification module, which was evaluated according to current coreference metrics. Keywords: --------- Topic detection, reference chains, natural language processing, lexical semantics, coreference, textual genre, topic segmentation, linguistic markers, cohesion, corpus linguistics ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Dec 4 13:02:08 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 14:02:08 +0100 Subject: Appel: ESWC 2014, Call for Challenge, Linked Open Data-enabled Recommender Systems Message-ID: Date: Tue, 3 Dec 2013 09:24:36 +0100 (CET) From: speroni at cs.unibo.it Message-Id: <20131203082456.E1E05F983E at vina.cines.fr> X-url: http://challenges.2014.eswc-conferences.org/RecSys ** apologies for cross-posting ** === Call for Challenge: Linked Open Data-enabled Recommender Systems === Challenge Website: http://challenges.2014.eswc-conferences.org/RecSys Call Web page: http://2014.eswc-conferences.org/important-dates/call-RecSys MOTIVATION AND OBJECTIVES People generally need more and more advanced tools that go beyond those implementing the canonical search paradigm for seeking relevant information. A new search paradigm is emerging, where the user perspective is completely reversed: from finding to being found. Recommender systems may help to support this new perspective, because they have the effect of pushing relevant objects, selected from a large space of possible options, to potentially interested users. To achieve this result, recommendation techniques generally rely on data referring to three kinds of objects: users, items and their relations. Recent developments in the Semantic Web community offer novel strategies to represent data about users, items and their relations that might improve the current state of the art of recommender systems, in order to move towards a new generation of recommender systems which fully understand the items they deal with. More and more semantic data are published following the Linked Data principles, that enable to set up links between objects in different data sources, by connecting information in a single global data space: the Web of Data. Today, the Web of Data includes different types of knowledge represented in a homogeneous form: sedimentary one (encyclopedic, cultural, linguistic, common-sense) and real-time one (news, data streams, ...). These data might be useful to interlink diverse information about users, items, and their relations and implement reasoning mechanisms that can support and improve the recommendation process. The primary goal of this challenge is twofold. On the one hand, we want to create a link between the Semantic Web and the Recommender Systems communities. On the other hand, we aim to show how Linked Open Data and semantic technologies can boost the creation of a new breed of knowledge-enabled and content-based recommender systems. TARGET AUDIENCE The target audience is all of those communities, both academic and industrial, which are interested in personalized information access with a particular emphasis on Linked Open Data. During the last ACM RecSys conference more than 60% of participants were from industry. This is for sure a witness of the actual interest of recommender systems for industrial applications ready to be released in the market. TASKS Task 1: Rating prediction in cold-start situations This task deals with the rating prediction problem, in which a system is requested to estimate the value of unknown numeric scores (a.k.a. ratings) that a target user would assign to available items, indicating whether she likes or dislikes them. In order to favor the proposal of content-based, LOD-enabled recommendation approaches, and limit the use of collaborative filtering approaches, this task aims at predicting ratings in cold-start situations, that is, predicting ratings for users who have a few past ratings, and predicting ratings of items that have been rated by a few users. The dataset to use in the task - DBbook - relates to the book domain. It contains explicit numeric ratings assigned by users to books. For each book we provide the corresponding DBpedia URI. Participants will have to exploit the provided ratings as training sets, and will have to estimate unknown ratings in a non-provided evaluation set. Recommendation approaches will be evaluated on the evaluation set by means of metrics that measure the differences between real and estimated ratings, namely the Root Mean Square Error (RMSE). Task 2: Top-N recommendation from binary user feedback This task deals with the top-N recommendation problem, in which a system is requested to find and recommend a limited set of N items that best match a user profile, instead of correctly predict the ratings for all available items. Similarly to Task 1, in order to favor the proposal of content-based, LOD-enabled recommendation approaches, and limit the use of collaborative filtering approaches, this task aims at generating ranked lists of items for which no graded ratings are available, but only binary ones. Also in this case, the DBbook dataset will be provided. In this task, the accuracy of recommendation approaches will be evaluated on an evaluation set using the F-measure. Task 3: Diversity A very interesting aspect of content-based recommender systems and then of LOD-enabled ones is giving the possibility to evaluate the diversity of recommended items in a straight way. This is a very popular topic in content-based recommender systems, which usually suffer from over-specialization. In this task, the evaluation will be made by considering a combination of both accuracy of the recommendation list and the diversity of items belonging to it. Also for this task, DBbook dataset will be used. Given the domain of books, diversity with respect to the two properties dbpedia-owl:author and skos:subject will be considered. EVALUATION DATASET AND EVALUATION METRICS DBbook dataset This dataset relies on user data and preferences retrieved from the Web. The books available in the dataset have been mapped to their corresponding DBpedia URIs. The mapping contains 8170 DBpedia URIs. These mappings can be used to extract semantic features from DBpedia or other LOD repositories to be exploited by the recommendation approaches proposed in the challenge. The dataset is split in a training set and an evaluation set. In the former, user ratings are provided to train a system while in the latter, ratings have been removed, and they will be used in the eventual evaluation step. Participants will generate the ratings (Task 1) or the ranking (Task 2 and Task 3) for the test set, and their results will be compared and evaluated with respect to actual users ratings (hidden evaluation data). The two sets will be provided at the challenge's website, in the following plain text files: * lodrecsys2014-DBbook-training-ratings.txt: Contains tuples that may be used to build the recommendation approaches. * lodrecsys2014-DBbook-test-ratings.txt: Contains tuples that will be used to evaluate the recommendation approaches. Together with the two previous files, a further one will be provided containing the mapping to DBpedia. * lodrecsys2014-DBbook-mappings.txt: Contains tuples associated with the books in DBbook that will be used to extract and exploit semantic features from DBpedia and other LOD repositories. Recommendation approaches will be evaluated with the metrics requested in each task. We provide a number of Java classes that compute the different metrics and a Web Service to test intermediate results. A description of the metrics to compute and further details regarding the evaluation will be available at the challenge website. JUDGING AND PRIZES After a first round of reviews, the Program Committee and the chairs will select a number of submissions that will have to satisfy the challenge's requirements, and will have to be presented at the conference. Submissions accepted for presentation will receive constructive reviews from the Program Committee, and will be included in post-proceedings. All accepted submissions will have a slot in a poster session dedicated to the challenge. In addition, the winners will present their work in a special slot of the main program of ESWC'14, and will be invited to submit a paper to a dedicated Semantic Web Journal special issue. For each task we will select: * the best performing tool, given to the paper which will get the highest score in the evaluation * the most original approach, selected by the Challenge Program Committee with the reviewing process HOW TO PARTICIPATE The following information has to be provided: * Abstract: no more than 200 words. * Description: It should contain the details of the system, including why the system is innovative, how it uses Semantic Web, which features or functions the system provides, what design choices were made, and what lessons were learned. The description should also summarize how participants have addressed the evaluation tasks. Papers must be submitted in PDF format, following the style of the Springer's Lecture Notes in Computer Science (LNCS) series (http://www.springer.com/computer/lncs/lncs+authors), and not exceeding 5 pages in length. * Result evaluation: For the three tasks previously described, a Web-accessible service will be provided in order to evaluate the produced results. All the details about the format and the service URL will be provided on the website. All submissions should be provided via EasyChair https://www.easychair.org/conferences/?conf=eswc2014-challenges MAILING LIST We invite the potential participants to subscribe to our mailing list in order to be kept up to date with the latest news related to the challenge. https://lists.sti2.org/mailman/listinfo/eswc2014-recsys-challenge IMPORTANT DATES * March 7, 2014, 23:59 (Hawaii time): Abstract Submission * March 14, 2014, 23:59 (Hawaii time): Submission * April 9, 2014, 23:59 (Hawaii time): Notification of acceptance * May 27-29, 2014: Challenge days CHALLENGE CHAIRS * Tommaso Di Noia (Polytechnic University of Bari, IT) * Ivan Cantador (Universidad Autonoma de Madrid, ES) EVALUATION COORDINATOR * Vito Claudio Ostuni (Polytechnic University of Bari, IT) PROGRAM COMMITTEE (to be completed) * Oscar Corcho (Universidad Politecnica de Madrid, ES) * Marco de Gemmis (University of Bari Aldo Moro, IT) * Frank Hopfgartner (Technische Universitat Berlin, DE) * Andreas Hotho (Universitat Wurzburg, DE) * Dietmar Jannach (TU Dortmund University, DE) * Pasquale Lops (University of Bari Aldo Moro, IT) * Valentina Maccatrozzo (Delft University of Technology, NL) * Francesco Ricci (Free University of Bozen-Bolzano, IT) * Giovanni Semeraro (University of Bari Aldo Moro, IT) * David Vallet (NICTA, AU) * Manolis Wallace (University of Peloponnese, GR) * Markus Zanker (Alpen-Adria-Universitaet Klagenfurt, AT) ESWC CHALLENGE COORDINATOR * Milan Stankovic (Sepage & Universite Paris-Sorbonne, FR) From hamon at LIMSI.FR Wed Dec 4 12:57:53 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 13:57:53 +0100 Subject: Stage: Prediction automatique de relations d'implication entre verbes, IRIT Message-ID: Date: Mon, 02 Dec 2013 13:48:03 +0100 From: Benamara Zitoune Farah Message-ID: <529C8183.5030300 at irit.fr> Prédiction automatique de relations d’implication entre verbes Encadrement : Marta Abrusan (marta.abrusan at irit.fr), Stergos Afantenos (stergos.afantenos at irit.fr), Farah Benamara (farah.benamara at irit.fr) Lieu : IRIT, Université Paul Sabatier Financement : prime de stage Durée : 5 mois La compréhension sémantique d’un texte est l’un des enjeux majeurs du traitement automatique du langage (TAL). Cette tâche est primordiale pour de très nombreuses applications telles que la génération automatique de textes et de conversations, le résumé automatique, la paraphrase automatique d’un texte ou encore la recherche d’information. Le but de ce stage est de contribuer à ces recherches en se focalisant sur la prédiction de relations entre verbes. Prédire une relation entre verbes consiste à déterminer pour un couple de verbes (v1,v2), associé ou non à un contexte, le type de relations sémantiques qui les relient, cf. Chklovski and Pantel (2004), Tremper and Frank (2013). Celles ci peuvent être de différentes natures : relations de synonymie, d’antonymie, de causalité, d’implication, etc. Dans ce stage, nous nous focaliserons sur les relations d’implication. Par exemple, les verbes “se balader” et “bouger” sont reliés par une relation d’implication, ce qui permettra à un ordinateur de répondre à une requête du type Pierre s’est-il baladé ?, sachant que la base de connaissances indique que Pierre a bougé. Afin d’identifier ces relations, le stagiaire devra utiliser les méthodes d’apprentissage automatiques les plus adéquates. Le but de ces méthodes est d’apprendre une fonction f : X --> Y où X représente un ensemble des features (ou traits) sur les paires des verbes et Y est un booléen représentant le fait qu’il existe ou non une relation d’implication entre ces verbes. Différentes méthodes seront explorées. Nous commencerons d’abord par les méthodes supervisées qui généralisent des observations faites sur un corpus de données. Cependant, cette approche présuppose que le nombre d’instances d’entraînement est suffisamment grand, ce qui n’est malheureusement pas toujours le cas pour de nombreuses tâches où le coût humain d’annotation des instances est élevé. La seconde étape sera alors d’explorer les méthodes d’apprentissage semi-supervisées afin de réduire ce coût. Références Timothy Chklovski and Patrick Pantel (2004) : VerbOcean : Mining the web for fine-grained semantic verb relations. In Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, pages 33-40, Barcelona, Spain, 2004. Tremper, G. and A. Frank (2013) : A Discriminative Analysis of Fine-Grained Semantic Relations including Presupposition : Annotation and Classification. In : Dialogue and Discourse, 4 (2), Special Issue : Beyond Semantics. The Challenge of Annotating Pragmatic and Discourse Phenomena, edited by S. Dipper, H. Zinsmeister and B. Webber, 282-322. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Dec 4 13:04:50 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 14:04:50 +0100 Subject: Appel: ESWC 2014, Call for Challenge, Semantic Publishing Message-ID: Date: Tue, 3 Dec 2013 09:39:17 +0100 (CET) From: speroni at cs.unibo.it Message-Id: <20131203083937.F044EDDAAB at vina.cines.fr> X-url: http://challenges.2014.eswc-conferences.org/SemPub ** apologies for cross-posting ** ==== Call for Challenge: Semantic Publishing ==== Challenge Website: http://challenges.2014.eswc-conferences.org/SemPub Call Web page: http://2014.eswc-conferences.org/important-dates/call-SemPub MOTIVATION AND OBJECTIVES Scholarly publishing is increasingly enabling a new wave of applications that better support researchers in disseminating, exploiting and evaluating their results. The potential of publishing scientific papers enriched with semantic information is huge and raises interesting and challenging issues. Semantic Web technologies play a central role in this context, as they can help publishers to make scientific results available in an open format the whole research community can benefit from. The Semantic Publishing Challenge 2014 is intended to be the first in a series of events at ESWC for producing and exploiting semantic publishing data. The main focus this year is on extracting information and using this information to assess the quality of scientific productions. Linked open datasets about scientific production exist - e.g. DBLP - but they usually cover basic bibliographic information, which is not sufficient to assess quality. Quality-related information, such as the number of journal articles that cite a given publication, or the number of times a workshop has been run by the same chairs and received many submissions, or the actual function of a citation are often hidden and not yet available as LOD. The main goal of the Challenge is to build high-quality LOD that contains such information. Moving off this year's outcomes, we plan to investigate the final publication and exploitation side in the next editions. TARGET AUDIENCE The Challenge is open to everyone from industry and academia. TASKS We ask challengers to automatically annotate a set of multi-format and multi-source input documents and to produce a Linked Open Dataset that fully describes these documents, their context, and relevant parts of their content. The evaluation will consist of evaluating a set of queries against the produced dataset to assess its correctness and completeness. The input dataset will be split in two parts: a training/testing part and an evaluation part, which will disclosed a few days before the submission deadline. Participants will be asked to run their tool on the evaluation dataset and to produce the final Linked Open Dataset. Further details about the organization of the Challenge will be provided. The Challenge will include two tasks: Task 1: Extraction and assessment of workshop proceedings information Participants are required to extract information from a set of HTML tables of contents, partly including microformat and RDFa annotations but not necessarily being valid HTML, of selected computer science workshop proceedings published with the CEUR-WS.org open access service. The extracted information is expected to answer queries about the quality of these workshops, for instance by measuring their growth, longevity, connection with other events, distribution of papers and authors. Task 2: Extraction and characterization of citations Participants are required to extract information about the citations in scientific journals and their relevance. Input documents are in XML JATS and TaxPub, an official extension of JATS customized for taxonomic treatments, and selected from the PubMedCentral Open Access Subset and the Pensoft Biodiversity Data Journal and ZooKeys archive. The extracted information is expected to be used for assessing the value of citations, for instance by considering their position in the paper, their co-location with other citations or their purpose. EVALUATION Participants will be requested to submit the LOD that their tool produces from the evaluation dataset, as well as a paper that describes their approach. They will also be given a set of queries in natural language form and will be asked to translate those queries into a SPARQL form that works on their LOD. The results of the queries on the produced LOD will be compared with the expected output, and precision and recall will be measured to identify the best performing approach. Separately, the most original approach will be assigned by the Program Committee. Further details about the evaluation will be provided on the challenge wiki. FEEDBACK AND DISCUSSION A discussion group is open for participants to ask questions and to receive updates about the challenge (see link at bottom). Participants are invited to subscribe to this group as soon as possible and to communicate their intention to participate. They are also invited to use this channel to discuss problems in the input dataset and to suggest changes. JUDGING AND PRIZES After a first round of review, the Program Committee and the chairs will select a number of submissions conforming to the challenge requirements that will be invited to present their work. Submissions accepted for presentation will receive constructive reviews from the Program Committee, they will be included in the Springer LNCS post-proceedings of ESWC, and they will have a presentation slot in a poster session dedicated to the challenge. In addition, the winners will present their work in a special slot of the main program of ESWC and will be invited to submit a revised and extended paper to a dedicated Semantic Web Journal special issue. Four winners will be selected. For each of the two tasks we will select: * best performing tool, given to the paper which will get the highest score in the evaluation * most original approach, selected by the Challenge Committee with the reviewing process HOW TO PARTICIPATE Participants are required to submit: * Abstract: no more than 200 words. * Description: It should explain the details of the automated annotation system, including why the system is innovative, how it uses Semantic Web technology, what features or functions the system provides, what design choices were made and what lessons were learned. The description should also summarize how participants have addressed the evaluation tasks. An outlook towards how the data could be consumed is appreciated but not strictly required. Papers must be submitted in PDF format, following the style of the Springer's Lecture Notes in Computer Science (LNCS) series (http://www.springer.com/computer/lncs/lncs+authors), and not exceeding 5 pages in length. * The Linked Open Dataset produced by their tool on the evaluation dataset (as a file or as a URL, in Turtle or RDF/XML). * A set of SPARQL queries that work on that LOD and correspond to the natural language queries provided as input Participants will also be asked to submit their tool (source and/or binaries, or a link these can be downloaded from, or a web service URL) for verification purposes. Further submission instructions will be published on the challenge wiki. All submissions should be provided via EasyChair https://www.easychair.org/conferences/?conf=eswc2014-challenges MAILING LIST We invite the potential participants to subscribe to our mailing list in order to be kept up to date with the latest news related to the challenge. https://lists.sti2.org/mailman/listinfo/eswc2014-sempub-challenge IMPORTANT DATES * December 3, 2013: Publication of the full description of tasks, rules and queries; publication of the training/testing dataset * January 15, 2014, 23:59 (Hawaii time): Deadline for making remarks to the training/testing dataset * January 20, 2014: Publication of the final training/testing dataset * March 7, 2014, 23:59 (Hawaii time): Abstract Submission * March 11, 2014: Publication of the evaluation dataset * March 14, 2014, 23:59 (Hawaii time): Submission * April 9, 2014, 23:59 (Hawaii time): Notification of acceptance * May 27-29, 2014: Challenge days CHALLENGE CHAIRS * Angelo Di Iorio (Department of Computer Science and Engineering, University of Bologna, IT) * Christoph Lange (Enterprise Information Systems, University of Bonn / Fraunhofer IAIS, DE) PROGRAM COMMITTEE Soren Auer (University of Bonn / Fraunhofer IAIS, DE) (supervisor) Sarven Capadisli (University of Leipzig, DE) Alexander Constantin (University of Manchester, UK) Alexander Garcia Castro (Florida State University, US) Leyla Jael Garcia Castro (Bundeswehr University of Munich, DE) Aidan Hogan (DERI Galway, IR) Evangelos Milios (Dalhousie University, CA) Lyubomir Penev (Pensoft Publishers, BG) Robert Stevens (University of Manchester, UK) Jun Zhao (Lancaster University, UK) We are inviting further members. ESWC CHALLENGE COORDINATOR * Milan Stankovic (Sepage & Universite Paris-Sorbonne, FR) From hamon at LIMSI.FR Wed Dec 4 13:10:55 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 14:10:55 +0100 Subject: Appel: ESWC 2014, Call for Challenge, Concept-Level Sentiment Analysis Message-ID: Date: Tue, 3 Dec 2013 12:17:24 +0100 (CET) From: speroni at cs.unibo.it Message-Id: <20131203111744.59CD6DBB48 at vina.cines.fr> X-url: http://challenges.2014.eswc-conferences.org/SemSA ==== Call for Challenge: Concept-Level Sentiment Analysis ==== Challenge Website: http://challenges.2014.eswc-conferences.org/SemSA Call Web page: http://2014.eswc-conferences.org/important-dates/call-SemSA MOTIVATION AND OBJECTIVES Mining opinions and sentiments from natural language is an extremely difficult task as it involves a deep understanding of most of the explicit and implicit, regular and irregular, syntactical and semantic rules proper of a language. Existing approaches mainly rely on parts of text in which opinions and sentiments are explicitly expressed such as polarity terms, affect words and their co-occurrence frequencies. However, opinions and sentiments are often conveyed implicitly through latent semantics, which make purely syntactical approaches ineffective. To this end, concept-level sentiment analysis aims to go beyond a mere word-level analysis of text and provide novel approaches to opinion mining and sentiment analysis that allow a more efficient passage from (unstructured) textual information to (structured) machine-processable data, in potentially any domain. Concept-level sentiment analysis focuses on a semantic analysis of text through the use of web ontologies or semantic networks, which allow the aggregation of conceptual and affective information associated with natural language opinions. By relying on large semantic knowledge bases, concept-level sentiment analysis steps away from blind use of keywords and word co-occurrence count, but rather relies on the implicit features associated with natural language concepts. This Challenge focuses on the introduction, presentation, and discussion of novel approaches to concept-level sentiment analysis. Participants will have to design a concept-level opinion-mining engine that exploits common-sense knowledge bases, e.g., SenticNet, and/or Linked Data and Semantic Web ontologies, e.g., DBPedia, to perform multi-domain sentiment analysis. The main motivation for the Challenge, in particular, is to go beyond a mere word-level analysis of natural language text and provide novel concept-level tools and techniques that allow a more efficient passage from (unstructured) natural language to (structured) machine-processable data, in potentially any domain. Systems must have a semantics flavor (e.g., by making use of Linked Data or known semantic networks within their core functionalities) and authors need to show how the introduction of semantics can be used to obtain valuable information, functionality or performance. Existing natural language processing methods or statistical approaches can be used too as long as the semantics plays a main role within the core approach (engines based merely on syntax/word-count will be excluded from the competition). TARGET AUDIENCE The Challenge is open to everyone from industry and academia. TASKS The Concept-Level Sentiment Analysis Challenge is defined in terms of different tasks. The first task is elementary whereas the others are more advanced. The input units of each task are sentences. Sentences are assumed to be in grammatically correct American English and have to be processed according to the input format specified at http://sentic.net/challenge/sentence. Elementary Task: Polarity Detection The main goal of the task is polarity detection. The proposed systems will be assessed according to precision, recall and F-measure of detected binary polarity values (1=positive; 0=negative) for each input sentence of the evaluation dataset, following the same format as in http://sentic.net/challenge/task0. The problem of subjectivity detection is not addressed within this Challenge, hence participants can assume that there will be no neutral sentences. Participants are encouraged to use the Sentic API or further develop and apply sentic computing tools. Advanced Task #1: Aspect-Based Sentiment Analysis The output of this task will be a set of aspects of the reviewed product and a binary polarity value associated to each of such aspects, in the format specified at http://sentic.net/challenge/task1. So, for example, while for the Elementary task an overall polarity (positive or negative) is expected for a review about a mobile phone, this task requires a set of aspects (such as ‘speaker', ‘touchscreen', ‘camera', etc.) and a polarity value (positive OR negative) associated with each of such aspects. Systems will be assessed according to both aspect extraction and aspect polarity detection. Advanced Task #2: Semantic Parsing As suggested by the title, the Challenge focuses on sentiment analysis at concept-level. This means that the proposed systems are not supposed to work at word/syntax level but rather work with concepts/semantics. Hence, this task will evaluate the capability of the proposed systems to deconstruct natural language text into concepts, following the same format as in http://sentic.net/challenge/task2. SenticNet will be taken as a reference to test the efficiency of the proposed parsers, but extracted concepts won't necessary have to match SenticNet concepts. The proposed systems, for example, are supposed to be able to extract a multi-word expression like ‘buy christmas present' from sentences such as “Today I bought a lot of very nice Christmas presents'. The number of extracted concepts per sentence will be assessed through precision, recall and F-measure against the evaluation dataset. Advanced Task #3: Topic Spotting Input sentences will be about four different domains, namely: books, DVDs, electronics, and kitchen appliances. This task focuses on the automatic classification of sentences into one of such domains, in the format specified at http://sentic.net/challenge/task3. All sentences are assumed to belong to only one of the above-mentioned domains. The proposed systems are supposed to exploit the extracted concepts to infer which domain each sentence belongs to. Classification accuracy will be evaluated in terms of precision, recall and F-measure against the evaluation dataset. EVALUATION DATASET Systems will be evaluated against a testing dataset which will be revealed and released after the first-round of evaluation during the Conference. The dataset will be made public on the challenge website. Participants are suggested to train and/or test their own systems using the Blitzer Dataset. The testing dataset will be constructed in the same way and from the same sources as the Blitzer dataset. EVALUATION The evaluation will be performed by the members of the Program Committee. For systems that can be tuned with different parameters, please indicate a range of up to 4 sets of settings. Settings with the best F-measures will be considered for judgment. For each system, reviewers will give a numerical score within the range [1-10] and details motivating their choice. The scores will be given to the following aspects: 1. Use of common-sense knowledge and semantics; 2. Precision, recall, and F-measure wrt the selected task; 3. Computational time; 4. Innovative nature of the approach. JUDGING AND PRIZES After a first round of review, the Program Committee and the chairs will select a number of submissions confirming to the challenge requirements that will be invited to present their work. Submissions accepted for presentation will be included in post-proceedings and will receive constructive reviews from the Program Committee. All accepted submissions will have a slot in a poster session dedicated to the challenge. In addition, the winners will present their work in a special slot of the main program of ESWC and will be invited to submit a paper to a dedicated Semantic Web Journal special issue. For the Concept-Level Sentiment Analysis Challenge there will be two awards for each task: * Quantitative: the system with the highest average score in items 1-3 above; * Innovative: the system with the highest score in item 4 above. There will be a board of judges at the conference who will evaluate again the systems in more detail. The judges will then meet in private to discuss the entries and to determine the winners. It may happen that the same system runs for both the awards. HOW TO PARTICIPATE The following information has to be provided: * Abstract: no more than 200 words. * Description: It should contain the details of the system, including why the system is innovative, how it uses Semantic Web, which features or functions the system provides, what design choices were made and what lessons were learned. The description should also summarize how participants have addressed the evaluation tasks. Papers must be submitted in PDF format, following the style of the Springer's Lecture Notes in Computer Science (LNCS) series (http://www.springer.com/computer/lncs/lncs+authors), and not exceeding 5 pages in length. * Web Access: The application can either be accessible via the web or downloadable. If the application is not publicly accessible, password must be provided. A short set of instructions on how to use the application should be provided as well. All submissions should be provided via EasyChair https://www.easychair.org/conferences/?conf=eswc2014-challenges Please share comments and questions with the challenge mailing list. The organizers will assist you for any potential issues that could be raised. MAILING LIST We invite the potential participants to subscribe to our mailing list in order to be kept up to date with the latest news related to the challenge. https://lists.sti2.org/mailman/listinfo/eswc2014-semsa-challenge IMPORTANT DATES * March 7, 2014, 23:59 (Hawaii time): Abstract Submission * March 14, 2014, 23:59 (Hawaii time): Submission * April 9, 2014, 23:59 (Hawaii time): Notification of acceptance * May 27-29, 2014: Challenge days CHALLENGE CHAIRS * Erik Cambria (National University of Singapore, SG) * Diego Reforgiato Recupero (CNR STLAB Laboratory, IT) PROGRAM COMMITTEE * Newton Howard (MIT Media Laboratory, US) * ChengXiang Zhai (University of Illinois at Urbana-Champaign, US) * Rada Mihalcea (University of North Texas, US) * Ping Chen (University of Houston-Downtown, US) * Yongzheng Zhang (LinkedIn Inc., US) * Giuseppe Di Fabbrizio (Amazon Inc., US) * Rui Xia (Nanjing University of Science and Technology, CN) * Rafal Rzepka (Hokkaido University, JP) * Amir Hussain (University of Stirling, UK) * Alexander Gelbukh (National Polytechnic Institute, MX) * Bjoern Schuller, (Technical University of Munich, DE) * Amitava Das (Samsung Research India, IN) * Dipankar Das (National Institute of Technology, IN) * Carlo Strapparava (Fondazione Bruno Kessler, IT) * Stefano Squartini (Marche Polytechnic University, IT) * Cristina Bosco (University of Torino, IT) * Paolo Rosso (Technical University of Valencia, ES) ESWC CHALLENGE COORDINATOR * Milan Stankovic (Sepage & Universite Paris-Sorbonne, FR) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Dec 4 13:25:22 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 14:25:22 +0100 Subject: Appel: RECITAL 2014 Message-ID: Date: Tue, 03 Dec 2013 17:37:52 +0100 From: Nuria Gala Message-ID: <529E08E0.4010502 at lif.univ-mrs.fr> RECITAL 2014 : Premier appel à communications ----------------------------------------------------------------- RECITAL 2014 16ème Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues. Faculté Saint Charles, Aix Marseille Université (Marseille), du 1er au 4 juillet 2014. Dates importantes ------------------------- - Date limite de soumission : samedi 5 avril 2014 - Notification aux auteurs : samedi 3 mai 2014 - Version définitive : samedi 17 mai 2014 Présentation ----------------- RECITAL 2014, la conférence annuelle des jeunes chercheurs associée à TALN, se déroulera à Marseille du 1er au 4 juillet 2014. RECITAL offre aux jeunes chercheurs en Traitement Automatique des Langues (TAL) l'occasion de présenter leurs travaux et de comparer leurs approches. Elle est réservée aux étudiants (master et doctorat) et aux jeunes chercheurs ayant obtenu leur doctorat depuis moins d'un an. Fort du succès des années précédentes, nous encourageons la soumission de travaux même préliminaires, de projets de thèse, et de travaux des premiers mois de recherche (état de l'art, premières pistes, etc.). L'objectif premier de RECITAL est de soutenir les travaux des jeunes chercheurs en TAL et de faciliter leur intégration dans notre communauté. A ce titre, nous visons : - des relectures pédagogiques : les auteurs doivent pouvoir comprendre les erreurs qu'ils ont pu commettre afin de pouvoir les corriger et améliorer la qualité de leur travail; - des relectures positives : il n'est jamais nécessaire de décourager un jeune chercheur, les maîtres-mots devront encourager/guider; - l'échange direct : les relectures seront communiquées signées, donc non-anonymes, aux auteurs. Libre à eux (ou mieux, aux relecteurs) d'aller informellement discuter ensemble lors de la conférence. Le prix du meilleur papier RECITAL sera décerné lors de la cérémonie de clôture. Thèmes principaux -------------------------- Les communications pourront porter sur les thèmes habituels du TAL : - Analyse et génération dans les domaines suivants : + Phonétique + Phonologie + Morphologie + Syntaxe + Sémantique - Analyse et génération dans les domaines suivants : + Phonétique + Phonologie + Morphologie + Syntaxe + Sémantique + Discours - Développement de ressources linguistiques pour le TAL : + Bases de données comportant des informations morphologiques, syntaxiques, sémantiques, et/ou phonologiques + Grammaires + Lexiques + Ontologies + Linguistique de corpus - Applications du TAL : + Analyse de sentiments ou d'opinions + Catégorisation ou classification automatique + Désambiguïsation lexicale + Dialogue homme-machine en langage naturel + Enseignement assisté par ordinateur + Indexation automatique + Recherche et extraction d'information + Résumé automatique + Résolution d'anaphores + Systèmes de question-réponse + Traduction automatique + Web sémantique - Approches : + Linguistiques formelles destinées à soutenir les traitements automatiques + Symboliques + Logiques + Statistiques + Basées sur l'apprentissage automatique Cette liste n'est pas exhaustive et l'adéquation d'une proposition de communication à la conférence sera jugée par le comité de programme. Critères de sélection ---------------------------- Les auteurs doivent être des étudiant(e)s ou bien des jeunes docteur(e)s ayant soutenu leur thèse depuis moins d'un an. Les publications avec des chercheurs confirmés (ce qui inclus les directeurs de thèses) doivent être soumises à TALN et non à RECITAL. Les auteurs sont invités à soumettre des travaux de recherche originaux, n'ayant pas fait l'objet de publications antérieures. Les soumissions seront examinées par au moins deux spécialistes du domaine. Seront considérées en particulier: - La correction du contenu scientifique et technique - La situation des travaux dans le contexte de la recherche internationale - L'organisation et la clarté de la présentation - L'adéquation aux thèmes de la conférence Les articles sélectionnés seront publiés dans les actes de la conférence. Suivant l'avis du comité de programme, les présentations se feront soit sous forme orale soit sous forme de poster. Modalités de Soumission --------------------------------- Les articles seront rédigés en français pour les francophones, en anglais pour ceux qui ne maîtrisent pas le français. Les articles doivent faire de 8 à 14 pages. Une feuille de style LaTeX, un modèle Word et un modèle LibreOffice seront disponibles sur le site web de la conférence (www.taln2014.org). Contact : nuria.gala(arobase)lif.univ-mrs.fr et/ou klim.peshkov(arobase)lpl-aix.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Dec 4 13:21:25 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 14:21:25 +0100 Subject: Seminaire: L'informatique : des biens communs ?, Centre d'Alembert, Orsay, 18 decembre 2013 Message-ID: Date: Tue, 3 Dec 2013 15:48:42 +0100 From: Centre d'Alembert Message-Id: X-url: http://www.centre-dalembert.u-psud.fr/ Séminaire 2013-2014 du Centre d'Alembert (http://www.centre-dalembert.u-psud.fr/) "Les sciences comme biens communs" Séance du mercredi 18 décembre 2013 (NB : pas de séance le 4 décembre comme annoncé dans notre programme transmis le 5/11/13) "L'INFORMATIQUE : DES BIENS COMMUNS ?" (titre provisoire) Intervenants : Jean-Pierre ARCHAMBAULT et Gilles DOWEK De plus amples informations sur la séance vous parviendront ultérieurement. ----- Programme détaillé et contact : Web : http://www.centre-dalembert.u-psud.fr Mél. : centre.dalembert at u-psud.fr Les séances ont lieu à la Faculté des Sciences d'Orsay, bâtiment des colloques (338), salle de conférences (1er étage à gauche) de 13h45 à 15h45 (entrée libre) Accès : http://goo.gl/maps/cXezZ ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Dec 4 13:26:50 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 14:26:50 +0100 Subject: Appel: CNL 2014 Message-ID: Date: Tue, 3 Dec 2013 20:46:53 +0100 From: Camilo Thorne Message-ID: X-url: http://attempto.ifi.uzh.ch/site/cnl2014/ Announcement and Call for Papers *** FOURTH WORKSHOP ON CONTROLLED NATURAL LANGUAGE (CNL 2014) *** Co-located with COLING 2014 20-22 August 2014 Galway, Ireland http://attempto.ifi.uzh.ch/site/cnl2014/ Proceedings in Springer LNCS. --------------------------------------------------------------------- This workshop on controlled natural language (CNL) has a broad scope and embraces all approaches that are based on natural language and apply restrictions on vocabulary, grammar, and/or semantics. This includes (but is certainly not limited to) approaches that have been called simplified language, plain language, formalized language, processable language, fragments of language, phraseologies, conceptual authoring, language generation, and guided natural language interfaces. Some CNLs are designed to improve communication among humans, especially for non-native speakers of the respective natural language. In other cases, the restrictions on the language are supposed to make it easier for computers to analyze such texts in order to improve computer-aided, semi-automatic, or automatic translations into other languages. A third group of CNL has the goal to enable reliable automated reasoning and formal knowledge representation from seemingly natural texts. All these types of CNL are covered by this workshop. Topics ------ Possible topics for CNL 2014 include: - CNL for knowledge representation - CNL for query interfaces - CNL for specifications - CNL for business rules - CNL for dialogue systems - CNL for machine translation - CNL for improved understandability of texts - CNL for natural language generation - design of CNLs - CNL applications - CNL evaluation - usability and acceptance of CNL - CNL grammars and lexica - multilingual CNLs - reasoning in CNL - spoken CNL - CNL in the context of the Semantic Web and Linked Open Data - CNL in the government - CNL in industry - CNL use cases - theoretical properties of CNL Important Dates --------------- Submission deadline: 14 March 2014 Notification of acceptance: 12 May 2014 Deadline for revised papers: 9 June 2014 Workshop: 20-22 August 2014 Submissions and Proceedings --------------------------- We invite researchers to submit papers with novel contributions in the area of CNL. These research papers should be formatted according to the Springer LNCS format and should not exceed 10 pages. Papers should be submitted via the EasyChair conference system: http://www.easychair.org/conferences/?conf=cnl2014 Accepted papers will be included in the workshop proceedings to be published by Springer in their LNCS/LNAI series and indexed in all major citation databases including ISI Web of Science and Scopus. Venue ----- The workshop takes place in Ireland at INSIGHT at NUI Galway (formerly DERI). Organization Committee ---------------------- - Brian Davis (INSIGHT at NUI Galway, formerly DERI, Ireland) - Kaarel Kaljurand (University of Zurich, Switzerland) - Tobias Kuhn (ETH Zurich, Switzerland) Program Committee ----------------- - Krasimir Angelov (Chalmers University, Sweden) - Johan Bos (University of Groningen, Netherlands) - Paul Buitelaar (INSIGHT at NUI Galway, formerly DERI, Ireland) - Olga Caprotti (University of Helsinki, Finland) - Rogan Creswick (Galois, USA) - Danica Damljanovic (University of Sheffield, UK) - Brian Davis (INSIGHT at NUI Galway, formerly DERI, Ireland) - Ronald Denaux (iSOCO, Spain) - Vania Dimitrova (University of Leeds, UK) - Ramona Enache (Chalmers University, Sweden) - Esra Erdem (Sabanci University, Turkey) - Sébastien Ferré (University Rennes 1, France) - Norbert E. Fuchs (University of Zurich, Switzerland) - Normunds Grūzītis (University of Latvia) - Siegfried Handschuh (INSIGHT at NUI Galway, formerly DERI, Ireland) - Stefan Höfler (University of Zurich, Switzerland) - Kaarel Kaljurand (University of Zurich, Switzerland) - Peter Koepke (University of Bonn, Germany) - Tobias Kuhn (ETH Zurich, Switzerland) - Hans Leiß (University of Munich, Germany) - Reinhard Muskens (Tilburg University, Netherlands) - Adegboyega Ojo (INSIGHT at NUI Galway, formerly DERI, Ireland) - Gordon Pace (University of Malta) - Richard Power (The Open University, UK) - Laurette Pretorius (University of South Africa) - Allan Ramsay (University of Manchester, UK) - Mike Rosner (University of Malta) - Aarne Ranta (University of Gothenburg, Sweden) - Rolf Schwitter (Macquarie University, Australia) - Geoff Sutcliffe (University of Miami, USA) - Irina Temnikova (University of Wolverhampton, UK) - Allan Third (The Open University, UK) - Camilo Thorne (Free University of Bozen-Bolzano, Italy) - Silvie Spreeuwenberg (LibRT, Netherlands) - Uta Schwertel (imc, Germany) - Jeroen Van Grondelle (Be Informed, Netherlands) - Yorick Wilks (University of Sheffield, UK) - Adam Wyner (University of Aberdeen, UK) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Dec 4 13:27:43 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 14:27:43 +0100 Subject: Job: Postdoc position in WSD, LIMSI-CNRS Message-ID: Date: Wed, 4 Dec 2013 11:19:06 +0100 From: Marianna Apidianaki Message-Id: <1EFA7D3B-766B-4504-90F2-3F188CEA6D13 at limsi.fr> X-url: http://transread.limsi.fr X-url: http://www.limsi.fr/Scientifique/tlp/mt/ [apologies for cross-posting] ****************************************************************** POSTDOC POSITION IN WORD SENSE DISAMBIGUATION AT LIMSI-CNRS, Orsay (Paris area), FRANCE ****************************************************************** A postdoctoral position in Word Sense Disambiguation (WSD) is available at the LIMSI-CNRS lab. The position is funded by the national (ANR) project TransRead (http://transread.limsi.fr) aimed at developing new multilingual text processing applications and facilitating the reading of multilingual documents for foreign language readers and learners. The postdoctoral researcher will work on both data-driven and knowledge-based Word Sense Disambiguation methods exploiting information in parallel corpora and large-scale multilingual resources (such as BabelNet and Wiktionary). The disambiguation output will serve to facilitate foreign language reading by providing semantic annotations in different languages. Research results will also be exploited in other applications developed in the group, such as Machine Translation. Requirements: - Ph.D. (or completed by February 2014) in Computer Science, Natural Language Processing, Computational Linguistics - Solid programming skills - Experience with statistical approaches to semantics, cross-lingual methods for NLP or Statistical Machine Translation - Strong publication record - Fluent English The successful candidate will join a dynamic research team working on various aspects of Statistical Machine Translation, Word Sense Disambiguation and paraphrasing. For information regarding our activities, see http://www.limsi.fr/Scientifique/tlp/mt/ Duration: 12 months Application deadline: Open until filled Interested candidates should send their CV and the names and contact information of 2 referees to Marianna Apidianaki (marianna at limsi.fr). Informal inquiries can be sent to the same address. About the LIMSI-CNRS The LIMSI-CNRS lab is situated at Orsay, a green area 25 km south of Paris. A suburban train connects Orsay to Paris city center. Detailed information about the LIMSI lab can be found at http://www.limsi.fr From hamon at LIMSI.FR Wed Dec 4 13:23:19 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 14:23:19 +0100 Subject: These: Assaf Urieli, Analyse syntaxique robuste du fran=?ISO-8859-1?Q?=E7ais?= Message-ID: Date: Tue, 3 Dec 2013 16:34:19 +0100 From: Assaf Urieli Message-ID: Bonjour, J'ai le plaisir de vous inviter à la soutenance de ma thèse intitulée "Analyse syntaxique robuste du français : concilier méthodes statistiques et connaissances linguistiques dans l'outil Talismane". La soutenance aura lieu le mardi 17 décembre 2013 à 14h00 dans la salle D31 de la Maison de la Recherche de l'Université de Toulouse le Mirail. La Maison de la Recherche se trouve juste à côté de la bibliothèque universitaire. Un plan se trouve à l'adresse suivante : http://www.univ-tlse2.fr/accueil/infos/pratique/adresses-et-acces/plan-du-campus-avec-l-index-des-ufr-services-et-salles-de-cours-6337.kjsp Vous êtes également conviés au pot de thèse qui suivra la soutenance. Bien cordialement, Assaf Urieli ------------------------------------------------------------------------ Le jury de soutenance sera composé de : M. Alexis Nasr, Professeur, LIF, Marseille - Rapporteur M. Eric Wehrli, Professeur, LATL, Genève - Rapporteur Mme Marie Candito, Maître de Conférences, ALPAGE, Paris Diderot - Examinatrice M. Nabil Hathout, Directeur de Recherche, CLLE, Toulouse - Examinateur M. Ludovic Tanguy, Maître de Conférences HDR, CLLE, Toulouse - Directeur de thèse Titre en français : Analyse syntaxique robuste du français : concilier méthodes statistiques et connaissances linguistiques dans l'outil Talismane Résumé de la thèse : Dans cette thèse, nous explorons l’analyse syntaxique statistique robuste du français. Notre principal souci est de trouver des méthodes qui permettent au linguiste d’injecter des connaissances et/ou des ressources linguistiques dans un moteur statistique afin d’améliorer les résultats à la fois globalement et pour certains phénomènes spécifiques. D’abord, nous décrivons le schéma d’annotation en dépendances du français, et les algorithmes capables de produire cette annotation, en particulier le parsing par transitions. Après avoir exploré les algorithmes d’apprentissage automatique supervisé pour les problèmes de classification en TAL, nous présentons l’analyseur syntaxique Talismane, développé dans le cadre de cette thèse, qui comprend quatre modules statistiques – le découpage en phrases, la segmentation en mots, l’étiquettage morpho-syntaxique et le parsing – ainsi que les diverses ressources linguistiques utilisées par les modèles de base. Nos premières expériences ont permis d’identifier la meilleure configuration d’apprentissage parmi les nombreuses configurations possibles. Ensuite, nous explorons les améliorations apportées par le principe de recherche par faisceau (beam search) et la propagation du faisceau d'un module à un autre. Finalement, nous présentons une série d’expériences dont le but est de corriger des erreurs linguistiques spécifiques au moyen de descripteurs ciblés pour l'apprentissage. Une de nos innovations est l’introduction des règles qui imposent ou interdisent certaines décisions locales, permettant ainsi de contourner le modèle statistique, et nous explorons l’utilisation de règles pour les erreurs que les descripteurs n’ont pas pu corriger. Finalement, nous explorons l’utilisation de ressources linguistiques à large couverture, au travers d'une expérience d'apprentissage semi-supervisé avec une ressource lexicale produite par une analyse sémantique distributionnelle. Titre en anglais : Robust French syntax analysis: reconciling statistical methods and linguistic knowledge in the Talismane toolkit Résumé en anglais : In this thesis we explore robust statistical syntax analysis for French. Our main concern is to explore methods whereby the linguist can inject linguistic knowledge and/or resources into the robust statistical engine in order to improve results both globally and for specific phenomena. We first explore the dependency annotation schema for French, concentrating on certain phenomena. Next, we look into the various algorithms capable of producing this annotation, and in particular on the transition-based parsing algorithm used in the rest of this thesis. After exploring supervised machine learning algorithms for NLP classification problems, we present the Talismane toolkit for syntax analysis, built within the framework of this thesis, including four statistical modules - sentence boundary detection, tokenisation, pos-tagging and parsing - as well as the various linguistic resources used for the baseline model, including corpora, lexicons and feature sets. Our first experiments enabled us to identify the best baseline among various machine learning configurations. We then look into improvements made possible by a beam search and by beam propagation from one module to the next. Finally, we present a series of experiments aimed at correcting errors related to specific linguistic phenomena, using targeted features. One of our innovations is the introduction of rules that can impose or prohibit certain decisions locally, thus bypassing the statistical model. We explore the usage of rules for errors that the features are unable to correct. Finally, we look into the enhancement of targeted features by large scale linguistic resources, and in particular a semi-supervised approach using a lexical resource constructed using distributional semantic techniques. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Dec 4 13:17:51 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 14:17:51 +0100 Subject: Appel: 19th Conference on Formal Grammar Message-ID: Date: Tue, 3 Dec 2013 15:34:11 +0100 (CET) From: Frank Richter Message-ID: X-url: http://fg.phil.hhu.de/ FIRST CALL FOR PAPERS FG-2014: The 19th Conference on Formal Grammar Tuebingen, Germany, August 16-17 2014 http://fg.phil.hhu.de/ Collocated with the European Summer School in Logic, Language and Information Background FG-2014 is the 19th conference on Formal Grammar, to be held in conjunction with the European Summer School in Logic, Language and Information, which takes place in 2014 in Tuebingen, Germany. Previous Formal Grammar meetings were held in Barcelona (1995), Prague (1996), Aix-en-Provence (1997), Saarbruecken (1998), Utrecht (1999), Helsinki (2001), Trento (2002), Vienna (2003), Nancy (2004), Edinburgh (2005), Malaga (2006), Dublin (2007), Hamburg (2008), Bordeaux (2009), Copenhagen (2010), Ljubljana (2011), Opole (2012) and Duesseldorf (2013). Aims and Scope FG provides a forum for the presentation of new and original research on formal grammar, mathematical linguistics and the application of formal and mathematical methods to the study of natural language. Themes of interest include, but are not limited to, * formal and computational phonology, morphology, syntax, semantics and pragmatics; * model-theoretic and proof-theoretic methods in linguistics; * logical aspects of linguistic structure; * constraint-based and resource-sensitive approaches to grammar; * learnability of formal grammar; * integration of stochastic and symbolic models of grammar; * foundational, methodological and architectural issues in grammar and linguistics; * mathematical foundations of statistical approaches to linguistic analysis. Previous conferences in this series have welcomed papers from a wide variety of frameworks. Submission Details We invite **electronic** submissions of original, 16-page papers (including references and possible technical appendices). Authors are encouraged to use the Springer-Verlag LNCS style: http://www.springer.com/computer/lncs?SGWID=0-164-6-793341-0 The submission deadline is **February 23 2014**. Papers must be submitted electronically at EasyChair, https://www.easychair.org. A submission URL for FG-2014 will be provided soon. Papers should report original work which was not presented in other conferences. However, simultaneous submission is allowed, provided that the authors indicate other conferences to which the work was submitted in a footnote. Note that accepted papers can only be presented in one of the venues. Submissions will be reviewed anonymously by at least three reviewers. Accepted papers will be published as a volume in the Springer LNCS series, under the FoLLI subline, either separately or jointly with the papers from FG-2015, depending on the number of accepted papers. Important Dates * February 23, 2014: Deadline for paper submission * April 20, 2014: Notification of acceptance * June 1, 2014: Camera ready copies due * August 16-17, 2014: Conference dates Standing committee * Glyn Morrill (Universitat Politècnica de Catalunya, Spain) * Reinhard Muskens (Tilburg University , The Netherlands) * Rainer Osswald (Heinrich-Heine-Universitaet Duesseldorf, Germany) * Frank Richter (Heinrich-Heine-Universitaet Duesseldorf, Germany) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Dec 4 13:16:57 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 14:16:57 +0100 Subject: Ressources: ELRA - Language Resources Catalogue - Update Message-ID: Date: Tue, 03 Dec 2013 14:12:06 +0100 From: ELRA ELDA Information Message-ID: <529DD8A6.4060109 at elda.org> [Our apologies if you have received multiple copies of this announcement.] We are happy to announce that 2 new Pronunciation Dictionaries from the GlobalPhone database (Chinese-Mandarin and Korean) are now available in our catalogue. *_The GlobalPhone Pronunciation Dictionaries:_* GlobalPhone is a multilingual speech and text database collected at Karlsruhe University, Germany. The GlobalPhone pronunciation dictionaries contain the pronunciations of all word forms found in the transcription data of the GlobalPhone speech & text database. The pronunciation dictionaries are currently available in 17 languages: Arabic (29230 entries/27059 words), Bulgarian (20193 entries), Croatian (23497 entries/20628 words), Czech (33049 entries/32942 words), French (36837 entries/20710 words), German (48979 entries/46035 words), Hausa (42662 entries/42079 words), Japanese (18094 entries), Polish (36484 entries), Portuguese (Brazilian) (54146 entries/54130 words), Russian (28818 entries/27667 words), Spanish (Latin American) (43264 entries/33960 words), Swedish (about 25000 entries), Turkish (31330 entries/31087 words), Vietnamese (38504 entries/29974 words), Chinese-Mandarin (73388 pronunciations), and Korean (3500 syllables). Special prices are offered for a combined purchase of several GlobalPhone languages. Available GlobalPhone Pronuncation Dictionaries are listed below (click on the links for further details): ***ELRA-S0340 GlobalPhone French Pronunciation Dictionary*** For more information, see: http://catalog.elra.info/product_info.php?products_id=1197 **ELRA-S0341 GlobalPhone German Pronunciation Dictionary** For more information, see: http://catalog.elra.info/product_info.php?products_id=1198 **ELRA-S0348 GlobalPhone Japanese Pronunciation Dictionary** For more information, see: http://catalog.elra.info/product_info.php?products_id=1199 **ELRA-S0350 GlobalPhone Arabic Pronunciation Dictionary ** For more information, see: http://catalog.elra.info/product_info.php?products_id=1200 *ELRA-S0351 GlobalPhone Bulgarian Pronunciation Dictionary* For more information, see: http://catalog.elra.info/product_info.php?products_id=1201 *ELRA-S0352 GlobalPhone Czech Pronunciation Dictionary* For more information, see: http://catalog.elra.info/product_info.php?products_id=1202 *ELRA-S0353 GlobalPhone Hausa Pronunciation Dictionary* For more information, see: http://catalog.elra.info/product_info.php?products_id=1203 *ELRA-S0354 GlobalPhone Polish Pronunciation Dictionary* For more information, see: http://catalog.elra.info/product_info.php?products_id=1204 *ELRA-S0355 GlobalPhone Portuguese (Brazilian) Pronunciation Dictionary* For more information, see: http://catalog.elra.info/product_info.php?products_id=1205 *ELRA-S0356 GlobalPhone Swedish Pronunciation Dictionary* For more information, see: http://catalog.elra.info/product_info.php?products_id=1206 **ELRA-S0358 GlobalPhone Croatian Pronunciation Dictionary** For more information, see: http://catalog.elra.info/product_info.php?products_id=1207 *ELRA-S0359 GlobalPhone Russian Pronunciation Dictionary* For more information, see: http://catalog.elra.info/product_info.php?products_id=1208 *ELRA-S0360 GlobalPhone Spanish (Latin American) Pronunciation Dictionary* For more information, see: http://catalog.elra.info/product_info.php?products_id=1209 *ELRA-S0361 GlobalPhone Turkish Pronunciation Dictionary* For more information, see: http://catalog.elra.info/product_info.php?products_id=1210 *ELRA-S0362 GlobalPhone Vietnamese Pronunciation Dictionary* For more information, see: http://catalog.elra.info/product_info.php?products_id=1211 *** NEW *** *ELRA-S0363 GlobalPhone Chinese-Mandarin Pronunciation Dictionary* For more information, see: http://catalog.elra.info/product_info.php?products_id=1212 *ELRA-S0364 GlobalPhone Korean Pronunciation Dictionary* For more information, see: http://catalog.elra.info/product_info.php?products_id=1213 For more information on the catalogue, please contact Valérie Mapelli mailto:mapelli at elda.org Visit our On-line Catalogue: http://catalog.elra.info Visit the Universal Catalogue: http://universal.elra.info Archives of ELRA Language Resources Catalogue Updates: http://www.elra.info/LRs-Announcements.html ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Dec 4 13:13:22 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 14:13:22 +0100 Subject: Appel: Special issue of BMC Bioinformatics on BioNLP Shared Task Message-ID: Date: Tue, 03 Dec 2013 14:03:35 +0100 From: Claire Nedellec Message-ID: <529DD6A7.9040804 at jouy.inra.fr> X-url: http://2013.bionlp-st.org/bmc-bioinformatics-special-issue ==================================================================== CALL FOR PAPERS Special issue of BMC Bioinformatics on BioNLP Shared Task 2013 ==================================================================== We are pleased to announce a special issue of BMC Bioinformatics on the BioNLP Shared Task (BioNLP-ST) 2013. BMC Bioinformatics is a premier open-access journal for BioNLP research with an Impact Factor of 3.02 and indexed by PubMed, Scopus, ACM and Thomson Reuters (ISI) among others. We invite two kinds of submissions: 1- extended versions of BioNLP-ST 2013 workshop papers. Each extended submission must include a substantial amount of previously unpublished content. In this type of submission, the workshop paper should be cited and the extended contribution has to be clearly stated. Please refer to BMC guidelines: http://www.biomedcentral.com/about/duplicatepublication 2- papers on new contributions to the BioNLP-ST 2013 tasks. The BioNLP-ST 2013 data and evaluation services are kept online at: http://2013.bionlp-st.org/tasks for new participants to register and evaluate their results on these datasets. For a general overview on BioNLP Shared Task 2013, please refer to the task homepage: http://2013.bionlp-st.org/ All submissions will undergo full peer review by 3 reviewers. We expect that 7-8 papers will be selected to be included in the special issue based on the reviews. When preparing your manuscript, please follow the author guidelines of BMC Bioinformatics: http://www.biomedcentral.com/bmcbioinformatics/ifora/ Please note also the following: * The submissions should NOT be made using the general BMC submissions site, but through a special issue submission site that will be available shortly. * For accepted papers, the authors have to be prepared to pay the article-processing charges. See http://www.biomedcentral.com/bmcbioinformatics/about/faq/openaccess for more details. IMPORTANT DATES Jan 27 Manuscript submission due (draft version) Feb 3 Manuscript submission (final version) Mar 8 Acceptance notification of first-round review To prepare the review process, we ask all those planning to submit a manuscript to submit an early version of the paper, including title and abstract, prior to the primary manuscript submission. The draft version is considered a notification of intent to submit and will only be used to coordinate reviews; it will not be considered in review of the manuscript. Note that the initial submissions must include sufficient information for reviewer assignment, e.g. title and abstract. The actual review will begin from Feb 4. So, the authors will have a chance to update their manuscripts by Feb 3. SPECIAL ISSUE EDITORS Claire Nédellec, National Institute for Agricultural Research (INRA) Jin-Dong Kim, Database Center for Life Science (DBCLS) Sampo Pyysalo, University of Turku Pierre Zweigenbaum, National Center for Scientific Research (CNRS) Read more: http://2013.bionlp-st.org/bmc-bioinformatics-special-issue ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 7 20:18:07 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 7 Dec 2013 21:18:07 +0100 Subject: Stage: Stage M2 (recherche), LIRMM, Combining Stochastic and Knowledge-based Modeling for Natural Language Understanding Message-ID: Date: Wed, 4 Dec 2013 17:15:39 +0100 From: Jean-Philippe Prost Message-ID: Bonjour, Un stage de M2 (Recherche) est disponible au LIRMM (U. Montpellier 2), entre le 20 janvier et mi-juin 2014. *Candidatures souhaitées le plus rapidement possible.* Titre : Combining Stochastic and Knowledge-based Modelling for Natural Language Understanding Encadrement : Jean-Philippe.Prost (Prost at lirmm.fr) Lieu : LIRMM, Montpellier Durée : 5 mois Financement : selon barèmes légaux Possibilité de poursuite en thèse, selon résultats (candidature soumise à sélection). Ce sujet de stage concerne le Traitement Automatique du Langage naturel (TAL). L'objet en est de se pencher sur une voie possible d'hybridation entre modélisation logique et modélisation probabiliste pour la représentation de connaissances langagières. Le sujet porte plus particulièrement sur la dimension syntaxique. ============== Les parseurs les plus performants du moment sont dits "robustes", pour leur capacité à produire un arbre syntaxique quelle que soit la phrase en entrée, y compris mal-formée. Ces analyseurs sont construits à base d'algorithmes d'apprentissage automatique qui permettent de construire l'arbre le plus probable étant donné la phrase en entrée. Cette robustesse est acquise au détriment d'une perte d'information conséquente. Par exemple, la question de la bonne-formation (grammaticalité) de l'entrée est éludée, l'analyse d'une phrase bien-formée n'ayant pas nécessairement une probabilité maximale. Or il est possible d'apporter une réponse exacte à ce problème, en venant simplement brancher un module à base de raisonnement logique sur la sortie non-déterministe d'un analyseur stochastique. Mais ce qui serait souhaitable, serait de pouvoir intégrer ce processus de résolution exacte dans le processus d'analyse stochastique. L'objet de ce stage est donc d'explorer différentes pistes possibles sur cette question. On pourra, par exemple, intégrer un mécanisme de vérification de modèle (model checking, au sens de la théorie logique des modèles) dans le procédé de reclassement (reranking) des n candidat-modèles les plus probables que génère un analyseur stochastique. Le stage comportera une partie d'état de l'art, une partie d'exploration théorique, et une partie réalisation (programmation). JP. Prost ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 7 20:19:41 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 7 Dec 2013 21:19:41 +0100 Subject: Stage: Stage M2 recherche en TAL/apprentissage automatique Message-ID: Date: Thu, 5 Dec 2013 11:07:34 +0100 From: Isabelle Tellier Message-Id: <730D5B02-4640-4012-930A-11F49EBF2FC7 at univ-paris3.fr> X-url: http://ligm.u-pem.fr X-url: http://www.lattice.cnrs.fr sujet de stage M2 recherche en TAL : acquisition d'un analyseur en dépendances du français médiéval Dans le cadre du projet ANR Syntactic Reference Corpus of Medieval French (SRCMF, 2008-2011), un Treebank (une collection d'arbres) d'analyses en dépendances d'énoncés du français médiéval a été constitué. Il comprend 260 000 mots (parmi lesquels environ 27 000 têtes verbales) annotées en étiquettes morpho-syntaxiques et reliés par des relations de dépendances étiquetées. L'objectif du stage est d'exploiter ce corpus par apprentissage automatique afin d'acquérir un analyseur en dépendances du français médiéval, éventuellement couplé à un étiqueteur morpho-syntaxique. Ces outils pourront être exploités sur de nouveaux textes. Ils permettront d'étudier précisément les spécificités grammaticales du français médiéval, en comparaison avec celles du français contemporain. Le stage bénéficiera de l'encadrement d'une spécialiste du français médiéval (Sophie Prévost) et de spécialistes du TAL et de l'apprentissage automatique (Matthieu Constant et Isabelle Tellier). Compétences requises : - niveau M2 ou ingénieur en informatique ou en TAL - Connaissances (ou au minimum intérêt argumenté) en TAL et en apprentissage automatique - Compétences en programmation Conditions du stage : - Rémunération : prime de stage (1/3 du SMIC). - Durée : 4 à 6 mois en commençant dès que possible en 2014. - Lieu : le stage sera encadré en collaboration par le LIGM à Marne-la-vallée (http://ligm.u-pem.fr) et le Lattice à Montrouge (http://www.lattice.cnrs.fr), trajets à prévoir entre les deux lieux envoyer CV + lettre de motivation à Matthieu Constant (Matthieu.Constant at u-pem.fr), Sophie Prévost (sophie.prevost at ens.fr), Isabelle Tellier (isabelle.tellier at univ-paris3.fr) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 7 20:24:50 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 7 Dec 2013 21:24:50 +0100 Subject: Appel: 4th DICTAP2014 Message-ID: Date: Thu, 5 Dec 2013 23:00:49 -0800 Message-ID: <4f9512da14cb095845ce6e8eb6c9d126 at localhost.localdomain> X-url: http://sdiwc.net/conferences/2014/dictap2014/ Please Accept our Apology for Multiple Posting. THANK YOU. ============================================================== The Fourth International Conference on Digital Information and Communication Technology and its Applications (DICTAP2014) May 6-8, 2014 University of the Thai Chamber of Commerce, Bangkok, Thailand http://sdiwc.net/conferences/2014/dictap2014/ All registered papers will be included in SDIWC Digital Library. ============================================================== The conference aims to enable researchers build connections between different digital applications. The conference welcomes papers on the following (but not limited to) research topics - Security in Information and Telecommunication System - Network Systems and Devices - Wireless and Optical Communications - Algorithms, Architecture, and Infrastructures - Information Content Security - Cloud Computing and Computer Networks - Sensor Networks and Embedded System - E-Learning, E-Commerce, E-Business and E-Government - Data Exchange Issues and Supply Chain - Information Retrieval - Web Services, Web based Application - Data Grids, Data and Information Quality - Data Warehouses and Data Mining - Image Analysis and Image Processing - Management and Diffusion of Multimedia Applications - Mobile, Ad Hoc and Sensor Network Security - Video Search and Video Mining - Enterprise Computing - Web Mining including Web Intelligence and Web 3.0 - Knowledge Management - Compression and Coding - XML and other extensible languages - Intelligent and Robust System - ICT for Social and Humanity - Security and Access Control - Constraint Programming - Ubiquitous Systems - Semantic Web, Ontologies and Rules Researchers are encouraged to submit their work electronically. All papers will be fully refereed by a minimum of two specialized referees. Before final acceptance, all referees comments must be considered. IMPORTANT DATES: Submission Deadline : April 6, 2014 Notification of Acceptance: April 15, 2014 Camera Ready Submission : April 26, 2014 Registration : April 26, 2014 Conference Dates : May 6-8, 2014 ---------------------------------------- From hamon at LIMSI.FR Sat Dec 7 20:35:43 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 7 Dec 2013 21:35:43 +0100 Subject: Appel: SIGDIAL 2014 Conference, June 18th-20th 2014 Message-ID: Date: Fri, 6 Dec 2013 19:26:40 +0000 From: Helen Hastie Message-ID: X-url: http://www.sigdial.org/workshops/conference15 X-url: http://www.cs.jhu.edu/ACL2014/CallforPapers.htm SIGDIAL 2014 CONFERENCE Wednesday, June 18 to Friday, June 20, 2014 The 15th Annual Meeting of the Special Interest Group on Discourse and Dialog will be co-located with the 8th International Conference on Natural Language Generation (INLG 2014) in Philadelphia, PA, USA and immediately preceding ACL 2014. http://www.sigdial.org/workshops/conference15 (under construction) CALL FOR PAPERS The 2014 SIGDIAL conference continues a series of fourteen conferences, providing a regular forum for the presentation of cutting edge research across the areas of discourse and dialog and attracting a diverse set of participants from academia and industry. The conference is sponsored by the SIGdial organization, which serves as the Special Interest Group on discourse and dialog for both ACL and ISCA. TOPICS OF INTEREST We welcome formal, corpus-based, system-building or analytical work on discourse and dialog including but not restricted to the following themes and topics: - Discourse Processing and Dialog Systems - Corpora, Tools and Methodology - Pragmatic and/or Semantic Modeling - Dimensions of Interaction - Open Domain Dialog - Style, Voice and Personality in Spoken Dialog and Written Text - Applications of Dialog and Discourse Processing Technology - Novel Methods for Generation Within Dialog, for a joint special session with INLG IMPORTANT DATES Special Session Proposal Deadline: Sunday, 9 February 2014 (23:59, GMT-11) Special Session Notification: Monday, 17 February 2014 Long, Short and Demonstration Paper Submission Deadline: Sunday, 9 March 2014 (23:59, GMT-11) Paper Notification: Friday, 18 April 2014 Final Paper Due - For papers accepted subject to receiving mentoring Wednesday, 14 May 2014 - For accepted papers Friday, 23 May 2014 Conference Wednesday, June 18, 2014 to Friday, June 20, 2014 SUBMISSIONS Special Session Proposals The SIGdial organizers welcome the submission of special session proposals. A SIGDIAL special session is the length of a regular session at the conference; may be organized as a poster session, a poster session with panel discussion, or an oral presentation session; and will be held on the last day of the conference. Special sessions may, at the discretion of the SIGdial organizers, be held as parallel sessions. Those wishing to organize a special session should prepare a two-page proposal containing: a summary of the topic of the special session; a list of organizers and sponsors; a list of people who may submit and participate; and a requested format (poster/panel/oral session). These proposals should be sent to conference[a]sigdial.org by the special session proposal deadline. Special session proposals will be reviewed jointly by the general and program co-chairs. Papers The program committee welcomes the submission of long papers, short papers, and demonstration descriptions. All accepted submissions will be published in the conference proceedings. Long papers may, at the discretion of the technical program committee, be accepted for oral or poster presentation. They must be no longer than 8 pages, including title, content, and examples. Two additional pages are allowed for references and appendices, which may include extended example discourses or dialogs, algorithms, graphical representations, etc. Short papers will be presented as posters. They should be no longer than 4 pages, including title and content. One additional page is allowed for references and appendices. Demonstration papers should be no longer than 3 pages, including references. A separate one-page document should be provided to the program co-chairs for demonstration descriptions, specifying furniture and equipment needed for the demo. Authors of a submission may designate their paper to be considered for a SIGDIAL special session. However, in 2014 all papers will undergo regular peer review and appear in the SIGDIAL proceedings. Consequently, no paper is guaranteed to appear in any particular session at SIGDIAL. Papers that have been or will be submitted to other meetings or publications must provide this information (see submission format). A paper accepted for presentation at SIGDIAL 2014 must not have been presented at any other meeting with publicly available proceedings. Any questions regarding submissions can be sent to the program co-chairs at program-chairs[at]sigdial.org . Authors are encouraged to submit additional supportive material such as video clips or sound clips and examples of available resources for review purposes. Submission is electronic using paper submission software. FORMAT All long, short, and demonstration submissions should follow the two-column ACL 2014 format. We strongly recommend the use of ACL LaTeX style files or Microsoft Word style files tailored for the ACL 2014 conference. Submissions must conform to the official ACL 2014 style guidelines ( http://www.cs.jhu.edu/ACL2014/CallforPapers.htm), and they must be electronic in PDF. As in most previous years, submissions will not be anonymous. MENTORING SERVICE For several years, the SIGDIAL conference has offered a mentoring service. Submissions with innovative core ideas that may need language (English) or organizational assistance will be flagged for "mentoring" and conditionally accepted with recommendation to revise with a mentor. An experienced mentor who has previously published in the SIGDIAL venue will then help the authors of these flagged papers prepare their submissions for publication. Any questions about the mentoring service can be addressed to the mentoring chair at mentoring[at]sigdial.org . STUDENT SUPPORT SIGdial also offers a limited number of scholarships for students presenting a paper accepted to the conference. Application materials will be posted at the conference website. BEST PAPER AWARDS In order to recognize significant advancements in dialog and discourse science and technology, SIGDIAL will recognize two best paper awards. A selection committee consisting of prominent researchers in the fields of interest will select the recipients of the awards. SPONSORSHIP SIGDIAL offers a number of opportunities for sponsors. For more information, email the conference organizers at sponsor-chair[at]sigdial.org. DIALOG AND DISCOURSE SIGDIAL authors are encouraged to submit their research to the journal Dialog and Discourse, which is endorsed by SIGdial. ORGANIZING COMMITTEE General Co-Chairs Kallirroi Georgila, University of Southern California, USA Matthew Stone, Rutgers, The State University of New Jersey, USA Technical Program Co-Chairs Helen Hastie, Heriot-Watt University, Edinburgh, UK Ani Nenkova, University of Pennsylvania, USA Mentoring Chair Svetlana Stoyanchev, AT&T Research Labs, USA Local Chair Keelan Evanini, Educational Testing Service, USA Sponsorships Chair Giuseppe Di Fabbrizio, Amazon.com, USA SIGdial President Amanda Stent, Yahoo! Labs, USA SIGdial Vice President Jason Williams, Microsoft Research, USA SIGdial Secretary/Treasurer Kristiina Jokinen, University of Helsinki, Finland From hamon at LIMSI.FR Sat Dec 7 20:30:10 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 7 Dec 2013 21:30:10 +0100 Subject: Info: Newsletter du LPL, No 7 (2013) Message-ID: Date: Fri, 6 Dec 2013 15:04:25 +0100 From: "Claudia Pichon-Starke" Message-ID: <026e01cef28c$1262c850$372858f0$@starke at lpl-aix.fr> X-url: http://www.lpl.univ-aix.fr/fileadmin/template/NewsLetter/NL7.htm Bonjour, La cellule communication du LPL a le plaisir de vous présenter le nouveau numéro de sa Newsletter : http://www.lpl.univ-aix.fr/fileadmin/template/NewsLetter/NL7.htm Nous vous en souhaitons une bonne lecture. Bien cordialement, Claudia Pichon-Starke pour la Cellule Com communication at lpl-aix.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 7 20:33:03 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 7 Dec 2013 21:33:03 +0100 Subject: Seminaire: Alpage, Djame Seddah, Overview of the SPMRL 2013 Shared Task, 13 decembre 2013 Message-ID: Date: Fri, 6 Dec 2013 20:00:24 +0100 From: Marie Candito Message-ID: ************** Séminaire de l'équipe Alpage ********* Il s'agit du séminaire de recherche en linguistique informatique organisé par l'équipe Alpage, équipe mixte INRIA - Paris Diderot, spécialisée en traitement automatique des langues. Vendredi 13 décembre de 11h à 12h30, en salle 106 Bâtiment Olympe de Gouges rue Albert Einstein 75013 Paris (dernier bâtiment de la rue Albert Einstein, numéro non attribué...) Toute personne intéressée est la bienvenue. ********************************************************** Djamé Seddah (Alpage) Title : “Overview of the SPMRL 2013 Shared Task: Cross-Framework Evaluation of Parsing Morphologically Rich Languages“ Abstract : In this presentation, we will present the outcoumes on the first shared task on statistical parsing of morphologically rich languages. The task features data sets from nine languages (Arabic, Basque, French, German, Hebrew, Hungarian, Korean, Polish and Swedish), each available both in constituency and dependency annotation. We report on the preparation of the data sets, on the proposed parsing scenarios, and on the evaluation metrics for parsing MRLs given different representation types. We present and analyze parsing results obtained by the task participants, and then provide an analysis and comparison of the parsers across languages and frameworks, reported for gold input as well as more realistic parsing scenarios. The shared task saw submissions from seven teams, and results produced by more than 14 different systems. The parsing results were obtained in different input scenarios (gold, predicted, and raw) and evaluated using different protocols (cross-framework, cross-scenario, and cross-language). In particular, this is the first time a multilingual evaluation campaign reports on the execution of parsers in realistic, morphologically ambiguous, settings. Djamé Seddah, Joint Work with Reut Tsarfaty, Sandra Küebler and many contributors[1] [1] Marie Candito, Jinho D. Choi, Richard Farkas , Jennifer Foster, Iakes Goenaga, Koldo Gojenola, YoavGoldberg, Spence Green, Nizar Habash, Marco Kuhlman, Wolfgang Maier, Joakim Nivre, Adam Przepiórkowski, Ryan Roth, Wolfgang Seeker, Yannick Versley, Veronika Vincze, Marcin Wolin ́ski, Alina Wróblewskav, Eric Villemonte de la Clergerie ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 7 20:31:15 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 7 Dec 2013 21:31:15 +0100 Subject: These: Florent Pompigne, Modelisation logique de la langue et Grammaires Categorielles Abstraites Message-ID: Date: Fri, 6 Dec 2013 15:24:14 +0100 (CET) From: Florent Pompigne Message-ID: <692013726.115461.1386339854164.JavaMail.root at loria.fr> Bonjour, j'ai le plaisir de vous inviter à la soutenance de ma thèse qui s'intitule "Modélisation logique de la langue et Grammaires Catégorielles Abstraites". La soutenance aura lieu le lundi 11 décembre à 10h30 au LORIA (Campus scientifique Vandoeuvre-lès-Nancy), en salle C005 . Vous êtes également conviés au pot, qui aura lieu à 16h. Cordialement, Florent Pompigne --- Composition du jury : Rapporteurs : Christian Retoré, Professeur, Université de Bordeaux 1 Annie Foret, Maître de conférences, Université de Rennes 1 Examinateurs : Michael Moortgat, Professeur, Université d'Utrecht Laurent Vigneron, Professeur, Université de Lorraine Directeurs : Philippe de Groote, Directeur de recherches, INRIA Nancy Sylvain Pogodalla, Chargé de recherches, INRIA Nancy --- Résumé Cette thèse s'intéresse à la modélisation de la syntaxe et de l'interface syntaxe-sémantique de la phrase, et explore la possibilité de contrôler au niveau des structures de dérivation la surgénération que produit le traitement des dépendances à distance par des types d'ordre supérieur. À cet effet, nous étudions la possibilité d'étendre le système de typage des Grammaires Catégorielles Abstraites avec les constructions de la somme disjointe, du produit cartésien et du produit dépendant, permettant d'étiqueter les catégories syntaxiques par des structures de traits. Nous prouvons dans un premier temps que le calcul résultant de cette extension bénéficie des propriétés de confluence et de normalisation, permettant d'identifier les termes beta-équivalents dans le formalisme grammatical. Nous réduisons de plus le même problème pour la beta-eta-équivalence à un ensemble d'hypothèse de départ. Dans un second temps, nous montrons comment cette introduction de structures de traits peut être appliquée au contrôle des dépendances à distances, à travers les exemples des contraintes de cas, des îlots d'extraction pour les mouvements explicites et implicites, et des extractions interrogatives multiples, et nous discutons de la pertinence de placer ces contrôles sur les structures de dérivation. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 7 20:23:29 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 7 Dec 2013 21:23:29 +0100 Subject: Appel: The 37th Annual ACM SIGIR 2014 conference, Gold Coast Australia, July 6-11 2014 Message-ID: Date: Fri, 6 Dec 2013 16:53:15 +1000 From: Richi Nayak Message-ID: X-url: http://sigir.org/sigir2014/ CALL FOR PAPERS, Submission Due: JAN 27, 2014 ACM SIGIR 2014: THE 37th ANNUAL CONFERENCE 6-11 July, 2014, Gold Coast, Australia Conference website: http://sigir.org/sigir2014/ SIGIR is the major international forum for the presentation of new research results and for the demonstration of new systems and techniques in the broad field of information retrieval (IR). The Conference and Program Chairs invite all those working in areas related to IR to submit original papers for review. SIGIR 2014 welcomes contributions related to any aspect of IR theory and foundation, techniques, and applications. Relevant topics include, but are not limited to: TOPICS - Document Representation and Content Analysis (e.g., text representation, document structure, linguistic analysis, NLP for IR, cross- and multi-lingual IR, information extraction, sentiment analysis, clustering, classification, topic models, facets, text streams) - Queries and Query Analysis (e.g., query intent, query suggestion and prediction, query representation and reformulation, query log analysis, conversational search and dialogue, spoken queries, summarization, question answering) - Retrieval Models and Ranking (e.g., IR theory, language models, probabilistic retrieval models, learning to rank, combining searches, diversity and aggregated search) - Search Engine Architectures and Scalability (e.g., indexing, compression, distributed IR, P2P IR, mobile IR, cloud IR) - Users and Interactive IR (e.g., user studies, user and task models, interaction analysis, session analysis, exploratory search, personalized search, social and collaborative search, search interface, whole session support) - Filtering and Recommending (e.g., content-based filtering, collaborative filtering, recommender systems) - Evaluation (e.g., test collections, experimental design, effectiveness measures, session-based evaluation, simulation) - Web IR and Social Media Search (e.g., link analysis, click models/behavioral modeling, social tagging, social network analysis, blog and microblog search, forum search, community-based QA, adversarial IR and spam, vertical and local search) - IR and Structured Data (e.g., XML search, ranking in databases, desktop search, entity search) - Multimedia IR (e.g., image search, video search, speech/audio search, music search) - Other Applications (e.g., digital libraries, enterprise search, genomics IR, legal IR, patent search, text reuse, new retrieval problems) CONTRIBUTION TYPES - Full papers (10 pages), Short papers (4 pages), Demos (3 pages), Tutorials, Workshops INSTRUCTION Requirements for paper format and appropriate content are described in the content guidelines. The requirements will be strictly enforced. Papers which do not conform to the requirements may be rejected without review, so please be sure to read this page carefully. SIGIR 2014 solicits proposals for tutorials of either half-day (3 hours plus breaks) or full day (6 hours plus breaks) on all topics of information retrieval and its applications. Each tutorial should cover a single topic in detail. Submissions should include a cover sheet and an extended abstract. Proposals for workshops to be held at ACM SIGIR 2014 are also solicited. Workshops will usually last for one day and will be held on Friday 11th July 2014. IMPORTANT DATES - 20 January 2014: Abstracts for full research papers due - 27 January 2014: Full research papers due - 3 February 2014: Workshop proposals due - 17 February 2014: Short paper, demonstration, and tutorial submission deadline - 18 April 2014: Paper, short paper, tutorial, and demonstration acceptance notifications - 11 May 2014: Camera ready copy due (note the short timeline due to early conference date) - 16 May 2014: Early bird registration deadline ORGANIZERS - General Chairs: Shlomo Geva, Andrew Trotman - PC Chairs: Peter Bruza, Charles L. A. Clarke, Kalervo Järvelin @Richi Nayak – Publicity Chair ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 7 20:36:19 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 7 Dec 2013 21:36:19 +0100 Subject: Journee: Optimisation et traitement automatique du langage, Universite Paris 13, 16 decembre 2013 Message-ID: X-Url: http://www.atala.org/article.php3?id_article=48 From: Thierry Hamon X-Archive-Url: http://listserv.linguistlist.org/archives/ln.html X-Archive-Url: http://liste.cines.fr/info/ln Date: Sat, 07 Dec 2013 21:29:27 +0100 Date: Fri, 06 Dec 2013 13:46:13 +0100 From: Joseph Le Roux Message-ID: X-url: http://doodle.com/87mnbcdbzdmytffc X-url: http://mathstic.univ-paris13.fr/index.php/axes/oaatcn Bonjour, Les méthodes d'optimisation sont de plus en plus utilisées en traitement automatique des langues pour concevoir des méthodes efficaces tant pour l'apprentissage de modèles que pour le décodage, notamment en offrant un cadre formel qui distingue clairement les contraintes globales des contraintes locales, tout en permettant de les combiner. L'équipe RCLN du Laboratoire d'Informatique de Paris Nord s'intéresse de près à ces méthodes pour la modélisation conjointe des différents niveaux de description linguistique. Dans le cadre du pôle math/stic de Paris 13, avec le soutien du LABEX EFL, et en collaboration avec l'équipe AOC du LIPN, l'équipe RCLN organise une journée de séminaires autour des thèmes de l'optimisation et du traitement automatique des langues. Cette journée aura lieu le 16 décembre sur le campus de l'Université Paris 13 à Villetaneuse, dans l'amphithéâtre Euler : 09h30-10h00 :: Accueil (Amphi Euler) 10h00-11h00 :: Exposé d'André Martins (Amphi Euler) 11h15-12h15 :: Exposé de Sebastian Riedel (Amphi Euler) Pause déjeuner (LIPN) 13h45-14h45 :: Exposé de Xavier Carreras (Amphi Euler) 15h00-16h00 :: Exposé de Joseph Le Roux (Amphi Euler) 16h00 :: goûter au LIPN ! La participation à cette journée est libre. Si vous comptez y assister, nous vous demandons, pour des raisons pratiques (organisation du buffet notamment), de nous l'indiquer à cette adresse [[http://doodle.com/87mnbcdbzdmytffc]] . site web : http://mathstic.univ-paris13.fr/index.php/axes/oaatcn Résumé des quatre exposés prévus : -------------------------------------------------------------------------------- André Martins, Priberam, Lisbon Title: AD3: A New Decoder for Structured Prediction Abstract: In this talk, I will present AD3 ("Alternating Directions Dual Decomposition"), a new consensus-based decoder for problems representable as factor graphs. AD3 is an approximate decoder that ignores global effects caused by the cycles of the graph, solving a linear relaxation of the original problem. It can handle many scenarios often encountered in NLP and IR applications, such as models with constraints in first-order logic; models involving budget or knapsack constraints; and combinations of structured models which are individually tractable, but hard to decode jointly. Like other dual decomposition algorithms, AD3 has a modular architecture, where local subproblems are solved independently, and their solutions are gathered to compute a global update. The key characteristic of AD3 is that each local subproblem has a quadratic regularizer, leading to faster convergence (both theoretically and in practice). After providing closed-form solutions for several of these subproblems, I will proceed to discuss a recent active set method that works for arbitrary factors, requiring only a local maximization oracle (the same oracle required in subgradient-based dual decomposition). In the second part of the talk, I will discuss two recent applications of AD3 in NLP problems: dependency parsing and compressive summarization. I will present "Turbo Parser," an open source dependency parser, which was recently improved with AD3 and the active set method to permit fast decoding of non-projective third-order models. Experiments in 14 languages yield state-of-art results, with parsing speeds ranging between 700 and 4,000 tokens per second. For compressive summarization, the use of AD3 leads to a system which is modular in the three qualities that define a good summary (conciseness, informativeness, and grammaticality), with state-of-the-art ROUGE scores, and runtimes close to extractive summarizers. This work was done in collaboration with Noah Smith, Mário Figueiredo, Eric Xing, Pedro Aguiar, and Miguel Almeida. ------------------------------------------------------------------------ Sebastian Riedel, UCL, London. Title: Predict, Price and Cut: Column and Row Generation for Structured Prediction. Abstract: Many problems in NLP, and structured prediction in general, can be cast as finding high-scoring structures based on a large set of candidate parts. For example, In second order tagging, we have to select high-scoring transitions between tags in a globally consistent fashion. In second order graph-based dependency parsing we have to choose a quadratic number of first order and a cubic number of second order edges such that the graph is both high-scoring and a tree. What makes such problems challenging is the large number of possible parts to consider. This number not only affects the cost of search or optimization but also slows down the process of scoring parts before they enter the optimisation problem, and extracting features. In this talk I present an approach that can solve problems with large sets of candidate parts without considering all of these parts in either optimization or scoring. In contrast to most pruning heuristics, our algorithm can give certificates of optimality before having optimized over, or even scored, all parts. It does so without the need of auxiliary models or tuning of threshold parameters. This is achieved by a delayed column and row generation algorithm that iteratively solves an LP relaxation over a small subset of current candidate parts, and then finds new candidates with high scores that can be inserted into the current optimal solution without removing high scoring existing structure. The latter step subtracts from the cost of a part the price of resources the part requires, and is often referred as pricing. Sometimes parts may score highly after pricing, but are necessary in order to make the current solution feasible. We add such parts in a step that roughly amounts to violated cuts to the LP. We evaluate our approach on two applications: second order dependency parsing and first order tagging with large domains. In both cases we dramatically reduce the number of parts considered, and observe about an order of magnitude speed-up. This is possible without loss of optimality guarantees, and hence accuracy. ------------------------------------------------------------------------ Xavier Carreras, UPC, Barcelona Title: Learning Automata and Grammars: From Spectral Algorithms to Convex Optimizations There is an increasing interest in spectral methods to learn latent-variable language models in the form of weighted automata and context-free grammars. Spectral methods provide an algebraic formulation to the problem of inducing automata or grammars from data, and directly exploit the recurrence relations behind the model. I will review the spectral method from an algebraic perspective, making use of Hankel matrices as the key object behind the method: a Hankel matrix collects all necessary statistics of the distribution we want to learn; and finding a low-rank factorization of this matrix results in the automata or grammar. Under mild assumptions, it can be shown that this method nicely approximates the target model. From here, I will show how we can reformulate the spectral learning algorithm as a low-rank convex optimization. This will be useful to adapt the method to other settings, by adding linear constraints. I will focus in "unsupervised" induction of context-free grammars, that is, learning a grammar from plain strings. Our formulation involves optimizing for a low-rank Hankel matrix that is linearly constrained to satisfy inside-outside recursions. An analogous method method can be formulated to learn finite-state transducers from unaligned parallel strings. ------------------------------------------------------------------------ Joseph Le Roux, LIPN, Paris Title: Combining PCFG-LA Models with Dual Decomposition: A case Study with Function Labels and Binarization Abstract: It has recently been shown that different NLP models can be effectively combined using dual decomposition. In this talk, we present how PCFG-LAs (Probabilistic Context-Free Grammars with Latent Annotations, the state-of-the-art model for unlexicalized constituent parsing) are suitable for combination in this way. We first show how the intractable problem of exact PCFG-LA decoding is approximated with anchored PCFGs. Then we present a method for combining anchored PCFGs based on the partial superposition of tree structures. We experiment with the different models which result from alternative methods of extracting a grammar from a treebank (retaining or discarding function labels, left binarization versus right binarization) and achieve state-of-the-art parsing performance, with a labeled Parseval F-score of 92.4 on Wall Street Journal Section 23 – this represents an error reduction rate of 7% over a strong PCFG-LA product-model baseline. This work was done in collaboration with Antoine Rozenknop and Jennifer Foster. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 10 20:43:43 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 10 Dec 2013 21:43:43 +0100 Subject: Appel: Third Workshop on Computational Linguistics for Literature Message-ID: Date: Sun, 08 Dec 2013 17:15:37 -0500 From: Stan Szpakowicz Message-ID: <52A4EF89.2060908 at eecs.uottawa.ca> Third Workshop on Computational Linguistics for Literature April 27, 2014, Göteborg, Sweden, co-located with EACL 2014 https://sites.google.com/site/clfl2014a/ Second Call for Papers (with apologies for multiple postings) The purpose of the series of ACL workshops on Computational Linguistics for Literature is to bring together researchers fascinated with literature as a unique type of data which pose distinct challenges. We invite papers on original unpublished work in this broad area. In particular, we hope to see papers which explore how the state-of-the-art NLP methods can help solve existing research problems in the humanities, or perhaps suggest new problems. Literary texts revolve around the human condition, emotions, social life and inner life. Naturally, such data abound in common-sense knowledge but are very thin on technical jargon. Can tools and methods developed in the ACL community help process literary data? When do they work, when do they fail and why? What new instruments do we need in order to work with prose and poetry, on a large or small scale? Are there computational solutions of noteworthy problems in the Humanities, Information Science, Library Sciences and other similar disciplines? Here are some of the topics of interest to the workshop: - the needs of the readers and how these needs translate into meaningful NLP tasks; - searching for literature; - recommendation systems for literature; - computational modelling of narratives, computational narratology, computational folkloristics; - summarization of literature; - differences between literature and other types of writing as relevant to computational linguistics; - discourse structure in literature; - emotion analysis for literature; - profiling and authorship attribution; - identification and analysis of literary genres; - building and analyzing social networks of characters; - generation of literary narrative, dialogue or poetry; - modelling literary dialogue for generation. We will consider regular papers which describe experimental methods or theoretical work, and we will gladly welcome position papers. The NLP community does not study literature often enough, so it is important to discuss and formulate the problems before proposing solutions. The submission deadline is January 23, 2014. Anna Feldman, Anna Kazantseva, Stan Szpakowicz ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 10 20:52:54 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 10 Dec 2013 21:52:54 +0100 Subject: Revue: CORELA, Statut et utilisation des corpus Message-ID: Date: Tue, 10 Dec 2013 16:54:25 +0100 From: Gilles Col Message-Id: X-url: http://corela.edel.univ-poitiers.fr/index.php?id=2951 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=2952 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=2964 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=2981 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=2994 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=3002 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=3012 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=3023 Chers collègues, L'équipe éditoriale de la revue CORELA - Cognition / Représentation / Langages a le plaisir de vous annoncer la publication de son dernier numéro thématique : " Statut et utilisation des corpus en linguistique", sous la direction de Laurence Durroux-Vincent Sommaire du numéro: Laurence Durroux-Vincent. "Statut et utilisation des corpus en linguistique". http://corela.edel.univ-poitiers.fr/index.php?id=2951 Tobias Scheer. "The Corpus: A Tool among Others". http://corela.edel.univ-poitiers.fr/index.php?id=2952 Pius ten Hacken, Renáta Panocová "The Use of Corpora in Word Formation Research". http://corela.edel.univ-poitiers.fr/index.php?id=2964 Najib Arbach, Saandia Ali "Aspects théoriques et méthodologiques de la représentativité des corpus". http://corela.edel.univ-poitiers.fr/index.php?id=2981 Charles Brasart "Corpus et alternance codique : que peut nous apprendre une approche comparative ?". http://corela.edel.univ-poitiers.fr/index.php?id=2994 Laurent David "L'acquisition du Present Perfect chez un enfant anglophone : méthodologie, codage et premiers résultats". http://corela.edel.univ-poitiers.fr/index.php?id=3002 Jérémi Sauvage, Christelle Dodane , Fabrice Hirsch , Melissa Barkat-Defradas "Réflexions méthodologiques sur l'analyse phonétique d'un corpus longitudinal et transversal en acquisition du langage". http://corela.edel.univ-poitiers.fr/index.php?id=3012 Paola Artero, Adriana Serban "The Status of Qualitative and Quantitative Methods of Enquiry in Translation Research: C. S. Lewis's Narnia in French - A Corpus-based Approach". http://corela.edel.univ-poitiers.fr/index.php?id=3023 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 10 20:45:22 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 10 Dec 2013 21:45:22 +0100 Subject: Appel: IC 2014 Message-ID: Date: Mon, 09 Dec 2013 09:51:41 +0100 From: Catherine Faron Zucker Message-ID: <52A5849D.90907 at i3s.unice.fr> X-url: http://www.irit.fr/IC2014 Appel à communications pour les 25es Journées Francophones d'Ingénierie des Connaissances : Connaissances pervasives: des utilisateurs plongés dans des systèmes à base de connaissances et des systèmes à base de connaissances plongés dans le monde http://www.irit.fr/IC2014 du 12 au 16 mai 2014 à Clermont-Ferrand ------------------------------------------------------------------------ Dates importantes : Date limite de soumission des communications : jeudi 19 décembre 2013 Date de notification aux auteurs : jeudi 13 février 2014 Date limite de réception des textes définitifs : lundi 17 mars 2014 Dates de la conférence : du 12 au 16 mai 2014 ------------------------------------------------------------------------ La conférence d'Ingénierie des Connaissances fête cette année ses 25 ans! Elle est le rendez-vous annuel de la communauté française et francophone qui s'intéresse aux problématiques liées à l'ingénierie des connaissances. Académiques et industriels s'y retrouvent pour échanger et réfléchir sur des problèmes de recherche qui se posent en acquisition, représentation et gestion des connaissances. L'essor des sciences et technologies de l'information et de la communication, et notamment des technologies du web, dans l'ensemble de la société engendre des mutations dans les pratiques individuelles et collectives. L'ingénierie des connaissances accompagne cette évolution, en inventant les modèles, méthodes et outils permettant l'intégration de connaissances et de raisonnements sur ces connaissances dans des environnements informatiques. De fait, on constate aujourd'hui une omniprésence des connaissances, en même temps qu'un décloisonnement des communautés de recherche en Intelligence Artificielle pour répondre aux problèmes qui se posent. Ainsi, la construction des systèmes à base de connaissances fait appel à l'ingénierie des connaissances, au traitement automatique des langues, à l'apprentissage automatique et à la fouille de données; le traitement des connaissances associe ingénierie des connaissances avec recherche d'information, recherche opérationnelle, analyse de données; la modélisation des environnements informatiques, de ses acteurs humains ou logiciels, allie ingénierie des connaissances et systèmes multi-agents; enfin l'ingénierie des connaissances touche à des thématiques classiques dans les communautés de l'interaction homme-machine et des environnements informatiques pour l'apprentissage humain. En plaçant ainsi l'utilisateur au coeur des systèmes informatiques, l'ingénierie des connaissances est sollicitée pour assister l'utilisateur dans le traitement de la masse de données disponibles sur le web, pour les visualiser, contextualiser, expliquer, recommander, classer, prédire, approximer, sécuriser, fiabiliser, pour pallier à leur imprécision voire incohérence. Les thèmes d'IC 2014 sont les suivants (liste non exhaustive): Représentation des connaissances et raisonnement - modèles, formalismes, langages formels et informels de représentation de connaissances - échange, interopérabilité et réutilisation des modèles, standardisation - terminologies, thesauri, ontologies et lexiques - types d'ontologies et rôle des ontologies de référence - méthodes et outils pour le cycle de vie des représentations - gestion de la dynamique des ontologies (versionnement, évolution, intégration et alignement) - raisonnements logiques, inférences, raisonnements à base de règles, - approximations, raisonnements statistiques, raisonnements par analogie, raisonnements à partir de cas, raisonnements dans les logiques non classiques Conception et construction de modèles de connaissances et d'ontologies - peuplement de modèles de connaissances - acquisition de connaissances à partir de textes - découverte interactive de connaissances dans les données, dans les historiques, dans les interactions, sur le web, etc. - inférence et règles métier : acquisition, conception, réutilisation, évolution, évaluation, exploitations - métrique et évaluation de la qualité des ontologies Web de données, web sémantique - ouverture et interconnexion des données - impact de l'ouverture des données sur le web et de l'externalisation ouverte (crowdsourcing) sur l'ingénierie des connaissances - réseaux sociaux - gestion de flux de données - passage à l'échelle : traitement de données massives - traitement de données hétérogènes - provenance et confiance dans les données - métrique et évaluation de la qualité des données Ingénierie des connaissances centrée utilisateurs - visualisation de connaissances, d'ontologies - visualisation d'interconnexions de données, de raisonnements sur les données - explication des raisonnements - profils utilisateurs, modèles de contexte et adaptation, modèles d'émotion - personnalisation de l'accès aux connaissances: confiance, recommandation, classement (ranking), résumé - assistance d'applications et de services: agents conversationnels - environnements informatiques pour l'apprentissage humain - conception et évaluation d'interactions et d'interfaces avec des systèmes à base de connaissances Interdisciplinarité de l'ingénierie des connaissances - épistémologie de l'ingénierie des connaissances - théorie des organisations et ingénierie des connaissances - sciences humaines et sciences cognitives et ingénierie des connaissances - ingénierie des connaissances scientifiques Ingénierie des connaissances pervasive : IC dans... - le traitement automatique des langues - l'apprentissage symbolique - la fouille de données - la recherche d'information, l'indexation - la recherche opérationnelle - l'analyse de données - les systèmes multi-agents - les environnements informatiques pour l'apprentissage humain - les interfaces homme-machine Applications - applications dans les domaines de la santé, de l'éducation et de l'apprentissage, de la culture, de la recherche documentaire, des transports intelligents, de l'informatique décisionnelle (BI), etc. - émergence de nouveaux domaines applicatifs - mémoires d'entreprises - systèmes pour la collaboration et la coopération dans les organisations et les collectifs ------------------------------------------------------------------------ Soumissions : Articles longs(12 pages (nouveau format), présentation orale de 20 mn, 10 mn discussion) : Sont concernés des travaux aboutis, présentant des résultats validés ou des synthèses fouillées. Articles courts(6 pages (nouveau format), présentation orale de 10 mn, 5 mn discussion) : Sont concernés des travaux en cours ou au stade de résultats partiels de recherche ou appliqués. Ce type de soumission correspond à des travaux innovants de qualité, mais dont la maturité ne permet pas des élaborations complètes et des conclusions définitives. Les articles devront respecter les styles de documents indiqués sur le site de la conférence: http://www.irit.fr/IC2014. Les auteurs sont invités à soumettre leurs communications au format PDF sur EasyChair: https://www.easychair.org/conferences/?conf=ic2014. Les articles acceptés feront l'objet d'actes en version électronique, publiés sur HAL, et disponibles sur une clé USB distribuée aux participants de la conférence. Un prix AFIA du meilleur article sera décerné par un jury composé de membres du comité de pilotage de la conférence. ------------------------------------------------------------------------ La conférence d'Ingénierie des Connaissances est placée sous l'égide du collège IC de l'AFIA ------------------------------------------------------------------------ Présidente du comité de programme : Catherine Faron Zucker (I3S, Université Nice Sophia Antipolis) Présidente du comité d'organisation : Catherine Roussey (Irstea) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 10 20:49:11 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 10 Dec 2013 21:49:11 +0100 Subject: These: Pierre Magistry, Unsupervised Word Segmentation and Wordhood Assessment Message-ID: Date: Tue, 10 Dec 2013 13:46:04 +0100 From: Pierre Magistry 馬基石 Message-ID: Chers amis, professeurs, étudiants, collègues, J'ai le plaisir de vous inviter à la soutenance de ma thèse, intitulée "Unsupervised Word Segmentation and Wordhood Assessment: the case for Mandarin Chinese". (Segmentation en mots non-supervisée et estimation de la lexicalité : le cas du mandarin) Je l'ai menée au sein de l'équipe Alpage, à l'université Paris-Diderot/INRIA et la soutiendrai le 19 décembre 2013 à partir de 9h30, dans la salle des thèses de Paris 7. Salle des thèses 580F (Halle aux Farines) 10 rue Françoise Dolto 75013 Paris (plan : http://www.openstreetmap.org/?mlat=48.8295&mlon=2.3825#map=16/48.8295/2.3825 ) Prendre l'entrée la plus proche de la Seine. La soutenance sera suivie d'un pot auquel vous êtes également conviés. Bien cordialement, Pierre Magistry ---- Le jury de soutenance sera composé de: PR. 謝舒凱 Hsieh Shukai (rapporteur) - National Taiwan University PR. Sylvain Kahane (directeur) - Université Paris 10 Nanterre, MoDyCo PR. Yves Lepage (rapporteur) - Waseda University PR. Marie-Claude Paris (co-directrice) - Université Paris Diderot, LLF DR. Benoit Sagot (co-directeur) - INRIA, ALPAGE DR. Pierre Zweigenbaum (président) - CNRS, LIMSI ---- Résumé de la thèse : Ce mémoire traite de la segmentation non-supervisée en mots et de l'évaluation de la «lexicalité» des formes. Le chinois moderne standard (mandarin) est choisi pour les expériences et évaluations. La première partie porte sur les aspects linguistiques. Elle présente les critères couramment utilisés pour définir le «mot» en linguistique chinoise et en montre les limites au travers d'une discussion de linguistique générale, abordant notamment la question des «expressions multi-mots» et son lien avec les problèmes de segmentation en mots. Nous présentons ensuite l'historique du développement de la Segmentation en Mot du Chinois (CWS) comme une tâche typique en TAL, et défendons l'idée qu'une part d'arbitraire dans l'annotation des corpus d'évaluation favorise artificiellement les systèmes d'apprentissage supervisés alors que nous considérons les systèmes non-supervisés d'un plus grand intérêt pour la linguistique. La première partie nous amène à fonder notre définition de la lexicalité sur deux critères: une forte autonomie de combinaison et un haut degré d'appartenance à une classe distributionnelle. La seconde partie présente une méthode non-supervisée pour évaluer l'autonomie de combinaison des formes inspirée par les hypothèses de Harris. Avec un algorithme de segmentation simple et rapide basé uniquement sur cette mesure, nous obtenons des résultats proches de l'état de l'art. Nous discutons ensuite de l'importance des pré-traitements et présentons des expériences utilisant le principe de la Longueur de Description Minimale (MDL) pour améliorer nos résultats. Enfin, nous proposons une méthode et des outils pour une évaluation plus qualitative des analyses fournies par notre système. Nous présentons aussi quelques résultats préliminaires sur d'autres langues qui montrent la généricité de notre méthode. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 10 20:50:08 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 10 Dec 2013 21:50:08 +0100 Subject: Appel: Atelier SemDis 2014 (semantique distributionnelle), site web Message-ID: Date: Tue, 10 Dec 2013 14:35:48 +0100 From: Cécile Fabre Message-ID: <52A718B4.6030600 at univ-tlse2.fr> X-url: http://www.irit.fr/semdis2014/ Le site Web décrivant les tâches organisées dans le cadre de l'atelier SemDis 2014 et fournissant les données associées est désormais accessible à l'adresse suivante : http://www.irit.fr/semdis2014/ ---- Objectifs de l'atelier : Les équipes CLLE-ERSS et IRIT-Melodi (Toulouse), organisatrices de l'atelier SemDis à TALN 2013, consacré aux approches fondées sur l'analyse sémantique distributionnelle, ont l'intention d'organiser une 2ème édition de l'atelier dans le cadre de TALN 2014 (sous réserve de l'acceptation par le comité de programme de TALN). Cette année, l'appel à communication sera complété par l'organisation de 2 tâches : - une tâche compétitive de substitution lexicale, adaptation au français de la tâche SemEval 2007 « Lexical substitution », telle qu'elle est présentée dans McCarthy & Navigli (2010). - une tâche exploratoire permettant d'examiner plus en détail les résultats de méthodes distributionnelles sur un corpus spécialisé, le corpus TALN (2007-2013). --- Calendrier : Sous réserve de l'acceptation par le comité de programme de TALN. Les dates seront précisées lorsque le calendrier des workshops TALN sera connu. Tâche compétitive (substitution lexicale) : - décembre 2013 : diffusion des données d'essai via le site web - 10 mars 2014 : diffusion des données de test - 17 mars 2014 : date limite de soumission par les participants - début avril 2014 : annonce des résultats - fin avril 2014 : date limite de soumission d'un article pour l'atelier - 26 mai 2014 : réception des versions finales - 1er Juillet 2014 : atelier SemDis Tâche exploratoire et appel à communication hors tâches : - décembre 2013 : description de la tâche 2 et diffusion du corpus via le site web - avril 2014 : réception des articles - 26 mai 2014 : réception des versions finales - 1er Juillet 2014 : atelier SemDis ---- Contact : Les organisateurs peuvent être contactés à l'adresse suivante : semdis2014 at gmail.com ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 10 20:46:47 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 10 Dec 2013 21:46:47 +0100 Subject: Stage: Transcription de l'oral, Projet europeen (3 a 6 mois) Message-ID: Date: Mon, 09 Dec 2013 11:20:26 +0100 From: "Catherine Bolly" Message-ID: [Stage] Transcription, langage oral - Projet européen Marie Curie (3 à 6 mois, 2014) TRANSCRIPTION DE L'ORAL (logiciel : Praat) Stagiaire : contrat à temps plein (durée : 3 mois) ou temps partiel (durée : 4 à 6 mois) Profil : niveau Master Début du contrat : février/mars 2014 CONTEXTE Le projet européen CorpAGEst porte sur l’étude du langage de personnes âgées, à partir de l’analyse d’entretiens enregistrés sur support audiovisuel (approche multimodale : texte, son, geste). Les analyses portent en particulier sur les marqueurs de discours et les gestes dont la fonction communicative est (inter)subjective (par ex. : le marqueur de discours 'tu vois' ou un regard vers l’interlocuteur). Le projet de recherche CorpAGEst "Approche sur corpus de la compétence pragmatique des personnes âgées" est un projet européen financé par les Actions Marie Curie dans le cadre du 7e European Community Framework Programme (projet N° 328282, FP7-PEOPLE-2012- IEF). Durée du projet : 2 ans (2013-2015). LIEU Le stage sera coordonné au sein du laboratoire de recherche par Catherine Bolly (UMR 7023 "Structures Formelles du Langage", CNRS Paris 8 & Centre de recherches "Valibel – Discours et variation", UCL Louvain-la-Neuve). MISSION La méthode d’analyse sur corpus multimodal adoptée implique la constitution et l’annotation d’un corpus audiovisuel : 8 sujets d’étude, env. 13 heures. La personne recrutée aura pour tâche de transcrire, au moyen du logiciel Praat, les données audio des entretiens effectués sur le terrain. Les normes de transcription suivront l’orthographe standard du français et les conventions de transcription du centre Valibel (Université catholique de Louvain). Des discussions et échanges dynamiques au sein de l’équipe permettront d’optimiser la procédure de transcription sur la base de l’expérience de chacun. PRÉREQUIS Familiarité avec les outils informatiques de base (traitements de texte, clavier, etc.). La connaissance du logiciel Praat est un atout. Expérience appréciée dans l’analyse impliquant le langage oral. Des connaissances en transcription et annotation de corpus oraux seraient un plus. PROFIL Étudiant en Master, dans le domaine des sciences humaines (langues et lettres, sciences du langage, sciences de la communication, traitement automatique du langage, ethnologie, etc.). Intérêt pour les sciences du langage, le français parlé et/ou le vieillissement langagier. Excellente maîtrise de la langue française. DURÉE ET CONDITIONS Stage de 3 mois à temps plein à partir de février/mars 2014 auprès d’une chercheuse en sciences du langage. Possibilité d’effectuer le stage à temps partiel (durée : 4 à 6 mois). Flexibilité et adaptabilité des horaires. Possibilité d’intégrer le travail dans le cadre d’un mémoire de Master (à négocier). RÉMUNÉRATION 436,05€ net par mois. CANDIDATURE Lettre de motivation et CV. A envoyer pour le 20 décembre 2013 au plus tard par e-mail à : catherine.bolly at uclouvain.be dominique_jean.boutet at orange.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 10 20:51:53 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 10 Dec 2013 21:51:53 +0100 Subject: Ecole: International Terminology Summer School (TSS 2014), Vienna, 14 to 18 July 2014 Message-ID: Date: Tue, 10 Dec 2013 15:01:08 +0100 From: excellence Message-ID: X-url: http://www.termnet.org/english/events/tss_2014/index.php Dear colleagues, You can now register for the International Terminology Summer School (TSS 2014) that will take place next year in Vienna, from 14 to 18 July 2014. TSS is a practice-oriented 5-day training course on terminology management. It addresses language professionals (translators, terminologists, localization experts, etc.), students and scientists who look for a comprehensive, state-of-the-art introduction to terminology theory and practice. Reserve your place at TSS 2014 today! You may register until early June 2014. However, places are limited, we advise you wait not too long. http://www.termnet.org/english/events/tss_2014/index.php PROGRAM: UNDERSTANDING TERMINOLOGY MANAGEMENT - Introduction to Terminology Work - Applied Principles of Terminology Work - Retrieving and Validating Relevant Online Information for Terminology Work TERMINOLOGY MANAGEMENT SKILLS - Data Modelling: Data Categories for Terminology Management - Creating a Terminology Database - Terminology Tools - A closer look at Terminology Management Systems - Terminology Workflows and Interfaces TERMINOLOGY STRATEGIES FOR BUSINESS PROCESSES - Three "P"s in Terminology Management: Project, Product, Process - Case Studies: Terminology and Technical Documentation - Case Studies: Legal Translation and Terminology STANDARDS AND LEGAL ISSUES FOR TERMINOLOGY WORK - Terminology Work for Literary Translation - Terminology and other Knowledge Organization Systems - Standards for Terminology Work - Copyright Issues for Terminology Management INSTRUCTORS: International renowned experts in terminology and terminology management will cover the TSS topics but also take time during the whole week to help participants with the most individual questions and problems. ECQA CERTIFICATION FOR TERMINOLOGY MANAGERS: The TSS2014 program is developed in cooperation with the ECQA initiative and this makes possible for TSS2014 participants to obtain the ECQA Certificate for Terminology Managers by performing the official ECQA exam for the job role Certified Terminology Manager - Basic or Certified Terminology Manager - Advanced. SOCIAL NETWORKING AND SIDE EVENTS: TSS gives the opportunity to the participants to network and discuss in a relaxed and resourceful environment with peers and colleagues during the side events planned for this purpose. Enjoy the Terminology Breakfasts every morning before classes, participant in the "get to know each other" event on Monday after the first day of classes. Meet your colleagues in the different "after school" events during the week. You can find all information about TSS2014 at: http://www.termnet.org/english/events/tss_2014/index.php (this website will be constantly updated in the next months with more and more information) or contact us (events at termnet.org). We will be happy to answer your questions. Please disseminate this information also among colleagues and friends who might be interested! Apologies for cross-posting. Kind regards, ------------------------------------------------------------ Your TermNet Events Team International Network for Terminology - TermNet Mooslackengasse 17, 1190 Vienna, Austria c/o bena office center doebling GmbH T: +43 1 23060 3965 F: +43 1 23060 3966 events at termnet.org - www.termnet.org ------------------------------------------------------------ TermNet - There is no Knowledge without Terminology - since 1988 ------------------------------------------------------------ From hamon at LIMSI.FR Tue Dec 10 20:47:47 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 10 Dec 2013 21:47:47 +0100 Subject: Appel: TOTh 2014 Message-ID: Date: Tue, 10 Dec 2013 10:18:05 +0100 From: Luc Damas Message-ID: X-url: http://www.porphyre.org/toth/ TOTh 2014 : APPEL A COMMUNICATIONS Terminologie & Ontologie : Théories et applications Chambéry – France Conférence : 12 & 13 juin 2014 Formation : 10 & 11 juin 2014 http://www.porphyre.org/toth/ http://www.porphyre.org/toth/toth-2014/appel-a-communications http://www.porphyre.org/toth/files/TOTh-2014-appel-a-communications.pdf Dates importantes : ------------------------- 10 février 2014 : Date limite de soumission des résumés (2 à 3 pages) 3 mars 2014: Notification d’acceptation Soumission par EasyChair : ----------------------------------- Les soumissions des résumés étendus (2 à 3 pages) se font exclusivement par EasyChair: http://www.easychair.org/conferences/?conf=toth2014 Présentation : ------------------ La 8ème édition des conférences internationales TOTh se déroulera les jeudi 12 et vendredi 13 juin 2014 à l’Université de Savoie (Chambéry, France) avec le soutien du Ministère de la Culture (DGLFLF) et de l’Association Européenne de Terminologie. Les conférences TOTh - Terminologie & Ontologie : Théories et applications - ont pour objectif de rassembler chercheurs, professionnels et, de façon plus générale, toute personne dont les préoccupations relèvent de la langue et de l’ingénierie des connaissances. Le programme comportera, en plus des communications retenues, une conférence invitée, une disputatio et une session poster. Une formation d’un jour et demi sur l’analyse de textes à des fins onomasiologiques précédera la conférence les mardi 10 et mercredi 11 juin. Un prix « Jeune chercheur » est décerné par le comité de programme lors de la conférence Thèmes de la Conférence (liste non exhaustive) : -------------------------------------------------------------- - Langues de spécialité et Lexicographie de spécialité, - Traduction, - Traitement de corpus, - Lexiques, Dictionnaires, Terminologies, - Aménagement linguistique et Aménagement terminologique, - Représentation, Extraction et Gestion des connaissances, - Systèmes conceptuels, - Ontologies, - Thesaurus, - Applications : Recherche d’information, Web des données, Bibliothèques numériques, Environnements de développement, Banques terminologiques, - Normes, Formats d’échange. TOTh 2014 http://www.porphyre.org/toth/ http://www.porphyre.org/toth/toth-2014/appel-a-communications http://www.porphyre.org/toth/files/TOTh-2014-appel-a-communications.pdf ------------------------------------------------- TOTh 2014: CALL FOR PAPERS Terminology & Ontology: Theories and applications Chambéry – France Conference: 12 and 13 June 2014 Training: 10 and 11 June 2014 http://www.porphyre.org/toth/ http://www.porphyre.org/toth/toth-2014-en/call-for-papers http://www.porphyre.org/toth/files/TOTh-2014-Call-for-papers.pdf DEADLINES: ------------------------------- 10 February 2014: Deadline for abstract submissions 3 March 2014: Notification of acceptance Paper submission via Easychair: ----------------------------------------- http://www.easychair.org/conferences/?conf=toth2014 Presentation: ------------------ The 8th TOTh International Conference will be held on Thursday 12th and Friday 13th June 2014 at the University of Savoie (in Chambéry, France) with the support of the French Ministry of Culture (DGLFLF) and of the European Association for Terminology. The TOTh Conferences – Terminology & Ontology: Theories and applications – bring together researchers, professionals and, more generally, all persons interested in issues related to language and knowledge engineering. The program will include accepted communications, a guest lecture, a disputatio, and a poster session. A one and a half day training session (in French) on text analysis for onomasiological purposes will be held just before the conference on Tuesday 10th and Wednesday 11th June. A Young Researcher Prize will be awarded by the Program Committee during the conference. Conference topics (non-exhaustive list): -------------------------------------------------- - Language for Special Purposes, Specialized Lexicography, - Translation, - Corpus Analysis, - Lexicon, Dictionary, Terminology, - Linguistic planning and Terminology planning, - Knowledge Representation, Extraction and Management, - Conceptual Systems, - Ontology, - Thesaurus, - Applications: Information retrieval, Web of data, Digital libraries, Software environments, Terminology banks, - Standards, Interchange formats. TOTh 2014 http://www.porphyre.org/toth/ http://www.porphyre.org/toth/toth-2014-en/call-for-papers http://www.porphyre.org/toth/files/TOTh-2014-Call-for-papers.pdf ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 14 20:41:30 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 21:41:30 +0100 Subject: Revue: CORELA, vol. num. 2 Message-ID: Date: Wed, 11 Dec 2013 08:58:20 +0100 From: Gilles Col Message-Id: X-url: http://corela.edel.univ-poitiers.fr/index.php?id=3050 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=3080 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=3029 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=3120 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=2977 Chers collègues, L'équipe éditoriale de la revue CORELA - Cognition / Représentation / Langages a le plaisir de vous annoncer la publication de son dernier numéro (vol. 11, num. 2). Les propositions de publication pour le prochain numéro de la revue (vol. 12, num. 1) sont à envoyer à Sylvester OSU (sylvester.osu at wanadoo.fr) avant le 15 mars 2014. Sommaire du numéro 11/2 : Philippe MULLER: "Cet obscur objet de désir : à propos de la construction want GN et de ses gloses". http://corela.edel.univ-poitiers.fr/index.php?id=3050 Elisabet JOHANSSON-MANOURY: "Quelques précisions sur l'anaphore notionnelle". http://corela.edel.univ-poitiers.fr/index.php?id=3080 Pierre FOURNIER : "Accentuation des composés savants en anglais contemporain : identification morphosémantique et enjeu phonologique". http://corela.edel.univ-poitiers.fr/index.php?id=3029 Laurence VINCENT-DURROUX : "Les représentations cognitives : pour une mise en évidence de leur rôle dans l'appropriation des langues". http://corela.edel.univ-poitiers.fr/index.php?id=3120 Chokri RHIBI : "Adverbes de localisation temporelle et enchaînement spatio-temporel : Le cas de dehors et autres expressions apparentées". http://corela.edel.univ-poitiers.fr/index.php?id=2977 Gilles Col Directeur éditorial ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 14 20:44:36 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 21:44:36 +0100 Subject: Appel: EACL 2014 Workshop on Cognitive Aspects of Computational Language Learning Message-ID: Date: Wed, 11 Dec 2013 09:18:32 +0100 From: Thierry Poibeau Message-Id: <2DEB90F7-AE27-4B57-BD5C-E1082484D996 at ens.fr> [Apologies for cross-postings] ================================================== Second Call for Papers ================================================== EACL 2014 Workshop on Cognitive Aspects of Computational Language Learning April 26, 2014 Gothenburg, Sweden https://sites.google.com/site/cognitivews2014/ Deadline for Paper Submissions: January, 23rd, 2014 (11:59pm GMT -12) Endorsed by the Special Interest Group of the ACL on Natural Language Learning (SIGNLL) --------------------------------------------------------------- The human ability to acquire and process language has long attracted interest and generated much debate due to the apparent ease with which such a complex and dynamic system is learnt and used on the face of ambiguity, noise and uncertainty. This subject raises many questions ranging from the nature vs. nurture debate of how much needs to be innate and how much needs to be learned for acquisition to be successful, to the mechanisms involved in this process (general vs specific) and their representations in the human brain. There are also developmental issues related to the different stages consistently found during acquisition (e.g. one word vs. two words) and possible organizations of this knowledge. These have been discussed in the context of first and second language acquisition and bilingualism, with cross linguistic studies shedding light on the influence of the language and the environment. The past decades have seen a massive expansion in the application of statistical and machine learning methods to natural language processing (NLP). This work has yielded impressive results in numerous speech and language processing tasks, including e.g. speech recognition, morphological analysis, parsing, lexical acquisition, semantic interpretation, and dialogue management. The good results have generally been viewed as engineering achievements. Recently researchers have begun to investigate the relevance of computational learning methods for research on human language acquisition and change. The use of computational modeling is a relatively recent trend boosted by advances in machine learning techniques, and the availability of resources like corpora of child and child-directed sentences, and data from psycholinguistic tasks by normal and pathological groups. Many of the existing computational models attempt to study language tasks under cognitively plausible criteria (such as memory and processing limitations that humans face), and to explain the developmental stages observed in the acquisition and evolution of the language abilities. In doing so, computational modeling provides insight into the plausible mechanisms involved in human language processes, and inspires the development of better language models and techniques. These investigations are very important since if computational techniques can be used to improve our understanding of human language acquisition and change, these will not only benefit cognitive sciences in general but will reflect back to NLP and place us in a better position to develop useful language models. Success in this type of research requires close collaboration between the NLP, linguistics, psychology and cognitive science communities. The workshop is targeted at anyone interested in the relevance of computational techniques for understanding first, second and bilingual language acquisition and language change in normal and clinical conditions. Long and short papers are invited on, but not limited to, the following topics: * Computational learning theory and analysis of language learning and organization * Computational models of first, second and bilingual language acquisition * Computational models of language changes in clinical conditions * Computational models and analysis of factors that influence language acquisition and use in different age groups and cultures * Computational models of various aspects of language and their interaction effect in acquisition, processing and change * Computational models of the evolution of language * Data resources and tools for investigating computational models of human language processes * Empirical and theoretical comparisons of the learning environment and its impact on language processes * Cognitively oriented Bayesian models of language processes * Computational methods for acquiring various linguistic information (related to e.g. speech, morphology, lexicon, syntax, semantics, and discourse) and their relevance to research on human language acquisition * Investigations and comparisons of supervised, unsupervised and weakly-supervised methods for learning (e.g. machine learning, statistical, symbolic, biologically-inspired, active learning, various hybrid models) from a cognitive perspective --------------------------------------------------------------- SUBMISSIONS We invite three different submission modalities: * Regular long papers (8 content pages + 1 page for references): Long papers should report on original, solid and finished research including new experimental results, resources and/or techniques. * Regular short papers (4 content pages + 1 page for references): Short papers should report on small experiments, focused contributions, ongoing research, negative results and/or philosophical discussion. * System demonstration (2 pages): System demonstration papers should describe and document the demonstrated system or resources. We encourage the demonstration of both early research prototypes and mature systems, that will be presented in a separate demo session. All submissions must be in PDF format and must follow the EACL 2014 formatting requirements (available at http://www.eacl2014.org/files/eacl-2014-styles.zip). We strongly advise the use of the provided Word or LaTeX template files. For long and short papers, the reported research should be substantially original. The papers will be presented orally or as posters. The decision as to which paper will be presented orally and which as poster will be made by the program committee based on the nature rather than on the quality of the work. Reviewing will be double-blind, and thus no author information should be included in the papers; self-reference should be avoided as well. Papers that do not conform to these requirements will be rejected without review. Accepted papers will appear in the workshop proceedings, where no distinction will be made between papers presented orally or as posters. Submission and reviewing will be electronic, managed by the START system: https://www.softconf.com/eacl2014/CogACLL/ Submissions must be uploaded onto the START system by the submission deadline: January 23rd, 2014 (11:59pm GMT -12 hours) Please choose the appropriate submission type from the START submission page, according to the category of your paper. --------------------------------------------------------------- IMPORTANT DATES Jan 23, 2014 Long and Short Paper submission deadline Feb 05, 2014 System Demonstrations submission deadline Feb 20, 2014 Notification of acceptance Mar 03, 2014 Camera-ready deadline Apr 26, 2014 Workshop --------------------------------------------------------------- PROGRAM COMMITTEE Afra Alishahi Tilburg University (Netherlands) Colin J Bannard University of Texas at Austin (USA) Marco Baroni University of Trento (Italy) Robert Berwick Massachusetts Institute of Technology (USA) Philippe Blache LPL, CNRS (France) Jim Blevins University of Cambridge (UK) Antal van den Bosch Radboud University Nijmegen (Netherlands) Chris Brew Nuance Communications (USA) Ted Briscoe University of Cambridge (UK) Alexander Clark Royal Holloway, University of London (UK) Robin Clark University of Pennsylvania (USA) Stephen Clark University of Cambridge (UK) Matthew W. Crocker Saarland University (Germany) Walter Daelemans University of Antwerp (Belgium) Dan Dediu Max Planck Institute for Psycholinguistics (The Netherlands) Barry Devereux University of Cambridge (UK) Benjamin Fagard Lattice-CNRS (France) Jeroen Geertzen University of Cambridge (UK) Ted Gibson Massachusetts Institute of Technology (USA) Henriette Hendriks University of Cambridge (UK) Marco Idiart Federal University of Rio Grande do Sul (Brazil) Mark Johnson Brown University (USA) Aravind Joshi University of Pennsylvania (USA) Gianluca Lebani University of Pisa (Italy) Igor Malioutov Massachusetts Institute of Technology (USA) Marie-Catherine de Marneffe The Ohio State University (USA) Maria Alice Parente Federal University of ABC (Brazil) Massimo Poesio University of Trento (Italy) Brechtje Post University of Cambridge (UK) Ari Rappoport The Hebrew University of Jerusalem (Israel) Anne Reboul L2C2-CNRS (France) Kenji Sagae University of Southern California (USA) Sabine Schulte im Walde University of Stuttgart (Germany) Ekaterina Shutova University of California, Berkeley (USA) Maity Siqueira Federal University of Rio Grande do Sul (Brazil) Mark Steedman University of Edinburgh (UK) Suzanne Stevenson University of Toronto (Canada) Remi van Trijp Sony Computer Science Laboratory Paris (France) Shuly Wintner University of Haifa (Israel) Charles Yang University of Pennsylvania (USA) Beracah Yankama Massachusetts Institute of Technology (USA) Menno van Zaanen Tilburg University (Netherlands) Alessandra Zarcone University of Stuttgart (Germany) --------------------------------------------------------------- WORKSHOP ORGANIZERS AND CONTACT Alessandro Lenci (University of Pisa, Italy) Muntsa Padró (Federal University of Rio Grande do Sul, Brazil) Thierry Poibeau (LATTICE-CNRS, France) Aline Villavicencio (Federal University of Rio Grande do Sul, Brazil) For any inquiries regarding the workshop please send an email to cognitive2014 at gmail.com ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 14 21:13:30 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 22:13:30 +0100 Subject: Appel: EACL 2014 Workshop on Continuous Vector Space Models and their Compositionality Message-ID: Date: Wed, 11 Dec 2013 12:35:49 +0100 From: Alexandre Allauzen Message-ID: <52A84E15.2010007 at limsi.fr> Second Call for Papers (Apologies for multiple postings) ************************************************************************ Workshop on Continuous Vector Space Models and their Compositionality (2nd edition) Co-located with EACL 2014, Gothenburg, Sweden April 27, 2014 Submission deadline: January 23, 2014 https://sites.google.com/site/cvscworkshop2014 ************************************************************************ INVITED SPEAKERS The workshop will showcase presentations from two invited speakers : Geoffrey Zweig (Microsoft Research) and Ivan Titov (University of Amsterdam). AIMS AND SCOPE In recent years, there has been a growing interest in algorithms that learn and use continuous representations for words, phrases, or documents in many natural language processing applications. Among many others, influential proposals that illustrate this trend include latent Dirichlet allocation, neural network based language models and spectral methods. These approaches are motivated by improving the generalization power of the discrete standard models, by dealing with the data sparsity issue and by efficiently handling a wide context. Despite the success of single word vector space models, they are limited since they do not capture compositionality. This prevents them from gaining a deeper understanding of the semantics of longer phrases or sentences. With the growing popularity of these neural and probabilistic methods of language processing, the scope of this second workshop is extended to theoretical and conceptual questions regarding: * their relation to unsupervised distributional representations, * the encompassing of the compositional aspects of formal models of semantics, * the role of linguistic theory in the design and development of these methods. Some such pertinent questions include: Should phrase representations and word representations be of the same sort? Could different linguistic levels require different modeling approaches? Is compositionality determined by syntax, and if so, how do we learn/define it? Should word representations be fixed and obtained distributionally, or should the encoding be variable? Should word representations be task-specific, or should they be general? In this workshop, we invite submissions of papers on continuous vector space models for natural language processing. Topics of interest include, but are not limited to: * learning algorithms for continuous vector space models, * their compositionality, * their use in NLP applications, * spectral learning for NLP, * neural networks for NLP, * phrase, sentence, and document-level distributional representations, * tensor models, * distributed semantic representations, * the role of syntax in compositional models, * formal and distributional semantic models. SUBMISSION INFORMATION Authors should submit a full paper of up to 8 pages in electronic, PDF format, with up to 2 additional pages for references. The reported research should be substantially original. The papers will be presented orally or as posters. All submissions must be in PDF format and must follow the EACL 2014 formatting requirements (http://www.eacl2014.org/files/eacl-2014-styles.zip ). Reviewing will be double-blind, and thus no author information should be included in the papers; self-reference should be avoided as well. Submissions must be made through the Softconf website set up for this workshop: https://www.softconf.com/eacl2014/CVSC/ Accepted papers will appear in the workshop proceedings, where no distinction will be made between papers presented orally or as posters. IMPORTANT DATES 23 January 2014 : Submission deadline 20 February 2014 : Notification of acceptance 3 March 2014 : Camera-ready deadline 27 April 2014 : Workshop ORGANIZERS Alexandre Allauzen (LIMSI-CNRS/Université Paris-Sud, France) Raffaella Bernardi (University of Trento, Italy) Edward Grefenstette (University of Oxford, UK) Hugo Larochelle (Université de de Sherbrooke, Canada) Christopher Manning (Stanford University, USA) Scott Wen-tau Yih (Microsoft Research, USA) PROGRAM COMMITTEE Nicholas Asher (IRIT-Toulouse) Marco Baroni (University of Trento) Yoshua Bengio (Université de Montréal) Gemma Boleda (University of Texas) Antoine Bordes (Université Technologique de Compiègne) Johan Bos (University of Groningen) Léon Bottou (Microsoft Research) Xavier Carreras (Universitat Politècnica de Catalunya) Lucas Champollion (New-York University) Stephen Clark (University of Cambridge) Shay Cohen (Columbia University) Ido Dagan (Bar Ilan University) Ronan Collobert (IDIAP Research Institute, Switzerland) Pino Di Fabbrizio (Amazon) Georgiana Dinu (University of Trento) Kevin Duh (Nara Institute of Science and Technology) Dean Foster (University of Pennsylvania) Alessandro Lenci (University of Pisa) Louise McNally (Universitat Pompeu Fabra) Fabio Massimo Zanzotto (Università degli Studi di Roma) Mirella Lapata (University of Edinburgh) Andriy Mnih (Gatsby Computational Neuroscience Unit) Larry Moss (Indiana University) Diarmuid Ó Seaghdha (University of Cambridge) Sebastian Pado (Universität Stuttgart) Martha Palmer (University of Colorado) John Platt (Microsoft Research) Maarten de Rijke (University of Amsterdam) Mehrnoosh Sadrzadeh (University of London) Mark Steedman (University of Edinburgh) Chung-chieh Shan (Indiana University) Peter Turney (NRC) Jason Weston (Google) Guillaume Wisniewski (LIMSI-CNRS/Université Paris-Sud) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 14 21:25:15 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 22:25:15 +0100 Subject: Stage: M2, Amelioration du module de desambiguisation du Moteur de Recherche Semantique d'inbenta Message-ID: Date: Thu, 12 Dec 2013 17:59:28 +0100 From: Manon Quintana Message-ID: *Présentation société* ------------------------------ inbenta est une société pionnière dans le *Traitement Automatique du Langage Naturel et la recherche sémantique*. Basée sur ces concepts novateurs, inbenta développe depuis 2005 des solutions logicielles pour les sites internet de Grands Comptes. *Description de l’offre* ------------------------------ Inbenta a développé un moteur de recherche intelligent appelé *Inbenta Semantic Search Engine* (ISSE). Les deux tâches principales de ce moteur sont d’analyser les questions des utilisateurs et de trouver la réponse appropriée à la requête en effectuant une recherche dans une base de connaissances. Un *module de désambiguïsation syntaxique et sémantique* est intégré dans notre moteur de recherche. Ce module est très important car il fait partie intégrante du bon fonctionnement de la solution. L’objet du stage proposé par inbenta sera d’améliorer le module de désambiguïsation. Les missions de stage seront : - Gestion linguistique et éditoriale d’un projet de FAQ dynamique afin que le stagiaire s’approprie l’existant - Enrichissement du module de désambiguïsation par l’ajout de règles, de descriptions lexicales et de grammaires locales + évaluation du travail - Réflexion d’amélioration du module de désambiguïsation à un niveau algorithmique *Profil recherché* ------------------------------ Nous recherchons une personne enthousiaste, organisée et sérieuse et ayant l’envie d’intégrer une équipe internationale. Le stagiaire devra également avoir les compétences suivantes : - Études en Traitement Automatique du Langage Naturel - Excellente maîtrise de la langue française et bonne communication écrite et orale en espagnol, anglais ou catalan Bonus : - Maîtrise d’au moins un langage de programmation (PHP de préférence) - Maitrise des expressions régulières et du SQL *Modalités du poste* ------------------------------ - Stage de 5 à 6 mois (avec possibilité d’embauche en CDI) - Rémunération prévue: 30% du SMIC (+ prime en fonction des résultats) - Début : à partir de Février / Mars 2013 - Lieu : Toulouse Merci d’adresser CV et lettre de motivation à Quintana Manon à l’adresse mail suivante : *mquintana at inbenta.com* ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 14 21:57:07 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 22:57:07 +0100 Subject: Appel: ESWC 2014, Call for Demos Message-ID: Date: Fri, 13 Dec 2013 15:31:40 +0100 (CET) From: speroni at cs.unibo.it Message-Id: <20131213143200.7CF23DBB67 at vina.cines.fr> X-url: http://2014.eswc-conferences.org/important-dates/call-demos ** apologies for cross-posting ** ==== Call for Demos ==== http://2014.eswc-conferences.org/important-dates/call-demos The demonstrations track of the ESWC 2014 complements the overall program of the conference with demonstrations of real-world, applied-and-tested advances of semantic technologies. In addition to papers and posters presenting latest theoretical achievements, the ESWC 2014 is seeking demonstrations of novel applications of semantics in various sectors, including: eGovernment, eEnvironment, eMobility and smart cities, eHealth, Life Sciences, Sensor networks and RDF Stream Processing, Linked Science, Media and entertainment, Telecommunications, Cultural heritage, Financial services, Energy and utilities, Manufacturing, Digital libraries, Cloud applications, Personal Information Management, Decision support, Emergency and crisis management etc. The demonstrations track is an opportunity for researchers and practitioners to present their innovative prototypes, practical developments, on-going projects, lessons learned and late-breaking results, relevant for the topics of interest of the main conference and the research and poster tracks. Submissions to the demonstrations track should make clear what will be demonstrated, and in particular point out what makes the demonstration a novel showcase. A video of the demo should be provided if a link to the demonstrator is not available. Submission should also specify: * What is the research background and application context of the demonstration, and what makes this a novel showcase for semantic technologies. * What is the key technology used, and how does the demonstrated system, application or infrastructure relate to pre-existing work. * What exactly will be demonstrated? What exactly will a visitor of the demonstration learn? * Submission Guidelines * Submissions to the demonstrations track must be in the form of a textual description of the demonstration to be given at the conference (which may include screenshots and must include either a link to the online demo or a video presenting it). The submissions must be at most 5 pages long including reference list (Springer LNCS style) and submitted as PDF. Decisions about acceptance will be based on relevance to the Semantic Web area, originality, potential significance, topicality and clarity. Papers that exceed the given page length or do not follow the LNCS guidelines will be rejected without a review. Submissions should be uploaded using the Easychair system: https://www.easychair.org/conferences/?conf=eswc2014pd * Important Dates * Submission Deadline: March 16, 2014 Notification of Acceptance: April 10, 2014 Camera-Ready Paper: April 25, 2014 * Important Notice * Demonstrator papers will appear in supplementary post-conference proceedings to be published by Springer in the Lecture Notes in Computer Science series. Participants with an accepted demo paper must register for the conference and present their system during the Demo Session. A space for demonstration will be allocated for each participant. Participants should use their own laptop for the demos. The organizers should be contacted well in advance of the conference in case of any special requirement. * Poster and Demo Chairs * Raphael Troncy (Multimedia Department, EURECOM, FR) Eva Blomqvist (Department of Computer and Information Science, Linkoping University, SE) From hamon at LIMSI.FR Sat Dec 14 21:15:49 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 22:15:49 +0100 Subject: Journee: Ateliers de formation a TXM, session de janvier-fevrier 2014, Lyon Message-ID: Date: Wed, 11 Dec 2013 14:31:15 +0100 From: Serge Heiden Message-ID: <52A86923.3040905 at ens-lyon.fr> X-url: http://textometrie.ens-lyon.fr Bonjour, Les prochains ateliers TXM auront lieu aux dates suivantes (à Lyon) : - mardi 21 janvier : TXM avancé ; - jeudi 27 février : Initiation à TXM ; - vendredi 28 février : Préparation de corpus et import dans TXM. TXM est un outil libre d'analyse de corpus de textes et de transcriptions. Si vous êtes intéressé(e) par l'une de ces séances, il faut vous inscrire en envoyant un mail au plus tôt (le nombre de places est limité) à textometrie <à> ens-lyon fr (l'inscription est nécessaire même si vous aviez répondu à la consultation Doodle) Toute l'information sur les ateliers : https://groupes.renater.fr/wiki/txm-users/public/ateliers_txm Au plaisir de vous retrouver peut-être à cette occasion, Serge Heiden, pour l'équipe TXM. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 14 20:47:36 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 21:47:36 +0100 Subject: Appel: ESSLLI 2014 Student Session Message-ID: Date: Wed, 11 Dec 2013 11:44:38 +0100 From: Ronald de Haan Message-ID: X-url: http://www.kr.tuwien.ac.at/drm/dehaan/stus2014/ X-url: http://www.esslli2014.info/ Please forward to students. Apologies for the multiple messages. *1st Call for Papers* *ESSLLI 2014 STUDENT SESSION* http://www.kr.tuwien.ac.at/drm/dehaan/stus2014/ *Held during* *The 26th European Summer School in* *Logic, Language and Information* Tübingen, Germany, August 11-22, 2014 *Deadline for submissions: April 1st, 2014* https://www.easychair.org/conferences/?conf=essllistus2014 *ABOUT:* The Student Session of the 26th European Summer School in Logic, Language, and Information (ESSLLI) will take place in Tübingen, Germany, on August 11-22, 2014. We invite submissions of original, unpublished work from students in any area at the intersection of Logic & Language, Language & Computation, or Logic & Computation. Submissions will be reviewed by several experts in the field, and accepted papers will be presented orally or as posters and will appear in the student session proceedings. This is an excellent opportunity to receive valuable feedback from expert readers and to present your work to a diverse audience. *SEPARATE POSTER SESSION:* Note that there are two separate kinds of submissions, one for the oral presentations and one for the posters. This means that papers can be directly submitted as posters. Reviewing and ranking will be done separately. We particularly encourage submissions for posters, as they offer an excellent opportunity to present research in progress. More detailed guidelines and policies regarding submission can be found on the Student Session website: http://www.kr.tuwien.ac.at/drm/dehaan/stus2014/ . Please direct inquiries about submission procedures or other matters relating to the Student Session to: dehaan at kr.tuwien.ac.at. For general inquiries about ESSLLI 2014, please consult the main ESSLLI 2014 page: http://www.esslli2014.info/. Kind regards, *the ESSLLI 2013 Student Session Organization Committee* *Chair:* Ronald de Haan (Technische Universität Wien, Austria) *LoCo co-chairs:* Zoé Christoff (Universiteit van Amsterdam, the Netherlands) Aybüke Özgün (Université de Lorraine, France) *LoLa co-chairs:* Philip Schulz (Universiteit van Amsterdam, the Netherlands) Thomas Brochhagen (Universität Düsseldorf, Germany) *LaCo co-chairs:* Miriam Kaeshammer (Universität Düsseldorf, Germany) Ramon Ziai (Universität Tübingen, Germany) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 14 21:22:54 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 22:22:54 +0100 Subject: Stage: Master recherche appliquee en TAL, region parisienne Message-ID: Date: Thu, 12 Dec 2013 17:16:38 +0100 From: philippe.martin at stella-medica.fr Message-ID: Stella Medica est un cabinet de recrutement bien implanté dans le milieu médical. En partenariat avec une société de conseil informatique, nous développons pour nos propres besoins et dans une optique de commercialisation des outils de gestion de recrutement. Dans le cadre de ce développement vous travaillerez sur un module de gestion d’analyse des données candidat. Les algorithmes à intégrer ou développer porteront sur les éléments suivants : - Dans le cadre de l’analyse de CV : o Identification de la nature et de la langue du document o Détection des informations dans le texte tels que (exemples simples) : Nom, prénom, date de naissance, adresse, métier et niveau d’expérience - Détection de documents similaires - Génération de résumé, dérivation de ce résumé en plusieurs langues (anglais en particulier) - Amélioration du moteur de recherches multicritères existant - Recherche automatique dans les réseaux sociaux et mise en lumière de points forts ou d’incohérences Vous travaillerez dans une équipe à dominante informatique et serez donc assez autonome. L’équipe sera encadrée par un chef de projet informatique expérimenté, dans une démarche de développement logiciel professionnel : démarche CMMI, outils de gestion de projet, de partage des sources et de gestion des tests. La plateforme de développement est principalement Microsoft : ASP.NET MVC en C#, composants C# ; SQL Server. Vous cherchez idéalement un stage de fin d’études Bac +5 en Traitement Automatique de la Langue (TAL) ou Ingenierie Linguistique. Le résultat de votre travail devra être « convertible en programme informatique », phase que vous pourrez vous-même éventuellement mettre en œuvre. Stage de longueur modulable de 4 à 6 mois à partir de février 2014 situé à Rueil-Malmaison. Rémunération selon profil, tickets restaurants, remboursement de 50% du pass Navigo. Contactez Philippe Martin avec votre CV et en précisant votre période de stage attendue : philippe.martin at stella-medica.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 14 21:18:28 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 22:18:28 +0100 Subject: Appel: EACL 2014 Workshop on Multiword Expressions (MWE 2014) Message-ID: Date: Thu, 12 Dec 2013 14:53:39 +0100 From: Agata Savary Message-ID: <52A9BFE3.3070802 at univ-tours.fr> X-url: http://multiword.sourceforge.net/mwe2014 Second CALL FOR PAPERS The 10th Workshop on Multiword Expressions (MWE 2014) http://multiword.sourceforge.net/mwe2014 Workshop at EACL 2014 (Gothenburg, Sweden), April 26-27, 2014 Endorsed by the Special Interest Group on the Lexicon of the Association for Computational Linguistics (SIGLEX;http://www.siglex.org/); SIGLEX's Multiword Expressions Section (SIGLEX-MWE; http://multiword.sourceforge.net/PHITE.php?sitesig=MWE); and PARSEME, European IC1207 COST Action (http://www.parseme.eu) Submission deadline: Long & short papers - January 23, 2014 at 11:59pm GMT -12 hours ------------------------- Call For Papers Under the denomination "multiword expression", one assumes a wide range of linguistic constructions such as idioms ("storm in a teacup", "sweep under the rug"), fixed phrases ("in vitro", "by and large", "rock'n roll"), noun compounds ("olive oil", "laser printer"), compound verbs ("take a nap", "bring about"), etc. While easily mastered by native speakers, their interpretation poses a major challenge for computational systems, due to their flexible and heterogeneous nature. For a start, MWEs are not nearly as frequent in NLP resources as they are in real-world text, and this problem of coverage may impact the performance of many NLP tasks. Moreover, treating MWEs also involves problems like determining their semantics, which is not always compositional ("to kick the bucket" meaning "to die"). In sum, MWEs are a key issue and a current weakness for natural language parsing and generation, as well as real-life applications depending on language technology, such as machine translation, just to name a prominent one among many. Thanks to the joint efforts of researchers from several fields working on MWEs, significant progress has been made in recent years, especially concerning the construction of large-scale language resources. For instance, there is a large number of recent papers which focus on acquisition of MWEs from corpora, and others that describe a variety of techniques to find paraphrases for MWEs. Current methods use a plethora of tools such as association measures, machine learning, syntactic patterns, web queries, etc. A considerable body of techniques, resources and tools to perform these tasks are now available, and are indicative of the growing importance of the field within the NLP community. Many of these advances are described as part of the annual workshop on MWEs, which attracts the attention of an ever-growing community working on a variety of languages and MWE types. The workshop has been held since 2001 in conjunction with major computational linguistics conferences (ACL, EACL, NAACL, COLING, LREC), providing an important venue for the community to interact, share resources and tools and collaborate on efforts for advancing the computational treatment of MWEs. Additionally, special issues on MWEs have been published by leading journals in computational linguistics. The latest such effort is the special issue on "Multiword Expressions: from Theory to Practice and Use", which has recently been published by the ACM Transactions on Speech and Language Processing (http://multiword.sourceforge.net/tslp2011si). MWE 2014 will be the 10th event in the series. We will be interested in major challenges in the overall process of MWE treatment, both from the theoretical and the computational viewpoint, focusing on original research related (but not limited) to the following topics: * Manually and automatically constructed resources * Representation of MWEs in dictionaries and ontologies * MWEs and user interaction * Multilingual acquisition * Multilingualism and MWE processing * Models of first and second language acquisition of MWEs * Crosslinguistic studies on MWEs * The role of MWEs in the domain adaptation of parsers * Integration of MWEs into NLP applications * Evaluation of MWE treatment techniques * Lexical, syntactic or semantic aspects of MWEs The workshop will feature a "Special Track on Parsing and MWEs" dedicated to "deep" parsing of MWEs, inviting submissions on the following and related challenges: * Lexicon-grammar interface: representing, at the lexical level, phenomena such as agreement, discontinuity and free word order; construction of MWE lexicons which would be easily convertible and maximally reusable in different parsing frameworks. * "Deep" parsing techniques for MWEs: optimal representation of MWEs within linguistic frameworks, such CCG, HPSG, LFG, TAG, minimalism, etc; processing MWEs before, during or after parsing; representing the semantics of MWEs. * Hybrid parsing of MWEs: combining data-driven and knowledge-based approaches for efficient and linguistically precise parsers; using unannotated data for improving models based on annotated data. * Annotating MWEs in treebanks: MWE-aware methodologies of treebank construction, and their increased usability in parsing. This special track is endorsed by PARSEME, European IC1207 COST Action, dedicated to parsing and MWEs (www.parseme.eu). PARSEME will fund travel and stay for at least 20 participants from the Action's member countries (the grant application procedure will be published shortly). A separate time slot will be allocated to the special track within the workshop. Authors may submit papers either to the special track or to the regular workshop. They should follow common submission instructions. Submission modalities For MWE 2014, we will accept the following two types of submissions: Regular long papers (8 content pages + 1 page for references): Long papers should report on solid and finished research including new experimental results, resources and/or techniques. Regular short papers (4 content pages + 1 page for references): Short papers should report on small experiments, focused contributions, ongoing research, negative results and/or philosophical discussion. The reported research should be substantially original. The papers will be presented orally or as posters. The decision as to which papers will be presented orally and which as posters will be made by the program committee based on the nature rather than on the quality of the work. All submissions must be in PDF format and must follow the EACL 2014 formatting requirements (available at http://www.eacl2014.org/files/eacl-2014-styles.zip). We strongly advise the use of the provided Word or LaTeX template files. Reviewing will be double-blind, and thus no author information should be included in the papers; self-reference should be avoided as well. Resources submitted with the papers should be anonymized for submission. Papers and/or resources that do not conform to these requirements will be rejected without review. Accepted papers will appear in the workshop proceedings, where no distinction will be made between papers presented orally or as posters. Submission and reviewing will be electronic, managed by the START system: https://www.softconf.com/eacl2014/MWE/ Submissions must be uploaded onto the START system by the submission deadline: January 23rd, 2014 (11:59pm GMT -12 hours) Please choose the appropriate submission type from the starting submission page, according to the category of your paper. Important dates 23 January 2014: Long & short paper submission deadline 11:59pm GMT -12 hours 20 February 2014: Notification of Acceptance 03 March 2014: Camera-ready papers due 26-27 April 2014: Workshop Dates Program Committee Iñaki Alegria, University of the Basque Country (Spain) Dimitra Anastasiou, University of Bremen (Germany) Doug Arnold, University of Essex (UK) Eleftherios Avramidis, DFKI GmbH (Germany) Tim Baldwin, University of Melbourne (Australia) Núria Bel, Universitat Pompeu Fabra (Spain) Chris Biemann, Technische Universität Darmstadt (Germany) Francis Bond, Nanyang Technological University (Singapore) Lars Borin, University of Gothenburg (Sweden) António Branco, University of Lisbon (Portugal) Miriam Butt, Universität Konstanz (Germany) Aoife Cahill, ETS (USA) Ken Church, IBM Research (USA) Matthieu Constant, Université Paris-Est Marne-la-Vallée (France) Paul Cook, University of Melbourne (Australia) Béatrice Daille, Nantes University (France) Koenraad De Smedt, University of Bergen (Norway) Gaël Dias, University of Caen Basse-Normandie (France) Güls,en Eryig(it, Istanbul Technical University (Turkey) Tomaz( Erjavec, Joz(ef Stefan Institute (Slovenia) Joaquim Ferreira da Silva, New University of Lisbon (Portugal) Roxana Girju, University of Illinois at Urbana-Champaign (USA) Chikara Hashimoto, National Institute of Information and Communications Technology (Japan) Ulrich Heid, Universität Hildesheim (Germany) Kyo Kageura, University of Tokyo (Japan) Su Nam Kim, Monash University (VIC, Australia) Ioannis Korkontzelos, University of Manchester (UK) Brigitte Krenn, Austrian Research Institute for Artificial Intelligence (Austria) Cvetana Krstev, University of Belgrade (Serbia) Marie-Catherine de Marneffe, The Ohio State University (USA) Takuya Matsuzaki, National Institute of Informatics (Japan) Preslav Nakov, Qatar Computing Research Institute (Qatar) Malvina Nissim, University of Bologna (Italy) Joakim Nivre, Uppsala University (Sweden) Diarmuid Ó Séaghdha, University of Cambridge (UK) Jan Odijk, University of Utrecht (The Netherlands) Yannick Parmentier, Université d'Orléans (France) Pavel Pecina, Charles University in Prague (Czech Republic) Scott Piao, Lancaster University (UK) Adam Przepiórkowski, Institute of Computer Science, Polish Academy of Sciences (Poland) Victoria Rosén, University of Bergen (Norway) Carlos Ramisch, Aix-Marseille University (France) Manfred Sailer, Goethe University Frankfurt am Main (Germany) Magali Sanches Duran, University of São Paulo (Brazil) Violeta Seretan, University of Geneva (Switzerland) Ekaterina Shutova, University of California, Berkeley (USA) Jan S(najder, University of Zagreb (Croatia) Pavel Stran(ák, Charles University in Prague (Czech Republic) Sara Stymne, Uppsala University (Sweden) Stan Szpakowicz, University of Ottawa (Canada) Beata Trawinski, Institut für Deutsche Sprache (IDS), Mannheim (Germany) Yulia Tsvetkov, Carnegie Mellon University (USA) Yuancheng Tu, Microsoft (USA) Ruben Urizar, University of the Basque Country (Spain) Gertjan van Noord, University of Groningen (The Netherlands) Aline Villavicencio, Federal University of Rio Grande do Sul (Brazil) Veronika Vincze, Hungarian Academy of Sciences (Hungary) Martin Volk, University of Zurich (Switzerland) Tom Wasow, Stanford University (USA) Shuly Wintner, University of Haifa (Israel) Dekai Wu, The Hong Kong University of Science & Technology (Hong Kong) Workshop Organizers Valia Kordoni (Humboldt-Universität zu Berlin, Germany) Markus Egg (Humboldt-Universität zu Berlin, Germany) Agata Savary (Université François Rabelais Tours, France; Special Track Organiser) Eric Wehrli (Université de Genève, Switzerland; Special Track Organiser) Stefan Evert (Friedrich-Alexander-Universität Erlangen-Nürnberg, Germany) Contact For any inquiries regarding the workshop please send an email to mweworkshop.eacl2014 at gmail.com ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 14 21:27:38 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 22:27:38 +0100 Subject: Appel: EACL 2014 Workshop on Hybrid Approaches to Translation (3rd HyTra) Message-ID: Date: Thu, 12 Dec 2013 20:28:00 +0100 From: "Reinhard Rapp" Message-ID: <2C7CD65A618846AA874D7EBEE8259000 at ASUSPC> X-url: http://parles.upf.edu/llocs/plambert/hytra/hytra2014/ Apologies for multiple postings Please distribute to colleagues ================================================================= 2nd Call for Papers THIRD WORKSHOP ON HYBRID APPROACHES TO TRANSLATION (HyTra 2014) Co-located with EACL 2014 http://eacl2014.org/ Gothenburg, Sweden April 27, 2014 Deadline for paper submissions: January 23, 2014 http://parles.upf.edu/llocs/plambert/hytra/hytra2014/ INVITED SPEAKERS Hans Uszkoreit (Saarland University and DFKI, Germany) Joakim Nivre (Uppsala University, Sweden) ================================================================= WORKSHOP DESCRIPTION The aim of the HyTra workshop series is to bring together researchers developing and applying statistical, example-based, or rule-based translation systems, and those enhancing MT systems by combining elements from different approaches, to promote discussion and sharing of ideas among them. Hereby one relevant focus is on effectively combining linguistic and data driven approaches (rule-based and statistical MT). Another focus is on hybridization in the context of human translation. The 3rd Workshop on Hybrid Approaches to Translation (HyTra-3) intends to continue developing and empowering the research agenda in the area of Hybrid Translation already started at its first and second editions. The previous two editions (see http://www-lium.univ-lemans.fr/esirmt-hytra/ and http://hytra.barcelonamedia.org/hytra2013/) were co-located with EACL 2012 in Avignon and with ACL 2013 in Sofia, and the proceedings were published on the ACL Anthology. TOPICS We solicit contributions including but not limited to the following topics: - ways and techniques of hybridization - architectures for the rapid development of hybrid MT systems - applications of hybrid systems - hybrid systems dealing with under-resourced languages - hybrid systems dealing with morphologically rich languages - using linguistic information (morphology, syntax, semantics) to enhance statistical MT (e.g. with hierarchical or factored models) - using contextual information to enhance statistical MT - bootstrapping rule-based systems from corpora - hybrid methods in spoken language translation - extraction of dictionaries and other large-scale resources for MT from parallel and comparable corpora - induction of morphological, grammatical, and translation rules from corpora - machine learning techniques for hybrid MT - describing structural mappings between languages (e.g. tree-structures using synchronous/transduction grammars) - heuristics for limiting the search space in hybrid MT - alternative methods for the fair evaluation of the output of different types of MT systems (e.g. relying on linguistic criteria) - system combination approaches such as multi-engine MT (parallel) or automatic post-editing (sequential) - open source tools and free language resources for hybrid MT SUBMISSIONS Contributions can be short or long papers. Short paper submissions must describe original and unpublished work without exceeding five pages of content plus one extra page for references. Characteristics of short papers include: a small, focused contribution; work in progress; a negative result; an opinion piece; an interesting application nugget. Long paper submissions must describe substantial, original, completed and unpublished work without exceeding eight pages of content plus two extra pages for references. Submissions will be judged according to the criteria of the main conference (EACL 2014). Authors are invited to submit papers on original and previously unpublished work. Formatting should be according to EACL 2014 specifications using LaTeX or MS-Word style files, see section "submission format" at http://eacl2014.org/call-for-papers. Reviewing of papers will be double-blind, so the submissions should not reveal the authors' identity. Submission is electronic in PDF format using the START submission system at the following URL: https://www.softconf.com/eacl2014/HyTra/ Double submission policy: Parallel submission to other meetings or publications is possible but must be immediately notified to the workshop contact person (see below). If accepted, withdrawals are only possible within two days after notification. For an accepted paper to appear in the proceedings, at least one author must register for the workshop and actually present the paper. The papers will be published in the workshop proceedings which will be made available via the ACL Anthology. BEST PAPERS Authors of selected papers will be invited to contribute extended versions of their papers as book chapters for an edited volume on hybrid MT. IMPORTANT DATES January 23, 2014: Deadline for paper submission February 20, 2014: Notification of acceptance March 3, 2014: Camera ready papers due April 27, 2014: Workshop in Gothenburg ORGANIZERS Rafael E. Banchs (Institute for Infocomm Research, Singapore) Marta R. Costa-jussa (Institute for Infocomm Research, Singapore) Reinhard Rapp (Universities of Aix-Marseille and Mainz) Patrik Lambert (Pompeu Fabra University, Barcelona) Kurt Eberle (Lingenio GmbH, Heidelberg) Bogdan Babych (University of Leeds) CONTACT PERSON Rafael E. Banchs: rembanchs (at) i2r (dot) a-star (dot) edu (dot) sg PROGRAMME COMMITTEE * Ahmet Aker, University of Sheffield, UK * Bogdan Babych, University of Leeds, UK * Rafael E. Banchs, Institute for Infocomm Research, Singapore * Alexey Baytin, Yandex, Moscow, Russia * Núria Bel, Universitat Pompeu Fabra, Barcelona, Spain * Pierrette Bouillon, ISSCO/TIM/ETI, University of Geneva, Switzerland * Michael Carl, Copenhagen Business School, Denmark * Marta R. Costa-jussa, Institute for Infocomm Research, Singapore * Oliver Culo, University of Mainz, Germany * Kurt Eberle, Lingenio GmbH, Heidelberg, Germany * Andreas Eisele, DGT (European Commission), Luxembourg * Marcello Federico, Fondazione Bruno Kessler, Trento, Italy * Christian Federmann, Language Technology Lab, DFKI, Saarbrücken, Germany * José A. R. Fonollosa, Universitat Politècnica de Catalunya, Barcelona, Spain * Maxim Khalilov, TAUS, Amsterdam, The Netherlands * Patrik Lambert, Pompeu Fabra University, Barcelona, Spain * Udo Kruschwitz, University of Essex, UK * Yanjun Ma, Baidu Inc., Beijing, China * José B. Mariño, Universitat Politècnica de Catalunya, Barcelona, Spain * Bart Mellebeek, University of Amsterdam, The Netherlands * Reinhard Rapp, Universities of Aix-Marseille, France, and Mainz, Germany * Anders Søgaard, University of Copenhagen, Denmark * Nasredine Semmar, CEA LIST, Fontenay-aux-Roses, France * Wade Shen, Massachusetts Institute of Technology, Cambridge, USA * Serge Sharoff, University of Leeds, UK * George Tambouratzis, Institute for Language and Speech Processing, Athens, Greece * Jörg Tiedemann, University of Uppsala, Sweden * Dekai Wu, The Hong Kong University of Science and Technology, Hong Kong, China ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 14 21:57:55 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 22:57:55 +0100 Subject: Appel: ESWC 2014, Call for Posters Message-ID: Date: Fri, 13 Dec 2013 15:57:01 +0100 (CET) From: speroni at cs.unibo.it Message-Id: <20131213145722.2F128DDA89 at vina.cines.fr> X-url: http://2014.eswc-conferences.org/important-dates/call-posters X-url: http://2014.eswc-conferences.org ** apologies for cross-posting ** ==== Call for Posters ==== http://2014.eswc-conferences.org/important-dates/call-posters 11th ESWC Conference 2014 Dates: May 25 - 29, 2014 Venue: Anissaras, Crete, Greece Hashtag: #eswc2014 Feed: @eswc_conf Site: http://2014.eswc-conferences.org General Chair: Valentina Presutti (STLab, ISTC-CNR, IT) Poster and Demo Chairs: - Raphael Troncy (Multimedia Department, EURECOM, FR) - Eva Blomqvist (Department of Computer and Information Science, Linkoping University, SE) The poster track of ESWC 2014 will provide an opportunity for presenting late-breaking results, on-going research projects, and speculative or innovative work in progress. Posters are intended to provide authors and participants with the ability to connect with each other and to engage in discussions about the work. We invite submissions relevant to the area of the Semantic Web that address, but are not limited to, the topics of the Research Track. Theoretical or technical posters, reports on semantic systems (however, demonstrations of such systems should be submitted in the demo track), descriptions of completed work, and work in progress are equally welcome. Papers submitted to the research track will NOT automatically be considered for the poster track. A separate poster submission must be made. For example, a poster can be used to present work that was insufficiently mature for the research track. * Submission Guidelines * Authors must submit a 4-page paper (PDF, Springer LNCS style) with a short abstract for evaluation. The paper must clearly demonstrate relevance to the Semantic Web and the topics of interest of ESWC 2014. Decisions about acceptance will be based on relevance to the Semantic Web, originality, potential significance, topicality and clarity. Papers that exceed the given page length or do not follow the LNCS guidelines will be rejected without a review. Submissions should be uploaded using the Easychair system: https://www.easychair.org/conferences/?conf=eswc2014pd * Important Dates * Submission Deadline: March 16, 2014 Notification of Acceptance: April 10, 2014 Camera-Ready Paper: April 25, 2014 * Important Notes * Poster papers will appear in supplementary post-conference proceedings to be published by Springer in the series Lecture Notes in Computer Science (LNCS). Participants with an accepted poster must register for the conference and present their work during the Poster Session. A space will be allocated for each participant. The organizers should be contacted well in advance of the conference in case of any special requirement. From hamon at LIMSI.FR Sat Dec 14 21:48:58 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 22:48:58 +0100 Subject: Info: Establishing the International Standard Language Resource Number (ISLRN) Message-ID: Date: Fri, 13 Dec 2013 16:45:44 +0100 From: ELRA ELDA Information Message-ID: <52AB2BA8.1030102 at elda.org> X-url: http://liste.cines.fr//ticket/61393947172170 X-url: http://www.elra.info/Establishing-the-ISLRN.html# /Press Release - Immediate - Paris, France, December 13, 2013/ *Establishing the International Standard Language Resource Number (ISLRN)* 12 major NLP organisations announce the establishment of the ISLRN, a Persistent Unique Identifier, to be assigned to each Language Resource. On November 18, 2013, 12 NLP organisations have agreed to announce the establishment of the International Standard Language Resource Number (ISLRN), a Persistent Unique Identifier, to be assigned to each Language Resource. Experiment replicability, an essential feature of scientific work, would be enhanced by such unique identifier. Set up by ELRA, LDC and AFNLP/Oriental-COCOSDA, the ISLRN Portal will provide unique identifiers using a standardised nomenclature, as a service free of charge for all Language Resource providers. It will be supervised by a steering committee composed of representatives of participating organisations and enlarged whenever necessary. More information on ELRA and the ISLRN, please contact: Khalid Choukri choukri at elda.org More information on ELDA, please contact: Hélène Mazo mazo at elda.org (http://www.elra.info/Establishing-the-ISLRN.html#) ELRA 55-57, rue Brillat Savarin 75013 Paris (France) Tel.: +33 1 43 13 33 33 Fax: +33 1 43 13 33 30 *** About ELRA *** The European Language Resources Association (ELRA) is a non-profit making organisation founded by the European Commission in 1995, with the mission of providing a clearing house for Language Resources and promoting Human Language Technologies (HLT). To find out more about ELRA, please visit our web site: www.elra.info *** About NLP12*** Alliance of Digital Humanities Organizations (ADHO), Association for Computational Linguistics (ACL), Asian Federation of Natural Language Processing (AFNLP), COLING Committee (ICCL), European Data Forum, European Language Resources Association (ELRA), International Association for Machine Translation (IAMT), International Committee for the Coordination & Standardisation of Speech Databases and Assessment Techniques (COCOSDA), International Speech Communication Association (ISCA), Linguistic Data Consortium (LDC), Oriental COCOSDA, Language Resource Management Agency (RMA) To find out more about NLP12, please read the NLP12 Paris Declaration (http://www.elra.info/NLP12-Paris-Declaration.html). ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 14 21:35:11 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 22:35:11 +0100 Subject: Appel: WIMS'14 (deadline extension) Message-ID: Date: Fri, 13 Dec 2013 10:23:11 +0100 From: Plantié Michel Message-ID: <52AAD1FF.5090101 at mines-ales.fr> X-url: http://wims14.csd.auth.gr *** apologies for cross-postings *** ======================================================================== Due to a number of requests we have to decided to extend the paper submission deadlines NEW deadline for paper submission: *** January 20th 2014 *** ======================================================================== CALL FOR RESEARCH / APPLICATION & CASE STUDY PAPERS / POSTERS CALL FOR WORKSHOPS / TUTORIALS / DEMONSTRATIONS KEYNOTES ANNOUNCEMENT WIMS’14 4th International Conference on Web Intelligence, Mining and Semantics Thessaloniki, Greece, 2-4 June 2014 http://wims14.csd.auth.gr About WIMS’14 Conference -------------------------------------- WIMS is a series of peer-reviewed International Computer Science conferences. It is a forum for researchers and practitioners to present their state-of-the-art results in building Intelligent Web, to examine performance characteristics of various approaches in Web-based intelligent information technology, and to cross-fertilize their ideas on the development of Web-based intelligent information management solutions across different domains. Objectives -------------- The purpose of the WIMS series is: - To provide a forum for established researchers and practitioners to present their contributions to the state of the art research and development in Web technology and applications - To give doctoral students an opportunity to present their research to a friendly and knowledgeable audience and receive valuable feedback - To provide an informal social event where Web technology researchers and practitioners can meet Call for Research Papers/Posters -------------------------------------------- WIMS’14 is intended to foster the dissemination of state-of-the-art research in the area of Web intelligence, Web mining, Web semantics and the fundamental interaction between them. Authors are invited to submit regular (12 pages) or short (6 pages) research papers or posters (4 pages) on all related areas. Papers exploring new directions or areas will receive a thorough and encouraging review. More details on the nature of the different submission types can be found at: http://wims14.csd.auth.gr/?page_id=23 The papers in all the categories should describe original results that have not been accepted or submitted for publication elsewhere. All submissions will be evaluated by at least three members of the international program committee. Submissions should be made electronically in PDF or DOC/DOCX (MS/Open Word) format via the electronic submission system of the WIMS’14 Conference Management system at https://www.easychair.org/conferences/?conf=wims14 The submissions should be typeset using the templates of the ACM International Conference Proceedings Series (ICPS). The templates could be retrieved from http://www.acm.org/sigs/publications/proceedings-templates Call for Application and Case Study Track Papers ---------------------------------------------------------------- The goal of this track is to provide opportunities for researchers, innovators, industry leaders, and practitioners to present and exchange ideas, results, and experiences in the applications of Web Intelligence, Mining, and Semantics (WIMS). The Application & Case Studies Track invites submissions describing implementations of Web Intelligence, Mining, and Semantics in industrial or government or other ‘real world’ settings. Papers must include descriptions of how the applications have been conceived, developed, and (probably) deployed. Details about the submission types and the submission procedure of this track can be found at: http://wims14.csd.auth.gr/?page_id=154 Call for Tutorials/Demonstration Proposals ---------------------------------------------------------- WIMS’14 also welcomes Tutorial and/or Demonstration proposals. WIMS’14 will include tutorials and/or demonstrations (on new applications) providing in-depth background on subjects that are of broad interest to the Web Mining and Web Semantics community. The tutorials will be part of the main conference technical program. Proposal papers must be no less than 5 pages and no more than 12 pages, and must provide a sense of both the scope of the tutorial and depth within the scope. Details about the submissions of the tutorial/demonstration proposals can be found at: http://wims14.csd.auth.gr/?page_id=148 Call for Workshop Proposals -------------------------------------- WIMS’14 invites proposals for workshops to be held in conjunction with the conference. Organizers may structure workshops as they see fit, possibly including invited talks, panel discussions, presentations of work in progress, peer-reviewed papers, or some combination. A workshop could be scheduled for a half a day or a full day, depending on potential interest and organizers preference. The organizers of successful workshops will be responsible for their own reviewing process, website, and to co-ordinate the collection and delivery of camera ready material and ACM copyright transfer. Accepted papers in workshops will be published in the WIMS’14 proceedings and disseminated through the ACM Digital Library. Selected extended papers will be invited to appear in a special issue of a reputed journal in the field. Details about the submissions of the workshop proposals can be found at: http://wims14.csd.auth.gr/?page_id=150 Conference Scope ------------------------- WIMS solicits regular and work-in-progress research, discussion papers and industry experience report papers in related fields. Papers exploring new directions or areas are also welcome. In particular but not exclusively the submissions within the following major areas are relevant: - Scalable Web and Data Architectures and Infrastructures - Web Intelligence - Web Mining, Information and Knowledge Extraction - Web Semantics and Reasoning - WIMS Applications - Evaluation and Validation of WIMS Technologies and Applications More details about the conference scope and topics can be found at: http://wims14.csd.auth.gr/?page_id=146 Publications ---------------- Accepted papers/tutorials/posters will be published by ACM and disseminated through the ACM Digital Library through the International Conference Proceedings Series (ICPS). A selection of the best WIMS'14 papers will be invited to be revised and extended for the post-conference publication in the Special Issue of the International Journal on Artificial Intelligence Tools (IJAIT). The proceedings of the previous WIMS conferences could be checked at: - WIMS’11:https://dl.acm.org/citation.cfm?id=1988688 - WIMS’12:https://dl.acm.org/citation.cfm?id=2254129 - WIMS’13:https://dl.acm.org/citation.cfm?id=2479787 Keynotes ------------- WIMS’14 organizing committee is happy to announce the following significant keynote speakers for this year’s conference: - “Large-Scale Reasoning with (Semantic) Data”: Grigoris Antoniou, Professor, University of Huddersfield - “Semantic Technology for online, broadcast and print media”: Jem Rayfield, Head Architect, Financial Times - “Big Data – from Hype to Reality”: Richard Benjamins, Dr., Director of Business Intelligence, Telefonica Important Dates --------------- 13.01.2014 submission of abstracts for Application and Case Study Track Papers ** NEW ** 20.01.2014 submission of Research Track papers & Application and Case Study Track papers & posters ** NEW ** 20.12.2013 Workshop & Tutorial Proposals submission, review and notification 24.03.2014 notification of acceptance for (all tracks) papers/posters ** NEW ** 07.04.2014 camera ready versions of the accepted (all tracks) papers, posters, tutorial papers ** NEW ** 14.04.2014 author registration deadline ** NEW ** 02-04.06.2014 Conference Organization ----------------- WIMS'14 is organized under the auspices of the Dept. of Informatics at the Aristotle University of Thessaloniki, Greece. (http://www.csd.auth.gr/en/) WIMS Conferences Chair Rajendra Akerkar, Western Norway Research Institute, Norway General Chairs Ioannis Vlahavas, Aristotle University of Thessaloniki, Greece Takahira Yamaguchi, Keio University, Japan Program Committee Chairs Nick Bassiliades, Aristotle University of Thessaloniki, Greece Vadim Ermolayev, Zaporizhzhya National University, Ukraine Advisory Committee Grigoris Antoniou, University of Huddersfield, UK Harold Boley, Faculty of Computer Science, University of New Brunswick, Fredericton, Canada James Hendler, Rensselaer Polytechnic Institute, Troy, NY, USA Guus Schreiber, VU University, Amsterdam, the Netherlands Amit Sheth, Ohio Center of Excellence on Knowledge-enabled Computing, Ohio, USA Workshops and Tutorials Chairs Costin Badica, University of Craiova, Romania Dimitris Vrakas, Aristotle University of Thessaloniki, Greece Industrial Track Chair John Davies, BT Research & Innovation, UK Publicity Chair Athena Vakali, Aristotle University of Thessaloniki, Greece Local Organization Co-Chairs Yannis Manolopoulos, Aristotle University of Thessaloniki, Greece Athanasios Tsadiras, Aristotle University of Thessaloniki, Greece Web Chairs Efstratios Kontopoulos, International Hellenic University, Greece Fotis Kokkoras, T.E.I. of Thessaly, Greece ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 14 21:49:50 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 22:49:50 +0100 Subject: Livre: Ressources Lexicales Message-ID: Date: Fri, 13 Dec 2013 19:19:24 +0100 From: Nuria Gala Message-ID: <52AB4FAC.30805 at lif.univ-mrs.fr> X-url: http://benjamins.com/#catalog/books/lis.30/main Ressources Lexicales Contenu, construction, utilisation, évaluation Sous la direction de Núria Gala et Michael Zock Aix-Marseille Université & LIF-CNRS http://benjamins.com/#catalog/books/lis.30/main Les ressources lexicales (dictionnaires, bases de données, thesaurus, etc.) rassemblent des connaissances sur les mots, leurs sens et leurs usages. Si pendant des siècles elles ont été tributaires de l'imprimerie et du format textuel, il existe de nos jours une grande variété d'outils et de ressources accessibles sous des formats électroniques divers. Ainsi, la façon de considérer les ressources lexicales a changé considérablement ces dernières décennies. On a vu notamment apparaître des ressources non plus conçues en tant qu'entités statiques, mais modélisées sous forme de bases de données ou de graphes, dans lesquelles les informations sont liées et accessibles dynamiquement. Le domaine des ressources lexicales, au carrefour de plusieurs disciplines dont la linguistique, la lexicologie, la lexicographie et le traitement automatique des langues, est sans nul doute en pleine effervescence. Le but de ce volume est d'en dresser un panorama général qui rend compte de l'existant et des évolutions en cours. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 17 21:15:19 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 17 Dec 2013 22:15:19 +0100 Subject: Appel: WoLLIC 2014 Message-ID: Date: Mon, 16 Dec 2013 08:09:25 -0200 From: Ruy de Queiroz Message-ID: X-url: http://wollic.org/wollic2014/ [Please circulate. Apologies for any cross-postings.] WoLLIC 2014 21st Workshop on Logic, Language, Information and Computation September 1st to 4th, 2014 Valparaiso, Chile SCIENTIFIC SPONSORSHIP Interest Group in Pure and Applied Logics (IGPL) The Association for Logic, Language and Information (FoLLI) Association for Symbolic Logic (ASL) European Association for Theoretical Computer Science (EATCS) European Association for Computer Science Logic (EACSL) Sociedade Brasileira de Computação (SBC) Sociedade Brasileira de Lógica (SBL) ORGANISATION Department of Computer Science, Universidad de Chile, Chile Department of Computer Science, Pontificia Universidad Católica de Chile, Chile Centro de Informática, Universidade Federal de Pernambuco, Brazil HOSTED BY Department of Informatics, Universidad Técnica Federico Santa María, Chile CALL FOR PAPERS WoLLIC is an annual international forum on inter-disciplinary research involving formal logic, computing and programming theory, and natural language and reasoning. Each meeting includes invited talks and tutorials as well as contributed papers. The twentieth WoLLIC will be held at the Universidad Técnica Federico Santa María, from September 1st to 4th, 2014. It is sponsored by the Association for Symbolic Logic (ASL), the Interest Group in Pure and Applied Logics (IGPL), the The Association for Logic, Language and Information (FoLLI), the European Association for Theoretical Computer Science (EATCS), the European Association for Computer Science Logic (EACSL), the Sociedade Brasileira de Computação (SBC), and the Sociedade Brasileira de Lógica (SBL). PAPER SUBMISSION Contributions are invited on all pertinent subjects, with particular interest in cross-disciplinary topics. Typical but not exclusive areas of interest are: foundations of computing and programming; novel computation models and paradigms; broad notions of proof and belief; proof mining, type theory, effective learnability; formal methods in software and hardware development; logical approach to natural language and reasoning; logics of programs, actions and resources; foundational aspects of information organization, search, flow, sharing, and protection. Proposed contributions should be in English, and consist of a scholarly exposition accessible to the non-specialist, including motivation, background, and comparison with related works. They must not exceed 10 pages (in font 10 or higher), with up to 5 additional pages for references and technical appendices. The paper's main results must not be published or submitted for publication in refereed venues, including journals and other scientific meetings. It is expected that each accepted paper be presented at the meeting by one of its authors. Papers must be submitted electronically at the WoLLIC 2014 EasyChair website. (Please go to http://wollic.org/wollic2014/instructions.html for instructions.) A title and single-paragraph abstract should be submitted by Mar 24, 2014, and the full paper by Mar 28, 2014 (firm date). Notifications are expected by May 2, 2014, and final papers for the proceedings will be due by May 15, 2014 (firm date). PROCEEDINGS The proceedings of WoLLIC 2014, including both invited and contributed papers, will be published in advance of the meeting as a volume in Springer's LNCS series. In addition, abstracts will be published in the Conference Report section of the Logic Journal of the IGPL, and selected contributions will be published as a special post-conference WoLLIC 2014 issue of a scientific journal (to be confirmed). INVITED SPEAKERS Verónica Becher (U Buenos Aires) Juha Kontinen (U Helsinki) Aarne Ranta (U Gothenburg) Kazushige Terui (U Kyoto) Luca Viganò (King’s College London) Thomas Wilke (U Kiel) STUDENT GRANTS ASL sponsorship of WoLLIC 2014 will permit ASL student members to apply for a modest travel grant (deadline: May 1st, 2014). See http://www.aslonline.org/studenttravelawards.html for details. IMPORTANT DATES Mar 24, 2014: Paper title and abstract deadline Mar 28, 2014: Full paper deadline May 2, 2014: Author notification May 15, 2014: Final version deadline (firm) PROGRAMME COMMITTEE Natasha Alechina (U Nottingham) Eric Allender (Rutgers U) Marcelo Arenas (PUC Chile) Steve Awodey (CMU) Julian Bradfield (U Edinburgh) Xavier Caicedo (U de Los Andes) Olivier Danvy (Aarhus U) Ulrich Kohlenbach (Tech U Darmstadt) (CHAIR) Marcus Kracht (U Bielefeld) Michiel van Lambalgen (U Amsterdam) Klaus Meer (Tech U Cottbus) George Metcalfe (Bern U) Dale Miller (INRIA Saclay / LIX) Russell Miller (CUNY) Sara Negri (U Helsinki) Nicole Schweikardt (U Frankfurt) (more to come) STEERING COMMITTEE Samson Abramksy, Johan van Benthem, Anuj Dawar, Joe Halpern, Wilfrid Hodges, Daniel Leivant, Leonid Libkin, Angus Macintyre, Grigori Mints, Luke Ong, Hiroakira Ono, Ruy de Queiroz. ORGANISING COMMITTEE Pablo Barceló (Universidad de Chile) (Local chair) Anjolina G. de Oliveira (U Fed Pernambuco) Ruy de Queiroz (U Fed Pernambuco) (co-chair) Juan Reutter (Pontificia Universidad Católica de Chile) Cristián Riveros (Pontificia Universidad Católica de Chile) FURTHER INFORMATION Contact one of the Co-Chairs of the Organising Committee. WEB PAGE http://wollic.org/wollic2014/ ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 17 21:20:08 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 17 Dec 2013 22:20:08 +0100 Subject: Appel: IC 2014, Extension de l'appel a soumission jusqu'au 9 janvier 2014 Message-ID: Date: Mon, 16 Dec 2013 12:50:01 +0100 From: Catherine Faron Zucker Message-ID: <52AEE8E9.1010403 at i3s.unice.fr> X-url: http://www.irit.fr/IC2014 Appel à communications pour les 25es Journées Francophones d'Ingénierie des Connaissances : Connaissances pervasives: des utilisateurs plongés dans des systèmes à base de connaissances et des systèmes à base de connaissances plongés dans le monde http://www.irit.fr/IC2014 du 12 au 16 mai 2014 à Clermont-Ferrand ------------------------------------------------------------------------ Dates importantes : Date limite de soumission des communications : jeudi 19 décembre 2013jeudi 9 janvier 2014 Date de notification aux auteurs : jeudi 13 février 2014 Date limite de réception des textes définitifs : lundi 17 mars 2014 Dates de la conférence : du 12 au 16 mai 2014 ------------------------------------------------------------------------ La conférence d'Ingénierie des Connaissances fête cette année ses 25 ans! Elle est le rendez-vous annuel de la communauté française et francophone qui s'intéresse aux problématiques liées à l'ingénierie des connaissances. Académiques et industriels s'y retrouvent pour échanger et réfléchir sur des problèmes de recherche qui se posent en acquisition, représentation et gestion des connaissances. L'essor des sciences et technologies de l'information et de la communication, et notamment des technologies du web, dans l'ensemble de la société engendre des mutations dans les pratiques individuelles et collectives. L'ingénierie des connaissances accompagne cette évolution, en inventant les modèles, méthodes et outils permettant l'intégration de connaissances et de raisonnements sur ces connaissances dans des environnements informatiques. De fait, on constate aujourd'hui une omniprésence des connaissances, en même temps qu'un décloisonnement des communautés de recherche en Intelligence Artificielle pour répondre aux problèmes qui se posent. Ainsi, la construction des systèmes à base de connaissances fait appel à l'ingénierie des connaissances, au traitement automatique des langues, à l'apprentissage automatique et à la fouille de données; le traitement des connaissances associe ingénierie des connaissances avec recherche d'information, recherche opérationnelle, analyse de données; la modélisation des environnements informatiques, de ses acteurs humains ou logiciels, allie ingénierie des connaissances et systèmes multi-agents; enfin l'ingénierie des connaissances touche à des thématiques classiques dans les communautés de l'interaction homme-machine et des environnements informatiques pour l'apprentissage humain. En plaçant ainsi l'utilisateur au coeur des systèmes informatiques, l'ingénierie des connaissances est sollicitée pour assister l'utilisateur dans le traitement de la masse de données disponibles sur le web, pour les visualiser, contextualiser, expliquer, recommander, classer, prédire, approximer, sécuriser, fiabiliser, pour pallier à leur imprécision voire incohérence. Les thèmes d'IC 2014 sont les suivants (liste non exhaustive): Représentation des connaissances et raisonnement - modèles, formalismes, langages formels et informels de représentation de connaissances - échange, interopérabilité et réutilisation des modèles, standardisation - terminologies, thesauri, ontologies et lexiques - types d'ontologies et rôle des ontologies de référence - méthodes et outils pour le cycle de vie des représentations - gestion de la dynamique des ontologies (versionnement, évolution, intégration et alignement) - raisonnements logiques, inférences, raisonnements à base de règles, - approximations, raisonnements statistiques, raisonnements par analogie, raisonnements à partir de cas, raisonnements dans les logiques non classiques Conception et construction de modèles de connaissances et d'ontologies - peuplement de modèles de connaissances - acquisition de connaissances à partir de textes - découverte interactive de connaissances dans les données, dans les historiques, dans les interactions, sur le web, etc. - inférence et règles métier : acquisition, conception, réutilisation, évolution, évaluation, exploitations - métrique et évaluation de la qualité des ontologies Web de données, web sémantique - ouverture et interconnexion des données - impact de l'ouverture des données sur le web et de l'externalisation ouverte (crowdsourcing) sur l'ingénierie des connaissances - réseaux sociaux - gestion de flux de données - passage à l'échelle : traitement de données massives - traitement de données hétérogènes - provenance et confiance dans les données - métrique et évaluation de la qualité des données Ingénierie des connaissances centrée utilisateurs - visualisation de connaissances, d'ontologies - visualisation d'interconnexions de données, de raisonnements sur les données - explication des raisonnements - profils utilisateurs, modèles de contexte et adaptation, modèles d'émotion - personnalisation de l'accès aux connaissances: confiance, recommandation, classement (ranking), résumé - assistance d'applications et de services: agents conversationnels - environnements informatiques pour l'apprentissage humain - conception et évaluation d'interactions et d'interfaces avec des systèmes à base de connaissances Interdisciplinarité de l'ingénierie des connaissances - épistémologie de l'ingénierie des connaissances - théorie des organisations et ingénierie des connaissances - sciences humaines et sciences cognitives et ingénierie des connaissances - ingénierie des connaissances scientifiques Ingénierie des connaissances pervasive : IC dans... - le traitement automatique des langues - l'apprentissage symbolique - la fouille de données - la recherche d'information, l'indexation - la recherche opérationnelle - l'analyse de données - les systèmes multi-agents - les environnements informatiques pour l'apprentissage humain - les interfaces homme-machine Applications - applications dans les domaines de la santé, de l'éducation et de l'apprentissage, de la culture, de la recherche documentaire, des transports intelligents, de l'informatique décisionnelle (BI), etc. - émergence de nouveaux domaines applicatifs - mémoires d'entreprises - systèmes pour la collaboration et la coopération dans les organisations et les collectifs ------------------------------------------------------------------------ Soumissions : Articles longs(12 pages (nouveau format), présentation orale de 20 mn, 10 mn discussion) : Sont concernés des travaux aboutis, présentant des résultats validés ou des synthèses fouillées. Articles courts(6 pages (nouveau format), présentation orale de 10 mn, 5 mn discussion) : Sont concernés des travaux en cours ou au stade de résultats partiels de recherche ou appliqués. Ce type de soumission correspond à des travaux innovants de qualité, mais dont la maturité ne permet pas des élaborations complètes et des conclusions définitives. Les articles devront respecter les styles de documents indiqués sur le site de la conférence: http://www.irit.fr/IC2014. Les auteurs sont invités à soumettre leurs communications au format PDF sur EasyChair: https://www.easychair.org/conferences/?conf=ic2014. Les articles acceptés feront l'objet d'actes en version électronique, publiés sur HAL, et disponibles sur une clé USB distribuée aux participants de la conférence. Un prix AFIA du meilleur article sera décerné par un jury composé de membres du comité de pilotage de la conférence. ------------------------------------------------------------------------ La conférence d'Ingénierie des Connaissances est placée sous l'égide du collège IC de l'AFIA ------------------------------------------------------------------------ Présidente du comité de programme : Catherine Faron Zucker (I3S, Université Nice Sophia Antipolis) Présidente du comité d'organisation : Catherine Roussey (Irstea) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 17 21:23:28 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 17 Dec 2013 22:23:28 +0100 Subject: Appel: 8eme colloque international du GEM&L Message-ID: Date: Tue, 17 Dec 2013 10:09:24 +0100 From: Luc Damas Message-ID: X-url: http://geml.eu/colloque/colloque-2014-toulouse-business-school/call-for-papers/ 8eme colloque international du GEM&L Toulouse Business School Toulouse, 26-28 mars 2014 Appel à communication – Call for Papers Version française (English version below) Langage et performance de l’entreprise http://geml.eu/colloque/colloque-2014-toulouse-business-school/call-for-papers/ Planning Proposition de communication : 10 janvier 2014 Notification d’acceptation / modification : 20 février 2014 Communication définitive: 10 mars 2014 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 17 21:22:04 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 17 Dec 2013 22:22:04 +0100 Subject: Info: EMCL Study grants Message-ID: Date: Mon, 16 Dec 2013 16:42:38 +0100 From: Bertram Fronhöfer Message-ID: <52AF1F6E.40304 at tu-dresden.de> X-url: http://www.emcl-study.eu/home.html X-url: http://www.emcl-study.eu/fileadmin/emcl_booklet_tree/ma_em_grant.html ------------------------------------------------------------------------ Dear all, I'd like to draw your attention to the fact that fresh Erasmus Mundus scholarships are available for Non-European AND European students who enrol in our European Master's Program in Computational Logic in the fall of 2014. The deadline for application is 31 January, 2014. More details are given below. In particular, I'd like to draw your attention to the fact that we are able to provide grants to EU-students for doing their project at the National ICT of Australia (NICTA). Please spread this information as wide as possible among friends and colleagues, at your old universities and the places, where you currently live and work. Many thanks -- Steffen ************************************************************************ The European Master's Program in Computational Logic We are glad to announce to you the possibility to join our European Master's Program of Computational Logic. This program is offered jointly at the Free-University of Bozen-Bolzano in Italy, the Technische Universität Dresden in Germany, the Universidade Nova de Lisboa in Portugal and the Technische Universität Wien in Austria. Within this program you have the choice to study at two /three of the four European universities. In addition, you can do your project work at the National ICT of Australia (NICTA). You will graduate with a MSc in Computer Science and obtain a joint degree. Information on the universities and the program including the application procedure is provided here: http://www.emcl-study.eu/home.html Language of instruction is English. Tuition fees are 3.000 EUR (for non-European students) and 1.000 (for European students) per year. We would like to draw your attention to the ERASMUS-MUNDUS scholarship program. The ERASMUS-MUNDUS consortium offers 2-year scholarships up to 48.000 EUR for non-EU students and up to 23.000 EUR for EU students of our European Master's Program in Computational Logic. More information on the scholarship program is available from: http://www.emcl-study.eu/fileadmin/emcl_booklet_tree/ma_em_grant.html Do not hesitate to contact us if you have any further questions. Kind regards -- Steffen Hölldobler Prof. Dr. Steffen Hoelldobler International Center for Computational Logic Technische Universität Dresden 01062 Dresden, Germany phone: [+49](351)46 33 83 40 fax: [+49](351)46 33 83 42 email: sh at iccl.tu-dresden.de ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 17 21:27:18 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 17 Dec 2013 22:27:18 +0100 Subject: Soft: JRC's Europe Media Monitor (EMM) - Free APPS for mobile version Message-ID: Date: Tue, 17 Dec 2013 15:49:14 +0100 From: Ralf Steinberger Message-id: <023b01cefb37$27ea4280$77bec780$@jrc.ec.europa.eu> X-url: http://emm.newsbrief.eu/ Dear all, this is to let you know that the highly multilingual Europe Media Monitor (EMM) applications are now available as free Apps for mobile devices running iOS and Android. You can download the EMM Apps via the links at the EMM-NewsBrief entry page http://emm.newsbrief.eu/ . Apps are currently available for mobile devices running Android and for the iPad. A version for the iPhone will be released soon. We would be delighted to receive your feedback on the newly released EMM Apps to the email address emm AT jrc DOT it. We look forward to hearing from you. EUROPE MEDIA MONITOR (EMM) EMM gathers a daily average of 175,000 news items in over 70 languages and it analyses the news automatically using a wide range of JRC-developed computational linguistics tools. These include event extraction; automatic entity recognition, classification and disambiguation; name variant mapping; co-reference resolution; quotation recognition; opinion mining; multi-document summarisation; document clustering and classification; machine translation; information aggregation, including across languages; and more. BACKGROUND AND READING EMM (http://emm.newsbrief.eu/overview.html) is a freely available advert-free family of news monitoring and analysis applications developed by the OPTIMA Team (http://ipsc.jrc.ec.europa.eu/?id=178) at the Joint Research Centre (http://ec.europa.eu/dgs/jrc/) (JRC), which is the European Commission's in-house science service. You find literature (scientific publications) on the Europe Media Monitor EMM (http://langtech.jrc.ec.europa.eu/JRC_Publications.html) and its individual text mining tools at the URL http://langtech.jrc.ec.europa.eu/JRC_Publications.html, including papers giving an introduction and a generic overview of the Europe Media Monitor EMM (http://langtech.jrc.ec.europa.eu/Documents/09_SIGIR-WS_Steinberger+frontmat ter.pdf) . FREELY AVAILABLE LINGUISTIC RESOURCES The JRC has also released a large volume of freely available multilingual linguistic resources (http://ipsc.jrc.ec.europa.eu/index.php?id=61) that can be used to develop or test a variety of multilingual and cross-lingual Natural Language Processing tools. These include parallel corpora in up to 26 languages; readily trained automatic document categorisation software (http://ipsc.jrc.ec.europa.eu/index.php?id=60) in 22 languages; dictionaries of names and their variant spellings (http://ipsc.jrc.ec.europa.eu/?id=42) , including across languages and scripts; and more. See http://ipsc.jrc.ec.europa.eu/index.php?id=61 for details and to download the data and the tools. Ralf Steinberger (http://langtech.jrc.ec.europa.eu/RS.html) and the OPTIMA team European Commission - Joint Research Centre (JRC) 21027 Ispra (VA), Italy URL - Applications: (http://emm.newsbrief.eu/overview.html) http://emm.newsbrief.eu/overview.html URL - Resources: http://ipsc.jrc.ec.europa.eu/index.php?id=61 From hamon at LIMSI.FR Tue Dec 17 21:29:33 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 17 Dec 2013 22:29:33 +0100 Subject: Seminaire: Pierre-Yves OUDEYER, 19 Decembre 2013, GIPSA-lab Message-ID: Date: Tue, 17 Dec 2013 16:18:29 +0100 From: Thomas Hueber Message-Id: <0167C9FD-3058-4DC0-886F-30E1363406CC at gipsa-lab.grenoble-inp.fr> X-url: http://www.pyoudeyer.com X-url: http://flowers.inria.fr X-url: http://www.gipsa-lab.inpg.fr/~thomas.hueber Bonjour à tous, Cette semaine, nous accueillerons Pierre-Yves Oudeyer, de l'INRIA Bordeaux. Il viendra nous présenter ses travaux ce Jeudi 19 décembre, à 13h30 en salle réunion du DPC-Ampère (B314). Venez nombreux ! (et n’hésitez pas à diffuser cette annonce dans vos laboratoires). L’équipe séminaire du Département Parole et Cognition du GIPSA-lab _________________________ Curiosity-driven learning and development: How robots can help us understand humans Pierre-Yves Oudeyer Inria and Ensta ParisTech, France A great mystery is how human infants develop: how they progressively discover their bodies, how they learn to interact with objects and social peers, and accumulate new skills all over their lives. Constructing robots, and building mechanisms that model such developmental processes, is key to advance our understanding of human development, in constant dialogue with human and living sciences. I will present examples of robotics models of curiosity-driven learning and exploration, and show how developmental trajectories can self-organize, starting from discovery of the body, then object affordances, then vocal babbling and vocal interactions with others. In particular, I will show that the onset of language spontaneously forms out of such sensorimotor development. Dr. Pierre-Yves Oudeyer is Research Director at Inria and head of the Inria and Ensta-ParisTech FLOWERS team (France). Before, he has been a permanent researcher in Sony Computer Science Laboratory for 8 years (1999-2007). He studied theoretical computer science at Ecole Normale Supérieure in Lyon, and received his Ph.D. degree in artificial intelligence from the University Paris VI, France. After working on computational models of language evolution, he is now working on developmental and social robotics, focusing on sensorimotor development, language acquisition and life-long learning in robots. Strongly inspired by infant development, the mechanisms he studies include artificial curiosity, intrinsic motivation, the role of morphology in learning motor control, human-robot interfaces, joint attention and joint intentional understanding, and imitation learning. He has published a book, more than 80 papers in international journals and conferences, holds 8 patents, gave several invited keynote lectures in international conferences, and received several prizes for his work in developmental robotics and on the origins of language. In particular, he is laureate of the ERC Starting Grant EXPLORERS. He is editor of the IEEE CIS Newsletter on Autonomous Mental Development, and associate editor of IEEE Transactions on Autonomous Mental Development, Frontiers in Neurorobotics, and of the International Journal of Social Robotics. He is also working actively for the diffusion of science towards the general public, through the writing of popular science articles and participation to radio and TV programs as well as science exhibitions. Web: http://www.pyoudeyer.com and http://flowers.inria.fr Dr. Thomas Hueber Chargé de recherche au CNRS GIPSA-lab, Département Parole et Cognition 961 rue de la Houille Blanche - Domaine universitaire - BP 46 38402 Saint Martin d'Hères CEDEX FRANCE Tél : +33 (0)4 76 57 49 40 Fax : +33 (0)4 76 57 47 10 Courriel : thomas.hueber at gipsa-lab.grenoble-inp.fr Web : http://www.gipsa-lab.inpg.fr/~thomas.hueber ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Dec 20 20:46:48 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 21:46:48 +0100 Subject: Appel: Ateliers TALN 2014 Message-ID: Date: Thu, 19 Dec 2013 11:44:37 +0100 From: Nuria Gala Message-ID: <52B2CE15.4070803 at lif.univ-mrs.fr> TALN 2014 - APPEL À ATELIERS L'édition 2014 de la conférence TALN aura lieu à Marseille du 1er au 4 juillet 2014. Comme pour les éditions précédentes, une journée sera dédiée à des ateliers. OBJECTIFS Un atelier porte sur une thématique particulière de TALN afin de rassembler quelques exposés plus ciblés. Il a son propre président et son propre comité de programme. Le responsable d’un atelier est chargé de l’appel à communications et de la coordination de son comité de programme. Les organisateurs de TALN s’occuperont de la partie logistique (gestion des salles, pauses café et diffusion des articles). Les ateliers auront lieu en parallèle (3 séances) sur une journée ou une demi-journée (2 à 4 sessions de 1h30). CALENDRIER Date des ateliers : 1er juillet 2014 Date limite de soumission de la proposition : 31 janvier 2014 Réponse du comité de programme de TALN : 7 février 2014 Version finale des articles : 30 mai 2013 MODALITÉS DE PROPOSITION Les propositions d’ateliers seront envoyées sous forme électronique à alexis.nasr[arobas]lif.univ-mrs.fr et nuria.gala[arobas]lif.univ-mrs.fr. Les propositions d’ateliers comprendront une description synthétique (2-3 pages) de la thématique de la conférence ainsi que son comité de programme et la durée souhaitée (un jour ou une demi-journée). Le comité de programme de TALN choisira parmi toutes les propositions. FORMAT Les conférences auront lieu en français (ou en anglais pour les non-francophones). Les articles devront suivre le format de TALN et comprendront 12 à 14 pages. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Dec 20 20:28:37 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 21:28:37 +0100 Subject: Appel: 23rd IEEE Wetice, Track Web2Touch, June 23-25, Parma, Italy Message-ID: Date: Wed, 18 Dec 2013 16:26:07 +0100 From: Lina Soualmia Message-ID: X-url: http://web2touch2014.tudor.lu/ X-url: http://cmt.dmi.unipr.it/wetice14 CALL FOR PAPERS Web2Touch 2014: Ontology Applications and Web Semantics http://web2touch2014.tudor.lu/ Conference Track @ 23rd IEEE WETICE http://cmt.dmi.unipr.it/wetice14 June 23-25, 2014 - Parma - Italy http://web2touch2014.tudor.lu *Objectives* The 2014 edition of the Web2Touch Track aims at presenting alternatives to address interoperability and distributed knowledge management problem for Web systems. Web is a living medium for collaborating where people and computer systems can produce, share and consume information. However several challenges for facilitating this collaboration are still open issues, including: storage, retrieval, extraction, and visualisation of huge quantity of information, interlinking, ontology classification and fusion of data. New technologies and standards have been developed to provide more efficient use of the Web, in particular for using Cloud, Linked Open Data, and Big Data. The key of the success of the Internet is the amount of published data available for everyone, but it also generates problems to end-users. Researchers are continuously working to improve Webapplications and Web contents structures. The aim is to avoid overwhelming human beings with irrelevant data, and to increase the capability of systems to correctly interpret Web contents, so improving the quality of the interaction on the Web. This Track is intended as a forum to discuss theories, models, techniques and user requirements related to how a number of semantic interoperability problems can be coped with to achieve commercial, scientific, and sociological success for Web applications. Web2Touch is an open forum for studies in multiples application domains including, for example, Web science, eHealth, eGov, eLearning and collaborative systems in general. *Topics* Include but are not limited to: - Case studies of social semantic web. - Ontology and knowledge reuse and integration. - Semantic web services. - Experiences in semantic web support for collaborative work and business. - Knowledge creation, management, maintenance, and validation. - Collaborative ontology evolution and alignment. - Web for sentiment analysis. - Privacy issues in knowledge sharing. *Important Dates* - Submission deadline: Feb 4, 2014 - Notification of acceptance: March 14, 2014 - Camera ready papers: April 11, 2014 - Conference: June 23-25, 2014 *Paper Submission* Papers submitted to W2T 2014 must be written in English and must be formatted in strict accordance with the Wetice formatting instructions (author kits and paper templates available for *Word*, *PDF*, and *LaTeX*) and must be submitted in PDF format. Papers can be up to 6 pages. By submitting a paper, authors commit to have the paper presented at the workshop by at least one of them if the paper is accepted. Papers have to be submitted through the easychair system: https://www.easychair.org/conferences/?conf=web2touch The previous editions of Web2Touch received publication from many countries, the accepted ones are available in the IEEE Xplore. The best papers from 2008 to 2010 were published in special issues of the International Journal of Web Portals (2011, vol 1 and 3) and extended versions of selected papers from Web2Touch 2012 were published in Concurrency and Computation (2013), Web Portals (2013) and Collaborative Enterprise (2013) . Contact: Marcos Da Silveira web2touch at easychair.org From hamon at LIMSI.FR Fri Dec 20 21:33:54 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 22:33:54 +0100 Subject: Job: Postdoctoral position in MT, Idiap, Switzerland Message-ID: Date: Fri, 20 Dec 2013 18:06:16 +0100 From: Andrei Popescu-Belis Message-ID: <52B47908.5040007 at idiap.ch> X-url: http://www.idiap.ch/education-and-jobs Postdoctoral position in Machine Translation (OP-20131220-162727) NLP Group, Idiap Research Institute, Martigny, Switzerland Description: Applications are invited for a postdoctoral researcher position in the field of machine translation (MT), in the context of the MODERN project (www.idiap.ch/project/modern) funded by the Swiss National Science Foundation. The MODERN project is headed by Idiap, with partners in Zurich, Geneva and Utrecht, and its overall goal is to model discourse entities and relations for improving MT. The researcher will focus on enhancing the use of text-level features upon decoding, in a statistical MT framework. Several features are available from our previous projects, for instance pertaining to discourse relations or verb tenses, but also based on robust semantic relatedness measures. In addition, work in MODERN will focus on discourse entities (coreference and anaphora resolution) which will offer additional text-level constraints to experiment with. We are expecting from the postdoctoral researcher to advance the state-of-the-art in text-level decoding with linguistically motivated features, demonstrating improvement over existing options (such as factored models) through quantitative evaluation. Candidates should hold a PhD degree in statistical machine translation or natural language processing, and demonstrate a strong publication record at the time of application. Strong skills in programming and statistical modeling are also expected, along with a sensitivity to MT evaluation issues. While English is the main working language, knowledge of French, German or Dutch (the languages targeted in MODERN) would be an advantage. The PhD degree must be obtained before the starting date of the position. The position is offered for one year, with renewal for a second year conditional on satisfactory results. Review of the applications will start on February 1st, 2014, and will continue until the position is filled. The appointment will start on August 1st, 2014. How to apply: Please fill in and submit your application through the Idiap online recruitment system, by clicking on the position's title at http://www.idiap.ch/education-and-jobs. Contact information: Further information about this position can be requested via the Idiap online recruitment system or by contacting Dr. Andrei Popescu-Belis, head of the NLP group. About Idiap: Idiap is an independent, non-profit research institute recognized and supported by the Swiss Government, and affiliated with the Ecole Polytechnique Fédérale de Lausanne (EPFL). It is located in the town of Martigny in Valais, a scenic region in the south of Switzerland, surrounded by the highest mountains of Europe, and offering exciting recreational activities, including hiking, climbing and skiing, as well as varied cultural activities. It is within close proximity to Geneva and Lausanne. Although Idiap is located in the French part of Switzerland, English is the working language. Free French lessons are provided. Idiap offers competitive salaries and conditions at all levels in a young, dynamic, and multicultural environment. Idiap is an equal opportunity employer and is actively involved in the "Advancement of Women in Science" European initiative. The Institute seeks to maintain a principle of open competition (on the basis of merit) to appoint the best candidate, provides equal opportunity for all candidates, and equally encourage both genders to apply. Andrei Popescu-Belis ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Dec 20 21:14:36 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 22:14:36 +0100 Subject: Job: Ingenieur en Traitement Automatique des Langues, INIST-CNRS Message-ID: Date: Fri, 20 Dec 2013 09:08:06 +0100 From: "GAUTIER, Patricia" Message-Id: Ingénieur en Traitement Automatique des Langues (Bac + 5) Poste : Ingénieur en Traitement Automatique des Langues Contrat : CDD Employeur : CNRS Lieu de Travail : Vandœuvre-lès-Nancy Envoyer CV et lettre de motivation à patricia.gautier at inist.fr Description du poste L'INIST-CNRS est une unité de service du CNRS qui collecte, traite et diffuse de l'information scientifique et technique vers le monde de la recherche. Dans le cadre des Initiatives d'Excellence, le CNRS est porteur du projet ISTEX (www.istex.fr). L'INIST-CNRS pour sa part est en charge d'une plateforme nationale permettant de stocker, héberger et diffuser des archives de documents scientifiques acquises dans le cadre de ce projet. Le volume de données à traiter, l'hétérogénéité des sources et la taille de la communauté visée (ESR) est sans précédent. L’objectif de cette plateforme est également de proposer aux chercheurs des services de traitement des données (extraction de données, fouille de texte...). La personne recrutée travaillera exclusivement sur ce projet pendant la période de son contrat et aura pour mission d’étudier et tester les outils existants qui permettent de réaliser des traitements robustes de fouille et d’annotation des textes. Elle réalisera une analyse des outils disponibles qu’ils soient commerciaux ou issus de travaux de recherche, et donnera des préconisations pour leur intégration dans la plateforme en cours de développement. Enfin, elle assurera ces intégrations en collaboration avec l’équipe de développement. Compétences Obligatoires : Connaissance approfondie des méthodologies, techniques et outils de Traitement Automatique des Langues et des formats de description de documents, d’annotations et de ressources terminologiques (XML et, par exemple, TEI, SKOS, TBX, ...) Intégration de solutions dans un environnement Linux, Bonne capacité d’analyse et de synthèse. Souhaitées : Autonomie de développement dans un langage comme Java, Python ou PHP. Expérience des gros volumes de données et des technologies du Web sémantique Profil De formation Bac+5 en informatique et ingénierie des langues, vous avez une expérience dans le domaine. Ayant déjà exercé ce type de fonction ou pas, vous êtes disponible et opérationnel rapidement, vous êtes autonome, organisé et rigoureux. Vous avez le sens du service, une bonne capacité d'assimilation et d'adaptation et un bon relationnel. Lieu : VANDOEUVRE-LES-NANCY Contrat : CDD de 12 mois Rémunération : selon diplôme et expérience Date de début : 01 février 2014 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Dec 20 21:32:27 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 22:32:27 +0100 Subject: Info: AFIA, Prix de These en Intelligence Artificielle 2014 Message-ID: Date: Fri, 20 Dec 2013 17:02:07 +0100 (CET) From: Amedeo Napoli Message-ID: <1763010084.15272569.1387555327165.JavaMail.root at loria.fr> X-url: http://www.afia.asso.fr/tiki-index.php?page=Prix+de+Th%C3%A8se+en+Intelligence+Artificielle bonjour à tous, avec toutes mes excuses pour les réceptions multiples, bonne fin d'année à tous, Amedeo Napoli ==================================================================== AFIA : Prix de Thèse en Intelligence Artificielle 2014 L'AFIA reconduit en 2014 le prix de thèse en Intelligence Artificielle, pour faire connaître et reconnaître les meilleurs travaux de recherche des jeunes chercheurs en Intelligence Artificielle. Pour cette édition 2014, le jury est composé à la fois de membres choisis parmi les ECCAI Fellows membres de l'AFIA et par des membres extérieurs choisis pour l'occasion. Le jury est présidé cette année encore par Amedeo Napoli (LORIA Nancy). Le premier prix est doté d'un montant de 1000 euros. Selon la qualité des soumissions reçues, un ou deux autres prix ou accessits pourront être attribués. Les jeunes chercheuses et chercheurs ayant obtenu un doctorat en France au cours de l'année civile 2013 peuvent se porter candidat. En outre, l'AFIA apprécierait que les jeunes docteurs candidats soient à jour de leur cotisation AFIA 2014, ce qui montrerait par là qu'ils sont conscients du rôle que peut jouer l'AFIA pour promouvoir l'Intelligence Artificielle sous toutes ses formes en France mais aussi à l'étranger. La constitution du dossier est détaillée à l'adresse : http://www.afia.asso.fr/tiki-index.php?page=Prix+de+Th%C3%A8se+en+Intelligence+Artificielle La date limite de soumission est fixée au **** lundi 27 janvier 2014 **** Les résultats seront connus à partir de la fin du mois d'avril. Le lauréat se verra remettre le prix aux cours du Congrès RFIA qui a lieu entre les 30 juin et 4 juillet à Rouen, et il aura l'occasion de présenter ses travaux au cours d'une session spéciale des Rencontres des Jeunes Chercheurs en Intelligence Artificielle (RJCIA 2014). ==================================================================== ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Dec 20 21:31:02 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 22:31:02 +0100 Subject: Ressources: ELRA - Language Resources Catalogue - Update Message-ID: Date: Fri, 20 Dec 2013 16:58:03 +0100 From: ELRA ELDA Information Message-ID: <52B4690B.5000801 at elda.org> X-url: http://catalog.elra.info/product_info.php?products_id=1214 X-url: http://catalog.elra.info/product_info.php?products_id=1215 Our apologies if you have received multiple copies of this announcement. We are happy to announce that 1 new Speech Language Resource and 1 new Written Corpus are now available in our catalogue. *ELRA-S0365 aGender* aGender contains speech sample recordings over public telephone lines with read and (semi-)spontaneous speech. Native German speakers called a voice portal from their private phone, and read text + answered some open questions. The corpus contains the voices of 945 German speakers (approx. minimum of 100 speakers per class), each delivering 18 speech items in up to six different sessions. For more information, see: http://catalog.elra.info/product_info.php?products_id=1214 *ELRA-W0074 Amharic-English bilingual corpus* The Amharic-English bilingual corpus contains parallel text from legal and news domains in Amharic script, in transliterated form and in English. The size of the corpus is of 232,653 words in Amharic and 291,701 in English. For more information, see: http://catalog.elra.info/product_info.php?products_id=1215 For more information on the catalogue, please contact Valérie Mapelli mailto:mapelli at elda.org Visit our On-line Catalogue: http://catalog.elra.info Visit the Universal Catalogue: http://universal.elra.info Archives of ELRA Language Resources Catalogue Updates: http://www.elra.info/LRs-Announcements.html ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Dec 20 20:44:28 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 21:44:28 +0100 Subject: Seminaire: BLRI, Pascal Belin, Aix-en-Provence, 24 janvier 2014 Message-ID: Date: Wed, 18 Dec 2013 17:43:15 +0100 From: Nadéra Bureau Message-ID: <016601cefc10$3fdf5400$bf9dfc00$@bureau at lpl-aix.fr> Vendredi 24 janvier 2014 11h Salle de conférences B011, bât. B 5 avenue Pasteur, Aix-en-Provence (Labex BLRI) Pascal Belin (Institut des Neurosciences de La Timone, UMR 7289, CNRS & Aix-Marseille Université, Marseille, France) A Vocal Brain: Cerebral Processing of Voice Information Résumé : The human voice carries speech but also a wealth of socially-relevant, speaker-related information. Listeners routinely perceive precious information on the speaker's identity (gender, age), affective state (happy, scared), as well as more subtle cues on perceived personality traits (attractiveness, dominance, etc.), strongly influencing social interactions. Using voice psychoacoustics and neuroimaging techniques, we examine the cerebral processing of person-related information in perceptual and neural voice representations. Results indicate a cerebral architecture of voice cognition sharing many similarities with the cerebral organization of face processing, with the main types of information in voices (identity, affect, speech) processed in interacting, but partly dissociable functional pathways. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Dec 20 20:22:52 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 21:22:52 +0100 Subject: Appel: TSD 2014, Preliminary Announcement Message-ID: Date: Wed, 18 Dec 2013 14:39:52 +0100 From: TSD 2014 Message-Id: X-url: http://www.tsdconference.org/ ********************************************************* TSD 2014 - PRELIMINARY ANNOUNCEMENT ********************************************************* Seventeenth International Conference on TEXT, SPEECH and DIALOGUE (TSD 2014) Brno, Czech Republic, 8-12 September 2014 http://www.tsdconference.org/ The conference is organized by the Faculty of Informatics, Masaryk University, Brno, and the Faculty of Applied Sciences, University of West Bohemia, Pilsen. The conference is supported by International Speech Communication Association. Venue: Brno, Czech Republic TSD SERIES TSD series evolved as a prime forum for interaction between researchers in both spoken and written language processing from all over the world. Proceedings of TSD form a book published by Springer-Verlag in their Lecture Notes in Artificial Intelligence (LNAI) series. TOPICS Topics of the conference will include (but are not limited to): Corpora and Language Resources (monolingual, multilingual, text and spoken corpora, large web corpora, disambiguation, specialized lexicons, dictionaries) Speech Recognition (multilingual, continuous, emotional speech, handicapped speaker, out-of-vocabulary words, alternative way of feature extraction, new models for acoustic and language modelling) Tagging, Classification and Parsing of Text and Speech (morphological and syntactic analysis, synthesis and disambiguation, multilingual processing, sentiment analysis, credibility analysis, automatic text labeling, summarization, authorship attribution) Speech and Spoken Language Generation (multilingual, high fidelity speech synthesis, computer singing) Semantic Processing of Text and Speech (information extraction, information retrieval, data mining, semantic web, knowledge representation, inference, ontologies, sense disambiguation, plagiarism detection) Integrating Applications of Text and Speech Processing (machine translation, natural language understanding, question-answering strategies, assistive technologies) Automatic Dialogue Systems (self-learning, multilingual, question-answering systems, dialogue strategies, prosody in dialogues) Multimodal Techniques and Modelling (video processing, facial animation, visual speech synthesis, user modelling, emotions and personality modelling) Papers on processing of languages other than English are strongly encouraged. PROGRAM COMMITTEE Hynek Hermansky, USA (general chair) Eneko Agirre, Spain Genevieve Baudoin, France Paul Cook, Australia Jan Cernocky, Czech Republic Simon Dobrisek, Slovenia Karina Evgrafova, Russia Darja Fiser, Slovenia Radovan Garabik, Slovakia Alexander Gelbukh, Mexico Louise Guthrie, GB Jan Hajic, Czech Republic Eva Hajicova, Czech Republic Yannis Haralambous, France Ludwig Hitzenberger, Germany Jaroslava Hlavacova, Czech Republic Ales Horak, Czech Republic Eduard Hovy, USA Maria Khokhlova, Russia Daniil Kocharov, Russia Ivan Kopecek, Czech Republic Valia Kordoni, Germany Steven Krauwer, The Netherlands Siegfried Kunzmann, Germany Natalija Loukachevitch, Russia Vaclav Matousek, Czech Republic Diana McCarthy, United Kingdom France Mihelic, Slovenia Hermann Ney, Germany Elmar Noeth, Germany Karel Oliva, Czech Republic Karel Pala, Czech Republic Nikola Pavesic, Slovenia Fabio Pianesi, Italy Maciej Piasecki, Poland Adam Przepiorkowski, Poland Josef Psutka, Czech Republic James Pustejovsky, USA German Rigau, Spain Leon Rothkrantz, The Netherlands Anna Rumshisky, USA Milan Rusko, Slovakia Mykola Sazhok, Ukraine Pavel Skrelin, Russia Pavel Smrz, Czech Republic Petr Sojka, Czech Republic Stefan Steidl, Germany Georg Stemmer, Germany Marko Tadic, Croatia Tamas Varadi, Hungary Zygmunt Vetulani, Poland Pascal Wiggers, The Netherlands Yorick Wilks, GB Marcin Wolinski, Poland Victor Zakharov, Russia KEYNOTE SPEAKERS Ralph Grishman, New York University, USA Bernardo Magnini, FBK - Fondazione Bruno Kessler, Italy FORMAT OF THE CONFERENCE The conference program will include presentation of invited papers, oral presentations, and poster/demonstration sessions. Papers will be presented in plenary or topic oriented sessions. Social events including a trip in the vicinity of Brno will allow for additional informal interactions. CONFERENCE PROGRAM The conference program will include oral presentations and poster/demonstration sessions with sufficient time for discussions of the issues raised. IMPORTANT DATES March 15 2014 ............ Submission of abstract March 22 2014 ............ Submission of full papers May 15 2014 .............. Notification of acceptance May 31 2014 .............. Final papers (camera ready) and registration August 3 2014 ............ Submission of demonstration abstracts August 10 2014 ........... Notification of acceptance for demonstrations sent to the authors September 8-12 2014 ...... Conference date The contributions to the conference will be published in proceedings that will be made available to participants at the time of the conference. OFFICIAL LANGUAGE The official language of the conference is English. ADDRESS All correspondence regarding the conference should be addressed to Ales Horak, TSD 2014 Faculty of Informatics, Masaryk University Botanicka 68a, 602 00 Brno, Czech Republic phone: +420-5-49 49 18 63 fax: +420-5-49 49 18 20 email: tsd2014 at tsdconference.org The official TSD 2014 homepage is: http://www.tsdconference.org/ LOCATION Brno is the second largest city in the Czech Republic with a population of almost 400.000 and is the country's judiciary and trade-fair center. Brno is the capital of South Moravia, which is located in the south-east part of the Czech Republic and is known for a wide range of cultural, natural, and technical sights. South Moravia is a traditional wine region. Brno had been a Royal City since 1347 and with its six universities it forms a cultural center of the region. Brno can be reached easily by direct flights from London, Moscow, Saint Petersburg, Eindhoven, Rome and Prague and by trains or buses from Prague (200 km) or Vienna (130 km). From hamon at LIMSI.FR Fri Dec 20 21:49:06 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 22:49:06 +0100 Subject: Stage: Internship in NLP and MT, Idiap, Switzerland Message-ID: Date: Fri, 20 Dec 2013 18:06:35 +0100 From: Andrei Popescu-Belis Message-ID: <52B4791B.40306 at idiap.ch> X-url: http://www.idiap.ch/education-and-jobs Internship in Natural Language Processing and Machine Translation (OP-20131220-163254) NLP Group, Idiap Research Institute, Martigny, Switzerland Description: Applications are invited for a 6-month internship at the Master level in the field of natural language processing and statistical machine translation. The internship is offered in relation to the Swiss MODERN project (www.idiap.ch/project/modern) aiming at modeling discourse entities and relations for improving MT. The intern will work within the NLP group at the Idiap Research Institute. The goal of the internship is to integrate existing tools for anaphora resolution (in-house and third party ones) with phrase-based SMT models (mainly using the Moses open-source system). Following previous work, the intern will examine whether statistical predictions about the features of target pronouns (gender, number, grammatical function, and others) can be used when training SMT and when translating (decoding) new texts. The intern will implement methods to derive such features, will use the features in an SMT model (e.g. a factored one), and will measure the variation in translation quality. The applicants should have a background in computer science or linguistics, with previous experience in SMT and NLP being a strong advantage. The applicants should have demonstrable programming skills in at least one programming language such as Perl or Python, Java or C/C++. Experience with the Moses system and/or anaphora resolution systems would be a plus. Good command of English is mandatory and knowledge of another European language such as French, German or Dutch would be an advantage. The screening of the applications will start on February 1st, 2014 and will continue until the position is filled. The intended starting date is in spring 2014. The appointment is for 6 months, with a gross internship salary of 2000 CHF per month. Participation in the MODERN project will entail interaction with project partners at the Universities of Zurich, Geneva, and Utrecht. How to apply: Please fill in and submit your application through the Idiap online recruitment system, by clicking on the position's title at http://www.idiap.ch/education-and-jobs. Contact information: Further information about this position can be requested via the Idiap online recruitment system or by contacting Dr. Andrei Popescu-Belis, head of the NLP group. About Idiap: Idiap is an independent, non-profit research institute recognized and supported by the Swiss Government, and affiliated with the Ecole Polytechnique Fédérale de Lausanne (EPFL). It is located in the town of Martigny in Valais, a scenic region in the south of Switzerland, surrounded by the highest mountains of Europe, and offering exciting recreational activities, including hiking, climbing and skiing, as well as varied cultural activities. It is within close proximity to Geneva and Lausanne. Although Idiap is located in the French part of Switzerland, English is the working language. Free French lessons are provided. Idiap offers competitive salaries and conditions at all levels in a young, dynamic, and multicultural environment. Idiap is an equal opportunity employer and is actively involved in the "Advancement of Women in Science" European initiative. The Institute seeks to maintain a principle of open competition (on the basis of merit) to appoint the best candidate, provides equal opportunity for all candidates, and equally encourage both genders to apply. Andrei Popescu-Belis ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Dec 20 20:23:39 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 21:23:39 +0100 Subject: Job: Poste d'Ingenieur d'Etude au GREYC (date limite: 16 Janvier 2014) Message-ID: Date: Wed, 18 Dec 2013 15:38:29 +0100 From: Gaël Dias Message-ID: <52B1B365.1040909 at unicaen.fr> X-url: http://web-rh.dsi.cnrs.fr/afip/owa/consult.affiche_fonc?code_fonc=V61008&code_corps=&code_dr=19&code_bap=&code_ds=61&nbjours= --- Désolé pour les diffusions multiples --- ================================================== ========== POSTE IE au GREYC ========== ========== Date Limite: 16 Janvier 2014 ========== ================================================== Le GREYC UMR 6072 à Caen recherche un ingénieur d'étude. En collaboration avec les équipes de recherche du laboratoire, l’ingénieur travaillera à la conception de plateformes logicielles autour des traitements d’images, des documents numériques et des langues, de la résolution dans les systèmes complexes et de l’optimisation et de la sécurité. Le poste est à pourvoir en premier lieu par mobilité interne au CNRS (NOEMI). Mais si aucun candidat ne convient, les candidatures de fonctionnaires externes au CNRS seront examinées. La campagne est ouverte jusqu'au 16 janvier 2014. Le profil du poste peut être trouvé à l'url suivante: http://web-rh.dsi.cnrs.fr/afip/owa/consult.affiche_fonc?code_fonc=V61008&code_corps=&code_dr=19&code_bap=&code_ds=61&nbjours= Pour tout renseignement complémentaire, n'hésitez pas à contacter Gaël Dias, Professeur des Universités (gael.dias at uniaen.fr) de l'Université de Caen Basse-Normandie et responsable de l'équipe HULTECH (Human Language Technology) au GREYC. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Dec 20 20:20:58 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 21:20:58 +0100 Subject: Job: Post-doc, Labex OBVIL Message-ID: Date: Wed, 18 Dec 2013 09:02:24 +0100 From: Jean-Gabriel Ganascia Message-ID: <52B15690.6080701 at Ganascia.name> *Paris - Sorbonne Universités* *INTITULE DU POSTE * ** *Ingénieur attaché au Labex OBVIL (spécialité intelligence artificielle)* /Catégorie : contrat post-doctoral/ /Prise de fonction : 1^er mars 2014/ /Durée : CDD renouvelable/ *Structure de référence :**Laboratoire d’excellence OBVIL* *Localisation géographique : Maison de la Recherche, 28 rue serpente, 75006 Paris* *Rattachements hiérarchiques au sein de la structure :**Directeur du LABEX OBVIL, Didier Alexandre, Directeur Adjoint du LABEX OBVIL, Jean-Gabriel Ganascia* *N+1 de l’agent :**Jean-Gabriel Ganascia* *Quotité de travail inhérente au poste : 100%*** *Missions du Service* Ce poste est destiné à un jeune chercheur en intelligence artificielle qui porte un intérêt aux *humanités numériques*. Le Laboratoire d’excellence OBVIL s’inscrit dans le cadre du PRES Sorbonne-Universités etréunit des chercheurs appartenant à 7 équipes d’accueil, à 2 UMR du CNRS. Il regroupe des enseignants-chercheurs et des chercheurs des universités Paris-Sorbonne et Pierre-et-Marie-Curie, les uns spécialistes de littérature, les autres de sciences cognitives et d’informatique. Il entend développer toutes les ressources offertes par les applications informatiques pour examiner aussi bien la littérature française du passé que la plus contemporaine. Il s’intéresse aux questions de la valeur littéraire, de la vie littéraire, de l’auctorialité et à l’étude des traductions, des transpositions, des adaptations pour comprendre la manière dont se constituent les canons. Il développe des programmes et outils dans le cadre des programmes de recherche du labex OBVIL. *Fonctions de l’agent* Les fonctions de l’agent sont la contribution de cet agent aux missions du service. *Développements logiciels pour les humanités numériques :* - Des outils numériques de comparaison de textes, de recherche d’homologies dans les textes, de détection d’entités nommées, d’indexation sémantique, de cartographie de corpus, de construction et de visualisation d’ontologies etc. ont été réalisées ces dernières années dans le cadre de l’équipe ACASA du LIP6 - Université Pierre et Marie Curie et du labex OBVIL. Il s’agit maintenant de les mettre à disposition d’une communauté large composée non seulement des chercheurs en littérature de l’OBVIL mais aussi de l’ensemble du public intéressé. L’agent contribuera à les valider sur des corpus conséquents et à réaliser des interfaces pour les rendre accessibles à tous sur le web. Activités de l’agent /Les activités sont les principales opérations à effectuer pour remplir les fonctions./ /Elles sont la traduction concrète des fonctions./ - Un premier volet portera sur la conception d’interface d’accès aux outils d’alignement de texte (MEDITE), de détection d’homologies dans les séquences (PHOEBUS) et de repérage de structures annulaires. Il s’agit non seulement de visualiser les résultats, mais mettre ces outils en accès libre sur le web. - Un second volet de l’activité portera sur le développement d’ontologies pour l’indexation de textes littéraires et sur l’introduction des techniques du Web sémantique dans leur traitement. - Enfin, un troisième volet portera sur la cartographie de corpus à l’aide des îles de mémoire développées au sein de l’équipe ACASA. Compétences Les compétences se traduisent par des savoirs, savoir-faire, savoir-être. Etre titulaire d’une thèse d’informatique. Maîtriser : - les concepts fondamentaux de l’intelligence artificielle - les techniques de base du traitement de la langue naturelle, - la conception d’interfaces visuelles, - les notions essentielles du web sémantique (ontologies, représentation des connaissances, etc.). Avoir la capacité à s’intégrer à une équipe de recherche en informatique et montrer un intérêt pour les humanités numériques. *Personne(s) à contacter:* /ALEXANDRE Didier, directeur du LABEX OBVIL/ /didier.alexandre at paris-sorbonne.fr/ /GANASCIA Jean-Gabriel, directeur de l’équipe ACASA du LIP6 - Université Pierre et Marie Curie/ /Jean-Gabriel.Ganascia at lip6.fr/ /Le dossier consistera en un CV, une présentation du doctorat, des activités et contrats de recherche antérieurs, et une lettre de motivation. / *Date limite du dépôt des candidatures : 1^er février 2014 auprès du Labex OBVIL, Université Paris-Sorbonne, Maison de la Recherche, 26 rue Serpente, 75006 Paris ou auprès de clarisse.barthelemy at gmail.com* ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Dec 20 21:37:36 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 22:37:36 +0100 Subject: Sujet de These: PhD Position in NLP for MT, Idiap, Switzerland Message-ID: Date: Fri, 20 Dec 2013 18:06:23 +0100 From: Andrei Popescu-Belis Message-ID: <52B4790F.6030808 at idiap.ch> X-url: http://www.idiap.ch/education-and-jobs PhD Position in Natural Language Processing for Machine Translation (OP-20131220-162332) NLP Group, Idiap Research Institute, Martigny, Switzerland Description: Applications are invited for a PhD position in natural language processing applied to machine translation, in the context of the MODERN project headed by Idiap, with partners in Zurich, Geneva and Utrecht (www.idiap.ch/project/modern) funded by the Swiss National Science Foundation. The overall goal of the project is to model discourse entities and relations for improving MT. The goal of the doctoral thesis, within MODERN, will be to design a probabilistic model of text-level coreference and anaphora resolution, in relation to discourse structure. The model will take advantage of rich semi-structured resources and semantic relatedness measures, along with insights on the interplay between referential and discursive relations. The model will be implemented and tested both intrinsically, on coreference tasks for English and a second language, and extrinsically, in combination to machine translation. Work on this latter aspect will involve collaboration with a postdoc in MT. Candidates should have a strong background in computer science, and good knowledge of natural language processing, in particular of statistical methods based on machine learning. They should have strong programming skills (e.g. in Python, Perl, Java or C/C++) as well as good knowledge of English as a working language. Knowledge of another language such as French, German, or Dutch is an advantage. A strong academic record and a clear aptitude for creative research will be priority selection criteria. The successful candidate will be registered as a PhD student in the EPFL EDEE doctoral program, to which he/she will have to apply with support from Idiap. The position is funded by the Swiss National Science Foundation, with a maximum duration of four years. Review of the applications will start immediately and will continue until the position is filled. The starting date is March 2014, but later dates can be considered as well. How to apply: Please fill in and submit your application through the Idiap online recruitment system, by clicking on the position's title at http://www.idiap.ch/education-and-jobs. Contact information: Further information about this position can be requested via the Idiap online recruitment system or by contacting Dr. Andrei Popescu-Belis, head of the NLP group. About Idiap: Idiap is an independent, non-profit research institute recognized and supported by the Swiss Government, and affiliated with the Ecole Polytechnique Fédérale de Lausanne (EPFL). It is located in the town of Martigny in Valais, a scenic region in the south of Switzerland, surrounded by the highest mountains of Europe, and offering exciting recreational activities, including hiking, climbing and skiing, as well as varied cultural activities. It is within close proximity to Geneva and Lausanne. Although Idiap is located in the French part of Switzerland, English is the working language. Free French lessons are provided. Idiap offers competitive salaries and conditions at all levels in a young, dynamic, and multicultural environment. Idiap is an equal opportunity employer and is actively involved in the "Advancement of Women in Science" European initiative. The Institute seeks to maintain a principle of open competition (on the basis of merit) to appoint the best candidate, provides equal opportunity for all candidates, and equally encourage both genders to apply. Andrei Popescu-Belis ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Dec 20 21:28:31 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 22:28:31 +0100 Subject: Cursus: Master Erasmus Mundus en TAL Message-ID: Date: Fri, 20 Dec 2013 15:25:46 +0100 From: Sylvain Pogodalla Message-ID: <21172.21354.208537.589923 at fanon.loria.fr> X-url: http://lct-master.org X-url: http://ec.europa.eu/education/external-relation-programmes/mundus_en.htm X-url: http://eacea.ec.europa.eu/erasmus_mundus/ *Bourses de master* pour étudiants *européens et non européens.* À diffuser auprès des étudiants qui souhaitent faire un master. Le programme Erasmus Mundus en "Language and Communication Technologies" (EMMLCT) http://lct-master.org invite les étudiants *européens* et *non européens* à postuler à la participation à ce programme. *La date limite de candidature est le 12 janvier 2014.* *Informations importantes* + enseignement en anglais + durée: 2 ans + chaque année d'étude dans un établissement partenaire différent + spécialisation dans les méthodes et technologies de l'informatique linguistique + bourses d'études du programme Erasmus Mundus de l'Union Européenne, *y compris pour les étudiants européens* *Le programme EMMLCT est proposé par le consortium d'universités* 1. Saarland University in Saarbruecken, Germany (coordinator) 2. University of Trento, Trento, Italy 3. University of Malta, Malta 4. University of Lorraine, Nancy, France 5. Charles University, Prague, Czech Republic 6. Rijksuniversiteit Groningen, The Netherlands 7. Shanghai Jiao Tong University, China 8. The University of Melbourne, Australia L'Université du pays Basque / Euskal Herriko University, San Sebastian, Spain participe également comme partenaire associé. *Le programme EMMLCT en bref* Le programme EMMLCT est un Master international. Le programme propose d'étudier chacune des deux années dans une université différente du consortium. Lorsque les conditions ont été remplies, les étudiants obtiennent deux Masters qui ont une reconnaissance nationale dans chacun des pays d'études. Les cours comprennent des modules fondamentaux obligatoires ainsi que des modules avancés en technologies du langage et de la communication et en informatique, un projet optionnel, et la rédaction d'un mémoire de Master. Le programme vise à donner aux étudiants des compétences et des perspectives sur les disciplines variées qui contribuent aux méthodes utilisées dans les technologies du langage et de la communication et à renforcer leur connaissance des méthodes scientifiques. De plus, les étudiants acquièrent des connaissances pratiques en choisissant une combinaison de modules en technologie du langage, en linguistique informatique et théorique, et en informatique. Le programme EMMLCT a été défini pour prendre en compte les demandes de l'industrie et de la recherche dans le domaine des technologies des langues. Il offre des perspectives d'apprentissage et de formation Le programme EMMLCT est approuvé et soutenu par le programme Erasmus Mundus de l'Union Européenne. Pour plus d'information : http://ec.europa.eu/education/external-relation-programmes/mundus_en.htm http://eacea.ec.europa.eu/erasmus_mundus/. *Candidatures* Tous les candidats doivent satisfaire les conditions suivantes : 1. Avoir une Licence (ou un diplôme équivalent) en informatique, linguistique, informatique linguistique, sciences cognitives, mathématiques, intelligence artificielle ou tout autre discipline pertinente. Ce diplôme doit avoir été obtenu avant le début des cours du master EMMLCT. 2. Compétences linguistiques : pour les étudiants dont la langue maternelle n'est pas l'anglais, un certificat d'aptitude en anglais, langue d'enseignement, est requis (TOEFL, IELTS, Cambridge certificate, etc.) Les candidatures se font en ligne. Plus d'information et les instructions pour la candidature se trouvent à : http://lct-master.org Adresse pour de plus amples informations : lct-info at coli.uni-sb.de Coordination du programme : Prof. Hans Uszkoreit Dr. Ing. Ivana Kruijff-Korbayova Department of Computational Linguistics and Phonetics Saarland University Saarbruecken, Germany http://lct-master.org ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 24 17:44:14 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 24 Dec 2013 18:44:14 +0100 Subject: Appel: COLING 2014, Dublin, Ireland Message-ID: Date: Sun, 22 Dec 2013 08:28:18 +0000 From: 박성배 Message-ID: <01648087DD5D4349A12B758B6F475B842356E92B at SejongSRV.sejong.knu.ac.kr> X-url: http://www.coling-2014.org We apologize if you received multiple copies ------------------------------------------------------------------------ 1st (Preliminary) Call for Papers - Coling 2014 The 25th International Conference on Computational Linguistics August 23 - 29, 2014 Dublin, Ireland http://www.coling-2014.org The International Committee on Computational Linguistics (ICCL) is pleased to announce the 25th International Conference on Computational Linguistics (Coling 2014), at Dublin City University (DCU, Dublin, Ireland, European Union). DCU is a young, dynamic and ambitious university with a mission to transform lives and societies through education, research and innovation. Most of the local organizers are from CNGL, Ireland’s Centre for Global Intelligent Content (formerly the Centre for Next Generation Localization), which embodies the leading position of Ireland in the global localization/internationalization business, a strong focus on language technologies including machine translation, computational linguistics and natural language processing, as well as on intelligent management, search, retrieval, transformation and adaptation of content. Coling will cover a broad spectrum of technical areas related to natural language and computation. The conference will include full papers (presented as oral presentations or posters), demonstrations, tutorials, and workshops. TOPICS OF INTEREST Coling 2014 solicits papers and demonstrations on original and unpublished research on the following topics, including, but not limited to: - pragmatics, semantics, syntax, grammars and the lexicon; - cognitive, mathematical and computational models of language processing; - models of communication by language; - lexical semantics and ontologies; - word segmentation, tagging and chunking; - parsing, both syntactic and deep; - generation and summarization; - paraphrasing, textual entailment and question answering; - speech recognition, text-to-speech and spoken language understanding; - multimodal and natural language interfaces and dialogue systems; - information retrieval, information extraction and knowledge base linking; - machine learning for natural language; - modeling of discourse and dialogue; - sentiment analysis, opinion mining and social media; - multilingual processing, machine translation and translation aids; - applications, tools and language resources; - system evaluation methodology and metrics. In all relevant areas, we encourage authors to include analysis of the influence of theories (intuitions, methodologies, insights, ? to technologies (computational algorithms, methods, tools, data, ? and/or contributions of technologies to theory development. In technologically oriented papers, we encourage in-depth analysis and discussion of errors made in the experiments described, if possible linking them to the presence or absence of linguistically-motivated features. Contributions that display and rigorously discuss future potential, even if not (yet) attested in standard evaluation, are welcome. PAPER REQUIREMENTS Papers should describe original work; they should emphasize completed work or well-advanced ongoing research rather than intended work, and should indicate clearly the state of completion of the reported results. Wherever appropriate, concrete evaluation results should be included. Submissions will be judged on correctness, originality, technical strength, significance and relevance to the conference, and interest to the attendees. Submissions presented at the conference should mostly contain new material that has not been presented at any other meeting with publicly available proceedings. Papers that are being submitted in parallel to other conferences or workshops must indicate this on the title page, as must papers that contain significant overlap with previously published work. REVIEWING Reviewing will be double blind. It will be managed by an international Conference Program Committee consisting of Program Chairs, members of the Scientific Advisory Board and Area Chairs, who will be assisted by invited reviewers. INSTRUCTIONS FOR AUTHORS For Coling 2014, there will be one category of research papers only. All of the papers will be included in conference proceedings, this time in electronic form only. The maximum submission length is 8 pages (A4), plus two extra pages for references. Authors of accepted papers will be given additional space in the camera-ready version to reflect space needed for changes stemming from reviewers?comments. Authors can indicate their preference for presentation mode (i.e. oral or poster presentation) in the submission form, and the reviewers will recommend an appropriate mode of presentation to the program committee which will then decide. There will be no distinction in the proceedings between research papers presented orally vs. as posters. Papers shall be submitted in English, anonymized with regard to the authors and/or their institution (no author-identifying information on the title page nor anywhere in the paper), including referencing style as usual. Papers must conform to official Coling 2014 style guidelines, which will be available on the Coling 2014 website. Submission and reviewing will be managed online by the START system. The only accepted format for submitted papers is in Adobe’s PDF. Submissions must be uploaded on the START system by the submission deadlines; submissions after that time will not be reviewed. To minimize network congestion, we request authors to upload their submissions as early as possible. Important Notice [1] In order to allow participants to be acquainted with the published papers ahead of time which in turn should facilitate discussions at Coling 2014, we have set the official publication date two weeks before the conference, i.e., on August 11, 2014. On that day, the papers will be available online for all participants to download, print and read. If your employer is taking steps to protect intellectual property related to your paper, please inform them about this timing. [2] While submissions are anonymous, we strongly encourage authors to plan for depositing language resources and other data as well as tools used and/or developed for the experiments described in the papers, if the paper is accepted. In this respect, we encourage authors then to deposit resources and tools to available open-access repositories of language resources and/or repositories of tools (such as META-SHARE, Clarin, ELRA, LDC or AFNLP/COCOSDA for data, and github, sourceforge, CPAN and similar for software and tools) and refer to them instead of submitting them with the paper, even though it will also be an open possibility (through the START system). The details will be given in the submission site for camera-ready versions of accepted papers. [3] There will be a separate call for demonstrations in February. Accepted papers on demonstrations will also be included in the proceedings. IMPORTANT DATES January, 2014: Opening of the submission website March 21, 2014: Paper submission deadline May 9-12, 2014: Author response period May 23, 2014: Author notification June 6, 2014: Camera-ready PDF due August 11, 2014: Official paper publication date August 25-29, 2014: Main conference PROGRAM COMMITTEE Program Committee Co-chairs Junichi Tsujii (Microsoft Research, China) Jan Hajic (Charles University in Prague, Czech Republic) Scientific Advisory Board members Ralph Grishman (New York University, USA) Yuji Matsumoto (Nara Institute of Science and Technology, Japan) Joakim Nivre (Uppsala University, Sweden) Michael Picheny (IBM T. J. Watson Research Center, USA) Donia Scott (Unviersity of Sussex, United Kingdom) Chengqing Zong (Chinese Academy of Sciences, China) Area Chairs 1. Linguistic Issues in CL and NLP Emily M. Bender (University of Washington, USA) Eva Hajicova (Charles University in Prague, Czech Republic) Igor Boguslavsky (Universidad Politecnica de Madrid, Spain) 2. Machine Learning for CL and NLP Jason Eisner (Johns Hopkins University, USA) Yoshimasa Tsuruoka (University of Tokyo, Japan) 3. Cognitive Issues in CL and NLP Philippe Blache (CNRS & CNRS & Aix-Marseille Université, France) Ted Gibson (MIT, USA) 4. Morphology, Word Segmentation, Tagging and Chunking Reut Tsarfaty (Weizmann Institute of Science, Israel) Yue Zhang (Singapore University of Technology and Design, Singapore) 5. Syntax, Grammar Induction, Syntactic and Semantic Parsing Laura Kallmeyer (Heinrich-Heine-Universität, Germany) Ryan McDonald (Google, USA) 6. Lexical Semantics and Ontologies Chu-Ren Huang (Hong Kong Polytechnic University, Hong Kong) Alessandro Oltramari (Carnegie Mellon University, USA) 7. Semantic Processing, Distributional Semantics and Compositional Semantics Stephen Clark (University of Cambridge, UK) Alessandro Lenci (University of Pisa, Italy) 8. Modeling of Discourse and Dialogue Nicolas Asher (CNRS & Université Paul Sabatier, France) Marilyn Walker (University of California Santa Cruz, USA) 9. Natural Language Generation and Summarization Albert Gatt (University of Malta, Malta) Advaith Siddharthan (University of Aberdeen, UK) 10. Paraphrasing and Textual Entailment Ido Dagan (Bar Ilan University, Israel) Kentaro Inui (Tohoku University, Japan) 11. Sentiment Analysis, Opinion Mining and Social Media Rada Mihalcea (University of Michigan, USA) Bing Liu (University of Illinois at Chicago, USA) 12. Information Retrieval and Question Answering Gareth Jones (Dublin City University, Ireland) Siddharth Patwardhan (IBM Research, USA) 13. Information Extraction and Database Linking James Curran (University of Sydney, Australia) Seung-won Hwang (Postec, Korea) 14. Applications Srinivas Bangalore (AT&T Labs-Research, USA) Heyan Huang (Beijing Institute of Technology, China) Guillaume Jacquet (Joint Research Centre, Italy) 15. Multimodal and Natural Language Interfaces and Dialog Systems Kristiina Jokinen (University of Helsinki, Finland) David Traum (University of Southern California, USA) 16. Speech Recognition, Text-To-Speech, Spoken Language Understanding Nick Campbell (Trinity College Dublin, Ireland) Alex Potamianos (National Technical University Crete, Greece) 17. Machine Translation Phillip Koehn (University of Edinburgh, UK / Johns Hopkins University, USA) Chris Quirk (Microsoft Research, USA) Tiejun Zhao (Harbin Institute of Technology, China) 18. Resources Pushpak Bhattacharyya (IIT Bombay, India) Nicoletta Calzolari (ILC-CNR, Pisa, Italy) Martha Palmer (University of Colorado, USA) 19. Languages with less resources Steven Bird (University of Melbourne, Australia) Mark Liberman (University of Pennsylvania, USA) Rajeev Sangal (IIT Banaras Hindu University, India) Koenraad De Smedt (University of Bergen, Norway) 20. Software and Tools Jesús Cardeñosa (Universidad Politecnica de Madrid, Spain) Jing-Shin Chang (National Chi Nan University,Taiwan) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 24 17:34:18 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 24 Dec 2013 18:34:18 +0100 Subject: Appel: LREC 2014, Workshops and Tutorials Message-ID: Date: Tue, 24 Dec 2013 15:35:58 +0100 From: ELRA ELDA Information Message-ID: <52B99BCE.9030106 at elda.org> X-url: http://lrec2014.lrec-conf.org/en/conference-programme/workshops-and-tutorials [Apologies for multiple postings] Schedule of all the LREC 2014 Workshops and Tutorials is now online at http://lrec2014.lrec-conf.org/en/conference-programme/workshops-and-tutorials. On this web page you will find a link to each Workshop Call for Papers and each Tutorial Outline, when available. Don't hesitate to contact Workshop and/or Tutorial organisers if you have specific questions on their event. For general LREC 2014 matters, please contact us at http://lrec2014.lrec-conf.org/en/contact/ www.lrec-conf.org/lrec2014 Follow us on Twitter: @LREC2014 (https://twitter.com/LREC2014) From hamon at LIMSI.FR Tue Dec 24 17:45:38 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 24 Dec 2013 18:45:38 +0100 Subject: Appel: LREC 2014 Workshop on Building and Using Comparable Corpora (7th BUCC) Message-ID: Date: Sun, 22 Dec 2013 14:42:26 +0100 From: "Reinhard Rapp" Message-ID: <3654EC959E8D49ABA97A0E21F4DA346A at ASUSPC> X-url: http://comparable.limsi.fr/bucc2014/ We apologize for multiple postings Please distribute to interested colleagues ============================================================ 1st Call for Papers 7th WORKSHOP ON BUILDING AND USING COMPARABLE CORPORA Building Resources for Machine Translation Research http://comparable.limsi.fr/bucc2014/ May 27, 2014 Co-located with LREC 2014 Harpa Conference Centre, Reykjavik (Iceland) DEADLINE FOR PAPERS: February 10, 2014 https://www.softconf.com/lrec2014/BUCC2014/ ============================================================ MOTIVATION In the language engineering and the linguistics communities, research in comparable corpora has been motivated by two main reasons. In language engineering, on the one hand, it is chiefly motivated by the need to use comparable corpora as training data for statistical Natural Language Processing applications such as statistical machine translation or cross-lingual retrieval. In linguistics, on the other hand, comparable corpora are of interest in themselves by making possible inter-linguistic discoveries and comparisons. It is generally accepted in both communities that comparable corpora are documents in one or several languages that are comparable in content and form in various degrees and dimensions. We believe that the linguistic definitions and observations related to comparable corpora can improve methods to mine such corpora for applications of statistical NLP. As such, it is of great interest to bring together builders and users of such corpora. The scarcity of parallel corpora has motivated research concerning the use of comparable corpora: pairs of monolingual corpora selected according to the same set of criteria, but in different languages or language varieties. Non-parallel yet comparable corpora overcome the two limitations of parallel corpora, since sources for original, monolingual texts are much more abundant than translated texts. However, because of their nature, mining translations in comparable corpora is much more challenging than in parallel corpora. What constitutes a good comparable corpus, for a given task or per se, also requires specific attention: while the definition of a parallel corpus is fairly straightforward, building a non-parallel corpus requires control over the selection of source texts in both languages. Parallel corpora are a key resource as training data for statistical machine translation, and for building or extending bilingual lexicons and terminologies. However, beyond a few language pairs such as English- French or English-Chinese and a few contexts such as parliamentary debates or legal texts, they remain a scarce resource, despite the creation of automated methods to collect parallel corpora from the Web. To exemplify such issues in a practical setting, this year's special focus will be on Building Resources for Machine Translation Research This special topic aims to address the need for: (1) Machine Translation training and testing data such as spoken or written monolingual, comparable or parallel data collections, and (2) methods and tools used for collecting, annotating, and verifying MT data such as Web crawling, crowdsourcing, tools for language experts and for finding MT data in comparable corpora. TOPICS We solicit contributions including but not limited to the following topics: Topics related to the special theme: * Methods and tools for collecting and processing MT data, including crowdsourcing * Methods and tools for quality control * Tools for efficient annotation * Bilingual term and named entity collections * Multilingual treebanks, wordnets, propbanks, etc. * Comparable corpora with parallel units annotated * Comparable corpora for under-resourced languages and specific domains * Multilingual corpora with rich annotations: POS tags, NEs, dependencies, semantic roles, etc. * Data for special applications: patent translation, movie subtitles, MOOCs, meetings, chat-rooms, social media, etc. * Legal issues with collecting and redistributing data and generating derivatives Building comparable corpora: * Human translations * Automatic and semi-automatic methods * Methods to mine parallel and non-parallel corpora from the Web * Tools and criteria to evaluate the comparability of corpora * Parallel vs non-parallel corpora, monolingual corpora * Rare and minority languages, across language families * Multi-media/multi-modal comparable corpora Applications of comparable corpora: * Human translations * Language learning * Cross-language information retrieval & document categorization * Bilingual projections * Machine translation * Writing assistance Mining from comparable corpora: * Extraction of parallel segments or paraphrases from comparable corpora * Extraction of bilingual and multilingual translations of single words and multi-word expressions; proper names, named entities, etc. IMPORTANT DATES February 10, 2014 Deadline for submission of full papers March 10, 2014 Notification of acceptance March 27, 2014 Camera-ready papers due May 27, 2014 Workshop date SUBMISSION INFORMATION Papers should follow the LREC main conference formatting details (to be announced on the conference website http://lrec2014.lrec-conf.org/en/ ) and should be submitted as a PDF-file via the START workshop manager at https://www.softconf.com/lrec2014/BUCC2014/ Contributions can be short or long papers. Short paper submission must describe original and unpublished work without exceeding six (6) pages. Characteristics of short papers include: a small, focused contribution; work in progress; a negative result; an opinion piece; an interesting application nugget. Long paper submissions must describe substantial, original, completed and unpublished work without exceeding ten (10) pages. Reviewing will be double blind, so the papers should not reveal the authors' identity. Accepted papers will be published in the workshop proceedings. Double submission policy: Parallel submission to other meetings or publications is possible but must be immediately notified to the workshop organizers. When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. Moreover, ELRA encourages all LREC authors to share the described LRs (data, tools, services, etc.), to enable their reuse, replicability of experiments, including evaluation ones, etc. For further information, please contact Pierre Zweigenbaum pz (at) limsi (dot) fr ORGANISERS Pierre Zweigenbaum, LIMSI, CNRS, Orsay (France) Ahmet Aker, University of Sheffield (UK) Serge Sharoff, University of Leeds (UK) Stephan Vogel, QCRI (Qatar) Reinhard Rapp, Universities of Mainz (Germany) and Aix-Marseille (France) SCIENTIFIC COMMITTEE * Ahmet Aker, University of Sheffield (UK) * Srinivas Bangalore (AT&T Labs, US) * Caroline Barrière (CRIM, Montréal, Canada) * Chris Biemann (TU Darmstadt, Germany) * Hervé Déjean (Xerox Research Centre Europe, Grenoble, France) * Kurt Eberle (Lingenio, Heidelberg, Germany) * Andreas Eisele (European Commission, Luxembourg) * Éric Gaussier (Université Joseph Fourier, Grenoble, France) * Gregory Grefenstette (INRIA, Saclay, France) * Silvia Hansen-Schirra (University of Mainz, Germany) * Hitoshi Isahara (Toyohashi University of Technology) * Kyo Kageura (University of Tokyo, Japan) * Adam Kilgarriff (Lexical Computing Ltd, UK) * Natalie Kübler (Université Paris Diderot, France) * Philippe Langlais (Université de Montréal, Canada) * Michael Mohler (Language Computer Corp., US) * Emmanuel Morin (Université de Nantes, France) * Dragos Stefan Munteanu (Language Weaver, Inc., US) * Lene Offersgaard (University of Copenhagen, Denmark) * Ted Pedersen (University of Minnesota, Duluth, US) * Reinhard Rapp (Université Aix-Marseille, France) * Sujith Ravi (Google, US) * Serge Sharoff (University of Leeds, UK) * Michel Simard (National Research Council Canada) * Richard Sproat (OGI School of Science & Technology, US) * Tim Van de Cruys (IRIT-CNRS, Toulouse, France) * Stephan Vogel, QCRI (Qatar) * Guillaume Wisniewski (Université Paris Sud & LIMSI-CNRS, Orsay, France) * Pierre Zweigenbaum (LIMSI-CNRS, Orsay, France) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 24 17:39:18 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 24 Dec 2013 18:39:18 +0100 Subject: Appel: Entity Extraction and Linking Challenge (#Microposts2014 @ WWW2014) Message-ID: Date: Sat, 21 Dec 2013 15:22:13 +0000 From: ampaeli cano Message-ID: X-url: http://www.scc.lancs.ac.uk/microposts2014/challenge/index.html ======================================================================= Entity Extraction and Linking Challenge at the 4th Making Sense of Microposts Workshop (#Microposts2014) @ WWW 2014 http://www.scc.lancs.ac.uk/microposts2014/challenge/index.html 7 April 2014, Seoul, Republic of Korea ======================================================================= Microposts are a highly popular medium to share facts, opinions or emotions. They are an invaluable wealth of data, ready to be mined for training predictive modelings. This year the #Microposts 2014 Workshop will host an "Entity Extraction and Linking Challenge". The overall task of the challenge is to automatically extract entities from English microposts, and link them to the corresponding English DBpedia v3.9 resources (if the linkage exists). As linking stage we aim to to disambiguate expressions that are formed by discrete (and typically short) sequences of words. Existing entity linking tools are intended for use over news corpora and similar document-based corpora with relatively long length. We organise this challenge to foster research into novel, more accurate solutions for the automatic entity linking in (much shorter) micropost data. We will ask the participants to automatically extract entities (e.g., Obama, London, Rakuten) belonging to all entity types (e.g., Person, Location, Organisation) from a collection of microposts. Participants will have to automatically provide context-relevant DBpedia resources for each entity in a micropost. DATASET ------- The dataset comprises of 3.5K tweets extracted from a much larger collection of over 18 million tweets. This collection, provided by the Redites project (http://demeter.inf.ed.ac.uk/redites/), covers event-annotated tweets collected for the period of 15th July 2011 to 15th August 2011 (31 days). It extends over multiple noteworthy events including the death of Amy Winhehouse, the London Riots and the Oslo bombing. Since the task of this challenge is to automatically extract and link entities, we have built our dataset considering both event and non-event tweets. While event tweets are more likely to contain entities, non-event tweets enable us to evaluate the performance of the system in avoiding false positives in the entity extraction phase. The dataset has been split into a training (70%) and testing (30%) sets. Following the Twitter TOS we will only provide tweet IDs and annotations for the training set; and tweet IDs for the test set. We will also provide a common framework to mine these datasets from Twitter. The training set will be released as tsv file where each line consists of : - tweet_id - entity_mention_1 - entity_uri_1 ... - entity_mention_n - entity_uri_n Tokens are separated by TABs. Entity mentions and uris are listed according to their appearance order in the tweet. We will timely advertise the release of the data sets on the workshop mailing list. Please subscribe to https://groups.google.com/d/ forum/microposts2014. More information about dates are available in the Challenge website. EVALUATION ---------- The evaluation consists of two separated stages: 1.- Paper peer review : A community of experts of the domain will judge the quality and applicability of the approaches taken, to provide useful insights on your research; 2.- Precision and Recall: F1 (F-measure with beta = 1) will be computed on a gold standard manually created from the test set. The automatically extracted entities and links will be both matched against this ground truth. All submissions will be only ranked according to the F1 of each best submission. SUBMISSIONS ----------- Submissions should be provided as a zip file using your system name as the file name (e.g. 'awesome.zip'), containing: 1. a TSV file with your system name (e.g. 'awesome.tsv'). We accept up to 3 different submissions, and we will consider *only* the best. If you do so you must specify clearly in your paper the modifications applied to each labelled submission. In this case the submission should contain each of up to 3 TSV files with the tool/system name with "_n" appended to each (e.g. awesome_1.tsv, awesome_2.tsv, awesome_3 ). In order to evaluate your submissions we require you to submit a tsv file following the format in which the training set is provided. 2. a paper of 6 pages describing your approach and how you tuned/tested it using the training split. All submissions must be in English. Submissions must be in PDF formatted in the style of the Springer Publications format for Lecture Notes in Computer Science (LNCS) [http://www.springer.com/ computer/lncs?SGWID=0-164-6-793341-0]. For details on the LNCS style, see Springer’s Author Instructions. All submissions are not anonymous. Please send us your submission before the deadline through Easychair [ https://www.easychair.org/conferences/?conf=microposts2014]. All accepted submissions will be invited for short presentations during the #Microposts2014 workshop and will be published independently from the workshop proceedings on the challenge page and on CEUR [http://ceur-ws.org/] (note that a minimum number of papers should be submitted in order to be able to publish them on CEUR). IMPORTANT DATES --------------- Intent to participate: 13 Jan 2014 (soft) Release of training set: 14 Jan 2014 Release of test set: 17 Feb 2014 Challenge Submission deadline: 21 Feb 2014 (hard) Challenge Notification: 14 Mar 2014 (hard) Challenge camera-ready deadline: 24 Mar 2014 (hard) Workshop program issued: 15 Mar 2014 Challenge proceedings to be published via CEUR Workshop - 07 Apr 2014 (Registration open to all) (All deadlines 23:59 Hawaii Time) PRIZE ----- to be announced CONTACT ------- E-mail: microposts2014 at easychair.org Facebook Group: http://www.facebook.com/#!/home.php?sk=group_180472611974910 Facebook Public Event page: http://www.facebook.com/events/116134955169543 Google group : https://groups.google.com/forum/#!forum/microposts2014 Twitter hashtag: #microposts2014challenge Twitter account: @Microposts2014 W3C Microposts Community Group: http://www.w3.org/community/microposts Challenge Organizers: ------------------------------ Challenge Chair: A. Elizabeth Cano, Aston University, UK Giuseppe Rizzo, Università di Torino, Italy Dataset Chair: Andrea Varga, The University of Sheffield, UK Challenge Committee: --------------------- Ebrahim Bagheri, Ryerson University, Canada Pierpaolo Basile, Dipartimento di Informatica - University of Bari, Italy Uldis Bojars, SIOC Project Óscar Corcho, Universidad Politécnica de Madrid, Spain Leon Derczynski, The University of Sheffield, UK Guillaume Erétéo, Orange Labs Miriam Fernandez, Knowledge Media Institute, The Open University, UK Andrés García-Silva, Ontology Engineering Group, Facultad de Informática, Univesidad Politécnica de Madrid, Spain Anna Lisa Gentile, The University of Sheffield, UK Robert Jäschke, L3S Research Center, Germany Diana Maynard, The University of Sheffield, UK José M. Morales-Del-Castillo, El Colegio de México, Mexico Georgios Paltoglou, University of Wolverhampton, UK Bernardo Pereira Nunes, PUC-Rio, Brazil Daniel Preoţiuc-Pietro, The University of Sheffield, UK Raphaël Troncy, EURECOM, France Mischa Tuffield, PeerIndex Victoria Uren, Aston University, UK ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 24 17:46:48 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 24 Dec 2013 18:46:48 +0100 Subject: Stage: Extraction d'information dans la presse financiere, MarketScience Message-ID: Date: Mon, 23 Dec 2013 17:05:42 +0100 From: Aude Message-ID: <52B85F56.1060608 at marketscience.fr> Titre : "Extraction d'information dans la presse financière" MarketScience est une start-up en finance, qui développe un centre de Data-Mining intégré à une appli. Cette appli sera commercialisée sur la plateforme Bloomberg, à une clientèle de professionnels de la finance. MarketScience combine modélisation des séries financières et media-mining. L'extraction d'information financière spécifique à nos besoins se fait en mobilisant des algos de TAL. Ces algos permettent de 1. déterminer la nature de l'article et sa probabilité de contenir une information pertinente pour notre analyse 2. isoler l'information nécessaire et la relier à l'évolution sur certains marchés 3. catégoriser cette information au sein de facteurs économiques pré-identifiés Ce travail se faisant tout d'abord en anglais (80% de la volumétrie journalière), mais aussi en français, allemand, italien, russe et chinois. Vous travaillerez essentiellement avec les outils suivants : Python, Unitex et MySQL/ SQL. Une connaissance des techniques du TAL (Traitement Automatique des Langues) ou des techniques d'apprentissage et de classification (SVM, CRF, etc.) est considérée comme fortement souhaitable. Ce stage se déroulera au sein d'une équipe pluridisciplinaire composée de statisticiens, économistes, programmeurs et ingénieurs TAListes. Durée du Stage : 4 à 6 mois à partir de février 2014. Embauche à la clé. Lieu du Stage : Orléans (45) Rémunération : 1200 Eur net min. Contactez-moi sur LinkedIn : Nicolas Boitout. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Dec 27 20:22:39 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 27 Dec 2013 21:22:39 +0100 Subject: Appel: LREC 2014 Workshop on Free/Open-Source Arabic Corpora and Corpora Processing Tools Message-ID: Date: Thu, 26 Dec 2013 20:05:38 +0000 From: OSACT Message-ID: <76B661A7930AFC49A0DC6418253B1FA2C6AC8E at ex-itu01-002> X-url: http://www.kacstac.org.sa/osact/index.html We apologize for multiple postings, Please distribute to interested colleagues ---------------------------------------------- 1st Call for Papers WORKSHOP ON Free/Open-Source Arabic Corpora and Corpora Processing Tools http://www.kacstac.org.sa/osact/index.html May 27, 2014 Co-located with LREC 2014 Harpa Conference Centre, Reykjavik (Iceland) DEADLINE FOR PAPERS: February 10, 2014 https://www.softconf.com/lrec2014/OSACT/ ============================================================ Workshop description For Natural Language Processing (NLP) and Computational Linguistics (CL) communities, it was a known situation that Arabic is a resource poor language. This situation was thought to be the reason why there is a lack of corpus based studies in Arabic. However, the last years witnessed the emergence of new considerably free Arabic corpora and in lesser extent Arabic corpora processing tools. Freely available Arabic corpora can be divided into two groups. The first group contains large Arabic corpora, which are designed and constructed basically for Arabic linguistics research and activities, and maybe for Arabic NLP. These corpora are diverse in the genres they cover and their sizes range from one million words to 700 million words. The second group contains corpora that were designed basically for Arabic text classification and clustering, they mainly contain newspapers' articles. They range from less than 1 million words to 11 million words. Some Arabic corpora are available on the web to explore using different tools, basically large corpora, while other corpora are only available for download. For the corpora that are available for download, the user may need to use standalone corpus processing tools. These tools contain many functionality such as word frequency, concordance, collocation, etc. Therefore, with the availability of large and diverse Arabic corpora, the situation does not change. There is still a lack of Arabic corpus base studies. Is this because of representativeness of these corpora? The available functions and tools associated with these corpora? or is it because they are not well known enough for the Arabic linguistics community? Motivation and topics of interest This half-day-workshop aims to encourage the researchers and developers to foster the utilization of freely available Arabic corpora and open source Arabic corpora processing tools and help in highlighting the drawbacks of these resources and discuss techniques and approaches on how to improve them. The workshop topics include but not limited to: - Surveying and criticizing the design of freely available Arabic corpora, their associated tools and stand alone Arabic corpora processing tools. - The applications and uses of freely available Arabic language resources in fields such as Arabic language education e.g. L1 and L2. - Arabic language modeling. - Corpus based Arabic lexigraphy. - Lexical semantics and word sense. - Corpus based Arabic syntactic. - Corpus based Arabic morphology. - Development of Arabic mobile applications based on the available Arabic language resources. - Evaluation and assessment of Arabic Corpora and Corpora Processing Tools. - Future directions of Free/Open Arabic Corpora and Corpora Processing Tools. Organising Committee - Hend Al-Khalifa, King Saud University, KSA - Abdulmohsen Al-Thubaity, King Abdul Aziz City for Science and Technology, KSA Program Committee - Eric Atwell, University of Leeds, UK - Khaled Shaalan, The British University in Dubai (BUiD), UAE - Dilworth Parkinson, Brigham Young University, USA - Nizar Habash, Columbia University, USA - Khurshid Ahmad, Trinity College Dublin, Ireland - Abdulmalik AlSalman, King Saud University, KSA - Maha Alrabiah, King Saud University, KSA - Saleh Alosaimi, Imam University, KSA - Sultan almujaiwel, King Saud University, KSA - Adam Kilgarriff, Lexical Computing Ltd, UK - Amal AlSaif, Imam University, KSA - Maha AlYahya, King Saud University, KSA - Auhood AlFaries, King Saud University, KSA - Salwa Hamada, Taibah University, KSA - Mansour Algamdi, King Abdul Aziz City for Science and Technology, KSA - Abdullah Alfaifi, University of Leeds, UK Important Dates - Submission deadline: 10 February 2014 - Notification of acceptance: 10 March 2013 - Final submission of manuscripts: 21 March 2014 - Workshop date: 27 May 2014 (morning session) Submissions The language of the workshop is English and submissions should be with respect to LREC 2014 paper submission instructions. All papers will be peer reviewed possibly by three independent referees. Papers must be submitted electronically in PDF format to the START system. When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. Moreover, ELRA encourages all LREC authors to share the described LRs (data, tools, services, etc.), to enable their reuse, replicability of experiments, including evaluation ones, etc. From hamon at LIMSI.FR Wed Dec 4 12:58:54 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 13:58:54 +0100 Subject: Appel: DISCOURS n=?ISO-8859-1?Q?=B014?= Message-ID: Date: Mon, 2 Dec 2013 17:09:23 +0100 From: "laure.sarda at ens.fr" Message-ID: X-url: http://discours.revues.org/index208.html ******************************************** *Call for papers for DISCOURS, Issue 14 * Coordination: Laure SARDA & Denis VIGIER We are inviting submissions for the next issue of DISCOURS, to appear in June 2014. DISCOURS is an international and interdisciplinary e-journal, which publishes two issues a year, and occasional special issues. The journal focuses on the following topics: discourse structure, cohesion, co-reference, linearization, indexation, information structure, word order, segmentation markers, integration markers, discourse relations, cognitive processes involved in discourse comprehension and production, and other related topics. It is intended as a forum for exchanging and comparing data, analyses and opinions for all linguists, psycholinguists and computer linguists working in fields involving the description, comprehension, formalization and processing of text organization. Papers (in English or French) should be sent to discours at revues.org before December 15, 2013. Full instructions can be found on the website (http:// discours.revues.org/224) *Appel ? publication DISCOURS, num?ro 14 * Coordination : Laure SARDA & Denis VIGIER Nous invitons des soumissions pour le prochain num?ro de DISCOURS, ? paraitre en Juin 2014 DISCOURS est une revue ?lectronique internationale et interdisciplinaire qui publie deux num?ros par an. Parall?lement, la revue accepte les projets de num?ros th?matiques. La ligne ?ditoriale de la revue se concentre autour des th?mes suivants : structuration du discours, coh?sion, co-r?f?rence, lin?arisation, indexation, structure informationnelle, ordre des mots, marqueurs de segmentation, marqueurs d?int?gration, relations de discours, processus cognitifs ? l'?uvre lors de la compr?hension et de la production de textes et autres th?mes reli?s. Elle est un lieu d'?change et de confrontation des donn?es, des analyses et des th?ories pour la communaut? des linguistes, psycholinguistes et informaticiens travaillant plus g?n?ralement ? la description, la compr?hension, la formalisation et le traitement informatique de l?organisation des textes. Envoyez vos articles en anglais ou en fran?ais ? discours at revues.orgjusqu?au 15 d?cembre 2013 (voir les modalit?s de soumission http://discours.revues.org/index208.html) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Dec 4 12:56:04 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 13:56:04 +0100 Subject: These: Laurence Longo, Vers des moteurs de recherche intelligents Message-ID: Date: Sat, 30 Nov 2013 19:51:20 +0100 From: Laurence Longo Message-ID: X-url: http://www.unistra.fr/plans/centre/72/index.html Bonjour, J'ai le plaisir de vous convier ? la soutenance de ma th?se intitul?e : "Vers des moteurs de recherche 'intelligents' : un outil de d?tection automatique de th?mes. M?thode bas?e sur l'identification automatique des cha?nes de r?f?rence." Elle aura lieu le jeudi 12 d?cembre 2013 ? partir de 14h30 dans la salle des th?ses du Nouveau Patio, Campus Universitaire Esplanade, 20a rue Ren? Descartes, 67084 Strasbourg (plan : http://www.unistra.fr/plans/centre/72/index.html). La soutenance sera suivie du traditionnel pot de th?se (? partir de 18h30, en salle 0-01) auquel vous ?tes chaleureusement invit?s. Bien cordialement. Laurence Longo -------------------- Composition du jury : --------- Mme Catherine Schnedecker - Professeur, LiLPa, Universit? de Strasbourg - Directrice Mme Amalia Todirascu - Ma?tre de conf?rences, LiLPa, Universit? de Strasbourg - Co-directrice M. Yves Bestgen - Professeur, Universit? catholique de Louvain - Rapporteur M. Denis Maurel - Professeur, Universit? Fran?ois Rabelais - Rapporteur Mme Agn?s Tutin - Professeur, Universit? Stendhal Grenoble - Examinatrice M. Fr?d?ric Landragin - Charg? de recherche CNRS, Universit? Paris 3 - Examinateur R?sum? : -------- Cette th?se se situe dans le domaine du Traitement Automatique des Langues et vise ? optimiser la classification des documents dans les moteurs de recherche. Les travaux se concentrent sur le d?veloppement d?un outil de d?tection automatique des th?mes des documents (ATDS-fr). Utilisant peu de connaissances, la m?thode hybride adopt?e allie des techniques statistiques de segmentation th?matique ? des m?thodes linguistiques identifiant des marqueurs de coh?sion. Parmi eux, les cha?nes de r?f?rence ? s?quence d?expressions r?f?rentielles se rapportant ? la m?me entit? du discours (e.g. Paul?il?cet homme) ? ont fait l?objet d?une attention particuli?re, car elles constituent un indice textuel important dans la d?tection des th?mes (i.e. ce sont des marqueurs d?introduction, de maintien et de changement th?matique). Ainsi, ? partir d?une ?tude des cha?nes de r?f?rence men?e dans un corpus issu de genres textuels vari?s (articles de journaux, rapports publics, lois europ?ennes, ?ditoriaux, roman), nous avons d?velopp? un module d?identification automatique des cha?nes de r?f?rence RefGen qui a ?t? ?valu? suivant les m?triques actuelles de la cor?f?rence. Mots-cl?s : ----------- D?tection automatique de th?mes, cha?nes de r?f?rence, traitement automatique des langues, s?mantique lexicale, cor?f?rence, genres textuels, segmentation th?matique, marqueurs linguistiques, coh?sion, linguistique de corpus Abstract : ---------- This thesis in the field of Natural Language Processing aims at optimizing documents classification in search engines. This work focuses on the development of a tool that automatically detects documents topics (ATDS-fr). Using poor knowledge, the hybrid method combines statistical techniques for topic segmentation and linguistic methods that identify cohesive markers. Among them, reference chains - sequences of referential expressions referring to the same entity (e.g. Paul ... he ... this man) - have been given special attention as they are important topic markers (i.e. they are markers of topic introduction, maintenance and change). Thus, from a study of reference chains extracted from a corpus composed of various textual genres (newspapers, public reports, European laws, editorials and novel) we developed RefGen, an automatic reference chains identification module, which was evaluated according to current coreference metrics. Keywords: --------- Topic detection, reference chains, natural language processing, lexical semantics, coreference, textual genre, topic segmentation, linguistic markers, cohesion, corpus linguistics ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Dec 4 13:02:08 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 14:02:08 +0100 Subject: Appel: ESWC 2014, Call for Challenge, Linked Open Data-enabled Recommender Systems Message-ID: Date: Tue, 3 Dec 2013 09:24:36 +0100 (CET) From: speroni at cs.unibo.it Message-Id: <20131203082456.E1E05F983E at vina.cines.fr> X-url: http://challenges.2014.eswc-conferences.org/RecSys ** apologies for cross-posting ** === Call for Challenge: Linked Open Data-enabled Recommender Systems === Challenge Website: http://challenges.2014.eswc-conferences.org/RecSys Call Web page: http://2014.eswc-conferences.org/important-dates/call-RecSys MOTIVATION AND OBJECTIVES People generally need more and more advanced tools that go beyond those implementing the canonical search paradigm for seeking relevant information. A new search paradigm is emerging, where the user perspective is completely reversed: from finding to being found. Recommender systems may help to support this new perspective, because they have the effect of pushing relevant objects, selected from a large space of possible options, to potentially interested users. To achieve this result, recommendation techniques generally rely on data referring to three kinds of objects: users, items and their relations. Recent developments in the Semantic Web community offer novel strategies to represent data about users, items and their relations that might improve the current state of the art of recommender systems, in order to move towards a new generation of recommender systems which fully understand the items they deal with. More and more semantic data are published following the Linked Data principles, that enable to set up links between objects in different data sources, by connecting information in a single global data space: the Web of Data. Today, the Web of Data includes different types of knowledge represented in a homogeneous form: sedimentary one (encyclopedic, cultural, linguistic, common-sense) and real-time one (news, data streams, ...). These data might be useful to interlink diverse information about users, items, and their relations and implement reasoning mechanisms that can support and improve the recommendation process. The primary goal of this challenge is twofold. On the one hand, we want to create a link between the Semantic Web and the Recommender Systems communities. On the other hand, we aim to show how Linked Open Data and semantic technologies can boost the creation of a new breed of knowledge-enabled and content-based recommender systems. TARGET AUDIENCE The target audience is all of those communities, both academic and industrial, which are interested in personalized information access with a particular emphasis on Linked Open Data. During the last ACM RecSys conference more than 60% of participants were from industry. This is for sure a witness of the actual interest of recommender systems for industrial applications ready to be released in the market. TASKS Task 1: Rating prediction in cold-start situations This task deals with the rating prediction problem, in which a system is requested to estimate the value of unknown numeric scores (a.k.a. ratings) that a target user would assign to available items, indicating whether she likes or dislikes them. In order to favor the proposal of content-based, LOD-enabled recommendation approaches, and limit the use of collaborative filtering approaches, this task aims at predicting ratings in cold-start situations, that is, predicting ratings for users who have a few past ratings, and predicting ratings of items that have been rated by a few users. The dataset to use in the task - DBbook - relates to the book domain. It contains explicit numeric ratings assigned by users to books. For each book we provide the corresponding DBpedia URI. Participants will have to exploit the provided ratings as training sets, and will have to estimate unknown ratings in a non-provided evaluation set. Recommendation approaches will be evaluated on the evaluation set by means of metrics that measure the differences between real and estimated ratings, namely the Root Mean Square Error (RMSE). Task 2: Top-N recommendation from binary user feedback This task deals with the top-N recommendation problem, in which a system is requested to find and recommend a limited set of N items that best match a user profile, instead of correctly predict the ratings for all available items. Similarly to Task 1, in order to favor the proposal of content-based, LOD-enabled recommendation approaches, and limit the use of collaborative filtering approaches, this task aims at generating ranked lists of items for which no graded ratings are available, but only binary ones. Also in this case, the DBbook dataset will be provided. In this task, the accuracy of recommendation approaches will be evaluated on an evaluation set using the F-measure. Task 3: Diversity A very interesting aspect of content-based recommender systems and then of LOD-enabled ones is giving the possibility to evaluate the diversity of recommended items in a straight way. This is a very popular topic in content-based recommender systems, which usually suffer from over-specialization. In this task, the evaluation will be made by considering a combination of both accuracy of the recommendation list and the diversity of items belonging to it. Also for this task, DBbook dataset will be used. Given the domain of books, diversity with respect to the two properties dbpedia-owl:author and skos:subject will be considered. EVALUATION DATASET AND EVALUATION METRICS DBbook dataset This dataset relies on user data and preferences retrieved from the Web. The books available in the dataset have been mapped to their corresponding DBpedia URIs. The mapping contains 8170 DBpedia URIs. These mappings can be used to extract semantic features from DBpedia or other LOD repositories to be exploited by the recommendation approaches proposed in the challenge. The dataset is split in a training set and an evaluation set. In the former, user ratings are provided to train a system while in the latter, ratings have been removed, and they will be used in the eventual evaluation step. Participants will generate the ratings (Task 1) or the ranking (Task 2 and Task 3) for the test set, and their results will be compared and evaluated with respect to actual users ratings (hidden evaluation data). The two sets will be provided at the challenge's website, in the following plain text files: * lodrecsys2014-DBbook-training-ratings.txt: Contains tuples that may be used to build the recommendation approaches. * lodrecsys2014-DBbook-test-ratings.txt: Contains tuples that will be used to evaluate the recommendation approaches. Together with the two previous files, a further one will be provided containing the mapping to DBpedia. * lodrecsys2014-DBbook-mappings.txt: Contains tuples associated with the books in DBbook that will be used to extract and exploit semantic features from DBpedia and other LOD repositories. Recommendation approaches will be evaluated with the metrics requested in each task. We provide a number of Java classes that compute the different metrics and a Web Service to test intermediate results. A description of the metrics to compute and further details regarding the evaluation will be available at the challenge website. JUDGING AND PRIZES After a first round of reviews, the Program Committee and the chairs will select a number of submissions that will have to satisfy the challenge's requirements, and will have to be presented at the conference. Submissions accepted for presentation will receive constructive reviews from the Program Committee, and will be included in post-proceedings. All accepted submissions will have a slot in a poster session dedicated to the challenge. In addition, the winners will present their work in a special slot of the main program of ESWC'14, and will be invited to submit a paper to a dedicated Semantic Web Journal special issue. For each task we will select: * the best performing tool, given to the paper which will get the highest score in the evaluation * the most original approach, selected by the Challenge Program Committee with the reviewing process HOW TO PARTICIPATE The following information has to be provided: * Abstract: no more than 200 words. * Description: It should contain the details of the system, including why the system is innovative, how it uses Semantic Web, which features or functions the system provides, what design choices were made, and what lessons were learned. The description should also summarize how participants have addressed the evaluation tasks. Papers must be submitted in PDF format, following the style of the Springer's Lecture Notes in Computer Science (LNCS) series (http://www.springer.com/computer/lncs/lncs+authors), and not exceeding 5 pages in length. * Result evaluation: For the three tasks previously described, a Web-accessible service will be provided in order to evaluate the produced results. All the details about the format and the service URL will be provided on the website. All submissions should be provided via EasyChair https://www.easychair.org/conferences/?conf=eswc2014-challenges MAILING LIST We invite the potential participants to subscribe to our mailing list in order to be kept up to date with the latest news related to the challenge. https://lists.sti2.org/mailman/listinfo/eswc2014-recsys-challenge IMPORTANT DATES * March 7, 2014, 23:59 (Hawaii time): Abstract Submission * March 14, 2014, 23:59 (Hawaii time): Submission * April 9, 2014, 23:59 (Hawaii time): Notification of acceptance * May 27-29, 2014: Challenge days CHALLENGE CHAIRS * Tommaso Di Noia (Polytechnic University of Bari, IT) * Ivan Cantador (Universidad Autonoma de Madrid, ES) EVALUATION COORDINATOR * Vito Claudio Ostuni (Polytechnic University of Bari, IT) PROGRAM COMMITTEE (to be completed) * Oscar Corcho (Universidad Politecnica de Madrid, ES) * Marco de Gemmis (University of Bari Aldo Moro, IT) * Frank Hopfgartner (Technische Universitat Berlin, DE) * Andreas Hotho (Universitat Wurzburg, DE) * Dietmar Jannach (TU Dortmund University, DE) * Pasquale Lops (University of Bari Aldo Moro, IT) * Valentina Maccatrozzo (Delft University of Technology, NL) * Francesco Ricci (Free University of Bozen-Bolzano, IT) * Giovanni Semeraro (University of Bari Aldo Moro, IT) * David Vallet (NICTA, AU) * Manolis Wallace (University of Peloponnese, GR) * Markus Zanker (Alpen-Adria-Universitaet Klagenfurt, AT) ESWC CHALLENGE COORDINATOR * Milan Stankovic (Sepage & Universite Paris-Sorbonne, FR) From hamon at LIMSI.FR Wed Dec 4 12:57:53 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 13:57:53 +0100 Subject: Stage: Prediction automatique de relations d'implication entre verbes, IRIT Message-ID: Date: Mon, 02 Dec 2013 13:48:03 +0100 From: Benamara Zitoune Farah Message-ID: <529C8183.5030300 at irit.fr> Pr?diction automatique de relations d?implication entre verbes Encadrement : Marta Abrusan (marta.abrusan at irit.fr), Stergos Afantenos (stergos.afantenos at irit.fr), Farah Benamara (farah.benamara at irit.fr) Lieu : IRIT, Universit? Paul Sabatier Financement : prime de stage Dur?e : 5 mois La compr?hension s?mantique d?un texte est l?un des enjeux majeurs du traitement automatique du langage (TAL). Cette t?che est primordiale pour de tr?s nombreuses applications telles que la g?n?ration automatique de textes et de conversations, le r?sum? automatique, la paraphrase automatique d?un texte ou encore la recherche d?information. Le but de ce stage est de contribuer ? ces recherches en se focalisant sur la pr?diction de relations entre verbes. Pr?dire une relation entre verbes consiste ? d?terminer pour un couple de verbes (v1,v2), associ? ou non ? un contexte, le type de relations s?mantiques qui les relient, cf. Chklovski and Pantel (2004), Tremper and Frank (2013). Celles ci peuvent ?tre de diff?rentes natures : relations de synonymie, d?antonymie, de causalit?, d?implication, etc. Dans ce stage, nous nous focaliserons sur les relations d?implication. Par exemple, les verbes ?se balader? et ?bouger? sont reli?s par une relation d?implication, ce qui permettra ? un ordinateur de r?pondre ? une requ?te du type Pierre s?est-il balad? ?, sachant que la base de connaissances indique que Pierre a boug?. Afin d?identifier ces relations, le stagiaire devra utiliser les m?thodes d?apprentissage automatiques les plus ad?quates. Le but de ces m?thodes est d?apprendre une fonction f : X --> Y o? X repr?sente un ensemble des features (ou traits) sur les paires des verbes et Y est un bool?en repr?sentant le fait qu?il existe ou non une relation d?implication entre ces verbes. Diff?rentes m?thodes seront explor?es. Nous commencerons d?abord par les m?thodes supervis?es qui g?n?ralisent des observations faites sur un corpus de donn?es. Cependant, cette approche pr?suppose que le nombre d?instances d?entra?nement est suffisamment grand, ce qui n?est malheureusement pas toujours le cas pour de nombreuses t?ches o? le co?t humain d?annotation des instances est ?lev?. La seconde ?tape sera alors d?explorer les m?thodes d?apprentissage semi-supervis?es afin de r?duire ce co?t. R?f?rences Timothy Chklovski and Patrick Pantel (2004) : VerbOcean : Mining the web for fine-grained semantic verb relations. In Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, pages 33-40, Barcelona, Spain, 2004. Tremper, G. and A. Frank (2013) : A Discriminative Analysis of Fine-Grained Semantic Relations including Presupposition : Annotation and Classification. In : Dialogue and Discourse, 4 (2), Special Issue : Beyond Semantics. The Challenge of Annotating Pragmatic and Discourse Phenomena, edited by S. Dipper, H. Zinsmeister and B. Webber, 282-322. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Dec 4 13:04:50 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 14:04:50 +0100 Subject: Appel: ESWC 2014, Call for Challenge, Semantic Publishing Message-ID: Date: Tue, 3 Dec 2013 09:39:17 +0100 (CET) From: speroni at cs.unibo.it Message-Id: <20131203083937.F044EDDAAB at vina.cines.fr> X-url: http://challenges.2014.eswc-conferences.org/SemPub ** apologies for cross-posting ** ==== Call for Challenge: Semantic Publishing ==== Challenge Website: http://challenges.2014.eswc-conferences.org/SemPub Call Web page: http://2014.eswc-conferences.org/important-dates/call-SemPub MOTIVATION AND OBJECTIVES Scholarly publishing is increasingly enabling a new wave of applications that better support researchers in disseminating, exploiting and evaluating their results. The potential of publishing scientific papers enriched with semantic information is huge and raises interesting and challenging issues. Semantic Web technologies play a central role in this context, as they can help publishers to make scientific results available in an open format the whole research community can benefit from. The Semantic Publishing Challenge 2014 is intended to be the first in a series of events at ESWC for producing and exploiting semantic publishing data. The main focus this year is on extracting information and using this information to assess the quality of scientific productions. Linked open datasets about scientific production exist - e.g. DBLP - but they usually cover basic bibliographic information, which is not sufficient to assess quality. Quality-related information, such as the number of journal articles that cite a given publication, or the number of times a workshop has been run by the same chairs and received many submissions, or the actual function of a citation are often hidden and not yet available as LOD. The main goal of the Challenge is to build high-quality LOD that contains such information. Moving off this year's outcomes, we plan to investigate the final publication and exploitation side in the next editions. TARGET AUDIENCE The Challenge is open to everyone from industry and academia. TASKS We ask challengers to automatically annotate a set of multi-format and multi-source input documents and to produce a Linked Open Dataset that fully describes these documents, their context, and relevant parts of their content. The evaluation will consist of evaluating a set of queries against the produced dataset to assess its correctness and completeness. The input dataset will be split in two parts: a training/testing part and an evaluation part, which will disclosed a few days before the submission deadline. Participants will be asked to run their tool on the evaluation dataset and to produce the final Linked Open Dataset. Further details about the organization of the Challenge will be provided. The Challenge will include two tasks: Task 1: Extraction and assessment of workshop proceedings information Participants are required to extract information from a set of HTML tables of contents, partly including microformat and RDFa annotations but not necessarily being valid HTML, of selected computer science workshop proceedings published with the CEUR-WS.org open access service. The extracted information is expected to answer queries about the quality of these workshops, for instance by measuring their growth, longevity, connection with other events, distribution of papers and authors. Task 2: Extraction and characterization of citations Participants are required to extract information about the citations in scientific journals and their relevance. Input documents are in XML JATS and TaxPub, an official extension of JATS customized for taxonomic treatments, and selected from the PubMedCentral Open Access Subset and the Pensoft Biodiversity Data Journal and ZooKeys archive. The extracted information is expected to be used for assessing the value of citations, for instance by considering their position in the paper, their co-location with other citations or their purpose. EVALUATION Participants will be requested to submit the LOD that their tool produces from the evaluation dataset, as well as a paper that describes their approach. They will also be given a set of queries in natural language form and will be asked to translate those queries into a SPARQL form that works on their LOD. The results of the queries on the produced LOD will be compared with the expected output, and precision and recall will be measured to identify the best performing approach. Separately, the most original approach will be assigned by the Program Committee. Further details about the evaluation will be provided on the challenge wiki. FEEDBACK AND DISCUSSION A discussion group is open for participants to ask questions and to receive updates about the challenge (see link at bottom). Participants are invited to subscribe to this group as soon as possible and to communicate their intention to participate. They are also invited to use this channel to discuss problems in the input dataset and to suggest changes. JUDGING AND PRIZES After a first round of review, the Program Committee and the chairs will select a number of submissions conforming to the challenge requirements that will be invited to present their work. Submissions accepted for presentation will receive constructive reviews from the Program Committee, they will be included in the Springer LNCS post-proceedings of ESWC, and they will have a presentation slot in a poster session dedicated to the challenge. In addition, the winners will present their work in a special slot of the main program of ESWC and will be invited to submit a revised and extended paper to a dedicated Semantic Web Journal special issue. Four winners will be selected. For each of the two tasks we will select: * best performing tool, given to the paper which will get the highest score in the evaluation * most original approach, selected by the Challenge Committee with the reviewing process HOW TO PARTICIPATE Participants are required to submit: * Abstract: no more than 200 words. * Description: It should explain the details of the automated annotation system, including why the system is innovative, how it uses Semantic Web technology, what features or functions the system provides, what design choices were made and what lessons were learned. The description should also summarize how participants have addressed the evaluation tasks. An outlook towards how the data could be consumed is appreciated but not strictly required. Papers must be submitted in PDF format, following the style of the Springer's Lecture Notes in Computer Science (LNCS) series (http://www.springer.com/computer/lncs/lncs+authors), and not exceeding 5 pages in length. * The Linked Open Dataset produced by their tool on the evaluation dataset (as a file or as a URL, in Turtle or RDF/XML). * A set of SPARQL queries that work on that LOD and correspond to the natural language queries provided as input Participants will also be asked to submit their tool (source and/or binaries, or a link these can be downloaded from, or a web service URL) for verification purposes. Further submission instructions will be published on the challenge wiki. All submissions should be provided via EasyChair https://www.easychair.org/conferences/?conf=eswc2014-challenges MAILING LIST We invite the potential participants to subscribe to our mailing list in order to be kept up to date with the latest news related to the challenge. https://lists.sti2.org/mailman/listinfo/eswc2014-sempub-challenge IMPORTANT DATES * December 3, 2013: Publication of the full description of tasks, rules and queries; publication of the training/testing dataset * January 15, 2014, 23:59 (Hawaii time): Deadline for making remarks to the training/testing dataset * January 20, 2014: Publication of the final training/testing dataset * March 7, 2014, 23:59 (Hawaii time): Abstract Submission * March 11, 2014: Publication of the evaluation dataset * March 14, 2014, 23:59 (Hawaii time): Submission * April 9, 2014, 23:59 (Hawaii time): Notification of acceptance * May 27-29, 2014: Challenge days CHALLENGE CHAIRS * Angelo Di Iorio (Department of Computer Science and Engineering, University of Bologna, IT) * Christoph Lange (Enterprise Information Systems, University of Bonn / Fraunhofer IAIS, DE) PROGRAM COMMITTEE Soren Auer (University of Bonn / Fraunhofer IAIS, DE) (supervisor) Sarven Capadisli (University of Leipzig, DE) Alexander Constantin (University of Manchester, UK) Alexander Garcia Castro (Florida State University, US) Leyla Jael Garcia Castro (Bundeswehr University of Munich, DE) Aidan Hogan (DERI Galway, IR) Evangelos Milios (Dalhousie University, CA) Lyubomir Penev (Pensoft Publishers, BG) Robert Stevens (University of Manchester, UK) Jun Zhao (Lancaster University, UK) We are inviting further members. ESWC CHALLENGE COORDINATOR * Milan Stankovic (Sepage & Universite Paris-Sorbonne, FR) From hamon at LIMSI.FR Wed Dec 4 13:10:55 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 14:10:55 +0100 Subject: Appel: ESWC 2014, Call for Challenge, Concept-Level Sentiment Analysis Message-ID: Date: Tue, 3 Dec 2013 12:17:24 +0100 (CET) From: speroni at cs.unibo.it Message-Id: <20131203111744.59CD6DBB48 at vina.cines.fr> X-url: http://challenges.2014.eswc-conferences.org/SemSA ==== Call for Challenge: Concept-Level Sentiment Analysis ==== Challenge Website: http://challenges.2014.eswc-conferences.org/SemSA Call Web page: http://2014.eswc-conferences.org/important-dates/call-SemSA MOTIVATION AND OBJECTIVES Mining opinions and sentiments from natural language is an extremely difficult task as it involves a deep understanding of most of the explicit and implicit, regular and irregular, syntactical and semantic rules proper of a language. Existing approaches mainly rely on parts of text in which opinions and sentiments are explicitly expressed such as polarity terms, affect words and their co-occurrence frequencies. However, opinions and sentiments are often conveyed implicitly through latent semantics, which make purely syntactical approaches ineffective. To this end, concept-level sentiment analysis aims to go beyond a mere word-level analysis of text and provide novel approaches to opinion mining and sentiment analysis that allow a more efficient passage from (unstructured) textual information to (structured) machine-processable data, in potentially any domain. Concept-level sentiment analysis focuses on a semantic analysis of text through the use of web ontologies or semantic networks, which allow the aggregation of conceptual and affective information associated with natural language opinions. By relying on large semantic knowledge bases, concept-level sentiment analysis steps away from blind use of keywords and word co-occurrence count, but rather relies on the implicit features associated with natural language concepts. This Challenge focuses on the introduction, presentation, and discussion of novel approaches to concept-level sentiment analysis. Participants will have to design a concept-level opinion-mining engine that exploits common-sense knowledge bases, e.g., SenticNet, and/or Linked Data and Semantic Web ontologies, e.g., DBPedia, to perform multi-domain sentiment analysis. The main motivation for the Challenge, in particular, is to go beyond a mere word-level analysis of natural language text and provide novel concept-level tools and techniques that allow a more efficient passage from (unstructured) natural language to (structured) machine-processable data, in potentially any domain. Systems must have a semantics flavor (e.g., by making use of Linked Data or known semantic networks within their core functionalities) and authors need to show how the introduction of semantics can be used to obtain valuable information, functionality or performance. Existing natural language processing methods or statistical approaches can be used too as long as the semantics plays a main role within the core approach (engines based merely on syntax/word-count will be excluded from the competition). TARGET AUDIENCE The Challenge is open to everyone from industry and academia. TASKS The Concept-Level Sentiment Analysis Challenge is defined in terms of different tasks. The first task is elementary whereas the others are more advanced. The input units of each task are sentences. Sentences are assumed to be in grammatically correct American English and have to be processed according to the input format specified at http://sentic.net/challenge/sentence. Elementary Task: Polarity Detection The main goal of the task is polarity detection. The proposed systems will be assessed according to precision, recall and F-measure of detected binary polarity values (1=positive; 0=negative) for each input sentence of the evaluation dataset, following the same format as in http://sentic.net/challenge/task0. The problem of subjectivity detection is not addressed within this Challenge, hence participants can assume that there will be no neutral sentences. Participants are encouraged to use the Sentic API or further develop and apply sentic computing tools. Advanced Task #1: Aspect-Based Sentiment Analysis The output of this task will be a set of aspects of the reviewed product and a binary polarity value associated to each of such aspects, in the format specified at http://sentic.net/challenge/task1. So, for example, while for the Elementary task an overall polarity (positive or negative) is expected for a review about a mobile phone, this task requires a set of aspects (such as ?speaker', ?touchscreen', ?camera', etc.) and a polarity value (positive OR negative) associated with each of such aspects. Systems will be assessed according to both aspect extraction and aspect polarity detection. Advanced Task #2: Semantic Parsing As suggested by the title, the Challenge focuses on sentiment analysis at concept-level. This means that the proposed systems are not supposed to work at word/syntax level but rather work with concepts/semantics. Hence, this task will evaluate the capability of the proposed systems to deconstruct natural language text into concepts, following the same format as in http://sentic.net/challenge/task2. SenticNet will be taken as a reference to test the efficiency of the proposed parsers, but extracted concepts won't necessary have to match SenticNet concepts. The proposed systems, for example, are supposed to be able to extract a multi-word expression like ?buy christmas present' from sentences such as ?Today I bought a lot of very nice Christmas presents'. The number of extracted concepts per sentence will be assessed through precision, recall and F-measure against the evaluation dataset. Advanced Task #3: Topic Spotting Input sentences will be about four different domains, namely: books, DVDs, electronics, and kitchen appliances. This task focuses on the automatic classification of sentences into one of such domains, in the format specified at http://sentic.net/challenge/task3. All sentences are assumed to belong to only one of the above-mentioned domains. The proposed systems are supposed to exploit the extracted concepts to infer which domain each sentence belongs to. Classification accuracy will be evaluated in terms of precision, recall and F-measure against the evaluation dataset. EVALUATION DATASET Systems will be evaluated against a testing dataset which will be revealed and released after the first-round of evaluation during the Conference. The dataset will be made public on the challenge website. Participants are suggested to train and/or test their own systems using the Blitzer Dataset. The testing dataset will be constructed in the same way and from the same sources as the Blitzer dataset. EVALUATION The evaluation will be performed by the members of the Program Committee. For systems that can be tuned with different parameters, please indicate a range of up to 4 sets of settings. Settings with the best F-measures will be considered for judgment. For each system, reviewers will give a numerical score within the range [1-10] and details motivating their choice. The scores will be given to the following aspects: 1. Use of common-sense knowledge and semantics; 2. Precision, recall, and F-measure wrt the selected task; 3. Computational time; 4. Innovative nature of the approach. JUDGING AND PRIZES After a first round of review, the Program Committee and the chairs will select a number of submissions confirming to the challenge requirements that will be invited to present their work. Submissions accepted for presentation will be included in post-proceedings and will receive constructive reviews from the Program Committee. All accepted submissions will have a slot in a poster session dedicated to the challenge. In addition, the winners will present their work in a special slot of the main program of ESWC and will be invited to submit a paper to a dedicated Semantic Web Journal special issue. For the Concept-Level Sentiment Analysis Challenge there will be two awards for each task: * Quantitative: the system with the highest average score in items 1-3 above; * Innovative: the system with the highest score in item 4 above. There will be a board of judges at the conference who will evaluate again the systems in more detail. The judges will then meet in private to discuss the entries and to determine the winners. It may happen that the same system runs for both the awards. HOW TO PARTICIPATE The following information has to be provided: * Abstract: no more than 200 words. * Description: It should contain the details of the system, including why the system is innovative, how it uses Semantic Web, which features or functions the system provides, what design choices were made and what lessons were learned. The description should also summarize how participants have addressed the evaluation tasks. Papers must be submitted in PDF format, following the style of the Springer's Lecture Notes in Computer Science (LNCS) series (http://www.springer.com/computer/lncs/lncs+authors), and not exceeding 5 pages in length. * Web Access: The application can either be accessible via the web or downloadable. If the application is not publicly accessible, password must be provided. A short set of instructions on how to use the application should be provided as well. All submissions should be provided via EasyChair https://www.easychair.org/conferences/?conf=eswc2014-challenges Please share comments and questions with the challenge mailing list. The organizers will assist you for any potential issues that could be raised. MAILING LIST We invite the potential participants to subscribe to our mailing list in order to be kept up to date with the latest news related to the challenge. https://lists.sti2.org/mailman/listinfo/eswc2014-semsa-challenge IMPORTANT DATES * March 7, 2014, 23:59 (Hawaii time): Abstract Submission * March 14, 2014, 23:59 (Hawaii time): Submission * April 9, 2014, 23:59 (Hawaii time): Notification of acceptance * May 27-29, 2014: Challenge days CHALLENGE CHAIRS * Erik Cambria (National University of Singapore, SG) * Diego Reforgiato Recupero (CNR STLAB Laboratory, IT) PROGRAM COMMITTEE * Newton Howard (MIT Media Laboratory, US) * ChengXiang Zhai (University of Illinois at Urbana-Champaign, US) * Rada Mihalcea (University of North Texas, US) * Ping Chen (University of Houston-Downtown, US) * Yongzheng Zhang (LinkedIn Inc., US) * Giuseppe Di Fabbrizio (Amazon Inc., US) * Rui Xia (Nanjing University of Science and Technology, CN) * Rafal Rzepka (Hokkaido University, JP) * Amir Hussain (University of Stirling, UK) * Alexander Gelbukh (National Polytechnic Institute, MX) * Bjoern Schuller, (Technical University of Munich, DE) * Amitava Das (Samsung Research India, IN) * Dipankar Das (National Institute of Technology, IN) * Carlo Strapparava (Fondazione Bruno Kessler, IT) * Stefano Squartini (Marche Polytechnic University, IT) * Cristina Bosco (University of Torino, IT) * Paolo Rosso (Technical University of Valencia, ES) ESWC CHALLENGE COORDINATOR * Milan Stankovic (Sepage & Universite Paris-Sorbonne, FR) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Dec 4 13:25:22 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 14:25:22 +0100 Subject: Appel: RECITAL 2014 Message-ID: Date: Tue, 03 Dec 2013 17:37:52 +0100 From: Nuria Gala Message-ID: <529E08E0.4010502 at lif.univ-mrs.fr> RECITAL 2014 : Premier appel ? communications ----------------------------------------------------------------- RECITAL 2014 16?me Rencontre des ?tudiants Chercheurs en Informatique pour le Traitement Automatique des Langues. Facult? Saint Charles, Aix Marseille Universit? (Marseille), du 1er au 4 juillet 2014. Dates importantes ------------------------- - Date limite de soumission : samedi 5 avril 2014 - Notification aux auteurs : samedi 3 mai 2014 - Version d?finitive : samedi 17 mai 2014 Pr?sentation ----------------- RECITAL 2014, la conf?rence annuelle des jeunes chercheurs associ?e ? TALN, se d?roulera ? Marseille du 1er au 4 juillet 2014. RECITAL offre aux jeunes chercheurs en Traitement Automatique des Langues (TAL) l'occasion de pr?senter leurs travaux et de comparer leurs approches. Elle est r?serv?e aux ?tudiants (master et doctorat) et aux jeunes chercheurs ayant obtenu leur doctorat depuis moins d'un an. Fort du succ?s des ann?es pr?c?dentes, nous encourageons la soumission de travaux m?me pr?liminaires, de projets de th?se, et de travaux des premiers mois de recherche (?tat de l'art, premi?res pistes, etc.). L'objectif premier de RECITAL est de soutenir les travaux des jeunes chercheurs en TAL et de faciliter leur int?gration dans notre communaut?. A ce titre, nous visons : - des relectures p?dagogiques : les auteurs doivent pouvoir comprendre les erreurs qu'ils ont pu commettre afin de pouvoir les corriger et am?liorer la qualit? de leur travail; - des relectures positives : il n'est jamais n?cessaire de d?courager un jeune chercheur, les ma?tres-mots devront encourager/guider; - l'?change direct : les relectures seront communiqu?es sign?es, donc non-anonymes, aux auteurs. Libre ? eux (ou mieux, aux relecteurs) d'aller informellement discuter ensemble lors de la conf?rence. Le prix du meilleur papier RECITAL sera d?cern? lors de la c?r?monie de cl?ture. Th?mes principaux -------------------------- Les communications pourront porter sur les th?mes habituels du TAL : - Analyse et g?n?ration dans les domaines suivants : + Phon?tique + Phonologie + Morphologie + Syntaxe + S?mantique - Analyse et g?n?ration dans les domaines suivants : + Phon?tique + Phonologie + Morphologie + Syntaxe + S?mantique + Discours - D?veloppement de ressources linguistiques pour le TAL : + Bases de donn?es comportant des informations morphologiques, syntaxiques, s?mantiques, et/ou phonologiques + Grammaires + Lexiques + Ontologies + Linguistique de corpus - Applications du TAL : + Analyse de sentiments ou d'opinions + Cat?gorisation ou classification automatique + D?sambigu?sation lexicale + Dialogue homme-machine en langage naturel + Enseignement assist? par ordinateur + Indexation automatique + Recherche et extraction d'information + R?sum? automatique + R?solution d'anaphores + Syst?mes de question-r?ponse + Traduction automatique + Web s?mantique - Approches : + Linguistiques formelles destin?es ? soutenir les traitements automatiques + Symboliques + Logiques + Statistiques + Bas?es sur l'apprentissage automatique Cette liste n'est pas exhaustive et l'ad?quation d'une proposition de communication ? la conf?rence sera jug?e par le comit? de programme. Crit?res de s?lection ---------------------------- Les auteurs doivent ?tre des ?tudiant(e)s ou bien des jeunes docteur(e)s ayant soutenu leur th?se depuis moins d'un an. Les publications avec des chercheurs confirm?s (ce qui inclus les directeurs de th?ses) doivent ?tre soumises ? TALN et non ? RECITAL. Les auteurs sont invit?s ? soumettre des travaux de recherche originaux, n'ayant pas fait l'objet de publications ant?rieures. Les soumissions seront examin?es par au moins deux sp?cialistes du domaine. Seront consid?r?es en particulier: - La correction du contenu scientifique et technique - La situation des travaux dans le contexte de la recherche internationale - L'organisation et la clart? de la pr?sentation - L'ad?quation aux th?mes de la conf?rence Les articles s?lectionn?s seront publi?s dans les actes de la conf?rence. Suivant l'avis du comit? de programme, les pr?sentations se feront soit sous forme orale soit sous forme de poster. Modalit?s de Soumission --------------------------------- Les articles seront r?dig?s en fran?ais pour les francophones, en anglais pour ceux qui ne ma?trisent pas le fran?ais. Les articles doivent faire de 8 ? 14 pages. Une feuille de style LaTeX, un mod?le Word et un mod?le LibreOffice seront disponibles sur le site web de la conf?rence (www.taln2014.org). Contact : nuria.gala(arobase)lif.univ-mrs.fr et/ou klim.peshkov(arobase)lpl-aix.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Dec 4 13:21:25 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 14:21:25 +0100 Subject: Seminaire: L'informatique : des biens communs ?, Centre d'Alembert, Orsay, 18 decembre 2013 Message-ID: Date: Tue, 3 Dec 2013 15:48:42 +0100 From: Centre d'Alembert Message-Id: X-url: http://www.centre-dalembert.u-psud.fr/ S?minaire 2013-2014 du Centre d'Alembert (http://www.centre-dalembert.u-psud.fr/) "Les sciences comme biens communs" S?ance du mercredi 18 d?cembre 2013 (NB : pas de s?ance le 4 d?cembre comme annonc? dans notre programme transmis le 5/11/13) "L'INFORMATIQUE : DES BIENS COMMUNS ?" (titre provisoire) Intervenants : Jean-Pierre ARCHAMBAULT et Gilles DOWEK De plus amples informations sur la s?ance vous parviendront ult?rieurement. ----- Programme d?taill? et contact : Web : http://www.centre-dalembert.u-psud.fr M?l. : centre.dalembert at u-psud.fr Les s?ances ont lieu ? la Facult? des Sciences d'Orsay, b?timent des colloques (338), salle de conf?rences (1er ?tage ? gauche) de 13h45 ? 15h45 (entr?e libre) Acc?s : http://goo.gl/maps/cXezZ ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Dec 4 13:26:50 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 14:26:50 +0100 Subject: Appel: CNL 2014 Message-ID: Date: Tue, 3 Dec 2013 20:46:53 +0100 From: Camilo Thorne Message-ID: X-url: http://attempto.ifi.uzh.ch/site/cnl2014/ Announcement and Call for Papers *** FOURTH WORKSHOP ON CONTROLLED NATURAL LANGUAGE (CNL 2014) *** Co-located with COLING 2014 20-22 August 2014 Galway, Ireland http://attempto.ifi.uzh.ch/site/cnl2014/ Proceedings in Springer LNCS. --------------------------------------------------------------------- This workshop on controlled natural language (CNL) has a broad scope and embraces all approaches that are based on natural language and apply restrictions on vocabulary, grammar, and/or semantics. This includes (but is certainly not limited to) approaches that have been called simplified language, plain language, formalized language, processable language, fragments of language, phraseologies, conceptual authoring, language generation, and guided natural language interfaces. Some CNLs are designed to improve communication among humans, especially for non-native speakers of the respective natural language. In other cases, the restrictions on the language are supposed to make it easier for computers to analyze such texts in order to improve computer-aided, semi-automatic, or automatic translations into other languages. A third group of CNL has the goal to enable reliable automated reasoning and formal knowledge representation from seemingly natural texts. All these types of CNL are covered by this workshop. Topics ------ Possible topics for CNL 2014 include: - CNL for knowledge representation - CNL for query interfaces - CNL for specifications - CNL for business rules - CNL for dialogue systems - CNL for machine translation - CNL for improved understandability of texts - CNL for natural language generation - design of CNLs - CNL applications - CNL evaluation - usability and acceptance of CNL - CNL grammars and lexica - multilingual CNLs - reasoning in CNL - spoken CNL - CNL in the context of the Semantic Web and Linked Open Data - CNL in the government - CNL in industry - CNL use cases - theoretical properties of CNL Important Dates --------------- Submission deadline: 14 March 2014 Notification of acceptance: 12 May 2014 Deadline for revised papers: 9 June 2014 Workshop: 20-22 August 2014 Submissions and Proceedings --------------------------- We invite researchers to submit papers with novel contributions in the area of CNL. These research papers should be formatted according to the Springer LNCS format and should not exceed 10 pages. Papers should be submitted via the EasyChair conference system: http://www.easychair.org/conferences/?conf=cnl2014 Accepted papers will be included in the workshop proceedings to be published by Springer in their LNCS/LNAI series and indexed in all major citation databases including ISI Web of Science and Scopus. Venue ----- The workshop takes place in Ireland at INSIGHT at NUI Galway (formerly DERI). Organization Committee ---------------------- - Brian Davis (INSIGHT at NUI Galway, formerly DERI, Ireland) - Kaarel Kaljurand (University of Zurich, Switzerland) - Tobias Kuhn (ETH Zurich, Switzerland) Program Committee ----------------- - Krasimir Angelov (Chalmers University, Sweden) - Johan Bos (University of Groningen, Netherlands) - Paul Buitelaar (INSIGHT at NUI Galway, formerly DERI, Ireland) - Olga Caprotti (University of Helsinki, Finland) - Rogan Creswick (Galois, USA) - Danica Damljanovic (University of Sheffield, UK) - Brian Davis (INSIGHT at NUI Galway, formerly DERI, Ireland) - Ronald Denaux (iSOCO, Spain) - Vania Dimitrova (University of Leeds, UK) - Ramona Enache (Chalmers University, Sweden) - Esra Erdem (Sabanci University, Turkey) - S?bastien Ferr? (University Rennes 1, France) - Norbert E. Fuchs (University of Zurich, Switzerland) - Normunds Gr?z?tis (University of Latvia) - Siegfried Handschuh (INSIGHT at NUI Galway, formerly DERI, Ireland) - Stefan H?fler (University of Zurich, Switzerland) - Kaarel Kaljurand (University of Zurich, Switzerland) - Peter Koepke (University of Bonn, Germany) - Tobias Kuhn (ETH Zurich, Switzerland) - Hans Lei? (University of Munich, Germany) - Reinhard Muskens (Tilburg University, Netherlands) - Adegboyega Ojo (INSIGHT at NUI Galway, formerly DERI, Ireland) - Gordon Pace (University of Malta) - Richard Power (The Open University, UK) - Laurette Pretorius (University of South Africa) - Allan Ramsay (University of Manchester, UK) - Mike Rosner (University of Malta) - Aarne Ranta (University of Gothenburg, Sweden) - Rolf Schwitter (Macquarie University, Australia) - Geoff Sutcliffe (University of Miami, USA) - Irina Temnikova (University of Wolverhampton, UK) - Allan Third (The Open University, UK) - Camilo Thorne (Free University of Bozen-Bolzano, Italy) - Silvie Spreeuwenberg (LibRT, Netherlands) - Uta Schwertel (imc, Germany) - Jeroen Van Grondelle (Be Informed, Netherlands) - Yorick Wilks (University of Sheffield, UK) - Adam Wyner (University of Aberdeen, UK) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Dec 4 13:27:43 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 14:27:43 +0100 Subject: Job: Postdoc position in WSD, LIMSI-CNRS Message-ID: Date: Wed, 4 Dec 2013 11:19:06 +0100 From: Marianna Apidianaki Message-Id: <1EFA7D3B-766B-4504-90F2-3F188CEA6D13 at limsi.fr> X-url: http://transread.limsi.fr X-url: http://www.limsi.fr/Scientifique/tlp/mt/ [apologies for cross-posting] ****************************************************************** POSTDOC POSITION IN WORD SENSE DISAMBIGUATION AT LIMSI-CNRS, Orsay (Paris area), FRANCE ****************************************************************** A postdoctoral position in Word Sense Disambiguation (WSD) is available at the LIMSI-CNRS lab. The position is funded by the national (ANR) project TransRead (http://transread.limsi.fr) aimed at developing new multilingual text processing applications and facilitating the reading of multilingual documents for foreign language readers and learners. The postdoctoral researcher will work on both data-driven and knowledge-based Word Sense Disambiguation methods exploiting information in parallel corpora and large-scale multilingual resources (such as BabelNet and Wiktionary). The disambiguation output will serve to facilitate foreign language reading by providing semantic annotations in different languages. Research results will also be exploited in other applications developed in the group, such as Machine Translation. Requirements: - Ph.D. (or completed by February 2014) in Computer Science, Natural Language Processing, Computational Linguistics - Solid programming skills - Experience with statistical approaches to semantics, cross-lingual methods for NLP or Statistical Machine Translation - Strong publication record - Fluent English The successful candidate will join a dynamic research team working on various aspects of Statistical Machine Translation, Word Sense Disambiguation and paraphrasing. For information regarding our activities, see http://www.limsi.fr/Scientifique/tlp/mt/ Duration: 12 months Application deadline: Open until filled Interested candidates should send their CV and the names and contact information of 2 referees to Marianna Apidianaki (marianna at limsi.fr). Informal inquiries can be sent to the same address. About the LIMSI-CNRS The LIMSI-CNRS lab is situated at Orsay, a green area 25 km south of Paris. A suburban train connects Orsay to Paris city center. Detailed information about the LIMSI lab can be found at http://www.limsi.fr From hamon at LIMSI.FR Wed Dec 4 13:23:19 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 14:23:19 +0100 Subject: These: Assaf Urieli, Analyse syntaxique robuste du fran=?ISO-8859-1?Q?=E7ais?= Message-ID: Date: Tue, 3 Dec 2013 16:34:19 +0100 From: Assaf Urieli Message-ID: Bonjour, J'ai le plaisir de vous inviter ? la soutenance de ma th?se intitul?e "Analyse syntaxique robuste du fran?ais : concilier m?thodes statistiques et connaissances linguistiques dans l'outil Talismane". La soutenance aura lieu le mardi 17 d?cembre 2013 ? 14h00 dans la salle D31 de la Maison de la Recherche de l'Universit? de Toulouse le Mirail. La Maison de la Recherche se trouve juste ? c?t? de la biblioth?que universitaire. Un plan se trouve ? l'adresse suivante : http://www.univ-tlse2.fr/accueil/infos/pratique/adresses-et-acces/plan-du-campus-avec-l-index-des-ufr-services-et-salles-de-cours-6337.kjsp Vous ?tes ?galement convi?s au pot de th?se qui suivra la soutenance. Bien cordialement, Assaf Urieli ------------------------------------------------------------------------ Le jury de soutenance sera compos? de : M. Alexis Nasr, Professeur, LIF, Marseille - Rapporteur M. Eric Wehrli, Professeur, LATL, Gen?ve - Rapporteur Mme Marie Candito, Ma?tre de Conf?rences, ALPAGE, Paris Diderot - Examinatrice M. Nabil Hathout, Directeur de Recherche, CLLE, Toulouse - Examinateur M. Ludovic Tanguy, Ma?tre de Conf?rences HDR, CLLE, Toulouse - Directeur de th?se Titre en fran?ais : Analyse syntaxique robuste du fran?ais : concilier m?thodes statistiques et connaissances linguistiques dans l'outil Talismane R?sum? de la th?se : Dans cette th?se, nous explorons l?analyse syntaxique statistique robuste du fran?ais. Notre principal souci est de trouver des m?thodes qui permettent au linguiste d?injecter des connaissances et/ou des ressources linguistiques dans un moteur statistique afin d?am?liorer les r?sultats ? la fois globalement et pour certains ph?nom?nes sp?cifiques. D?abord, nous d?crivons le sch?ma d?annotation en d?pendances du fran?ais, et les algorithmes capables de produire cette annotation, en particulier le parsing par transitions. Apr?s avoir explor? les algorithmes d?apprentissage automatique supervis? pour les probl?mes de classification en TAL, nous pr?sentons l?analyseur syntaxique Talismane, d?velopp? dans le cadre de cette th?se, qui comprend quatre modules statistiques ? le d?coupage en phrases, la segmentation en mots, l??tiquettage morpho-syntaxique et le parsing ? ainsi que les diverses ressources linguistiques utilis?es par les mod?les de base. Nos premi?res exp?riences ont permis d?identifier la meilleure configuration d?apprentissage parmi les nombreuses configurations possibles. Ensuite, nous explorons les am?liorations apport?es par le principe de recherche par faisceau (beam search) et la propagation du faisceau d'un module ? un autre. Finalement, nous pr?sentons une s?rie d?exp?riences dont le but est de corriger des erreurs linguistiques sp?cifiques au moyen de descripteurs cibl?s pour l'apprentissage. Une de nos innovations est l?introduction des r?gles qui imposent ou interdisent certaines d?cisions locales, permettant ainsi de contourner le mod?le statistique, et nous explorons l?utilisation de r?gles pour les erreurs que les descripteurs n?ont pas pu corriger. Finalement, nous explorons l?utilisation de ressources linguistiques ? large couverture, au travers d'une exp?rience d'apprentissage semi-supervis? avec une ressource lexicale produite par une analyse s?mantique distributionnelle. Titre en anglais : Robust French syntax analysis: reconciling statistical methods and linguistic knowledge in the Talismane toolkit R?sum? en anglais : In this thesis we explore robust statistical syntax analysis for French. Our main concern is to explore methods whereby the linguist can inject linguistic knowledge and/or resources into the robust statistical engine in order to improve results both globally and for specific phenomena. We first explore the dependency annotation schema for French, concentrating on certain phenomena. Next, we look into the various algorithms capable of producing this annotation, and in particular on the transition-based parsing algorithm used in the rest of this thesis. After exploring supervised machine learning algorithms for NLP classification problems, we present the Talismane toolkit for syntax analysis, built within the framework of this thesis, including four statistical modules - sentence boundary detection, tokenisation, pos-tagging and parsing - as well as the various linguistic resources used for the baseline model, including corpora, lexicons and feature sets. Our first experiments enabled us to identify the best baseline among various machine learning configurations. We then look into improvements made possible by a beam search and by beam propagation from one module to the next. Finally, we present a series of experiments aimed at correcting errors related to specific linguistic phenomena, using targeted features. One of our innovations is the introduction of rules that can impose or prohibit certain decisions locally, thus bypassing the statistical model. We explore the usage of rules for errors that the features are unable to correct. Finally, we look into the enhancement of targeted features by large scale linguistic resources, and in particular a semi-supervised approach using a lexical resource constructed using distributional semantic techniques. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Dec 4 13:17:51 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 14:17:51 +0100 Subject: Appel: 19th Conference on Formal Grammar Message-ID: Date: Tue, 3 Dec 2013 15:34:11 +0100 (CET) From: Frank Richter Message-ID: X-url: http://fg.phil.hhu.de/ FIRST CALL FOR PAPERS FG-2014: The 19th Conference on Formal Grammar Tuebingen, Germany, August 16-17 2014 http://fg.phil.hhu.de/ Collocated with the European Summer School in Logic, Language and Information Background FG-2014 is the 19th conference on Formal Grammar, to be held in conjunction with the European Summer School in Logic, Language and Information, which takes place in 2014 in Tuebingen, Germany. Previous Formal Grammar meetings were held in Barcelona (1995), Prague (1996), Aix-en-Provence (1997), Saarbruecken (1998), Utrecht (1999), Helsinki (2001), Trento (2002), Vienna (2003), Nancy (2004), Edinburgh (2005), Malaga (2006), Dublin (2007), Hamburg (2008), Bordeaux (2009), Copenhagen (2010), Ljubljana (2011), Opole (2012) and Duesseldorf (2013). Aims and Scope FG provides a forum for the presentation of new and original research on formal grammar, mathematical linguistics and the application of formal and mathematical methods to the study of natural language. Themes of interest include, but are not limited to, * formal and computational phonology, morphology, syntax, semantics and pragmatics; * model-theoretic and proof-theoretic methods in linguistics; * logical aspects of linguistic structure; * constraint-based and resource-sensitive approaches to grammar; * learnability of formal grammar; * integration of stochastic and symbolic models of grammar; * foundational, methodological and architectural issues in grammar and linguistics; * mathematical foundations of statistical approaches to linguistic analysis. Previous conferences in this series have welcomed papers from a wide variety of frameworks. Submission Details We invite **electronic** submissions of original, 16-page papers (including references and possible technical appendices). Authors are encouraged to use the Springer-Verlag LNCS style: http://www.springer.com/computer/lncs?SGWID=0-164-6-793341-0 The submission deadline is **February 23 2014**. Papers must be submitted electronically at EasyChair, https://www.easychair.org. A submission URL for FG-2014 will be provided soon. Papers should report original work which was not presented in other conferences. However, simultaneous submission is allowed, provided that the authors indicate other conferences to which the work was submitted in a footnote. Note that accepted papers can only be presented in one of the venues. Submissions will be reviewed anonymously by at least three reviewers. Accepted papers will be published as a volume in the Springer LNCS series, under the FoLLI subline, either separately or jointly with the papers from FG-2015, depending on the number of accepted papers. Important Dates * February 23, 2014: Deadline for paper submission * April 20, 2014: Notification of acceptance * June 1, 2014: Camera ready copies due * August 16-17, 2014: Conference dates Standing committee * Glyn Morrill (Universitat Polit?cnica de Catalunya, Spain) * Reinhard Muskens (Tilburg University , The Netherlands) * Rainer Osswald (Heinrich-Heine-Universitaet Duesseldorf, Germany) * Frank Richter (Heinrich-Heine-Universitaet Duesseldorf, Germany) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Dec 4 13:16:57 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 14:16:57 +0100 Subject: Ressources: ELRA - Language Resources Catalogue - Update Message-ID: Date: Tue, 03 Dec 2013 14:12:06 +0100 From: ELRA ELDA Information Message-ID: <529DD8A6.4060109 at elda.org> [Our apologies if you have received multiple copies of this announcement.] We are happy to announce that 2 new Pronunciation Dictionaries from the GlobalPhone database (Chinese-Mandarin and Korean) are now available in our catalogue. *_The GlobalPhone Pronunciation Dictionaries:_* GlobalPhone is a multilingual speech and text database collected at Karlsruhe University, Germany. The GlobalPhone pronunciation dictionaries contain the pronunciations of all word forms found in the transcription data of the GlobalPhone speech & text database. The pronunciation dictionaries are currently available in 17 languages: Arabic (29230 entries/27059 words), Bulgarian (20193 entries), Croatian (23497 entries/20628 words), Czech (33049 entries/32942 words), French (36837 entries/20710 words), German (48979 entries/46035 words), Hausa (42662 entries/42079 words), Japanese (18094 entries), Polish (36484 entries), Portuguese (Brazilian) (54146 entries/54130 words), Russian (28818 entries/27667 words), Spanish (Latin American) (43264 entries/33960 words), Swedish (about 25000 entries), Turkish (31330 entries/31087 words), Vietnamese (38504 entries/29974 words), Chinese-Mandarin (73388 pronunciations), and Korean (3500 syllables). Special prices are offered for a combined purchase of several GlobalPhone languages. Available GlobalPhone Pronuncation Dictionaries are listed below (click on the links for further details): ***ELRA-S0340 GlobalPhone French Pronunciation Dictionary*** For more information, see: http://catalog.elra.info/product_info.php?products_id=1197 **ELRA-S0341 GlobalPhone German Pronunciation Dictionary** For more information, see: http://catalog.elra.info/product_info.php?products_id=1198 **ELRA-S0348 GlobalPhone Japanese Pronunciation Dictionary** For more information, see: http://catalog.elra.info/product_info.php?products_id=1199 **ELRA-S0350 GlobalPhone Arabic Pronunciation Dictionary ** For more information, see: http://catalog.elra.info/product_info.php?products_id=1200 *ELRA-S0351 GlobalPhone Bulgarian Pronunciation Dictionary* For more information, see: http://catalog.elra.info/product_info.php?products_id=1201 *ELRA-S0352 GlobalPhone Czech Pronunciation Dictionary* For more information, see: http://catalog.elra.info/product_info.php?products_id=1202 *ELRA-S0353 GlobalPhone Hausa Pronunciation Dictionary* For more information, see: http://catalog.elra.info/product_info.php?products_id=1203 *ELRA-S0354 GlobalPhone Polish Pronunciation Dictionary* For more information, see: http://catalog.elra.info/product_info.php?products_id=1204 *ELRA-S0355 GlobalPhone Portuguese (Brazilian) Pronunciation Dictionary* For more information, see: http://catalog.elra.info/product_info.php?products_id=1205 *ELRA-S0356 GlobalPhone Swedish Pronunciation Dictionary* For more information, see: http://catalog.elra.info/product_info.php?products_id=1206 **ELRA-S0358 GlobalPhone Croatian Pronunciation Dictionary** For more information, see: http://catalog.elra.info/product_info.php?products_id=1207 *ELRA-S0359 GlobalPhone Russian Pronunciation Dictionary* For more information, see: http://catalog.elra.info/product_info.php?products_id=1208 *ELRA-S0360 GlobalPhone Spanish (Latin American) Pronunciation Dictionary* For more information, see: http://catalog.elra.info/product_info.php?products_id=1209 *ELRA-S0361 GlobalPhone Turkish Pronunciation Dictionary* For more information, see: http://catalog.elra.info/product_info.php?products_id=1210 *ELRA-S0362 GlobalPhone Vietnamese Pronunciation Dictionary* For more information, see: http://catalog.elra.info/product_info.php?products_id=1211 *** NEW *** *ELRA-S0363 GlobalPhone Chinese-Mandarin Pronunciation Dictionary* For more information, see: http://catalog.elra.info/product_info.php?products_id=1212 *ELRA-S0364 GlobalPhone Korean Pronunciation Dictionary* For more information, see: http://catalog.elra.info/product_info.php?products_id=1213 For more information on the catalogue, please contact Val?rie Mapelli mailto:mapelli at elda.org Visit our On-line Catalogue: http://catalog.elra.info Visit the Universal Catalogue: http://universal.elra.info Archives of ELRA Language Resources Catalogue Updates: http://www.elra.info/LRs-Announcements.html ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Dec 4 13:13:22 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 4 Dec 2013 14:13:22 +0100 Subject: Appel: Special issue of BMC Bioinformatics on BioNLP Shared Task Message-ID: Date: Tue, 03 Dec 2013 14:03:35 +0100 From: Claire Nedellec Message-ID: <529DD6A7.9040804 at jouy.inra.fr> X-url: http://2013.bionlp-st.org/bmc-bioinformatics-special-issue ==================================================================== CALL FOR PAPERS Special issue of BMC Bioinformatics on BioNLP Shared Task 2013 ==================================================================== We are pleased to announce a special issue of BMC Bioinformatics on the BioNLP Shared Task (BioNLP-ST) 2013. BMC Bioinformatics is a premier open-access journal for BioNLP research with an Impact Factor of 3.02 and indexed by PubMed, Scopus, ACM and Thomson Reuters (ISI) among others. We invite two kinds of submissions: 1- extended versions of BioNLP-ST 2013 workshop papers. Each extended submission must include a substantial amount of previously unpublished content. In this type of submission, the workshop paper should be cited and the extended contribution has to be clearly stated. Please refer to BMC guidelines: http://www.biomedcentral.com/about/duplicatepublication 2- papers on new contributions to the BioNLP-ST 2013 tasks. The BioNLP-ST 2013 data and evaluation services are kept online at: http://2013.bionlp-st.org/tasks for new participants to register and evaluate their results on these datasets. For a general overview on BioNLP Shared Task 2013, please refer to the task homepage: http://2013.bionlp-st.org/ All submissions will undergo full peer review by 3 reviewers. We expect that 7-8 papers will be selected to be included in the special issue based on the reviews. When preparing your manuscript, please follow the author guidelines of BMC Bioinformatics: http://www.biomedcentral.com/bmcbioinformatics/ifora/ Please note also the following: * The submissions should NOT be made using the general BMC submissions site, but through a special issue submission site that will be available shortly. * For accepted papers, the authors have to be prepared to pay the article-processing charges. See http://www.biomedcentral.com/bmcbioinformatics/about/faq/openaccess for more details. IMPORTANT DATES Jan 27 Manuscript submission due (draft version) Feb 3 Manuscript submission (final version) Mar 8 Acceptance notification of first-round review To prepare the review process, we ask all those planning to submit a manuscript to submit an early version of the paper, including title and abstract, prior to the primary manuscript submission. The draft version is considered a notification of intent to submit and will only be used to coordinate reviews; it will not be considered in review of the manuscript. Note that the initial submissions must include sufficient information for reviewer assignment, e.g. title and abstract. The actual review will begin from Feb 4. So, the authors will have a chance to update their manuscripts by Feb 3. SPECIAL ISSUE EDITORS Claire N?dellec, National Institute for Agricultural Research (INRA) Jin-Dong Kim, Database Center for Life Science (DBCLS) Sampo Pyysalo, University of Turku Pierre Zweigenbaum, National Center for Scientific Research (CNRS) Read more: http://2013.bionlp-st.org/bmc-bioinformatics-special-issue ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 7 20:18:07 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 7 Dec 2013 21:18:07 +0100 Subject: Stage: Stage M2 (recherche), LIRMM, Combining Stochastic and Knowledge-based Modeling for Natural Language Understanding Message-ID: Date: Wed, 4 Dec 2013 17:15:39 +0100 From: Jean-Philippe Prost Message-ID: Bonjour, Un stage de M2 (Recherche) est disponible au LIRMM (U. Montpellier 2), entre le 20 janvier et mi-juin 2014. *Candidatures souhait?es le plus rapidement possible.* Titre : Combining Stochastic and Knowledge-based Modelling for Natural Language Understanding Encadrement : Jean-Philippe.Prost (Prost at lirmm.fr) Lieu : LIRMM, Montpellier Dur?e : 5 mois Financement : selon bar?mes l?gaux Possibilit? de poursuite en th?se, selon r?sultats (candidature soumise ? s?lection). Ce sujet de stage concerne le Traitement Automatique du Langage naturel (TAL). L'objet en est de se pencher sur une voie possible d'hybridation entre mod?lisation logique et mod?lisation probabiliste pour la repr?sentation de connaissances langagi?res. Le sujet porte plus particuli?rement sur la dimension syntaxique. ============== Les parseurs les plus performants du moment sont dits "robustes", pour leur capacit? ? produire un arbre syntaxique quelle que soit la phrase en entr?e, y compris mal-form?e. Ces analyseurs sont construits ? base d'algorithmes d'apprentissage automatique qui permettent de construire l'arbre le plus probable ?tant donn? la phrase en entr?e. Cette robustesse est acquise au d?triment d'une perte d'information cons?quente. Par exemple, la question de la bonne-formation (grammaticalit?) de l'entr?e est ?lud?e, l'analyse d'une phrase bien-form?e n'ayant pas n?cessairement une probabilit? maximale. Or il est possible d'apporter une r?ponse exacte ? ce probl?me, en venant simplement brancher un module ? base de raisonnement logique sur la sortie non-d?terministe d'un analyseur stochastique. Mais ce qui serait souhaitable, serait de pouvoir int?grer ce processus de r?solution exacte dans le processus d'analyse stochastique. L'objet de ce stage est donc d'explorer diff?rentes pistes possibles sur cette question. On pourra, par exemple, int?grer un m?canisme de v?rification de mod?le (model checking, au sens de la th?orie logique des mod?les) dans le proc?d? de reclassement (reranking) des n candidat-mod?les les plus probables que g?n?re un analyseur stochastique. Le stage comportera une partie d'?tat de l'art, une partie d'exploration th?orique, et une partie r?alisation (programmation). JP. Prost ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 7 20:19:41 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 7 Dec 2013 21:19:41 +0100 Subject: Stage: Stage M2 recherche en TAL/apprentissage automatique Message-ID: Date: Thu, 5 Dec 2013 11:07:34 +0100 From: Isabelle Tellier Message-Id: <730D5B02-4640-4012-930A-11F49EBF2FC7 at univ-paris3.fr> X-url: http://ligm.u-pem.fr X-url: http://www.lattice.cnrs.fr sujet de stage M2 recherche en TAL : acquisition d'un analyseur en d?pendances du fran?ais m?di?val Dans le cadre du projet ANR Syntactic Reference Corpus of Medieval French (SRCMF, 2008-2011), un Treebank (une collection d'arbres) d'analyses en d?pendances d'?nonc?s du fran?ais m?di?val a ?t? constitu?. Il comprend 260 000 mots (parmi lesquels environ 27 000 t?tes verbales) annot?es en ?tiquettes morpho-syntaxiques et reli?s par des relations de d?pendances ?tiquet?es. L'objectif du stage est d'exploiter ce corpus par apprentissage automatique afin d'acqu?rir un analyseur en d?pendances du fran?ais m?di?val, ?ventuellement coupl? ? un ?tiqueteur morpho-syntaxique. Ces outils pourront ?tre exploit?s sur de nouveaux textes. Ils permettront d'?tudier pr?cis?ment les sp?cificit?s grammaticales du fran?ais m?di?val, en comparaison avec celles du fran?ais contemporain. Le stage b?n?ficiera de l'encadrement d'une sp?cialiste du fran?ais m?di?val (Sophie Pr?vost) et de sp?cialistes du TAL et de l'apprentissage automatique (Matthieu Constant et Isabelle Tellier). Comp?tences requises : - niveau M2 ou ing?nieur en informatique ou en TAL - Connaissances (ou au minimum int?r?t argument?) en TAL et en apprentissage automatique - Comp?tences en programmation Conditions du stage : - R?mun?ration : prime de stage (1/3 du SMIC). - Dur?e : 4 ? 6 mois en commen?ant d?s que possible en 2014. - Lieu : le stage sera encadr? en collaboration par le LIGM ? Marne-la-vall?e (http://ligm.u-pem.fr) et le Lattice ? Montrouge (http://www.lattice.cnrs.fr), trajets ? pr?voir entre les deux lieux envoyer CV + lettre de motivation ? Matthieu Constant (Matthieu.Constant at u-pem.fr), Sophie Pr?vost (sophie.prevost at ens.fr), Isabelle Tellier (isabelle.tellier at univ-paris3.fr) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 7 20:24:50 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 7 Dec 2013 21:24:50 +0100 Subject: Appel: 4th DICTAP2014 Message-ID: Date: Thu, 5 Dec 2013 23:00:49 -0800 Message-ID: <4f9512da14cb095845ce6e8eb6c9d126 at localhost.localdomain> X-url: http://sdiwc.net/conferences/2014/dictap2014/ Please Accept our Apology for Multiple Posting. THANK YOU. ============================================================== The Fourth International Conference on Digital Information and Communication Technology and its Applications (DICTAP2014) May 6-8, 2014 University of the Thai Chamber of Commerce, Bangkok, Thailand http://sdiwc.net/conferences/2014/dictap2014/ All registered papers will be included in SDIWC Digital Library. ============================================================== The conference aims to enable researchers build connections between different digital applications. The conference welcomes papers on the following (but not limited to) research topics - Security in Information and Telecommunication System - Network Systems and Devices - Wireless and Optical Communications - Algorithms, Architecture, and Infrastructures - Information Content Security - Cloud Computing and Computer Networks - Sensor Networks and Embedded System - E-Learning, E-Commerce, E-Business and E-Government - Data Exchange Issues and Supply Chain - Information Retrieval - Web Services, Web based Application - Data Grids, Data and Information Quality - Data Warehouses and Data Mining - Image Analysis and Image Processing - Management and Diffusion of Multimedia Applications - Mobile, Ad Hoc and Sensor Network Security - Video Search and Video Mining - Enterprise Computing - Web Mining including Web Intelligence and Web 3.0 - Knowledge Management - Compression and Coding - XML and other extensible languages - Intelligent and Robust System - ICT for Social and Humanity - Security and Access Control - Constraint Programming - Ubiquitous Systems - Semantic Web, Ontologies and Rules Researchers are encouraged to submit their work electronically. All papers will be fully refereed by a minimum of two specialized referees. Before final acceptance, all referees comments must be considered. IMPORTANT DATES: Submission Deadline : April 6, 2014 Notification of Acceptance: April 15, 2014 Camera Ready Submission : April 26, 2014 Registration : April 26, 2014 Conference Dates : May 6-8, 2014 ---------------------------------------- From hamon at LIMSI.FR Sat Dec 7 20:35:43 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 7 Dec 2013 21:35:43 +0100 Subject: Appel: SIGDIAL 2014 Conference, June 18th-20th 2014 Message-ID: Date: Fri, 6 Dec 2013 19:26:40 +0000 From: Helen Hastie Message-ID: X-url: http://www.sigdial.org/workshops/conference15 X-url: http://www.cs.jhu.edu/ACL2014/CallforPapers.htm SIGDIAL 2014 CONFERENCE Wednesday, June 18 to Friday, June 20, 2014 The 15th Annual Meeting of the Special Interest Group on Discourse and Dialog will be co-located with the 8th International Conference on Natural Language Generation (INLG 2014) in Philadelphia, PA, USA and immediately preceding ACL 2014. http://www.sigdial.org/workshops/conference15 (under construction) CALL FOR PAPERS The 2014 SIGDIAL conference continues a series of fourteen conferences, providing a regular forum for the presentation of cutting edge research across the areas of discourse and dialog and attracting a diverse set of participants from academia and industry. The conference is sponsored by the SIGdial organization, which serves as the Special Interest Group on discourse and dialog for both ACL and ISCA. TOPICS OF INTEREST We welcome formal, corpus-based, system-building or analytical work on discourse and dialog including but not restricted to the following themes and topics: - Discourse Processing and Dialog Systems - Corpora, Tools and Methodology - Pragmatic and/or Semantic Modeling - Dimensions of Interaction - Open Domain Dialog - Style, Voice and Personality in Spoken Dialog and Written Text - Applications of Dialog and Discourse Processing Technology - Novel Methods for Generation Within Dialog, for a joint special session with INLG IMPORTANT DATES Special Session Proposal Deadline: Sunday, 9 February 2014 (23:59, GMT-11) Special Session Notification: Monday, 17 February 2014 Long, Short and Demonstration Paper Submission Deadline: Sunday, 9 March 2014 (23:59, GMT-11) Paper Notification: Friday, 18 April 2014 Final Paper Due - For papers accepted subject to receiving mentoring Wednesday, 14 May 2014 - For accepted papers Friday, 23 May 2014 Conference Wednesday, June 18, 2014 to Friday, June 20, 2014 SUBMISSIONS Special Session Proposals The SIGdial organizers welcome the submission of special session proposals. A SIGDIAL special session is the length of a regular session at the conference; may be organized as a poster session, a poster session with panel discussion, or an oral presentation session; and will be held on the last day of the conference. Special sessions may, at the discretion of the SIGdial organizers, be held as parallel sessions. Those wishing to organize a special session should prepare a two-page proposal containing: a summary of the topic of the special session; a list of organizers and sponsors; a list of people who may submit and participate; and a requested format (poster/panel/oral session). These proposals should be sent to conference[a]sigdial.org by the special session proposal deadline. Special session proposals will be reviewed jointly by the general and program co-chairs. Papers The program committee welcomes the submission of long papers, short papers, and demonstration descriptions. All accepted submissions will be published in the conference proceedings. Long papers may, at the discretion of the technical program committee, be accepted for oral or poster presentation. They must be no longer than 8 pages, including title, content, and examples. Two additional pages are allowed for references and appendices, which may include extended example discourses or dialogs, algorithms, graphical representations, etc. Short papers will be presented as posters. They should be no longer than 4 pages, including title and content. One additional page is allowed for references and appendices. Demonstration papers should be no longer than 3 pages, including references. A separate one-page document should be provided to the program co-chairs for demonstration descriptions, specifying furniture and equipment needed for the demo. Authors of a submission may designate their paper to be considered for a SIGDIAL special session. However, in 2014 all papers will undergo regular peer review and appear in the SIGDIAL proceedings. Consequently, no paper is guaranteed to appear in any particular session at SIGDIAL. Papers that have been or will be submitted to other meetings or publications must provide this information (see submission format). A paper accepted for presentation at SIGDIAL 2014 must not have been presented at any other meeting with publicly available proceedings. Any questions regarding submissions can be sent to the program co-chairs at program-chairs[at]sigdial.org . Authors are encouraged to submit additional supportive material such as video clips or sound clips and examples of available resources for review purposes. Submission is electronic using paper submission software. FORMAT All long, short, and demonstration submissions should follow the two-column ACL 2014 format. We strongly recommend the use of ACL LaTeX style files or Microsoft Word style files tailored for the ACL 2014 conference. Submissions must conform to the official ACL 2014 style guidelines ( http://www.cs.jhu.edu/ACL2014/CallforPapers.htm), and they must be electronic in PDF. As in most previous years, submissions will not be anonymous. MENTORING SERVICE For several years, the SIGDIAL conference has offered a mentoring service. Submissions with innovative core ideas that may need language (English) or organizational assistance will be flagged for "mentoring" and conditionally accepted with recommendation to revise with a mentor. An experienced mentor who has previously published in the SIGDIAL venue will then help the authors of these flagged papers prepare their submissions for publication. Any questions about the mentoring service can be addressed to the mentoring chair at mentoring[at]sigdial.org . STUDENT SUPPORT SIGdial also offers a limited number of scholarships for students presenting a paper accepted to the conference. Application materials will be posted at the conference website. BEST PAPER AWARDS In order to recognize significant advancements in dialog and discourse science and technology, SIGDIAL will recognize two best paper awards. A selection committee consisting of prominent researchers in the fields of interest will select the recipients of the awards. SPONSORSHIP SIGDIAL offers a number of opportunities for sponsors. For more information, email the conference organizers at sponsor-chair[at]sigdial.org. DIALOG AND DISCOURSE SIGDIAL authors are encouraged to submit their research to the journal Dialog and Discourse, which is endorsed by SIGdial. ORGANIZING COMMITTEE General Co-Chairs Kallirroi Georgila, University of Southern California, USA Matthew Stone, Rutgers, The State University of New Jersey, USA Technical Program Co-Chairs Helen Hastie, Heriot-Watt University, Edinburgh, UK Ani Nenkova, University of Pennsylvania, USA Mentoring Chair Svetlana Stoyanchev, AT&T Research Labs, USA Local Chair Keelan Evanini, Educational Testing Service, USA Sponsorships Chair Giuseppe Di Fabbrizio, Amazon.com, USA SIGdial President Amanda Stent, Yahoo! Labs, USA SIGdial Vice President Jason Williams, Microsoft Research, USA SIGdial Secretary/Treasurer Kristiina Jokinen, University of Helsinki, Finland From hamon at LIMSI.FR Sat Dec 7 20:30:10 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 7 Dec 2013 21:30:10 +0100 Subject: Info: Newsletter du LPL, No 7 (2013) Message-ID: Date: Fri, 6 Dec 2013 15:04:25 +0100 From: "Claudia Pichon-Starke" Message-ID: <026e01cef28c$1262c850$372858f0$@starke at lpl-aix.fr> X-url: http://www.lpl.univ-aix.fr/fileadmin/template/NewsLetter/NL7.htm Bonjour, La cellule communication du LPL a le plaisir de vous pr?senter le nouveau num?ro de sa Newsletter : http://www.lpl.univ-aix.fr/fileadmin/template/NewsLetter/NL7.htm Nous vous en souhaitons une bonne lecture. Bien cordialement, Claudia Pichon-Starke pour la Cellule Com communication at lpl-aix.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 7 20:33:03 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 7 Dec 2013 21:33:03 +0100 Subject: Seminaire: Alpage, Djame Seddah, Overview of the SPMRL 2013 Shared Task, 13 decembre 2013 Message-ID: Date: Fri, 6 Dec 2013 20:00:24 +0100 From: Marie Candito Message-ID: ************** S?minaire de l'?quipe Alpage ********* Il s'agit du s?minaire de recherche en linguistique informatique organis? par l'?quipe Alpage, ?quipe mixte INRIA - Paris Diderot, sp?cialis?e en traitement automatique des langues. Vendredi 13 d?cembre de 11h ? 12h30, en salle 106 B?timent Olympe de Gouges rue Albert Einstein 75013 Paris (dernier b?timent de la rue Albert Einstein, num?ro non attribu?...) Toute personne int?ress?e est la bienvenue. ********************************************************** Djam? Seddah (Alpage) Title : ?Overview of the SPMRL 2013 Shared Task: Cross-Framework Evaluation of Parsing Morphologically Rich Languages? Abstract : In this presentation, we will present the outcoumes on the first shared task on statistical parsing of morphologically rich languages. The task features data sets from nine languages (Arabic, Basque, French, German, Hebrew, Hungarian, Korean, Polish and Swedish), each available both in constituency and dependency annotation. We report on the preparation of the data sets, on the proposed parsing scenarios, and on the evaluation metrics for parsing MRLs given different representation types. We present and analyze parsing results obtained by the task participants, and then provide an analysis and comparison of the parsers across languages and frameworks, reported for gold input as well as more realistic parsing scenarios. The shared task saw submissions from seven teams, and results produced by more than 14 different systems. The parsing results were obtained in different input scenarios (gold, predicted, and raw) and evaluated using different protocols (cross-framework, cross-scenario, and cross-language). In particular, this is the first time a multilingual evaluation campaign reports on the execution of parsers in realistic, morphologically ambiguous, settings. Djam? Seddah, Joint Work with Reut Tsarfaty, Sandra K?ebler and many contributors[1] [1] Marie Candito, Jinho D. Choi, Richard Farkas , Jennifer Foster, Iakes Goenaga, Koldo Gojenola, YoavGoldberg, Spence Green, Nizar Habash, Marco Kuhlman, Wolfgang Maier, Joakim Nivre, Adam Przepi?rkowski, Ryan Roth, Wolfgang Seeker, Yannick Versley, Veronika Vincze, Marcin Wolin ?ski, Alina Wr?blewskav, Eric Villemonte de la Clergerie ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 7 20:31:15 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 7 Dec 2013 21:31:15 +0100 Subject: These: Florent Pompigne, Modelisation logique de la langue et Grammaires Categorielles Abstraites Message-ID: Date: Fri, 6 Dec 2013 15:24:14 +0100 (CET) From: Florent Pompigne Message-ID: <692013726.115461.1386339854164.JavaMail.root at loria.fr> Bonjour, j'ai le plaisir de vous inviter ? la soutenance de ma th?se qui s'intitule "Mod?lisation logique de la langue et Grammaires Cat?gorielles Abstraites". La soutenance aura lieu le lundi 11 d?cembre ? 10h30 au LORIA (Campus scientifique Vandoeuvre-l?s-Nancy), en salle C005 . Vous ?tes ?galement convi?s au pot, qui aura lieu ? 16h. Cordialement, Florent Pompigne --- Composition du jury : Rapporteurs : Christian Retor?, Professeur, Universit? de Bordeaux 1 Annie Foret, Ma?tre de conf?rences, Universit? de Rennes 1 Examinateurs : Michael Moortgat, Professeur, Universit? d'Utrecht Laurent Vigneron, Professeur, Universit? de Lorraine Directeurs : Philippe de Groote, Directeur de recherches, INRIA Nancy Sylvain Pogodalla, Charg? de recherches, INRIA Nancy --- R?sum? Cette th?se s'int?resse ? la mod?lisation de la syntaxe et de l'interface syntaxe-s?mantique de la phrase, et explore la possibilit? de contr?ler au niveau des structures de d?rivation la surg?n?ration que produit le traitement des d?pendances ? distance par des types d'ordre sup?rieur. ? cet effet, nous ?tudions la possibilit? d'?tendre le syst?me de typage des Grammaires Cat?gorielles Abstraites avec les constructions de la somme disjointe, du produit cart?sien et du produit d?pendant, permettant d'?tiqueter les cat?gories syntaxiques par des structures de traits. Nous prouvons dans un premier temps que le calcul r?sultant de cette extension b?n?ficie des propri?t?s de confluence et de normalisation, permettant d'identifier les termes beta-?quivalents dans le formalisme grammatical. Nous r?duisons de plus le m?me probl?me pour la beta-eta-?quivalence ? un ensemble d'hypoth?se de d?part. Dans un second temps, nous montrons comment cette introduction de structures de traits peut ?tre appliqu?e au contr?le des d?pendances ? distances, ? travers les exemples des contraintes de cas, des ?lots d'extraction pour les mouvements explicites et implicites, et des extractions interrogatives multiples, et nous discutons de la pertinence de placer ces contr?les sur les structures de d?rivation. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 7 20:23:29 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 7 Dec 2013 21:23:29 +0100 Subject: Appel: The 37th Annual ACM SIGIR 2014 conference, Gold Coast Australia, July 6-11 2014 Message-ID: Date: Fri, 6 Dec 2013 16:53:15 +1000 From: Richi Nayak Message-ID: X-url: http://sigir.org/sigir2014/ CALL FOR PAPERS, Submission Due: JAN 27, 2014 ACM SIGIR 2014: THE 37th ANNUAL CONFERENCE 6-11 July, 2014, Gold Coast, Australia Conference website: http://sigir.org/sigir2014/ SIGIR is the major international forum for the presentation of new research results and for the demonstration of new systems and techniques in the broad field of information retrieval (IR). The Conference and Program Chairs invite all those working in areas related to IR to submit original papers for review. SIGIR 2014 welcomes contributions related to any aspect of IR theory and foundation, techniques, and applications. Relevant topics include, but are not limited to: TOPICS - Document Representation and Content Analysis (e.g., text representation, document structure, linguistic analysis, NLP for IR, cross- and multi-lingual IR, information extraction, sentiment analysis, clustering, classification, topic models, facets, text streams) - Queries and Query Analysis (e.g., query intent, query suggestion and prediction, query representation and reformulation, query log analysis, conversational search and dialogue, spoken queries, summarization, question answering) - Retrieval Models and Ranking (e.g., IR theory, language models, probabilistic retrieval models, learning to rank, combining searches, diversity and aggregated search) - Search Engine Architectures and Scalability (e.g., indexing, compression, distributed IR, P2P IR, mobile IR, cloud IR) - Users and Interactive IR (e.g., user studies, user and task models, interaction analysis, session analysis, exploratory search, personalized search, social and collaborative search, search interface, whole session support) - Filtering and Recommending (e.g., content-based filtering, collaborative filtering, recommender systems) - Evaluation (e.g., test collections, experimental design, effectiveness measures, session-based evaluation, simulation) - Web IR and Social Media Search (e.g., link analysis, click models/behavioral modeling, social tagging, social network analysis, blog and microblog search, forum search, community-based QA, adversarial IR and spam, vertical and local search) - IR and Structured Data (e.g., XML search, ranking in databases, desktop search, entity search) - Multimedia IR (e.g., image search, video search, speech/audio search, music search) - Other Applications (e.g., digital libraries, enterprise search, genomics IR, legal IR, patent search, text reuse, new retrieval problems) CONTRIBUTION TYPES - Full papers (10 pages), Short papers (4 pages), Demos (3 pages), Tutorials, Workshops INSTRUCTION Requirements for paper format and appropriate content are described in the content guidelines. The requirements will be strictly enforced. Papers which do not conform to the requirements may be rejected without review, so please be sure to read this page carefully. SIGIR 2014 solicits proposals for tutorials of either half-day (3 hours plus breaks) or full day (6 hours plus breaks) on all topics of information retrieval and its applications. Each tutorial should cover a single topic in detail. Submissions should include a cover sheet and an extended abstract. Proposals for workshops to be held at ACM SIGIR 2014 are also solicited. Workshops will usually last for one day and will be held on Friday 11th July 2014. IMPORTANT DATES - 20 January 2014: Abstracts for full research papers due - 27 January 2014: Full research papers due - 3 February 2014: Workshop proposals due - 17 February 2014: Short paper, demonstration, and tutorial submission deadline - 18 April 2014: Paper, short paper, tutorial, and demonstration acceptance notifications - 11 May 2014: Camera ready copy due (note the short timeline due to early conference date) - 16 May 2014: Early bird registration deadline ORGANIZERS - General Chairs: Shlomo Geva, Andrew Trotman - PC Chairs: Peter Bruza, Charles L. A. Clarke, Kalervo J?rvelin @Richi Nayak ? Publicity Chair ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 7 20:36:19 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 7 Dec 2013 21:36:19 +0100 Subject: Journee: Optimisation et traitement automatique du langage, Universite Paris 13, 16 decembre 2013 Message-ID: X-Url: http://www.atala.org/article.php3?id_article=48 From: Thierry Hamon X-Archive-Url: http://listserv.linguistlist.org/archives/ln.html X-Archive-Url: http://liste.cines.fr/info/ln Date: Sat, 07 Dec 2013 21:29:27 +0100 Date: Fri, 06 Dec 2013 13:46:13 +0100 From: Joseph Le Roux Message-ID: X-url: http://doodle.com/87mnbcdbzdmytffc X-url: http://mathstic.univ-paris13.fr/index.php/axes/oaatcn Bonjour, Les m?thodes d'optimisation sont de plus en plus utilis?es en traitement automatique des langues pour concevoir des m?thodes efficaces tant pour l'apprentissage de mod?les que pour le d?codage, notamment en offrant un cadre formel qui distingue clairement les contraintes globales des contraintes locales, tout en permettant de les combiner. L'?quipe RCLN du Laboratoire d'Informatique de Paris Nord s'int?resse de pr?s ? ces m?thodes pour la mod?lisation conjointe des diff?rents niveaux de description linguistique. Dans le cadre du p?le math/stic de Paris 13, avec le soutien du LABEX EFL, et en collaboration avec l'?quipe AOC du LIPN, l'?quipe RCLN organise une journ?e de s?minaires autour des th?mes de l'optimisation et du traitement automatique des langues. Cette journ?e aura lieu le 16 d?cembre sur le campus de l'Universit? Paris 13 ? Villetaneuse, dans l'amphith??tre Euler : 09h30-10h00 :: Accueil (Amphi Euler) 10h00-11h00 :: Expos? d'Andr? Martins (Amphi Euler) 11h15-12h15 :: Expos? de Sebastian Riedel (Amphi Euler) Pause d?jeuner (LIPN) 13h45-14h45 :: Expos? de Xavier Carreras (Amphi Euler) 15h00-16h00 :: Expos? de Joseph Le Roux (Amphi Euler) 16h00 :: go?ter au LIPN ! La participation ? cette journ?e est libre. Si vous comptez y assister, nous vous demandons, pour des raisons pratiques (organisation du buffet notamment), de nous l'indiquer ? cette adresse [[http://doodle.com/87mnbcdbzdmytffc]] . site web : http://mathstic.univ-paris13.fr/index.php/axes/oaatcn R?sum? des quatre expos?s pr?vus : -------------------------------------------------------------------------------- Andr? Martins, Priberam, Lisbon Title: AD3: A New Decoder for Structured Prediction Abstract: In this talk, I will present AD3 ("Alternating Directions Dual Decomposition"), a new consensus-based decoder for problems representable as factor graphs. AD3 is an approximate decoder that ignores global effects caused by the cycles of the graph, solving a linear relaxation of the original problem. It can handle many scenarios often encountered in NLP and IR applications, such as models with constraints in first-order logic; models involving budget or knapsack constraints; and combinations of structured models which are individually tractable, but hard to decode jointly. Like other dual decomposition algorithms, AD3 has a modular architecture, where local subproblems are solved independently, and their solutions are gathered to compute a global update. The key characteristic of AD3 is that each local subproblem has a quadratic regularizer, leading to faster convergence (both theoretically and in practice). After providing closed-form solutions for several of these subproblems, I will proceed to discuss a recent active set method that works for arbitrary factors, requiring only a local maximization oracle (the same oracle required in subgradient-based dual decomposition). In the second part of the talk, I will discuss two recent applications of AD3 in NLP problems: dependency parsing and compressive summarization. I will present "Turbo Parser," an open source dependency parser, which was recently improved with AD3 and the active set method to permit fast decoding of non-projective third-order models. Experiments in 14 languages yield state-of-art results, with parsing speeds ranging between 700 and 4,000 tokens per second. For compressive summarization, the use of AD3 leads to a system which is modular in the three qualities that define a good summary (conciseness, informativeness, and grammaticality), with state-of-the-art ROUGE scores, and runtimes close to extractive summarizers. This work was done in collaboration with Noah Smith, M?rio Figueiredo, Eric Xing, Pedro Aguiar, and Miguel Almeida. ------------------------------------------------------------------------ Sebastian Riedel, UCL, London. Title: Predict, Price and Cut: Column and Row Generation for Structured Prediction. Abstract: Many problems in NLP, and structured prediction in general, can be cast as finding high-scoring structures based on a large set of candidate parts. For example, In second order tagging, we have to select high-scoring transitions between tags in a globally consistent fashion. In second order graph-based dependency parsing we have to choose a quadratic number of first order and a cubic number of second order edges such that the graph is both high-scoring and a tree. What makes such problems challenging is the large number of possible parts to consider. This number not only affects the cost of search or optimization but also slows down the process of scoring parts before they enter the optimisation problem, and extracting features. In this talk I present an approach that can solve problems with large sets of candidate parts without considering all of these parts in either optimization or scoring. In contrast to most pruning heuristics, our algorithm can give certificates of optimality before having optimized over, or even scored, all parts. It does so without the need of auxiliary models or tuning of threshold parameters. This is achieved by a delayed column and row generation algorithm that iteratively solves an LP relaxation over a small subset of current candidate parts, and then finds new candidates with high scores that can be inserted into the current optimal solution without removing high scoring existing structure. The latter step subtracts from the cost of a part the price of resources the part requires, and is often referred as pricing. Sometimes parts may score highly after pricing, but are necessary in order to make the current solution feasible. We add such parts in a step that roughly amounts to violated cuts to the LP. We evaluate our approach on two applications: second order dependency parsing and first order tagging with large domains. In both cases we dramatically reduce the number of parts considered, and observe about an order of magnitude speed-up. This is possible without loss of optimality guarantees, and hence accuracy. ------------------------------------------------------------------------ Xavier Carreras, UPC, Barcelona Title: Learning Automata and Grammars: From Spectral Algorithms to Convex Optimizations There is an increasing interest in spectral methods to learn latent-variable language models in the form of weighted automata and context-free grammars. Spectral methods provide an algebraic formulation to the problem of inducing automata or grammars from data, and directly exploit the recurrence relations behind the model. I will review the spectral method from an algebraic perspective, making use of Hankel matrices as the key object behind the method: a Hankel matrix collects all necessary statistics of the distribution we want to learn; and finding a low-rank factorization of this matrix results in the automata or grammar. Under mild assumptions, it can be shown that this method nicely approximates the target model. From here, I will show how we can reformulate the spectral learning algorithm as a low-rank convex optimization. This will be useful to adapt the method to other settings, by adding linear constraints. I will focus in "unsupervised" induction of context-free grammars, that is, learning a grammar from plain strings. Our formulation involves optimizing for a low-rank Hankel matrix that is linearly constrained to satisfy inside-outside recursions. An analogous method method can be formulated to learn finite-state transducers from unaligned parallel strings. ------------------------------------------------------------------------ Joseph Le Roux, LIPN, Paris Title: Combining PCFG-LA Models with Dual Decomposition: A case Study with Function Labels and Binarization Abstract: It has recently been shown that different NLP models can be effectively combined using dual decomposition. In this talk, we present how PCFG-LAs (Probabilistic Context-Free Grammars with Latent Annotations, the state-of-the-art model for unlexicalized constituent parsing) are suitable for combination in this way. We first show how the intractable problem of exact PCFG-LA decoding is approximated with anchored PCFGs. Then we present a method for combining anchored PCFGs based on the partial superposition of tree structures. We experiment with the different models which result from alternative methods of extracting a grammar from a treebank (retaining or discarding function labels, left binarization versus right binarization) and achieve state-of-the-art parsing performance, with a labeled Parseval F-score of 92.4 on Wall Street Journal Section 23 ? this represents an error reduction rate of 7% over a strong PCFG-LA product-model baseline. This work was done in collaboration with Antoine Rozenknop and Jennifer Foster. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 10 20:43:43 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 10 Dec 2013 21:43:43 +0100 Subject: Appel: Third Workshop on Computational Linguistics for Literature Message-ID: Date: Sun, 08 Dec 2013 17:15:37 -0500 From: Stan Szpakowicz Message-ID: <52A4EF89.2060908 at eecs.uottawa.ca> Third Workshop on Computational Linguistics for Literature April 27, 2014, G?teborg, Sweden, co-located with EACL 2014 https://sites.google.com/site/clfl2014a/ Second Call for Papers (with apologies for multiple postings) The purpose of the series of ACL workshops on Computational Linguistics for Literature is to bring together researchers fascinated with literature as a unique type of data which pose distinct challenges. We invite papers on original unpublished work in this broad area. In particular, we hope to see papers which explore how the state-of-the-art NLP methods can help solve existing research problems in the humanities, or perhaps suggest new problems. Literary texts revolve around the human condition, emotions, social life and inner life. Naturally, such data abound in common-sense knowledge but are very thin on technical jargon. Can tools and methods developed in the ACL community help process literary data? When do they work, when do they fail and why? What new instruments do we need in order to work with prose and poetry, on a large or small scale? Are there computational solutions of noteworthy problems in the Humanities, Information Science, Library Sciences and other similar disciplines? Here are some of the topics of interest to the workshop: - the needs of the readers and how these needs translate into meaningful NLP tasks; - searching for literature; - recommendation systems for literature; - computational modelling of narratives, computational narratology, computational folkloristics; - summarization of literature; - differences between literature and other types of writing as relevant to computational linguistics; - discourse structure in literature; - emotion analysis for literature; - profiling and authorship attribution; - identification and analysis of literary genres; - building and analyzing social networks of characters; - generation of literary narrative, dialogue or poetry; - modelling literary dialogue for generation. We will consider regular papers which describe experimental methods or theoretical work, and we will gladly welcome position papers. The NLP community does not study literature often enough, so it is important to discuss and formulate the problems before proposing solutions. The submission deadline is January 23, 2014. Anna Feldman, Anna Kazantseva, Stan Szpakowicz ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 10 20:52:54 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 10 Dec 2013 21:52:54 +0100 Subject: Revue: CORELA, Statut et utilisation des corpus Message-ID: Date: Tue, 10 Dec 2013 16:54:25 +0100 From: Gilles Col Message-Id: X-url: http://corela.edel.univ-poitiers.fr/index.php?id=2951 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=2952 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=2964 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=2981 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=2994 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=3002 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=3012 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=3023 Chers coll?gues, L'?quipe ?ditoriale de la revue CORELA - Cognition / Repr?sentation / Langages a le plaisir de vous annoncer la publication de son dernier num?ro th?matique : " Statut et utilisation des corpus en linguistique", sous la direction de Laurence Durroux-Vincent Sommaire du num?ro: Laurence Durroux-Vincent. "Statut et utilisation des corpus en linguistique". http://corela.edel.univ-poitiers.fr/index.php?id=2951 Tobias Scheer. "The Corpus: A Tool among Others". http://corela.edel.univ-poitiers.fr/index.php?id=2952 Pius ten Hacken, Ren?ta Panocov? "The Use of Corpora in Word Formation Research". http://corela.edel.univ-poitiers.fr/index.php?id=2964 Najib Arbach, Saandia Ali "Aspects th?oriques et m?thodologiques de la repr?sentativit? des corpus". http://corela.edel.univ-poitiers.fr/index.php?id=2981 Charles Brasart "Corpus et alternance codique : que peut nous apprendre une approche comparative ?". http://corela.edel.univ-poitiers.fr/index.php?id=2994 Laurent David "L'acquisition du Present Perfect chez un enfant anglophone : m?thodologie, codage et premiers r?sultats". http://corela.edel.univ-poitiers.fr/index.php?id=3002 J?r?mi Sauvage, Christelle Dodane , Fabrice Hirsch , Melissa Barkat-Defradas "R?flexions m?thodologiques sur l'analyse phon?tique d'un corpus longitudinal et transversal en acquisition du langage". http://corela.edel.univ-poitiers.fr/index.php?id=3012 Paola Artero, Adriana Serban "The Status of Qualitative and Quantitative Methods of Enquiry in Translation Research: C. S. Lewis's Narnia in French - A Corpus-based Approach". http://corela.edel.univ-poitiers.fr/index.php?id=3023 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 10 20:45:22 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 10 Dec 2013 21:45:22 +0100 Subject: Appel: IC 2014 Message-ID: Date: Mon, 09 Dec 2013 09:51:41 +0100 From: Catherine Faron Zucker Message-ID: <52A5849D.90907 at i3s.unice.fr> X-url: http://www.irit.fr/IC2014 Appel ? communications pour les 25es Journ?es Francophones d'Ing?nierie des Connaissances : Connaissances pervasives: des utilisateurs plong?s dans des syst?mes ? base de connaissances et des syst?mes ? base de connaissances plong?s dans le monde http://www.irit.fr/IC2014 du 12 au 16 mai 2014 ? Clermont-Ferrand ------------------------------------------------------------------------ Dates importantes : Date limite de soumission des communications : jeudi 19 d?cembre 2013 Date de notification aux auteurs : jeudi 13 f?vrier 2014 Date limite de r?ception des textes d?finitifs : lundi 17 mars 2014 Dates de la conf?rence : du 12 au 16 mai 2014 ------------------------------------------------------------------------ La conf?rence d'Ing?nierie des Connaissances f?te cette ann?e ses 25 ans! Elle est le rendez-vous annuel de la communaut? fran?aise et francophone qui s'int?resse aux probl?matiques li?es ? l'ing?nierie des connaissances. Acad?miques et industriels s'y retrouvent pour ?changer et r?fl?chir sur des probl?mes de recherche qui se posent en acquisition, repr?sentation et gestion des connaissances. L'essor des sciences et technologies de l'information et de la communication, et notamment des technologies du web, dans l'ensemble de la soci?t? engendre des mutations dans les pratiques individuelles et collectives. L'ing?nierie des connaissances accompagne cette ?volution, en inventant les mod?les, m?thodes et outils permettant l'int?gration de connaissances et de raisonnements sur ces connaissances dans des environnements informatiques. De fait, on constate aujourd'hui une omnipr?sence des connaissances, en m?me temps qu'un d?cloisonnement des communaut?s de recherche en Intelligence Artificielle pour r?pondre aux probl?mes qui se posent. Ainsi, la construction des syst?mes ? base de connaissances fait appel ? l'ing?nierie des connaissances, au traitement automatique des langues, ? l'apprentissage automatique et ? la fouille de donn?es; le traitement des connaissances associe ing?nierie des connaissances avec recherche d'information, recherche op?rationnelle, analyse de donn?es; la mod?lisation des environnements informatiques, de ses acteurs humains ou logiciels, allie ing?nierie des connaissances et syst?mes multi-agents; enfin l'ing?nierie des connaissances touche ? des th?matiques classiques dans les communaut?s de l'interaction homme-machine et des environnements informatiques pour l'apprentissage humain. En pla?ant ainsi l'utilisateur au coeur des syst?mes informatiques, l'ing?nierie des connaissances est sollicit?e pour assister l'utilisateur dans le traitement de la masse de donn?es disponibles sur le web, pour les visualiser, contextualiser, expliquer, recommander, classer, pr?dire, approximer, s?curiser, fiabiliser, pour pallier ? leur impr?cision voire incoh?rence. Les th?mes d'IC 2014 sont les suivants (liste non exhaustive): Repr?sentation des connaissances et raisonnement - mod?les, formalismes, langages formels et informels de repr?sentation de connaissances - ?change, interop?rabilit? et r?utilisation des mod?les, standardisation - terminologies, thesauri, ontologies et lexiques - types d'ontologies et r?le des ontologies de r?f?rence - m?thodes et outils pour le cycle de vie des repr?sentations - gestion de la dynamique des ontologies (versionnement, ?volution, int?gration et alignement) - raisonnements logiques, inf?rences, raisonnements ? base de r?gles, - approximations, raisonnements statistiques, raisonnements par analogie, raisonnements ? partir de cas, raisonnements dans les logiques non classiques Conception et construction de mod?les de connaissances et d'ontologies - peuplement de mod?les de connaissances - acquisition de connaissances ? partir de textes - d?couverte interactive de connaissances dans les donn?es, dans les historiques, dans les interactions, sur le web, etc. - inf?rence et r?gles m?tier : acquisition, conception, r?utilisation, ?volution, ?valuation, exploitations - m?trique et ?valuation de la qualit? des ontologies Web de donn?es, web s?mantique - ouverture et interconnexion des donn?es - impact de l'ouverture des donn?es sur le web et de l'externalisation ouverte (crowdsourcing) sur l'ing?nierie des connaissances - r?seaux sociaux - gestion de flux de donn?es - passage ? l'?chelle : traitement de donn?es massives - traitement de donn?es h?t?rog?nes - provenance et confiance dans les donn?es - m?trique et ?valuation de la qualit? des donn?es Ing?nierie des connaissances centr?e utilisateurs - visualisation de connaissances, d'ontologies - visualisation d'interconnexions de donn?es, de raisonnements sur les donn?es - explication des raisonnements - profils utilisateurs, mod?les de contexte et adaptation, mod?les d'?motion - personnalisation de l'acc?s aux connaissances: confiance, recommandation, classement (ranking), r?sum? - assistance d'applications et de services: agents conversationnels - environnements informatiques pour l'apprentissage humain - conception et ?valuation d'interactions et d'interfaces avec des syst?mes ? base de connaissances Interdisciplinarit? de l'ing?nierie des connaissances - ?pist?mologie de l'ing?nierie des connaissances - th?orie des organisations et ing?nierie des connaissances - sciences humaines et sciences cognitives et ing?nierie des connaissances - ing?nierie des connaissances scientifiques Ing?nierie des connaissances pervasive : IC dans... - le traitement automatique des langues - l'apprentissage symbolique - la fouille de donn?es - la recherche d'information, l'indexation - la recherche op?rationnelle - l'analyse de donn?es - les syst?mes multi-agents - les environnements informatiques pour l'apprentissage humain - les interfaces homme-machine Applications - applications dans les domaines de la sant?, de l'?ducation et de l'apprentissage, de la culture, de la recherche documentaire, des transports intelligents, de l'informatique d?cisionnelle (BI), etc. - ?mergence de nouveaux domaines applicatifs - m?moires d'entreprises - syst?mes pour la collaboration et la coop?ration dans les organisations et les collectifs ------------------------------------------------------------------------ Soumissions : Articles longs(12 pages (nouveau format), pr?sentation orale de 20 mn, 10 mn discussion) : Sont concern?s des travaux aboutis, pr?sentant des r?sultats valid?s ou des synth?ses fouill?es. Articles courts(6 pages (nouveau format), pr?sentation orale de 10 mn, 5 mn discussion) : Sont concern?s des travaux en cours ou au stade de r?sultats partiels de recherche ou appliqu?s. Ce type de soumission correspond ? des travaux innovants de qualit?, mais dont la maturit? ne permet pas des ?laborations compl?tes et des conclusions d?finitives. Les articles devront respecter les styles de documents indiqu?s sur le site de la conf?rence: http://www.irit.fr/IC2014. Les auteurs sont invit?s ? soumettre leurs communications au format PDF sur EasyChair: https://www.easychair.org/conferences/?conf=ic2014. Les articles accept?s feront l'objet d'actes en version ?lectronique, publi?s sur HAL, et disponibles sur une cl? USB distribu?e aux participants de la conf?rence. Un prix AFIA du meilleur article sera d?cern? par un jury compos? de membres du comit? de pilotage de la conf?rence. ------------------------------------------------------------------------ La conf?rence d'Ing?nierie des Connaissances est plac?e sous l'?gide du coll?ge IC de l'AFIA ------------------------------------------------------------------------ Pr?sidente du comit? de programme : Catherine Faron Zucker (I3S, Universit? Nice Sophia Antipolis) Pr?sidente du comit? d'organisation : Catherine Roussey (Irstea) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 10 20:49:11 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 10 Dec 2013 21:49:11 +0100 Subject: These: Pierre Magistry, Unsupervised Word Segmentation and Wordhood Assessment Message-ID: Date: Tue, 10 Dec 2013 13:46:04 +0100 From: Pierre Magistry ??? Message-ID: Chers amis, professeurs, ?tudiants, coll?gues, J'ai le plaisir de vous inviter ? la soutenance de ma th?se, intitul?e "Unsupervised Word Segmentation and Wordhood Assessment: the case for Mandarin Chinese". (Segmentation en mots non-supervis?e et estimation de la lexicalit? : le cas du mandarin) Je l'ai men?e au sein de l'?quipe Alpage, ? l'universit? Paris-Diderot/INRIA et la soutiendrai le 19 d?cembre 2013 ? partir de 9h30, dans la salle des th?ses de Paris 7. Salle des th?ses 580F (Halle aux Farines) 10 rue Fran?oise Dolto 75013 Paris (plan : http://www.openstreetmap.org/?mlat=48.8295&mlon=2.3825#map=16/48.8295/2.3825 ) Prendre l'entr?e la plus proche de la Seine. La soutenance sera suivie d'un pot auquel vous ?tes ?galement convi?s. Bien cordialement, Pierre Magistry ---- Le jury de soutenance sera compos? de: PR. ??? Hsieh Shukai (rapporteur) - National Taiwan University PR. Sylvain Kahane (directeur) - Universit? Paris 10 Nanterre, MoDyCo PR. Yves Lepage (rapporteur) - Waseda University PR. Marie-Claude Paris (co-directrice) - Universit? Paris Diderot, LLF DR. Benoit Sagot (co-directeur) - INRIA, ALPAGE DR. Pierre Zweigenbaum (pr?sident) - CNRS, LIMSI ---- R?sum? de la th?se : Ce m?moire traite de la segmentation non-supervis?e en mots et de l'?valuation de la ?lexicalit?? des formes. Le chinois moderne standard (mandarin) est choisi pour les exp?riences et ?valuations. La premi?re partie porte sur les aspects linguistiques. Elle pr?sente les crit?res couramment utilis?s pour d?finir le ?mot? en linguistique chinoise et en montre les limites au travers d'une discussion de linguistique g?n?rale, abordant notamment la question des ?expressions multi-mots? et son lien avec les probl?mes de segmentation en mots. Nous pr?sentons ensuite l'historique du d?veloppement de la Segmentation en Mot du Chinois (CWS) comme une t?che typique en TAL, et d?fendons l'id?e qu'une part d'arbitraire dans l'annotation des corpus d'?valuation favorise artificiellement les syst?mes d'apprentissage supervis?s alors que nous consid?rons les syst?mes non-supervis?s d'un plus grand int?r?t pour la linguistique. La premi?re partie nous am?ne ? fonder notre d?finition de la lexicalit? sur deux crit?res: une forte autonomie de combinaison et un haut degr? d'appartenance ? une classe distributionnelle. La seconde partie pr?sente une m?thode non-supervis?e pour ?valuer l'autonomie de combinaison des formes inspir?e par les hypoth?ses de Harris. Avec un algorithme de segmentation simple et rapide bas? uniquement sur cette mesure, nous obtenons des r?sultats proches de l'?tat de l'art. Nous discutons ensuite de l'importance des pr?-traitements et pr?sentons des exp?riences utilisant le principe de la Longueur de Description Minimale (MDL) pour am?liorer nos r?sultats. Enfin, nous proposons une m?thode et des outils pour une ?valuation plus qualitative des analyses fournies par notre syst?me. Nous pr?sentons aussi quelques r?sultats pr?liminaires sur d'autres langues qui montrent la g?n?ricit? de notre m?thode. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 10 20:50:08 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 10 Dec 2013 21:50:08 +0100 Subject: Appel: Atelier SemDis 2014 (semantique distributionnelle), site web Message-ID: Date: Tue, 10 Dec 2013 14:35:48 +0100 From: C?cile Fabre Message-ID: <52A718B4.6030600 at univ-tlse2.fr> X-url: http://www.irit.fr/semdis2014/ Le site Web d?crivant les t?ches organis?es dans le cadre de l'atelier SemDis 2014 et fournissant les donn?es associ?es est d?sormais accessible ? l'adresse suivante : http://www.irit.fr/semdis2014/ ---- Objectifs de l'atelier : Les ?quipes CLLE-ERSS et IRIT-Melodi (Toulouse), organisatrices de l'atelier SemDis ? TALN 2013, consacr? aux approches fond?es sur l'analyse s?mantique distributionnelle, ont l'intention d'organiser une 2?me ?dition de l'atelier dans le cadre de TALN 2014 (sous r?serve de l'acceptation par le comit? de programme de TALN). Cette ann?e, l'appel ? communication sera compl?t? par l'organisation de 2 t?ches : - une t?che comp?titive de substitution lexicale, adaptation au fran?ais de la t?che SemEval 2007 ? Lexical substitution ?, telle qu'elle est pr?sent?e dans McCarthy & Navigli (2010). - une t?che exploratoire permettant d'examiner plus en d?tail les r?sultats de m?thodes distributionnelles sur un corpus sp?cialis?, le corpus TALN (2007-2013). --- Calendrier : Sous r?serve de l'acceptation par le comit? de programme de TALN. Les dates seront pr?cis?es lorsque le calendrier des workshops TALN sera connu. T?che comp?titive (substitution lexicale) : - d?cembre 2013 : diffusion des donn?es d'essai via le site web - 10 mars 2014 : diffusion des donn?es de test - 17 mars 2014 : date limite de soumission par les participants - d?but avril 2014 : annonce des r?sultats - fin avril 2014 : date limite de soumission d'un article pour l'atelier - 26 mai 2014 : r?ception des versions finales - 1er Juillet 2014 : atelier SemDis T?che exploratoire et appel ? communication hors t?ches : - d?cembre 2013 : description de la t?che 2 et diffusion du corpus via le site web - avril 2014 : r?ception des articles - 26 mai 2014 : r?ception des versions finales - 1er Juillet 2014 : atelier SemDis ---- Contact : Les organisateurs peuvent ?tre contact?s ? l'adresse suivante : semdis2014 at gmail.com ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 10 20:46:47 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 10 Dec 2013 21:46:47 +0100 Subject: Stage: Transcription de l'oral, Projet europeen (3 a 6 mois) Message-ID: Date: Mon, 09 Dec 2013 11:20:26 +0100 From: "Catherine Bolly" Message-ID: [Stage] Transcription, langage oral - Projet europ?en Marie Curie (3 ? 6 mois, 2014) TRANSCRIPTION DE L'ORAL (logiciel : Praat) Stagiaire : contrat ? temps plein (dur?e : 3 mois) ou temps partiel (dur?e : 4 ? 6 mois) Profil : niveau Master D?but du contrat : f?vrier/mars 2014 CONTEXTE Le projet europ?en CorpAGEst porte sur l??tude du langage de personnes ?g?es, ? partir de l?analyse d?entretiens enregistr?s sur support audiovisuel (approche multimodale : texte, son, geste). Les analyses portent en particulier sur les marqueurs de discours et les gestes dont la fonction communicative est (inter)subjective (par ex. : le marqueur de discours 'tu vois' ou un regard vers l?interlocuteur). Le projet de recherche CorpAGEst "Approche sur corpus de la comp?tence pragmatique des personnes ?g?es" est un projet europ?en financ? par les Actions Marie Curie dans le cadre du 7e European Community Framework Programme (projet N? 328282, FP7-PEOPLE-2012- IEF). Dur?e du projet : 2 ans (2013-2015). LIEU Le stage sera coordonn? au sein du laboratoire de recherche par Catherine Bolly (UMR 7023 "Structures Formelles du Langage", CNRS Paris 8 & Centre de recherches "Valibel ? Discours et variation", UCL Louvain-la-Neuve). MISSION La m?thode d?analyse sur corpus multimodal adopt?e implique la constitution et l?annotation d?un corpus audiovisuel : 8 sujets d??tude, env. 13 heures. La personne recrut?e aura pour t?che de transcrire, au moyen du logiciel Praat, les donn?es audio des entretiens effectu?s sur le terrain. Les normes de transcription suivront l?orthographe standard du fran?ais et les conventions de transcription du centre Valibel (Universit? catholique de Louvain). Des discussions et ?changes dynamiques au sein de l??quipe permettront d?optimiser la proc?dure de transcription sur la base de l?exp?rience de chacun. PR?REQUIS Familiarit? avec les outils informatiques de base (traitements de texte, clavier, etc.). La connaissance du logiciel Praat est un atout. Exp?rience appr?ci?e dans l?analyse impliquant le langage oral. Des connaissances en transcription et annotation de corpus oraux seraient un plus. PROFIL ?tudiant en Master, dans le domaine des sciences humaines (langues et lettres, sciences du langage, sciences de la communication, traitement automatique du langage, ethnologie, etc.). Int?r?t pour les sciences du langage, le fran?ais parl? et/ou le vieillissement langagier. Excellente ma?trise de la langue fran?aise. DUR?E ET CONDITIONS Stage de 3 mois ? temps plein ? partir de f?vrier/mars 2014 aupr?s d?une chercheuse en sciences du langage. Possibilit? d?effectuer le stage ? temps partiel (dur?e : 4 ? 6 mois). Flexibilit? et adaptabilit? des horaires. Possibilit? d?int?grer le travail dans le cadre d?un m?moire de Master (? n?gocier). R?MUN?RATION 436,05? net par mois. CANDIDATURE Lettre de motivation et CV. A envoyer pour le 20 d?cembre 2013 au plus tard par e-mail ? : catherine.bolly at uclouvain.be dominique_jean.boutet at orange.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 10 20:51:53 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 10 Dec 2013 21:51:53 +0100 Subject: Ecole: International Terminology Summer School (TSS 2014), Vienna, 14 to 18 July 2014 Message-ID: Date: Tue, 10 Dec 2013 15:01:08 +0100 From: excellence Message-ID: X-url: http://www.termnet.org/english/events/tss_2014/index.php Dear colleagues, You can now register for the International Terminology Summer School (TSS 2014) that will take place next year in Vienna, from 14 to 18 July 2014. TSS is a practice-oriented 5-day training course on terminology management. It addresses language professionals (translators, terminologists, localization experts, etc.), students and scientists who look for a comprehensive, state-of-the-art introduction to terminology theory and practice. Reserve your place at TSS 2014 today! You may register until early June 2014. However, places are limited, we advise you wait not too long. http://www.termnet.org/english/events/tss_2014/index.php PROGRAM: UNDERSTANDING TERMINOLOGY MANAGEMENT - Introduction to Terminology Work - Applied Principles of Terminology Work - Retrieving and Validating Relevant Online Information for Terminology Work TERMINOLOGY MANAGEMENT SKILLS - Data Modelling: Data Categories for Terminology Management - Creating a Terminology Database - Terminology Tools - A closer look at Terminology Management Systems - Terminology Workflows and Interfaces TERMINOLOGY STRATEGIES FOR BUSINESS PROCESSES - Three "P"s in Terminology Management: Project, Product, Process - Case Studies: Terminology and Technical Documentation - Case Studies: Legal Translation and Terminology STANDARDS AND LEGAL ISSUES FOR TERMINOLOGY WORK - Terminology Work for Literary Translation - Terminology and other Knowledge Organization Systems - Standards for Terminology Work - Copyright Issues for Terminology Management INSTRUCTORS: International renowned experts in terminology and terminology management will cover the TSS topics but also take time during the whole week to help participants with the most individual questions and problems. ECQA CERTIFICATION FOR TERMINOLOGY MANAGERS: The TSS2014 program is developed in cooperation with the ECQA initiative and this makes possible for TSS2014 participants to obtain the ECQA Certificate for Terminology Managers by performing the official ECQA exam for the job role Certified Terminology Manager - Basic or Certified Terminology Manager - Advanced. SOCIAL NETWORKING AND SIDE EVENTS: TSS gives the opportunity to the participants to network and discuss in a relaxed and resourceful environment with peers and colleagues during the side events planned for this purpose. Enjoy the Terminology Breakfasts every morning before classes, participant in the "get to know each other" event on Monday after the first day of classes. Meet your colleagues in the different "after school" events during the week. You can find all information about TSS2014 at: http://www.termnet.org/english/events/tss_2014/index.php (this website will be constantly updated in the next months with more and more information) or contact us (events at termnet.org). We will be happy to answer your questions. Please disseminate this information also among colleagues and friends who might be interested! Apologies for cross-posting. Kind regards, ------------------------------------------------------------ Your TermNet Events Team International Network for Terminology - TermNet Mooslackengasse 17, 1190 Vienna, Austria c/o bena office center doebling GmbH T: +43 1 23060 3965 F: +43 1 23060 3966 events at termnet.org - www.termnet.org ------------------------------------------------------------ TermNet - There is no Knowledge without Terminology - since 1988 ------------------------------------------------------------ From hamon at LIMSI.FR Tue Dec 10 20:47:47 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 10 Dec 2013 21:47:47 +0100 Subject: Appel: TOTh 2014 Message-ID: Date: Tue, 10 Dec 2013 10:18:05 +0100 From: Luc Damas Message-ID: X-url: http://www.porphyre.org/toth/ TOTh 2014 : APPEL A COMMUNICATIONS Terminologie & Ontologie : Th?ories et applications Chamb?ry ? France Conf?rence : 12 & 13 juin 2014 Formation : 10 & 11 juin 2014 http://www.porphyre.org/toth/ http://www.porphyre.org/toth/toth-2014/appel-a-communications http://www.porphyre.org/toth/files/TOTh-2014-appel-a-communications.pdf Dates importantes : ------------------------- 10 f?vrier 2014 : Date limite de soumission des r?sum?s (2 ? 3 pages) 3 mars 2014: Notification d?acceptation Soumission par EasyChair : ----------------------------------- Les soumissions des r?sum?s ?tendus (2 ? 3 pages) se font exclusivement par EasyChair: http://www.easychair.org/conferences/?conf=toth2014 Pr?sentation : ------------------ La 8?me ?dition des conf?rences internationales TOTh se d?roulera les jeudi 12 et vendredi 13 juin 2014 ? l?Universit? de Savoie (Chamb?ry, France) avec le soutien du Minist?re de la Culture (DGLFLF) et de l?Association Europ?enne de Terminologie. Les conf?rences TOTh - Terminologie & Ontologie : Th?ories et applications - ont pour objectif de rassembler chercheurs, professionnels et, de fa?on plus g?n?rale, toute personne dont les pr?occupations rel?vent de la langue et de l?ing?nierie des connaissances. Le programme comportera, en plus des communications retenues, une conf?rence invit?e, une disputatio et une session poster. Une formation d?un jour et demi sur l?analyse de textes ? des fins onomasiologiques pr?c?dera la conf?rence les mardi 10 et mercredi 11 juin. Un prix ? Jeune chercheur ? est d?cern? par le comit? de programme lors de la conf?rence Th?mes de la Conf?rence (liste non exhaustive) : -------------------------------------------------------------- - Langues de sp?cialit? et Lexicographie de sp?cialit?, - Traduction, - Traitement de corpus, - Lexiques, Dictionnaires, Terminologies, - Am?nagement linguistique et Am?nagement terminologique, - Repr?sentation, Extraction et Gestion des connaissances, - Syst?mes conceptuels, - Ontologies, - Thesaurus, - Applications : Recherche d?information, Web des donn?es, Biblioth?ques num?riques, Environnements de d?veloppement, Banques terminologiques, - Normes, Formats d??change. TOTh 2014 http://www.porphyre.org/toth/ http://www.porphyre.org/toth/toth-2014/appel-a-communications http://www.porphyre.org/toth/files/TOTh-2014-appel-a-communications.pdf ------------------------------------------------- TOTh 2014: CALL FOR PAPERS Terminology & Ontology: Theories and applications Chamb?ry ? France Conference: 12 and 13 June 2014 Training: 10 and 11 June 2014 http://www.porphyre.org/toth/ http://www.porphyre.org/toth/toth-2014-en/call-for-papers http://www.porphyre.org/toth/files/TOTh-2014-Call-for-papers.pdf DEADLINES: ------------------------------- 10 February 2014: Deadline for abstract submissions 3 March 2014: Notification of acceptance Paper submission via Easychair: ----------------------------------------- http://www.easychair.org/conferences/?conf=toth2014 Presentation: ------------------ The 8th TOTh International Conference will be held on Thursday 12th and Friday 13th June 2014 at the University of Savoie (in Chamb?ry, France) with the support of the French Ministry of Culture (DGLFLF) and of the European Association for Terminology. The TOTh Conferences ? Terminology & Ontology: Theories and applications ? bring together researchers, professionals and, more generally, all persons interested in issues related to language and knowledge engineering. The program will include accepted communications, a guest lecture, a disputatio, and a poster session. A one and a half day training session (in French) on text analysis for onomasiological purposes will be held just before the conference on Tuesday 10th and Wednesday 11th June. A Young Researcher Prize will be awarded by the Program Committee during the conference. Conference topics (non-exhaustive list): -------------------------------------------------- - Language for Special Purposes, Specialized Lexicography, - Translation, - Corpus Analysis, - Lexicon, Dictionary, Terminology, - Linguistic planning and Terminology planning, - Knowledge Representation, Extraction and Management, - Conceptual Systems, - Ontology, - Thesaurus, - Applications: Information retrieval, Web of data, Digital libraries, Software environments, Terminology banks, - Standards, Interchange formats. TOTh 2014 http://www.porphyre.org/toth/ http://www.porphyre.org/toth/toth-2014-en/call-for-papers http://www.porphyre.org/toth/files/TOTh-2014-Call-for-papers.pdf ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 14 20:41:30 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 21:41:30 +0100 Subject: Revue: CORELA, vol. num. 2 Message-ID: Date: Wed, 11 Dec 2013 08:58:20 +0100 From: Gilles Col Message-Id: X-url: http://corela.edel.univ-poitiers.fr/index.php?id=3050 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=3080 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=3029 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=3120 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=2977 Chers coll?gues, L'?quipe ?ditoriale de la revue CORELA - Cognition / Repr?sentation / Langages a le plaisir de vous annoncer la publication de son dernier num?ro (vol. 11, num. 2). Les propositions de publication pour le prochain num?ro de la revue (vol. 12, num. 1) sont ? envoyer ? Sylvester OSU (sylvester.osu at wanadoo.fr) avant le 15 mars 2014. Sommaire du num?ro 11/2 : Philippe MULLER: "Cet obscur objet de d?sir : ? propos de la construction want GN et de ses gloses". http://corela.edel.univ-poitiers.fr/index.php?id=3050 Elisabet JOHANSSON-MANOURY: "Quelques pr?cisions sur l'anaphore notionnelle". http://corela.edel.univ-poitiers.fr/index.php?id=3080 Pierre FOURNIER : "Accentuation des compos?s savants en anglais contemporain : identification morphos?mantique et enjeu phonologique". http://corela.edel.univ-poitiers.fr/index.php?id=3029 Laurence VINCENT-DURROUX : "Les repr?sentations cognitives : pour une mise en ?vidence de leur r?le dans l'appropriation des langues". http://corela.edel.univ-poitiers.fr/index.php?id=3120 Chokri RHIBI : "Adverbes de localisation temporelle et encha?nement spatio-temporel : Le cas de dehors et autres expressions apparent?es". http://corela.edel.univ-poitiers.fr/index.php?id=2977 Gilles Col Directeur ?ditorial ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 14 20:44:36 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 21:44:36 +0100 Subject: Appel: EACL 2014 Workshop on Cognitive Aspects of Computational Language Learning Message-ID: Date: Wed, 11 Dec 2013 09:18:32 +0100 From: Thierry Poibeau Message-Id: <2DEB90F7-AE27-4B57-BD5C-E1082484D996 at ens.fr> [Apologies for cross-postings] ================================================== Second Call for Papers ================================================== EACL 2014 Workshop on Cognitive Aspects of Computational Language Learning April 26, 2014 Gothenburg, Sweden https://sites.google.com/site/cognitivews2014/ Deadline for Paper Submissions: January, 23rd, 2014 (11:59pm GMT -12) Endorsed by the Special Interest Group of the ACL on Natural Language Learning (SIGNLL) --------------------------------------------------------------- The human ability to acquire and process language has long attracted interest and generated much debate due to the apparent ease with which such a complex and dynamic system is learnt and used on the face of ambiguity, noise and uncertainty. This subject raises many questions ranging from the nature vs. nurture debate of how much needs to be innate and how much needs to be learned for acquisition to be successful, to the mechanisms involved in this process (general vs specific) and their representations in the human brain. There are also developmental issues related to the different stages consistently found during acquisition (e.g. one word vs. two words) and possible organizations of this knowledge. These have been discussed in the context of first and second language acquisition and bilingualism, with cross linguistic studies shedding light on the influence of the language and the environment. The past decades have seen a massive expansion in the application of statistical and machine learning methods to natural language processing (NLP). This work has yielded impressive results in numerous speech and language processing tasks, including e.g. speech recognition, morphological analysis, parsing, lexical acquisition, semantic interpretation, and dialogue management. The good results have generally been viewed as engineering achievements. Recently researchers have begun to investigate the relevance of computational learning methods for research on human language acquisition and change. The use of computational modeling is a relatively recent trend boosted by advances in machine learning techniques, and the availability of resources like corpora of child and child-directed sentences, and data from psycholinguistic tasks by normal and pathological groups. Many of the existing computational models attempt to study language tasks under cognitively plausible criteria (such as memory and processing limitations that humans face), and to explain the developmental stages observed in the acquisition and evolution of the language abilities. In doing so, computational modeling provides insight into the plausible mechanisms involved in human language processes, and inspires the development of better language models and techniques. These investigations are very important since if computational techniques can be used to improve our understanding of human language acquisition and change, these will not only benefit cognitive sciences in general but will reflect back to NLP and place us in a better position to develop useful language models. Success in this type of research requires close collaboration between the NLP, linguistics, psychology and cognitive science communities. The workshop is targeted at anyone interested in the relevance of computational techniques for understanding first, second and bilingual language acquisition and language change in normal and clinical conditions. Long and short papers are invited on, but not limited to, the following topics: * Computational learning theory and analysis of language learning and organization * Computational models of first, second and bilingual language acquisition * Computational models of language changes in clinical conditions * Computational models and analysis of factors that influence language acquisition and use in different age groups and cultures * Computational models of various aspects of language and their interaction effect in acquisition, processing and change * Computational models of the evolution of language * Data resources and tools for investigating computational models of human language processes * Empirical and theoretical comparisons of the learning environment and its impact on language processes * Cognitively oriented Bayesian models of language processes * Computational methods for acquiring various linguistic information (related to e.g. speech, morphology, lexicon, syntax, semantics, and discourse) and their relevance to research on human language acquisition * Investigations and comparisons of supervised, unsupervised and weakly-supervised methods for learning (e.g. machine learning, statistical, symbolic, biologically-inspired, active learning, various hybrid models) from a cognitive perspective --------------------------------------------------------------- SUBMISSIONS We invite three different submission modalities: * Regular long papers (8 content pages + 1 page for references): Long papers should report on original, solid and finished research including new experimental results, resources and/or techniques. * Regular short papers (4 content pages + 1 page for references): Short papers should report on small experiments, focused contributions, ongoing research, negative results and/or philosophical discussion. * System demonstration (2 pages): System demonstration papers should describe and document the demonstrated system or resources. We encourage the demonstration of both early research prototypes and mature systems, that will be presented in a separate demo session. All submissions must be in PDF format and must follow the EACL 2014 formatting requirements (available at http://www.eacl2014.org/files/eacl-2014-styles.zip). We strongly advise the use of the provided Word or LaTeX template files. For long and short papers, the reported research should be substantially original. The papers will be presented orally or as posters. The decision as to which paper will be presented orally and which as poster will be made by the program committee based on the nature rather than on the quality of the work. Reviewing will be double-blind, and thus no author information should be included in the papers; self-reference should be avoided as well. Papers that do not conform to these requirements will be rejected without review. Accepted papers will appear in the workshop proceedings, where no distinction will be made between papers presented orally or as posters. Submission and reviewing will be electronic, managed by the START system: https://www.softconf.com/eacl2014/CogACLL/ Submissions must be uploaded onto the START system by the submission deadline: January 23rd, 2014 (11:59pm GMT -12 hours) Please choose the appropriate submission type from the START submission page, according to the category of your paper. --------------------------------------------------------------- IMPORTANT DATES Jan 23, 2014 Long and Short Paper submission deadline Feb 05, 2014 System Demonstrations submission deadline Feb 20, 2014 Notification of acceptance Mar 03, 2014 Camera-ready deadline Apr 26, 2014 Workshop --------------------------------------------------------------- PROGRAM COMMITTEE Afra Alishahi Tilburg University (Netherlands) Colin J Bannard University of Texas at Austin (USA) Marco Baroni University of Trento (Italy) Robert Berwick Massachusetts Institute of Technology (USA) Philippe Blache LPL, CNRS (France) Jim Blevins University of Cambridge (UK) Antal van den Bosch Radboud University Nijmegen (Netherlands) Chris Brew Nuance Communications (USA) Ted Briscoe University of Cambridge (UK) Alexander Clark Royal Holloway, University of London (UK) Robin Clark University of Pennsylvania (USA) Stephen Clark University of Cambridge (UK) Matthew W. Crocker Saarland University (Germany) Walter Daelemans University of Antwerp (Belgium) Dan Dediu Max Planck Institute for Psycholinguistics (The Netherlands) Barry Devereux University of Cambridge (UK) Benjamin Fagard Lattice-CNRS (France) Jeroen Geertzen University of Cambridge (UK) Ted Gibson Massachusetts Institute of Technology (USA) Henriette Hendriks University of Cambridge (UK) Marco Idiart Federal University of Rio Grande do Sul (Brazil) Mark Johnson Brown University (USA) Aravind Joshi University of Pennsylvania (USA) Gianluca Lebani University of Pisa (Italy) Igor Malioutov Massachusetts Institute of Technology (USA) Marie-Catherine de Marneffe The Ohio State University (USA) Maria Alice Parente Federal University of ABC (Brazil) Massimo Poesio University of Trento (Italy) Brechtje Post University of Cambridge (UK) Ari Rappoport The Hebrew University of Jerusalem (Israel) Anne Reboul L2C2-CNRS (France) Kenji Sagae University of Southern California (USA) Sabine Schulte im Walde University of Stuttgart (Germany) Ekaterina Shutova University of California, Berkeley (USA) Maity Siqueira Federal University of Rio Grande do Sul (Brazil) Mark Steedman University of Edinburgh (UK) Suzanne Stevenson University of Toronto (Canada) Remi van Trijp Sony Computer Science Laboratory Paris (France) Shuly Wintner University of Haifa (Israel) Charles Yang University of Pennsylvania (USA) Beracah Yankama Massachusetts Institute of Technology (USA) Menno van Zaanen Tilburg University (Netherlands) Alessandra Zarcone University of Stuttgart (Germany) --------------------------------------------------------------- WORKSHOP ORGANIZERS AND CONTACT Alessandro Lenci (University of Pisa, Italy) Muntsa Padr? (Federal University of Rio Grande do Sul, Brazil) Thierry Poibeau (LATTICE-CNRS, France) Aline Villavicencio (Federal University of Rio Grande do Sul, Brazil) For any inquiries regarding the workshop please send an email to cognitive2014 at gmail.com ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 14 21:13:30 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 22:13:30 +0100 Subject: Appel: EACL 2014 Workshop on Continuous Vector Space Models and their Compositionality Message-ID: Date: Wed, 11 Dec 2013 12:35:49 +0100 From: Alexandre Allauzen Message-ID: <52A84E15.2010007 at limsi.fr> Second Call for Papers (Apologies for multiple postings) ************************************************************************ Workshop on Continuous Vector Space Models and their Compositionality (2nd edition) Co-located with EACL 2014, Gothenburg, Sweden April 27, 2014 Submission deadline: January 23, 2014 https://sites.google.com/site/cvscworkshop2014 ************************************************************************ INVITED SPEAKERS The workshop will showcase presentations from two invited speakers : Geoffrey Zweig (Microsoft Research) and Ivan Titov (University of Amsterdam). AIMS AND SCOPE In recent years, there has been a growing interest in algorithms that learn and use continuous representations for words, phrases, or documents in many natural language processing applications. Among many others, influential proposals that illustrate this trend include latent Dirichlet allocation, neural network based language models and spectral methods. These approaches are motivated by improving the generalization power of the discrete standard models, by dealing with the data sparsity issue and by efficiently handling a wide context. Despite the success of single word vector space models, they are limited since they do not capture compositionality. This prevents them from gaining a deeper understanding of the semantics of longer phrases or sentences. With the growing popularity of these neural and probabilistic methods of language processing, the scope of this second workshop is extended to theoretical and conceptual questions regarding: * their relation to unsupervised distributional representations, * the encompassing of the compositional aspects of formal models of semantics, * the role of linguistic theory in the design and development of these methods. Some such pertinent questions include: Should phrase representations and word representations be of the same sort? Could different linguistic levels require different modeling approaches? Is compositionality determined by syntax, and if so, how do we learn/define it? Should word representations be fixed and obtained distributionally, or should the encoding be variable? Should word representations be task-specific, or should they be general? In this workshop, we invite submissions of papers on continuous vector space models for natural language processing. Topics of interest include, but are not limited to: * learning algorithms for continuous vector space models, * their compositionality, * their use in NLP applications, * spectral learning for NLP, * neural networks for NLP, * phrase, sentence, and document-level distributional representations, * tensor models, * distributed semantic representations, * the role of syntax in compositional models, * formal and distributional semantic models. SUBMISSION INFORMATION Authors should submit a full paper of up to 8 pages in electronic, PDF format, with up to 2 additional pages for references. The reported research should be substantially original. The papers will be presented orally or as posters. All submissions must be in PDF format and must follow the EACL 2014 formatting requirements (http://www.eacl2014.org/files/eacl-2014-styles.zip ). Reviewing will be double-blind, and thus no author information should be included in the papers; self-reference should be avoided as well. Submissions must be made through the Softconf website set up for this workshop: https://www.softconf.com/eacl2014/CVSC/ Accepted papers will appear in the workshop proceedings, where no distinction will be made between papers presented orally or as posters. IMPORTANT DATES 23 January 2014 : Submission deadline 20 February 2014 : Notification of acceptance 3 March 2014 : Camera-ready deadline 27 April 2014 : Workshop ORGANIZERS Alexandre Allauzen (LIMSI-CNRS/Universit? Paris-Sud, France) Raffaella Bernardi (University of Trento, Italy) Edward Grefenstette (University of Oxford, UK) Hugo Larochelle (Universit? de de Sherbrooke, Canada) Christopher Manning (Stanford University, USA) Scott Wen-tau Yih (Microsoft Research, USA) PROGRAM COMMITTEE Nicholas Asher (IRIT-Toulouse) Marco Baroni (University of Trento) Yoshua Bengio (Universit? de Montr?al) Gemma Boleda (University of Texas) Antoine Bordes (Universit? Technologique de Compi?gne) Johan Bos (University of Groningen) L?on Bottou (Microsoft Research) Xavier Carreras (Universitat Polit?cnica de Catalunya) Lucas Champollion (New-York University) Stephen Clark (University of Cambridge) Shay Cohen (Columbia University) Ido Dagan (Bar Ilan University) Ronan Collobert (IDIAP Research Institute, Switzerland) Pino Di Fabbrizio (Amazon) Georgiana Dinu (University of Trento) Kevin Duh (Nara Institute of Science and Technology) Dean Foster (University of Pennsylvania) Alessandro Lenci (University of Pisa) Louise McNally (Universitat Pompeu Fabra) Fabio Massimo Zanzotto (Universit? degli Studi di Roma) Mirella Lapata (University of Edinburgh) Andriy Mnih (Gatsby Computational Neuroscience Unit) Larry Moss (Indiana University) Diarmuid ? Seaghdha (University of Cambridge) Sebastian Pado (Universit?t Stuttgart) Martha Palmer (University of Colorado) John Platt (Microsoft Research) Maarten de Rijke (University of Amsterdam) Mehrnoosh Sadrzadeh (University of London) Mark Steedman (University of Edinburgh) Chung-chieh Shan (Indiana University) Peter Turney (NRC) Jason Weston (Google) Guillaume Wisniewski (LIMSI-CNRS/Universit? Paris-Sud) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 14 21:25:15 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 22:25:15 +0100 Subject: Stage: M2, Amelioration du module de desambiguisation du Moteur de Recherche Semantique d'inbenta Message-ID: Date: Thu, 12 Dec 2013 17:59:28 +0100 From: Manon Quintana Message-ID: *Pr?sentation soci?t?* ------------------------------ inbenta est une soci?t? pionni?re dans le *Traitement Automatique du Langage Naturel et la recherche s?mantique*. Bas?e sur ces concepts novateurs, inbenta d?veloppe depuis 2005 des solutions logicielles pour les sites internet de Grands Comptes. *Description de l?offre* ------------------------------ Inbenta a d?velopp? un moteur de recherche intelligent appel? *Inbenta Semantic Search Engine* (ISSE). Les deux t?ches principales de ce moteur sont d?analyser les questions des utilisateurs et de trouver la r?ponse appropri?e ? la requ?te en effectuant une recherche dans une base de connaissances. Un *module de d?sambigu?sation syntaxique et s?mantique* est int?gr? dans notre moteur de recherche. Ce module est tr?s important car il fait partie int?grante du bon fonctionnement de la solution. L?objet du stage propos? par inbenta sera d?am?liorer le module de d?sambigu?sation. Les missions de stage seront : - Gestion linguistique et ?ditoriale d?un projet de FAQ dynamique afin que le stagiaire s?approprie l?existant - Enrichissement du module de d?sambigu?sation par l?ajout de r?gles, de descriptions lexicales et de grammaires locales + ?valuation du travail - R?flexion d?am?lioration du module de d?sambigu?sation ? un niveau algorithmique *Profil recherch?* ------------------------------ Nous recherchons une personne enthousiaste, organis?e et s?rieuse et ayant l?envie d?int?grer une ?quipe internationale. Le stagiaire devra ?galement avoir les comp?tences suivantes : - ?tudes en Traitement Automatique du Langage Naturel - Excellente ma?trise de la langue fran?aise et bonne communication ?crite et orale en espagnol, anglais ou catalan Bonus : - Ma?trise d?au moins un langage de programmation (PHP de pr?f?rence) - Maitrise des expressions r?guli?res et du SQL *Modalit?s du poste* ------------------------------ - Stage de 5 ? 6 mois (avec possibilit? d?embauche en CDI) - R?mun?ration pr?vue: 30% du SMIC (+ prime en fonction des r?sultats) - D?but : ? partir de F?vrier / Mars 2013 - Lieu : Toulouse Merci d?adresser CV et lettre de motivation ? Quintana Manon ? l?adresse mail suivante : *mquintana at inbenta.com* ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 14 21:57:07 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 22:57:07 +0100 Subject: Appel: ESWC 2014, Call for Demos Message-ID: Date: Fri, 13 Dec 2013 15:31:40 +0100 (CET) From: speroni at cs.unibo.it Message-Id: <20131213143200.7CF23DBB67 at vina.cines.fr> X-url: http://2014.eswc-conferences.org/important-dates/call-demos ** apologies for cross-posting ** ==== Call for Demos ==== http://2014.eswc-conferences.org/important-dates/call-demos The demonstrations track of the ESWC 2014 complements the overall program of the conference with demonstrations of real-world, applied-and-tested advances of semantic technologies. In addition to papers and posters presenting latest theoretical achievements, the ESWC 2014 is seeking demonstrations of novel applications of semantics in various sectors, including: eGovernment, eEnvironment, eMobility and smart cities, eHealth, Life Sciences, Sensor networks and RDF Stream Processing, Linked Science, Media and entertainment, Telecommunications, Cultural heritage, Financial services, Energy and utilities, Manufacturing, Digital libraries, Cloud applications, Personal Information Management, Decision support, Emergency and crisis management etc. The demonstrations track is an opportunity for researchers and practitioners to present their innovative prototypes, practical developments, on-going projects, lessons learned and late-breaking results, relevant for the topics of interest of the main conference and the research and poster tracks. Submissions to the demonstrations track should make clear what will be demonstrated, and in particular point out what makes the demonstration a novel showcase. A video of the demo should be provided if a link to the demonstrator is not available. Submission should also specify: * What is the research background and application context of the demonstration, and what makes this a novel showcase for semantic technologies. * What is the key technology used, and how does the demonstrated system, application or infrastructure relate to pre-existing work. * What exactly will be demonstrated? What exactly will a visitor of the demonstration learn? * Submission Guidelines * Submissions to the demonstrations track must be in the form of a textual description of the demonstration to be given at the conference (which may include screenshots and must include either a link to the online demo or a video presenting it). The submissions must be at most 5 pages long including reference list (Springer LNCS style) and submitted as PDF. Decisions about acceptance will be based on relevance to the Semantic Web area, originality, potential significance, topicality and clarity. Papers that exceed the given page length or do not follow the LNCS guidelines will be rejected without a review. Submissions should be uploaded using the Easychair system: https://www.easychair.org/conferences/?conf=eswc2014pd * Important Dates * Submission Deadline: March 16, 2014 Notification of Acceptance: April 10, 2014 Camera-Ready Paper: April 25, 2014 * Important Notice * Demonstrator papers will appear in supplementary post-conference proceedings to be published by Springer in the Lecture Notes in Computer Science series. Participants with an accepted demo paper must register for the conference and present their system during the Demo Session. A space for demonstration will be allocated for each participant. Participants should use their own laptop for the demos. The organizers should be contacted well in advance of the conference in case of any special requirement. * Poster and Demo Chairs * Raphael Troncy (Multimedia Department, EURECOM, FR) Eva Blomqvist (Department of Computer and Information Science, Linkoping University, SE) From hamon at LIMSI.FR Sat Dec 14 21:15:49 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 22:15:49 +0100 Subject: Journee: Ateliers de formation a TXM, session de janvier-fevrier 2014, Lyon Message-ID: Date: Wed, 11 Dec 2013 14:31:15 +0100 From: Serge Heiden Message-ID: <52A86923.3040905 at ens-lyon.fr> X-url: http://textometrie.ens-lyon.fr Bonjour, Les prochains ateliers TXM auront lieu aux dates suivantes (? Lyon) : - mardi 21 janvier : TXM avanc? ; - jeudi 27 f?vrier : Initiation ? TXM ; - vendredi 28 f?vrier : Pr?paration de corpus et import dans TXM. TXM est un outil libre d'analyse de corpus de textes et de transcriptions. Si vous ?tes int?ress?(e) par l'une de ces s?ances, il faut vous inscrire en envoyant un mail au plus t?t (le nombre de places est limit?) ? textometrie ens-lyon fr (l'inscription est n?cessaire m?me si vous aviez r?pondu ? la consultation Doodle) Toute l'information sur les ateliers : https://groupes.renater.fr/wiki/txm-users/public/ateliers_txm Au plaisir de vous retrouver peut-?tre ? cette occasion, Serge Heiden, pour l'?quipe TXM. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 14 20:47:36 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 21:47:36 +0100 Subject: Appel: ESSLLI 2014 Student Session Message-ID: Date: Wed, 11 Dec 2013 11:44:38 +0100 From: Ronald de Haan Message-ID: X-url: http://www.kr.tuwien.ac.at/drm/dehaan/stus2014/ X-url: http://www.esslli2014.info/ Please forward to students. Apologies for the multiple messages. *1st Call for Papers* *ESSLLI 2014 STUDENT SESSION* http://www.kr.tuwien.ac.at/drm/dehaan/stus2014/ *Held during* *The 26th European Summer School in* *Logic, Language and Information* T?bingen, Germany, August 11-22, 2014 *Deadline for submissions: April 1st, 2014* https://www.easychair.org/conferences/?conf=essllistus2014 *ABOUT:* The Student Session of the 26th European Summer School in Logic, Language, and Information (ESSLLI) will take place in T?bingen, Germany, on August 11-22, 2014. We invite submissions of original, unpublished work from students in any area at the intersection of Logic & Language, Language & Computation, or Logic & Computation. Submissions will be reviewed by several experts in the field, and accepted papers will be presented orally or as posters and will appear in the student session proceedings. This is an excellent opportunity to receive valuable feedback from expert readers and to present your work to a diverse audience. *SEPARATE POSTER SESSION:* Note that there are two separate kinds of submissions, one for the oral presentations and one for the posters. This means that papers can be directly submitted as posters. Reviewing and ranking will be done separately. We particularly encourage submissions for posters, as they offer an excellent opportunity to present research in progress. More detailed guidelines and policies regarding submission can be found on the Student Session website: http://www.kr.tuwien.ac.at/drm/dehaan/stus2014/ . Please direct inquiries about submission procedures or other matters relating to the Student Session to: dehaan at kr.tuwien.ac.at. For general inquiries about ESSLLI 2014, please consult the main ESSLLI 2014 page: http://www.esslli2014.info/. Kind regards, *the ESSLLI 2013 Student Session Organization Committee* *Chair:* Ronald de Haan (Technische Universit?t Wien, Austria) *LoCo co-chairs:* Zo? Christoff (Universiteit van Amsterdam, the Netherlands) Ayb?ke ?zg?n (Universit? de Lorraine, France) *LoLa co-chairs:* Philip Schulz (Universiteit van Amsterdam, the Netherlands) Thomas Brochhagen (Universit?t D?sseldorf, Germany) *LaCo co-chairs:* Miriam Kaeshammer (Universit?t D?sseldorf, Germany) Ramon Ziai (Universit?t T?bingen, Germany) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 14 21:22:54 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 22:22:54 +0100 Subject: Stage: Master recherche appliquee en TAL, region parisienne Message-ID: Date: Thu, 12 Dec 2013 17:16:38 +0100 From: philippe.martin at stella-medica.fr Message-ID: Stella Medica est un cabinet de recrutement bien implant? dans le milieu m?dical. En partenariat avec une soci?t? de conseil informatique, nous d?veloppons pour nos propres besoins et dans une optique de commercialisation des outils de gestion de recrutement. Dans le cadre de ce d?veloppement vous travaillerez sur un module de gestion d?analyse des donn?es candidat. Les algorithmes ? int?grer ou d?velopper porteront sur les ?l?ments suivants : - Dans le cadre de l?analyse de CV : o Identification de la nature et de la langue du document o D?tection des informations dans le texte tels que (exemples simples) : Nom, pr?nom, date de naissance, adresse, m?tier et niveau d?exp?rience - D?tection de documents similaires - G?n?ration de r?sum?, d?rivation de ce r?sum? en plusieurs langues (anglais en particulier) - Am?lioration du moteur de recherches multicrit?res existant - Recherche automatique dans les r?seaux sociaux et mise en lumi?re de points forts ou d?incoh?rences Vous travaillerez dans une ?quipe ? dominante informatique et serez donc assez autonome. L??quipe sera encadr?e par un chef de projet informatique exp?riment?, dans une d?marche de d?veloppement logiciel professionnel : d?marche CMMI, outils de gestion de projet, de partage des sources et de gestion des tests. La plateforme de d?veloppement est principalement Microsoft : ASP.NET MVC en C#, composants C# ; SQL Server. Vous cherchez id?alement un stage de fin d??tudes Bac +5 en Traitement Automatique de la Langue (TAL) ou Ingenierie Linguistique. Le r?sultat de votre travail devra ?tre ? convertible en programme informatique ?, phase que vous pourrez vous-m?me ?ventuellement mettre en ?uvre. Stage de longueur modulable de 4 ? 6 mois ? partir de f?vrier 2014 situ? ? Rueil-Malmaison. R?mun?ration selon profil, tickets restaurants, remboursement de 50% du pass Navigo. Contactez Philippe Martin avec votre CV et en pr?cisant votre p?riode de stage attendue : philippe.martin at stella-medica.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 14 21:18:28 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 22:18:28 +0100 Subject: Appel: EACL 2014 Workshop on Multiword Expressions (MWE 2014) Message-ID: Date: Thu, 12 Dec 2013 14:53:39 +0100 From: Agata Savary Message-ID: <52A9BFE3.3070802 at univ-tours.fr> X-url: http://multiword.sourceforge.net/mwe2014 Second CALL FOR PAPERS The 10th Workshop on Multiword Expressions (MWE 2014) http://multiword.sourceforge.net/mwe2014 Workshop at EACL 2014 (Gothenburg, Sweden), April 26-27, 2014 Endorsed by the Special Interest Group on the Lexicon of the Association for Computational Linguistics (SIGLEX;http://www.siglex.org/); SIGLEX's Multiword Expressions Section (SIGLEX-MWE; http://multiword.sourceforge.net/PHITE.php?sitesig=MWE); and PARSEME, European IC1207 COST Action (http://www.parseme.eu) Submission deadline: Long & short papers - January 23, 2014 at 11:59pm GMT -12 hours ------------------------- Call For Papers Under the denomination "multiword expression", one assumes a wide range of linguistic constructions such as idioms ("storm in a teacup", "sweep under the rug"), fixed phrases ("in vitro", "by and large", "rock'n roll"), noun compounds ("olive oil", "laser printer"), compound verbs ("take a nap", "bring about"), etc. While easily mastered by native speakers, their interpretation poses a major challenge for computational systems, due to their flexible and heterogeneous nature. For a start, MWEs are not nearly as frequent in NLP resources as they are in real-world text, and this problem of coverage may impact the performance of many NLP tasks. Moreover, treating MWEs also involves problems like determining their semantics, which is not always compositional ("to kick the bucket" meaning "to die"). In sum, MWEs are a key issue and a current weakness for natural language parsing and generation, as well as real-life applications depending on language technology, such as machine translation, just to name a prominent one among many. Thanks to the joint efforts of researchers from several fields working on MWEs, significant progress has been made in recent years, especially concerning the construction of large-scale language resources. For instance, there is a large number of recent papers which focus on acquisition of MWEs from corpora, and others that describe a variety of techniques to find paraphrases for MWEs. Current methods use a plethora of tools such as association measures, machine learning, syntactic patterns, web queries, etc. A considerable body of techniques, resources and tools to perform these tasks are now available, and are indicative of the growing importance of the field within the NLP community. Many of these advances are described as part of the annual workshop on MWEs, which attracts the attention of an ever-growing community working on a variety of languages and MWE types. The workshop has been held since 2001 in conjunction with major computational linguistics conferences (ACL, EACL, NAACL, COLING, LREC), providing an important venue for the community to interact, share resources and tools and collaborate on efforts for advancing the computational treatment of MWEs. Additionally, special issues on MWEs have been published by leading journals in computational linguistics. The latest such effort is the special issue on "Multiword Expressions: from Theory to Practice and Use", which has recently been published by the ACM Transactions on Speech and Language Processing (http://multiword.sourceforge.net/tslp2011si). MWE 2014 will be the 10th event in the series. We will be interested in major challenges in the overall process of MWE treatment, both from the theoretical and the computational viewpoint, focusing on original research related (but not limited) to the following topics: * Manually and automatically constructed resources * Representation of MWEs in dictionaries and ontologies * MWEs and user interaction * Multilingual acquisition * Multilingualism and MWE processing * Models of first and second language acquisition of MWEs * Crosslinguistic studies on MWEs * The role of MWEs in the domain adaptation of parsers * Integration of MWEs into NLP applications * Evaluation of MWE treatment techniques * Lexical, syntactic or semantic aspects of MWEs The workshop will feature a "Special Track on Parsing and MWEs" dedicated to "deep" parsing of MWEs, inviting submissions on the following and related challenges: * Lexicon-grammar interface: representing, at the lexical level, phenomena such as agreement, discontinuity and free word order; construction of MWE lexicons which would be easily convertible and maximally reusable in different parsing frameworks. * "Deep" parsing techniques for MWEs: optimal representation of MWEs within linguistic frameworks, such CCG, HPSG, LFG, TAG, minimalism, etc; processing MWEs before, during or after parsing; representing the semantics of MWEs. * Hybrid parsing of MWEs: combining data-driven and knowledge-based approaches for efficient and linguistically precise parsers; using unannotated data for improving models based on annotated data. * Annotating MWEs in treebanks: MWE-aware methodologies of treebank construction, and their increased usability in parsing. This special track is endorsed by PARSEME, European IC1207 COST Action, dedicated to parsing and MWEs (www.parseme.eu). PARSEME will fund travel and stay for at least 20 participants from the Action's member countries (the grant application procedure will be published shortly). A separate time slot will be allocated to the special track within the workshop. Authors may submit papers either to the special track or to the regular workshop. They should follow common submission instructions. Submission modalities For MWE 2014, we will accept the following two types of submissions: Regular long papers (8 content pages + 1 page for references): Long papers should report on solid and finished research including new experimental results, resources and/or techniques. Regular short papers (4 content pages + 1 page for references): Short papers should report on small experiments, focused contributions, ongoing research, negative results and/or philosophical discussion. The reported research should be substantially original. The papers will be presented orally or as posters. The decision as to which papers will be presented orally and which as posters will be made by the program committee based on the nature rather than on the quality of the work. All submissions must be in PDF format and must follow the EACL 2014 formatting requirements (available at http://www.eacl2014.org/files/eacl-2014-styles.zip). We strongly advise the use of the provided Word or LaTeX template files. Reviewing will be double-blind, and thus no author information should be included in the papers; self-reference should be avoided as well. Resources submitted with the papers should be anonymized for submission. Papers and/or resources that do not conform to these requirements will be rejected without review. Accepted papers will appear in the workshop proceedings, where no distinction will be made between papers presented orally or as posters. Submission and reviewing will be electronic, managed by the START system: https://www.softconf.com/eacl2014/MWE/ Submissions must be uploaded onto the START system by the submission deadline: January 23rd, 2014 (11:59pm GMT -12 hours) Please choose the appropriate submission type from the starting submission page, according to the category of your paper. Important dates 23 January 2014: Long & short paper submission deadline 11:59pm GMT -12 hours 20 February 2014: Notification of Acceptance 03 March 2014: Camera-ready papers due 26-27 April 2014: Workshop Dates Program Committee I?aki Alegria, University of the Basque Country (Spain) Dimitra Anastasiou, University of Bremen (Germany) Doug Arnold, University of Essex (UK) Eleftherios Avramidis, DFKI GmbH (Germany) Tim Baldwin, University of Melbourne (Australia) N?ria Bel, Universitat Pompeu Fabra (Spain) Chris Biemann, Technische Universit?t Darmstadt (Germany) Francis Bond, Nanyang Technological University (Singapore) Lars Borin, University of Gothenburg (Sweden) Ant?nio Branco, University of Lisbon (Portugal) Miriam Butt, Universit?t Konstanz (Germany) Aoife Cahill, ETS (USA) Ken Church, IBM Research (USA) Matthieu Constant, Universit? Paris-Est Marne-la-Vall?e (France) Paul Cook, University of Melbourne (Australia) B?atrice Daille, Nantes University (France) Koenraad De Smedt, University of Bergen (Norway) Ga?l Dias, University of Caen Basse-Normandie (France) G?ls,en Eryig(it, Istanbul Technical University (Turkey) Tomaz( Erjavec, Joz(ef Stefan Institute (Slovenia) Joaquim Ferreira da Silva, New University of Lisbon (Portugal) Roxana Girju, University of Illinois at Urbana-Champaign (USA) Chikara Hashimoto, National Institute of Information and Communications Technology (Japan) Ulrich Heid, Universit?t Hildesheim (Germany) Kyo Kageura, University of Tokyo (Japan) Su Nam Kim, Monash University (VIC, Australia) Ioannis Korkontzelos, University of Manchester (UK) Brigitte Krenn, Austrian Research Institute for Artificial Intelligence (Austria) Cvetana Krstev, University of Belgrade (Serbia) Marie-Catherine de Marneffe, The Ohio State University (USA) Takuya Matsuzaki, National Institute of Informatics (Japan) Preslav Nakov, Qatar Computing Research Institute (Qatar) Malvina Nissim, University of Bologna (Italy) Joakim Nivre, Uppsala University (Sweden) Diarmuid ? S?aghdha, University of Cambridge (UK) Jan Odijk, University of Utrecht (The Netherlands) Yannick Parmentier, Universit? d'Orl?ans (France) Pavel Pecina, Charles University in Prague (Czech Republic) Scott Piao, Lancaster University (UK) Adam Przepi?rkowski, Institute of Computer Science, Polish Academy of Sciences (Poland) Victoria Ros?n, University of Bergen (Norway) Carlos Ramisch, Aix-Marseille University (France) Manfred Sailer, Goethe University Frankfurt am Main (Germany) Magali Sanches Duran, University of S?o Paulo (Brazil) Violeta Seretan, University of Geneva (Switzerland) Ekaterina Shutova, University of California, Berkeley (USA) Jan S(najder, University of Zagreb (Croatia) Pavel Stran(?k, Charles University in Prague (Czech Republic) Sara Stymne, Uppsala University (Sweden) Stan Szpakowicz, University of Ottawa (Canada) Beata Trawinski, Institut f?r Deutsche Sprache (IDS), Mannheim (Germany) Yulia Tsvetkov, Carnegie Mellon University (USA) Yuancheng Tu, Microsoft (USA) Ruben Urizar, University of the Basque Country (Spain) Gertjan van Noord, University of Groningen (The Netherlands) Aline Villavicencio, Federal University of Rio Grande do Sul (Brazil) Veronika Vincze, Hungarian Academy of Sciences (Hungary) Martin Volk, University of Zurich (Switzerland) Tom Wasow, Stanford University (USA) Shuly Wintner, University of Haifa (Israel) Dekai Wu, The Hong Kong University of Science & Technology (Hong Kong) Workshop Organizers Valia Kordoni (Humboldt-Universit?t zu Berlin, Germany) Markus Egg (Humboldt-Universit?t zu Berlin, Germany) Agata Savary (Universit? Fran?ois Rabelais Tours, France; Special Track Organiser) Eric Wehrli (Universit? de Gen?ve, Switzerland; Special Track Organiser) Stefan Evert (Friedrich-Alexander-Universit?t Erlangen-N?rnberg, Germany) Contact For any inquiries regarding the workshop please send an email to mweworkshop.eacl2014 at gmail.com ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 14 21:27:38 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 22:27:38 +0100 Subject: Appel: EACL 2014 Workshop on Hybrid Approaches to Translation (3rd HyTra) Message-ID: Date: Thu, 12 Dec 2013 20:28:00 +0100 From: "Reinhard Rapp" Message-ID: <2C7CD65A618846AA874D7EBEE8259000 at ASUSPC> X-url: http://parles.upf.edu/llocs/plambert/hytra/hytra2014/ Apologies for multiple postings Please distribute to colleagues ================================================================= 2nd Call for Papers THIRD WORKSHOP ON HYBRID APPROACHES TO TRANSLATION (HyTra 2014) Co-located with EACL 2014 http://eacl2014.org/ Gothenburg, Sweden April 27, 2014 Deadline for paper submissions: January 23, 2014 http://parles.upf.edu/llocs/plambert/hytra/hytra2014/ INVITED SPEAKERS Hans Uszkoreit (Saarland University and DFKI, Germany) Joakim Nivre (Uppsala University, Sweden) ================================================================= WORKSHOP DESCRIPTION The aim of the HyTra workshop series is to bring together researchers developing and applying statistical, example-based, or rule-based translation systems, and those enhancing MT systems by combining elements from different approaches, to promote discussion and sharing of ideas among them. Hereby one relevant focus is on effectively combining linguistic and data driven approaches (rule-based and statistical MT). Another focus is on hybridization in the context of human translation. The 3rd Workshop on Hybrid Approaches to Translation (HyTra-3) intends to continue developing and empowering the research agenda in the area of Hybrid Translation already started at its first and second editions. The previous two editions (see http://www-lium.univ-lemans.fr/esirmt-hytra/ and http://hytra.barcelonamedia.org/hytra2013/) were co-located with EACL 2012 in Avignon and with ACL 2013 in Sofia, and the proceedings were published on the ACL Anthology. TOPICS We solicit contributions including but not limited to the following topics: - ways and techniques of hybridization - architectures for the rapid development of hybrid MT systems - applications of hybrid systems - hybrid systems dealing with under-resourced languages - hybrid systems dealing with morphologically rich languages - using linguistic information (morphology, syntax, semantics) to enhance statistical MT (e.g. with hierarchical or factored models) - using contextual information to enhance statistical MT - bootstrapping rule-based systems from corpora - hybrid methods in spoken language translation - extraction of dictionaries and other large-scale resources for MT from parallel and comparable corpora - induction of morphological, grammatical, and translation rules from corpora - machine learning techniques for hybrid MT - describing structural mappings between languages (e.g. tree-structures using synchronous/transduction grammars) - heuristics for limiting the search space in hybrid MT - alternative methods for the fair evaluation of the output of different types of MT systems (e.g. relying on linguistic criteria) - system combination approaches such as multi-engine MT (parallel) or automatic post-editing (sequential) - open source tools and free language resources for hybrid MT SUBMISSIONS Contributions can be short or long papers. Short paper submissions must describe original and unpublished work without exceeding five pages of content plus one extra page for references. Characteristics of short papers include: a small, focused contribution; work in progress; a negative result; an opinion piece; an interesting application nugget. Long paper submissions must describe substantial, original, completed and unpublished work without exceeding eight pages of content plus two extra pages for references. Submissions will be judged according to the criteria of the main conference (EACL 2014). Authors are invited to submit papers on original and previously unpublished work. Formatting should be according to EACL 2014 specifications using LaTeX or MS-Word style files, see section "submission format" at http://eacl2014.org/call-for-papers. Reviewing of papers will be double-blind, so the submissions should not reveal the authors' identity. Submission is electronic in PDF format using the START submission system at the following URL: https://www.softconf.com/eacl2014/HyTra/ Double submission policy: Parallel submission to other meetings or publications is possible but must be immediately notified to the workshop contact person (see below). If accepted, withdrawals are only possible within two days after notification. For an accepted paper to appear in the proceedings, at least one author must register for the workshop and actually present the paper. The papers will be published in the workshop proceedings which will be made available via the ACL Anthology. BEST PAPERS Authors of selected papers will be invited to contribute extended versions of their papers as book chapters for an edited volume on hybrid MT. IMPORTANT DATES January 23, 2014: Deadline for paper submission February 20, 2014: Notification of acceptance March 3, 2014: Camera ready papers due April 27, 2014: Workshop in Gothenburg ORGANIZERS Rafael E. Banchs (Institute for Infocomm Research, Singapore) Marta R. Costa-jussa (Institute for Infocomm Research, Singapore) Reinhard Rapp (Universities of Aix-Marseille and Mainz) Patrik Lambert (Pompeu Fabra University, Barcelona) Kurt Eberle (Lingenio GmbH, Heidelberg) Bogdan Babych (University of Leeds) CONTACT PERSON Rafael E. Banchs: rembanchs (at) i2r (dot) a-star (dot) edu (dot) sg PROGRAMME COMMITTEE * Ahmet Aker, University of Sheffield, UK * Bogdan Babych, University of Leeds, UK * Rafael E. Banchs, Institute for Infocomm Research, Singapore * Alexey Baytin, Yandex, Moscow, Russia * N?ria Bel, Universitat Pompeu Fabra, Barcelona, Spain * Pierrette Bouillon, ISSCO/TIM/ETI, University of Geneva, Switzerland * Michael Carl, Copenhagen Business School, Denmark * Marta R. Costa-jussa, Institute for Infocomm Research, Singapore * Oliver Culo, University of Mainz, Germany * Kurt Eberle, Lingenio GmbH, Heidelberg, Germany * Andreas Eisele, DGT (European Commission), Luxembourg * Marcello Federico, Fondazione Bruno Kessler, Trento, Italy * Christian Federmann, Language Technology Lab, DFKI, Saarbr?cken, Germany * Jos? A. R. Fonollosa, Universitat Polit?cnica de Catalunya, Barcelona, Spain * Maxim Khalilov, TAUS, Amsterdam, The Netherlands * Patrik Lambert, Pompeu Fabra University, Barcelona, Spain * Udo Kruschwitz, University of Essex, UK * Yanjun Ma, Baidu Inc., Beijing, China * Jos? B. Mari?o, Universitat Polit?cnica de Catalunya, Barcelona, Spain * Bart Mellebeek, University of Amsterdam, The Netherlands * Reinhard Rapp, Universities of Aix-Marseille, France, and Mainz, Germany * Anders S?gaard, University of Copenhagen, Denmark * Nasredine Semmar, CEA LIST, Fontenay-aux-Roses, France * Wade Shen, Massachusetts Institute of Technology, Cambridge, USA * Serge Sharoff, University of Leeds, UK * George Tambouratzis, Institute for Language and Speech Processing, Athens, Greece * J?rg Tiedemann, University of Uppsala, Sweden * Dekai Wu, The Hong Kong University of Science and Technology, Hong Kong, China ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 14 21:57:55 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 22:57:55 +0100 Subject: Appel: ESWC 2014, Call for Posters Message-ID: Date: Fri, 13 Dec 2013 15:57:01 +0100 (CET) From: speroni at cs.unibo.it Message-Id: <20131213145722.2F128DDA89 at vina.cines.fr> X-url: http://2014.eswc-conferences.org/important-dates/call-posters X-url: http://2014.eswc-conferences.org ** apologies for cross-posting ** ==== Call for Posters ==== http://2014.eswc-conferences.org/important-dates/call-posters 11th ESWC Conference 2014 Dates: May 25 - 29, 2014 Venue: Anissaras, Crete, Greece Hashtag: #eswc2014 Feed: @eswc_conf Site: http://2014.eswc-conferences.org General Chair: Valentina Presutti (STLab, ISTC-CNR, IT) Poster and Demo Chairs: - Raphael Troncy (Multimedia Department, EURECOM, FR) - Eva Blomqvist (Department of Computer and Information Science, Linkoping University, SE) The poster track of ESWC 2014 will provide an opportunity for presenting late-breaking results, on-going research projects, and speculative or innovative work in progress. Posters are intended to provide authors and participants with the ability to connect with each other and to engage in discussions about the work. We invite submissions relevant to the area of the Semantic Web that address, but are not limited to, the topics of the Research Track. Theoretical or technical posters, reports on semantic systems (however, demonstrations of such systems should be submitted in the demo track), descriptions of completed work, and work in progress are equally welcome. Papers submitted to the research track will NOT automatically be considered for the poster track. A separate poster submission must be made. For example, a poster can be used to present work that was insufficiently mature for the research track. * Submission Guidelines * Authors must submit a 4-page paper (PDF, Springer LNCS style) with a short abstract for evaluation. The paper must clearly demonstrate relevance to the Semantic Web and the topics of interest of ESWC 2014. Decisions about acceptance will be based on relevance to the Semantic Web, originality, potential significance, topicality and clarity. Papers that exceed the given page length or do not follow the LNCS guidelines will be rejected without a review. Submissions should be uploaded using the Easychair system: https://www.easychair.org/conferences/?conf=eswc2014pd * Important Dates * Submission Deadline: March 16, 2014 Notification of Acceptance: April 10, 2014 Camera-Ready Paper: April 25, 2014 * Important Notes * Poster papers will appear in supplementary post-conference proceedings to be published by Springer in the series Lecture Notes in Computer Science (LNCS). Participants with an accepted poster must register for the conference and present their work during the Poster Session. A space will be allocated for each participant. The organizers should be contacted well in advance of the conference in case of any special requirement. From hamon at LIMSI.FR Sat Dec 14 21:48:58 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 22:48:58 +0100 Subject: Info: Establishing the International Standard Language Resource Number (ISLRN) Message-ID: Date: Fri, 13 Dec 2013 16:45:44 +0100 From: ELRA ELDA Information Message-ID: <52AB2BA8.1030102 at elda.org> X-url: http://liste.cines.fr//ticket/61393947172170 X-url: http://www.elra.info/Establishing-the-ISLRN.html# /Press Release - Immediate - Paris, France, December 13, 2013/ *Establishing the International Standard Language Resource Number (ISLRN)* 12 major NLP organisations announce the establishment of the ISLRN, a Persistent Unique Identifier, to be assigned to each Language Resource. On November 18, 2013, 12 NLP organisations have agreed to announce the establishment of the International Standard Language Resource Number (ISLRN), a Persistent Unique Identifier, to be assigned to each Language Resource. Experiment replicability, an essential feature of scientific work, would be enhanced by such unique identifier. Set up by ELRA, LDC and AFNLP/Oriental-COCOSDA, the ISLRN Portal will provide unique identifiers using a standardised nomenclature, as a service free of charge for all Language Resource providers. It will be supervised by a steering committee composed of representatives of participating organisations and enlarged whenever necessary. More information on ELRA and the ISLRN, please contact: Khalid Choukri choukri at elda.org More information on ELDA, please contact: H?l?ne Mazo mazo at elda.org (http://www.elra.info/Establishing-the-ISLRN.html#) ELRA 55-57, rue Brillat Savarin 75013 Paris (France) Tel.: +33 1 43 13 33 33 Fax: +33 1 43 13 33 30 *** About ELRA *** The European Language Resources Association (ELRA) is a non-profit making organisation founded by the European Commission in 1995, with the mission of providing a clearing house for Language Resources and promoting Human Language Technologies (HLT). To find out more about ELRA, please visit our web site: www.elra.info *** About NLP12*** Alliance of Digital Humanities Organizations (ADHO), Association for Computational Linguistics (ACL), Asian Federation of Natural Language Processing (AFNLP), COLING Committee (ICCL), European Data Forum, European Language Resources Association (ELRA), International Association for Machine Translation (IAMT), International Committee for the Coordination & Standardisation of Speech Databases and Assessment Techniques (COCOSDA), International Speech Communication Association (ISCA), Linguistic Data Consortium (LDC), Oriental COCOSDA, Language Resource Management Agency (RMA) To find out more about NLP12, please read the NLP12 Paris Declaration (http://www.elra.info/NLP12-Paris-Declaration.html). ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 14 21:35:11 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 22:35:11 +0100 Subject: Appel: WIMS'14 (deadline extension) Message-ID: Date: Fri, 13 Dec 2013 10:23:11 +0100 From: Planti? Michel Message-ID: <52AAD1FF.5090101 at mines-ales.fr> X-url: http://wims14.csd.auth.gr *** apologies for cross-postings *** ======================================================================== Due to a number of requests we have to decided to extend the paper submission deadlines NEW deadline for paper submission: *** January 20th 2014 *** ======================================================================== CALL FOR RESEARCH / APPLICATION & CASE STUDY PAPERS / POSTERS CALL FOR WORKSHOPS / TUTORIALS / DEMONSTRATIONS KEYNOTES ANNOUNCEMENT WIMS?14 4th International Conference on Web Intelligence, Mining and Semantics Thessaloniki, Greece, 2-4 June 2014 http://wims14.csd.auth.gr About WIMS?14 Conference -------------------------------------- WIMS is a series of peer-reviewed International Computer Science conferences. It is a forum for researchers and practitioners to present their state-of-the-art results in building Intelligent Web, to examine performance characteristics of various approaches in Web-based intelligent information technology, and to cross-fertilize their ideas on the development of Web-based intelligent information management solutions across different domains. Objectives -------------- The purpose of the WIMS series is: - To provide a forum for established researchers and practitioners to present their contributions to the state of the art research and development in Web technology and applications - To give doctoral students an opportunity to present their research to a friendly and knowledgeable audience and receive valuable feedback - To provide an informal social event where Web technology researchers and practitioners can meet Call for Research Papers/Posters -------------------------------------------- WIMS?14 is intended to foster the dissemination of state-of-the-art research in the area of Web intelligence, Web mining, Web semantics and the fundamental interaction between them. Authors are invited to submit regular (12 pages) or short (6 pages) research papers or posters (4 pages) on all related areas. Papers exploring new directions or areas will receive a thorough and encouraging review. More details on the nature of the different submission types can be found at: http://wims14.csd.auth.gr/?page_id=23 The papers in all the categories should describe original results that have not been accepted or submitted for publication elsewhere. All submissions will be evaluated by at least three members of the international program committee. Submissions should be made electronically in PDF or DOC/DOCX (MS/Open Word) format via the electronic submission system of the WIMS?14 Conference Management system at https://www.easychair.org/conferences/?conf=wims14 The submissions should be typeset using the templates of the ACM International Conference Proceedings Series (ICPS). The templates could be retrieved from http://www.acm.org/sigs/publications/proceedings-templates Call for Application and Case Study Track Papers ---------------------------------------------------------------- The goal of this track is to provide opportunities for researchers, innovators, industry leaders, and practitioners to present and exchange ideas, results, and experiences in the applications of Web Intelligence, Mining, and Semantics (WIMS). The Application & Case Studies Track invites submissions describing implementations of Web Intelligence, Mining, and Semantics in industrial or government or other ?real world? settings. Papers must include descriptions of how the applications have been conceived, developed, and (probably) deployed. Details about the submission types and the submission procedure of this track can be found at: http://wims14.csd.auth.gr/?page_id=154 Call for Tutorials/Demonstration Proposals ---------------------------------------------------------- WIMS?14 also welcomes Tutorial and/or Demonstration proposals. WIMS?14 will include tutorials and/or demonstrations (on new applications) providing in-depth background on subjects that are of broad interest to the Web Mining and Web Semantics community. The tutorials will be part of the main conference technical program. Proposal papers must be no less than 5 pages and no more than 12 pages, and must provide a sense of both the scope of the tutorial and depth within the scope. Details about the submissions of the tutorial/demonstration proposals can be found at: http://wims14.csd.auth.gr/?page_id=148 Call for Workshop Proposals -------------------------------------- WIMS?14 invites proposals for workshops to be held in conjunction with the conference. Organizers may structure workshops as they see fit, possibly including invited talks, panel discussions, presentations of work in progress, peer-reviewed papers, or some combination. A workshop could be scheduled for a half a day or a full day, depending on potential interest and organizers preference. The organizers of successful workshops will be responsible for their own reviewing process, website, and to co-ordinate the collection and delivery of camera ready material and ACM copyright transfer. Accepted papers in workshops will be published in the WIMS?14 proceedings and disseminated through the ACM Digital Library. Selected extended papers will be invited to appear in a special issue of a reputed journal in the field. Details about the submissions of the workshop proposals can be found at: http://wims14.csd.auth.gr/?page_id=150 Conference Scope ------------------------- WIMS solicits regular and work-in-progress research, discussion papers and industry experience report papers in related fields. Papers exploring new directions or areas are also welcome. In particular but not exclusively the submissions within the following major areas are relevant: - Scalable Web and Data Architectures and Infrastructures - Web Intelligence - Web Mining, Information and Knowledge Extraction - Web Semantics and Reasoning - WIMS Applications - Evaluation and Validation of WIMS Technologies and Applications More details about the conference scope and topics can be found at: http://wims14.csd.auth.gr/?page_id=146 Publications ---------------- Accepted papers/tutorials/posters will be published by ACM and disseminated through the ACM Digital Library through the International Conference Proceedings Series (ICPS). A selection of the best WIMS'14 papers will be invited to be revised and extended for the post-conference publication in the Special Issue of the International Journal on Artificial Intelligence Tools (IJAIT). The proceedings of the previous WIMS conferences could be checked at: - WIMS?11:https://dl.acm.org/citation.cfm?id=1988688 - WIMS?12:https://dl.acm.org/citation.cfm?id=2254129 - WIMS?13:https://dl.acm.org/citation.cfm?id=2479787 Keynotes ------------- WIMS?14 organizing committee is happy to announce the following significant keynote speakers for this year?s conference: - ?Large-Scale Reasoning with (Semantic) Data?: Grigoris Antoniou, Professor, University of Huddersfield - ?Semantic Technology for online, broadcast and print media?: Jem Rayfield, Head Architect, Financial Times - ?Big Data ? from Hype to Reality?: Richard Benjamins, Dr., Director of Business Intelligence, Telefonica Important Dates --------------- 13.01.2014 submission of abstracts for Application and Case Study Track Papers ** NEW ** 20.01.2014 submission of Research Track papers & Application and Case Study Track papers & posters ** NEW ** 20.12.2013 Workshop & Tutorial Proposals submission, review and notification 24.03.2014 notification of acceptance for (all tracks) papers/posters ** NEW ** 07.04.2014 camera ready versions of the accepted (all tracks) papers, posters, tutorial papers ** NEW ** 14.04.2014 author registration deadline ** NEW ** 02-04.06.2014 Conference Organization ----------------- WIMS'14 is organized under the auspices of the Dept. of Informatics at the Aristotle University of Thessaloniki, Greece. (http://www.csd.auth.gr/en/) WIMS Conferences Chair Rajendra Akerkar, Western Norway Research Institute, Norway General Chairs Ioannis Vlahavas, Aristotle University of Thessaloniki, Greece Takahira Yamaguchi, Keio University, Japan Program Committee Chairs Nick Bassiliades, Aristotle University of Thessaloniki, Greece Vadim Ermolayev, Zaporizhzhya National University, Ukraine Advisory Committee Grigoris Antoniou, University of Huddersfield, UK Harold Boley, Faculty of Computer Science, University of New Brunswick, Fredericton, Canada James Hendler, Rensselaer Polytechnic Institute, Troy, NY, USA Guus Schreiber, VU University, Amsterdam, the Netherlands Amit Sheth, Ohio Center of Excellence on Knowledge-enabled Computing, Ohio, USA Workshops and Tutorials Chairs Costin Badica, University of Craiova, Romania Dimitris Vrakas, Aristotle University of Thessaloniki, Greece Industrial Track Chair John Davies, BT Research & Innovation, UK Publicity Chair Athena Vakali, Aristotle University of Thessaloniki, Greece Local Organization Co-Chairs Yannis Manolopoulos, Aristotle University of Thessaloniki, Greece Athanasios Tsadiras, Aristotle University of Thessaloniki, Greece Web Chairs Efstratios Kontopoulos, International Hellenic University, Greece Fotis Kokkoras, T.E.I. of Thessaly, Greece ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Dec 14 21:49:50 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 14 Dec 2013 22:49:50 +0100 Subject: Livre: Ressources Lexicales Message-ID: Date: Fri, 13 Dec 2013 19:19:24 +0100 From: Nuria Gala Message-ID: <52AB4FAC.30805 at lif.univ-mrs.fr> X-url: http://benjamins.com/#catalog/books/lis.30/main Ressources Lexicales Contenu, construction, utilisation, ?valuation Sous la direction de N?ria Gala et Michael Zock Aix-Marseille Universit? & LIF-CNRS http://benjamins.com/#catalog/books/lis.30/main Les ressources lexicales (dictionnaires, bases de donn?es, thesaurus, etc.) rassemblent des connaissances sur les mots, leurs sens et leurs usages. Si pendant des si?cles elles ont ?t? tributaires de l'imprimerie et du format textuel, il existe de nos jours une grande vari?t? d'outils et de ressources accessibles sous des formats ?lectroniques divers. Ainsi, la fa?on de consid?rer les ressources lexicales a chang? consid?rablement ces derni?res d?cennies. On a vu notamment appara?tre des ressources non plus con?ues en tant qu'entit?s statiques, mais mod?lis?es sous forme de bases de donn?es ou de graphes, dans lesquelles les informations sont li?es et accessibles dynamiquement. Le domaine des ressources lexicales, au carrefour de plusieurs disciplines dont la linguistique, la lexicologie, la lexicographie et le traitement automatique des langues, est sans nul doute en pleine effervescence. Le but de ce volume est d'en dresser un panorama g?n?ral qui rend compte de l'existant et des ?volutions en cours. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 17 21:15:19 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 17 Dec 2013 22:15:19 +0100 Subject: Appel: WoLLIC 2014 Message-ID: Date: Mon, 16 Dec 2013 08:09:25 -0200 From: Ruy de Queiroz Message-ID: X-url: http://wollic.org/wollic2014/ [Please circulate. Apologies for any cross-postings.] WoLLIC 2014 21st Workshop on Logic, Language, Information and Computation September 1st to 4th, 2014 Valparaiso, Chile SCIENTIFIC SPONSORSHIP Interest Group in Pure and Applied Logics (IGPL) The Association for Logic, Language and Information (FoLLI) Association for Symbolic Logic (ASL) European Association for Theoretical Computer Science (EATCS) European Association for Computer Science Logic (EACSL) Sociedade Brasileira de Computa??o (SBC) Sociedade Brasileira de L?gica (SBL) ORGANISATION Department of Computer Science, Universidad de Chile, Chile Department of Computer Science, Pontificia Universidad Cat?lica de Chile, Chile Centro de Inform?tica, Universidade Federal de Pernambuco, Brazil HOSTED BY Department of Informatics, Universidad T?cnica Federico Santa Mar?a, Chile CALL FOR PAPERS WoLLIC is an annual international forum on inter-disciplinary research involving formal logic, computing and programming theory, and natural language and reasoning. Each meeting includes invited talks and tutorials as well as contributed papers. The twentieth WoLLIC will be held at the Universidad T?cnica Federico Santa Mar?a, from September 1st to 4th, 2014. It is sponsored by the Association for Symbolic Logic (ASL), the Interest Group in Pure and Applied Logics (IGPL), the The Association for Logic, Language and Information (FoLLI), the European Association for Theoretical Computer Science (EATCS), the European Association for Computer Science Logic (EACSL), the Sociedade Brasileira de Computa??o (SBC), and the Sociedade Brasileira de L?gica (SBL). PAPER SUBMISSION Contributions are invited on all pertinent subjects, with particular interest in cross-disciplinary topics. Typical but not exclusive areas of interest are: foundations of computing and programming; novel computation models and paradigms; broad notions of proof and belief; proof mining, type theory, effective learnability; formal methods in software and hardware development; logical approach to natural language and reasoning; logics of programs, actions and resources; foundational aspects of information organization, search, flow, sharing, and protection. Proposed contributions should be in English, and consist of a scholarly exposition accessible to the non-specialist, including motivation, background, and comparison with related works. They must not exceed 10 pages (in font 10 or higher), with up to 5 additional pages for references and technical appendices. The paper's main results must not be published or submitted for publication in refereed venues, including journals and other scientific meetings. It is expected that each accepted paper be presented at the meeting by one of its authors. Papers must be submitted electronically at the WoLLIC 2014 EasyChair website. (Please go to http://wollic.org/wollic2014/instructions.html for instructions.) A title and single-paragraph abstract should be submitted by Mar 24, 2014, and the full paper by Mar 28, 2014 (firm date). Notifications are expected by May 2, 2014, and final papers for the proceedings will be due by May 15, 2014 (firm date). PROCEEDINGS The proceedings of WoLLIC 2014, including both invited and contributed papers, will be published in advance of the meeting as a volume in Springer's LNCS series. In addition, abstracts will be published in the Conference Report section of the Logic Journal of the IGPL, and selected contributions will be published as a special post-conference WoLLIC 2014 issue of a scientific journal (to be confirmed). INVITED SPEAKERS Ver?nica Becher (U Buenos Aires) Juha Kontinen (U Helsinki) Aarne Ranta (U Gothenburg) Kazushige Terui (U Kyoto) Luca Vigan? (King?s College London) Thomas Wilke (U Kiel) STUDENT GRANTS ASL sponsorship of WoLLIC 2014 will permit ASL student members to apply for a modest travel grant (deadline: May 1st, 2014). See http://www.aslonline.org/studenttravelawards.html for details. IMPORTANT DATES Mar 24, 2014: Paper title and abstract deadline Mar 28, 2014: Full paper deadline May 2, 2014: Author notification May 15, 2014: Final version deadline (firm) PROGRAMME COMMITTEE Natasha Alechina (U Nottingham) Eric Allender (Rutgers U) Marcelo Arenas (PUC Chile) Steve Awodey (CMU) Julian Bradfield (U Edinburgh) Xavier Caicedo (U de Los Andes) Olivier Danvy (Aarhus U) Ulrich Kohlenbach (Tech U Darmstadt) (CHAIR) Marcus Kracht (U Bielefeld) Michiel van Lambalgen (U Amsterdam) Klaus Meer (Tech U Cottbus) George Metcalfe (Bern U) Dale Miller (INRIA Saclay / LIX) Russell Miller (CUNY) Sara Negri (U Helsinki) Nicole Schweikardt (U Frankfurt) (more to come) STEERING COMMITTEE Samson Abramksy, Johan van Benthem, Anuj Dawar, Joe Halpern, Wilfrid Hodges, Daniel Leivant, Leonid Libkin, Angus Macintyre, Grigori Mints, Luke Ong, Hiroakira Ono, Ruy de Queiroz. ORGANISING COMMITTEE Pablo Barcel? (Universidad de Chile) (Local chair) Anjolina G. de Oliveira (U Fed Pernambuco) Ruy de Queiroz (U Fed Pernambuco) (co-chair) Juan Reutter (Pontificia Universidad Cat?lica de Chile) Cristi?n Riveros (Pontificia Universidad Cat?lica de Chile) FURTHER INFORMATION Contact one of the Co-Chairs of the Organising Committee. WEB PAGE http://wollic.org/wollic2014/ ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 17 21:20:08 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 17 Dec 2013 22:20:08 +0100 Subject: Appel: IC 2014, Extension de l'appel a soumission jusqu'au 9 janvier 2014 Message-ID: Date: Mon, 16 Dec 2013 12:50:01 +0100 From: Catherine Faron Zucker Message-ID: <52AEE8E9.1010403 at i3s.unice.fr> X-url: http://www.irit.fr/IC2014 Appel ? communications pour les 25es Journ?es Francophones d'Ing?nierie des Connaissances : Connaissances pervasives: des utilisateurs plong?s dans des syst?mes ? base de connaissances et des syst?mes ? base de connaissances plong?s dans le monde http://www.irit.fr/IC2014 du 12 au 16 mai 2014 ? Clermont-Ferrand ------------------------------------------------------------------------ Dates importantes : Date limite de soumission des communications : jeudi 19 d?cembre 2013jeudi 9 janvier 2014 Date de notification aux auteurs : jeudi 13 f?vrier 2014 Date limite de r?ception des textes d?finitifs : lundi 17 mars 2014 Dates de la conf?rence : du 12 au 16 mai 2014 ------------------------------------------------------------------------ La conf?rence d'Ing?nierie des Connaissances f?te cette ann?e ses 25 ans! Elle est le rendez-vous annuel de la communaut? fran?aise et francophone qui s'int?resse aux probl?matiques li?es ? l'ing?nierie des connaissances. Acad?miques et industriels s'y retrouvent pour ?changer et r?fl?chir sur des probl?mes de recherche qui se posent en acquisition, repr?sentation et gestion des connaissances. L'essor des sciences et technologies de l'information et de la communication, et notamment des technologies du web, dans l'ensemble de la soci?t? engendre des mutations dans les pratiques individuelles et collectives. L'ing?nierie des connaissances accompagne cette ?volution, en inventant les mod?les, m?thodes et outils permettant l'int?gration de connaissances et de raisonnements sur ces connaissances dans des environnements informatiques. De fait, on constate aujourd'hui une omnipr?sence des connaissances, en m?me temps qu'un d?cloisonnement des communaut?s de recherche en Intelligence Artificielle pour r?pondre aux probl?mes qui se posent. Ainsi, la construction des syst?mes ? base de connaissances fait appel ? l'ing?nierie des connaissances, au traitement automatique des langues, ? l'apprentissage automatique et ? la fouille de donn?es; le traitement des connaissances associe ing?nierie des connaissances avec recherche d'information, recherche op?rationnelle, analyse de donn?es; la mod?lisation des environnements informatiques, de ses acteurs humains ou logiciels, allie ing?nierie des connaissances et syst?mes multi-agents; enfin l'ing?nierie des connaissances touche ? des th?matiques classiques dans les communaut?s de l'interaction homme-machine et des environnements informatiques pour l'apprentissage humain. En pla?ant ainsi l'utilisateur au coeur des syst?mes informatiques, l'ing?nierie des connaissances est sollicit?e pour assister l'utilisateur dans le traitement de la masse de donn?es disponibles sur le web, pour les visualiser, contextualiser, expliquer, recommander, classer, pr?dire, approximer, s?curiser, fiabiliser, pour pallier ? leur impr?cision voire incoh?rence. Les th?mes d'IC 2014 sont les suivants (liste non exhaustive): Repr?sentation des connaissances et raisonnement - mod?les, formalismes, langages formels et informels de repr?sentation de connaissances - ?change, interop?rabilit? et r?utilisation des mod?les, standardisation - terminologies, thesauri, ontologies et lexiques - types d'ontologies et r?le des ontologies de r?f?rence - m?thodes et outils pour le cycle de vie des repr?sentations - gestion de la dynamique des ontologies (versionnement, ?volution, int?gration et alignement) - raisonnements logiques, inf?rences, raisonnements ? base de r?gles, - approximations, raisonnements statistiques, raisonnements par analogie, raisonnements ? partir de cas, raisonnements dans les logiques non classiques Conception et construction de mod?les de connaissances et d'ontologies - peuplement de mod?les de connaissances - acquisition de connaissances ? partir de textes - d?couverte interactive de connaissances dans les donn?es, dans les historiques, dans les interactions, sur le web, etc. - inf?rence et r?gles m?tier : acquisition, conception, r?utilisation, ?volution, ?valuation, exploitations - m?trique et ?valuation de la qualit? des ontologies Web de donn?es, web s?mantique - ouverture et interconnexion des donn?es - impact de l'ouverture des donn?es sur le web et de l'externalisation ouverte (crowdsourcing) sur l'ing?nierie des connaissances - r?seaux sociaux - gestion de flux de donn?es - passage ? l'?chelle : traitement de donn?es massives - traitement de donn?es h?t?rog?nes - provenance et confiance dans les donn?es - m?trique et ?valuation de la qualit? des donn?es Ing?nierie des connaissances centr?e utilisateurs - visualisation de connaissances, d'ontologies - visualisation d'interconnexions de donn?es, de raisonnements sur les donn?es - explication des raisonnements - profils utilisateurs, mod?les de contexte et adaptation, mod?les d'?motion - personnalisation de l'acc?s aux connaissances: confiance, recommandation, classement (ranking), r?sum? - assistance d'applications et de services: agents conversationnels - environnements informatiques pour l'apprentissage humain - conception et ?valuation d'interactions et d'interfaces avec des syst?mes ? base de connaissances Interdisciplinarit? de l'ing?nierie des connaissances - ?pist?mologie de l'ing?nierie des connaissances - th?orie des organisations et ing?nierie des connaissances - sciences humaines et sciences cognitives et ing?nierie des connaissances - ing?nierie des connaissances scientifiques Ing?nierie des connaissances pervasive : IC dans... - le traitement automatique des langues - l'apprentissage symbolique - la fouille de donn?es - la recherche d'information, l'indexation - la recherche op?rationnelle - l'analyse de donn?es - les syst?mes multi-agents - les environnements informatiques pour l'apprentissage humain - les interfaces homme-machine Applications - applications dans les domaines de la sant?, de l'?ducation et de l'apprentissage, de la culture, de la recherche documentaire, des transports intelligents, de l'informatique d?cisionnelle (BI), etc. - ?mergence de nouveaux domaines applicatifs - m?moires d'entreprises - syst?mes pour la collaboration et la coop?ration dans les organisations et les collectifs ------------------------------------------------------------------------ Soumissions : Articles longs(12 pages (nouveau format), pr?sentation orale de 20 mn, 10 mn discussion) : Sont concern?s des travaux aboutis, pr?sentant des r?sultats valid?s ou des synth?ses fouill?es. Articles courts(6 pages (nouveau format), pr?sentation orale de 10 mn, 5 mn discussion) : Sont concern?s des travaux en cours ou au stade de r?sultats partiels de recherche ou appliqu?s. Ce type de soumission correspond ? des travaux innovants de qualit?, mais dont la maturit? ne permet pas des ?laborations compl?tes et des conclusions d?finitives. Les articles devront respecter les styles de documents indiqu?s sur le site de la conf?rence: http://www.irit.fr/IC2014. Les auteurs sont invit?s ? soumettre leurs communications au format PDF sur EasyChair: https://www.easychair.org/conferences/?conf=ic2014. Les articles accept?s feront l'objet d'actes en version ?lectronique, publi?s sur HAL, et disponibles sur une cl? USB distribu?e aux participants de la conf?rence. Un prix AFIA du meilleur article sera d?cern? par un jury compos? de membres du comit? de pilotage de la conf?rence. ------------------------------------------------------------------------ La conf?rence d'Ing?nierie des Connaissances est plac?e sous l'?gide du coll?ge IC de l'AFIA ------------------------------------------------------------------------ Pr?sidente du comit? de programme : Catherine Faron Zucker (I3S, Universit? Nice Sophia Antipolis) Pr?sidente du comit? d'organisation : Catherine Roussey (Irstea) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 17 21:23:28 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 17 Dec 2013 22:23:28 +0100 Subject: Appel: 8eme colloque international du GEM&L Message-ID: Date: Tue, 17 Dec 2013 10:09:24 +0100 From: Luc Damas Message-ID: X-url: http://geml.eu/colloque/colloque-2014-toulouse-business-school/call-for-papers/ 8eme colloque international du GEM&L Toulouse Business School Toulouse, 26-28 mars 2014 Appel ? communication ? Call for Papers Version fran?aise (English version below) Langage et performance de l?entreprise http://geml.eu/colloque/colloque-2014-toulouse-business-school/call-for-papers/ Planning Proposition de communication : 10 janvier 2014 Notification d?acceptation / modification : 20 f?vrier 2014 Communication d?finitive: 10 mars 2014 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 17 21:22:04 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 17 Dec 2013 22:22:04 +0100 Subject: Info: EMCL Study grants Message-ID: Date: Mon, 16 Dec 2013 16:42:38 +0100 From: Bertram Fronh?fer Message-ID: <52AF1F6E.40304 at tu-dresden.de> X-url: http://www.emcl-study.eu/home.html X-url: http://www.emcl-study.eu/fileadmin/emcl_booklet_tree/ma_em_grant.html ------------------------------------------------------------------------ Dear all, I'd like to draw your attention to the fact that fresh Erasmus Mundus scholarships are available for Non-European AND European students who enrol in our European Master's Program in Computational Logic in the fall of 2014. The deadline for application is 31 January, 2014. More details are given below. In particular, I'd like to draw your attention to the fact that we are able to provide grants to EU-students for doing their project at the National ICT of Australia (NICTA). Please spread this information as wide as possible among friends and colleagues, at your old universities and the places, where you currently live and work. Many thanks -- Steffen ************************************************************************ The European Master's Program in Computational Logic We are glad to announce to you the possibility to join our European Master's Program of Computational Logic. This program is offered jointly at the Free-University of Bozen-Bolzano in Italy, the Technische Universit?t Dresden in Germany, the Universidade Nova de Lisboa in Portugal and the Technische Universit?t Wien in Austria. Within this program you have the choice to study at two /three of the four European universities. In addition, you can do your project work at the National ICT of Australia (NICTA). You will graduate with a MSc in Computer Science and obtain a joint degree. Information on the universities and the program including the application procedure is provided here: http://www.emcl-study.eu/home.html Language of instruction is English. Tuition fees are 3.000 EUR (for non-European students) and 1.000 (for European students) per year. We would like to draw your attention to the ERASMUS-MUNDUS scholarship program. The ERASMUS-MUNDUS consortium offers 2-year scholarships up to 48.000 EUR for non-EU students and up to 23.000 EUR for EU students of our European Master's Program in Computational Logic. More information on the scholarship program is available from: http://www.emcl-study.eu/fileadmin/emcl_booklet_tree/ma_em_grant.html Do not hesitate to contact us if you have any further questions. Kind regards -- Steffen H?lldobler Prof. Dr. Steffen Hoelldobler International Center for Computational Logic Technische Universit?t Dresden 01062 Dresden, Germany phone: [+49](351)46 33 83 40 fax: [+49](351)46 33 83 42 email: sh at iccl.tu-dresden.de ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 17 21:27:18 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 17 Dec 2013 22:27:18 +0100 Subject: Soft: JRC's Europe Media Monitor (EMM) - Free APPS for mobile version Message-ID: Date: Tue, 17 Dec 2013 15:49:14 +0100 From: Ralf Steinberger Message-id: <023b01cefb37$27ea4280$77bec780$@jrc.ec.europa.eu> X-url: http://emm.newsbrief.eu/ Dear all, this is to let you know that the highly multilingual Europe Media Monitor (EMM) applications are now available as free Apps for mobile devices running iOS and Android. You can download the EMM Apps via the links at the EMM-NewsBrief entry page http://emm.newsbrief.eu/ . Apps are currently available for mobile devices running Android and for the iPad. A version for the iPhone will be released soon. We would be delighted to receive your feedback on the newly released EMM Apps to the email address emm AT jrc DOT it. We look forward to hearing from you. EUROPE MEDIA MONITOR (EMM) EMM gathers a daily average of 175,000 news items in over 70 languages and it analyses the news automatically using a wide range of JRC-developed computational linguistics tools. These include event extraction; automatic entity recognition, classification and disambiguation; name variant mapping; co-reference resolution; quotation recognition; opinion mining; multi-document summarisation; document clustering and classification; machine translation; information aggregation, including across languages; and more. BACKGROUND AND READING EMM (http://emm.newsbrief.eu/overview.html) is a freely available advert-free family of news monitoring and analysis applications developed by the OPTIMA Team (http://ipsc.jrc.ec.europa.eu/?id=178) at the Joint Research Centre (http://ec.europa.eu/dgs/jrc/) (JRC), which is the European Commission's in-house science service. You find literature (scientific publications) on the Europe Media Monitor EMM (http://langtech.jrc.ec.europa.eu/JRC_Publications.html) and its individual text mining tools at the URL http://langtech.jrc.ec.europa.eu/JRC_Publications.html, including papers giving an introduction and a generic overview of the Europe Media Monitor EMM (http://langtech.jrc.ec.europa.eu/Documents/09_SIGIR-WS_Steinberger+frontmat ter.pdf) . FREELY AVAILABLE LINGUISTIC RESOURCES The JRC has also released a large volume of freely available multilingual linguistic resources (http://ipsc.jrc.ec.europa.eu/index.php?id=61) that can be used to develop or test a variety of multilingual and cross-lingual Natural Language Processing tools. These include parallel corpora in up to 26 languages; readily trained automatic document categorisation software (http://ipsc.jrc.ec.europa.eu/index.php?id=60) in 22 languages; dictionaries of names and their variant spellings (http://ipsc.jrc.ec.europa.eu/?id=42) , including across languages and scripts; and more. See http://ipsc.jrc.ec.europa.eu/index.php?id=61 for details and to download the data and the tools. Ralf Steinberger (http://langtech.jrc.ec.europa.eu/RS.html) and the OPTIMA team European Commission - Joint Research Centre (JRC) 21027 Ispra (VA), Italy URL - Applications: (http://emm.newsbrief.eu/overview.html) http://emm.newsbrief.eu/overview.html URL - Resources: http://ipsc.jrc.ec.europa.eu/index.php?id=61 From hamon at LIMSI.FR Tue Dec 17 21:29:33 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 17 Dec 2013 22:29:33 +0100 Subject: Seminaire: Pierre-Yves OUDEYER, 19 Decembre 2013, GIPSA-lab Message-ID: Date: Tue, 17 Dec 2013 16:18:29 +0100 From: Thomas Hueber Message-Id: <0167C9FD-3058-4DC0-886F-30E1363406CC at gipsa-lab.grenoble-inp.fr> X-url: http://www.pyoudeyer.com X-url: http://flowers.inria.fr X-url: http://www.gipsa-lab.inpg.fr/~thomas.hueber Bonjour ? tous, Cette semaine, nous accueillerons Pierre-Yves Oudeyer, de l'INRIA Bordeaux. Il viendra nous pr?senter ses travaux ce Jeudi 19 d?cembre, ? 13h30 en salle r?union du DPC-Amp?re (B314). Venez nombreux ! (et n?h?sitez pas ? diffuser cette annonce dans vos laboratoires). L??quipe s?minaire du D?partement Parole et Cognition du GIPSA-lab _________________________ Curiosity-driven learning and development: How robots can help us understand humans Pierre-Yves Oudeyer Inria and Ensta ParisTech, France A great mystery is how human infants develop: how they progressively discover their bodies, how they learn to interact with objects and social peers, and accumulate new skills all over their lives. Constructing robots, and building mechanisms that model such developmental processes, is key to advance our understanding of human development, in constant dialogue with human and living sciences. I will present examples of robotics models of curiosity-driven learning and exploration, and show how developmental trajectories can self-organize, starting from discovery of the body, then object affordances, then vocal babbling and vocal interactions with others. In particular, I will show that the onset of language spontaneously forms out of such sensorimotor development. Dr. Pierre-Yves Oudeyer is Research Director at Inria and head of the Inria and Ensta-ParisTech FLOWERS team (France). Before, he has been a permanent researcher in Sony Computer Science Laboratory for 8 years (1999-2007). He studied theoretical computer science at Ecole Normale Sup?rieure in Lyon, and received his Ph.D. degree in artificial intelligence from the University Paris VI, France. After working on computational models of language evolution, he is now working on developmental and social robotics, focusing on sensorimotor development, language acquisition and life-long learning in robots. Strongly inspired by infant development, the mechanisms he studies include artificial curiosity, intrinsic motivation, the role of morphology in learning motor control, human-robot interfaces, joint attention and joint intentional understanding, and imitation learning. He has published a book, more than 80 papers in international journals and conferences, holds 8 patents, gave several invited keynote lectures in international conferences, and received several prizes for his work in developmental robotics and on the origins of language. In particular, he is laureate of the ERC Starting Grant EXPLORERS. He is editor of the IEEE CIS Newsletter on Autonomous Mental Development, and associate editor of IEEE Transactions on Autonomous Mental Development, Frontiers in Neurorobotics, and of the International Journal of Social Robotics. He is also working actively for the diffusion of science towards the general public, through the writing of popular science articles and participation to radio and TV programs as well as science exhibitions. Web: http://www.pyoudeyer.com and http://flowers.inria.fr Dr. Thomas Hueber Charg? de recherche au CNRS GIPSA-lab, D?partement Parole et Cognition 961 rue de la Houille Blanche - Domaine universitaire - BP 46 38402 Saint Martin d'H?res CEDEX FRANCE T?l : +33 (0)4 76 57 49 40 Fax : +33 (0)4 76 57 47 10 Courriel : thomas.hueber at gipsa-lab.grenoble-inp.fr Web : http://www.gipsa-lab.inpg.fr/~thomas.hueber ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Dec 20 20:46:48 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 21:46:48 +0100 Subject: Appel: Ateliers TALN 2014 Message-ID: Date: Thu, 19 Dec 2013 11:44:37 +0100 From: Nuria Gala Message-ID: <52B2CE15.4070803 at lif.univ-mrs.fr> TALN 2014 - APPEL ? ATELIERS L'?dition 2014 de la conf?rence TALN aura lieu ? Marseille du 1er au 4 juillet 2014. Comme pour les ?ditions pr?c?dentes, une journ?e sera d?di?e ? des ateliers. OBJECTIFS Un atelier porte sur une th?matique particuli?re de TALN afin de rassembler quelques expos?s plus cibl?s. Il a son propre pr?sident et son propre comit? de programme. Le responsable d?un atelier est charg? de l?appel ? communications et de la coordination de son comit? de programme. Les organisateurs de TALN s?occuperont de la partie logistique (gestion des salles, pauses caf? et diffusion des articles). Les ateliers auront lieu en parall?le (3 s?ances) sur une journ?e ou une demi-journ?e (2 ? 4 sessions de 1h30). CALENDRIER Date des ateliers : 1er juillet 2014 Date limite de soumission de la proposition : 31 janvier 2014 R?ponse du comit? de programme de TALN : 7 f?vrier 2014 Version finale des articles : 30 mai 2013 MODALIT?S DE PROPOSITION Les propositions d?ateliers seront envoy?es sous forme ?lectronique ? alexis.nasr[arobas]lif.univ-mrs.fr et nuria.gala[arobas]lif.univ-mrs.fr. Les propositions d?ateliers comprendront une description synth?tique (2-3 pages) de la th?matique de la conf?rence ainsi que son comit? de programme et la dur?e souhait?e (un jour ou une demi-journ?e). Le comit? de programme de TALN choisira parmi toutes les propositions. FORMAT Les conf?rences auront lieu en fran?ais (ou en anglais pour les non-francophones). Les articles devront suivre le format de TALN et comprendront 12 ? 14 pages. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Dec 20 20:28:37 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 21:28:37 +0100 Subject: Appel: 23rd IEEE Wetice, Track Web2Touch, June 23-25, Parma, Italy Message-ID: Date: Wed, 18 Dec 2013 16:26:07 +0100 From: Lina Soualmia Message-ID: X-url: http://web2touch2014.tudor.lu/ X-url: http://cmt.dmi.unipr.it/wetice14 CALL FOR PAPERS Web2Touch 2014: Ontology Applications and Web Semantics http://web2touch2014.tudor.lu/ Conference Track @ 23rd IEEE WETICE http://cmt.dmi.unipr.it/wetice14 June 23-25, 2014 - Parma - Italy http://web2touch2014.tudor.lu *Objectives* The 2014 edition of the Web2Touch Track aims at presenting alternatives to address interoperability and distributed knowledge management problem for Web systems. Web is a living medium for collaborating where people and computer systems can produce, share and consume information. However several challenges for facilitating this collaboration are still open issues, including: storage, retrieval, extraction, and visualisation of huge quantity of information, interlinking, ontology classification and fusion of data. New technologies and standards have been developed to provide more efficient use of the Web, in particular for using Cloud, Linked Open Data, and Big Data. The key of the success of the Internet is the amount of published data available for everyone, but it also generates problems to end-users. Researchers are continuously working to improve Webapplications and Web contents structures. The aim is to avoid overwhelming human beings with irrelevant data, and to increase the capability of systems to correctly interpret Web contents, so improving the quality of the interaction on the Web. This Track is intended as a forum to discuss theories, models, techniques and user requirements related to how a number of semantic interoperability problems can be coped with to achieve commercial, scientific, and sociological success for Web applications. Web2Touch is an open forum for studies in multiples application domains including, for example, Web science, eHealth, eGov, eLearning and collaborative systems in general. *Topics* Include but are not limited to: - Case studies of social semantic web. - Ontology and knowledge reuse and integration. - Semantic web services. - Experiences in semantic web support for collaborative work and business. - Knowledge creation, management, maintenance, and validation. - Collaborative ontology evolution and alignment. - Web for sentiment analysis. - Privacy issues in knowledge sharing. *Important Dates* - Submission deadline: Feb 4, 2014 - Notification of acceptance: March 14, 2014 - Camera ready papers: April 11, 2014 - Conference: June 23-25, 2014 *Paper Submission* Papers submitted to W2T 2014 must be written in English and must be formatted in strict accordance with the Wetice formatting instructions (author kits and paper templates available for *Word*, *PDF*, and *LaTeX*) and must be submitted in PDF format. Papers can be up to 6 pages. By submitting a paper, authors commit to have the paper presented at the workshop by at least one of them if the paper is accepted. Papers have to be submitted through the easychair system: https://www.easychair.org/conferences/?conf=web2touch The previous editions of Web2Touch received publication from many countries, the accepted ones are available in the IEEE Xplore. The best papers from 2008 to 2010 were published in special issues of the International Journal of Web Portals (2011, vol 1 and 3) and extended versions of selected papers from Web2Touch 2012 were published in Concurrency and Computation (2013), Web Portals (2013) and Collaborative Enterprise (2013) . Contact: Marcos Da Silveira web2touch at easychair.org From hamon at LIMSI.FR Fri Dec 20 21:33:54 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 22:33:54 +0100 Subject: Job: Postdoctoral position in MT, Idiap, Switzerland Message-ID: Date: Fri, 20 Dec 2013 18:06:16 +0100 From: Andrei Popescu-Belis Message-ID: <52B47908.5040007 at idiap.ch> X-url: http://www.idiap.ch/education-and-jobs Postdoctoral position in Machine Translation (OP-20131220-162727) NLP Group, Idiap Research Institute, Martigny, Switzerland Description: Applications are invited for a postdoctoral researcher position in the field of machine translation (MT), in the context of the MODERN project (www.idiap.ch/project/modern) funded by the Swiss National Science Foundation. The MODERN project is headed by Idiap, with partners in Zurich, Geneva and Utrecht, and its overall goal is to model discourse entities and relations for improving MT. The researcher will focus on enhancing the use of text-level features upon decoding, in a statistical MT framework. Several features are available from our previous projects, for instance pertaining to discourse relations or verb tenses, but also based on robust semantic relatedness measures. In addition, work in MODERN will focus on discourse entities (coreference and anaphora resolution) which will offer additional text-level constraints to experiment with. We are expecting from the postdoctoral researcher to advance the state-of-the-art in text-level decoding with linguistically motivated features, demonstrating improvement over existing options (such as factored models) through quantitative evaluation. Candidates should hold a PhD degree in statistical machine translation or natural language processing, and demonstrate a strong publication record at the time of application. Strong skills in programming and statistical modeling are also expected, along with a sensitivity to MT evaluation issues. While English is the main working language, knowledge of French, German or Dutch (the languages targeted in MODERN) would be an advantage. The PhD degree must be obtained before the starting date of the position. The position is offered for one year, with renewal for a second year conditional on satisfactory results. Review of the applications will start on February 1st, 2014, and will continue until the position is filled. The appointment will start on August 1st, 2014. How to apply: Please fill in and submit your application through the Idiap online recruitment system, by clicking on the position's title at http://www.idiap.ch/education-and-jobs. Contact information: Further information about this position can be requested via the Idiap online recruitment system or by contacting Dr. Andrei Popescu-Belis, head of the NLP group. About Idiap: Idiap is an independent, non-profit research institute recognized and supported by the Swiss Government, and affiliated with the Ecole Polytechnique F?d?rale de Lausanne (EPFL). It is located in the town of Martigny in Valais, a scenic region in the south of Switzerland, surrounded by the highest mountains of Europe, and offering exciting recreational activities, including hiking, climbing and skiing, as well as varied cultural activities. It is within close proximity to Geneva and Lausanne. Although Idiap is located in the French part of Switzerland, English is the working language. Free French lessons are provided. Idiap offers competitive salaries and conditions at all levels in a young, dynamic, and multicultural environment. Idiap is an equal opportunity employer and is actively involved in the "Advancement of Women in Science" European initiative. The Institute seeks to maintain a principle of open competition (on the basis of merit) to appoint the best candidate, provides equal opportunity for all candidates, and equally encourage both genders to apply. Andrei Popescu-Belis ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Dec 20 21:14:36 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 22:14:36 +0100 Subject: Job: Ingenieur en Traitement Automatique des Langues, INIST-CNRS Message-ID: Date: Fri, 20 Dec 2013 09:08:06 +0100 From: "GAUTIER, Patricia" Message-Id: Ing?nieur en Traitement Automatique des Langues (Bac + 5) Poste : Ing?nieur en Traitement Automatique des Langues Contrat : CDD Employeur : CNRS Lieu de Travail : Vand?uvre-l?s-Nancy Envoyer CV et lettre de motivation ? patricia.gautier at inist.fr Description du poste L'INIST-CNRS est une unit? de service du CNRS qui collecte, traite et diffuse de l'information scientifique et technique vers le monde de la recherche. Dans le cadre des Initiatives d'Excellence, le CNRS est porteur du projet ISTEX (www.istex.fr). L'INIST-CNRS pour sa part est en charge d'une plateforme nationale permettant de stocker, h?berger et diffuser des archives de documents scientifiques acquises dans le cadre de ce projet. Le volume de donn?es ? traiter, l'h?t?rog?n?it? des sources et la taille de la communaut? vis?e (ESR) est sans pr?c?dent. L?objectif de cette plateforme est ?galement de proposer aux chercheurs des services de traitement des donn?es (extraction de donn?es, fouille de texte...). La personne recrut?e travaillera exclusivement sur ce projet pendant la p?riode de son contrat et aura pour mission d??tudier et tester les outils existants qui permettent de r?aliser des traitements robustes de fouille et d?annotation des textes. Elle r?alisera une analyse des outils disponibles qu?ils soient commerciaux ou issus de travaux de recherche, et donnera des pr?conisations pour leur int?gration dans la plateforme en cours de d?veloppement. Enfin, elle assurera ces int?grations en collaboration avec l??quipe de d?veloppement. Comp?tences Obligatoires : Connaissance approfondie des m?thodologies, techniques et outils de Traitement Automatique des Langues et des formats de description de documents, d?annotations et de ressources terminologiques (XML et, par exemple, TEI, SKOS, TBX, ...) Int?gration de solutions dans un environnement Linux, Bonne capacit? d?analyse et de synth?se. Souhait?es : Autonomie de d?veloppement dans un langage comme Java, Python ou PHP. Exp?rience des gros volumes de donn?es et des technologies du Web s?mantique Profil De formation Bac+5 en informatique et ing?nierie des langues, vous avez une exp?rience dans le domaine. Ayant d?j? exerc? ce type de fonction ou pas, vous ?tes disponible et op?rationnel rapidement, vous ?tes autonome, organis? et rigoureux. Vous avez le sens du service, une bonne capacit? d'assimilation et d'adaptation et un bon relationnel. Lieu : VANDOEUVRE-LES-NANCY Contrat : CDD de 12 mois R?mun?ration : selon dipl?me et exp?rience Date de d?but : 01 f?vrier 2014 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Dec 20 21:32:27 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 22:32:27 +0100 Subject: Info: AFIA, Prix de These en Intelligence Artificielle 2014 Message-ID: Date: Fri, 20 Dec 2013 17:02:07 +0100 (CET) From: Amedeo Napoli Message-ID: <1763010084.15272569.1387555327165.JavaMail.root at loria.fr> X-url: http://www.afia.asso.fr/tiki-index.php?page=Prix+de+Th%C3%A8se+en+Intelligence+Artificielle bonjour ? tous, avec toutes mes excuses pour les r?ceptions multiples, bonne fin d'ann?e ? tous, Amedeo Napoli ==================================================================== AFIA : Prix de Th?se en Intelligence Artificielle 2014 L'AFIA reconduit en 2014 le prix de th?se en Intelligence Artificielle, pour faire conna?tre et reconna?tre les meilleurs travaux de recherche des jeunes chercheurs en Intelligence Artificielle. Pour cette ?dition 2014, le jury est compos? ? la fois de membres choisis parmi les ECCAI Fellows membres de l'AFIA et par des membres ext?rieurs choisis pour l'occasion. Le jury est pr?sid? cette ann?e encore par Amedeo Napoli (LORIA Nancy). Le premier prix est dot? d'un montant de 1000 euros. Selon la qualit? des soumissions re?ues, un ou deux autres prix ou accessits pourront ?tre attribu?s. Les jeunes chercheuses et chercheurs ayant obtenu un doctorat en France au cours de l'ann?e civile 2013 peuvent se porter candidat. En outre, l'AFIA appr?cierait que les jeunes docteurs candidats soient ? jour de leur cotisation AFIA 2014, ce qui montrerait par l? qu'ils sont conscients du r?le que peut jouer l'AFIA pour promouvoir l'Intelligence Artificielle sous toutes ses formes en France mais aussi ? l'?tranger. La constitution du dossier est d?taill?e ? l'adresse : http://www.afia.asso.fr/tiki-index.php?page=Prix+de+Th%C3%A8se+en+Intelligence+Artificielle La date limite de soumission est fix?e au **** lundi 27 janvier 2014 **** Les r?sultats seront connus ? partir de la fin du mois d'avril. Le laur?at se verra remettre le prix aux cours du Congr?s RFIA qui a lieu entre les 30 juin et 4 juillet ? Rouen, et il aura l'occasion de pr?senter ses travaux au cours d'une session sp?ciale des Rencontres des Jeunes Chercheurs en Intelligence Artificielle (RJCIA 2014). ==================================================================== ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Dec 20 21:31:02 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 22:31:02 +0100 Subject: Ressources: ELRA - Language Resources Catalogue - Update Message-ID: Date: Fri, 20 Dec 2013 16:58:03 +0100 From: ELRA ELDA Information Message-ID: <52B4690B.5000801 at elda.org> X-url: http://catalog.elra.info/product_info.php?products_id=1214 X-url: http://catalog.elra.info/product_info.php?products_id=1215 Our apologies if you have received multiple copies of this announcement. We are happy to announce that 1 new Speech Language Resource and 1 new Written Corpus are now available in our catalogue. *ELRA-S0365 aGender* aGender contains speech sample recordings over public telephone lines with read and (semi-)spontaneous speech. Native German speakers called a voice portal from their private phone, and read text + answered some open questions. The corpus contains the voices of 945 German speakers (approx. minimum of 100 speakers per class), each delivering 18 speech items in up to six different sessions. For more information, see: http://catalog.elra.info/product_info.php?products_id=1214 *ELRA-W0074 Amharic-English bilingual corpus* The Amharic-English bilingual corpus contains parallel text from legal and news domains in Amharic script, in transliterated form and in English. The size of the corpus is of 232,653 words in Amharic and 291,701 in English. For more information, see: http://catalog.elra.info/product_info.php?products_id=1215 For more information on the catalogue, please contact Val?rie Mapelli mailto:mapelli at elda.org Visit our On-line Catalogue: http://catalog.elra.info Visit the Universal Catalogue: http://universal.elra.info Archives of ELRA Language Resources Catalogue Updates: http://www.elra.info/LRs-Announcements.html ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Dec 20 20:44:28 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 21:44:28 +0100 Subject: Seminaire: BLRI, Pascal Belin, Aix-en-Provence, 24 janvier 2014 Message-ID: Date: Wed, 18 Dec 2013 17:43:15 +0100 From: Nad?ra Bureau Message-ID: <016601cefc10$3fdf5400$bf9dfc00$@bureau at lpl-aix.fr> Vendredi 24 janvier 2014 11h Salle de conf?rences B011, b?t. B 5 avenue Pasteur, Aix-en-Provence (Labex BLRI) Pascal Belin (Institut des Neurosciences de La Timone, UMR 7289, CNRS & Aix-Marseille Universit?, Marseille, France) A Vocal Brain: Cerebral Processing of Voice Information R?sum? : The human voice carries speech but also a wealth of socially-relevant, speaker-related information. Listeners routinely perceive precious information on the speaker's identity (gender, age), affective state (happy, scared), as well as more subtle cues on perceived personality traits (attractiveness, dominance, etc.), strongly influencing social interactions. Using voice psychoacoustics and neuroimaging techniques, we examine the cerebral processing of person-related information in perceptual and neural voice representations. Results indicate a cerebral architecture of voice cognition sharing many similarities with the cerebral organization of face processing, with the main types of information in voices (identity, affect, speech) processed in interacting, but partly dissociable functional pathways. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Dec 20 20:22:52 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 21:22:52 +0100 Subject: Appel: TSD 2014, Preliminary Announcement Message-ID: Date: Wed, 18 Dec 2013 14:39:52 +0100 From: TSD 2014 Message-Id: X-url: http://www.tsdconference.org/ ********************************************************* TSD 2014 - PRELIMINARY ANNOUNCEMENT ********************************************************* Seventeenth International Conference on TEXT, SPEECH and DIALOGUE (TSD 2014) Brno, Czech Republic, 8-12 September 2014 http://www.tsdconference.org/ The conference is organized by the Faculty of Informatics, Masaryk University, Brno, and the Faculty of Applied Sciences, University of West Bohemia, Pilsen. The conference is supported by International Speech Communication Association. Venue: Brno, Czech Republic TSD SERIES TSD series evolved as a prime forum for interaction between researchers in both spoken and written language processing from all over the world. Proceedings of TSD form a book published by Springer-Verlag in their Lecture Notes in Artificial Intelligence (LNAI) series. TOPICS Topics of the conference will include (but are not limited to): Corpora and Language Resources (monolingual, multilingual, text and spoken corpora, large web corpora, disambiguation, specialized lexicons, dictionaries) Speech Recognition (multilingual, continuous, emotional speech, handicapped speaker, out-of-vocabulary words, alternative way of feature extraction, new models for acoustic and language modelling) Tagging, Classification and Parsing of Text and Speech (morphological and syntactic analysis, synthesis and disambiguation, multilingual processing, sentiment analysis, credibility analysis, automatic text labeling, summarization, authorship attribution) Speech and Spoken Language Generation (multilingual, high fidelity speech synthesis, computer singing) Semantic Processing of Text and Speech (information extraction, information retrieval, data mining, semantic web, knowledge representation, inference, ontologies, sense disambiguation, plagiarism detection) Integrating Applications of Text and Speech Processing (machine translation, natural language understanding, question-answering strategies, assistive technologies) Automatic Dialogue Systems (self-learning, multilingual, question-answering systems, dialogue strategies, prosody in dialogues) Multimodal Techniques and Modelling (video processing, facial animation, visual speech synthesis, user modelling, emotions and personality modelling) Papers on processing of languages other than English are strongly encouraged. PROGRAM COMMITTEE Hynek Hermansky, USA (general chair) Eneko Agirre, Spain Genevieve Baudoin, France Paul Cook, Australia Jan Cernocky, Czech Republic Simon Dobrisek, Slovenia Karina Evgrafova, Russia Darja Fiser, Slovenia Radovan Garabik, Slovakia Alexander Gelbukh, Mexico Louise Guthrie, GB Jan Hajic, Czech Republic Eva Hajicova, Czech Republic Yannis Haralambous, France Ludwig Hitzenberger, Germany Jaroslava Hlavacova, Czech Republic Ales Horak, Czech Republic Eduard Hovy, USA Maria Khokhlova, Russia Daniil Kocharov, Russia Ivan Kopecek, Czech Republic Valia Kordoni, Germany Steven Krauwer, The Netherlands Siegfried Kunzmann, Germany Natalija Loukachevitch, Russia Vaclav Matousek, Czech Republic Diana McCarthy, United Kingdom France Mihelic, Slovenia Hermann Ney, Germany Elmar Noeth, Germany Karel Oliva, Czech Republic Karel Pala, Czech Republic Nikola Pavesic, Slovenia Fabio Pianesi, Italy Maciej Piasecki, Poland Adam Przepiorkowski, Poland Josef Psutka, Czech Republic James Pustejovsky, USA German Rigau, Spain Leon Rothkrantz, The Netherlands Anna Rumshisky, USA Milan Rusko, Slovakia Mykola Sazhok, Ukraine Pavel Skrelin, Russia Pavel Smrz, Czech Republic Petr Sojka, Czech Republic Stefan Steidl, Germany Georg Stemmer, Germany Marko Tadic, Croatia Tamas Varadi, Hungary Zygmunt Vetulani, Poland Pascal Wiggers, The Netherlands Yorick Wilks, GB Marcin Wolinski, Poland Victor Zakharov, Russia KEYNOTE SPEAKERS Ralph Grishman, New York University, USA Bernardo Magnini, FBK - Fondazione Bruno Kessler, Italy FORMAT OF THE CONFERENCE The conference program will include presentation of invited papers, oral presentations, and poster/demonstration sessions. Papers will be presented in plenary or topic oriented sessions. Social events including a trip in the vicinity of Brno will allow for additional informal interactions. CONFERENCE PROGRAM The conference program will include oral presentations and poster/demonstration sessions with sufficient time for discussions of the issues raised. IMPORTANT DATES March 15 2014 ............ Submission of abstract March 22 2014 ............ Submission of full papers May 15 2014 .............. Notification of acceptance May 31 2014 .............. Final papers (camera ready) and registration August 3 2014 ............ Submission of demonstration abstracts August 10 2014 ........... Notification of acceptance for demonstrations sent to the authors September 8-12 2014 ...... Conference date The contributions to the conference will be published in proceedings that will be made available to participants at the time of the conference. OFFICIAL LANGUAGE The official language of the conference is English. ADDRESS All correspondence regarding the conference should be addressed to Ales Horak, TSD 2014 Faculty of Informatics, Masaryk University Botanicka 68a, 602 00 Brno, Czech Republic phone: +420-5-49 49 18 63 fax: +420-5-49 49 18 20 email: tsd2014 at tsdconference.org The official TSD 2014 homepage is: http://www.tsdconference.org/ LOCATION Brno is the second largest city in the Czech Republic with a population of almost 400.000 and is the country's judiciary and trade-fair center. Brno is the capital of South Moravia, which is located in the south-east part of the Czech Republic and is known for a wide range of cultural, natural, and technical sights. South Moravia is a traditional wine region. Brno had been a Royal City since 1347 and with its six universities it forms a cultural center of the region. Brno can be reached easily by direct flights from London, Moscow, Saint Petersburg, Eindhoven, Rome and Prague and by trains or buses from Prague (200 km) or Vienna (130 km). From hamon at LIMSI.FR Fri Dec 20 21:49:06 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 22:49:06 +0100 Subject: Stage: Internship in NLP and MT, Idiap, Switzerland Message-ID: Date: Fri, 20 Dec 2013 18:06:35 +0100 From: Andrei Popescu-Belis Message-ID: <52B4791B.40306 at idiap.ch> X-url: http://www.idiap.ch/education-and-jobs Internship in Natural Language Processing and Machine Translation (OP-20131220-163254) NLP Group, Idiap Research Institute, Martigny, Switzerland Description: Applications are invited for a 6-month internship at the Master level in the field of natural language processing and statistical machine translation. The internship is offered in relation to the Swiss MODERN project (www.idiap.ch/project/modern) aiming at modeling discourse entities and relations for improving MT. The intern will work within the NLP group at the Idiap Research Institute. The goal of the internship is to integrate existing tools for anaphora resolution (in-house and third party ones) with phrase-based SMT models (mainly using the Moses open-source system). Following previous work, the intern will examine whether statistical predictions about the features of target pronouns (gender, number, grammatical function, and others) can be used when training SMT and when translating (decoding) new texts. The intern will implement methods to derive such features, will use the features in an SMT model (e.g. a factored one), and will measure the variation in translation quality. The applicants should have a background in computer science or linguistics, with previous experience in SMT and NLP being a strong advantage. The applicants should have demonstrable programming skills in at least one programming language such as Perl or Python, Java or C/C++. Experience with the Moses system and/or anaphora resolution systems would be a plus. Good command of English is mandatory and knowledge of another European language such as French, German or Dutch would be an advantage. The screening of the applications will start on February 1st, 2014 and will continue until the position is filled. The intended starting date is in spring 2014. The appointment is for 6 months, with a gross internship salary of 2000 CHF per month. Participation in the MODERN project will entail interaction with project partners at the Universities of Zurich, Geneva, and Utrecht. How to apply: Please fill in and submit your application through the Idiap online recruitment system, by clicking on the position's title at http://www.idiap.ch/education-and-jobs. Contact information: Further information about this position can be requested via the Idiap online recruitment system or by contacting Dr. Andrei Popescu-Belis, head of the NLP group. About Idiap: Idiap is an independent, non-profit research institute recognized and supported by the Swiss Government, and affiliated with the Ecole Polytechnique F?d?rale de Lausanne (EPFL). It is located in the town of Martigny in Valais, a scenic region in the south of Switzerland, surrounded by the highest mountains of Europe, and offering exciting recreational activities, including hiking, climbing and skiing, as well as varied cultural activities. It is within close proximity to Geneva and Lausanne. Although Idiap is located in the French part of Switzerland, English is the working language. Free French lessons are provided. Idiap offers competitive salaries and conditions at all levels in a young, dynamic, and multicultural environment. Idiap is an equal opportunity employer and is actively involved in the "Advancement of Women in Science" European initiative. The Institute seeks to maintain a principle of open competition (on the basis of merit) to appoint the best candidate, provides equal opportunity for all candidates, and equally encourage both genders to apply. Andrei Popescu-Belis ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Dec 20 20:23:39 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 21:23:39 +0100 Subject: Job: Poste d'Ingenieur d'Etude au GREYC (date limite: 16 Janvier 2014) Message-ID: Date: Wed, 18 Dec 2013 15:38:29 +0100 From: Ga?l Dias Message-ID: <52B1B365.1040909 at unicaen.fr> X-url: http://web-rh.dsi.cnrs.fr/afip/owa/consult.affiche_fonc?code_fonc=V61008&code_corps=&code_dr=19&code_bap=&code_ds=61&nbjours= --- D?sol? pour les diffusions multiples --- ================================================== ========== POSTE IE au GREYC ========== ========== Date Limite: 16 Janvier 2014 ========== ================================================== Le GREYC UMR 6072 ? Caen recherche un ing?nieur d'?tude. En collaboration avec les ?quipes de recherche du laboratoire, l?ing?nieur travaillera ? la conception de plateformes logicielles autour des traitements d?images, des documents num?riques et des langues, de la r?solution dans les syst?mes complexes et de l?optimisation et de la s?curit?. Le poste est ? pourvoir en premier lieu par mobilit? interne au CNRS (NOEMI). Mais si aucun candidat ne convient, les candidatures de fonctionnaires externes au CNRS seront examin?es. La campagne est ouverte jusqu'au 16 janvier 2014. Le profil du poste peut ?tre trouv? ? l'url suivante: http://web-rh.dsi.cnrs.fr/afip/owa/consult.affiche_fonc?code_fonc=V61008&code_corps=&code_dr=19&code_bap=&code_ds=61&nbjours= Pour tout renseignement compl?mentaire, n'h?sitez pas ? contacter Ga?l Dias, Professeur des Universit?s (gael.dias at uniaen.fr) de l'Universit? de Caen Basse-Normandie et responsable de l'?quipe HULTECH (Human Language Technology) au GREYC. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Dec 20 20:20:58 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 21:20:58 +0100 Subject: Job: Post-doc, Labex OBVIL Message-ID: Date: Wed, 18 Dec 2013 09:02:24 +0100 From: Jean-Gabriel Ganascia Message-ID: <52B15690.6080701 at Ganascia.name> *Paris - Sorbonne Universit?s* *INTITULE DU POSTE * ** *Ing?nieur attach? au Labex OBVIL (sp?cialit? intelligence artificielle)* /Cat?gorie : contrat post-doctoral/ /Prise de fonction : 1^er mars 2014/ /Dur?e : CDD renouvelable/ *Structure de r?f?rence :**Laboratoire d?excellence OBVIL* *Localisation g?ographique : Maison de la Recherche, 28 rue serpente, 75006 Paris* *Rattachements hi?rarchiques au sein de la structure :**Directeur du LABEX OBVIL, Didier Alexandre, Directeur Adjoint du LABEX OBVIL, Jean-Gabriel Ganascia* *N+1 de l?agent :**Jean-Gabriel Ganascia* *Quotit? de travail inh?rente au poste : 100%*** *Missions du Service* Ce poste est destin? ? un jeune chercheur en intelligence artificielle qui porte un int?r?t aux *humanit?s num?riques*. Le Laboratoire d?excellence OBVIL s?inscrit dans le cadre du PRES Sorbonne-Universit?s etr?unit des chercheurs appartenant ? 7 ?quipes d?accueil, ? 2 UMR du CNRS. Il regroupe des enseignants-chercheurs et des chercheurs des universit?s Paris-Sorbonne et Pierre-et-Marie-Curie, les uns sp?cialistes de litt?rature, les autres de sciences cognitives et d?informatique. Il entend d?velopper toutes les ressources offertes par les applications informatiques pour examiner aussi bien la litt?rature fran?aise du pass? que la plus contemporaine. Il s?int?resse aux questions de la valeur litt?raire, de la vie litt?raire, de l?auctorialit? et ? l??tude des traductions, des transpositions, des adaptations pour comprendre la mani?re dont se constituent les canons. Il d?veloppe des programmes et outils dans le cadre des programmes de recherche du labex OBVIL. *Fonctions de l?agent* Les fonctions de l?agent sont la contribution de cet agent aux missions du service. *D?veloppements logiciels pour les humanit?s num?riques :* - Des outils num?riques de comparaison de textes, de recherche d?homologies dans les textes, de d?tection d?entit?s nomm?es, d?indexation s?mantique, de cartographie de corpus, de construction et de visualisation d?ontologies etc. ont ?t? r?alis?es ces derni?res ann?es dans le cadre de l??quipe ACASA du LIP6 - Universit? Pierre et Marie Curie et du labex OBVIL. Il s?agit maintenant de les mettre ? disposition d?une communaut? large compos?e non seulement des chercheurs en litt?rature de l?OBVIL mais aussi de l?ensemble du public int?ress?. L?agent contribuera ? les valider sur des corpus cons?quents et ? r?aliser des interfaces pour les rendre accessibles ? tous sur le web. Activit?s de l?agent /Les activit?s sont les principales op?rations ? effectuer pour remplir les fonctions./ /Elles sont la traduction concr?te des fonctions./ - Un premier volet portera sur la conception d?interface d?acc?s aux outils d?alignement de texte (MEDITE), de d?tection d?homologies dans les s?quences (PHOEBUS) et de rep?rage de structures annulaires. Il s?agit non seulement de visualiser les r?sultats, mais mettre ces outils en acc?s libre sur le web. - Un second volet de l?activit? portera sur le d?veloppement d?ontologies pour l?indexation de textes litt?raires et sur l?introduction des techniques du Web s?mantique dans leur traitement. - Enfin, un troisi?me volet portera sur la cartographie de corpus ? l?aide des ?les de m?moire d?velopp?es au sein de l??quipe ACASA. Comp?tences Les comp?tences se traduisent par des savoirs, savoir-faire, savoir-?tre. Etre titulaire d?une th?se d?informatique. Ma?triser : - les concepts fondamentaux de l?intelligence artificielle - les techniques de base du traitement de la langue naturelle, - la conception d?interfaces visuelles, - les notions essentielles du web s?mantique (ontologies, repr?sentation des connaissances, etc.). Avoir la capacit? ? s?int?grer ? une ?quipe de recherche en informatique et montrer un int?r?t pour les humanit?s num?riques. *Personne(s) ? contacter:* /ALEXANDRE Didier, directeur du LABEX OBVIL/ /didier.alexandre at paris-sorbonne.fr/ /GANASCIA Jean-Gabriel, directeur de l??quipe ACASA du LIP6 - Universit? Pierre et Marie Curie/ /Jean-Gabriel.Ganascia at lip6.fr/ /Le dossier consistera en un CV, une pr?sentation du doctorat, des activit?s et contrats de recherche ant?rieurs, et une lettre de motivation. / *Date limite du d?p?t des candidatures : 1^er f?vrier 2014 aupr?s du Labex OBVIL, Universit? Paris-Sorbonne, Maison de la Recherche, 26 rue Serpente, 75006 Paris ou aupr?s de clarisse.barthelemy at gmail.com* ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Dec 20 21:37:36 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 22:37:36 +0100 Subject: Sujet de These: PhD Position in NLP for MT, Idiap, Switzerland Message-ID: Date: Fri, 20 Dec 2013 18:06:23 +0100 From: Andrei Popescu-Belis Message-ID: <52B4790F.6030808 at idiap.ch> X-url: http://www.idiap.ch/education-and-jobs PhD Position in Natural Language Processing for Machine Translation (OP-20131220-162332) NLP Group, Idiap Research Institute, Martigny, Switzerland Description: Applications are invited for a PhD position in natural language processing applied to machine translation, in the context of the MODERN project headed by Idiap, with partners in Zurich, Geneva and Utrecht (www.idiap.ch/project/modern) funded by the Swiss National Science Foundation. The overall goal of the project is to model discourse entities and relations for improving MT. The goal of the doctoral thesis, within MODERN, will be to design a probabilistic model of text-level coreference and anaphora resolution, in relation to discourse structure. The model will take advantage of rich semi-structured resources and semantic relatedness measures, along with insights on the interplay between referential and discursive relations. The model will be implemented and tested both intrinsically, on coreference tasks for English and a second language, and extrinsically, in combination to machine translation. Work on this latter aspect will involve collaboration with a postdoc in MT. Candidates should have a strong background in computer science, and good knowledge of natural language processing, in particular of statistical methods based on machine learning. They should have strong programming skills (e.g. in Python, Perl, Java or C/C++) as well as good knowledge of English as a working language. Knowledge of another language such as French, German, or Dutch is an advantage. A strong academic record and a clear aptitude for creative research will be priority selection criteria. The successful candidate will be registered as a PhD student in the EPFL EDEE doctoral program, to which he/she will have to apply with support from Idiap. The position is funded by the Swiss National Science Foundation, with a maximum duration of four years. Review of the applications will start immediately and will continue until the position is filled. The starting date is March 2014, but later dates can be considered as well. How to apply: Please fill in and submit your application through the Idiap online recruitment system, by clicking on the position's title at http://www.idiap.ch/education-and-jobs. Contact information: Further information about this position can be requested via the Idiap online recruitment system or by contacting Dr. Andrei Popescu-Belis, head of the NLP group. About Idiap: Idiap is an independent, non-profit research institute recognized and supported by the Swiss Government, and affiliated with the Ecole Polytechnique F?d?rale de Lausanne (EPFL). It is located in the town of Martigny in Valais, a scenic region in the south of Switzerland, surrounded by the highest mountains of Europe, and offering exciting recreational activities, including hiking, climbing and skiing, as well as varied cultural activities. It is within close proximity to Geneva and Lausanne. Although Idiap is located in the French part of Switzerland, English is the working language. Free French lessons are provided. Idiap offers competitive salaries and conditions at all levels in a young, dynamic, and multicultural environment. Idiap is an equal opportunity employer and is actively involved in the "Advancement of Women in Science" European initiative. The Institute seeks to maintain a principle of open competition (on the basis of merit) to appoint the best candidate, provides equal opportunity for all candidates, and equally encourage both genders to apply. Andrei Popescu-Belis ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Dec 20 21:28:31 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 20 Dec 2013 22:28:31 +0100 Subject: Cursus: Master Erasmus Mundus en TAL Message-ID: Date: Fri, 20 Dec 2013 15:25:46 +0100 From: Sylvain Pogodalla Message-ID: <21172.21354.208537.589923 at fanon.loria.fr> X-url: http://lct-master.org X-url: http://ec.europa.eu/education/external-relation-programmes/mundus_en.htm X-url: http://eacea.ec.europa.eu/erasmus_mundus/ *Bourses de master* pour ?tudiants *europ?ens et non europ?ens.* ? diffuser aupr?s des ?tudiants qui souhaitent faire un master. Le programme Erasmus Mundus en "Language and Communication Technologies" (EMMLCT) http://lct-master.org invite les ?tudiants *europ?ens* et *non europ?ens* ? postuler ? la participation ? ce programme. *La date limite de candidature est le 12 janvier 2014.* *Informations importantes* + enseignement en anglais + dur?e: 2 ans + chaque ann?e d'?tude dans un ?tablissement partenaire diff?rent + sp?cialisation dans les m?thodes et technologies de l'informatique linguistique + bourses d'?tudes du programme Erasmus Mundus de l'Union Europ?enne, *y compris pour les ?tudiants europ?ens* *Le programme EMMLCT est propos? par le consortium d'universit?s* 1. Saarland University in Saarbruecken, Germany (coordinator) 2. University of Trento, Trento, Italy 3. University of Malta, Malta 4. University of Lorraine, Nancy, France 5. Charles University, Prague, Czech Republic 6. Rijksuniversiteit Groningen, The Netherlands 7. Shanghai Jiao Tong University, China 8. The University of Melbourne, Australia L'Universit? du pays Basque / Euskal Herriko University, San Sebastian, Spain participe ?galement comme partenaire associ?. *Le programme EMMLCT en bref* Le programme EMMLCT est un Master international. Le programme propose d'?tudier chacune des deux ann?es dans une universit? diff?rente du consortium. Lorsque les conditions ont ?t? remplies, les ?tudiants obtiennent deux Masters qui ont une reconnaissance nationale dans chacun des pays d'?tudes. Les cours comprennent des modules fondamentaux obligatoires ainsi que des modules avanc?s en technologies du langage et de la communication et en informatique, un projet optionnel, et la r?daction d'un m?moire de Master. Le programme vise ? donner aux ?tudiants des comp?tences et des perspectives sur les disciplines vari?es qui contribuent aux m?thodes utilis?es dans les technologies du langage et de la communication et ? renforcer leur connaissance des m?thodes scientifiques. De plus, les ?tudiants acqui?rent des connaissances pratiques en choisissant une combinaison de modules en technologie du langage, en linguistique informatique et th?orique, et en informatique. Le programme EMMLCT a ?t? d?fini pour prendre en compte les demandes de l'industrie et de la recherche dans le domaine des technologies des langues. Il offre des perspectives d'apprentissage et de formation Le programme EMMLCT est approuv? et soutenu par le programme Erasmus Mundus de l'Union Europ?enne. Pour plus d'information?: http://ec.europa.eu/education/external-relation-programmes/mundus_en.htm http://eacea.ec.europa.eu/erasmus_mundus/. *Candidatures* Tous les candidats doivent satisfaire les conditions suivantes : 1. Avoir une Licence (ou un dipl?me ?quivalent) en informatique, linguistique, informatique linguistique, sciences cognitives, math?matiques, intelligence artificielle ou tout autre discipline pertinente. Ce dipl?me doit avoir ?t? obtenu avant le d?but des cours du master EMMLCT. 2. Comp?tences linguistiques?: pour les ?tudiants dont la langue maternelle n'est pas l'anglais, un certificat d'aptitude en anglais, langue d'enseignement, est requis (TOEFL, IELTS, Cambridge certificate, etc.) Les candidatures se font en ligne. Plus d'information et les instructions pour la candidature se trouvent ? : http://lct-master.org Adresse pour de plus amples informations : lct-info at coli.uni-sb.de Coordination du programme : Prof. Hans Uszkoreit Dr. Ing. Ivana Kruijff-Korbayova Department of Computational Linguistics and Phonetics Saarland University Saarbruecken, Germany http://lct-master.org ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 24 17:44:14 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 24 Dec 2013 18:44:14 +0100 Subject: Appel: COLING 2014, Dublin, Ireland Message-ID: Date: Sun, 22 Dec 2013 08:28:18 +0000 From: ??? Message-ID: <01648087DD5D4349A12B758B6F475B842356E92B at SejongSRV.sejong.knu.ac.kr> X-url: http://www.coling-2014.org We apologize if you received multiple copies ------------------------------------------------------------------------ 1st (Preliminary) Call for Papers - Coling 2014 The 25th International Conference on Computational Linguistics August 23 - 29, 2014 Dublin, Ireland http://www.coling-2014.org The International Committee on Computational Linguistics (ICCL) is pleased to announce the 25th International Conference on Computational Linguistics (Coling 2014), at Dublin City University (DCU, Dublin, Ireland, European Union). DCU is a young, dynamic and ambitious university with a mission to transform lives and societies through education, research and innovation. Most of the local organizers are from CNGL, Ireland?s Centre for Global Intelligent Content (formerly the Centre for Next Generation Localization), which embodies the leading position of Ireland in the global localization/internationalization business, a strong focus on language technologies including machine translation, computational linguistics and natural language processing, as well as on intelligent management, search, retrieval, transformation and adaptation of content. Coling will cover a broad spectrum of technical areas related to natural language and computation. The conference will include full papers (presented as oral presentations or posters), demonstrations, tutorials, and workshops. TOPICS OF INTEREST Coling 2014 solicits papers and demonstrations on original and unpublished research on the following topics, including, but not limited to: - pragmatics, semantics, syntax, grammars and the lexicon; - cognitive, mathematical and computational models of language processing; - models of communication by language; - lexical semantics and ontologies; - word segmentation, tagging and chunking; - parsing, both syntactic and deep; - generation and summarization; - paraphrasing, textual entailment and question answering; - speech recognition, text-to-speech and spoken language understanding; - multimodal and natural language interfaces and dialogue systems; - information retrieval, information extraction and knowledge base linking; - machine learning for natural language; - modeling of discourse and dialogue; - sentiment analysis, opinion mining and social media; - multilingual processing, machine translation and translation aids; - applications, tools and language resources; - system evaluation methodology and metrics. In all relevant areas, we encourage authors to include analysis of the influence of theories (intuitions, methodologies, insights, ? to technologies (computational algorithms, methods, tools, data, ? and/or contributions of technologies to theory development. In technologically oriented papers, we encourage in-depth analysis and discussion of errors made in the experiments described, if possible linking them to the presence or absence of linguistically-motivated features. Contributions that display and rigorously discuss future potential, even if not (yet) attested in standard evaluation, are welcome. PAPER REQUIREMENTS Papers should describe original work; they should emphasize completed work or well-advanced ongoing research rather than intended work, and should indicate clearly the state of completion of the reported results. Wherever appropriate, concrete evaluation results should be included. Submissions will be judged on correctness, originality, technical strength, significance and relevance to the conference, and interest to the attendees. Submissions presented at the conference should mostly contain new material that has not been presented at any other meeting with publicly available proceedings. Papers that are being submitted in parallel to other conferences or workshops must indicate this on the title page, as must papers that contain significant overlap with previously published work. REVIEWING Reviewing will be double blind. It will be managed by an international Conference Program Committee consisting of Program Chairs, members of the Scientific Advisory Board and Area Chairs, who will be assisted by invited reviewers. INSTRUCTIONS FOR AUTHORS For Coling 2014, there will be one category of research papers only. All of the papers will be included in conference proceedings, this time in electronic form only. The maximum submission length is 8 pages (A4), plus two extra pages for references. Authors of accepted papers will be given additional space in the camera-ready version to reflect space needed for changes stemming from reviewers?comments. Authors can indicate their preference for presentation mode (i.e. oral or poster presentation) in the submission form, and the reviewers will recommend an appropriate mode of presentation to the program committee which will then decide. There will be no distinction in the proceedings between research papers presented orally vs. as posters. Papers shall be submitted in English, anonymized with regard to the authors and/or their institution (no author-identifying information on the title page nor anywhere in the paper), including referencing style as usual. Papers must conform to official Coling 2014 style guidelines, which will be available on the Coling 2014 website. Submission and reviewing will be managed online by the START system. The only accepted format for submitted papers is in Adobe?s PDF. Submissions must be uploaded on the START system by the submission deadlines; submissions after that time will not be reviewed. To minimize network congestion, we request authors to upload their submissions as early as possible. Important Notice [1] In order to allow participants to be acquainted with the published papers ahead of time which in turn should facilitate discussions at Coling 2014, we have set the official publication date two weeks before the conference, i.e., on August 11, 2014. On that day, the papers will be available online for all participants to download, print and read. If your employer is taking steps to protect intellectual property related to your paper, please inform them about this timing. [2] While submissions are anonymous, we strongly encourage authors to plan for depositing language resources and other data as well as tools used and/or developed for the experiments described in the papers, if the paper is accepted. In this respect, we encourage authors then to deposit resources and tools to available open-access repositories of language resources and/or repositories of tools (such as META-SHARE, Clarin, ELRA, LDC or AFNLP/COCOSDA for data, and github, sourceforge, CPAN and similar for software and tools) and refer to them instead of submitting them with the paper, even though it will also be an open possibility (through the START system). The details will be given in the submission site for camera-ready versions of accepted papers. [3] There will be a separate call for demonstrations in February. Accepted papers on demonstrations will also be included in the proceedings. IMPORTANT DATES January, 2014: Opening of the submission website March 21, 2014: Paper submission deadline May 9-12, 2014: Author response period May 23, 2014: Author notification June 6, 2014: Camera-ready PDF due August 11, 2014: Official paper publication date August 25-29, 2014: Main conference PROGRAM COMMITTEE Program Committee Co-chairs Junichi Tsujii (Microsoft Research, China) Jan Hajic (Charles University in Prague, Czech Republic) Scientific Advisory Board members Ralph Grishman (New York University, USA) Yuji Matsumoto (Nara Institute of Science and Technology, Japan) Joakim Nivre (Uppsala University, Sweden) Michael Picheny (IBM T. J. Watson Research Center, USA) Donia Scott (Unviersity of Sussex, United Kingdom) Chengqing Zong (Chinese Academy of Sciences, China) Area Chairs 1. Linguistic Issues in CL and NLP Emily M. Bender (University of Washington, USA) Eva Hajicova (Charles University in Prague, Czech Republic) Igor Boguslavsky (Universidad Politecnica de Madrid, Spain) 2. Machine Learning for CL and NLP Jason Eisner (Johns Hopkins University, USA) Yoshimasa Tsuruoka (University of Tokyo, Japan) 3. Cognitive Issues in CL and NLP Philippe Blache (CNRS & CNRS & Aix-Marseille Universit?, France) Ted Gibson (MIT, USA) 4. Morphology, Word Segmentation, Tagging and Chunking Reut Tsarfaty (Weizmann Institute of Science, Israel) Yue Zhang (Singapore University of Technology and Design, Singapore) 5. Syntax, Grammar Induction, Syntactic and Semantic Parsing Laura Kallmeyer (Heinrich-Heine-Universit?t, Germany) Ryan McDonald (Google, USA) 6. Lexical Semantics and Ontologies Chu-Ren Huang (Hong Kong Polytechnic University, Hong Kong) Alessandro Oltramari (Carnegie Mellon University, USA) 7. Semantic Processing, Distributional Semantics and Compositional Semantics Stephen Clark (University of Cambridge, UK) Alessandro Lenci (University of Pisa, Italy) 8. Modeling of Discourse and Dialogue Nicolas Asher (CNRS & Universit? Paul Sabatier, France) Marilyn Walker (University of California Santa Cruz, USA) 9. Natural Language Generation and Summarization Albert Gatt (University of Malta, Malta) Advaith Siddharthan (University of Aberdeen, UK) 10. Paraphrasing and Textual Entailment Ido Dagan (Bar Ilan University, Israel) Kentaro Inui (Tohoku University, Japan) 11. Sentiment Analysis, Opinion Mining and Social Media Rada Mihalcea (University of Michigan, USA) Bing Liu (University of Illinois at Chicago, USA) 12. Information Retrieval and Question Answering Gareth Jones (Dublin City University, Ireland) Siddharth Patwardhan (IBM Research, USA) 13. Information Extraction and Database Linking James Curran (University of Sydney, Australia) Seung-won Hwang (Postec, Korea) 14. Applications Srinivas Bangalore (AT&T Labs-Research, USA) Heyan Huang (Beijing Institute of Technology, China) Guillaume Jacquet (Joint Research Centre, Italy) 15. Multimodal and Natural Language Interfaces and Dialog Systems Kristiina Jokinen (University of Helsinki, Finland) David Traum (University of Southern California, USA) 16. Speech Recognition, Text-To-Speech, Spoken Language Understanding Nick Campbell (Trinity College Dublin, Ireland) Alex Potamianos (National Technical University Crete, Greece) 17. Machine Translation Phillip Koehn (University of Edinburgh, UK / Johns Hopkins University, USA) Chris Quirk (Microsoft Research, USA) Tiejun Zhao (Harbin Institute of Technology, China) 18. Resources Pushpak Bhattacharyya (IIT Bombay, India) Nicoletta Calzolari (ILC-CNR, Pisa, Italy) Martha Palmer (University of Colorado, USA) 19. Languages with less resources Steven Bird (University of Melbourne, Australia) Mark Liberman (University of Pennsylvania, USA) Rajeev Sangal (IIT Banaras Hindu University, India) Koenraad De Smedt (University of Bergen, Norway) 20. Software and Tools Jes?s Carde?osa (Universidad Politecnica de Madrid, Spain) Jing-Shin Chang (National Chi Nan University,Taiwan) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 24 17:34:18 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 24 Dec 2013 18:34:18 +0100 Subject: Appel: LREC 2014, Workshops and Tutorials Message-ID: Date: Tue, 24 Dec 2013 15:35:58 +0100 From: ELRA ELDA Information Message-ID: <52B99BCE.9030106 at elda.org> X-url: http://lrec2014.lrec-conf.org/en/conference-programme/workshops-and-tutorials [Apologies for multiple postings] Schedule of all the LREC 2014 Workshops and Tutorials is now online at http://lrec2014.lrec-conf.org/en/conference-programme/workshops-and-tutorials. On this web page you will find a link to each Workshop Call for Papers and each Tutorial Outline, when available. Don't hesitate to contact Workshop and/or Tutorial organisers if you have specific questions on their event. For general LREC 2014 matters, please contact us at http://lrec2014.lrec-conf.org/en/contact/ www.lrec-conf.org/lrec2014 Follow us on Twitter: @LREC2014 (https://twitter.com/LREC2014) From hamon at LIMSI.FR Tue Dec 24 17:45:38 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 24 Dec 2013 18:45:38 +0100 Subject: Appel: LREC 2014 Workshop on Building and Using Comparable Corpora (7th BUCC) Message-ID: Date: Sun, 22 Dec 2013 14:42:26 +0100 From: "Reinhard Rapp" Message-ID: <3654EC959E8D49ABA97A0E21F4DA346A at ASUSPC> X-url: http://comparable.limsi.fr/bucc2014/ We apologize for multiple postings Please distribute to interested colleagues ============================================================ 1st Call for Papers 7th WORKSHOP ON BUILDING AND USING COMPARABLE CORPORA Building Resources for Machine Translation Research http://comparable.limsi.fr/bucc2014/ May 27, 2014 Co-located with LREC 2014 Harpa Conference Centre, Reykjavik (Iceland) DEADLINE FOR PAPERS: February 10, 2014 https://www.softconf.com/lrec2014/BUCC2014/ ============================================================ MOTIVATION In the language engineering and the linguistics communities, research in comparable corpora has been motivated by two main reasons. In language engineering, on the one hand, it is chiefly motivated by the need to use comparable corpora as training data for statistical Natural Language Processing applications such as statistical machine translation or cross-lingual retrieval. In linguistics, on the other hand, comparable corpora are of interest in themselves by making possible inter-linguistic discoveries and comparisons. It is generally accepted in both communities that comparable corpora are documents in one or several languages that are comparable in content and form in various degrees and dimensions. We believe that the linguistic definitions and observations related to comparable corpora can improve methods to mine such corpora for applications of statistical NLP. As such, it is of great interest to bring together builders and users of such corpora. The scarcity of parallel corpora has motivated research concerning the use of comparable corpora: pairs of monolingual corpora selected according to the same set of criteria, but in different languages or language varieties. Non-parallel yet comparable corpora overcome the two limitations of parallel corpora, since sources for original, monolingual texts are much more abundant than translated texts. However, because of their nature, mining translations in comparable corpora is much more challenging than in parallel corpora. What constitutes a good comparable corpus, for a given task or per se, also requires specific attention: while the definition of a parallel corpus is fairly straightforward, building a non-parallel corpus requires control over the selection of source texts in both languages. Parallel corpora are a key resource as training data for statistical machine translation, and for building or extending bilingual lexicons and terminologies. However, beyond a few language pairs such as English- French or English-Chinese and a few contexts such as parliamentary debates or legal texts, they remain a scarce resource, despite the creation of automated methods to collect parallel corpora from the Web. To exemplify such issues in a practical setting, this year's special focus will be on Building Resources for Machine Translation Research This special topic aims to address the need for: (1) Machine Translation training and testing data such as spoken or written monolingual, comparable or parallel data collections, and (2) methods and tools used for collecting, annotating, and verifying MT data such as Web crawling, crowdsourcing, tools for language experts and for finding MT data in comparable corpora. TOPICS We solicit contributions including but not limited to the following topics: Topics related to the special theme: * Methods and tools for collecting and processing MT data, including crowdsourcing * Methods and tools for quality control * Tools for efficient annotation * Bilingual term and named entity collections * Multilingual treebanks, wordnets, propbanks, etc. * Comparable corpora with parallel units annotated * Comparable corpora for under-resourced languages and specific domains * Multilingual corpora with rich annotations: POS tags, NEs, dependencies, semantic roles, etc. * Data for special applications: patent translation, movie subtitles, MOOCs, meetings, chat-rooms, social media, etc. * Legal issues with collecting and redistributing data and generating derivatives Building comparable corpora: * Human translations * Automatic and semi-automatic methods * Methods to mine parallel and non-parallel corpora from the Web * Tools and criteria to evaluate the comparability of corpora * Parallel vs non-parallel corpora, monolingual corpora * Rare and minority languages, across language families * Multi-media/multi-modal comparable corpora Applications of comparable corpora: * Human translations * Language learning * Cross-language information retrieval & document categorization * Bilingual projections * Machine translation * Writing assistance Mining from comparable corpora: * Extraction of parallel segments or paraphrases from comparable corpora * Extraction of bilingual and multilingual translations of single words and multi-word expressions; proper names, named entities, etc. IMPORTANT DATES February 10, 2014 Deadline for submission of full papers March 10, 2014 Notification of acceptance March 27, 2014 Camera-ready papers due May 27, 2014 Workshop date SUBMISSION INFORMATION Papers should follow the LREC main conference formatting details (to be announced on the conference website http://lrec2014.lrec-conf.org/en/ ) and should be submitted as a PDF-file via the START workshop manager at https://www.softconf.com/lrec2014/BUCC2014/ Contributions can be short or long papers. Short paper submission must describe original and unpublished work without exceeding six (6) pages. Characteristics of short papers include: a small, focused contribution; work in progress; a negative result; an opinion piece; an interesting application nugget. Long paper submissions must describe substantial, original, completed and unpublished work without exceeding ten (10) pages. Reviewing will be double blind, so the papers should not reveal the authors' identity. Accepted papers will be published in the workshop proceedings. Double submission policy: Parallel submission to other meetings or publications is possible but must be immediately notified to the workshop organizers. When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. Moreover, ELRA encourages all LREC authors to share the described LRs (data, tools, services, etc.), to enable their reuse, replicability of experiments, including evaluation ones, etc. For further information, please contact Pierre Zweigenbaum pz (at) limsi (dot) fr ORGANISERS Pierre Zweigenbaum, LIMSI, CNRS, Orsay (France) Ahmet Aker, University of Sheffield (UK) Serge Sharoff, University of Leeds (UK) Stephan Vogel, QCRI (Qatar) Reinhard Rapp, Universities of Mainz (Germany) and Aix-Marseille (France) SCIENTIFIC COMMITTEE * Ahmet Aker, University of Sheffield (UK) * Srinivas Bangalore (AT&T Labs, US) * Caroline Barri?re (CRIM, Montr?al, Canada) * Chris Biemann (TU Darmstadt, Germany) * Herv? D?jean (Xerox Research Centre Europe, Grenoble, France) * Kurt Eberle (Lingenio, Heidelberg, Germany) * Andreas Eisele (European Commission, Luxembourg) * ?ric Gaussier (Universit? Joseph Fourier, Grenoble, France) * Gregory Grefenstette (INRIA, Saclay, France) * Silvia Hansen-Schirra (University of Mainz, Germany) * Hitoshi Isahara (Toyohashi University of Technology) * Kyo Kageura (University of Tokyo, Japan) * Adam Kilgarriff (Lexical Computing Ltd, UK) * Natalie K?bler (Universit? Paris Diderot, France) * Philippe Langlais (Universit? de Montr?al, Canada) * Michael Mohler (Language Computer Corp., US) * Emmanuel Morin (Universit? de Nantes, France) * Dragos Stefan Munteanu (Language Weaver, Inc., US) * Lene Offersgaard (University of Copenhagen, Denmark) * Ted Pedersen (University of Minnesota, Duluth, US) * Reinhard Rapp (Universit? Aix-Marseille, France) * Sujith Ravi (Google, US) * Serge Sharoff (University of Leeds, UK) * Michel Simard (National Research Council Canada) * Richard Sproat (OGI School of Science & Technology, US) * Tim Van de Cruys (IRIT-CNRS, Toulouse, France) * Stephan Vogel, QCRI (Qatar) * Guillaume Wisniewski (Universit? Paris Sud & LIMSI-CNRS, Orsay, France) * Pierre Zweigenbaum (LIMSI-CNRS, Orsay, France) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 24 17:39:18 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 24 Dec 2013 18:39:18 +0100 Subject: Appel: Entity Extraction and Linking Challenge (#Microposts2014 @ WWW2014) Message-ID: Date: Sat, 21 Dec 2013 15:22:13 +0000 From: ampaeli cano Message-ID: X-url: http://www.scc.lancs.ac.uk/microposts2014/challenge/index.html ======================================================================= Entity Extraction and Linking Challenge at the 4th Making Sense of Microposts Workshop (#Microposts2014) @ WWW 2014 http://www.scc.lancs.ac.uk/microposts2014/challenge/index.html 7 April 2014, Seoul, Republic of Korea ======================================================================= Microposts are a highly popular medium to share facts, opinions or emotions. They are an invaluable wealth of data, ready to be mined for training predictive modelings. This year the #Microposts 2014 Workshop will host an "Entity Extraction and Linking Challenge". The overall task of the challenge is to automatically extract entities from English microposts, and link them to the corresponding English DBpedia v3.9 resources (if the linkage exists). As linking stage we aim to to disambiguate expressions that are formed by discrete (and typically short) sequences of words. Existing entity linking tools are intended for use over news corpora and similar document-based corpora with relatively long length. We organise this challenge to foster research into novel, more accurate solutions for the automatic entity linking in (much shorter) micropost data. We will ask the participants to automatically extract entities (e.g., Obama, London, Rakuten) belonging to all entity types (e.g., Person, Location, Organisation) from a collection of microposts. Participants will have to automatically provide context-relevant DBpedia resources for each entity in a micropost. DATASET ------- The dataset comprises of 3.5K tweets extracted from a much larger collection of over 18 million tweets. This collection, provided by the Redites project (http://demeter.inf.ed.ac.uk/redites/), covers event-annotated tweets collected for the period of 15th July 2011 to 15th August 2011 (31 days). It extends over multiple noteworthy events including the death of Amy Winhehouse, the London Riots and the Oslo bombing. Since the task of this challenge is to automatically extract and link entities, we have built our dataset considering both event and non-event tweets. While event tweets are more likely to contain entities, non-event tweets enable us to evaluate the performance of the system in avoiding false positives in the entity extraction phase. The dataset has been split into a training (70%) and testing (30%) sets. Following the Twitter TOS we will only provide tweet IDs and annotations for the training set; and tweet IDs for the test set. We will also provide a common framework to mine these datasets from Twitter. The training set will be released as tsv file where each line consists of : - tweet_id - entity_mention_1 - entity_uri_1 ... - entity_mention_n - entity_uri_n Tokens are separated by TABs. Entity mentions and uris are listed according to their appearance order in the tweet. We will timely advertise the release of the data sets on the workshop mailing list. Please subscribe to https://groups.google.com/d/ forum/microposts2014. More information about dates are available in the Challenge website. EVALUATION ---------- The evaluation consists of two separated stages: 1.- Paper peer review : A community of experts of the domain will judge the quality and applicability of the approaches taken, to provide useful insights on your research; 2.- Precision and Recall: F1 (F-measure with beta = 1) will be computed on a gold standard manually created from the test set. The automatically extracted entities and links will be both matched against this ground truth. All submissions will be only ranked according to the F1 of each best submission. SUBMISSIONS ----------- Submissions should be provided as a zip file using your system name as the file name (e.g. 'awesome.zip'), containing: 1. a TSV file with your system name (e.g. 'awesome.tsv'). We accept up to 3 different submissions, and we will consider *only* the best. If you do so you must specify clearly in your paper the modifications applied to each labelled submission. In this case the submission should contain each of up to 3 TSV files with the tool/system name with "_n" appended to each (e.g. awesome_1.tsv, awesome_2.tsv, awesome_3 ). In order to evaluate your submissions we require you to submit a tsv file following the format in which the training set is provided. 2. a paper of 6 pages describing your approach and how you tuned/tested it using the training split. All submissions must be in English. Submissions must be in PDF formatted in the style of the Springer Publications format for Lecture Notes in Computer Science (LNCS) [http://www.springer.com/ computer/lncs?SGWID=0-164-6-793341-0]. For details on the LNCS style, see Springer?s Author Instructions. All submissions are not anonymous. Please send us your submission before the deadline through Easychair [ https://www.easychair.org/conferences/?conf=microposts2014]. All accepted submissions will be invited for short presentations during the #Microposts2014 workshop and will be published independently from the workshop proceedings on the challenge page and on CEUR [http://ceur-ws.org/] (note that a minimum number of papers should be submitted in order to be able to publish them on CEUR). IMPORTANT DATES --------------- Intent to participate: 13 Jan 2014 (soft) Release of training set: 14 Jan 2014 Release of test set: 17 Feb 2014 Challenge Submission deadline: 21 Feb 2014 (hard) Challenge Notification: 14 Mar 2014 (hard) Challenge camera-ready deadline: 24 Mar 2014 (hard) Workshop program issued: 15 Mar 2014 Challenge proceedings to be published via CEUR Workshop - 07 Apr 2014 (Registration open to all) (All deadlines 23:59 Hawaii Time) PRIZE ----- to be announced CONTACT ------- E-mail: microposts2014 at easychair.org Facebook Group: http://www.facebook.com/#!/home.php?sk=group_180472611974910 Facebook Public Event page: http://www.facebook.com/events/116134955169543 Google group : https://groups.google.com/forum/#!forum/microposts2014 Twitter hashtag: #microposts2014challenge Twitter account: @Microposts2014 W3C Microposts Community Group: http://www.w3.org/community/microposts Challenge Organizers: ------------------------------ Challenge Chair: A. Elizabeth Cano, Aston University, UK Giuseppe Rizzo, Universit? di Torino, Italy Dataset Chair: Andrea Varga, The University of Sheffield, UK Challenge Committee: --------------------- Ebrahim Bagheri, Ryerson University, Canada Pierpaolo Basile, Dipartimento di Informatica - University of Bari, Italy Uldis Bojars, SIOC Project ?scar Corcho, Universidad Polit?cnica de Madrid, Spain Leon Derczynski, The University of Sheffield, UK Guillaume Er?t?o, Orange Labs Miriam Fernandez, Knowledge Media Institute, The Open University, UK Andr?s Garc?a-Silva, Ontology Engineering Group, Facultad de Inform?tica, Univesidad Polit?cnica de Madrid, Spain Anna Lisa Gentile, The University of Sheffield, UK Robert J?schke, L3S Research Center, Germany Diana Maynard, The University of Sheffield, UK Jos? M. Morales-Del-Castillo, El Colegio de M?xico, Mexico Georgios Paltoglou, University of Wolverhampton, UK Bernardo Pereira Nunes, PUC-Rio, Brazil Daniel Preo?iuc-Pietro, The University of Sheffield, UK Rapha?l Troncy, EURECOM, France Mischa Tuffield, PeerIndex Victoria Uren, Aston University, UK ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Dec 24 17:46:48 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 24 Dec 2013 18:46:48 +0100 Subject: Stage: Extraction d'information dans la presse financiere, MarketScience Message-ID: Date: Mon, 23 Dec 2013 17:05:42 +0100 From: Aude Message-ID: <52B85F56.1060608 at marketscience.fr> Titre : "Extraction d'information dans la presse financi?re" MarketScience est une start-up en finance, qui d?veloppe un centre de Data-Mining int?gr? ? une appli. Cette appli sera commercialis?e sur la plateforme Bloomberg, ? une client?le de professionnels de la finance. MarketScience combine mod?lisation des s?ries financi?res et media-mining. L'extraction d'information financi?re sp?cifique ? nos besoins se fait en mobilisant des algos de TAL. Ces algos permettent de 1. d?terminer la nature de l'article et sa probabilit? de contenir une information pertinente pour notre analyse 2. isoler l'information n?cessaire et la relier ? l'?volution sur certains march?s 3. cat?goriser cette information au sein de facteurs ?conomiques pr?-identifi?s Ce travail se faisant tout d'abord en anglais (80% de la volum?trie journali?re), mais aussi en fran?ais, allemand, italien, russe et chinois. Vous travaillerez essentiellement avec les outils suivants : Python, Unitex et MySQL/ SQL. Une connaissance des techniques du TAL (Traitement Automatique des Langues) ou des techniques d'apprentissage et de classification (SVM, CRF, etc.) est consid?r?e comme fortement souhaitable. Ce stage se d?roulera au sein d'une ?quipe pluridisciplinaire compos?e de statisticiens, ?conomistes, programmeurs et ing?nieurs TAListes. Dur?e du Stage : 4 ? 6 mois ? partir de f?vrier 2014. Embauche ? la cl?. Lieu du Stage : Orl?ans (45) R?mun?ration : 1200 Eur net min. Contactez-moi sur LinkedIn : Nicolas Boitout. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Dec 27 20:22:39 2013 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 27 Dec 2013 21:22:39 +0100 Subject: Appel: LREC 2014 Workshop on Free/Open-Source Arabic Corpora and Corpora Processing Tools Message-ID: Date: Thu, 26 Dec 2013 20:05:38 +0000 From: OSACT Message-ID: <76B661A7930AFC49A0DC6418253B1FA2C6AC8E at ex-itu01-002> X-url: http://www.kacstac.org.sa/osact/index.html We apologize for multiple postings, Please distribute to interested colleagues ---------------------------------------------- 1st Call for Papers WORKSHOP ON Free/Open-Source Arabic Corpora and Corpora Processing Tools http://www.kacstac.org.sa/osact/index.html May 27, 2014 Co-located with LREC 2014 Harpa Conference Centre, Reykjavik (Iceland) DEADLINE FOR PAPERS: February 10, 2014 https://www.softconf.com/lrec2014/OSACT/ ============================================================ Workshop description For Natural Language Processing (NLP) and Computational Linguistics (CL) communities, it was a known situation that Arabic is a resource poor language. This situation was thought to be the reason why there is a lack of corpus based studies in Arabic. However, the last years witnessed the emergence of new considerably free Arabic corpora and in lesser extent Arabic corpora processing tools. Freely available Arabic corpora can be divided into two groups. The first group contains large Arabic corpora, which are designed and constructed basically for Arabic linguistics research and activities, and maybe for Arabic NLP. These corpora are diverse in the genres they cover and their sizes range from one million words to 700 million words. The second group contains corpora that were designed basically for Arabic text classification and clustering, they mainly contain newspapers' articles. They range from less than 1 million words to 11 million words. Some Arabic corpora are available on the web to explore using different tools, basically large corpora, while other corpora are only available for download. For the corpora that are available for download, the user may need to use standalone corpus processing tools. These tools contain many functionality such as word frequency, concordance, collocation, etc. Therefore, with the availability of large and diverse Arabic corpora, the situation does not change. There is still a lack of Arabic corpus base studies. Is this because of representativeness of these corpora? The available functions and tools associated with these corpora? or is it because they are not well known enough for the Arabic linguistics community? Motivation and topics of interest This half-day-workshop aims to encourage the researchers and developers to foster the utilization of freely available Arabic corpora and open source Arabic corpora processing tools and help in highlighting the drawbacks of these resources and discuss techniques and approaches on how to improve them. The workshop topics include but not limited to: - Surveying and criticizing the design of freely available Arabic corpora, their associated tools and stand alone Arabic corpora processing tools. - The applications and uses of freely available Arabic language resources in fields such as Arabic language education e.g. L1 and L2. - Arabic language modeling. - Corpus based Arabic lexigraphy. - Lexical semantics and word sense. - Corpus based Arabic syntactic. - Corpus based Arabic morphology. - Development of Arabic mobile applications based on the available Arabic language resources. - Evaluation and assessment of Arabic Corpora and Corpora Processing Tools. - Future directions of Free/Open Arabic Corpora and Corpora Processing Tools. Organising Committee - Hend Al-Khalifa, King Saud University, KSA - Abdulmohsen Al-Thubaity, King Abdul Aziz City for Science and Technology, KSA Program Committee - Eric Atwell, University of Leeds, UK - Khaled Shaalan, The British University in Dubai (BUiD), UAE - Dilworth Parkinson, Brigham Young University, USA - Nizar Habash, Columbia University, USA - Khurshid Ahmad, Trinity College Dublin, Ireland - Abdulmalik AlSalman, King Saud University, KSA - Maha Alrabiah, King Saud University, KSA - Saleh Alosaimi, Imam University, KSA - Sultan almujaiwel, King Saud University, KSA - Adam Kilgarriff, Lexical Computing Ltd, UK - Amal AlSaif, Imam University, KSA - Maha AlYahya, King Saud University, KSA - Auhood AlFaries, King Saud University, KSA - Salwa Hamada, Taibah University, KSA - Mansour Algamdi, King Abdul Aziz City for Science and Technology, KSA - Abdullah Alfaifi, University of Leeds, UK Important Dates - Submission deadline: 10 February 2014 - Notification of acceptance: 10 March 2013 - Final submission of manuscripts: 21 March 2014 - Workshop date: 27 May 2014 (morning session) Submissions The language of the workshop is English and submissions should be with respect to LREC 2014 paper submission instructions. All papers will be peer reviewed possibly by three independent referees. Papers must be submitted electronically in PDF format to the START system. When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. Moreover, ELRA encourages all LREC authors to share the described LRs (data, tools, services, etc.), to enable their reuse, replicability of experiments, including evaluation ones, etc.