From hamon at LIMSI.FR Sat Feb 1 08:19:48 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 1 Feb 2014 09:19:48 +0100 Subject: Appel: revue TAL - Note de lecture (BUVET) Message-ID: Date: Wed, 29 Jan 2014 16:59:15 +0100 (CET) From: Denis Maurel Message-ID: <799078705.18806356.1391011155799.JavaMail.zimbra at univ-tours.fr> La revue TAL publie régulièrement des notes de lecture. Nous recherchons un collègue souhaitant lire le livre: "Pierre-André BUVET. La Dimension lexicale de la détermination en français. Honoré Champion. 2013. 473 pages." et prêt à en faire un compte-rendu pour la revue TAL (cet ouvrage sera envoyé gracieusement en échange du service rendu). Cette note de lecture doit être rédigée en français (trois pages maximum, au format de la revue) et envoyée fin mars 2014. D'autres compte-rendu sont possibles si vous avez lu récemment un ouvrage qui vous a intéressé et si vous êtes prêt à partager votre lecture avec la communauté... Denis Maurel ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 1 08:25:24 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 1 Feb 2014 09:25:24 +0100 Subject: Appel: 15th International Workshop on Non-Monotonic Reasoning (NMR 2014) Message-ID: Date: Thu, 30 Jan 2014 10:59:24 +0900 From: Sébastien Konieczny Message-Id: X-url: http://www.kr.tuwien.ac.at/nmr14/ CALL FOR PAPERS 15th International Workshop on Non-Monotonic Reasoning (NMR 2014) http://www.kr.tuwien.ac.at/nmr14/ Vienna, Austria, July 17–19, 2014 Co-located with KR 2014 [http://kr.org/KR2014/], DL 2014 [http://www.dbai.tuwien.ac.at/dl2014/], FLoC 2014, and Logic Colloquium 2014. NMR 2014 is part of the Vienna Summer of Logic [http://vsl2014.at/] ______________________________________________________________________ * Aims and Scope * The NMR workshop series is the premier specialized forum for researchers in non-monotonic reasoning and related areas. This will be the 15th workshop in this series. Its aim is to bring together active researchers in the broad area of non-monotonic reasoning, including belief revision, reasoning about actions, argumentation, declarative programming, preferences, non-monotonic reasoning for ontologies, uncertainty, and other related topics. NMR will share a joint session with the 27th International Workshop on Description Logics (DL 2014). * Topics * NMR 2014 welcomes the submission of papers broadly centered on issues and research in non-monotonic reasoning. We welcome papers of either theoretical or practical nature. Topics of interest include (but are not limited to): - abductive reasoning and diagnosis, - algorithms and complexity analysis, - argumentation and dialog, - answer-set programming, - belief revision, belief update, and belief merging, - benchmarks for non-monotonic reasoning, - declarative programming for non-monotonic reasoning, - default reasoning, - empirical studies of reasoning strategies, - foundations of non-monotonic reasoning, - hybrid approaches (non-monotonic reasoning combined with other computing paradigms), - inconsistency handling, - implementations and systems, - non-monotonic logics in multi-agent interaction, including negotiation and dispute resolution, - non-monotonic reasoning for ontologies, - reasoning and decision making under uncertainty, - reasoning with preferences, - representing actions and planning, - causal reasoning, and - similarity based-reasoning. * Tracks * To focus the different topics of submissions, the workshop comprises the following thematic tracks: 1. Actions, Causality, and Belief Change; 2. Declarative Programming; 3. Argumentation and Dialog; 4. Preferences, Norms, and Trust; 5. NMR and Uncertainty; 6. Commonsense and NMR for Ontologies; as well as the following special tracks: 7. Systems and Applications; 8. Benchmarks for NMR. * Systems and Applications Track * Recent years witnessed the development of mature solver technology for some NMR based formalisms and, accordingly, successful real-world applications. This track welcomes papers on describing implemented NMR systems as well as papers presenting applications of NMR formalisms and systems. Topics of interest include pure system descriptions (providing information on the basic functionality and usability of the respective systems), the comparison and evaluation of NMR systems, NMR applications in industry and academia, software engineering and modeling methodology aspects, and reports from the field. * Benchmarks for NMR special track * The aim of the Benchmarks for NMR special track is to discuss the construction of benchmarks for NMR. Benchmarks proved useful in a variety of domains in order to develop efficient algorithms and methods. They are for the moment insufficiently developed for main NMR areas. We want to discuss this issue in NMR 2014. Typical questions of interested could be: - How to obtain benchmarks from real application cases? - How to build sensible random benchmarks? - How to export existing benchmarks in some formalism into other domains formalisms? - Etc. Papers related to these issues, description of existing systems of benchmarks, etc., are welcome. * Submissions * Papers should be between 4 and 10 pages in AAAI style (http://www.aaai.org/Publications/Author/author.php) including references, figures, and appendixes if any. System descriptions can typically be on the lower bound of the page range. Papers submission will be handled electronically by means of the easychair system. Papers must be submitted in PDF only. The submission page is available at http://www.easychair.org/conferences/?conf=nmr2014 Papers already published at other conferences and that can be of interest for an NMR audience are welcomed to NMR 2014, provided that the initial publication is mentioned in a footnote on the first page. Submissions will have to indicate to which of the above listed tracks it is intended to belong to, as well as whether it constitutes new research or recently published research. * Proceedings * There are no formal proceedings for NMR. The accepted papers will be published as a technical report and will be made available in the CoRR Computing Research Repository, see http://arxiv.org/corr/home. The copyright of the papers lies with the authors, and as far as NMR is concerned, they are free to submit to other conferences and workshops as well. Similarly, papers already published can be submitted (but this has to be indicated in the submission). * Important Dates * Submission deadline: February 14, 2014 Notification: April 2, 2014 Camera-ready articles due: April 25, 2014 NMR 2014: July 17-19, 2014 * Location * NMR 2014 will be held at the Vienna University of Technology and is part of the Vienna Summer of Logic, http://vsl2014.at/, which will probably be the largest scientific logic event in known history. * Workshop Chairs * Sébastien Konieczny (CNRS, Université d'Artois, France) Hans Tompits (Vienna University of Technology, Austria) * Track Chairs * 1. Actions, Causality, and Belief Change Renata Wasserman (Universidade de São Paulo, Brasil) 2. Declarative Programming Tomi Janhunen (Aalto University, Finland) 3. Argumentation and Dialog Paul E. Dunne (University of Liverpool, UK) 4. Preferences, Norms, and Trust Mehdi Dastani (Utrecht University, Netherlands) 5. NMR and Uncertainty Lluis Godo (Universitat Autonoma de Barcelona, Spain) 6. Commonsense and NMR for Ontologies Guilin Qi (Southeast University, China) 7. Systems and Applications Esra Erdem (Sabanci University, Turkey) 8. Benchmarks for NMR Sébastien Konieczny (CNRS, Université d'Artois, France) Email: nmr14 [at] kr [dot] tuwien [dot] ac [dot] at * Homepage * http://www.kr.tuwien.ac.at/nmr14/ ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 1 08:33:57 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 1 Feb 2014 09:33:57 +0100 Subject: Stage: Offre de stage SNCF, Extraction d'informations sur les pratiques de mobilite Message-ID: Date: Fri, 31 Jan 2014 09:52:57 +0100 From: "REUTENAUER Coralie (SNCF / DIRECTION SI2R / DIR I&R PRODUCTION)" Message-ID: <17936745830A554B9A3D9540F41B312D02DF5B26 at se3p08bm045.COMMUN.AD.SNCF.FR> ******************* Intitulé : Extraction d'informations sur les pratiques de mobilité La Direction Innovation et Recherche de la SNCF recherche un stagiaire pour travailler sur un projet d'étude de la mobilité des voyageurs à travers l'analyse de données textuelles. *Activités du stage* ------------------------------ Réalisation d'une plateforme d'étiquetage sémantique de données textuelles pour l'analyse des pratiques de mobilité. *Thème* ------------------------------ La société connaît depuis quelques années des changements majeurs dans les pratiques de mobilité, du fait d'autres formes d'organisation du travail, de l'émergence de nouveaux modes de transport,... Les voyageurs s'expriment sur le web social à propos leurs déplacements, aussi bien en situation normale qu'en situation perturbée. Les messages contiennent des informations sur les activités des voyageurs, leurs particularités sociologiques ou encore leurs motivations. Une analyse sémantique en fonction de tels critères est susceptible d'apporter une meilleure connaissance des comportements, des besoins et des attentes. Elle permet une compréhension nuancée et différenciée de la mobilité. Le stage aura pour objectif de contribuer à la mise en place d'une plateforme d'analyse de données pour l'extraction d'informations sur les pratiques de mobilité. *Description * ------------------------------ Le stagiaire devra : - prendre connaissance du contexte du stage (SNCF, Direction Innovation & Recherche, objectifs du stage et cadre de réalisation, situation actuelle et interlocuteurs sur les sujets concernés, ...) - faire un état de l'art des outils disponibles sur le marché en analysant leurs possibilités, leurs avantages et leurs inconvénients. - mettre en place une interface de gestion de la base de données (collecte, structuration et interface). - Définir et implémenter des méthodes pour l'étiquetage sémantique des données, en fonction d'une typologie qui lui sera préalablement spécifiée - Exploiter les résultats d'annotations et proposer des pistes d'amélioration Présentations et rapports : - présentation de début de stage à la SNCF (au bout d'un mois de stage) : contexte de stage, planning de réalisation et premiers travaux réalisés. - rapport final de stage complet comprenant : méthodologie retenue, travaux réalisés, résultats obtenus et problèmes rencontrés... 2 soutenances de fin de stage : une à l'école et une à la SNCF. Des présentations en interne SNCF ou externes pourront être effectuées. *Profil recherché* ------------------------------ Niveau : De formation Bac+5 en Traitement Automatique du Langage Naturel ou Informatique (ingénieur ou master 2). Compétences souhaitées : - Capacités d'analyse, de rédaction et de synthèse - Autonomie, qualités relationnelles, qualité de présentation (orale/écrite). - Manipulation et test des outils de TAL - Connaissances en TAL et linguistique - Bonnes compétences en informatique (programmation, gestion de bases de données) - Des connaissances en statistiques seront appréciées. Bonus : - Maîtrise d'au moins un langage de programmation (PHP de préférence) - Maitrise des expressions régulières et du SQL *Modalités du poste* ------------------------------ - Durée : 4 mois - Rémunération prévue: indemnités de stage + carte de circulation SNCF - Début : à partir de Juin 2014 - Lieu : Paris Merci d'adresser CV et lettre de motivation à Coralie Reutenauer à l'adresse mail suivante : coralie.reutenauer at sncf.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 1 08:21:11 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 1 Feb 2014 09:21:11 +0100 Subject: Seminaire: Conferences Nick Riemer, Universite Paris Diderot Message-ID: Date: Wed, 29 Jan 2014 18:07:32 +0100 From: Thierry Poibeau Message-Id: <2BD46089-6E29-4586-B4B5-650ADC89F01E at ens.fr> X-url: http://www.linguist.univ-paris-diderot.fr/linglunch.html Chers collègues, Nick Riemer, Professeur à l’Université de Sydney, chercheur invité à l'Univ. Paris Diderot, séjournera à l'Université du 22 janvier au 22 février 2014, à l'initiative de l’UFR de Linguistique et du laboratoire d'Histoire des Théories Linguistiques (UMR 7597, CNRS/Paris Diderot/Sorbonne Nouvelle). Il donnera trois conférences: - mardi 4 février (14h-16h): « L¹invariance, l¹expressivité et le sens : vers une typologie de la signification lexicale » conférence organisée par HTL/ UMR7597 et le LaTTiCe/ UMR8094, ODG salle 209 - jeudi 6 février (12h-13h): « Six reasons not to believe in the descriptive /expressive distinction » séminaire Linglunch, UFR Linguistique, ODG salle 533 http://www.linguist.univ-paris-diderot.fr/linglunch.html - vendredi 14 février (13h-15h): « La sémantique mentaliste : enjeux épistémologiques » séminaire HTL /UMR7597 et Labex EFL « Linguistique et théories de l’esprit avant et après l’émergence de la psychologie expérimentale », ODG salle 131 lieu : Université Paris Diderot, Bât. Olympe de Gouges (ODG) , rue Einstein 75013 Paris Cordialement, Thierry Poibeau et Jacqueline Léon ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 1 08:40:44 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 1 Feb 2014 09:40:44 +0100 Subject: Appel: 4th Workshop on Controlled Languages (CNL 2014) Message-ID: Date: Fri, 31 Jan 2014 13:29:44 +0100 From: Camilo Thorne Message-ID: X-url: http://attempto.ifi.uzh.ch/site/cnl2014/ Second Call for Papers *** FOURTH WORKSHOP ON CONTROLLED NATURAL LANGUAGE (CNL 2014) *** Co-located with COLING 2014 20-22 August 2014 Galway, Ireland http://attempto.ifi.uzh.ch/site/cnl2014/ Proceedings in Springer LNCS. --------------------------------------------------------------------- This workshop on controlled natural language (CNL) has a broad scope and embraces all approaches that are based on natural language and apply restrictions on vocabulary, grammar, and/or semantics. This includes (but is certainly not limited to) approaches that have been called simplified language, plain language, formalized language, processable language, fragments of language, phraseologies, conceptual authoring, language generation, and guided natural language interfaces. Some CNLs are designed to improve communication among humans, especially for non-native speakers of the respective natural language. In other cases, the restrictions on the language are supposed to make it easier for computers to analyze such texts in order to improve computer-aided, semi-automatic, or automatic translations into other languages. A third group of CNL has the goal to enable reliable automated reasoning and formal knowledge representation from seemingly natural texts. All these types of CNL are covered by this workshop. Topics ------ Possible topics for CNL 2014 include: - CNL for knowledge representation - CNL for query interfaces - CNL for specifications - CNL for business rules - CNL for dialogue systems - CNL for machine translation - CNL for improved understandability of texts - CNL for natural language generation - design of CNLs - CNL applications - CNL evaluation - usability and acceptance of CNL - CNL grammars and lexica - multilingual CNLs - reasoning in CNL - spoken CNL - CNL in the context of the Semantic Web and Linked Open Data - CNL in the government - CNL in industry - CNL use cases - theoretical properties of CNL Important Dates --------------- Submission deadline: 14 March 2014 Notification of acceptance: 12 May 2014 Deadline for revised papers: 9 June 2014 Workshop: 20-22 August 2014 Submissions and Proceedings --------------------------- We invite researchers to submit papers with novel contributions in the area of CNL. These research papers should be formatted according to the Springer LNCS format and should not exceed 10 pages. Papers should be submitted via the EasyChair conference system: http://www.easychair.org/conferences/?conf=cnl2014 Accepted papers will be included in the workshop proceedings to be published by Springer in their LNCS/LNAI series and indexed in all major citation databases including ISI Web of Science and Scopus. Venue ----- The workshop takes place in Ireland at INSIGHT at NUI Galway (formerly DERI). Organization Committee ---------------------- - Brian Davis (INSIGHT at NUI Galway, formerly DERI, Ireland) - Kaarel Kaljurand (University of Zurich, Switzerland) - Tobias Kuhn (ETH Zurich, Switzerland) Program Committee ----------------- - Krasimir Angelov (Chalmers University, Sweden) - Johan Bos (University of Groningen, Netherlands) - Paul Buitelaar (INSIGHT at NUI Galway, formerly DERI, Ireland) - Olga Caprotti (University of Helsinki, Finland) - Rogan Creswick (Galois, USA) - Danica Damljanovic (University of Sheffield, UK) - Brian Davis (INSIGHT at NUI Galway, formerly DERI, Ireland) - Ronald Denaux (iSOCO, Spain) - Vania Dimitrova (University of Leeds, UK) - Ramona Enache (Chalmers University, Sweden) - Esra Erdem (Sabanci University, Turkey) - Sébastien Ferré (University Rennes 1, France) - Norbert E. Fuchs (University of Zurich, Switzerland) - Normunds Grūzītis (University of Latvia) - Siegfried Handschuh (INSIGHT at NUI Galway, formerly DERI, Ireland) - Stefan Höfler (University of Zurich, Switzerland) - Kaarel Kaljurand (University of Zurich, Switzerland) - Peter Koepke (University of Bonn, Germany) - Tobias Kuhn (ETH Zurich, Switzerland) - Hans Leiß (University of Munich, Germany) - Reinhard Muskens (Tilburg University, Netherlands) - Adegboyega Ojo (INSIGHT at NUI Galway, formerly DERI, Ireland) - Gordon Pace (University of Malta) - Richard Power (The Open University, UK) - Laurette Pretorius (University of South Africa) - Allan Ramsay (University of Manchester, UK) - Mike Rosner (University of Malta) - Aarne Ranta (University of Gothenburg, Sweden) - Rolf Schwitter (Macquarie University, Australia) - Geoff Sutcliffe (University of Miami, USA) - Irina Temnikova (University of Wolverhampton, UK) - Allan Third (The Open University, UK) - Camilo Thorne (Free University of Bozen-Bolzano, Italy) - Silvie Spreeuwenberg (LibRT, Netherlands) - Uta Schwertel (imc, Germany) - Jeroen Van Grondelle (Be Informed, Netherlands) - Yorick Wilks (University of Sheffield, UK) - Adam Wyner (University of Aberdeen, UK) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 1 08:43:19 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 1 Feb 2014 09:43:19 +0100 Subject: Appel: IC 2014, 2nd appel a ateliers et tutoriels Message-ID: Date: Fri, 31 Jan 2014 18:51:08 +0100 From: Catherine Faron Zucker Message-ID: <52EBE28C.7050606 at i3s.unice.fr> X-url: http://www.irit.fr/IC2014 *2nd Appel à Ateliers et Tutoriels des* 25èmes Journées Francophones d'Ingénierie des Connaissances : /Connaissances pervasives : des utilisateurs plongés dans des SBC et des SBC plongés dans le monde/ http://www.irit.fr/IC2014 du 12 au 16 mai 2014 à Clermont-Ferrand ------------------------------------------------------------------------ *Dates importantes :* Date limite de soumission des propositions d'ateliers et tutoriels : 7 février 2014 Réponse du Comité de programme : 11 février 2014 ------------------------------------------------------------------------ Les ateliers sont l'occasion de rassembler des groupes d'intérêt sur un sujet précis, de lancer un groupe de travail sur un thème nouveau, de confronter, discuter et débattre de différentes approches autour d'une problématique commune. Les tutoriels sont l'occasion de former les membres de la communauté d'Ingénierie des connaissances à des modèles, méthodes, techniques ou outils. L'organisation de la conférence IC 2014 permet la tenue d'ateliers et de tutoriels le lundi 12 mai après-midi et mardi 13 mai. Plusieurs ateliers en parallèle peuvent être envisagés. Un atelier peut durer une demi-journée ou une journée complète. ------------------------------------------------------------------------ *Soumissions* Les propositions d'ateliers ou de tutoriels seront soumises pour avis au comité de programme d'IC2014. Elles doivent être envoyées par courriel à (faron at i3s.unice.fr). Les informations suivantes doivent être fournies : - Titre de l'atelier - Nom et prénom du responsable (contact) - Adresse électronique du responsable (contact) - Comité de pilotage (nom, prénom, adresse électronique, page web pour chaque personne) - Texte de l'appel à communications de l'atelier - Moyens matériels requis - Durée de l'atelier : 1/2 ou 1 journée complète - Créneau(x) préféré(s) : Lundi après-midi / Mardi matin / Mardi après-Midi Le responsable d'un atelier ou tutoriel sera responsable de la communication sur celui-ci, notamment de la création d'un site web ou d'une page du site de la conférence, de la diffusion d'un appel à communications, de l'organisation d'un processus de revue des soumissions, de la notification aux auteurs, de la collecte des versions définitives des articles acceptés et de la constitution des actes de l'atelier. Il devra s'assurer de l'inscription des organisateurs et auteurs à l'atelier et encourager les inscriptions à la conférence. A titre indicatif, le calendrier des ateliers pourra être : - Diffusion de l'appel à communication : 12 février 2014 - Date limite de soumission : 10 mars 2014 - Réponse aux auteurs : 31 mars 2014 - Versions définitives des contributions : 14 avril 2014 Le respect de ce calendrier permettra de distribuer les actes des ateliers sur la clé USB délivrée aux participants. ------------------------------------------------------------------------ La conférence d'Ingénierie des connaissances est placée sous l'égide du collège IC de l'AFIA ------------------------------------------------------------------------ * Présidente du comité de programme :* Catherine Faron Zucker (I3S, Université Nice Sophia Antipolis) *Présidente du comité d'organisation :* Catherine Roussey, (IRSTEA) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 1 08:40:00 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 1 Feb 2014 09:40:00 +0100 Subject: Soft: Public release of the OpenDial toolkit Message-ID: Date: Fri, 31 Jan 2014 12:43:38 +0100 From: Pierre Lison Message-Id: <7D2E2BBB-45C5-4FC9-AED8-60D1ED025321 at ifi.uio.no> X-url: http://opendial.googlecode.com Dear all, I’m pleased to announce the first public release of the OpenDial toolkit! OpenDial is a Java-based software toolkit that can be used to develop robust and adaptive dialogue systems for various domains. Dialogue understanding, management and generation models are expressed in OpenDial through probabilistic rules encoded in a simple XML format. You can find more information about the toolkit on the project website: http://opendial.googlecode.com The current release contains a completely refactored code base, a large set of unit tests and a full-fledged graphical interface. You can also find on the website practical examples of dialogue domains and a step-by-step documentation on how to use the toolkit. Feel free to let me know if you have any comments or suggestions, Pierre Lison (PhD Research Fellow) Department of Informatics, University of Oslo Mobile: +47.967.998.12 Web: http://folk.uio.no/plison ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 1 08:23:13 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 1 Feb 2014 09:23:13 +0100 Subject: Appel: Final Call for SIGIR'14 Workshop Proposals (due Feb 3) Message-ID: Date: Thu, 30 Jan 2014 09:15:25 +1000 From: Richi Nayak Message-ID: X-url: http://sigir.org/sigir2014/ [Apologies if you receive this more than once] This is the Final Call for SIGIR'14 Workshop Proposals: - Deadline Monday 3 February 2014 (any time-zone) - On any topic, in particular novel ideas and emerging areas in IR - In a workshop format (not just presenting papers) - Proposals in pdf, ACM style max. 4 pages - Submit at (ttps://www.easychair.org/conferences/?conf=sigir2014workshops) Full details: -- Call for Workshop Proposals 37th Annual ACM SIGIR Conference, SIGIR 2014 Gold Coast, Australia, 6-11 July, 2014 http://sigir.org/sigir2014/ * SIGIR 2014 Workshop Program SIGIR is the major international forum for the presentation of new research results and for the demonstration of new systems and techniques in the broad field of information retrieval (IR). Proposals are solicited for workshops to be held at ACM SIGIR 2014 in Gold Coast, Australia. Workshops will last for one day and will be held on Friday 11 July 2014. Workshop topics will typically match those identified in the SIGIR 2014 general call for contributions, but proposals concerned with other areas of IR are welcome. SIGIR 2014 workshops will provide a platform for presenting novel ideas and emerging areas in IR, in a less formal and possibly more focused way than the conference itself. The format of each workshop is to be determined by the organizers, but it is expected that workshops will contain ample time for general discussion and engagement by all participants -- not just those presenting papers. Workshops that foster collaboration, discussion, group problem-solving and community building initiatives are particularly encouraged. Workshops that only involve the presentation of papers in a "mini conference" format are discouraged. Researchers and practitioners from all areas of IR are invited to submit proposals for review. The organizers of approved workshops will be expected to define the workshop's focus, gather and review submissions, and decide upon final program content. Organizers (including co-organizers) are expected to attend their entire workshop and to provide an article for ACM SIGIR Forum summarizing the event. * Submission Requirements Workshop proposals should include the following information: 1. Title 2. Motivation for the workshop and appropriateness to SIGIR. 3. Theme and purpose of the workshop. 4. Format and planned activities and a tentative schedule of events. 5. Selection process for participants and/or presenters and maximum number of participants (if limited). 6. List of organisers with a short biographical sketch of each organizer, describing relevant qualifications and experience. 7. Names of potential program committee members. 8. Related workshops (if applicable). If the workshop has been held previously at SIGIR or another conference, then the organizers should indicate this and describe briefly past attendance and outcomes, and why another workshop is needed. Proposals should be submitted via the easychair workshop submission webpage (https://www.easychair.org/conferences/?conf=sigir2014workshops) in pdf format and should not exceed 4 pages in ACM style. * Important Dates - Monday 3 February 2014: Workshop proposals due - Monday 10 March 2014: Notification of workshop acceptance [We try to notify 1-2 weeks early!] - Sunday 6-11 July, 2014: SIGIR Conference - Friday 11 July, 2014: SIGIR Workshops day * SIGIR Workshop Co-Chairs Jaap Kamps, University of Amsterdam, Email: sigir2014workshops at easychair.org Gabriella Kazai, Microsoft Research and Bing, Email: sigir2014workshops at easychair.org From hamon at LIMSI.FR Sat Feb 1 08:31:09 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 1 Feb 2014 09:31:09 +0100 Subject: Seminaire: Alpage, Eric de la Clergerie, 7 fevrier 2014 Message-ID: Date: Fri, 31 Jan 2014 08:33:43 +0100 From: Marie Candito Message-ID: ************** Séminaire de l'équipe Alpage ********* Il s'agit du séminaire de recherche en linguistique informatique organisé par l'équipe Alpage, équipe mixte INRIA - Paris Diderot, spécialisée en traitement automatique des langues. Vendredi 7 février de 11h à 12h30, en salle 267 Bâtiment Olympe de Gouges rue Albert Einstein 75013 Paris (Il s'agit du dernier bâtiment de la rue Albert Einstein, dans la récemment inaugurée "place Paul Ricoeur") Toute personne intéressée est la bienvenue. *********************************************************** Eric de la Clergerie (Alpage, INRIA / Université Paris Diderot) Titre : Analyse syntaxique statistique et hybride: 2 expériences Résumé : Je présente 2 expériences d'analyse syntaxique qui n'ont en commun que de s'appuyer sur le système DyALog. La première expérience, initiée dans le cadre de la campagne d'évaluation SPMRL 2013, concerne le développement rapide d'un analyseur statistique de type Shift-Reduce (à la MALT) mais avec faisceau et utilisation de techniques de programmation dynamique. Son autre originalité réside dans la possibilité de prendre des treillis de mots en entrée. La seconde expérience s'inscrit dans le cadre du développement à long-terme de FRMG, un analyseur fondé sur une grammaire TAG symbolique à large couverture pour le français. L'utilisation de techniques d'apprentissage faiblement supervisé sur le French TreeBank (FTB) a permis d'améliorer fortement les performances du module de désambiguisation de FRMG. Les résultats obtenus par cette hybridation symbolique/statistique se situent au niveau de ceux obtenus par des analyseurs purement statistiques entraînés et évalués sur le FTB. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 1 08:28:11 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 1 Feb 2014 09:28:11 +0100 Subject: Revue: Journal of Language Modelling 1(2) Message-ID: Date: Thu, 30 Jan 2014 10:34:49 +0100 From: Adam Przepiorkowski Message-ID: <87zjmdhkxy.fsf at bach.ipipan.waw.pl> X-url: http://jlm.ipipan.waw.pl/ It is our pleasure to announce the publication of issue 1(2) of the Journal of Language Modelling, a free open-access peer-reviewed journal aiming to bridge the gap between theoretical, formal and computational linguistics: http://jlm.ipipan.waw.pl/ (click on “CURRENT” to see this issue). TABLE OF CONTENTS: "Populating a multilingual ontology of proper names from open sources" Agata SAVARY, Leszek MANICKI, Małgorzata BARON "Grammatical typology and frequency analysis: number availability and number use" Dunstan BROWN, Greville G. CORBETT, Sebastian FEDDEN, Andrew HIPPISLEY, Paul MARRIOTT "Reactivation of antecedents by overt vs null pronouns: Evidence from Persian Niloofar KESHTIARI, Shravan VASISHTH "Syntax-driven semantic frame composition in Lexicalized Tree Adjoining Grammars" Laura KALLMEYER, Rainer OSSWALD "External reviewers 2012–2013" The current make-up of the JLM Editorial Board is enclosed below. Best regards, Adam Przepiórkowski (for JLM editors) ====================================================================== EDITORIAL BOARD: Steven Abney, University of Michigan, USA Ash Asudeh, Carleton University, CANADA; University of Oxford, UNITED KINGDOM Chris Biemann, Technische Universität Darmstadt, GERMANY Igor Boguslavsky, Technical University of Madrid, SPAIN; Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, RUSSIA António Branco, University of Lisbon, PORTUGAL David Chiang, University of Southern California, Los Angeles, USA Greville Corbett, University of Surrey, UNITED KINGDOM Dan Cristea, University of Iași, ROMANIA Jan Daciuk, Gdańsk University of Technology, POLAND Mary Dalrymple, University of Oxford, UNITED KINGDOM Darja Fišer, University of Ljubljana, SLOVENIA Anette Frank, Universität Heidelberg, GERMANY Claire Gardent, CNRS/LORIA, Nancy, FRANCE Jonathan Ginzburg, Université Paris-Diderot, FRANCE Stefan Th. Gries, University of California, Santa Barbara, USA Heiki-Jaan Kaalep, University of Tartu, ESTONIA Laura Kallmeyer, Heinrich-Heine-Universität Düsseldorf, GERMANY Jong-Bok Kim, Kyung Hee University, Seoul, KOREA Kimmo Koskenniemi, University of Helsinki, FINLAND, Finland Jonas Kuhn, Universität Stuttgart, GERMANY Alessandro Lenci, University of Pisa, ITALY Ján Mačutek, Comenius University in Bratislava, SLOVAKIA Igor Mel’čuk, University of Montreal, CANADA Glyn Morrill, Technical University of Catalonia, Barcelona, SPAIN Stefan Müller, Freie Universität Berlin, GERMANY Reinhard Muskens, Tilburg University, NETHERLANDS Mark-Jan Nederhof, University of St Andrews, UNITED KINGDOM Petya Osenova, Sofia University, BULGARIA David Pesetsky, Massachusetts Institute of Technology, USA Maciej Piasecki, Wrocław University of Technology, POLAND Christopher Potts, Stanford University, USA Louisa Sadler, University of Essex, UNITED KINGDOM Ivan A. Sag†, Stanford University, USA Agata Savary, Université François Rabelais Tours, FRANCE, France Sabine Schulte im Walde, Universität Stuttgart, GERMANY Stuart M. Shieber, Harvard University, USA Mark Steedman, University of Edinburgh, UNITED KINGDOM Stan Szpakowicz, Electrical Engineering and Computer Science, University of Ottawa, CANADA; Institute of Computer Science, Polish Academy of Sciences, Warsaw, POLAND Shravan Vasishth, Universität Potsdam, GERMANY Zygmunt Vetulani, Adam Mickiewicz University, Poznań, POLAND Aline Villavicencio, Federal University of Rio Grande do Sul, Porto Alegre, BRAZIL Veronika Vincze, University of Szeged, HUNGARY Yorick Wilks, Florida Institute of Human and Machine Cognition, USA Shuly Wintner, University of Haifa, ISRAEL Zdeněk Žabokrtský, Charles University in Prague, CZECH REPUBLIC ====================================================================== ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 1 08:46:21 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 1 Feb 2014 09:46:21 +0100 Subject: Appel: Medias et bien-etre, discours et representations Message-ID: date: Sat, 01 Feb 2014 07:52:32 +0100 from: "Pascale Vergely" message-id: <612-52ec9980-1-6511a680 at 8997611> Appel à communications Médias et bien-être : discours et représentations A la suite du colloque international "Médias et santé publique" organisé par le laboratoire SPH de l’Université Bordeaux en octobre 2012, l’Université de Bologne (DIT Département d’Interprétation et de Traduction de Forlì) en partenariat avec les Universités Bordeaux et Rome 3, en collaboration avec le Dorif Università, souhaitent poursuivre les réflexions entamées en se focalisant sur la notion de bien-être. A la fois physique, psychologique et sociale, la notion de bien-être s’avère centrale dans nos sociétés complexes. Sa définition, selon le dictionnaire, souligne qu’il s’agit autant d’une « sensation agréable procurée par la satisfaction de besoins physiques » et par l’« absence de tensions psychologiques », que d’une « situation matérielle qui permet de satisfaire les besoins de l’existence ». Dans ce colloque nous entendons privilégier les dimensions du bien-être centrées sur la santé, que ce soit dans le domaine de la nutrition (régimes, produits et cuisine bio-, produits naturels…), du sport (gymnastique de forme ou l'entraînement physique) et des thérapies (stations thermales, médecine non conventionnelle, médecine douce, produits paramédicaux…). L’idéal normatif ou l’injonction sociale de rechercher un état de « bien-être » seront également interrogés. Le colloque qui sera organisé à Forlì les 16 et 17 octobre 2014 vise à réunir des chercheurs d’horizons différents (en Linguistique et Sciences du Langage, en Traductologie, en Sciences de l’Information et de la Communication, en Sociologie, en Philosophie…), ainsi que des professionnels de la santé et du bien-être et des médias, afin de croiser leurs réflexions sur les textes et discours liés au bien-être. Les communications - en français ou en italien - pourront s’articuler autour des axes suivants (liste non exhaustive) : La traduction et la traductologie : - La traduction de textes centrés sur le bien-être : questions théoriques et stratégies de traduction. - Traduction et genres textuels en rapport au bien-être : quelles stratégies pour une grande variété de genres, des textes scientifiques rédigés par les spécialistes aux textes à visée promotionnelle ou vulgarisatrice. - La traduction de textes centrés sur le bien-être entre sciences dures et sciences humaines : des problématiques terminologiques aux enjeux stylistiques. - La localisation de sites consacrés au bien-être dans différentes langues et cultures. - Quelle formation pour le traducteur de textes d’une si grande variété et hétérogénéité linguistique, discursive et culturelle ? L’analyse du discours : - Le(s) discours et les textes à propos du bien-être : quelles relations entre situation d’énonciation, interdiscours et formation sociodiscursive ? Quelles stratégies rhétoriques sont mises en œuvre ? - Comment traiter l’hétérogénéité des supports des textes et des discours (publicités, cartels, murs, écrans, documents audio-visuels et de l’internet…) à propos du bien-être et quels rapports entretiennent-ils avec le contexte environnemental ? - Quels liens existe-t-il entre l’axe de l’intentionnalité (axe de production) et l’axe de l’acceptabilité (axe de la réception-interprétation) pour ces discours et textes ? - Est-ce que les discours scientifiques transforment les subjectivités ? La vulgarisation entre analyse du discours et linguistique textuelle : - La vulgarisation en tant que traduction intralinguistique : quelles marques linguistiques et sémiotiques (reformulations, désignations qualifiantes, infographies, encadrés et illustrations à vocation didactique, etc.) dans des corpus à propos du bien-être ? - Les textes sur le bien-être à la croisée entre vulgarisation et promotion : ethos, polyphonie et enjeux énonciatifs. - Enjeux terminologiques et reformulations dans les textes de vulgarisation. - Vulgarisation et typologies textuelles (les différentes formes de la présentation du bien-être dans les médias, dans les publications spécialisées, dans les revues, dans les brochures d’informations de centres thermaux ou de médecine douce, etc.) - Stratégies de vulgarisation et classes d’âge : quelles différences entre les textes pour adultes et ceux pour les enfants ? Calendrier 31 mars 2014 : Date limite de réception des propositions de communication (300-400 mots), accompagnées d’une courte notice bio-bibliographique. Les propositions (en français ou en italien) sont à adresser à : Roberta Pederzoli: r.pederzoli at unibo.it Licia Reggiani: licia.reggiani at unibo.it Laura Santone: laura.santone at uniroma3.it Fin mai 2014 : Notification d’acceptation 16 et 17 octobre 2014 : Colloque Comité scientifique Josiane Boutet (Paris 7), Bruna Donatelli (Un. Roma Tre), Enrica Galazzi (Un. Cattolica de Milan), Chiara Elefante (Un. Bologne), Danielle Londei (Un. Bologne), Olivier Laught (Un. Bordeaux), Sophie Moirand (Un. Paris 3), Paola Paissa (Un. de Turin), Roberta Pederzoli (Un. Bologne), Sandrine Reboul-Touré (Un. Paris 3), Laura Santone (Un. Roma Tre), Pascale Vergely (Un. Bordeaux). ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 1 08:44:46 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 1 Feb 2014 09:44:46 +0100 Subject: Appel: Semantic Sentiment Analysis 2014, workshop ESWC 2014 Message-ID: Date: Fri, 31 Jan 2014 23:44:41 +0100 From: Diego Reforgiato Message-ID: <52EC2759.5050501 at istc.cnr.it> X-url: http://ontologydesignpatterns.org/wiki/SemanticSentimentAnalysis2014 ** apologies for cross-posting ** ============================================ Call for Workshop: Semantic Sentiment Analysis 2014 Semantic Web and Sentiment Analysis ============================================ http://ontologydesignpatterns.org/wiki/SemanticSentimentAnalysis2014 Dates: May 25, 2014 Hashtag: #ssa2014 Venue: Anissaras, Crete, Greece Jointly with the 11th ESWC 2014 (http://2014.eswc-conferences.org) Workshop Chairs: - Aldo Gangemi (U. Paris Nord France/ISTC-CNR Rome Italy) - Harith Alani (KMI-OU Milton Keynes UK) - Malvina Nissim (University of Bologna, Italy) - Erik Cambria (NUS Singapore) - Diego Reforgiato Recupero (ISTC-CNR Catania Italy) Mining opinions and sentiments from natural language is a difficult task as opinions and sentiments are often conveyed implicitly through latent semantics, which make approaches based on surface lexical hints less effective in those cases. To this end, semantic sentiment analysis may go beyond word-level analysis of text, and provide novel approaches to opinion mining and sentiment analysis that allow a more efficient passage from (unstructured) textual information to (structured) machine-processable data, in potentially any domain. Semantic sentiment analysis may benefit from the use of web ontologies (e.g. linked data, multilingual linked lexica, linked vocabularies), or graph-based quasi-ontologies (e.g. ConceptNet, SenticNet, Nell, OIE, etc.) which enable the aggregation of conceptual and affective information associated with natural language opinions. This workshop focuses on the introduction, presentation, and discussion of novel approaches to semantic sentiment analysis. Special focus will be given to semantic methods, models, and tools that exploit common-sense knowledge bases to perform multi- or open-domain sentiment analysis. The intended audience of the workshop includes researchers from academia and industry as well as professionals and industrial practitioners to discuss and exchange positions on new hybrid techniques, which use semantics for sentiment analysis. The expected number of participants ranges between 20 and 40. * Topics of Interest * Topics of Includes but not limited to: - Ontologies and knowledge bases for sentiment analysis - Topic and entity based sentiment analysis - Evolution of sentiment within and across social media systems and topics - Semantic processing of social media for sentiment analysis - Contextualised sentiment analysis - Comparison of semantic approaches for sentiment analysis - Personalised sentiment analysis and monitoring - Prediction of sentiment towards events, people, organisations, etc. - Baselines and datasets for semantic sentiment analysis - Concept-level sentiment analysis - Biologically-inspired opinion mining * Submission Guidelines * Submissions must comply with the Springer LNCS style and will be made using EasyChair at the following link https://www.easychair.org/conferences/?conf=ssa14. Authors are invited to submit full papers (up to 8 e.g. original work) pages or short papers (up to 4 pages e.g. pilot experiments, position statement, short communications). Accepted papers will be published by CEUR-WS. Moreover, selected expanded versions of workshop papers will be invited for a special issue of Springer Cognitive Computation (http://springer.com/journal/12559). At least one of the authors of the accepted papers must register for both the main conference and the workshop to be included into the workshop proceedings. * Important Dates * Submission Deadline: March 6, 2014 Notification of Acceptance: April 1, 2014 Camera-Ready Paper: April 15, 2014 From hamon at LIMSI.FR Mon Feb 3 09:08:06 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Mon, 3 Feb 2014 10:08:06 +0100 Subject: Appel: NLDB'2014, Montpellier, France (Extended deadline) Message-ID: Date: Mon, 3 Feb 2014 05:32:23 +0100 From: Mathieu Roche Message-Id: <00AB2166-CA0A-472B-9333-3F5B6CD90005 at lirmm.fr> X-url: http://www.nldb.org/ ============== CALL FOR PAPERS - NLDB'2014 19th International Conference on Application of Natural Language to Information Systems 18-20 June 2014 - Montpellier, France http://www.nldb.org/ Submission deadline: February 10, 2014 [EXTENDED DEADLINE] ============== Since 1995, the NLDB conference aims at bringing together researcher, industrials and potential users interested in various application of Natural Language in the Database and Information Systems field. The integration of databases and natural language has been an utopia for many years. However, progress has been made and this is now an established field thanks to developments in Natural Language and technologies that made the storage and manipulation of large electronic dictionaries possible. As Information Systems are now evolving into the communication area, the term databases should be considered in the broader sense of information and communication systems. The use of Natural Language in Software Engineering has contributed to both improving the development process from the viewpoints of developers (improve the process of conceptual modeling, validation, etc) and the usability of applications by users (natural language query interfaces, semantic webs, etc). NLDB'2014 will take place in Montpellier (France). The conference invites researchers from academia and industry to submit papers for oral or poster presentations on recent, unpublished research that addresses theoretical aspects, algorithms, applications, architectures for applied and integrated NLP, resources for applied NLP, and other aspects of NLP, as well as review and discussion papers. Topics of interest include but are not limited to: - Applications of NLP in Information Systems: Multilingual Information Systems, NLP in Requirement Engineering, NLP in Knowledge Management, Semantic Data Integration and Data Cleaning - Social Media and Web Data: Corpus analysis, Language identification, Text normalization, Robust NLP for social media, Text classification, Information Extraction and Sentiment Analysis for social media - Big Data and Natural Language - Semantic Web Open Linked Data: Ontology Learning and Alignment, Populating ontologies, Querying Ontologies and linked data, Semantic tagging and classification, Ontology-driven NLP - Question Answering (QA): NL interfaces to databases, QA using web data, multi-lingual QA, Non-factoid QA (how/why/opinion questions, lists), geographical QA, QA corpora and training sets - Natural language and Ubiquitous Computing: Pervasive Computing, Embedded, Robotic and Mobile Applications. - Natural Language in Conceptual Modeling: Analysis of Natural Language Descriptions, Terminological Ontologies, Consistency Checking, Metadata Creation and Harvesting, Ontology-driven Systems Integration, Ontology Management - NLP Applications: Business Intelligence, Subjectivity and Sentiment Analysis, QA systems, Event Detection, Named Entity and Event Detection, Information Extraction, Summarization, NLP for Data Mining, NLP for Data Warehouses, Plagiarism detection, Identity detection ============== Submission guidelines: Authors should follow the LNCS format and submit their manuscripts in pdf. Submissions can be full papers (12 pages maximum including references), short papers (6 pages) or papers for a poster presentation (4 pages) : https://www.easychair.org/conferences/?conf=nldb2014 The programme committee may decide to accept some full papers as short papers or poster contributions. The Proceedings of NLDB 2014 will be published by Springer LNCS Series. We plan to publish extended versions of a selection of the best papers after the conference in the 'Data and Knowledge Engineering Journal' as a special issue. ============== Conference Chair: - Elisabeth Métais, Conservatoire National des Arts et Métiers, Paris, France Programme Committee Chairs: - Mathieu Roche, Cirad, TETIS, France - Maguelonne Teisseire, Irstea, TETIS, France ============== Important dates: - Paper submission: February 10, 2014 [extended deadline] - Notification of acceptance: March 24, 2014 - Camera-ready papers: April 7, 2014 - Conference: 18-20 June 2014 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 4 22:04:20 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 4 Feb 2014 23:04:20 +0100 Subject: Appel: TSD 2014 Message-ID: Date: Tue, 04 Feb 2014 22:30:47 +0100 From: TSD 2014 Message-Id: X-url: http://www.tsdconference.org/ ********************************************************* TSD 2014 - FIRST CALL FOR PAPERS ********************************************************* Seventeenth International Conference on TEXT, SPEECH and DIALOGUE (TSD 2014) Brno, Czech Republic, 8-12 September 2014 http://www.tsdconference.org/ The conference is organized by the Faculty of Informatics, Masaryk University, Brno, and the Faculty of Applied Sciences, University of West Bohemia, Pilsen. The conference is supported by International Speech Communication Association. Venue: Brno, Czech Republic THE SUBMISSION DEADLINES: March 15 2014 ............ Submission of abstracts March 22 2014 ............ Submission of full papers Submission of abstract serves for better organization of the review process only - for the actual review a full paper submission is necessary. TSD SERIES TSD series evolved as a prime forum for interaction between researchers in both spoken and written language processing from all over the world. Proceedings of TSD form a book published by Springer-Verlag in their Lecture Notes in Artificial Intelligence (LNAI) series. TSD Proceedings are regularly indexed by Thomson Reuters Conference Proceedings Citation Index. Moreover, LNAI series are listed in all major citation databases such as DBLP, SCOPUS, EI, INSPEC or COMPENDEX. TOPICS Topics of the conference will include (but are not limited to): Corpora and Language Resources (monolingual, multilingual, text and spoken corpora, large web corpora, disambiguation, specialized lexicons, dictionaries) Speech Recognition (multilingual, continuous, emotional speech, handicapped speaker, out-of-vocabulary words, alternative way of feature extraction, new models for acoustic and language modelling) Tagging, Classification and Parsing of Text and Speech (morphological and syntactic analysis, synthesis and disambiguation, multilingual processing, sentiment analysis, credibility analysis, automatic text labeling, summarization, authorship attribution) Speech and Spoken Language Generation (multilingual, high fidelity speech synthesis, computer singing) Semantic Processing of Text and Speech (information extraction, information retrieval, data mining, semantic web, knowledge representation, inference, ontologies, sense disambiguation, plagiarism detection) Integrating Applications of Text and Speech Processing (machine translation, natural language understanding, question-answering strategies, assistive technologies) Automatic Dialogue Systems (self-learning, multilingual, question-answering systems, dialogue strategies, prosody in dialogues) Multimodal Techniques and Modelling (video processing, facial animation, visual speech synthesis, user modelling, emotions and personality modelling) Papers on processing of languages other than English are strongly encouraged. PROGRAM COMMITTEE Hynek Hermansky, USA (general chair) Eneko Agirre, Spain Genevieve Baudoin, France Paul Cook, Australia Jan Cernocky, Czech Republic Simon Dobrisek, Slovenia Karina Evgrafova, Russia Darja Fiser, Slovenia Radovan Garabik, Slovakia Alexander Gelbukh, Mexico Louise Guthrie, GB Jan Hajic, Czech Republic Eva Hajicova, Czech Republic Yannis Haralambous, France Ludwig Hitzenberger, Germany Jaroslava Hlavacova, Czech Republic Ales Horak, Czech Republic Eduard Hovy, USA Maria Khokhlova, Russia Daniil Kocharov, Russia Ivan Kopecek, Czech Republic Valia Kordoni, Germany Steven Krauwer, The Netherlands Siegfried Kunzmann, Germany Natalija Loukachevitch, Russia Vaclav Matousek, Czech Republic Diana McCarthy, United Kingdom France Mihelic, Slovenia Hermann Ney, Germany Elmar Noeth, Germany Karel Oliva, Czech Republic Karel Pala, Czech Republic Nikola Pavesic, Slovenia Fabio Pianesi, Italy Maciej Piasecki, Poland Adam Przepiorkowski, Poland Josef Psutka, Czech Republic James Pustejovsky, USA German Rigau, Spain Leon Rothkrantz, The Netherlands Anna Rumshisky, USA Milan Rusko, Slovakia Mykola Sazhok, Ukraine Pavel Skrelin, Russia Pavel Smrz, Czech Republic Petr Sojka, Czech Republic Stefan Steidl, Germany Georg Stemmer, Germany Marko Tadic, Croatia Tamas Varadi, Hungary Zygmunt Vetulani, Poland Pascal Wiggers, The Netherlands Yorick Wilks, GB Marcin Wolinski, Poland Victor Zakharov, Russia KEYNOTE SPEAKERS Ralph Grishman, New York University, USA Bernardo Magnini, FBK - Fondazione Bruno Kessler, Italy Salim Roukos, IBM, USA FORMAT OF THE CONFERENCE The conference program will include presentation of invited papers, oral presentations, and poster/demonstration sessions. Papers will be presented in plenary or topic oriented sessions. Social events including a trip in the vicinity of Brno will allow for additional informal interactions. SUBMISSION OF PAPERS Authors are invited to submit a full paper not exceeding 8 pages formatted in the LNCS style (see below). Those accepted will be presented either orally or as posters. The decision about the presentation format will be based on the recommendation of the reviewers. The authors are asked to submit their papers using the on-line form accessible from the conference website. Papers submitted to TSD 2014 must not be under review by any other conference or publication during the TSD review cycle, and must not be previously published or accepted for publication elsewhere. As reviewing will be blind, the paper should not include the authors' names and affiliations. Furthermore, self-references that reveal the author's identity, e.g., "We previously showed (Smith, 1991) ...", should be avoided. Instead, use citations such as "Smith previously showed (Smith, 1991) ...". Papers that do not conform to the requirements above are subject to be rejected without review. The authors are strongly encouraged to write their papers in TeX or LaTeX formats. These formats are necessary for the final versions of the papers that will be published in the Springer Lecture Notes. Authors using a WORD compatible software for the final version must use the LNCS template for WORD and within the submit process ask the Proceedings Editors to convert the paper to LaTeX format. For this service a service-and-license fee of CZK 1500 will be levied automatically. The paper format for review has to be either PDF or PostScript file with all required fonts included. Upon notification of acceptance, presenters will receive further information on submitting their camera-ready and electronic sources (for detailed instructions on the final paper format see http://www.springer.de/comp/lncs/authors.html#Proceedings, Sample File typeinst.zip). Authors are also invited to present actual projects, developed software or interesting material relevant to the topics of the conference. The presenters of demonstrations should provide an abstract not exceeding one page. The demonstration abstracts will not appear in the conference proceedings. IMPORTANT DATES March 15 2014 ............ Submission of abstracts March 22 2014 ............ Submission of full papers May 15 2014 .............. Notification of acceptance May 31 2014 .............. Final papers (camera ready) and registration August 3 2014 ............ Submission of demonstration abstracts August 10 2014 ........... Notification of acceptance for demonstrations sent to the authors September 8-12 2014 ...... Conference date Submission of abstracts serves for better organization of the review process only - for the actual review a full paper submission is necessary. The accepted conference contributions will be published in proceedings that will be made available to participants at the time of the conference. OFFICIAL LANGUAGE The official language of the conference is English. ACCOMMODATION The organizing committee will arrange discounts on accommodation in the 4-star hotel at the conference venue. The current prices of the accommodation will be available at the conference website. ADDRESS All correspondence regarding the conference should be addressed to Ales Horak, TSD 2014 Faculty of Informatics, Masaryk University Botanicka 68a, 602 00 Brno, Czech Republic phone: +420-5-49 49 18 63 fax: +420-5-49 49 18 20 email: tsd2014 at tsdconference.org The official TSD 2014 homepage is: http://www.tsdconference.org/ LOCATION Brno is the second largest city in the Czech Republic with a population of almost 400.000 and is the country's judiciary and trade-fair center. Brno is the capital of South Moravia, which is located in the south-east part of the Czech Republic and is known for a wide range of cultural, natural, and technical sights. South Moravia is a traditional wine region. Brno had been a Royal City since 1347 and with its six universities it forms a cultural center of the region. Brno can be reached easily by direct flights from London, Moscow, and Eindhoven, and by trains or buses from Prague (200 km) or Vienna (130 km). For the participants with some extra time, nearby places may also be of interest. Local ones include: Brno Castle now called Spilberk, Veveri Castle, the Old and New City Halls, the Augustine Monastery with St. Thomas Church and crypt of Moravian Margraves, Church of St. James, Cathedral of St. Peter & Paul, Cartesian Monastery in Kralovo Pole, the famous Villa Tugendhat designed by Mies van der Rohe along with other important buildings of between-war Czech architecture. For those willing to venture out of Brno, Moravian Karst with Macocha Chasm and Punkva caves, battlefield of the Battle of three emperors (Napoleon, Russian Alexander and Austrian Franz - Battle by Austerlitz), Chateau of Slavkov (Austerlitz), Pernstejn Castle, Buchlov Castle, Lednice Chateau, Buchlovice Chateau, Letovice Chateau, Mikulov with one of the largest Jewish cemeteries in Central Europe, Telc - a town on the UNESCO heritage list, and many others are all within easy reach. From hamon at LIMSI.FR Tue Feb 4 21:36:02 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 4 Feb 2014 22:36:02 +0100 Subject: Appel: KI 2014 Message-ID: Date: Mon, 3 Feb 2014 14:39:45 +1100 From: Michael Thielscher Message-Id: <1F4275CC-271B-4C6E-82C8-B0B40944A797 at cse.unsw.edu.au> X-url: http://www.easychair.org/conferences/?conf=ki2014 **** Please accept our apologies if you receive multiple copies of this call **** ====================================================================== KI 2014 - The 37th German Conference on Artificial Intelligence Stuttgart, September 22--26, 2014 ====================================================================== www.ki2014.de KI 2014 is the 37th edition of the German Conference on Artificial Intelligence, which traditionally brings together academic and industrial researchers from all areas of AI, providing a premier forum for exchanging news and research results on theory and applications of intelligent system technology. The technical program of KI 2014 will comprise paper and poster presentations and a variety of workshops and tutorials. KI 2014 will be co-located with Informatik 2014 (Annual Conference of the German Informatics Society) and MATES 2014 (The 12th German Conference on Multi- Agent System Technologies). The conference invites original research papers from all areas of AI, its fundamentals, its algorithms, its history and its applications. You are invited to submit original research and application papers on all aspects of AI research, including but not limited to the following: - Agent-based and multiagent systems - AI applications and innovations - Cognitive modeling - Commonsense reasoning - Computer vision - Constraint satisfaction, search, and optimization - Evolutionary computation - Game playing and interactive entertainment - Information retrieval, integration, and extraction - Knowledge acquisition and ontologies - Knowledge representation and reasoning - Machine learning and data mining - Multidisciplinary AI - Natural language processing - Planning and scheduling - Robotics - Uncertainty in AI - Web and information systems We especially welcome application papers that provide novel insights on the interplay of AI and the real world, as well as papers that bring useful computational technologies from other areas of computer science into AI. ======== Important Dates ======== Workshop/Tutorial submission: March 15 Full/Short Paper submission: May 1 Acceptance notification: June 23 Final version due: July 4 KI Workshops and Conference: September 22-26 ======== Workshops, Tutorials, Doctoral Consortium ======== Together with the main conference, we plan to organize a small number of high-quality workshops and tutorials. We especially encourage workshops organized by AI Special Interest Groups (GI-Fachgruppen), workshops that bring together researchers from different disciplines, and workshops that highlight emerging topics of AI research. Tutorials should target a large percentage of conference participants, including graduate students as well as experienced researchers, and practitioners. The technical program of KI2014 is also complemented by a doctoral consortium that invites participation by PhD students at any stage and from any subject area within AI. The KI doctoral consortium is organized as a joint event with the doctoral program of Informatik 2014, and includes a mentoring program, a PhD workshop, and poster presentations. For details on submitting workshop and tutorials and for applying to the doctoral consortium, please see the KI2014 web page. ======== Submission ======== Submitted papers, which have to be in English, must not exceed 12 pages in Springer LNCS style for full technical contributions and 6 pages for short papers. Full technical papers are expected to report on new research that makes a substantial technical contribution to the field. Short papers are also included in the main proceedings and can report on new research or other issues of interest to the AI community. Examples of work suitable for short paper submissions include: novel ideas whose scope is not large enough for a full paper: important implementation techniques; novel interesting benchmark problems; short experimental studies; interesting applications that are not yet completely solved or analyzed; position or challenge papers; etc. Short paper submissions are especially invited for software demonstration or PhD work in progress. Submission will be through the Easychair conference management system : http://www.easychair.org/conferences/?conf=ki2014 Papers will be subject to blind peer review. All papers will be reviewed based on the standard criteria of relevance, significance of results, originality of ideas, soundness, and quality of the presentation. All accepted papers will be published in the main conference proceedings, and will be presented at the conference. At least one author of each accepted paper must register for the conference and present the contribution. The KI 2014 proceedings will be published by Springer as a volume of the LNAI (Lecture Notes in Artificial Intelligence) series. ======== Main Organizers ======== Program Chairs * Carsten Lutz (Universität Bremen) * Michael Thielscher (University of New South Wales) Workshop and Tutorial Chair * Frieder Stolzenburg (Hochschule Harz) Doctoral Consortium Chair * Markus Krötzsch (TU Dresden) For enquiries, please contact ki2014 at easychair.org. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 4 21:39:13 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 4 Feb 2014 22:39:13 +0100 Subject: Appel: RECITAL 2014 Message-ID: Date: Mon, 03 Feb 2014 11:52:07 +0100 From: Nuria Gala Message-ID: <52EF74D7.3000409 at lif.univ-mrs.fr> RECITAL 2014 : deuxième appel à communications ----------------------------------------------- RECITAL 2014 16ème Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues. Faculté Saint Charles, Aix Marseille Université (Marseille), du 1er au 4 juillet 2014. Dates importantes ----------------- - Date limite de soumission : samedi 5 avril 2014 - Notification aux auteurs : samedi 3 mai 2014 - Version définitive : samedi 17 mai 2014 Présentation ------------ RECITAL 2014, la conférence annuelle des jeunes chercheurs associée à TALN, se déroulera à Marseille du 1er au 4 juillet 2014. RECITAL offre aux jeunes chercheurs en Traitement Automatique des Langues (TAL) l'occasion de présenter leurs travaux et de comparer leurs approches. Elle est réservée aux étudiants (master et doctorat) et aux jeunes chercheurs ayant obtenu leur doctorat depuis moins d'un an. Fort du succès des années précédentes, nous encourageons la soumission de travaux même préliminaires, de projets de thèse, et de travaux des premiers mois de recherche (état de l'art, premières pistes, etc.). L'objectif premier de RECITAL est de soutenir les travaux des jeunes chercheurs en TAL et de faciliter leur intégration dans notre communauté. A ce titre, nous visons : - des relectures pédagogiques : les auteurs doivent pouvoir comprendre les erreurs qu'ils ont pu commettre afin de pouvoir les corriger et améliorer la qualité de leur travail; - des relectures positives : il n'est jamais nécessaire de décourager un jeune chercheur, les maîtres-mots devront encourager/guider; - l'échange direct : les relectures seront communiquées signées, donc non-anonymes, aux auteurs. Libre à eux (ou mieux, aux relecteurs) d'aller informellement discuter ensemble lors de la conférence. Le prix du meilleur papier RECITAL sera décerné lors de la cérémonie de clôture. Thèmes principaux ----------------- Les communications pourront porter sur les thèmes habituels du TAL : - Analyse et génération dans les domaines suivants : + Phonétique + Phonologie + Morphologie + Syntaxe + Sémantique - Analyse et génération dans les domaines suivants : + Phonétique + Phonologie + Morphologie + Syntaxe + Sémantique + Discours - Développement de ressources linguistiques pour le TAL : + Bases de données comportant des informations morphologiques, syntaxiques, sémantiques, et/ou phonologiques + Grammaires + Lexiques + Ontologies + Linguistique de corpus - Applications du TAL : + Analyse de sentiments ou d'opinions + Catégorisation ou classification automatique + Désambiguïsation lexicale + Dialogue homme-machine en langage naturel + Enseignement assisté par ordinateur + Indexation automatique + Recherche et extraction d'information + Résumé automatique + Résolution d'anaphores + Systèmes de question-réponse + Traduction automatique + Web sémantique - Approches : + Linguistiques formelles destinées à soutenir les traitements automatiques + Symboliques + Logiques + Statistiques + Basées sur l'apprentissage automatique Cette liste n'est pas exhaustive et l'adéquation d'une proposition de communication à la conférence sera jugée par le comité de programme. Critères de sélection --------------------- Les auteurs doivent être des étudiant(e)s ou bien des jeunes docteur(e)s ayant soutenu leur thèse depuis moins d'un an. Les publications avec des chercheurs confirmés (ce qui inclus les directeurs de thèses) doivent être soumises à TALN et non à RECITAL. Les auteurs sont invités à soumettre des travaux de recherche originaux, n'ayant pas fait l'objet de publications antérieures. Les soumissions seront examinées par au moins deux spécialistes du domaine. Seront considérées en particulier: - La correction du contenu scientifique et technique - La situation des travaux dans le contexte de la recherche internationale - L'organisation et la clarté de la présentation - L'adéquation aux thèmes de la conférence Les articles sélectionnés seront publiés dans les actes de la conférence. Suivant l'avis du comité de programme, les présentations se feront soit sous forme orale soit sous forme de poster. Modalités de Soumission ----------------------- Les articles seront rédigés en français pour les francophones, en anglais pour ceux qui ne maîtrisent pas le français. Les articles doivent faire de 8 à 14 pages. Une feuille de style LaTeX, un modèle Word et un modèle LibreOffice seront disponibles sur le site web de la conférence (www.taln2014.org). Contact : nuria.gala(arobase)lif.univ-mrs.fr et/ou klim.peshkov(arobase)lpl-aix.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 4 21:37:17 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 4 Feb 2014 22:37:17 +0100 Subject: Appel: 19th Conference on Formal Grammar, FG-2014 Message-ID: Date: Mon, 3 Feb 2014 11:42:11 +0100 (CET) From: Frank Richter Message-ID: X-url: http://fg.phil.hhu.de/ FINAL CALL FOR PAPERS FG-2014: The 19th Conference on Formal Grammar Tuebingen, Germany, August 16-17 2014 http://fg.phil.hhu.de/ Collocated with the European Summer School in Logic, Language and Information Background FG-2014 is the 19th conference on Formal Grammar, to be held in conjunction with the European Summer School in Logic, Language and Information, which takes place in 2014 in Tuebingen, Germany. Previous Formal Grammar meetings were held in Barcelona (1995), Prague (1996), Aix-en-Provence (1997), Saarbruecken (1998), Utrecht (1999), Helsinki (2001), Trento (2002), Vienna (2003), Nancy (2004), Edinburgh (2005), Malaga (2006), Dublin (2007), Hamburg (2008), Bordeaux (2009), Copenhagen (2010), Ljubljana (2011), Opole (2012) and Duesseldorf (2013). Aims and Scope FG provides a forum for the presentation of new and original research on formal grammar, mathematical linguistics and the application of formal and mathematical methods to the study of natural language. Themes of interest include, but are not limited to, * formal and computational phonology, morphology, syntax, semantics and pragmatics; * model-theoretic and proof-theoretic methods in linguistics; * logical aspects of linguistic structure; * constraint-based and resource-sensitive approaches to grammar; * learnability of formal grammar; * integration of stochastic and symbolic models of grammar; * foundational, methodological and architectural issues in grammar and linguistics; * mathematical foundations of statistical approaches to linguistic analysis. Previous conferences in this series have welcomed papers from a wide variety of frameworks. Submission Details We invite **electronic** submissions of original, 16-page papers (including references and possible technical appendices). Authors are encouraged to use the Springer-Verlag LNCS style: http://www.springer.com/computer/lncs?SGWID=0-164-6-793341-0 The submission deadline is **February 23 2014**. Papers must be submitted electronically at https://www.easychair.org/conferences/?conf=fg2014 Papers should report original work which was not presented in other conferences. However, simultaneous submission is allowed, provided that the authors indicate other conferences to which the work was submitted in a footnote. Note that accepted papers can only be presented in one of the venues. Submissions will be reviewed anonymously by at least three reviewers. Accepted papers will be published as a volume in the Springer LNCS series, under the FoLLI subline, either separately or jointly with the papers from FG-2015, depending on the number of accepted papers. Important Dates * February 23, 2014: Deadline for paper submission * April 20, 2014: Notification of acceptance * June 1, 2014: Camera ready copies due * August 16-17, 2014: Conference dates Invited Speakers Thomas Icard (Stanford University, USA) Christian Retoré (Université Bordeaux 1, France) Program Committee Alexander Clark (King's College London) Benoit Crabbé (Université Paris 7, France) Berthold Crysmann (CNRS - LLF, France) Denys Duchier (Université d'Orleans, France) Annie Foret (IRISA - IFSIC, France) Nissim Francez (Technion, Israel) Philippe de Groote (Inria Nancy) Laura Kallmeyer (University of Düsseldorf, Germany) Makoto Kanazawa (National Institute of Informatics, Japan) Greg Kobele (University of Chicago, USA) Valia Kordoni (Humboldt-Universität Berlin, Germany) Wolfgang Maier (University of Düsseldorf, Germany) Stefan Müller (Freie Universität Berlin, Germany) Gerald Penn (University of Toronto, Canada) Christian Retoré (Université Bordeaux 1, France) Manfred Sailer (Goethe University Frankfurt, Germany) Ed Stabler (UCLA, USA) Jesse Tseng (CNRS - CLLE-ERSS, France) Oriol Valentín (Universitat Politècnica de Catalunya, Spain) Standing committee * Glyn Morrill (Universitat Politècnica de Catalunya, Spain) * Reinhard Muskens (Tilburg University, The Netherlands) * Rainer Osswald (Heinrich-Heine-Universitaet Duesseldorf, Germany) * Frank Richter (Heinrich-Heine-Universitaet Duesseldorf, Germany) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 4 21:39:50 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 4 Feb 2014 22:39:50 +0100 Subject: Appel: QALD-4 training phase starts now! Message-ID: Date: Mon, 3 Feb 2014 13:51:53 +0100 (CET) From: Elena Cabrio Message-ID: <743639560.1084915.1391431913412.JavaMail.root at inria.fr> X-url: http://nlp.uned.es/clef-qa/ X-url: http://www.sc.cit-ec.uni-bielefeld.de/qald Dear all, We are very happy to announce QALD-4, the fourth in a series of evaluation campaigns on multilingual question answering over linked data, this time with a strong emphasis on interlinked data and hybrid question answering. In the context of the question answering lab at CLEF 2014 (http://nlp.uned.es/clef-qa/), we offer three open challenges: Task 1: Multilingual question answering over DBpedia Task 2: Biomedical question answering over interlinked data Task 3: Hybrid question answering All relevant information about training and test phase as well as the lab can be found on our website: http://www.sc.cit-ec.uni-bielefeld.de/qald The training phase starts now, and the further schedule is as follows: Release of test data: April 1, 2014 Submission deadline: May 1, 2014 Release of results: May 8, 2014 Workshop: September, 2014 We cordially invite everyone working on question answering over linked data to participate, and look forward to another exciting challenge! With best regards, Christina Unger, Vanessa Lopez, Philipp Cimiano, Elena Cabrio, Axel Ngonga, Sebastian Walter, Corina Forascu From hamon at LIMSI.FR Tue Feb 4 21:40:34 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 4 Feb 2014 22:40:34 +0100 Subject: Conf: TOTh 2014, Opening Conference Message-ID: Date: Mon, 3 Feb 2014 14:08:44 +0100 From: Luc Damas Message-ID: ------------------------------------------------------ TOTh 2014 - Opening Conference 12-13 June 2014 - www.porphyre.org/toth/ "Ontology meets natural language: What can linguists learn?" Christiane Fellbaum - Princeton University The creation of large lexical databases for Natural Language Processing has renewed interest in the structure of the lexicon and challenged many linguists' theories of systemic regularities in the face of empirical investigation. The need for mapping lexicons across languages and research in automatic inferencing and reasoning have resulted in fruitful collaboration between lexicology and the neighboring discipline of ontology and have afforded new perspectives on the lexicon. My talk will address the following questions and provide some answers based on recent research. * To what extent can a formal ontology be mapped onto the lexicon of a natural language and reflect both its surface and its hidden structure? * How much lexical semantics can ontology capture, and how useful is an ontological perspective of the lexicon for word sense disambiguation? * How can ontology inform the construction of lexical databases for automatic reasoning, e.g., by introducing categorial distinctions such as between types and roles? --------------------------------------------------- From hamon at LIMSI.FR Tue Feb 4 21:42:59 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 4 Feb 2014 22:42:59 +0100 Subject: Job: Research Assistant in Text Simplification, UPF/TALN/, Barcelona Message-ID: Date: Mon, 3 Feb 2014 18:22:51 +0100 From: Horacio Saggion Message-ID: X-url: http://www.dtic.upf.edu/~hsaggion/ TALN Research Group, Department of Information and Communication Technologies (DTIC), Universitat Pompeu Fabra (UPF), Barcelona, Spain Applications are invited for a 12-months research position in the area of text simplification. Funding for the position arises from a new collaborative European project. The research will be carried out at the Natural Language Processing research group (TALN) in Barcelona under the supervision of Dr. Horacio Saggion as PI. *Brief Description of Work* The work to be carried out involves the development/adaptation of a text simplification system to the English language and improvement of technology already developed in the Simplext project. *Qualifications:* Applicants should have: *** a MsC or PhD in computer science, artificial intelligence, or computational linguistics *** Knowledge of machine-learning and rule-based approaches *** Knowledge of Java is a plus *Additionally:* *** Knowledge of the GATE system will be well appreciated *** Highly motivated individual with experience or interest in the area of automatic text simplification and/or summarization. *Language requirements:* *** good knowledge of English *Starting date:* The position is expected to start early March/April 2014. *Educational level required:* Msc. or PhD. *Salary and duration of the contract* We offer a gross salary of EURO 20.000 - EURO 22.000 which can be complemented with teaching if desired. The duration of the contract is 12 months. The position can be extended depending on available funds. *How to apply:* A cover letter, curriculum vitae, and the names and contact information of at least two referees to horacio.saggion at upf.edu. Use as subject of your message "Research Position Simplification/NLP" *Selection process* The selection process will start immediately. From hamon at LIMSI.FR Tue Feb 4 21:41:37 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 4 Feb 2014 22:41:37 +0100 Subject: Appel: FLAIRS-27, Call for Poster Abstracts (Submissions Due February 10, 2014) Message-ID: Date: Mon, 3 Feb 2014 14:49:45 +0100 From: Christophe Jouis Message-Id: <1050D157-C8B8-43C7-9F45-F6DC9D33C4E7 at lip6.fr> X-url: http://www.FLAIRS-27.info ======================================================================= The 27th International Conference of the Florida Artificial Intelligence Research Society (FLAIRS-27) Poster abstract submission deadline: February 10, 2014 www.FLAIRS-27.info May 21-23, 2014 Pensacola Beach, Florida, USA Hilton Pensacola Beach Gulf Front http://hiltonpensacolabeach.com/ ======================================================================= FLAIRS-27 continues a tradition of presenting and discussing state of the art artificial intelligence and related research in a sociable atmosphere within a beautiful setting. Events will include invited speakers, special tracks, and presentations of papers and posters. As always, there will be a Best Paper award and a Best Poster award. In addition, because FLAIRS has a rich tradition of encouraging student authors, there will be a Best Student Paper award for the best paper written primarily by a student. A second submission round of poster abstracts (250 words) for presentation at the conference is now invited. Topics of interest are in all areas of (or related to) artificial intelligence, including, but not limited to Foundations: * Cognitive modeling * Knowledge representation * Learning * Perception * Reasoning and programming * Search Architectures: * Agents and distributed AI * Information retrieval * Intelligent user interfaces * Natural language systems * Robotics Applications: * Aviation and aerospace * Education * Entertainment * Management and manufacturing * Medicine * World Wide Web Implications: * Evaluation of AI systems * Philosophical foundations * Social impact and ethics * Teaching AI Special Areas: * AI and Cyber Security * Culture-Aware AI Systems * AI in Business * Semantics, Logic and Information Extraction in AI * Affective Computing * Intelligent Tutoring Systems * Games and Multimedia * Data Mining * Case-Based Reasoning * Uncertain Reasoning * Intelligent Autonomous Systems * Robotics * Healthcare Informatics * Applied Natural Language Processing ============= Location ============= The 27th International Conference of FLAIRS takes place on the beachfront in the center of everything on Pensacola Beach, Florida! Enjoy captivating views of the emerald waters of the Gulf of Mexico and 52 miles of snow white sand beaches from your private Gulf Front balcony. Our hotel is within walking distance from restaurants, vibrant nightlife and recreational activities such as charter fishing, paddle boarding, sailing, kayaking, biking, and miniature golf. We are just a quick drive to historic downtown Pensacola and regional attractions such as the National Naval Aviation Museum, Gulf Islands National Seashore and Fort Pickens, Pensacola Lighthouse, Blue Wahoos Baseball Stadium, Veterans Memorial Park, Pensacola Civic Center, and the Saenger Theater. ============= Important Dates ============= February 10, 2014 Second submission (Abstract only) February 17, 2014 Abstract only notification February 24, 2014 Camera ready version due ============= Submissions ============= We are offering a second round for submission of poster abstracts (up to 250 words) to give new and developing research an outlet for peer feedback. Each submitted poster abstract will be subject to an editorial review to determine acceptability. Editors can make spelling, grammatical, and formatting changes to the abstracts. All posters will be presented according to space and time availability. An author of each accepted poster abstract is required to register, attend, and present the poster. It is a requirement of FLAIRS that there will be at least one early regular author registration per submission. Submissions must be original, and not submitted concurrently to another conference. Abstract papers must be in AAAI format, and submitted as PDF through the EasyChair conference system, which can be accessed through the main conference web site (www.FLAIRS-27.info). The proceedings of FLAIRS-27 will be published by the AAAI. Authors of accepted papers will be required to sign a form transferring copyright of their contribution to AAAI. An author of each accepted paper is required to register, attend, and present the poster at FLAIRS. All submissions and registrations must comply with FLAIRS requirements as posted on the conference website. ============= ORGANIZATION ============= Conference Chair: G. Michael Youngblood, PARC, a Xerox company Program Chairs: Bill Eberle, Tennessee Technological University Chutima Boonthum-Denecke, Hampton University Special Tracks Coordinator: Ingrid Russell, University of Hartford Contact Bill Eberle: weberle at tntech.edu From hamon at LIMSI.FR Tue Feb 4 21:46:39 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 4 Feb 2014 22:46:39 +0100 Subject: Appel: Hackathon Participation, WASSA 2014@ACL 2014 Message-ID: Date: Tue, 04 Feb 2014 11:33:06 +0100 From: Alexandra Balahur Dobrescu Message-id: <52F0C1E2.4060806 at jrc.ec.europa.eu> X-url: http://optima.jrc.it/wassa2014/ /*Apologies for cross-postings ***************************************************************** */ */ 5^th Workshop on Computational Approaches to /* */Subjectivity, Sentiment and Social Media Analysis (WASSA 2014)/* *http://optima.jrc.it/wassa2014/* *- Endorsed by SIGNLL and SIGSEM - *** Sentiment analysis systems hackathon In the light of the fact that different sentiment analysis systems have been proposed and showcased in the past years, we feel there is a growing need to make other researchers and users familiar with these systems and have them employ them for building an end application. The Hackathon word stands for "Hacking Marathon", and its purpose is to introduce some system and make users employ it to build an application for a real-life task. We would like to invite researchers who have a working prototype of their system to announce their availability to participate in the WASSA 2014 Hackathon. The systems can be oriented to the following tasks: ./The use of semantic resources and methods (knowledge bases, semantic representations, inference mechanisms) for subjectivity, sentiment and emotion analysis/ ./Opinion retrieval, extraction, categorization, aggregation and summarization/ ./Trend detection in social media using subjectivity and sentiment analysis techniques/ ./Data linking through social networks based on affect-related NLP methods/ ./Mass opinion estimation based on NLP and statistical models/ ./Online reputation management/ ./Subjectivity, sentiment and emotion detection in social networks/ ./Classification of stance in dialogues/ ./Subjectivity and sentiment analysis in social media/ To participate with a system or as a "hacker" of one of the participating systems, please send an e-mail to: Alexandra.Balahur at jrc.ec.europa.eu In order to speed up the process, we will put the "hackers" in contact with the system authors, so that the applications can be explored prior to the WASSA 2014 workshop. We will also invite papers describing the results of these experiments as part of the final workshop. We are looking forward to your participation! On behalf of the WASSA 2014 Organizers, Alexandra Balahur From hamon at LIMSI.FR Tue Feb 4 21:48:07 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 4 Feb 2014 22:48:07 +0100 Subject: Job: Recherche un ingenieur d'etudes, ELDA Message-ID: Date: Tue, 04 Feb 2014 16:29:09 +0100 From: ELRA ELDA Information Message-ID: <52F10745.8010207 at elda.org> X-url: http://www.elra.info X-url: http://www.elda.org ELDA (European Language Resources Distribution Agency) cherche à pourvoir immédiatement un poste d'ingénieur d'études et de développement. Il/elle aura en charge des études et des développements techniques dans le domaine de l'ingénierie linguistique, en particulier en technologies vocales, multimédias et multimodales. Au sein du département R&D, il/elle travaillera sur des projets de développement d'outils pour la production, la validation et le management de ressources linguistiques dans le cadre de projets collaboratifs et internationaux. Ce poste est une excellente opportunité pour des candidats créatifs et motivés, qui souhaitent participer activement à la construction du domaine de l'ingénierie de la langue dans l'Union européenne. Profil : * De formation supérieure (Ingénieur ou Docteur) en traitement automatique de la parole ou du multimédia/multimodales (reconnaissance/synthèse de la parole, traitement d'images/vidéos, analyse de documents/OCR, etc.). * Expérience et/ou connaissance des programmes de recherche en Europe, aux Etats-Unis ou au Japon. * Expérience en gestion de projet, Implication dans la gestion de projets européens serait un plus significatif. * bonne maitrise de l'environnement de programmation Django, de Django-CMS, Python, PhP serait un plus. * Expérience et/ou connaissance des questions relatives aux ressources linguistiques et à l'évaluation des technologies de la langue. * Aptitude à travailler tant de façon indépendante qu'au sein d'une équipe, en particulier aptitude à superviser les membres d'une équipe multidisciplinaire. * Anglais courant exigé. Salaire: en fonction des qualifications et expériences. Toute candidature sera étudiée jusqu'à ce que le poste soit pourvu. Le poste est basé à Paris. Les candidats doivent avoir la citoyenneté d'un pays de l'Union européenne ou être en possession d'un titre de séjour. Les candidatures doivent être adressées par courriel (lettre de motivation et curriculum vitae) à : Khalid CHOUKRI ELRA / ELDA 55-57, rue Brillat Savarin 75013 Paris FRANCE Fax : +33 1 43 13 33 30 courriel : job at elda.org ELDA, unité opérationnelle de l'association ELRA, est chargée de promouvoir le développement de ressources linguistiques sous toutes les formes électroniques utilisables, en particulier sous la forme de corpus oraux et écrits, de lexiques et de bases terminologiques. Depuis sa création en 1995, ELDA s'est affirmée comme un centre unique en Europe pour la distribution de ressources linguistiques, capable de répondre aux divers besoins des développeurs de technologie. Ses activités se développent maintenant vers de nouveaux types de ressources linguistiques (données de type multimodal/multimédia). Certaines ressources linguistiques sont conçues au cours de projets (co-)financés par ELDA. Celles-ci sont ensuite compilées sous la forme d'un catalogue de ressources linguistiques. ELDA est impliquée dans un certain nombre de projets européens et nationaux. ELDA s'intéresse également aux problèmes juridiques en rapport avec les ressources linguistiques, réalise régulièrement des études de marché sur les besoins des utilisateurs, et travaille à l'amélioration des procédures de validation des ressources. Pour de plus amples renseignements concernant ELRA/ELDA, voir: http://www.elra.info ou www.elda.org ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 4 21:44:50 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 4 Feb 2014 22:44:50 +0100 Subject: Appel: Roadmapping Workshop on Linked Data for Language Technology - #LiderEU Message-ID: Date: Tue, 4 Feb 2014 09:24:51 +0100 From: Maud Ehrmann Message-ID: X-url: http://2014.data-forum.eu/ X-url: http://lider-project.eu/ X-url: http://www.multilingualweb.eu/ X-url: http://nlp2rdf.org X-url: http://linguistics.okfn.org/ X-url: http://dbpedia.org Dear all, I would like to draw your attention to the Linked Data for Language Technology (LD4LT) workshop (http://www.w3.org/community/ld4lt/wiki/LD4LT_Group_Kick-Off_and_Roadmap_Meeting) that will be organized as part of the European Data Forum 2014 (http://2014.data-forum.eu/). The goal of the workshop is to gather use cases and requirements as a basis to develop a European R&D road-map and agenda for content analytics solutions based on Linked Data. It is a free community event - there is no admission fee for participants, but registration is required (https://www.w3.org/2002/09/wbs/1/lider-rmws-20140321/). You are encouraged to provide a title for a position statement in your registration form. This is a simple, short statement that summarizes your ideas / technologies / use cases related to Linked Data and Language Technology. The meeting is supported by the LIDER (http://lider-project.eu/) project, the MultilingualWeb community (http://www.multilingualweb.eu/), the NLP2RDF project (http://nlp2rdf.org), the Working Group for Open Data in Linguistics (http://linguistics.okfn.org/) as well as the DBpedia Project (http://dbpedia.org). As input to the discussion and the work of the LD4LT group, you may also want to fill in the first LIDER survey (https://www.w3.org/2002/09/wbs/1/ld4lt-wbs1/) . We hope to see you at the workshop. Best Regards, Maud Ehrmann, on behalf of the LIDER consortium. From hamon at LIMSI.FR Sat Feb 8 11:10:30 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 12:10:30 +0100 Subject: Stage: Development of linguistic resources to improve an information extraction tool Message-ID: Date: Thu, 6 Feb 2014 15:28:38 +0100 From: Dhouha Bouamor Message-ID: Trooclick France is a company that specializes in the development of web applications for the automatic processing of information. Our goal is to create services that rebuild the user's trust in digital content. Up to now, Web players were able to enhance the relevance of this content; we go a step further and contribute to improve its reliability. Trooclick was created in November 2012. Just a few months later, in April 2013, it received financial support from the BPI (French public investment bank) and in June 2013 the French government granted it the Status of "Young Innovative Company" (JEI), recognizing its innovative nature. It now counts twelve committed and passionate members in its tight-knit team. The company carries out R&D projects in search of technical solutions in the Artificial Intelligence field. Due to its growth, Trooclick is now looking for candidates for a 6 month internship for its office in Paris (17ème). Missions: As a member of the technical team, you will benefit from ongoing training and you will help us design and build our information extraction framework based on advanced NLP technologies. You will turn ideas into well-documented and reliable linguistic resources (both dictionaries and extraction rules) to ensure efficiency, quality, performance and scalability. A great team player, you will interact with other departments to understand and fine tune specifications. You will carry out unitary testing, create and maintain our test validation corpus and participate in editing technical documents. All developments will be done in English. Qualifications: - BSc/MSc - Experience with NLP tools such as Gate, Treetagger, NooJ, Stanford for linguistic annotation, named entity recognition, relationship and fact extraction, sentiment analysis, etc. - Experience in scripting languages such as Perl or Python as well as XML format to be autonomous in completing some technical tasks. - Experience with basic database management operations (SQL language) Knowledge of Semantic Web technologies (RDF, OWL, SKOS, etc.) will be a plus. - Excellent communication skills in English and French - We are open to new ideas that will significantly contribute to our success. Our friendly team will provide the opportunity for valuable collaboration. - We offer you career perspectives in a young and dynamic company with an interesting and diversified scope of duties at the cutting edge of research. We welcome applications from highly motivated individuals able to learn new techniques and share knowledge and experience with the team. Interested? Then send your application to jobs at trooclick.com! ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 10:54:13 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 11:54:13 +0100 Subject: Appel: INTERSPEECH 2014 Message-ID: Date: Thu, 06 Feb 2014 09:56:31 +0800 From: "Organization @ Interspeech 2014" Message-ID: <52F2EBCF.5070801 at interspeech2014.org> X-url: http://www.interspeech2014.org --- INTERSPEECH 2014 - SINGAPORE --- September 14-18, 2014 --- http://www.interspeech2014.org INTERSPEECH is the world's largest and most comprehensive conference on issues surrounding the science and technology of spoken language processing, both in humans and in machines. The theme of INTERSPEECH 2014 is --- Celebrating the Diversity of Spoken Languages --- INTERSPEECH 2014 emphasizes an interdisciplinary approach covering all aspects of speech science and technology spanning basic theories to applications. In addition to regular oral and poster sessions, the conference will also feature plenary talks by internationally renowned experts, tutorials, special sessions, show & tell sessions, and exhibits. A number of satellite events will take place immediately before and after the conference. Please follow the details of these and other news at the INTERSPEECH website www.interspeech2014.org . --------- SCOPE ----------- We invite you to submit original papers in any related area, including but not limited to: 1: Speech Perception and Production 2: Prosody, Phonetics, Phonology, and Para-/Non- Linguistic Information 3: Analysis of Speech and Audio Signals 4: Speech Coding and Enhancement 5: Speaker and Language Identification 6: Speech Synthesis and Spoken Language Generation 7: Speech Recognition - Signal Processing, Acoustic Modelling, Robustness, and Adaptation 8: Speech Recognition - Architecture, Search & Linguistic Components 9: LVCSR and Its Applications, Technologies and Systems for New Applications 10: Spoken Language Processing - Dialogue, Summarization, Understanding 11: Spoken Language Processing -Translation, Info Retrieval 12: Spoken Language Evaluation, Standardization and Resources A detailed description of these areas is accessible at: http://www.interspeech2014.org/public.php?page=conference_areas.html ------- SUBMISSIONS ------- Papers for the INTERSPEECH 2014 proceedings should be up to 4 pages of text, plus one page (maximum) for references only. Paper submissions must conform to the format defined in the paper preparation guidelines and provided in the Authors' kit, on the INTERSPEECH 2014 website, along with the Call for Papers. Optionally, authors may submit additional files, such as multimedia files, which will be included in the official conference proceedings USB drive. Authors must declare that their contributions are original and are not being submitted for publication elsewhere (e.g. another conference, workshop, or journal). Papers must be submitted via the online paper submission system, which will be opened in February 2014. The conference will be conducted in English. Information on the paper submission procedure is available at: http://www.interspeech2014.org/public.php?page=submission_procedure.html ----- IMPORTANT DATES ----- 24 March, 2014 Full Paper submission deadline 10 June, 2014 Notification of acceptance 20 June, 2014 Camera-ready paper due 10 July, 2014 Early registration deadline 14-18 September, 2014 Conference dates We look forward to welcoming you to INTERSPEECH 2014 in Singapore! - The Organizing team of INTERSPEECH 2014 - Email: tpc at interspeech2014.org From hamon at LIMSI.FR Sat Feb 8 11:21:37 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 12:21:37 +0100 Subject: Appel: TOTh 2014, Extension de la date limite Message-ID: Date: Fri, 7 Feb 2014 17:52:33 +0100 From: Luc Damas Message-ID: X-url: http://www.porphyre.org/toth/ X-url: http://www.porphyre.org/toth/toth-2014/appel-a-communications The deadline for abstract submissions for the TOTh 2014 Conference has been extended to 24th February. La date limite de soumission des résumés pour la Conférence TOTh 2014 est repoussée au lundi 24 février. ----------------------------------------------------- TOTh 2014 : APPEL A COMMUNICATIONS Terminologie & Ontologie : Théories et applications Chambéry - France 24 février 2014 : Date limite de soumission des résumés (2 à 3 pages) Conférence : 12 & 13 juin 2014 Formation : 10 & 11 juin 2014 http://www.porphyre.org/toth/ http://www.porphyre.org/toth/toth-2014/appel-a-communications http://www.porphyre.org/toth/files/TOTh-2014-appel-a-communications.pdf Soumission par EasyChair : ----------------------------------- Les soumissions des résumés étendus (2 à 3 pages) se font exclusivement par EasyChair: http://www.easychair.org/conferences/?conf=toth2014 Présentation : ------------------ La 8ème édition des conférences internationales TOTh se déroulera les jeudi 12 et vendredi 13 juin 2014 à l'Université de Savoie (Chambéry, France) avec le soutien du Ministère de la Culture (DGLFLF) et de l'Association Européenne de Terminologie. Les conférences TOTh - Terminologie & Ontologie : Théories et applications - ont pour objectif de rassembler chercheurs, professionnels et, de façon plus générale, toute personne dont les préoccupations relèvent de la langue et de l'ingénierie des connaissances. Le programme comportera, en plus des communications retenues, une conférence invitée, une disputatio et une session poster. Une formation d'un jour et demi sur l'analyse de textes à des fins onomasiologiques précédera la conférence les mardi 10 et mercredi 11 juin. Un prix << Jeune chercheur >> est décerné par le comité de programme lors de la conférence Thèmes de la Conférence (liste non exhaustive) : -------------------------------------------------------------- - Langues de spécialité et Lexicographie de spécialité, - Traduction, - Traitement de corpus, - Lexiques, Dictionnaires, Terminologies, - Aménagement linguistique et Aménagement terminologique, - Représentation, Extraction et Gestion des connaissances, - Systèmes conceptuels, - Ontologies, - Thesaurus, - Applications : Recherche d'information, Web des données, Bibliothèques numériques, Environnements de développement, Banques terminologiques, - Normes, Formats d'échange. TOTh 2014 http://www.porphyre.org/toth/ http://www.porphyre.org/toth/toth-2014/appel-a-communications http://www.porphyre.org/toth/files/TOTh-2014-appel-a-communications.pdf ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 11:02:25 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 12:02:25 +0100 Subject: Job: CDI, Web-Miner, MarketScience Message-ID: Date: Thu, 06 Feb 2014 11:40:20 +0100 From: Aude Robert Message-ID: <52F36694.8060503 at marketscience.fr> Title : CDI, Web-Miner, MarketScience Compagny: MarketScience Department: IT Software City : Orléans (France, 45) Job Type : Full time / CDI Languages : English and French IDEAL CANDIDATE: We are looking for a candidate who will work on our Media Mining module. The module will aggregate and store information extracted from the Internet that will be integrated into a financial market Bloomberg application. REQUIREMENTS : Autonomous and very ambitious Good experience in Python or PHP, and HTML is necessary. Web crawler/information retrieval knowledge Database management(MySQL/SQL Server) knowledge Focused on code quality Knowledge in Natural Language Processing, Text Mining or Information Retrieval Very good english writing and communication skills Planning and long term thinking CAREER LEVEL : 3 Year Exp. RESPONSABILITIES : Develop and maintain a web mining system. Develop and maintain a storage solution for the extracted information. Process extracted information (Natural Language Processing) Assure long term scaling of the solution. Excellent compensation for excellent & ambitious candidate. DESCRIPTION OF COMPANY : MarketScience is a French/Romanian Start-up in Information Technology/Finance. Salary will be defined according to profile and experience. For more information, please contact aude.robert at marketscience.fr To submit an application, please send your CV and cover letter to both nicolasboitout at hotmail.com ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 10:43:02 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 11:43:02 +0100 Subject: Appel: CBMI2014, Submission deadline 16 Feb, Workshop on Content-Based Multimedia Indexing Message-ID: Date: Wed, 05 Feb 2014 09:59:19 +0100 From: Bernard Merialdo Message-ID: <52F1FD67.1080806 at eurecom.fr> X-url: http://cbmi2014.itec.aau.at/ (our apologies for duplicate postings) ************************************************************************* CALL FOR PAPERS Paper submission deadline: February 16, 2014 C B M I 2014 12th International Workshop on Content-Based Multimedia Indexing 18-20 June 2014, Klagenfurt, Austria http://cbmi2014.itec.aau.at/ in cooperation with IEEE CSS and ACM SIGMM ************************************************************************* The 12th International Content Based Multimedia Indexing Workshop aims to bring together the various communities involved in all aspects of content-based multimedia indexing, retrieval, browsing and presentation. Following the eleven successful previous events of CBMI (Toulouse 1999, Brescia 2001, Rennes 2003, Riga 2005, Bordeaux 2007, London 2008, Chania 2009, Grenoble 2010, Madrid 2011, Annecy 2012, and Veszprem 2013), CBMI 2014 will take place in Klagenfurt, in the very south of Austria from June 18th to June 20th 2014. CBMI 2014 is organized in cooperation with IEEE Circuits and Systems Society and ACM SIG Multimedia. Topics of the workshop include but are not limited to visual indexing, audio and multi-modal indexing, multimedia information retrieval, and multimedia browsing and presentation. Additional special sessions are planned in the fields of endoscopic videos and images and multimedia metadata. Paper submission: Authors are invited to submit full-length and special session papers of 6 pages and short (poster) and demo papers of 4 pages maximum. All peer-reviewed, accepted and registered papers will be published in the CBMI 2014 workshop proceedings to be indexed and distributed by the IEEE Xplore. The submissions are peer reviewed in single blind process, the language of the workshop is English. Selected papers of the conference will be invited to submit extended versions of their contributions to a special issue of Multimedia Tools and Applications journal (MTAP). * Important dates: Paper submission deadline: February 16, 2014 Notification of acceptance: March 30, 2014 Camera-ready papers due: April 14, 2014 Author registration: April 14, 2014 Early registration: May 25, 2014 * Topics of interest include, but are not limited to: * Visual Indexing (image, video, graphics) * Visual content extraction Identification and tracking of semantic regions * Identification of semantic events * Audio and Multi-modal Indexing * Audio indexing (audio, speech, music) * Audio content extraction * Multi-modal and cross-modal indexing * Multimedia fusion * Metadata generation, coding and transformation * Multimedia Information Retrieval (image, audio, video, ...) * Matching and similarity search * Content-based search * Multimedia data mining * Multimedia recommendation * Large scale multimedia database management * Multimedia Browsing and Presentation * Summarization, browsing and organization of multimedia content * Personalization and content adaptation * User interaction and relevance feedback * Multimedia interfaces, presentation and visualization tools * Contact For more information please visit http://cbmi2014.itec.aau.at/ and for additional questions, remarks, or clarifications please contact cbmi2014 at itec.aau.at From hamon at LIMSI.FR Sat Feb 8 10:49:09 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 11:49:09 +0100 Subject: Appel: LRE-Rel2 Religious Text Analytics Workshop, LREC 2014 Message-ID: Date: Wed, 5 Feb 2014 15:06:58 +0000 (GMT) From: Eric Atwell Message-ID: X-url: http://www.comp.leeds.ac.uk/arabic/lre-rel2.html 2nd Workshop on Language Resources and Evaluation for Religious Texts (LRE-Rel2) 31st May 2014 (afternoon session) Harpa Conference Center, Reykjavik, Iceland http://www.comp.leeds.ac.uk/arabic/lre-rel2.html Second Call for Papers, deadline: 19.02.2014 After a successful launch at LREC 2012, we are organising a second workshop on Language Resources and Evaluation for Religious Texts hosted by LREC 2014 in Reykjavik, Iceland. This is an inclusive workshop title, aimed at researchers with a generic interest in religious texts to raise awareness of different perspectives and practices, and to identify some common themes. Our first workshop attracted a range of scholarship, particularly on Arabic and Islamic Studies, and this year we are keen to extend this range to canonical texts from other languages and religions - Christian, Jewish, Sikh, Buddhist, and other - and, given the Icelandic venue, to iconic texts from historical religions such as The Prose Edda of Norse mythology. We are also keen to foster inter-faith corpus studies, tracing similarities as well as differences in religious texts, where this genre includes: the faith-defining religious canon; authoritative interpretations and commentary; sermons, liturgy, prayers, poetry, and lyrics. Workshop Attendance: Prospective authors should note that attendance at the workshop is necessary for their paper to be published in the workshop proceedings, though for papers with multiple authors, only one author will be required to attend. Therefore, prospective authors are strongly encouraged to check the LREC website and other sources for information on visa restrictions, and likely travel and hotel costs. Workshop Topics: Submissions are invited for (but not limited to) the following topics: - measuring semantic relatedness between multiple religious texts and corpora from different religions; - analysis of ceremonial, liturgical, and ritual speech; recitation styles; speech decorum; discourse analysis for religious texts; - formulaic language and multi-word expressions in religious texts; - suitability of modal and other logic types for knowledge representation and inference in religious texts; - issues in, and evaluation of, machine translation in religious texts; - text-mining, stylometry, and authorship attribution for religious texts; - corpus query languages and tools for exploring religious corpora; - dictionaries, thesaurai, Wordnet, and ontologies for religious texts; - (new) corpora and rich and novel annotation schemes for religious texts; - annotation and analysis of religious metaphor; - genre analysis for religious texts; - application in other disciplines (e.g. theology, classics, philosophy, literature) of computer-mediated methods for analysing religious texts. Important Dates: 19.02.2014 Deadline for paper submissions 05.03.2014 Notification of acceptance 21.03.2014 Camera-ready copies due from authors 06.04.2014 LREC deadline for camera-ready copy of Workshop Proceedings 31.05.2014 LRE-Rel2 Workshop (afternoon session) Submission of Papers: Please submit your papers via the LRE-Rel2 submission page in LREC's START Manager. The URL for this is: https://www.softconf.com/lrec2014/LRE-Rel2/ Authors should submit a full paper, not an abstract. They should also follow LREC guidelines for length of papers, and are advised to use the LREC template for paper submissions. This template will be published by LREC in due course. In the meantime, authors can use the LREC 2012 template: http://www.lrec-conf.org/lrec2012/?Authors-Kit Request from LREC Organisers: "...When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. Moreover, ELRA encourages all LREC authors to share the described LRs (data, tools, services, etc.), to enable their reuse, replicability of experiments, including evaluation ones, etc..." LRE-Rel2 Organising Committee: Eric Atwell and Claire Brierley: Computing, University of Leeds, LEEDS, LS2 9JT, UK E.S.Atwell at leeds.ac.uk; C.Brierley at leeds.ac.uk Majdi Sawalha and Bassam Hammo: Computer Information Systems, University of Jordan, AMMAN 11942, Jordan sawalha.majdi at leeds.ac.uk; LRE-Rel2 Programme Committee (provisional): Eric Atwell, School of Computing, University of Leeds, UK Claire Brierley, School of Computing, University of Leeds, UK Majdi Sawalha, Computer Information Systems, University of Jordan, Jordan Bassam Hammo, Computer Information Systems, University of Jordan, Jordan Sane Yagi, Department of Linguistics, University of Jordan, Jordan Dag Haug, Department of Philosophy, History of Art and Ideas, University of Oslo, Norway Moshe Koppel, Department of Computer Science, Bar-Ilan University, Israel Nils Reiter: Department of Computational Linguistics, Heidelberg University, Germany Andrew Wilson, Department of Linguistics and English Language, University of Lancaster, UK Claudia Resch, Institute for Corpus Linguistics and Text Technology, Austrian Academy of Sciences, Austria Nadeem Obaid, Computer Information Systems, University of Jordan, Jordan Muhammad A.M. Abushariah, Computer Information Systems, University of Jordan, Jordan Mortaza Rezaee, Islamic College, London, UK Janet Watson, Arabic and Middle Eastern Studies, and Linguistics and Phonetics, University of Leeds, UK John Lee, Halliday Centre for Intelligent Applications of Language Studies, City University of Hong Kong, (HK) Mohamed Menacer, Taibah University, Saudi Arabia Deryle Lonsdale, Department of Linguistics and English Language, Brigham Young University, US Bob MacDonald, Research and Development, Anthony Macauley Associates, Canada Kais Dukes, School of Computing, University of Leeds, UK Aida Mustapha, Department of Computer Science and Information Technology, Putra University, Malaysia Liviu Dinu, Centre for Computational Linguistics, University of Bucharest, Romania Behrooz Minaei, School of Computer Engineering, Iran University of Science and Technology, Iran Gurpreet Singh, Centre for Language and Communication Studies, Trinity College Dublin, Ireland Azzeddine Mazroui, Mathematics and Computer Science, Mohammed 1st University, Morocco From hamon at LIMSI.FR Sat Feb 8 11:03:30 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 12:03:30 +0100 Subject: These: Rima Harastani, Alignement lexical en corpus comparables Message-ID: Date: Thu, 6 Feb 2014 12:40:10 +0100 From: Rima Harastani Message-ID: Bonjour, J'ai le plaisir de vous inviter à ma soutenance de thèse réalisée au sein du Laboratoire d'Informatique de Nantes Atlantique (LINA) et intitulée : "Alignement lexical en corpus comparables : le cas des composés savants et des adjectifs relationnels". La soutenance aura lieu le lundi 10 février 2014 à 14h00 en salle 3 au LINA (bâtiment 11, Faculté des Sciences et des Techniques). Vous êtes également convié-e-s au pot de thèse qui suivra. Bien cordialement, Rima Harastani *************************** Le jury de soutenance sera composé de : M. Hervé BLANCHON, Maître des conférences, Université Pierre Mendès - Rapporteur M. Ulrich HEID, Professeur des universités, Université de Stuttgart - Rapporteur M. Vincent CLAVEAU, Chargé de recherche, CNRS-IRISA Rennes - Examinateur M. Holger SCHWENK, Professeur des universités, Université du Maine - Examinateur Mme. Béatrice DAILLE, Professeur des universités, Université de Nantes - co-encadrante de thèse M. Emmanuel MORIN, Professeur des universités, Université de Nantes - directeur de thèse *************************** Résumé : Notre travail concerne l'extraction automatique d'une liste de termes alignés avec leurs traductions (c'est-à-dire un lexique bilingue spécialisé) à partir d'un corpus comparable dans un domaine de spécialité. Un corpus comparable comprend des textes écrits dans deux langues différentes sans aucune relation de traduction entre eux mais dont les textes appartiennent à un même domaine. Les contributions de cette thèse portent sur l'amélioration de la qualité d'un lexique bilingue spécialisé extrait à partir d'un corpus comparable. Nous proposons des méthodes consacrées à la traduction de deux types de termes, qui ont des caractéristiques en commun entre plusieurs langues ou qui posent par leur nature des problèmes pour la traduction : les composés savants (termes contenant au moins une racine empruntée au grec ou latin) et les termes composés d'un nom et un adjectif relationnel. Nous développons également une méthode, qui exploite des contextes riches en termes spécifiques au domaine du corpus, pour réordonner dans un lexique bilingue spécialisé des traductions candidates fournies pour un terme. Les expériences sont réalisées en utilisant deux corpus comparables spécialisés, le premier dans le domaine du cancer du sein et le deuxième dans le domaine des énergies renouvelables, sur les langues français, anglais, allemand et espagnol. Mots clés : corpus comparables, langue de spécialité, alignement multilingue, composés savants, adjectifs relationnels. *************************** Abstract: Our work concerns the automatic extraction of a list of aligned terms with their translations (i.e. specialized bilingual lexicon) from comparable corpora belonging to a specific domain. Comparable corpora include texts written in two languages which are not translations of each other but belong to the same domain. This thesis contributes to the improvement of the quality of an extracted bilingual lexicon. We propose methods dedicated to the translation of two types of terms that have common characteristics among many languages or that cause specific problems for translation due to their nature. These types of terms are the neoclassical compounds (terms containing at least one root borrowed from Greek or Latin) and the terms composed of one noun and one relational adjective. We also propose a method, which exploits contexts rich in domain-specific terms to re-rank some provided translations in a bilingual lexicon for a given term. The experiments are performed using two specialized comparable corpora, the first in the domain of breast cancer and the second in the domain of renewable energy, on the languages French, English, German and Spanish. Keywords: comparable corpora, specialized language, multilingual alignment, neoclassical compounds, relational adjectives. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 10:44:31 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 11:44:31 +0100 Subject: Appel: Journal of Language Modelling, Special issue on Grammar Engineering (deadline extension) Message-ID: Date: Wed, 5 Feb 2014 10:53:22 +0100 From: Yannick Parmentier Message-ID: <20140205095322.GB1924 at haku> X-url: http://jlm.ipipan.waw.pl/ Message-ID: <20140205095322.GB1924 at haku> User-Agent: Mutt/1.5.21 (2010-09-15) **Apologies for cross-posting. Please post or distribute.** *DEADLINE EXTENSION: February, 23rd* CALL FOR PAPERS Journal of Language Modelling SPECIAL ISSUE ON High-level Methodologies for Grammar Engineering http://jlm.ipipan.waw.pl/ Special Issue Description ------------------------- Many grammatical frameworks have been proposed over the last decades to describe the syntax (and semantics) of natural language. Among the most widely used, one may cite (in chronological order) Tree-Adjoining Grammar (TAG) [Joshi et al., 1975], Lexical-Functional Grammar (LFG) [Bresnan and Kaplan, 1982], Combinatory Categorial Grammar (CCG) [Steedman, 1987], or Head- driven Phrase Structure Grammar (HPSG) [Pollard and Sag, 1994]. These frameworks present theoretical and practical interests. From a theoretical point of view, they provide a formal device for the linguist to experiment with her/his theories. >From a practical point of view, they make it possible to automatically process natural language in applications such as dialog systems, machine translation, etc. They differ in their expressivity and complexity. Some prove more suitable for the description of a given language than others. Still, for many of these frameworks, large resources (e.g., grammars) have been designed, at first by hand, and later via dedicated tools (e.g., integrated grammar environments such as XLE for LFG [King et al., 2000]). In this Special Issue, we are concerned with this complex task of grammar engineering, keeping in mind the two above-mentioned theoretical and practical interests. Topics of Interest ------------------ This special issue welcomes contributions from the following topics (without being limited to these): 1. New or updated work on the traditional topics of GEAF workshops The traditional topics in the series of "Grammar Engineering Accross Frameworks" workshops, "Formal Grammar" conferences and / or the "High-level Methodologies for Grammar Engineering" ESSLLI 2013 workshop are appropriate. The submitted paper must clearly deal with language modelling. An extended version of a workshop/conference paper can be submitted, provided that the contribution has been substantially updated. The forums for any preliminary versions of the paper must be indicated. 2. Study of new fundamental and / or applied results in relation with grammar development, maintenance and enhancement - dedicated description languages for grammar engineering - semi-automatic acquisition of grammars - development of grammars for under-resourced languages - cross-framework grammar engineering - cross-language grammar engineering - seamless construction of grammars from dynamically changing linguistic descriptions - designs of weighted grammars - efficient and dynamic construction of structures from updated rules or logical formulas 3. Study of new methods for grammar engineering and their applications - debugging environments for grammar design - systems based on large-scale grammars 4. In addition, the topics of interest include reports on implementations - new kinds of implementations of grammar compilers, libraries and on-demand operations - benchmarks suitable for evaluation of grammars - methods that construct, minimize or decompose grammars Important Dates --------------- - Deadline for submissions: 23 February 2014 - First decision: 12 May 2014 - Submission of revised version: 17 July 2014 - Final decision: 22 September 2014 - Submission of camera-ready versions: 19 October 2014 Submission ---------- Articles submitted to this special issue must adhere to the JLM instructions for contributors and use the JLM's style files, see: http://jlm.ipipan.waw.pl/about/submissions#authorGuidelines Paper submissions should be made through JLM's electronic submission system: http://jlm.ipipan.waw.pl/about/submissions#onlineSubmissions All submitted articles should be anonymous and any obvious self-references should be avoided. The contribution of the paper must be substantial. All articles will be evaluated by at least 3 reviewers, including at least one member of JLM's Editorial Board, and at least one member of the issue's Guest Editorial Board. Editorial Board --------------- Guest Editors - Denys Duchier, Université d'Orléans, FRANCE - Yannick Parmentier, Université d'Orléans, FRANCE Guest Editorial Board - Emily Bender, University of Washington, USA - Philippe Blache, CNRS, Université d'Aix-en-Provence, FRANCE - Eric De La Clergerie, INRIA Paris, FRANCE - Benoît Crabbé, Université Diderot Paris 7, FRANCE - Berthold Crysmann, CNRS Paris, FRANCE - Denys Duchier, Université d'Orléans, FRANCE - Claire Gardent, CNRS/LORIA, Nancy, FRANCE - Joseph Le Roux, Université Paris Nord, FRANCE - Timm Lichte, Universität Düsseldorf, GERMANY - Montserrat Marimon, Universitat de Barcelona, SPAIN - Yusuke Miyao, National Institute of Informatics, Tokyo, JAPAN - Stefan Müller, Freie Universität Berlin, GERMANY - Yannick Parmentier, Université d'Orléans, FRANCE Contact ------- Yannick Parmentier Department of Computer Science, University Institute of Technology University of Orléans firstname (dot) lastname (at) univ-orleans (dot) fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 11:13:20 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 12:13:20 +0100 Subject: Seminaire: Exploration textometrique des thematiques dans un corpus, 13 fevrier 2014, Universite de Paris Est Marne la Vallee Message-ID: Date: Thu, 06 Feb 2014 16:12:20 +0100 From: Nicolas Turenne Message-ID: <52F3A654.3000909 at yahoo.fr> X-url: http://www.inra-ifris.org/axes-de-recherche-thematique/groupe-plateforme-cortext/seminaire-de-l-axe-traces-digitales-groupe-cortext.html [désolé pour les réceptions multiples] Chers collègues, Dans le cadre de la quinzième séance du séminaire 'traces digitales' du groupe CorText, nous avons le plaisir d'accueillir Philippe Gambette (MCF, Université Paris-Est Marne la Vallée) et Mathieu Brugidou (chercheur HDR, EDF) qui nous parleront d’analyse statistique de données textuelles. Cette matinée aura lieu le jeudi 13 février de 10h à 13h00, dans la salle de réunion C219 du Latts (Université Paris-Est Marne la Vallée - rue Galilée, bâtiment du Bois de l’Etang - 2ème étage - aile C). Voir http://www.inra-ifris.org/axes-de-recherche-thematique/groupe-plateforme-cortext/seminaire-de-l-axe-traces-digitales-groupe-cortext.html 10h-11h30 Philippe Gambette (maître de conférences, LIGM, Université Paris-Est Marne-la-Vallée) Exploration textométrique des thématiques dans un corpus de projets de recherche sur la biodiversité Résumé : À partir d’un corpus de plus de 4000 résumés de projets financés par une quinzaine d’agences de divers pays européens, nous dressons un panorama des tendances temporelles et géographiques de la recherche sur la biodiversité financée en Europe entre 2004 et 2011. Pour cela, analyses factorielles, évaluations de spécificités de vocabulaire et visualisations en nuages arborés sont combinées dans le cadre d’une analyse exploratoire enrichie par le regard d’un expert du domaine. Les qualités et les limites de la visualisation en nuage arboré seront tout particulièrement évoquées, et approfondies par d’autres cas d’utilisation. 11h30-13h Mathieu Brugidou (chercheur HDR, EDF) Le Grenelle de l’environnement : une approche morphologique et formelle avec des logiciels d’analyse des données textuelles, Alceste et Tropes. Résumé : L’hypothèse privilégiée dans ce travail est de considérer le Grenelle de l’environnement comme un dispositif d’écriture collective. Il ne s’agit pas seulement d’un dispositif de participation et de concertation des parties prenantes, marqué fortement par une dimension de négociation, comme l’indique la reprise du terme de Grenelle. Ce processus de concertation s’avère être aussi un dispositif d’écriture collective destiné à produire des synthèses et des mesures d’action publique. Deux types d’approches seront privilégiés dans cette analyse. Une première approche, morphologique, est orientée vers l’analyse de la représentation des problèmes ; une seconde, formelle, [Lassegue, 2010] est dédiée à l’analyse de la formulation de mesures : La perspective morphologique cherche à mettre en évidence les structures thématiques des différents corpus et leurs dynamiques. Nous faisons l’hypothèse que le dispositif du Grenelle – et notamment sa conception en phases – produit des cartographies des problèmes et que ceux-ci ne peuvent être représentés que de manière relationnelle. La définition des problèmes pour les acteurs du Grenelle implique de trier les questions, d’abord entre celles qui relèvent ou qui ne relèvent pas du Grenelle de l’environnement, ensuite entre celles définissant les différents problèmes identifiés au cours du Grenelle. Les méthodes d’analyses des données textuelles seront ici privilégiées – notamment celles s’inscrivant dans la filiation de Benzécri, à l’image de la méthode Alceste développée par M.Reinert, tournées vers une approche empirique des structures ; Une deuxième perspective peut être qualifiée de formelle dans la mesure où elle s’attache aux effets des dispositifs d’écritures sur la forme des mesures. Il s’agit de décrire la mise en forme des mesures en faisant l’hypothèse que celles-ci – au-delà de leurs contenus – présentent des caractéristiques linguistiques propres qui répondent à une grammaire [Lemieux, 2009] qui définit leurs propriétés pragmatiques. Cette deuxième approche, plus analytique, privilégie des études diachroniques suivant le phasage du dispositif. Ce sont notamment des analyses morphosyntaxiques qui seront ici conduites avec l’aide du logiciel Tropes. Ces analyses seront conduites sur des corpus construits selon différents paramètres de temps (les phases de la négociation) mais aussi d’espaces de discours (les différents dispositifs d’écritures, lesquels renvoient aux phases du Grenelle mais aussi à des sous espaces des rapports, par exemple l’opposition entre synthèse et annexes, etc.). ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 11:25:34 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 12:25:34 +0100 Subject: Stage: Master recherche, Annotation semantique dynamique/Dynamic semantic annotation, LIPN, Universite Paris 13 Message-ID: Date: Fri, 7 Feb 2014 18:49:22 +0100 From: Adeline Nazarenko Message-Id: <300A46BB-69F2-47EC-9292-B3576E7A713E at lipn.univ-paris13.fr> (English version below) Proposition de stage de master recherche Annotation sémantique dynamique Mots clefs : Traitement Automatique des Langues, annotation sémantique, gestion de contenus, ingénierie des connaissances, web sémantique Encadrants : Adeline Nazarenko et François Lévy (LIPN, Université Paris 13 – Sorbonne Paris Cité & CNRS) Durée : 4 à 6 mois (printemps-été 2014) Indemnités : 430€ /mois (Labex EFL) Problématique L'annotation sémantique des documents joue aujourd'hui un rôle clef pour beaucoup d'applications de gestion de contenus textuels (navigation textuelle, recherche d'information sémantique, restructuration de documents, etc.). L'annotation sémantique consiste à apposer sur un texte des informations, ou métadonnées, dont la sémantique est portée par un modèle sémantique formel (langage d'indexation, thesaurus, ontologie, par exemple) [13, 6, 14]. On associe ainsi au texte une représentation sémantique formelle et les moteurs de recherche ou agents logiciels peuvent exploiter à la fois le contenu textuel (recherche en plain texte, calculs distributionnels) et la sémantique formelle qui lui est associée. Les outils d'annotation de la première génération sont assez frustres, se contentant souvent de lier les mentions des entités nommées identifiées dans les textes à des instances existantes ou à de nouvelles instances de concepts dans une ontologie [10, 4]. Le développement des applications spécialisées de gestion de contenus et l'essor du web de données amènent aujourd'hui à revoir les méthodes d'annotation sémantique : on a besoin de méthodes et d'outils qui offrent une expressivité d'annotation plus riche (par ex. annoter des instances de concepts mais aussi des concepts et des relations) tout en étant robustes, génériques et adaptables à différents domaines et contextes d'utilisation. Objectif du stage Le stage permettra de proposer une méthode d'annotation sémantique qui intègre des mesures de qualité de l'annotation et qui permette de réviser l'annotation dynamiquement. On supposera que le modèle sémantique utilisé est de type ontologique. Si l'on considère qu'un système d'annotation S= est composé d'une ontologie O, d'un texte T et d'un ensemble d'annotations ou de liens A associant à des segments de T des entités de O, il faut réviser le système S si l'un de ses composants est mis à jour (le texte est modifié, l'ontologie est enrichie ou restructurée) ou lorsque des incohérences ou défauts de couverture sont détectés. Le stage consistera à étudier les différents cas de figures dans lesquels un tel système d'annotation doit être révisé et à proposer une méthode d'annotation dynamique intégrant des processus de révision. La méthode d'annotation dynamique doit 1) intégrer des critères de cohérence et des mesures de couverture pour identifier quand la révision d'un système d'annotation est nécessaire, 2) proposer des procédures de révision adaptées aux différents cas de figure et 3) contrôler la convergence du processus global de révision. En commençant par les types d'annotation les plus simples (par ex. un texte annoté avec les instances et les concepts d'une ontologie), le stagiaire devra proposer une méthode d'annotation dynamique. Il pourra s'appuyer sur les outils d'annotation sémantique existants de l'équipe RCLN, sur l'expertise des membres de l'équipe et sur des cas d'usage réels pour évaluer l'apport de cette dynamique de l'annotation. Il est souhaitable que la méthode proposée soit directement intégrée à un outil d'annotation existant mais elle pourra aussi être testée en simulation si l'intégration s'avère trop coûteuse. Description du travail Le stage comportera différentes parties : 1) état de l'art sur l'annotation sémantique et veille sur les outils existants (outils de l'équipe RCLN ou autres) ; 2) description, modélisation et implémentation du processus d'annotation dynamique (pour les types d'annotations les plus simples ; en s'appuyant sur les outils existants et/ou sur des technologies sémantiques) ; 3) analyse, test et évaluation de l'approche proposée sur des cas d'usage réels simplifiés fournis par l'équipe RCLN. En outre et en prévision d'une poursuite en thèse, le stagiaire pourra chercher à spécifier une méthode d'annotation sémantique plus riche prenant en compte une palette étendue de types d'annotations. Contexte et informations pratiques Le stage sera encadré par Adeline Nazarenko et François Lévy. Le/la stagiaire sera intégré(e) à l'équipe RCLN et bénéficiera de l'expertise de celle-ci en matière de traitement automatique des langues, d'ingénierie des connaissances textuelles et de web sémantique. En particulier, l'équipe RCLN a une solide expérience en matière d'annotation sémantique (annotation manuelle [2, 3] ou automatique, par apprentissage [8], formalismes et ressources pour l'annotation [9, 12]) et de construction d'ontologies à partir de textes [1]. Elle a aussi l’expérience de l'intégration de ces méthodes d'annotation et d'acquisition dans les outils d'analyse de contenus [7, 5, 11]. Le/la stagiaire travaillera au LIPN (Université Paris 13 – Sorbonne Paris Cité & CNRS) où il/elle se verra attribuer un bureau. Il/elle aura accès à l'ensemble des moyens techniques et des données nécessaires à son travail. Le stage est prévu pour une durée de 6 mois. Il devrait débuter au printemps 2014. Le stage sera financé dans le cadre d'une opération de recherche de l'axe « Analyse sémantique computationnelle » du Labex « Fondements empiriques de la linguistique ». Les candidatures doivent être adressées à François Lévy (francois.levy à lipn.univ-paris13.fr) avant le 7 mars 2014 : envoyer une lettre de motivation, un CV, les relevés de notes de master. Références [1] N. Aussenac-Gilles, S. Després, and S. Szulman. « The TERMINAE Method and Platform for Ontology Engineering from texts ». In Paul Buitelaar and Philipp Cimiano, editors, Bridging the Gap between Text and Knowledge - Selected Contributions to Ontology Learning and Population from Text, pages 199–223. IOS Press, janvier 2008. [2] K. Fort. Les ressources annotées, un enjeu pour l’analyse de contenu : vers une méthodologie de l’annotation manuelle de corpus. Thèse d'informatique, Université Paris 13 – Sorbonne Paris Cité, Villetaneuse, France, 2012. [3] K. Fort., A. Nazarenko, S. Rosset. « Modeling the Complexity of Manual Annotation Tasks: a Grid of Analysis ». In Proceedings of the 24th International Conference on Computational Linguistics (COLING 2012), Mumbai, India, December 2012. [4] C. Giuliano, A. Gliozzo. « Instance-based ontology population exploiting named-entity substitution ». In Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008), pages 265–272, Manchester, August 2008. [5] A. Guissé, F. Lévy, A. Nazarenko. Un moteur sémantique pour explorer des textes réglementaires. In Actes des 22èmes journées francophones d'Ingénierie des Connaissances, Chambéry, 2011. [6] A. Kiryakov, B. Popov, I. Terziev, D. Manov, and D. Ognyanoff. « Semantic annotation, indexing, and retrieval ». Journal of Web Semantics, 2(1):49–79, 2004. [7] F. Lévy, A. Nazarenko, A. Guissé. « Annotation, indexation et parcours de documents numériques ». Revue des Sciences et Technologies de l'Information, 13(3/2010):121–152, 2010. [8] Y. Ma, F. Lévy, A. Nazarenko. Annotation sémantique pour des domaines spécialisés et des ontologies riches. In de la 20ème conférence du Traitement Automatique du Langage Naturel (TALN 2013), pp 464-478, 17-21 Juin 2013, Les Sables d’Olonne. [9] Y. Ma, A. Nazarenko, L. Audibert. « Formal description of resources for ontology-based semantic annotation ». In Proceedings of the International Conference on Language Resources and Evaluation (LREC 2010), Malta, May 2010. ELRA. [10] B. Magnini, A. Pianta, O. Popescu, M. Speranza. « Ontology population from textual mentions: Task definition and benchmark ». In Proceedings of the OLP2 workshop on Ontology Population and Learning, Sidney, Australia, 2006. [11] A. Nazarenko, A. Guissé, F. Lévy, N. Omrane, S. Szulman. « Integrating Written Policies in Business Rule Management Systems ». In Rule-Based reasoning, Programming, and Applications, volume 6826 of Lecture Notes in Computer Science, pages 99–113, Barcelona, Espagne, 2011. [12] N. Omrane, A. Nazarenko, P. Rosina, S. Szulman, C. Westphal. « Lexicalized ontology for a business rules management platform: An automotive use case ». In Proceedings of the 5th International Symposium on Rules, International Business Rules Forum (RuleMF at BRF), Ft Lauderdale, Florida, USA, November 2011. [13] B. Popov, A. Kiryakov, D. Ognyanoff, D. Manov, A. Kirilov. « Kim – a semantic platform for information extraction and retrieval ». Natural Language Engineering, 10(3-4):375–392, 2004. [14] V. Uren, P. Cimiano, J. Iria, S. Handschuh, M. Vargas-Vera, E. Motta, F. Ciravegna. « Semantic annotation for knowledge management: Requirements and a survey of the state of the art ». Journal of Web Semantics, 4, 2006. ------------------------------------------------------- (French version above) Proposal for a master internship Dynamic semantic annotation Keywords: Natural Language Engineering, Semantic Annotation, Content Management, Knowledge Engineering, Semantic Web Supervision: Adeline Nazarenko and François Lévy (LIPN, Université Paris 13 – Sorbonne Paris Cité & CNRS) Duration: 4-6 months (spring-summer 2014) Indemnités: 430€ /month (Labex EFL) Problem The semantic annotation of documents plays a key role for many applications of textual content management (e.g. navigation, semantic information retrieval, publication). Semantic Annotation consists in enriching a text with metadata which semantics is given by a formal semantic model (e.g. indexing language, thesaurus, ontology) [13, 6 , 14]. A formal semantic representation is thus associated with the text so that search engines or software agents can jointly exploit the textual content (plain text search, distributional measures) and the formal semantics associated with it. The first generation annotation tools are quite simple. They often merely bind references to named entities identified in the texts to existing instances or new instances of concepts in an ontology [10 , 4]. However, the development of specialized applications of content management and linked data calls for renewed methods of semantic annotation: we need methods and tools that provide a richer expressiveness of annotation (e.g. annotation wrt. concepts and relations and not only instances) while being robust, generic and adaptable to different domains and use cases. Goal The goal of the internship is to design a semantic annotation method incorporating annotation quality measures and enabling the dynamic revision of annotations, assuming that the semantic model is ontological. If we consider that an annotation system S = consists of an ontology O, a text T and a set of annotations or links A associating segments of with entities of O, one must revise the system S if one of its components is updated (the text is modified, the ontology is enriched or restructured ) or when inconsistencies or gaps in coverage are detected. The Master student will study the different scenarios requiring the revision of such an annotation system and propose a method of dynamic annotation integrating such a revision process. The dynamic annotation method must 1) integrate consistency criteria and coverage metrics to identify when the revision of an annotation system is necessary, 2) propose revision procedures adapted to different use scenarios and 3) control the convergence of the overall revision process. Starting with the simplest types of annotation (e.g. a text annotated with instances and concepts of an ontology), the student will provide a method for dynamic annotation. It will rely on existing semantic annotation tools, on the expertise of RCLN team members and on real use cases to assess the contribution of this dynamic annotation. The proposed method will be directly integrated into an existing annotation tool or tested through simulation if integration is too complex. Description of work The work will include several parts: - state of the art on semantic annotation and review of existing tools; - description, modeling and implementation of the dynamic annotation process (for the simplest types of annotations and based on existing tools and/or semantic technologies); - analysis, test and evaluation of the proposed approach on simple but real use cases provided by the RCLN team. In addition, and in anticipation of a PhD followup, the student may start to specify a richer semantic annotation method taking into account a wider range of annotation types. Context and Practical Information The work will be supervised by Pr. Adeline Nazarenko and Pr. Francois Levy. The intern will be integrated in the RCLN team and benefit from its expertise in natural language processing, knowledge engineering and semantic web. In particular, RCLN has a solid experience in semantic annotation (manual annotation [2, 3] or based on machine learning [8], formalisms and resources for annotation [9, 12]) and text-based ontology design [1]. It also knows how to integrate those methods of acquisition and annotation in content analysis tools [7 , 5, 11 ]. The intern will work at LIPN (University Paris 13 - Sorbonne Paris Cité & CNRS) where he/she will be assigned a desk. He/she will have access to local facilities and data resources. The internship is for a period of 6 months. It should start in spring 2014. It will be funded by the Labex "Empirical Foundations of Language" (research strand “computational semantic analysis”). Applications should be addressed to François Lévy (francois.levy to lipn.univ - paris13.fr) before March 7, 2014 : send a cover letter, a CV and transcripts. References [1] N. Aussenac-Gilles, S. Després, and S. Szulman. « The TERMINAE Method and Platform for Ontology Engineering from texts ». In Paul Buitelaar and Philipp Cimiano, editors, Bridging the Gap between Text and Knowledge - Selected Contributions to Ontology Learning and Population from Text, pages 199–223. IOS Press, janvier 2008. [2] K. Fort. Les ressources annotées, un enjeu pour l’analyse de contenu : vers une méthodologie de l’annotation manuelle de corpus. Thèse d'informatique, Université Paris 13 – Sorbonne Paris Cité, Villetaneuse, France, 2012. [3] K. Fort., A. Nazarenko, S. Rosset. « Modeling the Complexity of Manual Annotation Tasks: a Grid of Analysis ». In Proceedings of the 24th International Conference on Computational Linguistics (COLING 2012), Mumbai, India, December 2012. [4] C. Giuliano, A. Gliozzo. « Instance-based ontology population exploiting named-entity substitution ». In Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008), pages 265–272, Manchester, August 2008. [5] A. Guissé, F. Lévy, A. Nazarenko. Un moteur sémantique pour explorer des textes réglementaires. In Actes des 22èmes journées francophones d'Ingénierie des Connaissances, Chambéry, 2011. [6] A. Kiryakov, B. Popov, I. Terziev, D. Manov, and D. Ognyanoff. « Semantic annotation, indexing, and retrieval ». Journal of Web Semantics, 2(1):49–79, 2004. [7] F. Lévy, A. Nazarenko, A. Guissé. « Annotation, indexation et parcours de documents numériques ». Revue des Sciences et Technologies de l'Information, 13(3/2010):121–152, 2010. [8] Y. Ma, F. Lévy, A. Nazarenko. Annotation sémantique pour des domaines spécialisés et des ontologies riches. In de la 20ème conférence du Traitement Automatique du Langage Naturel (TALN 2013), pp 464-478, 17-21 Juin 2013, Les Sables d’Olonne. [9] Y. Ma, A. Nazarenko, L. Audibert. « Formal description of resources for ontology-based semantic annotation ». In Proceedings of the International Conference on Language Resources and Evaluation (LREC 2010), Malta, May 2010. ELRA. [10] B. Magnini, A. Pianta, O. Popescu, M. Speranza. « Ontology population from textual mentions: Task definition and benchmark ». In Proceedings of the OLP2 workshop on Ontology Population and Learning, Sidney, Australia, 2006. [11] A. Nazarenko, A. Guissé, F. Lévy, N. Omrane, S. Szulman. « Integrating Written Policies in Business Rule Management Systems ». In Rule-Based reasoning, Programming, and Applications, volume 6826 of Lecture Notes in Computer Science, pages 99–113, Barcelona, Espagne, 2011. [12] N. Omrane, A. Nazarenko, P. Rosina, S. Szulman, C. Westphal. « Lexicalized ontology for a business rules management platform: An automotive use case ». In Proceedings of the 5th International Symposium on Rules, International Business Rules Forum (RuleMF at BRF), Ft Lauderdale, Florida, USA, November 2011. [13] B. Popov, A. Kiryakov, D. Ognyanoff, D. Manov, A. Kirilov. « Kim – a semantic platform for information extraction and retrieval ». Natural Language Engineering, 10(3-4):375–392, 2004. [14] V. Uren, P. Cimiano, J. Iria, S. Handschuh, M. Vargas-Vera, E. Motta, F. Ciravegna. « Semantic annotation for knowledge management: Requirements and a survey of the state of the art ». Journal of Web Semantics, 4, 2006. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 10:47:33 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 11:47:33 +0100 Subject: Journee: Formation au logiciel TXM (textometrie), dates pour mars-avril 2014 Message-ID: Date: Wed, 05 Feb 2014 15:53:08 +0100 From: Serge Heiden Message-ID: <52F25054.2030106 at ens-lyon.fr> X-url: http://textometrie.ens-lyon.fr Bonjour, Le logiciel open-source de textométrie TXM, dont le développement est coordonné par le laboratoire ICAR à Lyon, permet l'analyse documentaire (qualitative) et statistique (quantitative) de données textuelles (dont oral transcrit), pour des corpus de recherche personnels ou pour la mise en ligne de corpus : http://textometrie.ens-lyon.fr Les prochains ateliers de formation au logiciel TXM auront lieu à Lyon, à l'ENS site Descartes, les journées suivantes (8h30/9h30-17h30/18h) : - lundi 7 avril : Initiation à TXM ; - mardi 8 avril : Préparation de corpus et import dans TXM. (Les ateliers de février sont complets.) Un atelier "TXM avancé" pourra également être organisé si nous pouvons réunir un petit groupe d'utilisateurs expérimentés (faisant déjà des analyses avec TXM sur leurs données). Si vous êtes intéressé(e) par l'une de ces séances, il faut vous inscrire en envoyant un mail à textometrie <à> ens-lyon fr (l'inscription est nécessaire même si vous aviez répondu à la consultation doodle.) Toute l'information sur les ateliers : https://groupes.renater.fr/wiki/txm-users/public/ateliers_txm Au plaisir de vous retrouver peut-être à l'une de ces journées, Serge Heiden, pour l'équipe TXM. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 10:46:34 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 11:46:34 +0100 Subject: Appel: LREC 2014 Workshop on Building and Using Comparable Corpora (7th BUCC) Message-ID: Date: Wed, 5 Feb 2014 14:06:23 +0100 From: "Reinhard Rapp" Message-ID: <231BF614593D4F2A923714DCB28EF473 at ASUSPC> X-url: http://comparable.limsi.fr/bucc2014/ We apologize for multiple postings Please distribute to interested colleagues ============================================================ Final Call for Papers 7th WORKSHOP ON BUILDING AND USING COMPARABLE CORPORA Building Resources for Machine Translation Research http://comparable.limsi.fr/bucc2014/ May 27, 2014 Co-located with LREC 2014 Harpa Conference Centre, Reykjavik (Iceland) DEADLINE FOR PAPERS: February 10, 2014 https://www.softconf.com/lrec2014/BUCC2014/ *** INVITED SPEAKER *** Chris Callison-Burch (University of Pennsylvania) ============================================================ MOTIVATION In the language engineering and the linguistics communities, research in comparable corpora has been motivated by two main reasons. In language engineering, on the one hand, it is chiefly motivated by the need to use comparable corpora as training data for statistical Natural Language Processing applications such as statistical machine translation or cross-lingual retrieval. In linguistics, on the other hand, comparable corpora are of interest in themselves by making possible inter-linguistic discoveries and comparisons. It is generally accepted in both communities that comparable corpora are documents in one or several languages that are comparable in content and form in various degrees and dimensions. We believe that the linguistic definitions and observations related to comparable corpora can improve methods to mine such corpora for applications of statistical NLP. As such, it is of great interest to bring together builders and users of such corpora. The scarcity of parallel corpora has motivated research concerning the use of comparable corpora: pairs of monolingual corpora selected according to the same set of criteria, but in different languages or language varieties. Non-parallel yet comparable corpora overcome the two limitations of parallel corpora, since sources for original, monolingual texts are much more abundant than translated texts. However, because of their nature, mining translations in comparable corpora is much more challenging than in parallel corpora. What constitutes a good comparable corpus, for a given task or per se, also requires specific attention: while the definition of a parallel corpus is fairly straightforward, building a non-parallel corpus requires control over the selection of source texts in both languages. Parallel corpora are a key resource as training data for statistical machine translation, and for building or extending bilingual lexicons and terminologies. However, beyond a few language pairs such as English- French or English-Chinese and a few contexts such as parliamentary debates or legal texts, they remain a scarce resource, despite the creation of automated methods to collect parallel corpora from the Web. To exemplify such issues in a practical setting, this year's special focus will be on Building Resources for Machine Translation Research This special topic aims to address the need for: (1) Machine Translation training and testing data such as spoken or written monolingual, comparable or parallel data collections, and (2) methods and tools used for collecting, annotating, and verifying MT data such as Web crawling, crowdsourcing, tools for language experts and for finding MT data in comparable corpora. TOPICS We solicit contributions including but not limited to the following topics: Topics related to the special theme: * Methods and tools for collecting and processing MT data, including crowdsourcing * Methods and tools for quality control * Tools for efficient annotation * Bilingual term and named entity collections * Multilingual treebanks, wordnets, propbanks, etc. * Comparable corpora with parallel units annotated * Comparable corpora for under-resourced languages and specific domains * Multilingual corpora with rich annotations: POS tags, NEs, dependencies, semantic roles, etc. * Data for special applications: patent translation, movie subtitles, MOOCs, meetings, chat-rooms, social media, etc. * Legal issues with collecting and redistributing data and generating derivatives Building comparable corpora: * Human translations * Automatic and semi-automatic methods * Methods to mine parallel and non-parallel corpora from the Web * Tools and criteria to evaluate the comparability of corpora * Parallel vs non-parallel corpora, monolingual corpora * Rare and minority languages, across language families * Multi-media/multi-modal comparable corpora Applications of comparable corpora: * Human translations * Language learning * Cross-language information retrieval & document categorization * Bilingual projections * Machine translation * Writing assistance Mining from comparable corpora: * Extraction of parallel segments or paraphrases from comparable corpora * Extraction of bilingual and multilingual translations of single words and multi-word expressions; proper names, named entities, etc. IMPORTANT DATES February 10, 2014 Deadline for submission of full papers March 10, 2014 Notification of acceptance March 27, 2014 Camera-ready papers due May 27, 2014 Workshop date SUBMISSION INFORMATION Papers should follow the LREC main conference formatting details (to be announced on the conference website http://lrec2014.lrec-conf.org/en/ ) and should be submitted as a PDF-file via the START workshop manager at https://www.softconf.com/lrec2014/BUCC2014/ Contributions can be short or long papers. Short paper submission must describe original and unpublished work without exceeding six (6) pages. Characteristics of short papers include: a small, focused contribution; work in progress; a negative result; an opinion piece; an interesting application nugget. Long paper submissions must describe substantial, original, completed and unpublished work without exceeding ten (10) pages. Reviewing will be double blind, so the papers should not reveal the authors' identity. Accepted papers will be published in the workshop proceedings. Double submission policy: Parallel submission to other meetings or publications is possible but must be immediately notified to the workshop organizers. When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. Moreover, ELRA encourages all LREC authors to share the described LRs (data, tools, services, etc.), to enable their reuse, replicability of experiments, including evaluation ones, etc. JOURNAL SPECIAL ISSUE Authors of selected papers will be encouraged to submit substantially extended versions of their manuscripts to an upcoming special issue on ‘Machine Translation Using Comparable Corpora’ of the Journal of Natural Language Engineering. ORGANISERS Pierre Zweigenbaum, LIMSI, CNRS, Orsay (France) Ahmet Aker, University of Sheffield (UK) Serge Sharoff, University of Leeds (UK) Stephan Vogel, QCRI (Qatar) Reinhard Rapp, Universities of Mainz (Germany) and Aix-Marseille (France) FURTHER INFORMATION Pierre Zweigenbaum: pz (at) limsi (dot) fr SCIENTIFIC COMMITTEE * Ahmet Aker, University of Sheffield (UK) * Srinivas Bangalore (AT&T Labs, US) * Caroline Barrière (CRIM, Montréal, Canada) * Chris Biemann (TU Darmstadt, Germany) * Hervé Déjean (Xerox Research Centre Europe, Grenoble, France) * Kurt Eberle (Lingenio, Heidelberg, Germany) * Andreas Eisele (European Commission, Luxembourg) * Éric Gaussier (Université Joseph Fourier, Grenoble, France) * Gregory Grefenstette (INRIA, Saclay, France) * Silvia Hansen-Schirra (University of Mainz, Germany) * Hitoshi Isahara (Toyohashi University of Technology) * Kyo Kageura (University of Tokyo, Japan) * Adam Kilgarriff (Lexical Computing Ltd, UK) * Natalie Kübler (Université Paris Diderot, France) * Philippe Langlais (Université de Montréal, Canada) * Michael Mohler (Language Computer Corp., US) * Emmanuel Morin (Université de Nantes, France) * Dragos Stefan Munteanu (Language Weaver, Inc., US) * Lene Offersgaard (University of Copenhagen, Denmark) * Ted Pedersen (University of Minnesota, Duluth, US) * Reinhard Rapp (Université Aix-Marseille, France) * Sujith Ravi (Google, Mountain View, US) * Serge Sharoff (University of Leeds, UK) * Michel Simard (National Research Council Canada) * Richard Sproat (OGI School of Science & Technology, US) * Tim Van de Cruys (IRIT-CNRS, Toulouse, France) * Stephan Vogel (QCRI, Qatar) * Guillaume Wisniewski (Université Paris Sud & LIMSI-CNRS, Orsay, France) * Pierre Zweigenbaum (LIMSI-CNRS, Orsay, France) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 11:20:43 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 12:20:43 +0100 Subject: Seminaire: Programme Seminaire Linguistique Theorique, Universite de Bourgogne Message-ID: Date: Fri, 7 Feb 2014 16:12:55 +0100 (CET) From: Philippe Monneret Message-ID: <447312548.2154087.1391785975799.JavaMail.root at u-bourgogne.fr> SÉMINAIRE 2014 QUESTIONS DE LINGUISTIQUE THÉORIQUE Philippe Monneret Université de Bourgogne / GReLiSC (EA 4178 CPTC) Bâtiment Droit-Lettres – 10h-12h ------------------------------------------------------------------------ Jeudi 23 janvier (salle 105 PEG) Présentation et introduction du séminaire « Analogie et similarité en sciences du langage » Philippe Monneret (Université de Bourgogne) Jeudi 6 février (salle 155) « Pourquoi tant de logiques modales ? » Guillaume Coqui (Université de Bourgogne) Jeudi 13 février (salle du Conseil Lettres) « Remarques sur la détermination en français » Pierre-André Buvet (Université de Paris 13) Jeudi 20 février Colloque « Vers une histoire des théories phonosymboliques » Organisé par Luca Nobile (Université de Bourgogne) Jeudi 6 mars (salle 155) « Sur le possible en linguistique » Thomas Verjans (Université de Bourgogne) Jeudi 13 mars (salle du Conseil Lettres) « Mon nom est PERSONNE : qui est JE, qui est TU, qui est IL ? » Georges Kleiber (Université de Strasbourg) Jeudi 20 mars (salle du Conseil Lettres) « L’expression de la finalité en français » Gaston Gross (Université de Paris 13) Vendredi 28 mars (salle du Conseil Droit) « Philosophie du langage, philosophie de la linguistique, philosophie des langues » Christian Puech (Université de Paris 3) Jeudi 3 avril (salle 155) « Sur la disparition de cuidier » Thierry Ponchon (Université de Reims, Champagne-Ardenne) Vendredi 11 avril (salle du Conseil Lettres) « Essai de typologie de l’iconicité linguistique » Luca Nobile (Université de Bourgogne) Jeudi 17 avril (salle 155) « Penser sans langage. Le cas des enfants en période pré-linguistique » Anna Ciaunica ( Mind, Language and Action Lab - University of Porto) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 11:01:30 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 12:01:30 +0100 Subject: Appel: SIGDIAL 2014 Conference, June 18th-20th, Philadelphia, USA Message-ID: Date: Thu, 6 Feb 2014 09:47:00 +0000 From: Helen Hastie Message-ID: X-url: http://www.sigdial.org/workshops/conference15 *SIGDIAL 2014 CONFERENCE* *Wednesday, June 18 to Friday, June 20, 2014 * The 15th Annual Meeting of the Special Interest Group on Discourse and Dialog will be co-located with the 8th International Conference on Natural Language Generation (INLG 2014 (http://inlg2014.wordpress.com/)) in Philadelphia, PA, USA and immediately preceding ACL 2014. http://www.sigdial.org/workshops/conference15 *CALL FOR PAPERS*The 2014 SIGDIAL conference continues a series of fourteen conferences, providing a regular forum for the presentation of cutting edge research across the areas of discourse and dialog and attracting a diverse set of participants from academia and industry. The conference is sponsored by the SIGdial organization, which serves as the Special Interest Group on discourse and dialog for both ACL and ISCA. *KEYNOTE SPEAKERS*We are pleased to announce our keynote speakers will be Prof. Steve Young from the University of Cambridge and Prof. Lillian Lee from Cornell University. *TOPICS OF INTEREST*We welcome formal, corpus-based, system-building or analytical work on discourse and dialog including but not restricted to the following themes and topics: - Discourse Processing and Dialog Systems - Corpora, Tools and Methodology - Pragmatic and/or Semantic Modeling - Dimensions of Interaction - Open Domain Dialog - Style, Voice and Personality in Spoken Dialog and Written Text - Applications of Dialog and Discourse Processing Technology - Novel Methods for Generation Within Dialog for a joint special session with INLG *SPECIAL SESSIONS*There will be one special session co-located with INLGon the morning of June 20th. There will be a second special session on the Dialogue State Tracking Challenge (DSTC) (http://camdial.org/~mh521/dstc/)*. *Proposals for additional special sessions are also still welcome. Special sessions submission will undergo regular SIGdial review process. *IMPORTANT DATES* *Special Session Proposal Deadline: * Sunday, 9 February 2014 (23:59, GMT-11) *Special Session Notification: * Monday, 17 February 2014 *Long, Short and Demonstration* *Paper Submission Deadline: * Sunday, 9 March 2014 (23:59, GMT-11) *Paper Notification: * Friday, 18 April 2014 *Final Paper Due* - For papers accepted subject to receiving mentoring Wednesday, 14 May 2014 - For accepted papers Friday, 23 May 2014 *Conference * Wednesday, June 18, 2014 to Friday, June 20, 2014 *SUBMISSIONS* *Special Session Proposals* The SIGdial organizers welcome the submission of special session proposals. A SIGDIAL special session is the length of a regular session at the conference; may be organized as a poster session, a poster session with panel discussion, or an oral presentation session; and will be held on the last day of the conference. Special sessions may, at the discretion of the SIGdial organizers, be held as parallel sessions. Those wishing to organize a special session should prepare a two-page proposal containing: a summary of the topic of the special session; a list of organizers and sponsors; a list of people who may submit and participate; and a requested format (poster/panel/oral session). These proposals should be sent to *conference[a]sigdial.org (http://sigdial.org)* by the special session proposal deadline. Special session proposals will be reviewed jointly by the general and program co-chairs. *Papers* The program committee welcomes the submission of long papers, short papers, and demonstration descriptions. All accepted submissions will be published in the conference proceedings. Long papers may, at the discretion of the technical program committee, be accepted for oral or poster presentation. They must be no longer than 8 pages, including title, content, and examples. Two additional pages are allowed for references and appendices, which may include extended example discourses or dialogs, algorithms, graphical representations, etc. Short papers will be presented as posters. They should be no longer than 4 pages, including title and content. One additional page is allowed for references and appendices. Demonstration papers should be no longer than 3 pages, including references. A separate one-page document should be provided to the program co-chairs for demonstration descriptions, specifying furniture and equipment needed for the demo. Authors of a submission may designate their paper to be considered for a SIGDIAL special session, which would highlight a particular area or topic. All papers will undergo regular peer review. Papers that have been or will be submitted to other meetings or publications must provide this information (see submission format). A paper accepted for presentation at SIGDIAL 2014 must not have been presented at any other meeting with publicly available proceedings. Any questions regarding submissions can be sent to the program co-chairs at program-chairs[at]sigdial.org. Authors are encouraged to submit additional supportive material such as video clips or sound clips and examples of available resources for review purposes. Submission is electronic using paper submission software. *FORMAT* All long, short, and demonstration submissions should follow the two-column ACL 2014 format. We strongly recommend the use of ACL LaTeX style files or Microsoft Word style files tailored for the ACL 2014 conference. Submissions must conform to the official ACL 2014 style guidelines ( http://www.cs.jhu.edu/ACL2014/CallforPapers.htm), and they must be electronic in PDF. As in most previous years, submissions will not be anonymous. *MENTORING SERVICE* For several years, the SIGDIAL conference has offered a mentoring service. Submissions with innovative core ideas that may need language (English) or organizational assistance will be flagged for "mentoring" and conditionally accepted with recommendation to revise with a mentor. An experienced mentor who has previously published in the SIGDIAL venue will then help the authors of these flagged papers prepare their submissions for publication. Any questions about the mentoring service can be addressed to the mentoring chair at mentoring[at]sigdial.org. *STUDENT SUPPORT* SIGdial also offers a limited number of scholarships for students presenting a paper accepted to the conference. Application materials will be posted at the conference website. *BEST PAPER AWARDS* In order to recognize significant advancements in dialog and discourse science and technology, SIGDIAL will recognize two best paper awards. A selection committee consisting of prominent researchers in the fields of interest will select the recipients of the awards. *SPONSORSHIP* SIGDIAL offers a number of opportunities for sponsors. For more information, email the conference organizers at sponsor-chair[at]sigdial.org . *DIALOG AND DISCOURSE* SIGDIAL authors are encouraged to submit their research to the journal Dialog and Discourse, which is endorsed by SIGdial. *ORGANIZING COMMITTEE* General Co-Chairs Kallirroi Georgila, University of Southern California, USA Matthew Stone, Rutgers, The State University of New Jersey, USA Technical Program Co-Chairs Helen Hastie, Heriot-Watt University, Edinburgh, UK Ani Nenkova, University of Pennsylvania, USA Mentoring Chair Svetlana Stoyanchev, AT&T Research Labs, USA Local Chair Keelan Evanini, Educational Testing Service, USA Sponsorships Chair Giuseppe Di Fabbrizio, Amazon.com, USA SIGdial President Amanda Stent, Yahoo! Labs, USA SIGdial Vice President Jason Williams, Microsoft Research, USA SIGdial Secretary/Treasurer Kristiina Jokinen, University of Helsinki, Finland From hamon at LIMSI.FR Sat Feb 8 11:16:21 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 12:16:21 +0100 Subject: Conf: WWW'14, Tutorial on Concept-Level Sentiment Analysis, Seoul, Korea Message-ID: Date: Fri, 7 Feb 2014 17:43:30 +0800 From: Erik Cambria Message-ID: <6BBCDA75-D5AB-4402-BC54-3E4965361F62 at nus.edu.sg> Apologies for cross-posting, Participants are invited to the WWW’14 tutorial on Concept-Level Sentiment Analysis, which will be held within the World Wide Web conference this April in Seoul, Korea. The tutorial aims to provide its participants means to efficiently design models, techniques, tools, and services for concept-level sentiment analysis and their commercial realizations. The tutorial draws on insights resulting from the recent IEEE Intelligent Systems special issues on Concept-Level Opinion and Sentiment Analysis and the IEEE CIM special issue on Computational Intelligence for Natural Language Processing. The tutorial includes a hands-on session to illustrate how to build a concept-level opinion-mining engine step-by-step, from semantic parsing to concept-level reasoning. BACKGROUND AND MOTIVATIONS As the Web rapidly evolves, Web users are evolving with it. In an era of social connectedness, people are becoming increasingly enthusiastic about interacting, sharing, and collaborating through social networks, online communities, blogs, Wikis, and other online collaborative media. In recent years, this collective intelligence has spread to many different areas, with particular focus on fields related to everyday life such as commerce, tourism, education, and health, causing the size of the Social Web to expand exponentially. The distillation of knowledge from such a large amount of unstructured information, however, is an extremely difficult task, as the contents of today’s Web are perfectly suitable for human consumption, but remain hardly accessible to machines. The opportunity to capture the opinions of the general public about social events, political movements, company strategies, marketing campaigns, and product preferences has raised growing interest both within the scientific community, leading to many exciting open challenges, as well as in the business world, due to the remarkable benefits to be had from marketing and financial market prediction. Mining opinions and sentiments from natural language, however, is an extremely difficult task as it involves a deep understanding of most of the explicit and implicit, regular and irregular, syntactical and semantic rules proper of a language. Existing approaches mainly rely on parts of text in which opinions and sentiments are explicitly expressed such as polarity terms, affect words and their co-occurrence frequencies. However, opinions and sentiments are often conveyed implicitly through latent semantics, which make purely syntactical approaches ineffective. Concept-level sentiment analysis focuses on a semantic analysis of text through the use of web ontologies or semantic networks, which allow the aggregation of conceptual and affective information associated with natural language opinions. By relying on external knowledge, such approaches step away from blind use of keywords and word co-occurrence count, but rather rely on the implicit features associated with natural language concepts. Unlike purely syntactical techniques, concept-based approaches are able to detect also sentiments that are expressed in a subtle manner, e.g., through the analysis of concepts that do not explicitly convey any emotion, but which are implicitly linked to other concepts that do so. The bag-of-concepts model can represent semantics associated with natural language much better than bags-of-words. In the bag-of-words model, in fact, a concept such as cloud computing would be split into two separate words, disrupting the semantics of the input sentence (in which, for example, the word cloud could wrongly activate concepts related to weather). The analysis at concept-level allows for the inference of semantic and affective information associated with natural language text and, hence, enables comparative fine-grained feature-based sentiment analysis. Rather than gathering isolated opinions about a whole item (e.g., iPhone5), users are generally more interested in comparing different products according to specific features (e.g., iPhone5’s vs Galaxy S3’s touchscreen), or even sub-features (e.g., fragility of iPhone5’s vs Galaxy S3’s touchscreen). In this context, the construction of comprehensive common and common-sense knowledge bases is key for feature-spotting and polarity detection, respectively. Common-sense, in particular, is necessary to properly deconstruct natural language text into sentiments – for example, to appraise the concept small room as negative for a hotel review and small queue as positive for a post office, or the concept go read the book as positive for a book review but negative for a movie review. TUTORIAL PROGRAM - Introduction (5 mins) - New Avenues in Sentiment Analysis Research - From Heuristics to Discourse Structure (5 mins) - From Coarse to Fine-Grained Analysis (5 mins) - From Keywords to Concepts (10 mins) - Concept-Level Models - Knowledge acquisition models (10 mins) - Emotion categorization models (10 mins) - Vector space models (10 mins) - Concept-Level Techniques - Analogical reasoning (10 mins) - Parallel analogy (10 mins) - Spreading activation (10 mins) - Concept-Level Tools - Sentiment resources (15 mins) - Common knowledge repositories (15 mins) - Aspect mining and polarity detection (10 mins) - Building a Concept-Level Opinion-Mining Engine - Semantic parsing (15 mins) - Sentic API (15 mins) - Application Samples (20 mins) - Conclusion (5 mins) IMPACT AND RELEVANCE The World Wide Web Conference is a global event bringing together key researchers, innovators, decision-makers, technologists, and business experts trying to make meaning out of Web data. Within this research and business area, opinion mining and sentiment analysis have become increasingly important subtasks in recent years. However, there are still many challenges, including social information understanding and integration, that need to be addressed. For these reasons, a tutorial on concept-level sentiment analysis is strongly relevant to WWW’14. TARGET AUDIENCE AND PREREQUISITES The target audience includes researchers and professionals in the fields of sentiment analysis, Web data mining, and related areas. The tutorial also aims to attract researchers from industry community as it covers research efforts for the development of applications in fields such as commerce, tourism, education, and health. The audience is expected to have basic computer science skills, but psychologists and sociologists are also very welcome. The tutorial not only covers state-of-the-art approaches to concept-level sentiment analysis, but also provides information about techniques and tools to be used for practical opinion mining. ABOUT THE TUTOR Erik Cambria received his BEng and MEng with honors in Electronic Engineering from the University of Genova, in 2005 and 2008 respectively. In 2011, he has been awarded a PhD in Computing Science and Mathematics, following the completion of an industrial Cooperative Awards in Science and Engineering (CASE) research project, funded by the UK Engineering and Physical Sciences Research Council (EPSRC), which was born from the collaboration between the University of Stirling and the MIT Media Laboratory. Today, Erik is the lead investigator of a MINDEF-funded project on Commonsense Knowledge Representation & Reasoning at the National University of Singapore (Temasek Laboratories) and an associate researcher at the MIT Media Laboratory (Synthetic Intelligence Project). His interests include AI, Semantic Web, KR, NLP, opinion mining and sentiment analysis, affective and cognitive modeling, intention awareness, HCI, and e-health. Erik is also chair of several international conferences, e.g., Extreme Learning Machines (ELM), and workshop series, e.g., ICDM SENTIRE. He is on the editorial board of Springer Cognitive Computation and he is a guest editor of many other leading AI journals. Erik is also a fellow of the Brain Sciences Foundation, the Chinese Academy of Sciences, National Taiwan University, Microsoft Research Asia, and HP Labs India. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 10:51:47 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 11:51:47 +0100 Subject: Stage: Stage de Master 2, Analyse des citations et des enumerations dans les fora de sante, Toulouse Message-ID: Date: Wed, 05 Feb 2014 16:43:23 +0100 From: Mustapha Mojahid Message-ID: <52F25C1B.2050808 at irit.fr> X-url: http://www.msh-m.fr/programmes/programmes-2013/parlons-de-nous Titre : Analyse des citations et des énumérations dans les fora de santé. *Lieu : *Laboratoire IRIT (Institut de Recherche en Informatique de Toulouse), Université Paul Sabatier, Equipe ELIPSE (Etude de L'Interaction Personne SystèmE). Contexte et problématique Ce sujet du stage se situe dans le cadre d'un projet national « Parlons de nous » (http://www.msh-m.fr/programmes/programmes-2013/parlons-de-nous) où l'on étudie des fora de santé pour tenter de répondre à la question « à quoi pensent les patients ? » et du projet interMSH qui lui fait suite. Les forums de santé sont des lieux où les patients échangent de nombreux points de vues, conseils, où ils s'interrogent et discutent, et ce dans un contexte bien différent des têtes à têtes menées avec les professionnels de santé, qui ont lieu dans une durée et un lieu contraint. Le relatif anonymat des échanges, l'implication récurrente de certains dans les forums, les questions débattues peuvent être révélatrices de points de vue, de connaissances ou méconnaissances d'informations médicales, d'alertes, en provenance du grand public. Dans ce contexte éminemment subjectif, la caractérisation et la compréhension des perceptions dans les fils de discussions des forums est difficile, mais aussi particulièrement intéressante et instructive dans une perspective d'amélioration des programmes de santé publique. *Objectif du projet* Un des objectifs du projet est de développer une plateforme pour aider des chercheurs (linguistes, sociologues et psychologues) et des médecins à observer certains comportements dans les fils de discussions dans des fora de santé. Dans le cadre d'un travail de thèse en informatique lancé cette année sur le sujet du contexte et des informations médicales, nous avons initié cette plateforme. La plateforme vise à proposer une interface qui permet de représenter et visualiser schématiquement les fils discussions (ou extraits de ces fils de discussions) de forums au travers de traits et critères que les chercheurs vont choisir. On espère ainsi pouvoir associer à certains schémas de discussions une qualité informationnelle du fil étudié (exemple : une discussion qui diverge du thème initial et se recentre entre 2 personnes habituées devient peut être un aparté hors sujet). Les critères actuellement pris en compte sont de nature contextuelle (les profils des utilisateurs (âge, sexe), le temps, nombres d'interventions, la longueur des échanges, les micro-échanges...). Nous souhaitons travailler sur l'exploration d'autres critères qui prendront en considération des traitements linguistiques des discussions afin de disposer d'un jeu de vues sur les discussions. Nous souhaitons évaluer si ces informations permettent de répondre à notre hypothèse de caractérisation des fils de discussion. *Objectif du stage* Ce stage vise deux objectifs : 1. Explorer des pistes linguistiques pressenties comme - étudier l'utilisation des citations : en effet les internautes se citent et se répondent beaucoup au fil des discussions et visualiser ces interconnections dans le fil de discussions nous permettraient peut être d'en avoir une compréhension élargie ; - connaitre les énumérations dans les fils de discussions (quels indices discursifs et de mise en forme matérielle peut-on repérer et qu'en déduire ?) - analyser en utilisant les terminologies médicales existantes, les proximités sémantiques entre les différents post de discussion dans un fil. 2. élaborer des stratégies pour coupler les indices de l'architecture de texte et ceux liés au contexte (profil de l'usager, thématique abordée, statut du message dans la discussion, etc.). Ce travail se fera en collaboration avec le doctorant. Nous souhaitons en effet dans cette plateforme, en manipulant des jauges constituées par ces critères, observer et pouvoir caractériser comment se construisent les réponses ? Est-ce que les réponses sont fournies par des habitués ou des béotiens ? Peut-on écarter certaines discussions (vulgarité, éparpillement...), ou au contraire anticiper sur des contenus plutôt informatifs voire cruciaux ? Quels rebondissements ? Quels recentrages ?... Nous nous focaliserons sur un sous-ensemble de ces besoins. La plate-forme « configurable »envisagée doit permettre aux chercheurs / médecins de pouvoir observer des comportements et des « histoires de discussion » stéréotypés. *Perspectives * Deux thèses possibles dans la continuité de ce sujet de master (1. sur l'étude des énumérations dans les manuels scolaires d'histoire géographie 2.sur l'amélioration de l'accessibilité textuelle pour des personnes non-voyantes). *Modalités du stage * Encadrants : Lydia-Mai Ho-Dac, CLLE-ERSS, Université Toulouse le Mirail Nathalie Souf, IRIT-ELIPSE, Université Paul Sabatier et ISIS Castres Mustapha Mojahid, IRIT-ELIPSE, Université Paul Sabatier Durée : 5-6 mois. Rémunération : celle prévue par la règlementation à savoir 436,05 EUR par mois. Début : à partir de Mars-Avril 2014. *Profil du candidat * Le candidat devra être inscrit dans un Master 2 en traitement automatique des langues. *Compétences demandées* - compétences en traitement automatique des langues et/ou en linguistique de corpus. - compétences de base en informatique et idéalement maîtrise d'outils pour l'analyse de corpus et/ou de langages de programmation de type perl et python. *Comment candidater ?* Envoyer un CV (avec le détail des cours et notes des deux années de Master) et une lettre de motivation à : Mustapha.Mojahid at irit.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 11:27:00 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 12:27:00 +0100 Subject: These: Inga Gheorghita, Construction automatique de hierarchies semantiques a partir du TLFi Message-ID: Date: Fri, 07 Feb 2014 23:24:56 +0100 From: Inga Gheorghita Message-ID: Bonjour, J’ai le plaisir de vous convier à ma soutenance de thèse de doctorat en linguistique informatique intitulée : « Construction automatique de hiérarchies sémantiques à partir du Trésor de la langue française informatisé (TLFi) : application à l'indexation et la recherche d'images ». La thèse sera présentée et soutenue le lundi 17 février 2014 à 9h30 à l’Université de Lorraine, Campus Lettres et Sciences Humaines et Sociales, bâtiment CNRS-ATILF, salle Imbs. La soutenance sera suivie du traditionnel pot de thèse auquel vous êtes tous chaleureusement invités. ======================= Composition du jury : ======================= Béatrice DAILLE, Professeur à l’Université de Nantes (rapporteur) Brigitte GRAU, Professeur à l’ENSIIE (rapporteur) Cyril MARCH, Ingénieur, Directeur général de Xilopix (invité) Jean-Marie PIERREL, Professeur à l’Université de Lorraine (directeur de thèse) Alain POLGUÈRE, Professeur à l’Université de Lorraine (examinateur) Yannick TOUSSAINT, Chargé de Recherche HDR à l’INRIA (examinateur) ========== Résumé : ========== L’objectif principal de cette thèse est de montrer que les informations lexicales issues d’un dictionnaire de langue, tel le Trésor de la langue française informatisé (TLFi), peuvent améliorer les processus d’indexation et de recherche d’images. Le problème d’utilisation d’une telle ressource est qu’elle n’est pas suffisamment formalisée pour être exploitée d’emblée dans un tel domaine d’application. Pour résoudre ce problème, nous proposons, dans un premier temps, une approche de construction automatique de hiérarchies sémantiques à partir du TLFi. Après avoir défini une caractéristique quantitative (mesurable) et comparable des noms apparaissant dans les définitions lexicographiques, à travers une formule de pondération permettant de sélectionner le nom de poids maximal comme l’hyperonyme pour un lexème donné du TLFi, nous proposons un algorithme de construction automatique de hiérarchies sémantiques pour les lexèmes des vocables du TLFi. Une fois notre approche validée à travers des évaluations manuelles, nous montrons, dans un second temps, que les hiérarchies sémantiques obtenues à partir du TLFi peuvent être utilisées pour l’enrichissement d’un thésaurus construit manuellement ainsi que pour l’indexation automatique d’images à partir de leurs descriptions textuelles associées. Nous prouvons aussi que l’exploitation d’une telle ressource dans le domaine de recherche d’images améliore la précision de la recherche en structurant les résultats selon les domaines auxquels les concepts de la requête de recherche peuvent faire référence. La mise en place d’un prototype nous a permis ainsi d’évaluer et de valider les approches ainsi proposées. ========== Mots-clés : ========== hiérarchies sémantiques, ressources lexicales, pondération, dictionnaire de langue, TLFi, indexation et recherche d’images, thésaurus Au plaisir de vous retrouver, Inga Gheorghita ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 10:53:09 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 11:53:09 +0100 Subject: Appel: ACFAS Colloque 635, Langues Naturelles, Informatique et Sciences Cognitives Message-ID: Date: Wed, 5 Feb 2014 12:04:51 -0500 From: Fatiha Sadat Message-ID: X-url: http://www.acfas.ca/evenements/congres/programme_preliminaire/82/600/635/C Appel à communications - EXTENSION DE LA DATE DE SOUMISSION POUR LE 14 Février 2014 Le sujet de ce colloque traite de la linguistique informatique et plus particulièrement du traitement automatique des langues naturelles (TALN/TAL) ainsi que son interaction avec les sciences cognitives. Il s'agit d'un domaine qui se situe au carrefour de la linguistique, de l'intelligence artificielle et des sciences cognitives dans leurs recherches sur le langage naturel. Ce colloque vise les différentes applications du TALN dans un contexte monolingue, bilingue ou multilingue, tel que le développement des méthodologies nécessaires pour la construction et l'enrichissement des ressources linguistiques. Tous les aspects du traitement automatique des langues sont considérés, en particulier les travaux théoriques ayant pour but de décrire les phénomènes linguistiques par des modèles calculatoires, le développement de logiciels d'aide aux linguistes pour leurs travaux de recherche, tels que des études de corpus, le développement de logiciels et de ressources pour utiliser, traiter, organiser et conserver des données linguistiques écrites ou orale. Il sera demandé aux présentateurs de mettre en valeur les aspects liés au traitement automatique des langues naturelles, quel que soit le sujet abordé, ainsi qu'à leur pertinence au niveau des sciences cognitives. Plus spécifiquement, nous proposons d'abord les sujets suivants : - modèles de calcul des langues naturelles, - réseaux sémantiques de grande taille, - modèles cognitifs et psychologiques des langues naturelles, - évolution des langues naturelles, - traitement du discours, - pragmatique, processus cognitifs et sociaux des langues, - résumé automatique et extraction d'information, - interfaces et systèmes de dialogue, - traduction automatique, - traitement des ressources multilingues, - construction et enrichissement des outils et ressources en TALN, - corpus linguistique, - forage de données. Chaque proposition soumise doit comprendre : le nom et le prénom du conférencier, sa fonction, son affiliation et son courriel. De plus, un résumé d'un maximum de 1500 caractères (espaces compris) et dont le titre ne dépasse pas 180 caractères (espaces comprises) est exigé. Les propositions doivent être acheminées sous forme de document Word ou RTF. *CALENDRIER* Date limite de soumission : 14 février 2014 (extension) Notification aux auteurs : 25 février 2014 Date du colloque : 13 mai 2014 *MODALITÉS DE SOUMISSION* Les articles seront rédigés en français et doivent être soumis à travers le système de conférences libreEasychairet le lien suivant https://www.easychair.org/conferences/?conf=lnisc2014 Responsable(s) Fatiha SADAT *UQAM - Université du Québec à Montréal*, Alexandre BLONDIN MASSÉ *UQAC - Université du Québec à Chicoutimi*, Stevan HARNAD *UQAM - Université du Québec à Montréal* Cet appel se trouve à cette adresse: http://www.acfas.ca/evenements/congres/programme_preliminaire/82/600/635/C ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From pattisonnr at 163.COM Sun Feb 9 03:00:48 2014 From: pattisonnr at 163.COM (=?GBK?B?y+/X3Lzg?=) Date: Sun, 9 Feb 2014 11:00:48 +0800 Subject: =?GBK?Q?=CE=A2=D0=C5=D3=AA=CF=FA=CB=C4=B2=BD=CB=BC=CE=AC=B7=A8=A3=BA?= Message-ID: An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: 微信营销四步思维法:.xlsx Type: application/vnd.openxmlformats-officedocument.spreadsheetml.sheet Size: 26890 bytes Desc: not available URL: From moradarbg at 163.COM Mon Feb 10 04:51:50 2014 From: moradarbg at 163.COM (=?GBK?B?u8bW97nc?=) Date: Mon, 10 Feb 2014 12:51:50 +0800 Subject: =?GBK?Q?=C9=CF=BA=A3=CF=E8=CC=A9=CF=E4=B0=FC=C6=A4=BE=DF=40{ok}?= Message-ID: An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: 上海翔泰箱包皮具@{ok}.xlsx Type: application/vnd.openxmlformats-officedocument.spreadsheetml.sheet Size: 35087 bytes Desc: not available URL: From hamon at LIMSI.FR Tue Feb 11 20:50:18 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 21:50:18 +0100 Subject: Appel: YSIP Message-ID: Date: Sat, 08 Feb 2014 17:45:54 +0100 From: Bertram Fronhöfer Message-ID: <52F65F42.8080406 at tu-dresden.de> X-url: http://infocom.iitit.ru/ X-url: http://ysip.computational-logic.org/ The Institute for Information Processing and Telecommunication of the North-Caucasus Federal University and the International Center for Computational Logic of the The Institute for Information Processing and Telecommunication of the North-Caucasus Federal University and the International Center for Computational Logic of the Technische Universität Dresden are jointly organizing the Young Scientists' International Workshop on Trends in Information Processing (YSIP) at the North-Caucasus Federal University in Stavropol from April 22 until April 25 as part of the International Conference on Infocommunicational Technologies in Science, Production and Education (INFOCOM-6) . We invite young scientists – typically master or PhD students – to submit new scientific results – as, for example, obtained in their bachelor thesis, project work, master thesis, or PhD project – in all areas of Information Processing. In particular, papers are welcome in, but are not limited to, the following areas: * Semantic Methods * Semantic Web * Languages for Semantic Information Processing * Applications of Semantic Information Processing * Formal Concept Analysis * Big Data * Information Retrieval * Question-Answering Systems * Information Security * Problem Solving * Knowledge Representation and Reasoning * Common Sense Reasoning * Machine Learning * Pattern Recognition Important Dates: 28 February 2014 Deadline of Submission 15 March 2014 Information about Acceptance or Rejection 20 March 2014 Registration 31 March 2014 Submission of Final Version 22-25 April 2014 Workshop For more information see: http://ysip.computational-logic.org/ ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 11 20:41:48 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 21:41:48 +0100 Subject: Appel: TKE 2014, extended deadline Message-ID: Date: Sat, 8 Feb 2014 15:15:44 +0100 From: Laurent Romary Message-Id: X-url: http://tke2014.coreon.com +++ Extended deadline +++ TKE 2014 Ontology, terminology & text mining — Call for papers and workshop proposals The 11th international conference on Terminology and Knowledge Engineering, TKE 2014, is organised by GTW and DIN Deutsches Institut für Normung e. V. in cooperation with Inria, Coreon, Cologne University of Applied Sciences and Copenhagen Business School. The theme of TKE 2014 will be ‘Ontology, terminology & text mining. Venue: DIN Deutsches Institut für Normung e. V., Am DIN-Platz, Burggrafenstr. 6, 10787 Berlin Dates: - Main conference: 19th -20th June 2014 - Post-conference workshops: 21st June 2014 See more information on the conference web site: http://tke2014.coreon.com. Important dates: [NEW]1st March 2014 Submission of papers and workshop proposals 31rst March 2014 Notification of acceptance of papers and workshop proposals 15th May 2014 Final versions for the proceedings Submissions should be anonymous and should be made on the web site: http://tke2014.sciencesconf.org. See information on format for papers on http://tke2014.coreon.com. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 11 20:54:36 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 21:54:36 +0100 Subject: Appel: Workshop Affect, Compagnon Artificiel, Interaction (WACAI) 2014 Message-ID: Date: Mon, 10 Feb 2014 14:33:10 +0100 From: Magalie Ochs Message-ID: <52F8D516.8050508 at telecom-paristech.fr> X-url: http://wacai14.litislab.fr/ *Appel à Contributions : Workshop francophone Affect, Compagnon Artificiel, Interaction (WACAI) 2014 – 30 juin, 1er juillet à Rouen * http://wacai14.litislab.fr/ L’objectif du workshop WACAI 2014 est de réunir les recherches et développements en cours autour des /Agents Conversationnels Animés (ACA)/ et des/robots sociaux/. Cette rencontre est plus particulièrement centrée sur l’étude, la modélisation, le développement et l’évaluation de l’interaction de systèmes interactifs avec leurs partenaires (humains ou artefactuels). A l’interface entre sciences de l’ingénieur et sciences humaines et sociales, l’ambition de WACAI est d’appréhender l’interaction personne-système dans toute sa complexité (biologique, linguistique, sociale, culturelle et émotionnelle). Après les précédentes éditions bi-annuelles du workshop WACA, organisées successivement à Grenoble (2005), à Toulouse (2006), à Paris (2008), à Lille (2010) et WACAI à Grenoble (2012), cette nouvelle édition se déroulera à Rouen le 30 Juin et 1er Juillet 2014. Les contributions attendues (en français ou en anglais) sont de trois ordres : - des articles scientifiques (max 6 pages) ; - des revues de questions ou des états de l’art (max 8 pages), notamment sur les liens entre les problématiques communes et les spécificités des communautés ACA et robotique ; - des descriptions de réalisations ou d’expérimentations en cours (2 pages); sur les domaines de l'/Informatique Affective/, le /Traitement des //Signaux Sociaux/, les /Agents Conversationnels Animés/ et les /Robots Sociaux/. *Dates importantes* ***Soumission des articles : 23 Avril 2014**** Notification aux auteurs : 14 Mai 2014 Version finale : 28 Mai 2014 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 11 20:59:29 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 21:59:29 +0100 Subject: Appel: ESWC 2014, Challenge Semantic Publishing Message-ID: Date: Mon, 10 Feb 2014 15:22:49 +0100 (CET) From: speroni at cs.unibo.it Message-Id: <20140210142309.BD95211973C at vina.cines.fr> X-url: http://challenges.2014.eswc-conferences.org/SemPub ** apologies for cross-posting ** ==== Second Call for Challenge: Semantic Publishing ==== Challenge Website: http://challenges.2014.eswc-conferences.org/SemPub Call Web page: http://2014.eswc-conferences.org/important-dates/call-SemPub ** NEWS: ** an open in-use task has been added! 11th Extended Semantic Web Conference (ESWC) 2014 Dates: May 25 - 29, 2014 Venue: Anissaras, Crete, Greece Hashtag: #eswc2014 Feed: @eswc_conf Site: http://2014.eswc-conferences.org General Chair: Valentina Presutti (STLab, ISTC-CNR, IT) Challenge Coordinator: Milan Stankovic (Sepage & Universite Paris-Sorbonne, FR) Challenge Chairs: - Angelo Di Iorio (Department of Computer Science and Engineering, University of Bologna, IT) - Christoph Lange (Enterprise Information Systems, University of Bonn / Fraunhofer IAIS, DE) MOTIVATION AND OBJECTIVES Scholarly publishing is increasingly enabling a new wave of applications that better support researchers in disseminating, exploiting and evaluating their results. The potential of publishing scientific papers enriched with semantic information is huge and raises interesting and challenging issues. Semantic Web technologies play a central role in this context, as they can help publishers to make scientific results available in an open format the whole research community can benefit from. The Semantic Publishing Challenge 2014 is intended to be the first in a series of events at ESWC for producing and exploiting semantic publishing data. The main focus this year is on extracting information and using this information to assess the quality of scientific productions. Linked open datasets about scientific production exist - e.g. DBLP - but they usually cover basic bibliographic information, which is not sufficient to assess quality. Quality-related information are often hidden and not yet available as LOD. There is also a growing interest in alternative forms of publishing scientific data as (semantic) datasets that can be more easily shared, linked to each other, and reasoned on. Alternative metrics for scientific impact are also gaining relevance. We are seeking the most innovative and impacting applications in this emerging contexts. TARGET AUDIENCE The Challenge is open to everyone from industry and academia. TASKS The Challenge includes three tasks. Participants can participate in as many tasks as they like. = Extraction Tasks = We ask challengers to automatically annotate a set of multi-format and multi-source input documents and to produce a Linked Open Dataset that fully describes these documents, their context, and relevant parts of their content. The evaluation will consist of evaluating a set of queries against the produced dataset to assess its correctness and completeness. The input dataset will be split in two parts: a training/testing part and an evaluation part, which will disclosed a few days before the submission deadline. Participants will be asked to run their tool on the evaluation dataset and to produce the final Linked Open Dataset. == Task 1: Extraction and assessment of workshop proceedings information == Participants are required to extract information from a set of HTML tables of contents, partly including microformat and RDFa annotations but not necessarily being valid HTML, of selected computer science workshop proceedings published with the CEUR-WS.org open access service. The extracted information is expected to answer queries about the quality of these workshops, for instance by measuring their growth, longevity, connection with other events, distribution of papers and authors. == Task 2: Extraction and characterization of citations == Participants are required to extract information about the citations in scientific journals and their relevance. Input documents are in XML JATS and TaxPub, an official extension of JATS customized for taxonomic treatments, and selected from the PubMedCentral Open Access Subset and the Pensoft Biodiversity Data Journal and ZooKeys archive. The extracted information is expected to be used for assessing the value of citations, for instance by considering their position in the paper, their co-location with other citations or their purpose. == In-use Task 3: Semantic technologies in improving scientific production == Participants are asked to submit demos that showcase the potential of Semantic Web technology for enhancing and assessing the quality of scientific production. The task has a completely open structure and is, in particular, independent from tasks 1 and 2: participants are free to decide which tool to show and which dataset to use. The evaluation will be different from other tasks and will consist of two phases: after a first round of review, a number of submissions will be invited to demo their work at ESWC. The final decision will be taken at the Conference by a jury formed of PC members present at the event and other invited experts. Further details are available at: http://challenges.2014.eswc-conferences.org/index.php/SemPub/Task3 EVALUATION = Extraction Tasks 1 and 2 = Participants will be requested to submit the LOD that their tool produces from the evaluation dataset, as well as a paper that describes their approach. They will also be given a set of queries in natural language form and will be asked to translate those queries into a SPARQL form that works on their LOD. The results of the queries on the produced LOD will be compared with the expected output, and precision and recall will be measured to identify the best performing approach. Separately, the most original approach will be assigned by the Program Committee. = In-use Task 3 = Participants are required to submit a paper description as for tasks 1 and 2 and a demo version of the tool (open source appreciated but not mandatory). The evaluation will consist of two phases: after a first round of review, a number of submissions will be invited to demo their work at ESWC. The final decision will be taken at the Conference by a jury formed of PC members present at the event and other invited experts. The winner will be selected according to its potential impact, originality, breakthrough, the quality of the demo, and the appropriateness for ESWC. Further details about the evaluation are provided on the challenge wiki. FEEDBACK AND DISCUSSION A discussion group is open for participants to ask questions and to receive updates about the challenge (see link at bottom). Participants are invited to subscribe to this group as soon as possible and to communicate their intention to participate. They are also invited to use this channel to discuss problems in the input dataset and to suggest changes. JUDGING AND PRIZES The Program Committee and the chairs will select a number of submissions conforming to the challenge requirements that will be invited to present their work. Submissions accepted for presentation will receive constructive reviews from the Program Committee, they will be included in the Springer LNCS post-proceedings of ESWC, and they will also have a presentation slot in a poster session dedicated to the challenge. In addition, the winners will present their work in a special slot of the main program of ESWC and will be invited to submit a revised and extended paper to a dedicated Semantic Web Journal special issue. Five winners will be selected. For each of Tasks 1 and 2 we will select: * best performing tool, given to the paper which will get the highest score in the evaluation * most original approach, selected by the Challenge Committee with the reviewing process The winner of Task 3 will be selected by the jury according to its potential impact, originality, breakthrough, the quality of the demo, and the appropriateness for ESWC. Winners will be selected only for tasks with at least 3 participants. In any case all submissions will be reviewed and, if accepted, published in ESWC post-proceedings. An amount of 700 Euro has already been secured for the final prize. We are currently working on securing further funding. HOW TO PARTICIPATE Participants are required to submit: * Abstract: no more than 200 words. * Description: It should explain the details of the automated annotation system, including why the system is innovative, how it uses Semantic Web technology, what features or functions the system provides, what design choices were made and what lessons were learned. The description should also summarize how participants have addressed the evaluation tasks. An outlook towards how the data could be consumed is appreciated but not strictly required. Papers must be submitted in PDF format, following the style of the Springer's Lecture Notes in Computer Science (LNCS) series (http://www.springer.com/computer/lncs/lncs+authors), and not exceeding 5 pages in length. Submissions for task 1 and task 2 also have to include: * The Linked Open Dataset produced by the tool on the evaluation dataset (as a file or as a URL, in Turtle or RDF/XML). * A set of SPARQL queries that work on that LOD and correspond to the natural language queries provided as input * Participants will also be asked to submit their tool (source and/or binaries, or a link these can be downloaded from, or a web service URL) for verification purposes. Submissions for the in-use task 3 have to include: * a demo version of the tool. The demo must be made available along with the paper submission but participants are allowed to refine it until the presentation at ESWC-14. All papers submissions should be provided via EasyChair: https://www.easychair.org/conferences/?conf=eswc2014-challenges MAILING LIST We invite the potential participants to subscribe to our mailing list in order to be kept up to date with the latest news related to the challenge. https://lists.sti2.org/mailman/listinfo/eswc2014-sempub-challenge IMPORTANT DATES * December 3, 2013: Publication of the full description of the extraction tasks 1 and 2, rules and queries; publication of the training/testing dataset * January 31, 2014, 23:59 CET: Deadline for making remarks to the task 1 and 2 training/testing datasets * February 5, 2014: Publication of the final task 1 and 2 training/testing datasets * March 7, 2014, 23:59 CET: Abstract submission * March 11, 2014: Publication of the task 1 and 2 evaluation dataset * March 14, 2014, 23:59 CET: Submission due * April 9, 2014, 23:59 CET: Notification of acceptance * May 27-29, 2014: Demo at ESWC-14, and winner selection PROGRAM COMMITTEE Soren Auer (University of Bonn / Fraunhofer IAIS, DE) (supervisor) Chris Bizer (University of Mannheim, DE) Sarven Capadisli (University of Leipzig, DE) Alexander Constantin (University of Manchester, UK) Jeremy Debattista (University of Bonn / Fraunhofer IAIS, DE) Alexander Garcia Castro (Florida State University, US) Leyla Jael Garcia Castro (Bundeswehr University of Munich, DE) Paul Groth (VU University of Amsterdam, Netherlands) Rinke Hoekstra (VU University of Amsterdam, Netherlands) Aidan Hogan (DCC, Universidad de Chile) Evangelos Milios (Dalhousie University, CA) Lyubomir Penev (Pensoft Publishers, BG) Robert Stevens (University of Manchester, UK) Jun Zhao (Lancaster University, UK) We are inviting further members. From hamon at LIMSI.FR Tue Feb 11 21:11:22 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 22:11:22 +0100 Subject: Appel: ECML PKDD 2014 (Reminder) Message-ID: Date: Mon, 10 Feb 2014 21:42:19 +0100 (CET) From: Amedeo Napoli Message-ID: <1665645114.3459231.1392064939712.JavaMail.root at loria.fr> X-url: http://www.ecmlpkdd2014.org/ Calls -- ECML PKDD 2014 (Reminder) Apologize for multiple receptions ----------------------------------- ECML PKDD 2014 September 15-19 2014, Nancy France http://www.ecmlpkdd2014.org/ ----------------------------------- ----- CALLS ----- ----------------------------------------------- Call For Workshops (workshops at ecmlpkdd2014.org) ----------------------------------------------- Workshop proposal deadline 2 March 2014 Workshop acceptance notification 21 March 2014 ----------------------------------------------- Call For Tutorials (tutorials at ecmlpkdd2014.org) ----------------------------------------------- Tutorial proposal deadline 7 March 2014 Tutorial acceptance notification 28 March 2014 ------------------------------------------- Call For Papers (pcchairs at ecmlpkdd2014.org) ------------------------------------------- Abstract submission deadline 12 April 2014 Paper submission deadline 16 April 2014 --------------------------------------- Call For Demos (demos at ecmlpkdd2014.org) --------------------------------------- Demo submission deadline 16 May 2014 Notification of acceptance 15 June 2014 ----------------------------------------------- Call For Nectar Track (nectar at ecmlpkdd2014.org) ----------------------------------------------- Submission deadline 14 June 2014 Notifications of acceptance 28 June 2014 --------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 11 21:16:27 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 22:16:27 +0100 Subject: Appel: COLING 2014 Update, Tutorials & Workshops Announced, Paper Submission Message-ID: Date: Tue, 11 Feb 2014 12:25:34 +0000 From: John Judge Message-ID: <52FA16BE.5000801 at computing.dcu.ie> X-url: http://www.coling-2014.org ********** Apologies for cross-posting ********** COLING 2014 Dublin, Ireland, 23-29 August, 2014 COLING 2014 (http://www.coling-2014.org), the 25th International Conference on Computational Linguistics, will be organised by CNGL (Centre for Global Intelligent Content) at the Helix Convention Centre at Dublin City University (DCU) from 23-29 August 2014. The COLING conference is organised under the auspices of the International Committee on Computational Linguistics (ICCL). Accepted Tutorials and Workshops We are pleased to announce that the workshops and tutorials for COLING 2014 have been finalised. In total there will be 6 tutorials and 18 workshops covering a wide range of topics. You will be able to register for tutorials and workshops when the main conference registration opens. Accepted Tutorials: - Dependency Parsing: Past, Present, and Future - Wenliang Chen, Zhenghua Li and Min Zhang - Using Neural Networks for Modeling and Representing Natural Languages - Thomas Mikolov - Multilingual Word Sense Disambiguation and Entity Linking - Roberto Navigli and Andrea Moro - Selection Bias, Label Bias, and Bias in Ground Truth - Anders Søgaard, Barbara Plank and Dick Hovy - Automated Grammatical Error Correction for Language Learners - Joel Tetreault and Claudia Leacock - Biomedical/clinical NLP - Ozlem Uzuner and Meliha Yetisgen Accepted Workshops: - Workshop on Lexical and Grammatical Resources for Language Processing - First Joint Workshop on Statistical Parsing of Morphologically Rich Languages and Syntactic Analysis of Non-Canonical Languages (SPMRL-SANCL) - Celtic Language Technology Workshop (CLTW) - 5th Workshop on South and Southeast Asian Natural Language Processing (WSSANLP) - Synchronic and Diachronic Approaches to Analyzing Technical Language (SADAATL) - The 3rd Workshop on Vision and Language (VL'14) - First Joint Workshop on Multidisciplinary Approaches to Big Social Data Analysis and Social NLP (MABSDA-SocialNLP) - First Joint Workshop on Open Infrastructures for HLT Resource Processing and Development and Text Analysis Frameworks - Cognitive Aspects of the Lexicon (CogALex-IV) - Semantic Web and Information Extraction (SWAIE) - VarDial: Workshop on Applying NLP Tools to Similar Languages, Varieties and Dialects (VarDial) - SemEval-2014: Semantic Evaluation Exercises - International Workshop on Semantic Evaluation (SemEval) - The AHA!-Workshop on Information Discovery in Text (AHA!) - CompuTerm 2014. 4th International Workshop on Computational Terminology (CompuTerm) - The First Workshop on Computational Approaches to Compound Analysis (ComAComA) - Automatic Text Simplification? Methods and Applications in the Multilingual Society (ATS-MA) - The 8th Linguistic Annotation Workshop (The LAW VIII) - The third Joint Conference on Lexical and Computational Semantics (*SEM-2014) Call for Papers: The first call for papers for the main conference is open. The paper submission system is now open at https://www.softconf.com/coling2014/main/. Papers should be prepared using the paper templates available here http://www.coling-2014.org/doc/coling2014.zip. We invite authors to submit papers in all relevant areas and encourage authors to include analysis of the influence of theories (intuitions, methodologies, insights), to technologies (computational algorithms, methods, tools, data), and/or contributions of technologies to theory development. Contributions that display and rigorously discuss future potential, even if not (yet) attested in standard evaluation, are welcome. For more information and instructions for authors see the preliminary call for papers here http://www.coling-2014.org/call-for-papers.php. Sponsorship and Promotion Opportunities With the tremendous impact and growth of COLING over many years, we have set out a comprehensive offering of corporate sponsorship and support opportunities for industry participants. The COLING 2014 organising committee is committed to working in close partnership with corporate supporters to maximise the benefits of sponsorship and provide opportunities for sponsors to engage appropriately with COLING delegates. With this in mind, if you wish to explore additional options for support or means of engagement with the COLING 2014 community please feel free to contact sponsorship at coling-2014.org. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 11 20:57:26 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 21:57:26 +0100 Subject: Appel: ESWC 2014, Challenge Linked Open Data-enabled Message-ID: Date: Mon, 10 Feb 2014 15:09:59 +0100 (CET) From: speroni at cs.unibo.it Message-Id: <20140210141019.5E85E119754 at vina.cines.fr> X-url: http://challenges.2014.eswc-conferences.org/RecSys ** apologies for cross-posting ** = Second Call for Challenge: Linked Open Data-enabled Recommender Systems = Challenge Website: http://challenges.2014.eswc-conferences.org/RecSys Call Web page: http://2014.eswc-conferences.org/important-dates/call-RecSys 11th Extended Semantic Web Conference (ESWC) 2014 Dates: May 25 - 29, 2014 Venue: Anissaras, Crete, Greece Hashtag: #eswc2014 Feed: @eswc_conf Site: http://2014.eswc-conferences.org General Chair: Valentina Presutti (STLab, ISTC-CNR, IT) Challenge Coordinator: Milan Stankovic (Sepage & Universite Paris-Sorbonne, FR) Challenge Chairs: - Tommaso Di Noia (Polytechnic University of Bari, IT) - Ivan Cantador (Universidad Autonoma de Madrid, ES) MOTIVATION AND OBJECTIVES People generally need more and more advanced tools that go beyond those implementing the canonical search paradigm for seeking relevant information. A new search paradigm is emerging, where the user perspective is completely reversed: from finding to being found. Recommender systems may help to support this new perspective, because they have the effect of pushing relevant objects, selected from a large space of possible options, to potentially interested users. To achieve this result, recommendation techniques generally rely on data referring to three kinds of objects: users, items and their relations. Recent developments in the Semantic Web community offer novel strategies to represent data about users, items and their relations that might improve the current state of the art of recommender systems, in order to move towards a new generation of recommender systems that fully understand the items they deal with. More and more semantic data are published following the Linked Data principles, that enable to set up links between objects in different data sources, by connecting information in a single global data space: the Web of Data. Today, the Web of Data includes different types of knowledge represented in a homogeneous form: sedimentary one (encyclopedic, cultural, linguistic, common-sense) and real-time one (news, data streams, ...). These data might be useful to interlink diverse information about users, items, and their relations and implement reasoning mechanisms that can support and improve the recommendation process. The primary goal of this challenge is twofold. On the one hand, we want to create a link between the Semantic Web and the Recommender Systems communities. On the other hand, we aim to show how Linked Open Data (LOD) and semantic technologies can boost the creation of a new breed of knowledge-enabled and content-based recommender systems. TARGET AUDIENCE The target audience is all of the Semantic Web and the Recommender Systems communities, both academic and industrial, which are interested in personalized information access with a particular emphasis on Linked Open Data. During the last ACM RecSys conference more than 60% of participants were from industry. This is for sure a witness of the actual interest of recommender systems for industrial applications ready to be released in the market. TASKS * Task 1: Rating prediction in cold-start situations This task deals with the rating prediction problem, in which a system is requested to estimate the value of unknown numeric scores (a.k.a. ratings) that a target user would assign to available items, indicating whether she likes or dislikes them. In order to favor the proposal of content-based, LOD-enabled recommendation approaches, and limit the use of collaborative filtering approaches, this task aims at predicting ratings in cold-start situations, that is, predicting ratings for users who have a few past ratings, and predicting ratings of items that have been rated by a few users. The dataset to use in the task - DBbook - relates to the book domain. It contains explicit numeric ratings assigned by users to books. For each book we provide the corresponding DBpedia URI. Participants will have to exploit the provided ratings as training sets, and will have to estimate unknown ratings in a non-provided evaluation set. Recommendation approaches will be evaluated on the evaluation set by means of metrics that measure the differences between real and estimated ratings, namely the Root Mean Square Error (RMSE). * Task 2: Top-N recommendation from binary user feedback This task deals with the top-N recommendation problem, in which a system is requested to find and recommend a limited set of N items that best match a user profile, instead of correctly predict the ratings for all available items. Similarly to Task 1, in order to favor the proposal of content-based, LOD-enabled recommendation approaches, and limit the use of collaborative filtering approaches, this task aims to generate ranked lists of items for which no graded ratings are available, but only binary ones. Also in this case, the DBbook dataset is used. In this task, the accuracy of recommendation approaches will be evaluated on an evaluation set using the F-measure. * Task 3: Diversity A very interesting aspect of content-based recommender systems, and then of LOD-enabled ones, is giving the possibility to evaluate the diversity of recommended items in a straight way. This is a very popular topic in content-based recommender systems, which usually suffer from over-specialization. In this task, the evaluation will be made by considering a combination of both accuracy (F-measure) of the recommendation list and the diversity (Intra-List Diversity) of items belonging to it. Also for this task, the DBbook dataset is used. Given the domain of books, diversity with respect to the two properties http://dbpedia.org/ontology/author and http://purl.org/dc/terms/subject will be considered. DATASET * DBbook dataset This dataset relies on user data and preferences retrieved from the Web. The books available in the dataset have been mapped to their corresponding DBpedia URIs. The mapping contains 8170 DBpedia URIs. These mappings can be used to extract semantic features from DBpedia or other LOD repositories to be exploited by the recommendation approaches proposed in the challenge. The dataset is split in a training set and an evaluation set. In the former, user ratings are provided to train a system while in the latter, ratings have been removed, and they will be used in the eventual evaluation step. The mapping file is available at: http://sisinflab.poliba.it/semanticweb/lod/recsys/2014challenge/DBbook_Items_DBpedia_mapping.tsv.zip It contains a tab-separated values file where each line has the following format: DBbook_ItemID \t name \t DBpedia_URI. We suggest to extract a semantic descriptions for all the items present in this mapping file by starting from the DBpedia URIs. The training sets are available at: * Task 1: http://sisinflab.poliba.it/semanticweb/lod/recsys/2014challenge/DBbook_train_ratings.zip The archive contains a tab-separated values file containing the training data and a README describing its content. Each line in the file is composed by: userID \t itemID \t rating. The ratings are in scale 0-5. The training set contains 75559 ratings. There are 6181 users and 6166 items which have been rated by at least one user. * Task 2 and Task 3: http://sisinflab.poliba.it/semanticweb/lod/recsys/2014challenge/DBbook_train_binary.zip The archive contains a tab-separated values file containing the training data and a README describing its content. Each line in the file is composed by: userID \t itemID \t rating. The ratings are in binary scale. 1 means that the item is relevant for the user, 0 means irrelevant. The training set contains 72372 ratings. There are 6181 users and 6733 items which have been rated by at least one user. ADDITIONAL DATASETS Although not used in the challenge, two additional rating datasets linked to DBpedia are provided, namely the well known MovieLens10M dataset and the Last.fm dataset published at HetRec'11 workshop. http://sisinflab.poliba.it/semanticweb/lod/recsys/datasets/ We encourage participants to use these datasets for testing the developed recommendation approaches on several domains. JUDGING AND PRIZES After a first round of reviews, the Program Committee and the chairs will select a number of submissions that will have to satisfy the challenge requirements, and will have to be presented at the conference. Submissions accepted for presentation will receive constructive reviews from the Program Committee, and will be included in post-proceedings. All accepted submissions will have a slot in a poster session dedicated to the challenge. In addition, the winners will present their work in a special slot of the main program of ESWC'14, and will be invited to submit a paper to a dedicated Semantic Web Journal special issue. For each task we will select: * the best performing tool, given to the paper which will get the highest score in the evaluation * the most original approach, selected by the Challenge Program Committee with the reviewing process An amount of 700 Euro has already been secured for the final prize. We are currently working on securing further funding. Winners will be selected only for tasks with at least 3 participants. In any case, all submissions will be reviewed and, if accepted, published at ESWC post-proceedings. HOW TO PARTICIPATE 1. Make your result submission * Register your group using the registration web form available at: http://193.204.59.20:8181/eswc2014lodrecsys/signup.html * Choose one or more tasks among Task 1, Task 2 and Task 3 * Build your recommender system using the provided training data. * Evaluate your approach by submitting your results using the evaluation service. * Your final score will be the one computed with respect to the last result submission made before March 7, 2014, 23:59 CET. 2. Submit your paper The following information has to be provided: * Abstract: no more than 200 words. * Description: It should contain the details of the system, including why the system is innovative, how it uses Semantic Web, which features or functions the system provides, what design choices were made, and what lessons were learned. The description should also summarize how participants have addressed the evaluation tasks. Papers must be submitted in PDF format, following the style of the Springer's Lecture Notes in Computer Science (LNCS) series (http://www.springer.com/computer/lncs/lncs+authors), and not exceeding 5 pages in length. All submissions should be provided via EasyChair: https://www.easychair.org/conferences/?conf=eswc2014-challenges MAILING LIST We invite the potential participants to subscribe to our mailing list in order to be kept up to date with the latest news related to the challenge. https://lists.sti2.org/mailman/listinfo/eswc2014-recsys-challenge IMPORTANT DATES * March 7, 2014, 23:59 CET: Result submission due * March 14, 2014, 23:59 CET: Paper submission due * April 9, 2014, 23:59 CET: Notification of acceptance * May 27-29, 2014: The Challenge takes place at ESWC'14 EVALUATION COORDINATOR * Vito Claudio Ostuni (Polytechnic University of Bari, IT) PROGRAM COMMITTEE (to be completed) * Pablo Castells, Universidad Autonoma de Madrid, Spain * Oscar Corcho, Universidad Politecnica de Madrid, Spain * Marco de Gemmis, University of Bari Aldo Moro, Italy * Frank Hopfgartner, Technische Universitat Berlin, Germany * Andreas Hotho, Universitat Wurzburg, Germany * Dietmar Jannach, TU Dortmund University, Germany * Pasquale Lops, University of Bari Aldo Moro, Italy * Valentina Maccatrozzo, VU University Amsterdam, The Netherlands * Roberto Mirizzi, Polytechnic University of Bari, Italy * Alexandre Passant, seevl.fm, Ireland * Francesco Ricci, Free University of Bozen-Bolzano, Italy * Giovanni Semeraro, University of Bari Aldo Moro, Italy * David Vallet, NICTA, Australia * Manolis Wallace, University of Peloponnese, Greece * Markus Zanker, Alpen-Adria-Universitaet Klagenfurt, Austria * Tao Ye, Pandora Internet Radio, USA From hamon at LIMSI.FR Tue Feb 11 21:08:25 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 22:08:25 +0100 Subject: Job: Open Position for Graduate Fellowship in Semantic Web Technologies, CNR-STLAB Message-ID: Date: Mon, 10 Feb 2014 16:48:46 +0100 (CET) From: speroni at cs.unibo.it Message-Id: <20140210154907.4FE7D119754 at vina.cines.fr> X-url: http://istc.cnr.it/vacancy/assegno-di-ricerca-n%C2%B0-2092014-tecnologie-semantiche-e-open-data-legovernment-e-la-fruizione- TITLE: Open Position for Graduate Fellowship in Semantic Web Technologies at CNR-STLAB Important notice: send by February 19th to stlab at cnr.it your CV, a motivational statement, and the contact of at least one referee (or a recommendation letter). The official procedure for being admitted to the selection is described on the official application post (see below) and the deadline is February 27th. Topic: Theories and methods for knowledge extraction and representation at a web scale and their application to cultural heritage and eGovernment. Type of Grant: Graduate Fellowship Employer: Institute of Cognitive Science and Technologies of CNR Salary: EUR 19.367,00 (nineteen-thousand-three-hundred-sixtyseven/00) net of expenses in charge of CNR. Starting from: April 2014 Duration: 12 months Location: Rome, Italy Official application deadline: 27 February, 2014 Link to the official application post: http://istc.cnr.it/vacancy/assegno-di-ricerca-n%C2%B0-2092014-tecnologie-semantiche-e-open-data-legovernment-e-la-fruizione- Scientific responsible: Dr. Valentina Presutti Contact person (administrative issues): stefania.capotosti at istc.cnr.it Type of Grant: Graduate Fellowship There will be a public selection procedure, based on qualifications and an interview, for the assignment of n. 1 (one) - Graduate Fellowship in order to conduct research related to the Scientific Area Information Sciences AND Computer Sciences at the Institute of Cognitive Sciences and Technologies, CNR, in the scope of the projects: eGovernment, Digital Libraries and Hermes, under the scientific responsibility of Dr. Valentina Presutti. To the selection may apply individuals who, whatever their nationality or age, are in possession of the following requirements at the date of expiry of the deadline for submission of applications: a) Degree in Computer Science or Engineering or Literature and Philosophy in accordance with the legislation in force before DM 509/99 or Degree in Computer Science, Engineering, Linquistics or Science of Language (or equivalent) in accordance with the regulations referred to in DM 509/99 or Master's Degree in Computer Science, Engineering Linguistics or Science of Language (or equivalent) in accordance with the regulations referred to in DM 270/04, with professional resume suitable for the conduct of research according to the specifications given in the following points (the candidate is in charge, penalty of exclusion, of demonstrating equiparation of graduation diplomas); b) All qualifications obtained abroad (bechelor's degree, doctorate, and any other qualification) shall be previously recognized in Italy in accordance with current legislation (information on the website of the Ministry of University and Scientific Research: www.miur.it. The equivalence of those diplomas obtained abroad who have not already been recognized in Italy with the expected formal procedure above, will be evaluated, with the only purpose of the present selection, by the Examining Committee constituted according to art. 6, paragraph 1 of the Regulations; c) Documented experience of research, development and application of semantic technologies. In particular, it is required expertise in at least one of the following areas: ontology design and open data, knowledge representation, and natural lanugage processing; d) Excellent knowledge of OWL, RDF, and SPARQL; e) Knowledge of mobile application development platforms; f) Knowledge and documented experience of Java development; preference will be given to candidates that know also other programming languages; g) English proficiency. THE ENGLISH CALL ON THE WEB-SITE DOES NOT HAVE LEGAL VALUE IN ITSELF, AND THUS DOES NOT SUPERSEDE THE ITALIAN VERSION OF THE CALL ANNOUNCEMENT (BANDO). From hamon at LIMSI.FR Tue Feb 11 20:39:46 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 21:39:46 +0100 Subject: Appel: INTERSPEECH 2014, Special Sessions Message-ID: Date: Mon, 10 Feb 2014 12:20:47 +0800 From: "Organization @ Interspeech 2014" Message-ID: <52F8539F.8050905 at interspeech2014.org> X-url: http://www.INTERSPEECH2014.org --- INTERSPEECH 2014 - SINGAPORE --- September 14-18, 2014 --- http://www.INTERSPEECH2014.org INTERSPEECH is the world's largest and most comprehensive conference on issues surrounding the science and technology of spoken language processing, both in humans and in machines. The theme of INTERSPEECH 2014 is --- Celebrating the Diversity of Spoken Languages --- INTERSPEECH 2014 includes a number of special sessions covering interdisciplinary topics and/or important new emerging areas of interest related to the main conference topics. Special sessions proposed for the forthcoming edition are: - A Re-evaluation of Robustness - Deep Neural Networks for Speech Generation and Synthesis - Exploring the Rich Information of Speech Across Multiple Languages - INTERSPEECH 2014 Computational Paralinguistics ChallengE (ComParE) - Multichannel Processing for Distant Speech Recognition - Open Domain Situated Conversational Interaction - Phase Importance in Speech Processing Applications - Speaker Comparison for Forensic and Investigative Applications - Text-dependent for Short-duration Speaker Verification - Tutorial Dialogues and Spoken Dialogue Systems - Visual Speech Decoding A description of each special session is given below. For paper submission, please follow the main conference procedure and chose the Special Session track when selecting your paper area. Paper submission procedure is described at: http://www.INTERSPEECH2014.org/public.php?page=submission_procedure.html For more information, feel free to contact the Special Session Chair, Dr. Tomi H. Kinnunen, at email tkinnu [at]cs.uef.fi ------------------------------------------------------------------------ Special Session Description ------------------------------------------------------------------------ A Re-evaluation of Robustness The goal of the session is to facilitate a re-evaluation of robust speech recognition in the light of recent developments. It’s a re-evaluation at two levels: - a re-evaluation in perspective brought by breakthroughs in performance obtained by Deep Neural Network which leads to a fresh questioning of the role and contribution of robust feature extraction. - A literal re-evaluation on common databases to be able to present and compare performances of different algorithms and system approaches to robustness. Paper submissions are invited on the theme of noise robust speech recognition and required to submit results on the Aurora 4 database to facilitate cross comparison of the performance between different techniques. Recent developments raise interesting research questions that the session aims to help Progress by bringing focus and exploration of these issues. For example 1. What role is there for signal processing to create feature representations to use as inputs to Deep Learning or can deep learning do all the work? 2. What feature representations can be automatically learnt in a deep learning architecture? 3. What other techniques can give great improvement in robustness? 4. What techniques don’t work and why? The session organizers wish to encourage submissions that bring insight and understanding to the issues highlighted above. Authors are requested not only to present absolute performance of the whole system but also to highlight the contribution made by various components in a complex system. Papers that are accepted for the session are encouraged to also evaluate their techniques on new test data sets (available in July) and submit their results at the end of August. Session organization The session will be structured as a combination of 1. Invited talks 2. Oral paper presentations 3. Poster presentations 4. Summary of contributions and results on newly released test sets 5. Discussion Organizers: David Pearce, Audience dpearce [at]audience.com Hans-Guenter Hirsch, Niederrhein University of Applied Sciences, hans-guenter.hirsch [at]hs-niederrhein.de Reinhold Haeb-Umbach, University of Paderborn, haeb [at]nt.uni-paderborn.de Michael Seltzer, Microsoft, mseltzer [at]microsoft.com Keikichi Hirose, The University of Tokyo, hirose [at]gavo.t.u-tokyo.ac.jp Steve Renals, University of Edinburgh, s.renals [at]ed.ac.uk Sim Khe Chai, National University of Singapore, simkc [at]comp.nus.edu.sg Niko Moritz, Fraunhofer IDMT, Oldenburg, niko.moritz [at]idmt.fraunhofer.de K K Chin, Google, kkchin [at]google.com Deep Neural Networks for Speech Generation and Synthesis This special session aims to bring together researchers who work actively on deep neural networks for speech research, particularly, in generation and synthesis, to promote and to understand better the state-of-art DNN research in statistical learning and compare results with the parametric HMM-GMM model which has been well-established for speech synthesis, generation, and conversion. DNN, with its neuron-like structure, can simulate human speech production system in a layered, hierarchical, nonlinear and self-organized network. It can transform linguistic text information into intermediate semantic, phonetic and prosodic content and finally generate speech waveforms. Many possible neural network architectures or typologies exist, e.g. feed-forward NN with multiple hidden layers, stacked RBM or CRBM, Recurrent Neural Net (RNN), which have been used to speech/image recognition and other applications. We would like to use this special session as a forum to present updated results in the research frontiers, algorithm development and application scenarios. Particular focused areas will be on parametric TTS synthesis, voice conversion, speech compression, de-noising and speech enhancement. Organizers: Yao Qian, Microsoft Research Asia, yaoqian [at]microsoft.com Frank K. Soong, Microsoft Research Asia, frankkps [at]microsoft.com Exploring the Rich Information of Speech Across Multiple Languages Spoken language is the most direct means of communication between human beings. However, speech communication often demonstrates its language-specific characteristics because of, for instance, the linguistic difference (e.g., tonal vs. non-tonal, monosyllabic vs. multisyllabic) across languages. Our knowledge on the diversities of speech science across languages is still limited, including speech perception, linguistic and non-linguistic (e.g., emotion) information, etc. This knowledge is of great significance to facilitate our design of language-specific application of speech techniques (e.g., automatic speech recognition, assistive hearing devices) in the future. This special session will provide an opportunity for researchers from various communities (including speech science, medicine, linguistics and signal processing) to stimulate further discussion and new research in the broad cross-language area, and present their latest research on understanding the language-specific features of speech science and their applications in the speech communication of machines and human beings. This special session encourages contributions all fields on speech science, e.g., production and perception, but with a focus on presenting the language-specific characteristics and discussing their implications to improve our knowledge on the diversities of speech science across multiple languages. Topics of interest include, but are not limited to: 1. characteristics of acoustic, linguistic and language information in speech communication across multiple languages; 2. diversity of linguistic and non-linguistic (e.g., emotion) information among multiple spoken languages; 3. language-specific speech intelligibility enhancement and automatic speech recognition techniques; and 4. comparative cross-language assessment of speech perception in challenging environments. Organizers: Junfeng Li, Institute of Acoustics, Chinese Academy of Sciences, junfeng.li.1979 [at]gmail.com Fei Chen, The University of Hong Kong, feichen1 [at]hku.hk INTERSPEECH 2014 Computational Paralinguistics ChallengE (ComParE) The INTERSPEECH 2014 Computational Paralinguistics ChallengE (ComParE) is an open Challenge dealing with speaker characteristics as manifested in their speech signal's acoustic properties. This year, it introduces new tasks by the Cognitive Load Sub-Challenge, the Physical Load Sub-Challenge, and a Multitask Sub-Challenge: For these Challenge tasks, the COGNITIVE-LOAD WITH SPEECH AND EGG database (CLSE), the MUNICH BIOVOICE CORPUS (MBC), and the ANXIETY-DEPRESSION-EMOTION-SLEEPINESS audio corpus (ADES) with high diversity of speakers and different languages covered (Australian English and German) are provided by the organizers. All corpora provide fully realistic data in challenging acoustic conditions and feature rich annotation such as speaker meta-data. They are given with distinct definitions of test, development, and training partitions, incorporating speaker independence as needed in most real-life settings. Benchmark results of the most popular approaches are provided as in the years before. Transcription of the train and development sets will be known. All Sub-Challenges allow contributors to find their own features with their own machine learning algorithm. However, a standard feature set will be provided per corpus that may be used. Participants will have to stick to the definition of training, development, and test sets. They may report on results obtained on the development set, but have only five trials to upload their results on the test sets, whose labels are unknown to them. Each participation will be accompanied by a paper presenting the results that undergoes peer-review and has to be accepted for the conference in order to participate in the Challenge. The results of the Challenge will be presented in a Special Session at INTERSPEECH 2014 in Singapore. Further, contributions using the Challenge data or related to the Challenge but not competing within the Challenge are also welcome. More information is given also on the Challenge homepage: http://emotion-research.net/sigs/speech-sig/is14-compare Organizers: Björn Schuller, Imperial College London / Technische Universität München,schuller [at]IEEE.org Stefan Steidl, Friedrich-Alexander-University, stefan.steidl [at]fau.de Anton Batliner, Technische Universität München / Friedrich-Alexander-University, batliner [at]cs.fau.de Jarek Krajweski, Bergische Universität Wuppertal, krajewsk [at]uni-wuppertal.de Julien Epps, The University of New South Wales / National ICT Australia, j.epps [at]unsw.edu.au Multichannel Processing for Distant Speech Recognition Distant speech recognition in real-world environments is still a challenging problem: reverberation and dynamic background noise represent major sources of acoustic mismatch that heavily decrease ASR performance, which, on the contrary, can be very good in close-talking microphone setups. In this context, a particularly interesting topic is the adoption of distributed microphones for the development of voice-enabled automated home environments based on distant-speech interaction: microphones are installed in different rooms and the resulting multichannel audio recordings capture multiple audio events, including voice commands or spontaneous speech, generated in various locations and characterized by a variable amount of reverberation as well as possible background noise. The focus of the proposed special session will be on multichannel processing for automatic speech recognition (ASR) in such a setting. Unlike other robust ASR tasks, where static adaptation or training with noisy data sensibly ameliorates performance, the distributed microphone scenario requires full exploitation of multichannel information to reduce the highly variable dynamic mismatch. To facilitate better evaluation of the proposed algorithms the organizers will provide a set of multichannel recordings in a domestic environment. The recordings will include spoken commands mixed with other acoustic events occurring in different rooms of a real apartment. The data is being created in the frame of the EC project DIRHA (Distant speech Interaction for Robust Home Applications) which addresses the challenges of speech interaction for home automation. The organizers will release the evaluation package (datasets and scripts) on February 17; the participants are asked to submit a regular paper reporting speech recognition results on the evaluation set and comparing their performance with the provided reference baseline. Further details are available at: http://dirha.fbk.eu/INTERSPEECH2014 Organizers: Marco Matassoni, Fondazione Bruno Kessler, matasso [at]fbk.eu Ramon Fernandez Astudillo, Instituto de Engenharia de Sistemas e Computadores, ramon.astudillo [at]inesc-id.pt Athanasios Katsamanis, National Technical University of Athens, nkatsam [at]cs.ntua.gr Open Domain Situated Conversational Interaction Robust conversational systems have the potential to revolutionize our interactions with computers. Building on decades of academic and industrial research, we now talk to our computers, phones, and entertainment systems on a daily basis. However, current technology typically limits conversational interactions to a few narrow domains/topics (e.g., weather, traffic, restaurants). Users increasingly want the ability to converse with their devices over broad web-scale content. Finding something on your PC or the web should be as simple as having a conversation. A promising approach to address this problem is situated conversational interaction. The approach leverages the situation and/or context of the conversation to improve system accuracy and effectiveness. Sources of context include visual content being displayed to the user, Geo-location, prior interactions, multi-modal interactions (e.g., gesture, eye gaze), and the conversation itself. For example, while a user is reading a news article on their tablet PC, they initiate a conversation to dig deeper on a particular topic. Or a user is reading a map and wants to learn more about the history of events at mile marker 121. Or a gamer wants to interact with a game’s characters to find the next clue in their quest. All of these interactions are situated – rich context is available to the system as a source of priors/constraints on what the user is likely to say. This special session will provide a forum to discuss research progress in open domain situated conversational interactions. Topics of the session will include: - Situated context in spoken dialog systems - Visual/dialog/personal/geo situated context - Inferred context through interpretation and reasoning - Open domain spoken dialog systems - Open domain spoken/natural language understanding and generation - Open domain semantic interpretation - Open domain dialog management (large-scale belief state/policy) - Conversational Interactions - Multi-modal inputs in situated open domains (speech/text + gesture, touch, eye gaze) - Multi-human situated interactions Organizers: Larry Heck, Microsoft Research, larry [at]ieee.org Dilek Hakkani-Tür, Microsoft Research, dilek [at]ieee.org Gokhan Tur, Microsoft Research, gokhan [at]ieee.org Steve Young, Cambridge University, sjy [at]eng.cam.ac.uk Phase Importance in Speech Processing Applications In the past decades, the amplitude of speech spectrum is considered to be the most important feature in different speech processing applications and phase of the speech signal has received less attention. Recently, several findings justify the phase importance in speech and audio processing communities. The importance of phase estimation along with amplitude estimation in speech enhancement, complementary phase-based features in speech and speaker recognition and phase-aware acoustic modeling of environment are the most prominent reported works scattered in different communities of speech and audio processing. These examples suggest that incorporating the phase information can push the limits of state-of-the-art phase-independent solutions employed for long in different aspects of audio and speech signal processing. This Special Session aims to explore the recent advances and methodologies to exploit the knowledge of signal phase information in different aspects of speech processing. Without a dedicated effort to bring researchers from different communities, a quick advance in investigation towards the phase usefulness in speech processing applications is difficult to achieve. Therefore, as the first step in this direction, we aim to promote the "phase-aware speech and audio signal processing" to form a community of researchers to organize the next steps. Our initiative is to unify these efforts to better understand the pros and cons of using phase and the degree of feasibility for phase estimation/enhancement in different areas of speech processing including: speech enhancement, speech separation, speech quality estimation, speech and speaker recognition, voice transformation and speech analysis and synthesis. The goal is to promote the importance of the phase-based signal processing and studying its importance and sharing interesting findings from different speech processing applications. Organizers: Pejman Mowlaee, Graz University of Technology, pejman.mowlaee [at]tugraz.at Rahim Saeidi, University of Eastern Finland, rahim.saeidi [at]uef.fi Yannis Styilianou, Toshiba Labs Cambridge UK / University of Crete, yannis [at]csd.uoc.gr Speaker Comparison for Forensic and Investigative Applications In speaker comparison, speech/voice samples are compared by humans and/or machines for use in investigation or in court to address questions that are of interest to the legal system. Speaker comparison is a high-stakes application that can change people’s lives and it demands the best that science has to offer; however, methods, processes, and practices vary widely. These variations are not necessarily for the better and though recognized, are not generally appreciated and acted upon. Methods, processes, and practices grounded in science are critical for the proper application (and non-application) of speaker comparison to a variety of international investigative and forensic applications. This special session will contribute to scientific progress through 1) understanding speaker comparison for investigative and forensic application (e.g., describe what is currently being done and critically analyze performance and lessons learned); 2) improving speaker comparison for investigative and forensic applications (e.g., propose new approaches/techniques, understand the limitations, and identify challenges and opportunities); 3) improving communications between communities of researchers, legal scholars, and practitioners internationally (e.g., directly address some central legal, policy, and societal questions such as allowing speaker comparisons in court, requirements for expert witnesses, and requirements for specific automatic or human-based methods to be considered scientific); 4) using best practices (e.g., reduction of bias and presentation of evidence); 5) developing a roadmap for progress in this session and future sessions; and 6) producing a documented contribution to the field. Some of these objectives will need multiple sessions to fully achieve and some are complicated due to differing legal systems and cultures. This special session builds on previous successful special sessions and tutorials in forensic applications of speaker comparison at INTERSPEECH beginning in 2003. Wide international participation is planned, including researchers from the ISCA SIGs for the Association Francophone de la Communication Parlée (AFCP) and the Speaker and Language Characterization (SpLC). Organizers: Joseph P. Campbell, PhD, MIT Lincoln Laboratory, jpc [at]ll.mit.edu Jean-François Bonastre, l'Université d'Avignon, jean-francois.bonastre [at]univ-avignon.fr Text-dependent for Short-duration Speaker Verification In recent years, speaker verification engines have reached maturity and have been deployed in commercial applications. Ergonomics of such applications is especially demanding and imposes a drastic limitation in terms of speech duration during authentication. A well known tactic to address the problem of lack of data, due to short duration, is using text-dependency. However, recent breakthroughs achieved in the context of text-independent speaker verification in terms of accuracy and robustness do not benefit text-dependent applications. Indeed, large development data required by the recent approaches is not available in the text-dependent context. The purpose of this special session is to gather the research efforts from both academia and industry toward a common goal of establishing a new baseline and explore new directions for text-dependent speaker verification. The focus of the session is on robustness with respect to duration and modeling of lexical information. To support the development and evaluation of text-dependent speaker verification technologies, the Institute for Infocomm Research (I2R) has recently released the RSR2015 database, including 150 hours of data recorded from 300 speakers. The papers submitted to the special session are encouraged, but not limited, to provide results based on the RSR2015 database in order to enable comparison of algorithms and methods. For this purpose, the organizers strongly encourage the participants to report performance on the protocol delivered with the database in terms of EER and minimum cost (in the sense of NIST 2008 Speaker Recognition evaluation). To get the database, please contact the organizers. Further details are available at: http://www1.i2r.a-star.edu.sg/~kalee/is2014/tdspk.html Organizers: Anthony LARCHER (alarcher [at]i2r.a-star.edu.sg) Institute for Infocomm Research Hagai ARONOWITZ (hagaia [at]il.ibm.com) IBM Research – Haifa Kong Aik LEE (kalee [at]i2r.a-star.edu.sg) Institute for Infocomm Research Patrick KENNY (patrick.kenny [at]crim.ca) CRIM – Montréal Tutorial Dialogues and Spoken Dialogue Systems The growing interest in educational applications that use spoken interaction and dialogue technology has boosted research and development of interactive tutorial systems, and over the recent years, advances have been achieved in both spoken dialogue community and education research community, with sophisticated speech and multi-modal technology which allows functionally suitable and reasonably robust applications to be built. The special session combines spoken dialogue research, interaction modeling, and educational applications, and brings together the two INTERSPEECH SIG communities: SLaTE and SIGdial. The session focuses on methods, problems and challenges that are shared by both communities, such as sophistication of speech processing and dialogue management for educational interaction, integration of the models with theories of emotion, rapport, and mutual understanding, as well as application of the techniques to novel learning environments, robot interaction, etc. The session aims to survey issues related to the processing of spoken language in various learning situations, modeling of the teacher-student interaction in MOOC-like environments, as well as evaluating tutorial dialogue systems from the point of view of natural interaction, technological robustness, and learning outcome. The session encourages interdisciplinary research and submissions related to the special focus of the conference, "Celebrating the Diversity of Spoken Languages". For further information click http://junionsjlee.wix.com/INTERSPEECH Organizers: Maxine Eskenazi, max+ [at]cs.cmu.edu Kristiina Jokinen, kristiina.jokinen [at]helsinki.fi Diane Litman, litman [at]cs.pitt.edu Martin Russel, M.J.RUSSELL [at]bham.ac.uk --- Visual Speech Decoding Speech perception is a bi-modal process that takes into account both the acoustic (what we hear) and visual (what we see) speech information. It has been widely acknowledged that visual clues play a critical role in automatic speech recognition (ASR) especially when audio is corrupted by, for example, background noise or voices from untargeted speakers, or even inaccessible. Decoding the visual speech is utterly important for ASR technologies to be widely implemented to realize truly natural human-computer interactions. Despite the advances in acoustic ASR, visual speech decoding remains a challenging problem. The special session aims to attract more effort to tackle this important problem. In particular, we would like to encourage researchers to focus on some critical questions in the area. We propose four questions as the initiative as follows: 1. How to deal with the speaker dependency in visual speech data? 2. How to cope with the head-pose variation? 3. How to encode temporal information in visual features? 4. How to automatically adapt the fusion rule when the quality of the two individual (audio and visual) modalities varies? Researchers and participants are encouraged to raise more questions related to visual speech decoding. We expect the session to draw a wide range of attention from both the speech recognition and machine vision communities to the problem of visual speech decoding. Organizers: Ziheng Zhou, University of Oulu, ziheng.zhou [at]ee.oulu.fi Matti Pietikäinen, University of Oulu, matti.pietikainen [at]ee.oulu.fi Guoying Zhao, University of Oulu, gyzhao [at]ee.oulu.fi ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 11 21:12:49 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 22:12:49 +0100 Subject: Job: Deep neural networks for statistical machine translation, several positions available (Postdoc, PhD, engineers, etc), France Message-ID: Date: Mon, 10 Feb 2014 22:06:10 +0100 From: Holger Schwenk Message-ID: <52F93F42.7090004 at lium.univ-lemans.fr> During the last years, there have been several breakthroughs in the use of neural networks for natural language processing, in particular using deep architectures. The computer science laboratory of the University of Le Mans (LIUM) is working since many years on statistical machine translation (SMT), and we were among the first researchers to successfully use neural networks, for instance continuous space language and translation models. We want to substantially increase our research efforts in this area, hoping to achieve a significant advances in SMT. Our goal is to build state-of-the-art large scale SMT systems using deep neural networks. In this major research effort, we have openings at different levels - postdoc positions - PhD positions - short term visits with a well focused research project - engineers The candidates are expected to have demonstrated knowledge in at least one of the following fields: - neural networks (feed-forward, recurrent NN, deep learning, etc). - statistical machine translation - efficient implementation of machine learning algorithms (GPU, MPI, etc) The positions are immediately available. Applications are accepted until the positions are filled. Initial appointment is for one year, renewable for up to three years. Competitive salaries are available, including health care and other social benefits, travel support, etc. The working language is English or French. LIUM is participating in several international projects, financed by the European Commission, DARPA and the French government. We collaborate with leading research groups in USA and Europe. A large computer cluster is available to support the research (700 CPU cores with a total of 6 TBytes of memory and more than 250 TBytes of RAID disk space). We also own a cluster with 30 Tesla K20 and K40 GPU cards, connected by a fast Infiniband network. Le Mans is located in between Paris and the Atlantic ocean. Both can be reached in about 1 hour by high speed train. The Loire valley with many wineries and other attractions is just a short drive away ... Applications should include an CV, a list of publications and the name of two references. We will invite interesting candidates for further discussions. For more information, please contact Holger Schwenk by email: Holger.Schwenk at lium.univ-lemans.fr From hamon at LIMSI.FR Tue Feb 11 20:51:33 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 21:51:33 +0100 Subject: Stage: Stage M2, Analyse automatique de comptes-rendus de consultations medicales, IRIT Message-ID: Date: Mon, 10 Feb 2014 13:19:07 +0100 From: philippe muller Message-ID: <52F8C3BB.8060604 at irit.fr> Proposition de stage de M2(R) Titre : Analyse automatique de comptes-rendus de consultations médicales Mots-clefs : traitement automatique des langues naturelles, extraction d'information, apprentissage automatique, médecine Résumé : Ce stage de M2 Recherche vise l'analyse automatique de comptes-rendus de consultations médicales pour en extraire certaines informations nécessaires à la réalisation d’une étude épidémiologique. Les compte-rendus sont constitués de texte libre résumant les données sociodémographiques, le diagnostic, les symptômes, et les résultats des tests éventuels réalisés chez ces patients. Deux objectifs principaux sont considérés : d'une part classifier les comptes rendus vis à vis de l’existence ou non d’une pathologie, d'autre part extraire un certain nombre d’informations précises en lien avec ces maladies. Le stagiaire devra évaluer et comparer les stratégies classiques d'extraction d'information en Traitement Automatique des Langues en domaine spécialisé, et l'application de méthodes d'apprentissage automatique pour les deux objectifs visés. Les données utilisées sont un corpus de compte-rendus de consultation déidentifiés, en français. Encadrement : Ce stage sera effectué à l'IRIT (Institut de Recherche en Informatique de Toulouse), en coopération avec l'INSERM, et sera codirigé par Philippe Muller (IRIT, équipe MELODI "MEthodes et ingénierie des Langues, des Ontologies et du DIscours") et Virginie Gardette (équipe "Vieillissement et maladie d'Alzheimer" de l'UMR INSERM 1027 "Epidémiologie et analyses en santé publique"). http://www.irit.fr/-Equipe-MELODI- http://www.u1027.inserm.fr/42537678/0/fiche___pagelibre/&RH=1303915788348 Compétences requises : Le stagiaire devra avoir une formation en M2 informatique, idéalement avec des compétences en apprentissage automatique et/ou traitement automatique du langage naturel. Durée : 4-6 mois à partir de mars ou avril. Rémunération : 436,05€/mois, conformément à la réglementation. Candidature : Envoyez un CV (avec relevés de notes récents) et une lettre de motivation à philippe.muller at irit.fr et virginie.gardette at univ-tlse3.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 11 21:03:16 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 22:03:16 +0100 Subject: Appel: ESWC 2014, Challenge Concept-Level Sentiment Analysis Message-ID: Date: Mon, 10 Feb 2014 15:55:04 +0100 (CET) From: speroni at cs.unibo.it Message-Id: <20140210145524.577BFDBA76 at vina.cines.fr> X-url: http://challenges.2014.eswc-conferences.org/SemSA ** apologies for cross-posting ** ==== Second Call for Challenge: Concept-Level Sentiment Analysis ==== Challenge Website: http://challenges.2014.eswc-conferences.org/SemSA Call Web page: http://2014.eswc-conferences.org/important-dates/call-SemSA 11th Extended Semantic Web Conference (ESWC) 2014 Dates: May 25 - 29, 2014 Venue: Anissaras, Crete, Greece Hashtag: #eswc2014 Feed: @eswc_conf Site: http://2014.eswc-conferences.org General Chair: Valentina Presutti (STLab, ISTC-CNR, IT) Challenge Coordinator: Milan Stankovic (Sepage & Universite Paris-Sorbonne, FR) Challenge Chairs: - Erik Cambria (National University of Singapore, SG) - Diego Reforgiato (STLab, ISTC-CNR, IT) MOTIVATION AND OBJECTIVES Mining opinions and sentiments from natural language is an extremely difficult task as it involves a deep understanding of most of the explicit and implicit, regular and irregular, syntactical and semantic rules proper of a language. Existing approaches mainly rely on parts of text in which opinions and sentiments are explicitly expressed such as polarity terms, affect words and their co-occurrence frequencies. However, opinions and sentiments are often conveyed implicitly through latent semantics, which make purely syntactical approaches ineffective. To this end, concept-level sentiment analysis aims to go beyond a mere word-level analysis of text and provide novel approaches to opinion mining and sentiment analysis that allow a more efficient passage from (unstructured) textual information to (structured) machine-processable data, in potentially any domain. Concept-level sentiment analysis focuses on a semantic analysis of text through the use of web ontologies or semantic networks, which allow the aggregation of conceptual and affective information associated with natural language opinions. By relying on large semantic knowledge bases, concept-level sentiment analysis steps away from blind use of keywords and word co-occurrence count, but rather relies on the implicit features associated with natural language concepts. This Challenge focuses on the introduction, presentation, and discussion of novel approaches to concept-level sentiment analysis. Participants will have to design a concept-level opinion-mining engine that exploits common-sense knowledge bases, e.g., SenticNet, and/or Linked Data and Semantic Web ontologies, e.g., DBPedia, to perform multi-domain sentiment analysis. The main motivation for the Challenge, in particular, is to go beyond a mere word-level analysis of natural language text and provide novel concept-level tools and techniques that allow a more efficient passage from (unstructured) natural language to (structured) machine-processable data, in potentially any domain. Systems must have a semantics flavor (e.g., by making use of Linked Data or known semantic networks within their core functionalities) and authors need to show how the introduction of semantics can be used to obtain valuable information, functionality or performance. Existing natural language processing methods or statistical approaches can be used too as long as the semantics plays a main role within the core approach (engines based merely on syntax/word-count will be excluded from the competition). TARGET AUDIENCE The Challenge is open to everyone from industry and academia. TASKS The Concept-Level Sentiment Analysis Challenge is defined in terms of different tasks. The first task is elementary whereas the others are more advanced. The input units of each task are sentences. Sentences are assumed to be in grammatically correct American English and have to be processed according to the input format specified at http://sentic.net/challenge/sentence. * Elementary Task: Polarity Detection The main goal of the task is polarity detection. The proposed systems will be assessed according to precision, recall and F-measure of detected binary polarity values (1=positive; 0=negative) for each input sentence of the evaluation dataset, following the same format as in http://sentic.net/challenge/task0. The problem of subjectivity detection is not addressed within this Challenge, hence participants can assume that there will be no neutral sentences. Participants are encouraged to use the Sentic API or further develop and apply sentic computing tools. * Advanced Task #1: Aspect-Based Sentiment Analysis The output of this task will be a set of aspects of the reviewed product and a binary polarity value associated to each of such aspects, in the format specified at http://sentic.net/challenge/task1. So, for example, while for the Elementary task an overall polarity (positive or negative) is expected for a review about a mobile phone, this task requires a set of aspects (such as 'speaker', 'touchscreen', 'camera', etc.) and a polarity value (positive OR negative) associated with each of such aspects. Systems will be assessed according to both aspect extraction and aspect polarity detection. * Advanced Task #2: Semantic Parsing As suggested by the title, the Challenge focuses on sentiment analysis at concept-level. This means that the proposed systems are not supposed to work at word/syntax level but rather work with concepts/semantics. Hence, this task will evaluate the capability of the proposed systems to deconstruct natural language text into concepts, following the same format as in http://sentic.net/challenge/task2. SenticNet will be taken as a reference to test the efficiency of the proposed parsers, but extracted concepts won't necessary have to match SenticNet concepts. The proposed systems, for example, are supposed to be able to extract a multi-word expression like 'buy christmas present' from sentences such as 'Today I bought a lot of very nice Christmas presents'. The number of extracted concepts per sentence will be assessed through precision, recall and F-measure against the evaluation dataset. * Advanced Task #3: Topic Spotting Input sentences will be about four different domains, namely: books, DVDs, electronics, and kitchen appliances. This task focuses on the automatic classification of sentences into one of such domains, in the format specified at http://sentic.net/challenge/task3. All sentences are assumed to belong to only one of the above-mentioned domains. The proposed systems are supposed to exploit the extracted concepts to infer which domain each sentence belongs to. Classification accuracy will be evaluated in terms of precision, recall and F-measure against the evaluation dataset. EVALUATION DATASET Systems will be evaluated against a testing dataset which will be revealed and released after the first-round of evaluation during the Conference. The dataset will be made public on the challenge website. Participants are suggested to train and/or test their own systems using the Blitzer Dataset. The testing dataset will be constructed in the same way and from the same sources as the Blitzer dataset. EVALUATION The evaluation will be performed by the members of the Program Committee. For systems that can be tuned with different parameters, please indicate a range of up to 4 sets of settings. Settings with the best F-measures will be considered for judgment. For each system, reviewers will give a numerical score within the range [1-10] and details motivating their choice. The scores will be given to the following aspects: 1. Use of common-sense knowledge and semantics; 2. Precision, recall, and F-measure wrt the selected task; 3. Computational time; 4. Innovative nature of the approach. JUDGING AND PRIZES After a first round of review, the Program Committee and the chairs will select a number of submissions confirming to the challenge requirements that will be invited to present their work. Submissions accepted for presentation will be included in post-proceedings and will receive constructive reviews from the Program Committee. All accepted submissions will have a slot in a poster session dedicated to the challenge. In addition, the winners will present their work in a special slot of the main program of ESWC and will be invited to submit a paper to a dedicated Semantic Web Journal special issue. For the Concept-Level Sentiment Analysis Challenge there will be two awards for each task: * Quantitative: the system with the highest average score in items 1-3 above; * Innovative: the system with the highest score in item 4 above. There will be a board of judges at the conference who will evaluate again the systems in more detail. The judges will then meet in private to discuss the entries and to determine the winners. It may happen that the same system runs for both the awards. Winners will be selected only for tasks with at least 3 participants. In any case all submissions will be reviewed and, if accepted, published in ESWC post-proceedings. An amount of 700 euros has already been secured for the first task for what the first point of the evaluation aspects is concerned. We are currently working on securing further funding. HOW TO PARTICIPATE The following information has to be provided: * Abstract: no more than 200 words. * Description: It should contain the details of the system, including why the system is innovative, how it uses Semantic Web, which features or functions the system provides, what design choices were made and what lessons were learned. The description should also summarize how participants have addressed the evaluation tasks. Papers must be submitted in PDF format, following the style of the Springer's Lecture Notes in Computer Science (LNCS) series (http://www.springer.com/computer/lncs/lncs+authors), and not exceeding 5 pages in length. * Web Access: The application can either be accessible via the web or downloadable. If the application is not publicly accessible, password must be provided. A short set of instructions on how to use the application should be provided as well. All submissions should be provided via EasyChair: https://www.easychair.org/conferences/?conf=eswc2014-challenges MAILING LIST We invite the potential participants to subscribe to our mailing list in order to be kept up to date with the latest news related to the challenge. https://lists.sti2.org/mailman/listinfo/eswc2014-semsa-challenge IMPORTANT DATES * March 14, 2014, 23:59 CET: Submission due * April 9, 2014, 23:59 CET: Notification of acceptance * May 27-29, 2014: The Challenge takes place at ESWC-14 PROGRAM COMMITTEE * Newton Howard, MIT Media Laboratory (USA) * Cheng Xiang Zhai, University of Illinois at Urbana-Champaign (USA) * Rada Mihalcea, University of North Texas (USA) * Ping Chen, University of Houston-Downtown (USA) * Yongzheng Zhang, LinkedIn Inc. (USA) * Giuseppe Di Fabbrizio, Amazon Inc. (USA) * Rui Xia, Nanjing University of Science and Technology (China) * Rafal Rzepka, Hokkaido University (Japan) * Amir Hussain, University of Stirling (UK) * Alexander Gelbukh, National Polytechnic Institute (Mexico) * Bjoern Schuller, Technical University of Munich (Germany) * Amitava Das, Samsung Research India (India) * Dipankar Das, National Institute of Technology (India) * Carlo Strapparava, Fondazione Bruno Kessler (Italy) * Stefano Squartini, Marche Polytechnic University (Italy) * Cristina Bosco, University of Torino (Italy) * Paolo Rosso, Technical University of Valencia (Spain) From hamon at LIMSI.FR Tue Feb 11 21:23:29 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 22:23:29 +0100 Subject: Job: Professeur en informatique, UFR Mathematiques et Informatique, Universite de Lorraine Message-ID: Date: Tue, 11 Feb 2014 18:07:36 +0100 From: smaili Message-ID: <52FA58D8.1040706 at loria.fr> L'UFR Mathématiques et Informatique et le LORIA de l'UL proposent un poste de professeur en informatique dont les profils enseignement et recherche sont donnés ci-dessous. Par ailleurs, nous organisons à l'UFR MI le 19 mars une journée de présentation où les candidats qui le souhaitent viendront présenter leurs dossiers devant les équipes pédagogiques de l'UFR Mathématiques et Informatique. Profil enseignement : Le professeur doit s'intégrer à l'équipe pédagogique de la licence et du master sciences cognitives. Les sciences cognitives s'intéressent à la compréhension de la pensée artificielle, animale ou humaine. Les compétences recherchées vont donc de l'informatique aux sciences du langage en passant par la biologie.L'UFR recrutera un professeur qui possède un maximum de compétences liées aux sciences cognitives et qui pourrait intervenir dans les cours : aspects cognitifs liés à la linguistique computationnelle, les neurosciences, la représentation des connaissances, l'intelligence artificielle, l'intelligence collective, etc. L'enseignant doit avoir une compétence et une expérience certaine dans ce domaine pour être opérationnel dès la rentrée 2014-2015 où les besoins pour nos formations sont importants. Le candidat s'intégrera à l'équipe pédagogique et contribuera à la création de projets innovants.Au niveau administratif, le candidat doit être en mesure très rapidement de prendre des responsabilités lourdes : montage de nouveaux diplômes, rédaction d'habilitation, direction de diplômes ou d'UFR. Le professeur recruté devra être un moteur dans les futurs projets que nous sommes en train de mettre au point. Composante/UFR :Mathématiques et InformatiqueRéférence UFR : K. Smaïli Mots-clés enseignement : sciences cognitives, traitement automatique des langues, neurosciences, intelligence artificielle Profil recherche : Le professeur recruté effectuera sa recherche dans le Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA). Il devra s'intégrer dans le département 4 (Traitement des langues et des connaissances). Toutes les équipes du département sont concernées. Ce département travaille sur les trois notions étroitement liées du document, du langage et de la connaissance. Au sens large, on peut parler de sémantiques computationnelles. En effet, dans l'analyse de document, les caractères et les symboles dans un document peuvent être considérés comme des indices de faible niveau et combinés pour construire des entités beaucoup plus complexes.?La reconnaissance de la parole ou l'analyse de phrase étudie les sémantiques relatives au langage. Et, à un plus haut niveau, les représentations du discours ou de la connaissance nécessite l'étude de formalismes logiques, de leur pouvoir expressif, la cartographie d'un formalisme vers un autre, le raisonnement, et la preuve... Les équipes concernées sont avant tout Cello, Orpailleurs, Parole (Multispeech et SMarT), Qgar, Read, Semagramme, Synalp. Nom laboratoire 1 :LORIANuméro unité du laboratoire 1 : UMR 7503 Mots-clés recherche : Sémantique computationnelle, traitement de la langue naturelle, logique computationnelle, analyse de document, reconnaissance des formes, modélisation stochastique, traitement de texte/ de données, réseaux bayésiens, découverte de la connaissance, représentation de la connaissance, systèmes de la connaissance, biologie des systèmes, accueil moléculaire, logic linéaire, grammaires catégorielles, syntaxe, reconnaissance de la parole, modélisation articulatoire, reconnaissance de l'écriture manuelle, classificateurs incrémentaux ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 11 21:17:55 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 22:17:55 +0100 Subject: Soft: Talismane, analyseur syntaxique robuste en dependances Message-ID: Date: Tue, 11 Feb 2014 16:27:03 +0100 From: Assaf Urieli Message-ID: X-url: http://redac.univ-tlse2.fr/applications/talismane.html Talismane : analyseur syntaxique robuste en dépendances ******************************************************* Talismane est un outil libre et complet d'analyse syntaxique en dépendances, prêt à l'emploi pour le français. L'outil Talismane est disponible pour téléchargement à : http://redac.univ-tlse2.fr/applications/talismane.html Talismane a été développé par Assaf URIELI dans le cadre de sa thèse au sein de l'axe TAL de CLLE-ERSS (UMR 5263), Université de Toulouse II le Mirail. - Thèse : http://w3.erss.univ-tlse2.fr/textes/pagespersos/urieli/URIELI-thesis-2013.pdf - Laboratoire CLLE-ERSS : http://w3.erss.univ-tlse2.fr/ C'est un logiciel libre (open source) écrit en Java, et diffusé sous une licence Affero GPL v3 : http://www.gnu.org/licenses/agpl-3.0.html Talismane effectue, pour le français : - la segmentation en phrases ; - la segmentation en mots (tokenisation) + lemmatisation ; - l'étiquetage morpho-syntaxique (pos-tagging) ; - l'analyse syntaxique en dépendances (parsing). Talismane a été entraîné sur le French Treebank (Abeillé et al, 2003) pour la segmentation en phrases et en mots ainsi que pour le pos-tagging, et sur le French Treebank converti en dépendances (Candito et al, 2010) pour le parsing. Il utilise le LeFFF comme lexique par défaut (Sagot 2010). Il est entièrement paramétrable (apprentissage, traits, règles, jeu d'étiquettes, lexique...) et peut être réentraîné pour d'autres langues. En plus, Talismane permet de : - analyser rapidement (2 millions de mots par heure dans la configuration de base) ; - parser du XML ou du HTML facilement, par l'ajout de filtres indiquant quelles parties analyser ; - ajouter des règles pour forcer ou empêcher le système de choisir une certaine segmentation, étiquette ou lien de dépendance, selon le contexte ; - choisir entre une analyse de meilleure qualité (faisceau large) ou plus rapide (faisceau étroit) ; - garder une trace de la position exacte de chaque token analysé dans le fichier d'origine ; - indiquer la confiance du système dans chaque décision prise. Un mode d'emploi est disponible sur le site. Cordialement, Assaf URIELI Laboratoire CLLE-ERSS, UMR 5263 Université de Toulouse II le Mirail ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 11 21:28:45 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 22:28:45 +0100 Subject: Appel: 7th BUCC Workshop at LREC: Deadline extension until Feb. 23 and Journal Special Issue Message-ID: Date: Tue, 11 Feb 2014 21:29:13 +0100 From: "Reinhard Rapp" Message-ID: <5441EDC688574124BDEA8066E6719EDA at ASUSPC> X-url: http://comparable.limsi.fr/bucc2014/ We apologize for multiple postings Please distribute to interested colleagues ============================================================ DEADLINE EXTENSION AND JOURNAL SPECIAL ISSUE ============================================================ 7th WORKSHOP ON BUILDING AND USING COMPARABLE CORPORA Building Resources for Machine Translation Research http://comparable.limsi.fr/bucc2014/ May 27, 2014 Co-located with LREC 2014 Harpa Conference Centre, Reykjavik (Iceland) EXTENDED DEADLINE FOR PAPERS: February 23, 2014 https://www.softconf.com/lrec2014/BUCC2014/ *** INVITED SPEAKER *** Chris Callison-Burch (University of Pennsylvania) ============================================================ MOTIVATION In the language engineering and the linguistics communities, research in comparable corpora has been motivated by two main reasons. In language engineering, on the one hand, it is chiefly motivated by the need to use comparable corpora as training data for statistical Natural Language Processing applications such as statistical machine translation or cross-lingual retrieval. In linguistics, on the other hand, comparable corpora are of interest in themselves by making possible inter-linguistic discoveries and comparisons. It is generally accepted in both communities that comparable corpora are documents in one or several languages that are comparable in content and form in various degrees and dimensions. We believe that the linguistic definitions and observations related to comparable corpora can improve methods to mine such corpora for applications of statistical NLP. As such, it is of great interest to bring together builders and users of such corpora. The scarcity of parallel corpora has motivated research concerning the use of comparable corpora: pairs of monolingual corpora selected according to the same set of criteria, but in different languages or language varieties. Non-parallel yet comparable corpora overcome the two limitations of parallel corpora, since sources for original, monolingual texts are much more abundant than translated texts. However, because of their nature, mining translations in comparable corpora is much more challenging than in parallel corpora. What constitutes a good comparable corpus, for a given task or per se, also requires specific attention: while the definition of a parallel corpus is fairly straightforward, building a non-parallel corpus requires control over the selection of source texts in both languages. Parallel corpora are a key resource as training data for statistical machine translation, and for building or extending bilingual lexicons and terminologies. However, beyond a few language pairs such as English- French or English-Chinese and a few contexts such as parliamentary debates or legal texts, they remain a scarce resource, despite the creation of automated methods to collect parallel corpora from the Web. To exemplify such issues in a practical setting, this year's special focus will be on Building Resources for Machine Translation Research This special topic aims to address the need for: (1) Machine Translation training and testing data such as spoken or written monolingual, comparable or parallel data collections, and (2) methods and tools used for collecting, annotating, and verifying MT data such as Web crawling, crowdsourcing, tools for language experts and for finding MT data in comparable corpora. TOPICS We solicit contributions including but not limited to the following topics: Topics related to the special theme: * Methods and tools for collecting and processing MT data, including crowdsourcing * Methods and tools for quality control * Tools for efficient annotation * Bilingual term and named entity collections * Multilingual treebanks, wordnets, propbanks, etc. * Comparable corpora with parallel units annotated * Comparable corpora for under-resourced languages and specific domains * Multilingual corpora with rich annotations: POS tags, NEs, dependencies, semantic roles, etc. * Data for special applications: patent translation, movie subtitles, MOOCs, meetings, chat-rooms, social media, etc. * Legal issues with collecting and redistributing data and generating derivatives Building comparable corpora: * Human translations * Automatic and semi-automatic methods * Methods to mine parallel and non-parallel corpora from the Web * Tools and criteria to evaluate the comparability of corpora * Parallel vs non-parallel corpora, monolingual corpora * Rare and minority languages, across language families * Multi-media/multi-modal comparable corpora Applications of comparable corpora: * Human translations * Language learning * Cross-language information retrieval & document categorization * Bilingual projections * Machine translation * Writing assistance Mining from comparable corpora: * Extraction of parallel segments or paraphrases from comparable corpora * Extraction of bilingual and multilingual translations of single words and multi-word expressions; proper names, named entities, etc. IMPORTANT DATES February 23, 2014 Deadline for submission of papers (extended) March 10, 2014 Notification of acceptance March 27, 2014 Camera-ready papers due May 27, 2014 Workshop date SUBMISSION INFORMATION Papers should follow the LREC main conference formatting details (to be announced on the conference website http://lrec2014.lrec-conf.org/en/ ) and should be submitted as a PDF-file via the START workshop manager at https://www.softconf.com/lrec2014/BUCC2014/ Contributions can be short or long papers. Short paper submission must describe original and unpublished work without exceeding six (6) pages. Characteristics of short papers include: a small, focused contribution; work in progress; a negative result; an opinion piece; an interesting application nugget. Long paper submissions must describe substantial, original, completed and unpublished work without exceeding ten (10) pages. Reviewing will be double blind, so the papers should not reveal the authors' identity. Accepted papers will be published in the workshop proceedings. Double submission policy: Parallel submission to other meetings or publications is possible but must be immediately notified to the workshop organizers. When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. Moreover, ELRA encourages all LREC authors to share the described LRs (data, tools, services, etc.), to enable their reuse, replicability of experiments, including evaluation ones, etc. JOURNAL SPECIAL ISSUE Authors of selected papers will be encouraged to submit substantially extended versions of their manuscripts to an upcoming special issue on ‘Machine Translation Using Comparable Corpora’ of the Journal of Natural Language Engineering. ORGANISERS Pierre Zweigenbaum, LIMSI, CNRS, Orsay (France) Ahmet Aker, University of Sheffield (UK) Serge Sharoff, University of Leeds (UK) Stephan Vogel, QCRI (Qatar) Reinhard Rapp, Universities of Mainz (Germany) and Aix-Marseille (France) CONTACT Pierre Zweigenbaum: pz (at) limsi (dot) fr SCIENTIFIC COMMITTEE * Ahmet Aker, University of Sheffield (UK) * Srinivas Bangalore (AT&T Labs, US) * Caroline Barrière (CRIM, Montréal, Canada) * Chris Biemann (TU Darmstadt, Germany) * Hervé Déjean (Xerox Research Centre Europe, Grenoble, France) * Kurt Eberle (Lingenio, Heidelberg, Germany) * Andreas Eisele (European Commission, Luxembourg) * Éric Gaussier (Université Joseph Fourier, Grenoble, France) * Gregory Grefenstette (INRIA, Saclay, France) * Silvia Hansen-Schirra (University of Mainz, Germany) * Hitoshi Isahara (Toyohashi University of Technology) * Kyo Kageura (University of Tokyo, Japan) * Adam Kilgarriff (Lexical Computing Ltd, UK) * Natalie Kübler (Université Paris Diderot, France) * Philippe Langlais (Université de Montréal, Canada) * Michael Mohler (Language Computer Corp., US) * Emmanuel Morin (Université de Nantes, France) * Dragos Stefan Munteanu (Language Weaver, Inc., US) * Lene Offersgaard (University of Copenhagen, Denmark) * Ted Pedersen (University of Minnesota, Duluth, US) * Reinhard Rapp (Université Aix-Marseille, France) * Sujith Ravi (Google, Mountain View, US) * Serge Sharoff (University of Leeds, UK) * Michel Simard (National Research Council Canada) * Richard Sproat (OGI School of Science & Technology, US) * Tim Van de Cruys (IRIT-CNRS, Toulouse, France) * Stephan Vogel (QCRI, Qatar) * Guillaume Wisniewski (Université Paris Sud & LIMSI-CNRS, Orsay, France) * Pierre Zweigenbaum (LIMSI-CNRS, Orsay, France) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 11 21:10:05 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 22:10:05 +0100 Subject: Appel: RJCIA 2014 (15 jours avant la date-butoir) Message-ID: Date: Mon, 10 Feb 2014 17:19:02 +0100 From: Grégory Bonnet Message-ID: <52F8FBF6.8060809 at unicaen.fr> X-url: http://rjcia2014.greyc.fr -- Merci de diffuser aussi largement que possible cet appel, en particulier au sein des associations de doctorants et des listes de diffusion de vos écoles doctorales et laboratoires. Veuillez excuser les réceptions multiples. -- ********************************************************************** Appel à communications RJCIA 2014 : 12èmes Rencontres des Jeunes Chercheurs en Intelligence Artificielle 30 juin - 1er juillet 2014, Rouen, France http://rjcia2014.greyc.fr ********************************************************************** Présentation Les douzièmes rencontres des jeunes chercheurs en Intelligence Artificielle (RJCIA 2014) se dérouleront du 30 juin au 1er juillet 2014 à Rouen. Ces rencontres sont hébergées par RFIA'14 (Reconnaissance des Formes et l'Intelligence Artificielle), organisée par le laboratoire LITIS et ses établissements de tutelle l'INSA de Rouen, l’Université de Rouen et l’Université du Havre, au nom de l'Association Française d'Intelligence Artificielle (AFIA) et de l'Association Française de Reconnaissance et d'Interprétation des Formes (AFRIF). Les RJCIA 2014 sont destinées aux jeunes chercheurs en Intelligence Artificielle : doctorants ou titulaires d'un doctorat depuis moins d'un an. L'objectif de cette manifestation est double : permettre aux jeunes chercheurs préparant une thèse en Intelligence Artificielle, ou l'ayant soutenue depuis peu, de se rencontrer et de présenter leurs travaux, et ainsi de former des contacts avec d'autres jeunes chercheurs et d'élargir leurs perspectives en échangeant avec des spécialistes d'autres domaines de l'Intelligence Artificielle ; former les jeunes chercheurs à la préparation d'un article, à sa révision pour tenir compte des observations du comité de programme, et à sa présentation devant un auditoire de spécialistes, leur permettant ainsi d'obtenir des retours de chercheurs de leur domaine ou de domaines connexes. Pour tout renseignement complémentaire, s'adresser à Grégory Bonnet (gregory.bonnet[at]unicaen.fr), président du comité de programme. Appel à communication Les RJCIA 2014 ont pour vocation de compléter le panorama de la recherche francophone en Intelligence Artificielle proposé par RFIA'14 en présentant des travaux de jeunes chercheurs dans ce domaine. Ainsi, toute contribution relevant de l'Intelligence Artificielle est la bienvenue. La liste indicative des thématiques ci-dessous n'est pas exhaustive. Apprentissage artificiel (supervisé, non supervisé, par renforcement) Fouille de masse de données, fouille de grands graphes, fouille de texte Extraction et utilisation de connaissances par et pour la fouille (de masses) de données Satisfaction de contraintes Représentation des connaissances, modèles de raisonnement, raisonnement, planification Agents autonomes et systèmes multi-agents (décision individuelle, collective, interactions) Informatique émotionnelle, agent virtuel socio-émotionnel, interaction humain-agent Robotique autonome, contrôle intelligent, aide à la décision dans les systèmes robotiques Gestion des connaissances : modélisation, organisation, optimisation et traitement Traitement automatique des langues Données liées, données ouvertes, publication de données Masse de données, Web de données, web sémantique, ontologies. Extraction d'information, indexation sémantique de contenu Visualisation de données et de connaissances Validation, évaluation des systèmes d'Intelligence Artificielle Applications de l’Intelligence Artificielle (médecine, transport, agronomie, droit) Conditions de soumission Les auteurs sont invités à soumettre des propositions d'articles de 6 pages double colonne maximum selon le style proposé par RFIA'14. dans l'une des deux catégories suivantes : Communications de synthèse : Les RJCIA 2014 sont l'occasion pour les doctorants de première année de présenter leur sujet de thèse au travers d'une communication de synthèse. Cette dernière permet de présenter un état de l'art, d'identifier des problématiques et/ou de proposer de première pistes de travail. Les auteurs sont encouragés à clairement décrire le contexte de leur travaux et de mettre en lumière les problèmes rencontrés. Communications de recherche : Les RJCIA 2014 invitent aussi les jeunes chercheurs plus confirmés à présenter des travaux de recherche originaux théoriques, expérimentaux ou à vocation opérationnelle. Une rigueur devra être apportée à la rédaction des articles, tant sur le plan de la forme que sur celui du contenu scientifique. Les jeunes chercheurs qui le souhaitent sont également invités à soumettre leur communication à la session IA de RFIA'14. Les auteurs sont invités à indiquer lors de la soumission la catégorie dans laquelle s'inscrit leur proposition. Le premier auteur de la communication doit être doctorant ou docteur depuis moins d'un an (thèse en cours, ou soutenue en 2013). Aucune exclusivité n'est demandée : nous souhaitons que les articles soumis dans les conférences ou ateliers internationaux puissent également être proposés aux RJCIA 2014. La langue de la conférence est le français de préférence, ou l'anglais sinon. Chaque soumission sera relue par au moins trois membres du comité de programme. Les critères de relecture incluent l'originalité des idées, la qualité technique, l'importance des résultats et la qualité de la présentation. Dates importantes Ouverture du site de soumissions : 6 janvier 2014 Date limite de soumission : 24 février 2014 Notification aux auteurs : 25 avril 2014 Réception des versions définitives : 16 mai 2014 RJCIA 2014 : lundi 30 juin et mardi 1er juillet 2014 Soumission d'articles Le format de soumission est disponible sur le site de la conférence : http://rjcia2014.greyc.fr Les soumissions se sont font via Easychair : https://www.easychair.org/conferences/?conf=rjcia2014 Comité de programme Nicolas Béchet, IRISA, Université de Bretagne-Sud Lamia Belouaer, IRENav, Ecole Navale Grégory Bonnet, GREYC, Université de Caen Basse-Normandie Elise Bonzon, LIPADE, Université Paris Descartes Gauvain Bourgne, LIP6, Université Pierre et Marie Curie Sylvain Bouveret, LIG, INP Grenoble Amélie Cordier, LIRIS, Université Lyon 1 Alice Hermann, LORIA, Université de Lorraine Abir Béatrice Karami, LIRIS, Université Lyon 1 Jean-Marie Lagniez, CRIL, Université d'Artois Robin Lamarche-Perrin, Max Planck Institute for Mathematics in the Sciences Marie Lefevre, LIRIS, Université Lyon 1 Cyrille Martin, DAI-Labor (CC NGS), Technischen Universität Berlin Benoit Mathern, LIRIS, Université Lyon 1 Laëtitia Matignon, LIRIS, Université Lyon 1 Alexandre Niveau, GREYC, Université de Caen Basse-Normandie Antoine Nongaillard, DISP, Université Lyon 2 Alexandre Pauchet, LITIS, INSA Rouen Cédric Pralet, Onera Mathieu Roche, TETIS, Cirad Julien Rossit, LIPADE, Université Paris Descartes Stéphanie Roussel, CRIL Karima Sedki, Lim&Bio, Université Paris 13 Informations pratiques Consultez le site de RFIA'14 (http://rfia2014.univ-rouen.fr/) afin d'obtenir les informations pratiques pour s'inscrire et se rendre au LITIS. Pour le comité de programme des RJCIA 2014, Grégory Bonnet ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 11 21:21:26 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 22:21:26 +0100 Subject: Stage: Master 2e annee, Creation de la maquette numerique d'une serie de documents traitant d'archeologie Message-ID: Date: Tue, 11 Feb 2014 16:37:08 +0100 From: Thierry Poibeau Message-Id: <46F083B2-0D77-4D14-8508-E665CCD78F4A at ens.fr> Proposition de stage de Master 2e année : Création de la maquette numérique d'une série de documents traitant d'archéologie Descriptif : Le stage s'inscrit dans le domaine des humanités numériques, dans le cadre d'un projet EITAB, PEPS PSL-CNRS, mettant en œuvre une collaboration entre le laboratoire Lattice (UMR8094, analyse linguistique et traitement automatique des langues,http://www.lattice.cnrs.fr/) et le laboratoire AOROC (UMR 8546, archéologie, http://www.archeo.ens.fr/). Les deux laboratoires font partie de l'Ecole normale supérieure et collaborent au sein du laboratoire d'excellence TransferS (http://www.transfers.ens.fr/). Le stage vise à créer, à partir d'un document textuel traitant d'archéologie, la maquette d'un ebook. Il s'agit de mettre en place un accès pertinent et convivial à un texte de spécialité à partir de données extraites du texte (termes structurés en index, etc.). On s'intéressera donc particulièrement aux outils de structuration (base de données, mise en place de liens entre termes, etc.) et de mise en forme (xml, html, css) des données. Une collaboration étroite avec les interlocuteurs côté archéologie est indispensable. Le LATTICE dispose d'un extracteur de termes et les outils classiques de structuration de données et de création d'ontologies pourront être utilisés. Le stage porte essentiellement sur l'interaction entre texte et index. En fonction des compétences du candidat retenu, on pourra aussi s'intéresser à l'interaction entre texte et cartes géographiques (pour permettre de visualiser les principaux sites archéologiques, créer des liens entre objets d'étude et zones géographiques, et pouvoir répondre à des questions comme : « sur quels sites de la région tourangelle ont été trouvées des tuiles à rebord ? »). On pourra aussi, si le temps le permet, s'intéresser à l'adaptation du rendu final en fonction du support (ordinateur, tablette, etc.). Le stage a une finalité avant tout pratique : il s'agit de mettre en place une maquette opérationnelle montrant un exemple concret de réalisation possible. L'archéologie se prête particulièrement bien à l'enrichissement multimédia du texte numérique. Mission du stage Créer la maquette numérique d'une série de documents traitant d'archéologie. – veiller à la cohérence des données (corriger et enrichir les index existants) – structurer le document texte – créer index et requêtes dynamiques. Compétences requises – connaissance d'outils d'édition électronique – connaissance des langages de structuration de document (xml, html, css, etc) – intérêt pour la mise en place de solutions applicatives, prise en compte des besoins utilisateurs – connaissance des bases de données – qualité de rédaction en français et en anglais – un intérêt pour l'archéologie serait un plus Informations complémentaires sur EITAB : http://www.archeo.ens.fr/spip.php?article586 Conditions Le stage se déroulera au laboratoire Lattice (à Montrouge, http://www.lattice.cnrs.fr/) pendant 6 mois, à partir d'avril 2014 en étroite collaboration avec le laboratoire AOROC (à l'ENS, 45 rue d'Ulm à Paris). Ce stage est indemnisé suivant les règles en vigueur grâce à un projet PEPS de site co-financé par le CNRS et PSL. Lieu Laboratoire LATTICE à Montrouge. Encadrants Frédérique Mélanie Becquet et Thierry Poibeau pour le LATTICE et Katherine Gruel pour AOROC. Comment postuler ? Envoyer un CV et une lettre de motivation à Thierry Poibeau et Frédérique Mélanie prenom suivi du nom séparé par un point arobase ens.fr dès que possible et, dans tous les cas, avant le 20 février 2014. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Feb 14 21:03:27 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 22:03:27 +0100 Subject: Appel: Arabic natural dialect processing, Workshop ICCTIM2014 Message-ID: Date: Wed, 12 Feb 2014 17:19:40 +0100 From: smaili Message-ID: <52FB9F1C.4020401 at loria.fr> X-url: http://sdiwc.net/conferences/2014/icctim2014/workshops/ We apologize for multiple postings, please distribute to interested colleagues DEADLINE *March 10, 2014* * Workshop at **ICCTIM2014**, Dubai, april 10, 2014 * http://sdiwc.net/conferences/2014/icctim2014/workshops/ Title : Arabic natural dialect processing Summary Modern Standard Arabic (MSA) is the language of more than 250 million persons. It is used mainly in writing and in formal speech. Unfortunately, most of Arab people, do not use MSA in their daily conversations; the result is that different Arabic dialects are spoken through more than twenty countries.In fact, MSA is not acquired as a mother tongue, but rather it is learned as a second language at school and through exposure to formal broadcast programs (such as the daily news), religious practice, and newspaper. Spoken Arabic is often referred to as colloquial Arabic, dialects, or vernaculars. It's a mixed form, which has many variations, and often a dominating influence from local languages (from before the introduction of Arabic) and from languages of the countries which occupied the Arabic region. Differences between the various variants of spoken Arabic can be large enough to make them incomprehensible to Arabic people coming from different regions. Hence, regarding the large differences between such spoken languages, we can consider them as disparate languages or more exactly as different dialects depending on the geographical place in which they are practiced: Morocco, Algeria, Egypt,... Because in general, they are not written therefore, corpora are not available. Everyone knows the importance of such corporawhen we would like to mine texts or to develop some applications as speech recognition or machine translation which are based on statistical models. The only existing corpora but not yet explored are those used in social networks which cannot be used easily due to the multiplicity of formats, the number of foreign words, the mixture between dialects and French or English and so on. This workshop is an opportunity for the NLP community to focus on this challenging topic and encourage them to develop new resource Arabic dialect,Arabic dialect corpora processing tools and help in highlighting the difficulties of processing Arabic dialects especially those which use so many foreign words adapted lexically and grammatically to Arabic. The workshop topics include but not limited to: 1. Collecting Arabic dialect corpora 2. Diacritization of Arabic dialects 3. Mining Arabic social networks 4. Language modelling 5. Arabic dialect morphology 6. Development of mobile Arabic dialects applications: speech recognition, machine translation, ... 7. Tagging Arabic dialects corpora 8. Maghreb Arabic dialects versus Orient Arabic dialects: Linguistic study. Format and duration: a full day workshop will held on April 10, 2014. The language of the workshop is English and submissions should be with respect to ICCTIM2014 paper submission instructions. All papers will be peer reviewed.Papers must be submitted electronically in PDF format as soon as possible and before March 10, 2014. When you submit by using the OpenConf management system, please select others in the proposed topics and in keywords, enter Arabic dialect. In all the cases, when you submit, please send an email to the chairman of the workshop:smaili at loria.fr From hamon at LIMSI.FR Fri Feb 14 20:45:15 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 21:45:15 +0100 Subject: Revue: Texto ! Textes & Cultures Message-ID: Date: Wed, 12 Feb 2014 14:40:39 +0100 From: Carine Duteil-Mougel Message-Id: X-url: http://www.revue-texto.net Bonjour, Vous pourrez trouver ci-dessous le sommaire du nouveau numéro de la revue Texto ! Textes et Cultures. http://www.revue-texto.net Bonne lecture ! ///// TEXTO ! Volume XIX - n°1 (2014). Coordonné par Régis Missire Dits et inédits Dénier, théoriser, éliminer : le « travail » de Duch Sur L’élimination de Rithy Panh avec Christophe Bataille Elise LAMY-RESTED À la découverte du sens historique. Alfred Baeumler et la Nouvelle vision du monde LÉONORE BAZINEK L'expérience de la parole : le thème du sujet parlant ANTONINO BONDI Le discours de Dakar - Représentations et stéréotypes dans un discours en Afrique sur l’Afrique MICHELLE LECOLLE Parutions et trésors Pour une conception intégrale de la création lexicale CHRISTOPHE GÉRARD Constantes et variations du rituel politique: l'exemple des vœux présidentiels (Démonstration Lexico 3/TextObserver) JEAN-MARC LEBLANC Sémantique du site web de la restauration: étude thématique et expressive d'un genre polysémiotique ERIC TRUDEL Outils et méthodes pour appliquer le savoir-faire des sémanticiens aux traitements des masses documentaires MATHIEU VALETTE, EGLE EENSOO, EVELYNE BOURION & MONIQUE SLODZIAN Note de lecture. Gaston Gross. Manuel d’analyse linguistique : Approche sémantico-syntaxique du lexique. PASCAL VAILLANT Dynamiques du changement sémantique. Détection, analyse et modélisation du changement sémantique en corpus en diachronie courte - Présentation de thèse ARMELLE BOUSSIDAN Repères pour l'étude La sémantique interprétative - Du mot au corpus et du sème aux formes sémantiques ALI BELGHANEM À la recherche du temps perdu, ou comment rendre l’espace immatériel THIERRY MÉZAILLE Saussurismes et sémiotique Triple articulation de la langue et articulation herméneutique du langage - Quand De l’essence double du langage réinterprète les textes saussuriens SIMON BOUQUET ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Feb 14 21:15:38 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 22:15:38 +0100 Subject: Appel: COLING 2014, February 2014 Newsletter Message-ID: Date: Thu, 13 Feb 2014 17:07:23 +0000 From: 박성배 Message-ID: <01648087DD5D4349A12B758B6F475B842CD7C9EF at SejongSRV.sejong.knu.ac.kr> X-url: http://www.coling-2014.org We apologize if you received multiple copies. COLING 2014 Dublin, Ireland, 23-29 August, 2014 COLING 2014 (http://www.coling-2014.org), the 25th International Conference on Computational Linguistics, will be organised by CNGL (Centre for Global Intelligent Content) at the Helix Convention Centre at Dublin City University (DCU) from 23-29 August 2014. The COLING conference is organised under the auspices of the International Committee on Computational Linguistics (ICCL). Accepted Tutorials and Workshops We are pleased to announce that the workshops and tutorials for COLING 2014 have been finalised. In total there will be 6 tutorials and 18 workshops covering a wide range of topics. You will be able to register for tutorials and workshops when the main conference registration opens. Accepted Tutorials: - Dependency Parsing: Past, Present, and Future - Wenliang Chen, Zhenghua Li and Min Zhang - Using Neural Networks for Modeling and Representing Natural Languages- Thomas Mikolov - Multilingual Word Sense Disambiguation and Entity Linking - Roberto Navigli and Andrea Moro - Selection Bias, Label Bias, and Bias in Ground Truth - Anders Søgaard, Barbara Plank and Dick Hovy - Automated Grammatical Error Correction for Language Learners - Joel Tetreault and Claudia Leacock - Biomedical/clinical NLP - Ozlem Uzuner and Meliha Yetisgen Accepted Workshops: - Workshop on Lexical and Grammatical Resources for Language Processing - First Joint Workshop on Statistical Parsing of Morphologically Rich Languages and Syntactic Analysis of Non-Canonical Languages (SPMRL-SANCL) - Celtic Language Technology Workshop (CLTW) - 5th Workshop on South and Southeast Asian Natural Language Processing (WSSANLP) - Synchronic and Diachronic Approaches to Analyzing Technical Language (SADAATL) - The 3rd Workshop on Vision and Language (VL'14) - Second Workshop on Natural Language Processing for Social Media (SocialNLP 2014) - First Joint Workshop on Open Infrastructures for HLT Resource Processing and Development and Text Analysis Frameworks - Cognitive Aspects of the Lexicon (CogALex-IV) - Semantic Web and Information Extraction (SWAIE) - VarDial: Workshop on Applying NLP Tools to Similar Languages, Varieties and Dialects (VarDial) - SemEval-2014: Semantic Evaluation Exercises - International Workshop on Semantic Evaluation (SemEval) - The AHA!-Workshop on Information Discovery in Text (AHA!) - CompuTerm 2014. 4th International Workshop on Computational Terminology (CompuTerm) - The First Workshop on Computational Approaches to Compound Analysis (ComAComA) - Automatic Text Simplification? Methods and Applications in the Multilingual Society (ATS-MA) - The 8th Linguistic Annotation Workshop (The LAW VIII) - The third Joint Conference on Lexical and Computational Semantics (*SEM-2014) Call for Papers: The first call for papers for the main conference is open. The paper submission system is now open at https://www.softconf.com/coling2014/main/. Papers should be prepared using the paper templates available here http://www.coling-2014.org/doc/coling2014.zip. We invite authors to submit papers in all relevant areas and encourage authors to include analysis of the influence of theories (intuitions, methodologies, insights), to technologies (computational algorithms, methods, tools, data), and/or contributions of technologies to theory development. Contributions that display and rigorously discuss future potential, even if not (yet) attested in standard evaluation, are welcome. For more information and instructions for authors see the preliminary call for papers here http://www.coling-2014.org/call-for-papers.php. Sponsorship and Promotion Opportunities With the tremendous impact and growth of COLING over many years, we have set out a comprehensive offering of corporate sponsorship and support opportunities for industry participants. The COLING 2014 organising committee is committed to working in close partnership with corporate supporters to maximise the benefits of sponsorship and provide opportunities for sponsors to engage appropriately with COLING delegates. With this in mind, if you wish to explore additional options for support or means of engagement with the COLING 2014 community please feel free to contact sponsorship at coling-2014.org. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Feb 14 20:49:03 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 21:49:03 +0100 Subject: Stage: Parametrage et evaluation d'un systeme de classification automatique de rapports de securite, Toulouse Message-ID: Date: Wed, 12 Feb 2014 16:03:13 +0100 From: Ludovic Tanguy Message-ID: <52FB8D31.4090104 at univ-tlse2.fr> Stage de Master 2 / Ecole d'ingénieur : paramétrage et évaluation d'un système de classification automatique de rapports de sécurité. L'entreprise : CFH (Conseil en facteurs humains)/SafetyDATA est une PME spécialisée dans le traitement automatique des langues dans le domaine de la sécurité. Elle travaille en collaboration avec CLLE-ERSS, un laboratoire de linguistique, et plus particulièrement avec l'équipe TAL (Traitement automatique des langues). Le stage sera co-encadré par CFH et le laboratoire CLLE. Contexte : CFH a conçu un système de traitement automatique des langues dont le but est d'analyser des rapports d'incidents afin de proposer une ou plusieurs catégorie(s) pour leur indexation dans une base de données. Le système est actuellement déployé et analyse chaque mois plusieurs centaines de documents (en français et en anglais) à l'aide de règles apprises automatiquement et basées sur le repérage de certains termes dans les rapports analysés. Voir plus de détails sur http://www.safety-data-analysis.com/ Le stage vise l'évolution de ce système de classification, notamment en envisageant l'utilisation d'un système d'apprentissage supervisé statistique. L'objectif de ce stage est double : 1/ Identifier et quantifier le gain apporté par l'utilisation d'un modèle statistique (SVM, régression logistique, réseau bayésien, etc.) par rapport au système actuel ; 2/ Mesurer l'impact sur les performances du système des différents traitements linguistiques appliqués aux documents avant leur analyse (correction des erreurs, normalisation des formes de surface, identification d'expressions complexes, utilisation de classes sémantiques, etc.). Profil recherché : Étudiant en deuxième année de master ou dernière année d'école d'ingénieur, en informatique ou traitement automatique des langues. Compétences requises : Systèmes de classification automatique par apprentissage artificiel. L'étudiant doit connaître le fonctionnement de ces systèmes et être autonome quant à leur utilisation et évaluation (scripts, gestion de données volumineuses, etc.) On attend également de l'étudiant une capacité à observer les données et à s'intégrer dans un environnement interdisciplinaire. Détails : - stage conventionné et rémunéré basé à Toulouse - durée : 4 à 6 mois à partir de mars Contact : Céline Raynal raynal at conseil-fh.fr Ludovic Tanguy tanguy at univ-tlse2.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Feb 14 20:55:23 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 21:55:23 +0100 Subject: Appel: SEPLN 2014 Message-ID: Date: Wed, 12 Feb 2014 16:28:44 +0100 From: Horacio Saggion Message-ID: X-url: http://www.taln.upf.edu/pages/sepln2014/es/index.html X-url: http://www.taln.upf.edu/pages/sepln2014/en/index.html CALL FOR PAPERS: 30th CONFERENCE OF THE SPANISH SOCIETY FOR NATURAL LANGUAGE PROCESSING (SEPLN 2014) September 17-19, 2014 Universitat de Girona http://www.taln.upf.edu/pages/sepln2014/es/index.html -------------------------------------------------------------------- INTRODUCTION ----------------------- The 30th edition of the Annual Conference of the Spanish Society for Natural Language Processing (SEPLN) will take place in Universitat de Girona, Girona, Spain on 17-19 September 2014. We also expect to organize associated workshops. The huge amount of information available in digital format and in different languages calls for systems to enable us to access this vast library in an increasingly more structured way. In this same area, there is a renewed interest in improving information accessibility and information exploitation in multilingual environments. Many of the formal foundations for dealing appropriately with these necessities have been, and are still being established in the area of Natural Language Processing and its many branches: Information extraction and retrieval, Questions answering systems, Machine translation, Automatic analysis of textual content, Text summarization, Text generation, and Speech recognition and synthesis. The aim of the conference is to provide a forum for discussion and communication where the latest research work and developments in the field of Natural Language Processing (NLP) can be presented by scientific and business communities. The conference also aims at exposing new possibilities of real applications and R&D projects in this field. Moreover, as in previous editions, there is the intention of identifying future guidelines or paths for basic research and foreseen software applications, in order to compare them against the market needs. Finally, the conference intends to be an appropriate forum in helping new professionals to become active members in this field. TOPICS ----------- Researchers and companies are encouraged to send communications, project abstracts or demonstrations related to any language technology topic including but not limited to the following: * Linguistic, mathematic and psycholinguistic models of language. * Machine learning in NLP. * Computational lexicography and terminology. * Corpus linguistics. * Development of linguistic resources and tools. * Grammars and formalisms for morphological and syntactic analysis. * Semantics, pragmatics and discourse. * Lexical ambiguity resolution. * Monolingual and multilingual text generation. * Machine translation. * Speech synthesis and recognition. * Dialogue systems. * Audio indexing. * Monolingual and multilingual information extraction and retrieval. * Question answering systems. * Evaluation of NLP systems. * Automatic textual content analysis. * Sentiment analysis and opinion mining. * Plagiarism detection. * Negation and speculation processing. * Text mining in blogosphere and social networks. * Text summarization. * Image retrieval. * NLP in biomedical domain. * NLP-based generation of teaching resources. * NLP for languages with limited resources. * NLP industrial applications. CONTACT -------------- All information related to the conference can be found in the web: http://www.taln.upf.edu/pages/sepln2014/en/index.html STRUCTURE OF THE CONFERENCE -------------------------------------------------- The conference will last three days, and will consist of sessions devoted to presenting papers, posters, tutorials, ongoing research projects and prototype or product demonstrations connected with topics addressed in the conference. Besides, we expect to organize associated workshops. SUBMISSION OF CONTRIBUTIONS ------------------------------------------------- Authors are encouraged to send theoretical or application-oriented proposals related to NLP. The proposals must include the following sections: * The title of the communication. * An abstract in English and Spanish (maximum 150 words) and a list of keywords. * The paper can be written in Spanish or English. Its overall maximum length will be 8 pages, including references. * The documents must not include headers or footers. * Papers should NOT include the names of the authors. The papers proposed will be reviewed at least by three reviewers, and can be accepted to be presented either as posters or as communications, depending on the program necessities. However, no distinction will be made between communications and posters in the printed version of the SEPLN journal. *** IMPORTANT NOTE ON CAMERA READY **** The final version of the paper (camera ready) should be submitted together with a cover letter explaining how the suggestions of the reviewers were implemented in the final version. ********************************************** Please, send your proposals using the following link:: http://www.sepln.org/myreview-sepln53/ The format of the SEPLN journal must be followed: http://www.sepln.org/?page_id=1285&lang=en In addition, all proposals will have to comply with the following requirements, depending on whether they pare papers, demos or projects. PROJECTS AND DEMOS ---------------------------------- As in previous editions, the organizers encourage participants to give oral presentations of R&D projects and demos of systems or tools related to the NLP field. For oral presentations on R&D projects to be accepted, the following information must be included: * Project title. * Name, affiliation, address, e-mail and phone number of the project director. * Funding institutions. * Groups participating in the project. * Abstract (4 pages maximum, including references). For demonstrations to be accepted, the following information is mandatory: * Demo title. * Name, affiliation, e-mail and phone number of the authors. * Abstract (4 pages maximum, including references). * Time estimation for the whole presentation. **** SEE NOTE ON CAMERA READY ABOVE **** IMPORTANT DATES ---------------------------- Deadline for full papers, demos, and projects: 10th April 2014 Notifications: 26th May 2014 Camera Ready: 7th June 2014 From hamon at LIMSI.FR Fri Feb 14 21:01:10 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 22:01:10 +0100 Subject: Job: Professeur en informatique, UFR Mathematiques et Informatique, Universite de Lorraine, Modification du profil Message-ID: Date: Wed, 12 Feb 2014 17:12:37 +0100 From: smaili Message-ID: <52FB9D75.9050402 at loria.fr> X-url: http://www.loria.fr/la-recherche/departements/traitement-automatique-des-langues-et-des-connaissances L'UFR Mathématiques et Informatique et le LORIA de l'UL proposent un poste de professeur en informatique dont les profils enseignement et recherche sont donnés ci-dessous. Par ailleurs, nous organisons à l'UFR MI le 19 mars une journée de présentation où les candidats qui le souhaitent viendront présenter leurs dossiers devant les équipes pédagogiques de l'UFR Mathématiques et Informatique. Profil enseignement : Le professeur doit s'intégrer à l'équipe pédagogique de la licence et du master sciences cognitives. Les sciences cognitives s'intéressent à la compréhension de la pensée artificielle, animale ou humaine. Les compétences recherchées vont donc de l'informatique aux sciences du langage en passant par la biologie.L'UFR recrutera un professeur qui possède un maximum de compétences liées aux sciences cognitives et qui pourrait intervenir dans les cours : aspects cognitifs liés à la linguistique computationnelle, les neurosciences, la représentation des connaissances, l'intelligence artificielle, l'intelligence collective, etc. L'enseignant doit avoir une compétence et une expérience certaine dans ce domaine pour être opérationnel dès la rentrée 2014-2015 où les besoins pour nos formations sont importants. Le candidat s'intégrera à l'équipe pédagogique et contribuera à la création de projets innovants.Au niveau administratif, le candidat doit être en mesure très rapidement de prendre des responsabilités lourdes : montage de nouveaux diplômes, rédaction d'habilitation, direction de diplômes ou d'UFR. Le professeur recruté devra être un moteur dans les futurs projets que nous sommes en train de mettre au point. Composante/UFR : Mathématiques et Informatique Référence UFR : K. Smaïli Mots-clés enseignement : sciences cognitives, traitement automatique des langues, neurosciences, intelligence artificielle Profil recherche : Le professeur recruté effectuera sa recherche dans le Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA). Il devra s'intégrer dans le département 4 (Traitement des langues et des connaissances) http://www.loria.fr/la-recherche/departements/traitement-automatique-des-langues-et-des-connaissances. Toutes les équipes du département sont concernées. Ce département étudie les trois domaines étroitement liés que sont les langues naturelles, les connaissances et les documents. Les thématiques autour des langues couvrent un large spectre et concernent notamment la parole (analyse, reconnaissance, synthèse et perception), la syntaxe (par de méthodes symboliques ou statistiques), la sémantique et la représentation du discours. Dans le domaine des connaissances, les travaux concernent, entre autres, la représentation des connaissances et la formalisation du raisonnement, la découverte de connaissances par l'utilisation de méthodes symboliques ou numériques de fouille de données, les méthodes d'apprentissage et la classification. Concernant les documents, les principales thématiques sont la reconnaissance de formes, la reconnaissance de symboles et la modélisation de l'écriture manuscrite. Les équipes concernées sont Cello, Orpailleur, Parole (Multispeech et SMarT), Qgar, Read, Sémagramme, Synalp. Mots clés traitement automatique des langues, linguistique computationnelle, grammaires formelles, traduction automatique, analyse/synthèse de la parole, reconnaissance de la parole, modélisation articulatoire de la parole, traitement du signal audio, séparation de sources sonores, modèles probabilistes, fouilles de données complexes, apprentissage symbolique et numérique, représentation des connaissances et raisonnement, classification, reconnaissance de formes, modélisation de l'écriture manuelle ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Feb 14 21:19:39 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 22:19:39 +0100 Subject: Appel: Traitement automatique des langues africaines, TALAf 2014 Message-ID: Date: Fri, 14 Feb 2014 15:34:37 +0100 From: Mathieu Mangeot Message-Id: <9E5A0F68-F283-4A20-BF7C-6F42C360D4B0 at imag.fr> X-url: http://jibiki.univ-savoie.fr/~mangeot/TALAf/2014/ Appel à communications TALAf 2014 : Traitement automatique des langues africaines (écrit et parole) Atelier TALN 2014 - Marseille le 1^er juillet 2014 Version html : http://jibiki.univ-savoie.fr/~mangeot/TALAf/2014/ Version PDF : http://jibiki.univ-savoie.fr/~mangeot/TALAf/2014/atelierTALAf2014v5.pdf PRÉSENTATION Dans la suite du premier atelier TALAf qui s'est tenu le 8 juin 2012 à Grenoble, lors de la conférence JEP-TALN-RECITAL 2012 (voir les actes : http:// aclweb.org/anthology//W/W12/#1300), nous proposons une nouvelle édition de cet atelier lors de la conférence TALN 2014 le premier juillet à Marseille. Nous accueillons les travaux menés sur toutes les langues peu dotées d'Afrique. L'arabe dialectal de l'Afrique du nord (maghrébin) est également bienvenu. Les recherches en traitement automatique des langues africaines sont actuellement à l'orée de développements majeurs. Les efforts de reconnaissance des langues nationales et de standardisation des différents alphabets commencent à porter leurs fruits. Au Niger, par exemple, les alphabets des langues fulfulde, haussa, kanouri, songhai-zarma et tamajaq ont été définis par des arrêtés du gouvernement en 1999. Par ailleurs, un certain nombre de collègues formés dans les pays du Nord reviennent dans leur pays avec la volonté de continuer leur travail sur les langues locales. Il y a également des diasporas disposant de moyens technologiques leur permettant de contribuer directement en ligne et de manière bénévole. Pour autant, les langues nationales de la plupart des pays d'Afrique sont peu dotées (langues-π) : les ressources électroniques disponibles sont rares, mal distribuées, voire inexistantes. Seules sont accessibles les fonctions d'édition et d'impression rendant l'exploitation de ces langues difficile. Au moment où il est question de les introduire dans le système éducatif, de créer des normes d'écriture standardisées et stabilisées et surtout de développer leur usage à l'écrit et à l'oral dans l'administration et la vie quotidienne, un développement de ces langues s’impose comme une nécessité vitale. Développer le traitement automatique de langues africaines nécessite l'élaboration de ressources qui seront les fondements à partir desquels des traitements plus élaborés peuvent être construits. Il apparaît indispensable de constituer en premier lieu des corpus écrits et oraux annotés aussi larges que possibles. À partir de tels corpus, il est possible d’extraire des exemples pour aider à la constitution de dictionnaires ou de mettre au point des modèles de langage pour la reconnaissance vocale. Toutefois, la constitution de tels corpus reste une entreprise délicate dans le contexte de langues peu dotée car les transcriptions souffrent du manque de standardisation de la langue et l'enrichissement de corpus reste très onéreux. Le développement d'applications à base de traitement de l'oral peut être considéré comme prioritaire dans des régions de tradition orale. De plus, l'usage de téléphones mobiles, très répandu, permet d'imaginer un déploiement rapide de ces applications. Les dictionnaires sont également nécessaires pour construire les outils de base tels les correcteurs orthographiques (qui peuvent servir à leur tour pour corriger les corpus écrits) ou encore pour l'aide à la transcription de corpus oraux. Il existe parfois des dictionnaires bilingues couplant la langue officielle et une langue nationale. Par exemple, au Mali, le père Charles Bailleul est l'auteur d’un dictionnaire bambara-français ; au Niger, le projet éducatif SOUTÉBA a créé cinq dictionnaires bilingues destinés aux enfants de primaire. Mais ceux-ci existent uniquement en version papier ou sous forme de fichiers d'éditeurs de texte (format.doc). Informatiser ces dictionnaires pour les rendre utilisables par des outils de traitement automatique nécessite, dans un premier temps, d'ajouter des informations manquantes : prononciation, règles de flexion morphologiques et flexionnelles, exemples et traductions tirés de corpus, etc. Il s'agit dans un premier temps de les informatiser (les transformer dans un format utilisable par des outils de traitement automatique) et de les compléter avec des informations manquantes : prononciation, exemples et traductions tirés de corpus, etc. Des astuces peuvent parfois être inventées pour pallier le manque de ressources. Par exemple, s'il n'existe pas de corpus oraux avec transcriptions, il est possible de constituer un corpus oral de textes lus. Enfin, il y a lieu de prendre en compte les contraintes socio-économiques s'exerçant sur la population des locuteurs : les ressources économiques sont limitées, les ressources humaines qualifiées sont rares, les recherches sont sporadiques et isolées, les résultats confidentiels et parcellaires. Il est donc nécessaire de définir des méthodologies économes en coût d'achat de logiciels et en temps de travail qualifié visant à produire des résultats pérennes, partagés et faciles à enrichir. La constitution de ressources linguistiques de manière générale, et plus encore pour les langues africaines devrait donc respecter un certain nombre de principes : utilisation d'outils en source ouverte, définition et utilisation de standards (ISO, Unicode), transfert de connaissances entre les collègues des pays du Nord et du Sud, disponibilité des ressources sous licence ouverte (Creative Commons), etc. Cet atelier a pour but d'effectuer un état des lieux des travaux de constitution de ressources linguistiques de base (dictionnaires, corpus oraux et écrits), de mettre au point des méthodologies simples et économes d'élaboration de ressource, d'échanger sur les techniques permettant de se passer de certaines ressources inexistantes et de fixer un certain nombre de principes pour les futurs travaux dans le domaine. L'atelier se déroulera sur une demi-journée ou une journée entière, sous réserve des soumissions et de la décision finale du comité de programme de TALN. Les ateliers TALAf sont soutenus par l'association LTT (Lexicologie Terminologie Traduction). TYPES DE COMMUNICATION THÈMES L'atelier est ouvert à la présentation de travaux de recherche portant sur les thèmes suivants : Ressources : - constitution de corpus écrits (monolingues, bilingues alignés ou comparables) - constitution de corpus oraux (incluant la transcription) - élaboration de lexiques et dictionnaires (monolingues, bilingues) - évaluation de la qualité des ressources Outils : - analyseurs morphologiques, correcteurs orthographiques - analyseurs syntaxiques, correcteurs grammaticaux - systèmes de TA (statistique ou à base de règles) - reconnaissance de la parole - synthèse vocale CRITÈRES DE SÉLECTION Les soumissions seront examinées par au moins deux spécialistes du domaine. Pour les travaux de recherches, seront considérées en particulier : - l'adéquation aux thèmes de l'atelier. - l'importance et l'originalité de la contribution, - la correction du contenu scientifique et technique, - l'organisation et la clarté de la présentation. MODALITÉS DE SOUMISSION Les articles seront rédigés en français pour les francophones, en anglais pour ceux qui ne maîtrisent pas le français. Les formats précis de soumission sont disponibles pour Word et Latex sur le site de taln2014 : http://www.taln2014.org/site/soumission/ Les propositions de communications doivent être envoyées sous forme pdf à l'adresse suivante : https://www.easychair.org/conferences/?conf=talaf20140 COMITÉ DE PROGRAMME Laurent Besacier (LIG, Grenoble, France) Philippe Bretier (Voxygen, Pleumeur-Bodou, France) Khalid Choukri (ELDA, Paris, France) Mame Thierno Cissé (ARCIV, Université Cheikh Anta Diop, Dakar, Sénégal) Denys Duchier (Université d'Orléans, Orléans, France) Chantal Enguehard (LINA, Nantes, France) Gil Francopoulo (Tagmatica, Paris, France) Mathieu Mangeot (LIG, Grenoble, France) Chérif Mbodj, (Centre de Linguistique Appliquée de Dakar, Sénégal) Kamal Naït-Zerrad (INALCO, Paris, France) Pascal Nocera, (Université d'Avignon, France) Francois Pellegrino, (DDL, Lyon, France) Fatiha Sadat (UQAM, Montréal, Canada) Mamadou Lamine Sanogo (INSS, Ouagadougou, Burkina-Faso) Emmanuel Schang (Université d'Orléans, Orléans, France) Gilles Sérasset (LIG, Grenoble, France) Valentin Vydrin (LLACAN-INALCO, Paris, France) CALENDRIER - Date limite de soumission : 26 avril 2014 - Notification aux auteurs : 24 mai 2014 - Date limite de soumission des versions définitives : 15 juin 2014 - Atelier : 1 juillet 2014 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Feb 14 21:21:54 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 22:21:54 +0100 Subject: Appel: Atelier IC&Sante, associe a la conference IC 2014 Message-ID: Date: Fri, 14 Feb 2014 16:49:13 +0100 From: Sandra Bringay Message-Id: <428D889D-DA62-4520-BD59-81DEE14FB069 at lirmm.fr> X-url: https://www.lirmm.fr/ic-sante/ Atelier Ingénierie des connaissances et Santé https://www.lirmm.fr/ic-sante/ En association avec les 25èmes Journées francophones d'Ingénierie des Connaissances, Clermont Ferrand, du 12 au 16 mai 2014 (http://www.irit.fr/IC2014) Au cours des dernières années, le nombre de données médicales n'a cessé d'augmenter suite à l’informatisation croissante des professionnels de santé. Ces données restent trop souvent sous-exploitées. Les méthodes d’ingénierie des connaissances proposent des solutions pertinentes pour traiter toute la richesse potentielle de ces bases, même si des questions d'éthique, de mise en place de solutions, d'acquisition et de sécurité restent posées. L’atelier sera l’occasion de débattre des verrous liés à la modélisation, la structuration, l'extraction, la recherche et la fouille de données de santé de tous types. L’atelier permettra de présenter des activités de recherche innovantes, des méthodes, des technologies et des démonstrateurs développés aussi bien dans le monde académique, dans les laboratoires affiliés aux structures de santé et par des industriels impliqués dans le domaine de l'ingénierie de connaissances. L’objectif de cet atelier est d’offrir un cadre d’échange entre les chercheurs, les industriels et les utilisateurs intéressés par les dernières avancées scientifiques du domaine de l’ingénierie des connaissances en Santé. On s'intéressera par exemple aux thèmes : * Terminologies et Ontologies médicales : Interopérabilité des systèmes d'information en santé, indexation terminologique de la littérature médicale, de dossiers médicaux de patients et de récits cliniques, annotations de documents, formalismes de représentation, alignement d'ontologies. * Usage du Web 2.0 en santé : Réseaux collaboratifs des praticiens de santé, patients et famille, réseaux bibliographiques dans le domaine médical, annotation sociale de ressources médicale * Traitement d'informations multimodales : Traitement du texte, image et son, classification et intégration de contenus médicaux multimodaux, multi-sources * Extraction de connaissances à partir de contenus médicaux : Anonymisation, extraction d'entités nommées, de variable et/ou indicateurs pathologiques, recherche de corrélations, fouille de données médicales * Systèmes d'information en santé : Représentation, traitement et accès aux informations et des connaissances en santé, Systèmes d'information décisionnels de santé, Gestion des dossiers médicaux de patients, Personnalisation et santé. Nous faisons appel aux chercheurs et aux experts du domaine pour qu’ils présentent leurs travaux. Nous encourageons également les jeunes chercheurs et les industriels intéressés par ces questions à participer à cette manifestation et à se faire connaître. Chaque soumission fera l’objet d’une évaluation par plusieurs re-lecteurs du comité de programme. L’atelier est ouvert à la présentation de travaux de différents stades d'avancement. Site web de l'atelier : -------------------------- https://www.lirmm.fr/ic-sante/ Soumissions : -------------------------- Les textes (soumission et version finale) doivent être rédigés en français (en anglais pour les non francophones), de façon anonyme, sur 6 pages maximum en incluant toutes les références et les figures, au format de la conférence Ingénierie des connaissances (http://www.irit.fr/IC2014/node/16). Les articles sont à envoyer via le site https://www.easychair.org/conferences/?conf=icsant2014 Dates importantes : -------------------------- * Date limite de soumission : 10 mars 2014 * Réponse aux auteurs : 31 mars 2014 * Versions définitives des contributions : 14 avril 2014 Organisatrices : --------------------- * Sandra Bringay, LIRMM-UM3, http://www.lirmm.fr/~bringay/ * Nathalie Souf, IRIT-UPS, http://www.irit.fr/~Nathalie.Souf * Lynda Tamine-Lechani, IRIT-UPS, http://www.irit.fr/~Lynda.Tamine-Lechani/ Comité de programme : ------------------------------- Jérome Azé, LIRMM, Montpellier Audrey Baneyx, Sciences Politiques, Paris Catherine Berrut, LIG, Grenoble Sandra Bringay, LIRMM, Montpellier Jean Charlet, AP-HP & INSERM U1142, Paris Adrien Coulet, LORIA, Nancy Olivier Cure, LIGM, Paris Cedrick Fairon, Université de Louvain Natalia Grabar, STL, Lille Brigitte Grau, LIMSI, Paris Marie-Christine Jaulent, INSERM U1142, Paris Clément Jonquet, LIRMM, Montpellier Vanda Luengo, LIG, Grenoble Fleur Mougin, ERIAS - ISPED, Université de Bordeaux Pascal Poncelet, LIRMM, Montpellier Mathieu Roche, CIRAD, Montpellier Lina Soualmia, LITIS, Rouen Nathalie Souf, IRIT, Toulouse Lynda Tamine Lechani IRIT, Toulouse Maguelonne Teisseire, IRSTEA, Montpellier Mouna Torjmen Khemakhem, Université de Sfax, Tunisie Pierre Zweigenbaum, LIMSI, Paris ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Feb 14 21:20:31 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 22:20:31 +0100 Subject: Job: CDI linguiste-informaticien H/F, Eptica-Lingway Message-ID: Date: Fri, 14 Feb 2014 16:36:55 +0100 From: Hugues de Mazancourt Message-Id: <8111AB85-F54E-4A54-9DDB-7CF69EA3FBE0 at eptica.com> Poste à pourvoir en CDI Localisation: Boulogne-Billancourt Réumnération: en fonction de l'expérience Eptica Lingway, filiale du groupe Eptica, développe et commercialise la gamme LEA pour l'e-recrutement. Cette gamme comprend LEA CV pour l'analyse automatique de CV, LEA Search pour la recherche sémantique notamment d'offres d'emploi et de CV, et LEA Matching pour le matching offres-cv. Ces produits mettent en oeuvre des grammaires d'analyse, des dictionnaires et un réseau sémantique multilingue adapté pour le monde du recrutement. Il met en œuvre des stratégies de recherche documentaire spécifiques, notamment pour le matching offres-CV. La gamme LEA est disponible en plusieurs langues européennes et asiatiques. Dans le cadre des évolutions de l'offre LEA, Eptica-Lingway recherche un(e) linguiste informaticien(ne) en CDI. Au sein de l'équipe technique, le candidat aura les missions suivantes : - participer à la conception, développement, maintenance des fonctionnalités linguistiques des produits de la gamme LEA, - apporter une expertise aux problématiques techniques de nos clients, - effectuer des analyses de corpus, - effectuer le support de second niveau pour les clients ou partenaires, - participer aux réunions de projet en cas de besoin, - compléter et exécuter les tests de qualité des produits. Compétences requises : - Traitement Automatique des Langues (étude de corpus, moteur de recherche, grammaires locales, techniques de "machine learning"), - Des connaissances en programmation/scripting (Java, Groovy, Perl, ...) et bases de données - Maîtrise de l'anglais, - Maîtrise d'autres langues européennes appréciée, - Bonnes capacités d'analyse, - Facilité à travailler en équipe. Envoyer CV et lettre de motivation à hugues.de-mazancourt at eptica.com ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Feb 14 21:04:25 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 22:04:25 +0100 Subject: Appel: Call for Demos, NLDB'2014 Message-ID: Date: Wed, 12 Feb 2014 18:16:32 +0100 From: Mathieu Roche Message-ID: <74038619aa63fbdf9b4901f47d8e716a at lirmm.fr> X-url: http://www.nldb.org/ ******************************************* Call for Demos - NLDB'2014 18-20 June 2014 - Montpellier, France http://www.nldb.org/ ******************************************* The 19th International Conference on Application of Natural Language to Information Systems (NLDB?2014) invites submissions of demonstrations of state-of-the-art research or industrial prototypes related to all aspects of Natural Language in the Database and Information Systems field. Topics of interest include but are not limited to: - Applications of NLP in Information Systems - Social Media and Web Data - Big Data and Natural Language - Semantic Web and Open Linked Data - Question Answering (QA) - Natural language and Ubiquitous Computing - Natural Language in Conceptual Modeling - NLP Applications (Opinion Mining, Information Extraction, ?) Demo submissions will be handled online via the easychair conference management system: https://www.easychair.org/conferences/?conf=nldb2014demonstratio Demonstration paper submissions should have 4 pages (LNCS format). Developers should outline the design of their system and provide details to allow the evaluation of its validity, quality, originality, and relevance to NLP in Information Systems. The accepted papers for demos will be included in the conference proceedings, to be published by Springer Verlag in the "Lecture Notes in Computer Science" (LNCS) Series. The demos will be presented in a special demonstration session. At least one of the demo submitters must register for the conference, and perform the demo on site. Important Dates: - Demo submission deadline (firm): March 13, 2014 - Notification of acceptance: March 28, 2014 - Camera-ready paper due: April 7, 2014 From hamon at LIMSI.FR Fri Feb 14 21:06:20 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 22:06:20 +0100 Subject: Appel: Atelier TALN SemDis 2014 Message-ID: Date: Thu, 13 Feb 2014 09:42:24 +0100 From: Cécile Fabre Message-ID: <52FC8570.2090902 at univ-tlse2.fr> X-url: http://www.irit.fr/semdis2014/ APPEL À PARTICIPATION Atelier SemDis2014 (Sémantique Distributionnelle) Atelier organisé dans le cadre de la conférence TALN 2014 Marseille, 1er juillet 2014 ************************************************************ Présidents : Cécile Fabre (CLLE-ERSS) et Tim Van de Cruys (IRIT-MELODI) Site Web : www.irit.fr/semdis2014/ Contact : semdis2014 at gmail.com. ************************************************************ SemDis 2014 est la 2e édition de l'atelier consacré aux approches de TAL fondées sur l'analyse sémantique distributionnelle. Cette année l'atelier s'organisera autour de deux tâches portant sur le traitement du français : 1. une tâche compétitive de substitution lexicale, adaptation au français de la tâche SemEval 2007 (McCarthy & Navigli 2010). Etant donné un mot-cible dans une phrase complète, il s'agit de proposer une (ou plusieurs) unités de substitution qui n'altèrent pas le sens global de l'énoncé. Le choix du substitut est libre et sera confronté aux réponses fournies par des annotateurs humains. Le jeu d'évaluation comprend 300 phrases issues du corpus frWaC. Cette tâche donnera lieu à un classement des systèmes en compétition. Les participants à la tâche devront soumettre leurs résultats et un article décrivant le système utilisé. 2. une tâche exploratoire sur un corpus spécialisé, le corpus TALN, sélection d'articles en français issus des conférences TALN et RECITAL sur la période 2007 à 2013. Nous invitons les participants à déployer sur ce corpus une ou plusieurs techniques d'analyse distributionnelle de leur choix, selon leurs objectifs propres. Pour faciliter les échanges à partir d'exemples communs, une sélection de mots est proposée (calculer, complexe, précis, fréquence, graphe, méthode, sémantique, trait). Cette tâche n'est pas une tâche compétitive ; les participants doivent soumettre un article décrivant le travail réalisé à partir de ce corpus. La description complète des tâches et des données est à lire sur le site de l'atelier : www.irit.fr/semdis2014/ Dates importantes : ***************** Participation à la tâche 1 : 10 mars 2014 : diffusion des données de test 17 mars 2014 : date limite de soumission des résultats par les participants 7 avril 2014 : annonce des résultats Soumission des articles pour les 2 tâches : 22 avril 2014 : envoi des articles 13 mai : notification 30 mai 2014 : réception des versions finales Types de communications : *********************** Les articles seront rédigés en français pour les francophones, en anglais pour ceux qui ne maîtrisent pas le français. Ils devront suivre le format de TALN 2014. Les articles envoyés dans le cadre de la tâche 1 comprendront 6 à 12 pages. Les articles envoyés dans le cadre de la tâche 2 comprendront 12 à 14 pages. Une feuille de style LaTeX et des modèles LibreOffice et Word sont disponibles sur le site web de la conférence (http://www.taln2014.org/site/soumission/). Nous annoncerons ultérieurement la durée des exposés, qui sera fonction du nombre de communications acceptées. Modalités de soumission : ********************** Les articles, au format pdf, doivent être déposés à l'adresse : https://www.easychair.org/conferences/?conf=semdis2014 Comité d’organisation : ******************* L'atelier est organisé conjointement par les équipes CLLE-ERSS et IRIT-MELODI (Toulouse). Cécile Fabre (CLLE-ERSS) Nabil Hathout (CLLE-ERSS) Mai Ho-Dac (CLLE-ERSS) François Morlane-Hondère (CLLE-ERSS) Philippe Muller (IRIT-MELODI) Franck Sajous (CLLE-ERSS) Ludovic Tanguy (CLLE-ERSS) Tim Van de Cruys (IRIT-MELODI) Comité de programme : ******************** Stergos Afantenos - IRIT, Toulouse Ann Bertels - ILT-QLVL, Leuven Yves Bestgen - UCL/CECL, Louvain-La-Neuve Marie Candito - ALPAGE, Paris Eric de la Clergerie - ALPAGE, Paris Cécile Fabre - CLLE, Toulouse Olivier Ferret - CEA-LIST, Fontenay-aux-Roses Nabil Hathout - CLLE, Toulouse Olivier Kraif - LIDILEM, Grenoble Philippe Langlais - RALI, Montréal Emmanuel Morin - LINA, Nantes Philippe Muller - IRIT, Toulouse Adeline Nazarenko - LIPN, Paris Pascale Sébillot - IRISA, Rennes Ludovic Tanguy - CLLE, Toulouse Tim Van de Cruys - IRIT, Toulouse ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Feb 14 21:17:19 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 22:17:19 +0100 Subject: Stage: Pretraitement de donnees textuelles pour un systeme d'analyse semantique, Strasbourg Message-ID: Date: Fri, 14 Feb 2014 12:50:12 +0400 From: Yuliya <4arly at bk.ru> Message-ID: <1392367811.166452886 at f389.i.mail.ru> X-url: http://macaon.lif.univ-mrs.fr/ Sujet de stage en informatique Prétraitement de données textuelles pour un système d’analyse sémantique Entreprise : Rebuz SAS, Strasbourg Durée : 2-5 mois Niveau : M1-M2 Rémunération : 436,05 euros/mois Contact : Mme Yuliya Goncharova, 4arly at bk.ru Détails : Rebuz est une société spécialisée dans l'analyse de textes pour la veille économique. Le système original repose sur l'analyse sémantique épaulée par des principes de la linguistique cognitive. Le stage portera sur l'amélioration du module de prétraitement existant. Ce sujet sera particulièrement intéressant pour les étudiants souhaitant en apprendre plus sur le Traitement Automatique de Langues (TAL) et sur la Recherche d'Information (RI). Les tâches seront adaptées selon le niveau et les préférences du (de la) candidat(e) sélectionné(e). Objectifs du stage * Révision du module actuel (écrit en Java) * Nettoyage et optimisation du code * Intégration de l'étiqueteur morphosyntaxique MACAON [1] * Séries de tests et perfectionnement Compétences recherchées : - aisance dans la programmation en Java (un échantillon de code sera demandé) - bonnes connaissances des tests unitaires (JUnit) - bonnes capacités de travail en équipe - rigueur - responsabilité - autonomie [1] http://macaon.lif.univ-mrs.fr/ ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Feb 14 21:23:35 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 22:23:35 +0100 Subject: Appel: Colloque international Langage et Analogie. Figement. Polysemie Message-ID: Date: Fri, 14 Feb 2014 18:52:51 +0100 From: Pierre-André Buvet Message-ID: <52FE57F3.9070204 at ldi.univ-paris13.fr> Colloque international LANGAGE ET ANALOGIE. FIGEMENT. POLYSÉMIE Université de Grenade, Faculté de Traduction et d'interprétariat 17-19 septembre 2014 La vitalité d'une discipline se mesure à sa capacité d'intégration des sciences et des technologies de son époque. En linguistique, une large part de l'innovation théorique se développe au contact de deux domaines particulièrement productifs : les sciences cognitives et l'informatique. Les premières découvrent un certain nombre de contraintes non spécifiquement langagières que les modèles linguistiques ne peuvent ignorer, sauf à renoncer à toute prétention réaliste au sens psychologique du terme. La dimension sociale des langues elle-même ne peut plus être pensée indépendamment du fait que les langues se manifestent et se développent par l'activité des cerveaux singuliers des locuteurs qui les parlent et les comprennent. Par ailleurs, au plan technologique, les progrès de l'informatique permettent d'explorer des quantités de données linguistiques toujours plus vastes dans des corpus de plus en plus nombreux, et par conséquent d'espérer non seulement des descriptions qui tendent à l'exhaustivité mais aussi un traitement automatique du langage qui s'approche, au moins dans ses finalités, de son traitement naturel. Les concepts d'analogie et de figement, auxquels ce colloque est consacré, jouent un rôle essentiel dans l'articulation des sciences du langage avec la psychologie cognitive et l'outil informatique. Le rôle central de l'analogie dans la cognition humaine est aujourd'hui bien documenté : l'être humain possède une capacité beaucoup plus développée que d'autres espèces animales à percevoir, à concevoir et à traiter une multiplicité de similarités pertinentes dans son environnement, grâce auxquelles cet environnement lui apparaît à chaque instant comme structuré, ou, en d'autres termes, catégorisé, conceptualisé. Au plan linguistique, les processus analogiques sont susceptibles d'impliquer des similarités sur au moins trois plans fondamentaux : au plan du contenu, les similarités entre des entités extra-linguistiques produisent des concepts qui donnent lieu ou bien à une réification sémiotique sous la forme de dénominations, ou bien à une unification procédurale au moyen de mots grammaticaux de sens instructionnel ; au plan formel, les similarités entre les signifiants produisent des régularités présentes partout où existe un paradigme ; enfin, la similarité entre formes et contenus se traduit par le caractère motivé des signes linguistiques, qu'il s'agisse de motivation interne, c'est-à-dire d'iconicité au sens strict (imagic iconicity, symbolisme phonétique, etc.), ou de motivation externe ou syntagmatique (la motivation relative saussurienne, l'iconicité diagrammatique au sens peircien ou encore l'iconicité des linguistiques cognitives contemporaines, qui se manifeste dans les théories cognitives de la métaphore et plus largement de l'embodiment). Or la plupart des concepts mis en jeu en linguistique analogique -- dénomination, motivation, iconicité, polysémie, etc. -- ont connu un profond renouvellement sous l'influence des développements de la recherche en phraséologie. Et l'on sait bien que le problème des expressions figées reste en linguistique informatique un verrou technologique qui constitue un défi majeur de la recherche contemporaine sur le traitement automatique du langage. L'enjeu de ce colloque bisannuel, dont la première session s'est déroulée en Tunisie en octobre 2012, consiste donc à croiser la recherche en linguistique analogique et la recherche en phraséologie, dans le but d'ouvrir de nouvelles perspectives dont le caractère innovant soit fondé sur la pluralité des approches. La session 2014 accordera un intérêt particulier à la polysémie, dans les perspectives de la linguistique analogique et de la linguistique du figement. Selon la première, la polysémie constitue une situation intermédiaire entre la configuration de l'isomorphisme (une forme -- un sens) et les configurations disjonctives de l'homonymie (un signifiant pour deux signifiés distincts) et du polymorphisme (un signifié pour deux signifiants distincts). Le champ d'investigation principal porte donc sur l'analyse qualitative de la similarité entre les acceptions des polysèmes et sur le traitement cognitif des analogies que conditionne cette similarité (pertinence d'un sens de base, existence d'un sentiment métaphorique, etc.). Quant aux unités polylexicales, il est établi que leur polysémie est restreinte par rapport à celle des unités monolexicales, puisque les expressions figées sont constituées d'unités saturées dans leur contexte immédiat et par conséquent que la variation contextuelle liée à la variation polysémique ne peut jouer que pour l'ensemble de l'expression. Il demeure que certaines expressions figées sont polysémiques (ex. perdre pied) et par ailleurs que la question du lien entre le sens littéral et le sens phraséologique des expressions figées interroge les limites du concept de polysémie. Dans la systématique générale des langues, une tension existe entre deux grands principes d'économie cognitive : l'économie quantitative et l'économie procédurale. Cette dernière est maximale dans les situations d'isomorphisme (analogie diagrammatique) et d'iconicité (analogie imaginale). Les systèmes sémiotiques sont en effet d'autant plus simples dans leur fonctionnement qu'ils sont réguliers et d'autant plus aisés à mémoriser qu'ils sont motivés. Cependant, à ces principes, s'opposent la tendance à l'économie quantitative visant à réduire le nombre de signes à mémoriser, tendance qui se traduit ou bien par la polysémie (analogie entre signes fondée sur la similarité des signifiés), qui permet d'augmenter le rendement cognitif d'un signe en lui donnant plusieurs emplois, ou bien par le figement, qui permet de construire un sens nouveau au moyen d'unités lexicales déjà disponibles, la séquence figée étant le plus souvent motivée (analogie entre forme et contenu). L'un des enjeux de ce colloque sera donc de chercher à mieux comprendre les modalités de résolution de cette tension selon les langues et les discours considérés. Plus largement, le colloque accueillera toute contribution visant à éclairer les rapports entre au moins deux des trois concepts interrogés : analogie, figement et polysémie. Conférenciers invités Xavier Blanco (Université autonome de Barcelone) Mirella Conenna (Université Aldo Moro, Bari) Christian Cuxac (Université de Paris VIII) Maribel González Rey (USC, Saint Jacques de Compostelle Jean-René Ladmiral (Université de Paris X) Pedro Mogorron (Université d'Alicante) Emmanuel Sander (Université de Paris VIII) Olivier Soutet (Université de Paris IV) Mark Turner (Case Western Reserve University) Comité d'honneur Francisco Lodeiro González Recteur de l'Université de Grenade Angela Collado Ais Doyenne de la Faculté de l'Université de Grenade Antonio Chicharro Chamorro Directeur du département de Linguistique Générale et Théorie de la Littérature de l'Université de Grenade Sylvie Laigneau-Fontaine Directrice du Laboratoire CPTC (EA 4178), Université de Bourgogne Eric Heilmann Doyen de la faculté de Lettres et Philosophie de l'Université de Bourgogne Comité scientifique Didier Bottineau (Université de Paris X) Samir Bajric (Université de Paris IV) Carmen Mellado Blanco (Université de St Jacques de Compostelle) Pierre-André Buvet (Université de Paris XIII) Gaston Gross (Université de Paris XIII) Georges Kleiber (Université de Strasbourg) Juan de Dios Luque Durán (Université de Grenade) Lucía Luque Nadal (Université de Cordoue) Angel López García (Universidad de Valencia) Aleksandar Mijatovic (Université de Rijeka) Rosemeire Monteiro-Plantin (Université Fédérale du Ceará) Julia Sevilla Muñoz (Université Complutense de Madrid) Franck Neveu (Université de Paris IV) Luca Nobile (Université de Bourgogne) José Manuel Pazos Bretaña (Université de Grenade) Thierry Ponchon (Université de Reims) Christian Puech (Université de Paris III) Dan Savatovsky (Université de Paris III) Jean-Pierre Thibaud (Université de Bourgogne) Thomas Verjans (Université de Bourgogne) Les communications seront proposées en français Elles doivent être envoyées à l'adresse suivante : colloqueanalogie at gmail.com. Informations complémentaires disponibles sur le site du colloque à l'adresse https://sites.google.com/site/colloquelangageetanalogie/accueil-grenade-2014-1 Date limite de soumission (5000 signes + orientations bibliographiques) : 5 mai 2014 Réponse du comité scientifique : 30 juin 2014 Remise des articles : 31 janvier 2015 au plus tard. Publication des actes : 2015 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From pilarcumjl at 163.COM Mon Feb 17 00:23:51 2014 From: pilarcumjl at 163.COM (=?GBK?B?svG4sdfc?=) Date: Mon, 17 Feb 2014 08:23:51 +0800 Subject: J5 =?GBK?Q?=A2=DF=C9=CF=BA=A3=B1=C8=CC=D8=BB=AF=B9=A4?= Message-ID: An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: J5 ⑦上海比特化工.xlsx Type: application/vnd.openxmlformats-officedocument.spreadsheetml.sheet Size: 32739 bytes Desc: not available URL: From hamon at LIMSI.FR Mon Feb 17 14:25:22 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Mon, 17 Feb 2014 15:25:22 +0100 Subject: Appel: TALN 2014, Extension de date limite Message-ID: Date: Mon, 17 Feb 2014 15:05:16 +0100 From: Philippe Blache Message-Id: <0403127F-6FE2-4B9D-9767-C60B58ED1DC1 at lpl-aix.fr> X-url: http://www.taln2014.org APPEL À COMMUNICATIONS TALN-2014 21e conférence sur le Traitement Automatique des Langues Naturelles http://www.taln2014.org 1-4 Juillet 2014 Marseille --> Extension de date limite de soumission --> Nouvelle Date : Jeudi 27 Février 2014 --------------------- CALENDRIER ========== 1. Articles longs - Date limite de soumission : 20 février 2014 - Notification aux auteurs : 14 avril 2014 - Date limite de soumission des versions définitives : 30 mai 2014 2. Articles courts - Date limite de soumission : 24 avril 2014 - Notification aux auteurs : 22 mai 2014 - Date limite de soumission des versions définitives : 30 mai 2014 3. Démonstrations - Date limite de soumission : 21 avril 2014 - Notification aux auteurs : 10 mai 2014 - Date limite de soumission des versions définitives : 30 mai 2014 PRÉSENTATION ============ Organisée par le LPL (Laboratoire Parole et Langage) et le LIF (Laboratoire d’Informatique Fondamentale), la conférence TALN (Traitement Automatique des Langues Naturelles) aura lieu pour son 20ème anniversaire à Marseille (Faculté Saint-Charles, Université d'Aix-Marseille) du 1er au 4 juillet 2014 La conférence TALN’2014, qui est organisée sous l’égide de l’ATALA, se tiendra conjointement avec la 16e édition des Rencontres des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RECITAL’2014). La conférence TALN’2014 comprendra des communications orales présentant des travaux de recherche et des prises de position, des communications affichées, des conférences invitées et des démonstrations. La langue officielle de la conférence est le français. Les communications en anglais sont acceptées pour les participants non francophones. TYPES DE COMMUNICATIONS ======================= Deux formats de communications sont prévus : les articles longs (de 12 à 14 pages) et les articles courts (de 6 à 8 pages). Les auteurs sont invités à présenter deux types de communications : - des articles présentant des travaux de recherche originaux - des prises de position présentant un point de vue sur l’état des recherches en TALN, fondées sur une solide expérience du domaine Les articles doivent présenter des travaux originaux, comportant un apport substantiel par rapport à d'autres travaux éventuellement déjà publiés. Les traductions d'articles publiés ailleurs dans une autre langue ne sont pas acceptées. Les articles longs seront présentés sous forme d’une communication orale, les articles courts sous forme d’un poster. Les communications pourront porter sur tous les thèmes du TAL. CRITÈRES DE SÉLECTION ===================== Les soumissions seront examinées par au moins deux spécialistes du domaine. Pour les travaux de recherche, seront considérées en particulier : - l’adéquation aux thèmes de la conférence - l’importance et l’originalité de la contribution - la correction du contenu scientifique et technique - la discussion critique des résultats, en particulier par rapport aux autres travaux du domaine - la situation des travaux dans le contexte de la recherche internationale - l’organisation et la clarté de la présentation Pour les prises de position, seront privilégiées : - la largeur de vue et la prise en compte de l’état de l’art - l’originalité et l’impact du point de vue présenté Les articles sélectionnés seront publiés dans les actes de la conférence. Le comité de programme sélectionnera parmi les communications acceptées un article (prix TALN) pour recommandation à publication (dans une version étendue) dans la revue Traitement Automatique des Langues (revue TAL). MODALITÉS DE SOUMISSION ======================= Les articles seront rédigés en français pour les francophones, en anglais pour ceux qui ne maîtrisent pas le français. Une feuille de style LaTeX et un modèle Word sont disponibles sur le site web de la conférence : http://www.taln2014.org COMITE D'ORGANISATION ===================== Philippe Blache Frédéric Béchet Carine André Brigitte Bigi Sébastien Bermond Cyril Deniaud Nadéra Bureau Benoît Favre Stéphanie Desous Joëlle Lavaud Nuria Gala Frédéric Lefevre Grégoire Montcheuil Alexis Nasr Catherine Perrot Klim Peshkov Laurent Prévot Carlos Ramisch Stéphane Rauzy Claudia Starke CONTACTS ======== philippe.blache[arobas]lpl-aix.fr nadera.bureau[arobas]lpl-aix.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:50:56 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:50:56 +0100 Subject: Seminaire: Data, digital methods and mapping social complexity, ENSCI, Paris Message-ID: Date: Tue, 18 Feb 2014 18:35:45 +0100 From: Audrey Baneyx - Sciences Po Message-ID: <530399F1.6080804 at sciencespo.fr> X-url: http://www.digitalmethods-seminar.org Chère collègue, Cher collègue, Nous avons le plaisir de vous inviter à participer au nouveau séminaire de recherche conjoint du médialab (Sciences Po) et de la plateforme CorTexT (IFRIS) "*Data, digital methods and mapping social complexity: Visualizing social and semantic dynamics in the social sciences*". Le séminaire sera hébergé par l'ENSCI à Paris et animé par une équipe de chercheurs de Sciences Po et de l'IFRIS de février à juillet 2014. Le séminaire est destiné à un public de doctorants, chercheurs, ingénieurs et designers. Nos rencontres auront lieu huit jeudi après-midis ainsi que deux journées complètes dans l'Espace Viénot de l'ENSCI, située au rez-de-chaussée du 48 rue Saint Sabin, Paris. La langue de travail sera le français, excepté lors des séances auxquelles sont invités des chercheurs anglophones. Vous trouverez ci-dessous la présentation du séminaire et le programme des sessions. Nous serions heureux de vous compter parmi nous. Merci de nous faire connaître votre participation en vous inscrivant directement sur notre site : www.digitalmethods-seminar.org . Bien cordialement /Le Comité d'organisation/ -------------------------------------------------- Data, digital methods and mapping social complexity: Visualizing social and semantic dynamics in the social sciences *Présentation du séminaire* Les cartes sont des représentations visuelles de l'espace. Ce sont des images codifiées qui permettent de signaler la présence d'éléments sur un territoire donné comme des objets, des zones ou des flux. L'art cartographique a une histoire longue et riche. Initialement conçu pour assister le déplacement des hommes et des biens, la carte vise à représenter l'espace géographique en positionnant des éléments graphiques sur une surface bidimensionnelle. En tant qu'outil figuratif, la carte est rapidement devenu un instrument de recherche et de gestion. De fait, la pratique cartographique a fortement marqué nos façons de représenter, naviguer, comprendre, organiser et contester les espaces que nous habitons. L'avènement récent de l'informatique et des satellites a profondément bouleversé la pratique cartographique. Les Systèmes d'Informations Géographiques (SIG) permettent aujourd'hui de capter, stocker, manier, analyser et gérer des données spatiales pour assister la science et la prise de décision. Les satellites permettent de positionner et suivre des mouvements sur l'intégralité du globe terrestre alors que des millions d'utilisateurs accèdent à des cartes interactives via leurs ordinateurs ou téléphones portables. En parallèle à ces développements, un nombre croissant de chercheurs en sciences sociales s'intéresse à la représentation de données non-spatiales. Ces représentations sont régulièrement présentées comme des cartes de territoires inconnus, capables de figurer l'émergence, la stabilisation et la déstabilisation de phénomènes sociaux complexes. En mobilisant des outils d'extraction, de traitement automatisé et de visualisation de données provenant de sources hétérogènes (corpus de documents, bases de données, données web), ces chercheurs espèrent cartographier de façon dynamique des phénomènes sociaux à différents degrés de granularité. La transition technique et épistémologique ouverte par ces outils est parfois nommée /Humanités Digitales/. Ces développements semblent particulièrement attractifs pour les chercheurs intéressés par l'étude quantitative des phénomènes sociaux. Ainsi, ces outils pourraient combler le fossé méthodologique qui sépare l'analyse des interactions spécifiques de celles des structures globales. Au vu de l'expansion constante de la production et de la circulation de données numériques, il paraît essentiel pour les chercheurs en sciences sociales de s'interroger sur ces nouvelles sources de données ainsi que sur les outils et méthodes pour les analyser. Parmi ces dernières, l'analyse de réseau permet de positionner les acteurs en visualisant leurs interactions. Elle complète les analyses statistiques (régression, analyse de correspondance etc.), offrant un nouvel angle pour étudier la société /faite/ et /en train de se faire/. Une telle approche ouvre de nouvelles perspectives pour de nombreux champs disciplinaires en sciences sociales, allant de la sociologie des sciences et des techniques aux sciences politiques. Mais d'où viennent ces outils ? Le développement de logiciels de traitement et de visualisation de données tels que Gephi, le CorTexT Manager, Science Scape et Hyphe sont le fruit de travaux interdisciplinaires, à l'intersection de la linguistique informatique, du data mining, de l'intelligence artificielle, des systèmes dynamiques et de la science des réseaux. Ces communautés ont travaillé sur des problèmes variés comme l'extraction, le parsing, la désambiguïsation, le clustering, le filtrage et la visualisation de données. De même, la sémiologie graphique mobilisée dans ces logiciels se nourrit de travaux en cartographie et en design. Ainsi, pour produire des visualisations pertinentes, robustes et lisibles, les chercheurs en sciences sociales peuvent se tourner vers des communautés de chercheurs et de praticiens déjà structurées autour de problèmes et de méthodes spécifiques. Notre séminaire vise à orchestrer la rencontre et favoriser l'échange entre ces communautés (chercheurs en sciences sociales et sciences « exactes », ingénieurs et designers) à travers la lecture, la présentation et la discussion de textes académiques et non académiques, de travaux d'invités et d'outils. Vous trouverez en pièce jointe la feuille de route (en anglais) que nous avons construite pour rendre compte des multiples questions et manières de faire auxquelles sont confrontés les cartographes /en herbe/ de la complexité sociale. *Programme des sessions* *Calendrier* *27.02.14 // ENSCI -- Espace Viénot // 14:00-17:30* Introduction Richard Rogers (University of Amsterdam) *06.03.14 // ENSCI -- Espace Viénot // 14:00-17:30* Mapping science and technology through structured data Jean-Philippe Cointet (INRA) *27.03.14 // ENSCI -- Espace Viénot // 14:30-18:00* Digitized archives and distant reading Claire Lemercier (Sciences Po) *10.04.14 // ENSCI -- Espace Viénot // 14:30-18:00* Natively digital data mapping Nortje Marres (Goldsmiths University of London) *24.04.14 // ENSCI -- Espace Viénot // 14:30-18:00* Transformative interactions: web effects on social dynamics /Invitations en cours/ *15.05.14 // ENSCI -- Espace Viénot // 14:30-18:00* Visualizing complexity Donato Ricci (Sciences Po) *05.06.14 // ENSCI -- Espace Viénot // 14:00-17:30* Spatializing data Jacques Levy (EPFL) *19.06.14 // ENSCI -- Espace Viénot // 14:30-18:00 * Activism, journalism, decision-making : DMT's in practice Benjamin Ooghe-Tabanou (Regards Citoyens) & Data journaliste [à confirmer] *10.07.14-11.07.14 // ENSCI -- Espace Viénot [à confirmer] // 9:30-17:30* BarCamp Avec la participation des équipes de la plateforme CorTexT de l'IFRIS et du médialab Sciences Po ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:31:19 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:31:19 +0100 Subject: Conf: TOTh 2014, disputatio, 12-13 June 2014 Message-ID: Date: Mon, 17 Feb 2014 09:32:04 +0100 From: Luc Damas Message-ID: X-url: http://porphyre.org/toth/toth-2014/disputatio ------------------------------------------------------ TOTh 2014 - Disputatio 12-13 June 2014 www.porphyre.org/toth/ http://porphyre.org/toth/toth-2014/disputatio "Harris et les immunologistes : un exemple à suivre ?" Pr. Pierre Lerat - Université Paris XIII Zellig Harris a fondé principalement ses analyses de discours sur un mécanisme logico-linguistique universel : la prédication. Il a obtenu des résultats particulièrement significatifs dans ses derniers travaux (1988), qui portent sur des comptes rendus d'immunologistes. Les contraintes réciproques entre les types d'expressions prédicatives et les types d'expressions référentielles requis par les discours scientifiques et techniques sont fortes. L'intérêt de cette approche conduit à poser deux questions : à la terminologie celle de la prévisibilité des mots spécialisés, aux ontologies celle du travail sur les schémas prédicatifs autrement qu'en termes de << relations associatives >> parmi d'autres. --------------------------------------------------- ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:52:28 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:52:28 +0100 Subject: Job: Research Assistant position on OBDA at the Free Univ. of Bozen-Bolzano, Italy Message-ID: Date: Tue, 18 Feb 2014 21:00:09 +0100 From: Diego Calvanese Message-ID: ------------------------------------------------------------------------ Research Assistant position at the Free University of Bozen-Bolzano/Italy ------------------------------------------------------------------------ The KRDB Research Centre for Knowledge and Data at the Faculty of Computer Science of the Free University of Bozen-Bolzano (FUB), Italy is looking for a talented Research Assistant to work fulltime on *** SPARQL Query Answering for Ontology Based Data Access and Ontology Based Data Integration *** The position will be financed by the Large Scale Integrating Project (IP) "Scalable End-user Access to Big Data" (Optique) [2], within the EU 7th Framework Programme (FP7). The position concerns the design and implementation of techniques for optimized SPARQL query answering over virtual RDF graphs, obtained through R2RML mappings over relational databases. Requirements for this position are an MSc or preferably a PhD in Computer Science, fluency in English, and strong Java programming skills. Some background on SPARQL, SQL, and OWL are also required. During this project the applicant will have the opportunity to apply the results of their research in challenging industrial use-cases involving reasoning on Big Data, as provided by the Optique project and by the user base of the -ontop- [1] framework for Ontology Based Data Access. The position is to start in 2014 as early as possible. The position has an initial contract duration of 1 year, and will be extended for up to 3 years in case of mutual agreement. Candidates interested in pursuing a PhD in Computer Science where they combine applied and foundational research, may also apply for enrollment in the PhD program of the Faculty of Computer Science at FUB, starting in Winter 2014. The position comes with a great location, Bolzano [3,4]. The Free University of Bozen-Bolzano is located right in the center of one of the most beautiful cities in the Alps, and consistently ranks among the top locations in Italy for its living standard. The region, South Tyrol, is among the top destinations for Hikers, Mountaineers, Rock climbers, Skiers, etc. And last, as one would expect from any Italian location, food in Bolzano is great. Interested applicants should get in contact immediately with Prof. Diego Calvanese (calvanese at inf.unibz.it), Dr. Martin Rezk (mrezk at inf.unibz.it), or Dr. Guohui Xiao (xiao at inf.unibz.it). Please attach a detailed CV including a statement of interest, project experience (if any) and up to 2 references (reference letters will be requested directly from the referees). [1] http://ontop.inf.unibz.it/ [2] http://www.optique-project.eu/ [3] http://en.wikipedia.org/wiki/Bolzano/ [4] https://www.google.com/search?q=bolzano+italy&prmd=imvns&tbm=isch From hamon at LIMSI.FR Tue Feb 18 21:47:08 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:47:08 +0100 Subject: Job: Speech Data Evaluator, Google Speech Research (Multiple positions) Message-ID: Date: Tue, 18 Feb 2014 12:49:33 +0000 From: Hadrien Gelas Message-ID: Bonjour, Veuillez considérer l'annonce ci-dessous et, si possible, notamment pour les personnes impliquées dans des formations, faire circuler ce message auprès des étudiants ou autres personnes intéressées. Merci d'avance, Cordialement, Hadrien Gelas ________________________________ Speech Data Evaluator for French Job title: Speech Data Evaluator for French (Multiple positions) Linguistic Field(s): - Phonetics/Phonology - Semantics - Discourse Analysis Job description: As a Speech Data Evaluator and a native-level speaker of French, you will be part of a team based in Dublin, processing large amounts of linguistic data and carrying out a number of tasks to improve the quality of Google’s speech synthesis and speech recognition in your own language. This includes: - classifying and annotating linguistic data - transcription - labeling text for disambiguation, expansion, and text normalization - providing phonetic transcription of lexicon entries according to given standards and using in-house tools Job requirements: - native-level speaker of French (with good command of the standard dialect) and fluent in English - passion for language with good knowledge of orthography and grammar in the target language - a degree in a language-related field such as linguistics, language teaching, translation, editing, writing, proofreading, or similar - keen interest in technology and computer-literate (should feel comfortable using in-house tools and should have an interest in current speech, mobile and online technology) - attention to detail and good organizational skills Project duration: 6-11 months (with potential for extension) **This is not a permanent position but a contract position.** For immediate consideration, please email your CV and cover letter in English (PDF format preferred) with "Speech Data Evaluator [French]" in the subject line. Email Address for applications: DataOpsMan at gmail.com Contact information: Ara Kim Closing date: open until filled ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:36:13 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:36:13 +0100 Subject: Appel: IC 2014, Posters et demonstrations Message-ID: Date: Mon, 17 Feb 2014 17:06:24 +0100 From: Catherine Faron Zucker Message-ID: <53023380.10006 at i3s.unice.fr> X-url: http://www.irit.fr/IC2014 *Appel à soumissions pour la session Posters et Démonstrations des* 25èmes Journées Francophones d'Ingénierie des Connaissances : /Connaissances pervasives : des utilisateurs plongés dans des SBC et des SBC plongés dans le monde/ http://www.irit.fr/IC2014 du 12 au 16 mai 2014 à Clermont-Ferrand ------------------------------------------------------------------------ *Dates importantes :* Date limite de soumission : vendredi 28 février 2014 Date de notification aux auteurs : lundi 17 mars 2014 ------------------------------------------------------------------------ La conférence d'Ingénierie des Connaissances fête cette année ses 25 ans! Elle est le rendez-vous annuel de la communauté française et francophone qui s'intéresse aux problématiques liées à l'ingénierie des connaissances. Académiques et industriels s'y retrouvent pour échanger et réfléchir sur des problèmes de recherche qui se posent en acquisition, représentation et gestion des connaissances. L'essor des Sciences et Technologies de l'Information et de la Communication, et notamment des technologies du Web, dans l'ensemble de la société engendre des mutations dans les pratiques individuelles et collectives. L'Ingénierie des Connaissances accompagne cette évolution, en inventant les modèles, méthodes et outils permettant l''intégration de connaissances et de raisonnements sur ces connaissances dans des environnements informatiques. De fait, on constate aujourd'hui une omniprésence des connaissances, en même temps qu'un décloisonnement des communautés de recherche en Intelligence Artificielle pour répondre aux problèmes qui se posent. Ainsi, la construction des systèmes à base de connaissances fait appel à l'Ingénierie des Connaissances, au traitement automatique des langues, à l'apprentissage automatique et à la fouille de données; le traitement des connaissances associe Ingénierie des Connaissances avec recherche d'information, recherche opérationnelle, analyse de données; la modélisation des environnements informatiques, de ses acteurs humains ou logiciels, allie Ingénierie des Connaissances et systèmes multi-agents; enfin l'Ingénierie des Connaissances touche à des thématiques classiques dans les communautés de l'interaction homme-machine et des environnements informatiques pour l'apprentissage humain pour placer l'utilisateur au centre des systèmes. En plaçant l'utilisateur au coeur des systèmes informatiques, l'Ingénierie des Connaissances est sollicitée pour traiter la masse de données disponibles sur le web tout en assistant l'utilisateur pour les visualiser, contextualiser, expliquer, recommander, classer, prédire, approximer, sécuriser, fiabiliser, pour pallier à leur imprécision voire incohérence. ------------------------------------------------------------------------ *Soumissions* /*Posters :*/ Sont concernés des travaux en cours ou au stade de résultats préliminaires de recherche ou appliqués. Les posters sont destinés à provoquer la discussion au sein de la communauté pour accompagner au mieux le démarrage des travaux présentés. Ils seront exposés dans un espace de discussion. /*Démonstrations :*/ Une ou plusieurs séances de démonstrations seront organisées pour permettre aux participants de présenter leurs développements dans un espace de discussion. Ce peuvent être des maquettes, prototypes, outils finalisés, mais aussi applications industrielles et opérationnelles. Les propositions de démonstrations peuvent venir compléter une communication. Des résultats peuvent donner lieu à une double soumission pour poster ET démonstration. La soumission d'un poster et/ou d'une démonstration peut également venir en complément d'un article (long ou court) accepté à la conférence ou d'une présentation dans la session spéciale "Logiciels, Projets, Industrie". Les textes associés aux démonstrations et aux posters ne devront pas excéder 3 pages et devront respecter les styles de documents proposés pour la conférence : http://www.irit.fr/IC2014. Les auteurs sont invités à soumettre leurs communications au format PDF sur EasyChair: https://www.easychair.org/conferences/?conf=ic2014. Les articles des posters et démonstrations acceptés seront inclus dans les actes d'IC 2014. Pour les posters acceptés, le format de présentation sera A0 vertical. ------------------------------------------------------------------------ La conférence d'Ingénierie des connaissances est placée sous l'égide du collège IC de l'AFIA ------------------------------------------------------------------------ *Présidente du comité de programme :* Catherine Faron Zucker (I3S, Université Nice Sophia Antipolis) *Présidente du comité d'organisation :* Catherine Roussey, (IRSTEA) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:59:02 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:59:02 +0100 Subject: Appel: Computerm 2014, COLING 2014 Workshop Message-ID: Date: Tue, 18 Feb 2014 22:56:44 +0100 From: Thierry Hamon Message-ID: <87ppmk84pv.fsf at smtp-out.limsi.fr> X-url: http://perso.limsi.fr/hamon/Computerm2014/ 4th International Workshop on Computational Terminology CompuTerm 2014 COLING 2014 Workshop 23rd or 24th August 2014, Dublin, Ireland http://perso.limsi.fr/hamon/Computerm2014/ Submission deadine: 2nd May 2014 Computational Terminology covers an increasingly important aspect in Natural Language Processing areas such as text mining, information retrieval, information extraction, summarisation, textual entailment, document management systems, question-answering systems, ontology building, etc. Terminological information is paramount for knowledge mining from texts for scientific discovery and competitive intelligence. Scientific needs in fast growing domains (such as biomedicine, chemistry and ecology) and the overwhelming amount of textual data published daily demand that terminology is acquired and managed systematically and automatically; while in well established domains (such as law, economy, banking and music) the demand is on fine-grained analyses of documents for knowledge description and acquisition. Moreover, capturing new concepts leads to the acquisition and management of new knowledge. The aim of this fourth CompuTerm workshop is to bring together Natural Language Processing researchers to discuss recent advances in computational terminology and its impact in many NLP applications. The topics addressed in this workshop are wide ranging: - term extraction, recognition and filtering, which is the core of the terminological activity that lays basis for other terminological topics and tasks; - event recognition and extraction, that extends the notion of the terminological entity from terms meaning static units up to terms meaning procedural and dynamic processes; - acquisition of semantic relations among terms, which is also an important research topic as the acquisition of semantic relationships between terms finds applications such as the population and update of existing knowledge bases, definition of domain specific templates in information extraction and disambiguation of terms; - term variation management, that helps to deal with the dynamic nature of terms, their acquisition from heterogeneous sources, their integration, standardisation and representation for a large range of applications and resources, is also increasingly important, as one has to address this research problem when working with various controlled vocabularies, thesauri, ontologies and textual data. Term variation is also related to their paraphrases and reformulations, due to historical, regional, local or personal issues. Besides, the discovery of synonym terms or term clusters is equally beneficial to many NLP applications; - definition acquisition, that covers important research and aims to provide precise and nonambiguous description of terminological entities. Such definitions may contain elements necessary for the formal description of terms and concepts within ontologies; - consideration of the user expertise, that is becoming a new issue in the terminological activity, takes into account the fact that specialized domains contain notions and terms often nonunderstandable to non-experts or to laymen (such as patients within the medical area, or bank clients within banking and economy areas). This aspect, although related to specialized areas, provides direct link between specialized languages and general language; - systematic terminology management and updating domain specific dictionaries and thesauri, that are important aspects for maintaining the existing terminological resources. These aspects become crucial because the amount of the existing terminological resources is constantly increasing and because their perennial and efficient use depends on their maintenance and updating, while their re-acquisition is costly and often non-reproducible; - monolingual and multilingual resources, that open the possibility for developing cross-lingual and multi-lingual applications, requires specific corpora, methods and tools which design and evaluation are challenging issues; - robustness and portability of methods, which allows to apply methods developed in one given context to other contexts (corpora, domains, languages, etc.) and to share the research expertise among them; - social netwoks and modern media processing, that attracts an increasing number of researchers and that provides challenging material to be processed; - utilization of terminologies in various NLP applications, as they are a necessary component of any NLP system dealing with domain-specific literature, is another novel and challenging research direction. The workshop submissions are open to different approaches, ranging from term extraction in various languages (using verb co-occurrence, information theoretic approaches, machine learning, etc.), translation pairs extracting from bilingual corpora based on terminology, up to semantic oriented approaches and theoretical aspects of terminology. Besides, experiments on the evaluation of terminological methods and tools are also encouraged since they provide interesting and useful proof about the utility of terminological resources: - direct evaluation may concern the efficiency of the terminological methods and tools to capture the terminological entities and relations, as well as various kinds of related information; - indirect evaluation may concern the use of terminological resources in various NLP applications and the impact these resources have on the performance of the automatic systems. In this case, research and competition tracks (such as TREC, BioCreative, CLEF, CLEF-eHealth, I2B2, *SEM, and other shared tasks), provide particularly fruitful evaluation contexts and proved very successful in identifying key problems in terminology such as term variation and ambiguity. We encourage authors to submit their research work related to various aspects of computational terminology, such as mentioned in this call. The workshop authors will be proposed to submit an extented version of their work to a special issue of an international journal or of a book collection. Importante dates: - 1st workshop CFP: 17th February 2014 - Paper due date: 2nd May 2014 - Notification of acceptance: 6th June 2014 - Camera-ready deadline: 27th June 2014 - Workshop: 23rd or 24th August 2014 Submission instructions: Authors are invited to submit full papers on original, unpublished work in the topic area of this workshop. Submissions should follow the COLING 2014 instruction for authors (http://www.coling-2014.org/call-for-papers.php) and be formatted using the COLING 2014 stylefiles for latex, MS Word or LibreOffice (http://www.coling-2014.org/doc/coling2014.zip), with blind review and not exceeding 8 pages plus two extra pages for references. The PDF files will be submitted electronically at https://www.softconf.com/coling2014/WS-9/ Organisers: - Patrick Drouin, Observatoire de linguistique Sens-Texte, Université de Montréal, Montréal, Canada - Natalia Grabar, CNRS UMR 8163 STL, Université Lille 1&3, Villeneuve d'Ascq - Thierry Hamon, LIMSI-CNRS, Orsay, France & Université Paris 13, Sorbonne Paris Cité, Villetaneuse, France - Kyo Kageura, Library and Information Science Laboratory, University of Tokyo, Tokyo, Japan Program Committee - Sophia Ananiadou, National Centre for Text Mining, UK - Olivier Bodenreider, NLM, USA - Beatrice Daille, IRIN, France - Éric Gaussier, LIG, Université Joseph Fourier, France - Gregory Grefenstette, Clairvoyance Corp, France - Marie-Claude L'Homme, University of Montréal, Canada - Philippe Langlais, RALI, Canada - John McNaught, UMIST & National Centre for Text Mining, UK - Rogelio Nazar, University Pompeu Fabra, Spain - Goran Nenadic, UMIST & National Centre for Text Mining, UK - Jorge Vivaldi Palatresi, University Pompeu Fabra, Spain - Selja Seppälä, University at Buffalo, USA - Karine Verspoor, NICTA, Australia - Pierre Zweigenbaum, LIMSI, France ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:36:50 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:36:50 +0100 Subject: Appel: Organisation de la Plate-forme IA de l'AFIA 2015 (date limite reponse 28 mars 2014) Message-ID: Date: Mon, 17 Feb 2014 23:19:35 +0100 From: Secretaire AFIA Message-Id: <87A5A11D-FEFD-4CB4-AC31-63CAF45F1382 at afia.asso.fr> X-url: http://www.afia.asso.fr/tiki-download_file.php?fileId=146 --- Merci de relayer ce message auprès des personnes intéressées --- L'AFIA recherche un organisateur pour sa Plate-forme 2015 (période : entre le 29 juin et le 10 juillet, montant d'inscription max : 300 euros pour un événement de 5 jours, hébergement maximum de 4 conférences en parallèle, nombre de participants envisagés : 300). Si vous êtes intéressés par l'organisation d'un tel événement, merci de contacter le bureau de l'AFIA (bureau AT afia.asso.fr) pour plus de spécifications sachant que la date limite pour une réponse consolidée est fixée au 28 mars 2014. Pour avoir plus d'information, consulter : http://www.afia.asso.fr/tiki-download_file.php?fileId=146 le CA de l'AFIA ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:47:46 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:47:46 +0100 Subject: Conf: Online Registration to LREC 2014 now open! Message-ID: Date: Tue, 18 Feb 2014 17:11:38 +0100 From: ELRA ELDA Information Message-ID: <5303863A.4070703 at elda.org> X-url: http://lrec2014.lrec-conf.org/en/registration/ X-url: http://www.lrec-conf.org/lrec2014 [Apologies for multiple postings] The online registration to the Main conference, the workshops and tutorials is now open @ http://lrec2014.lrec-conf.org/en/registration/. Contact: registration at lrec-conf.org www.lrec-conf.org/lrec2014 Follow us on Twitter: @LREC2014 (https://twitter.com/LREC2014) From hamon at LIMSI.FR Tue Feb 18 21:41:53 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:41:53 +0100 Subject: Appel: Special Track Computational Intelligence (ACM-MEDES 2014) Message-ID: Date: Mon, 17 Feb 2014 18:12:18 -0500 From: "Ismail Biskri" Message-ID: <009501cf2c35$b43f7460$1cbe5d20$@Biskri at uqtr.ca> X-url: http://sigappfr.acm.org/MEDES/14/index.php?option=com_content&view=article&id=23&Itemid=24 http://sigappfr.acm.org/MEDES/14/index.php?option=com_content&view=article&id=23&Itemid=24 Description What we call here Computational Intelligence is the study of data mining, information extraction, knowledge processing, language processing, discourse processing, and any area that relates to an intelligent processing of language, information and knowledge. Until recently this area has been performed by experts in computer science or experts in artificial intelligence. In fact, there were many computer applications in Computational Intelligence without any theoretical support. Today, people working in such areas often work as members of pluridisciplinary and interdisciplinary teams, including, of course, computer scientists and expert of artificial intelligence but also mathematicians, statisticians, linguists, language experts, philosophers, logicians, cognitive scientists, neuroscientists, psycholinguists, anthropologists, etc. Therefore, Computational Intelligence must become more connected to areas as cognitive science, mathematics, statistic, linguistics, philosophy, logics, neuroscience, etc. through the development of methods, strategies and algorithms for all aspects of language processing, discourse processing, knowledge processing, information processing, etc. These methods and strategies can be seen according to their theoretical or practical significance. The aim of this track is to present original works related to computational intelligence. These works should focus either on practical applications, or on results of theoretical foundations of the discipline. We, therefore, invite authors for original papers submissions for any work related to theoretical or practical aspects of Computational Intelligence. Topics of Interest Submissions are invited in, but not limited to, the following areas: * Philosophy of language and discourse * Cognitive science * Non standard logics * Natural language processing * Computational linguistics (lexicology; morphology; syntax; semantics) * Information extraction * Ontologies * Knowledge processing * Data Mining, Text Mining, Web Mining * Information retrieval * Web Semantics Paper Submission Papers should be submitted in electronic form (PDF) using EasyChair (https://www.easychair.org/conferences/?conf=medes2014). Papers are up to 6 pages and must be formatted according to the ACM proceedings format which can be found here: http://www.acm.org/sigs/publications/proceedings-templates. Papers that fail to comply with length limit will be rejected. Submissions will be peer-reviewed by at least 3 peer reviewers. After the preliminary notification date, authors rebut by evidence and arguments all reviewer inquiries and their comments. Based on the rebuttal feedback, reviewers notify authors with the final decision. Selection criteria will include: relevance, significance, impact, originality, technical soundness, and quality of presentation. Preference will be given to submissions that take strong or challenging positions on important emergent topics related to Digital Ecosystems. At least one author should attend the conference to present the paper. The conference Proceedings will be published by ACM and indexed by the ACM Digital Library and DBLP. Important dates * Full Papers Submission: May 12, 2014 * Notification of Paper Acceptance: July 13, 2014 * Camera – ready papers: August 07, 2014 * Special Track days: September 15-17, 2014 Special Track chairs Ismaïl Biskri Université du Québec à Trois-Rivières Canada ismail.biskri at uqtr.ca Adel Jebali University of Concordia Canada adel.jebali at concordia.ca Programme Committee (to be completed) Ismail Biskri, Universite de Québec à Trois Rivières, Canada Jean-Pierre Desclés, Université de Paris-Sorbonne, France Al Moatasem Alrahabi, Université Paris-Sorbonne Abu-Dhabi Rim Faiz, IHEC de Carthage, Université du 7 novembre à Carthage, Tunisia Adel Jebali, University Concordia, Montreal, Canada Jean-Guy Meunier, Université de Québec à Montréal, Canada Anca Pascu, Université de Bretagne Occidentale, France Christophe Jouis, Université Paris 6, France Malek Mouhoub, University of Regina, Canada Juan Manuel Torres Moreno, Université d’Avignon, France Agata Jackiewicz, Université Paris-Sorbonne, France Marc Bertin, Université de Québec à Montréal, Canada Iana Atanassova, University Concordia, Canada Florence Le Priol, Université Paris-Sorbonne, France Sonia El Euch, Université du Québec à Trois-Rivières, Canada Maria Antonietta Pinto, Universita di Roma, Italy Roger Nkambou, Université du Québec à Montréal, Canada Daniel Vanderveken, Université du Québec à Montréal, Canada, Candida De Sousa Melo, Universidade Federal do Rio Grande do Norte, Brazil ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:48:48 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:48:48 +0100 Subject: Info: Prix de la these ATALA 2014 Message-ID: Date: Tue, 18 Feb 2014 17:15:57 +0100 From: pap Message-ID: <5303873D.9000903 at limsi.fr> X-url: http://www.atala.org/_Prix-de-la-these_ Prix de la thèse ATALA 2014 Depuis 2011, l'ATALA, Association pour le Traitement Automatique des Langues, décerne un prix de thèse pour récompenser des travaux francophones d'excellence dans le domaine du TAL. Ce prix illustre la volonté de l'ATALA de soutenir les jeunes chercheurs francophones et de contribuer à la diffusion de leur travaux à l'ensemble de la communauté du TAL. Contrairement aux années précédentes, le prix 2014 n'est pas doté. Peuvent candidater tous les docteurs ayant soutenu leur thèse de doctorat dans une université francophone entre le 1er septembre 2011 et le 31 décembre 2013 et n'ayant jamais candidaté auparavant au prix de thèse ATALA. Un jury d'une quinzaine de chercheurs et enseignants-chercheurs couvrant l'ensemble des domaines de recherche en TAL examineront les dossiers afin de sélectionner le (la) lauréat(e). Le (la) lauréat(e) recevra son prix au cours du colloque TALN 2014 qui aura lieu à la Faculté Saint-Charles de Marseille, du 1er au 4 juillet 2014. À cette occasion le (la) lauréat(e) présentera ses travaux lors d'une session plénière. Date limite de soumission des dossiers : 15 mars 2014. Le dossier à fournir est constitué d'un CV (2 pages), des rapports de pré-soutenance et de soutenance, et de la thèse. Étapes - Remplir le formulaire http://www.atala.org/_Prix-de-la-these_ - Envoyer une archive contenant le CV, les rapports et la thèse (ou bien une URL d'où l'on puisse récupérer les documents) à : pap at limsi.fr - Annonce des résultats début mai 2014 - Remise du prix lors de TALN 2014 Jury: (En cours de composition) Co-présidé par : Frédérique Segond (Groupe Viseo), Patrick Paroubek (LIMSI-CNRS) et Charles Teissèdre (Université Paris-Sorbonne). ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:30:17 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:30:17 +0100 Subject: These: Dhouha Bouamor, Constitution de ressources linguistiques multilingues Message-ID: Date: Sun, 16 Feb 2014 13:54:56 +0000 From: SEMMAR Nasredine 202247 Message-ID: <50CC12289F0C854FBFF134F270030CD71C49B57B at EXDAG0-B2.intra.cea.fr> Bonjour, J'ai le plaisir de vous inviter à ma soutenance de thèse réalisée au sein du Laboratoire Vision et Ingénierie des Contenus (LVIC) et le Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur (LIMSI) intitulée : "Constitution de ressources linguistiques multilingues à partir de corpus de textes parallèles et comparables". La soutenance aura lieu le vendredi 21 février 2014 à 14h00 au LIMSI. Vous êtes également conviés au pot de thèse qui suivra. Bien cordialement, Dhouha Bouamor ************************************************************************ Le jury de soutenance sera composé de : M. Reinhard Rapp, Professeur à l'Université de Mainz, Rapporteur M. Eric Gaussier, Professeur à l'Université Joseph Fourier, Rapporteur M. Philippe Langlais, Professeur à l'Université de Montréal, Examinateur M. François Yvon, Professeur à l'Université Paris-Sud, Examinateur M. Pierre Zweigenbaum, Directeur de recherche au LIMSI-CNRS, Directeur de thèse M. Nasredine Semmar, Chercheur au LVIC - CEA LIST, Encadrant ************************************************************************ Résumé : Les lexiques bilingues sont des ressources particulièrement utiles pour la Traduction Automatique et la Recherche d’Information Interlingue. Leur construction manuelle nécessite une expertise forte dans les deux langues concernées et est un processus coûteux. Plusieurs méthodes automatiques ont été proposées comme une alternative, mais elles ne sont disponibles que dans un nombre limité de langues et leurs performances sont encore loin derrière la qualité des traductions manuelles. Notre travail porte sur l'extraction de ces lexiques bilingues à partir de corpus de textes parallèles et comparables. En nous basant sur des corpus parallèles, nous présentons une approche qui porte sur le traitement d’expressions polylexicales, allant de leur acquisition automatique à leur intégration dans un système de traduction automatique statistique. Pour les corpus comparables, nous proposons deux nouvelles approches dont le but est d’extraire des lexiques bilingues spécialisés dans les domaines de la finance des entreprises, du cancer du sein, de l’énergie éolienne et de la technologie mobile. La première approche étend l’approche distributionnelle par un processus de désambiguïsation lexicale. La deuxième approche repose sur Wikipédia et l’analyse explicite sémantique. L’originalité de cette approche réside dans le fait que, au lieu de considérer l’espace des mots d’un corpus pour la représentation des mots que l’on souhaite traduire, ces derniers sont représentés dans l’espace des titres des articles de Wikipédia. Ces deux approches se comparent favorablement aux méthodes existantes dans la plupart des configurations. Mots clés : extraction de lexique bilingue, corpus parallèle, corpus comparable, alignement, traduction automatique statistique. ************************************************************************ Abstract: Bilingual lexicons are central components of machine translation and cross-lingual information retrieval systems. Their manual construction requires extensive expertise in both languages involved and it is a costly process. Several automatic methods were proposed as an alternative but they often rely of resources available in a limited number of languages and their performances are still far behind the quality of manual translations. Our work concerns bilingual lexicon extraction from multilingual parallel and comparable corpora. Based on parallel corpora, we present an approach that focuses on processing multiword expressions, ranging from their automatic acquisition to their integration into a statistical machine translation system. For comparable corpora, we propose two new approaches that aim at extracting bilingual lexicons specialized in the corporate finance, breast cancer, wind energy and mobile technology domains. The first approach augments the distributional approach by proposing a word sense disambiguation process that keeps only the words that are more likely to give the best representation of a word to be translated. Our second approach is based on Wikipedia and its explicit semantic analysis. The main originality of this approach is in the way words are represented: instead of representing the words in a corpus words space, they are represented in Wikipedia titles’ space. The newly introduced approaches compare favorably to existing methods in almost configurations. Keywords: bilingual lexicon extraction, parallel corpus, comparable corpus, alignment, statistical machine translation. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:06:14 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:06:14 +0100 Subject: Appel: 7e Atelier BUCC a LREC, extension au 23/2 et numero special de JNLE Message-ID: Date: Fri, 14 Feb 2014 22:41:02 +0100 From: Pierre Zweigenbaum Message-ID: <52FE8D6E.8050403 at limsi.fr> X-url: http://comparable.limsi.fr/bucc2014/ Veuillez excuser l'envoi multiple --- merci de transmettre aux collègues intéressés ============================================================ DEADLINE EXTENSION AND JOURNAL SPECIAL ISSUE ============================================================ 7th WORKSHOP ON BUILDING AND USING COMPARABLE CORPORA Building Resources for Machine Translation Research http://comparable.limsi.fr/bucc2014/ May 27, 2014 Co-located with LREC 2014 Harpa Conference Centre, Reykjavik (Iceland) EXTENDED DEADLINE FOR PAPERS: February 23, 2014 https://www.softconf.com/lrec2014/BUCC2014/ *** INVITED SPEAKER *** Chris Callison-Burch (University of Pennsylvania) ============================================================ MOTIVATION In the language engineering and the linguistics communities, research in comparable corpora has been motivated by two main reasons. In language engineering, on the one hand, it is chiefly motivated by the need to use comparable corpora as training data for statistical Natural Language Processing applications such as statistical machine translation or cross-lingual retrieval. In linguistics, on the other hand, comparable corpora are of interest in themselves by making possible inter-linguistic discoveries and comparisons. It is generally accepted in both communities that comparable corpora are documents in one or several languages that are comparable in content and form in various degrees and dimensions. We believe that the linguistic definitions and observations related to comparable corpora can improve methods to mine such corpora for applications of statistical NLP. As such, it is of great interest to bring together builders and users of such corpora. The scarcity of parallel corpora has motivated research concerning the use of comparable corpora: pairs of monolingual corpora selected according to the same set of criteria, but in different languages or language varieties. Non-parallel yet comparable corpora overcome the two limitations of parallel corpora, since sources for original, monolingual texts are much more abundant than translated texts. However, because of their nature, mining translations in comparable corpora is much more challenging than in parallel corpora. What constitutes a good comparable corpus, for a given task or per se, also requires specific attention: while the definition of a parallel corpus is fairly straightforward, building a non-parallel corpus requires control over the selection of source texts in both languages. Parallel corpora are a key resource as training data for statistical machine translation, and for building or extending bilingual lexicons and terminologies. However, beyond a few language pairs such as English- French or English-Chinese and a few contexts such as parliamentary debates or legal texts, they remain a scarce resource, despite the creation of automated methods to collect parallel corpora from the Web. To exemplify such issues in a practical setting, this year's special focus will be on Building Resources for Machine Translation Research This special topic aims to address the need for: (1) Machine Translation training and testing data such as spoken or written monolingual, comparable or parallel data collections, and (2) methods and tools used for collecting, annotating, and verifying MT data such as Web crawling, crowdsourcing, tools for language experts and for finding MT data in comparable corpora. TOPICS We solicit contributions including but not limited to the following topics: Topics related to the special theme: * Methods and tools for collecting and processing MT data, including crowdsourcing * Methods and tools for quality control * Tools for efficient annotation * Bilingual term and named entity collections * Multilingual treebanks, wordnets, propbanks, etc. * Comparable corpora with parallel units annotated * Comparable corpora for under-resourced languages and specific domains * Multilingual corpora with rich annotations: POS tags, NEs, dependencies, semantic roles, etc. * Data for special applications: patent translation, movie subtitles, MOOCs, meetings, chat-rooms, social media, etc. * Legal issues with collecting and redistributing data and generating derivatives Building comparable corpora: * Human translations * Automatic and semi-automatic methods * Methods to mine parallel and non-parallel corpora from the Web * Tools and criteria to evaluate the comparability of corpora * Parallel vs non-parallel corpora, monolingual corpora * Rare and minority languages, across language families * Multi-media/multi-modal comparable corpora Applications of comparable corpora: * Human translations * Language learning * Cross-language information retrieval & document categorization * Bilingual projections * Machine translation * Writing assistance Mining from comparable corpora: * Extraction of parallel segments or paraphrases from comparable corpora * Extraction of bilingual and multilingual translations of single words and multi-word expressions; proper names, named entities, etc. IMPORTANT DATES February 23, 2014 Deadline for submission of papers (extended) March 10, 2014 Notification of acceptance March 27, 2014 Camera-ready papers due May 27, 2014 Workshop date SUBMISSION INFORMATION Papers should follow the LREC main conference formatting details (to be announced on the conference website http://lrec2014.lrec-conf.org/en/ ) and should be submitted as a PDF-file via the START workshop manager at https://www.softconf.com/lrec2014/BUCC2014/ Contributions can be short or long papers. Short paper submission must describe original and unpublished work without exceeding six (6) pages. Characteristics of short papers include: a small, focused contribution; work in progress; a negative result; an opinion piece; an interesting application nugget. Long paper submissions must describe substantial, original, completed and unpublished work without exceeding ten (10) pages. Reviewing will be double blind, so the papers should not reveal the authors' identity. Accepted papers will be published in the workshop proceedings. Double submission policy: Parallel submission to other meetings or publications is possible but must be immediately notified to the workshop organizers. When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. Moreover, ELRA encourages all LREC authors to share the described LRs (data, tools, services, etc.), to enable their reuse, replicability of experiments, including evaluation ones, etc. JOURNAL SPECIAL ISSUE Authors of selected papers will be encouraged to submit substantially extended versions of their manuscripts to an upcoming special issue on "Machine Translation Using Comparable Corpora" of the Journal of Natural Language Engineering. ORGANISERS Pierre Zweigenbaum, LIMSI, CNRS, Orsay (France) Ahmet Aker, University of Sheffield (UK) Serge Sharoff, University of Leeds (UK) Stephan Vogel, QCRI (Qatar) Reinhard Rapp, Universities of Mainz (Germany) and Aix-Marseille (France) CONTACT Pierre Zweigenbaum: pz (at) limsi (dot) fr SCIENTIFIC COMMITTEE * Ahmet Aker, University of Sheffield (UK) * Srinivas Bangalore (AT&T Labs, US) * Caroline Barrière (CRIM, Montréal, Canada) * Chris Biemann (TU Darmstadt, Germany) * Hervé Déjean (Xerox Research Centre Europe, Grenoble, France) * Kurt Eberle (Lingenio, Heidelberg, Germany) * Andreas Eisele (European Commission, Luxembourg) * Éric Gaussier (Université Joseph Fourier, Grenoble, France) * Gregory Grefenstette (INRIA, Saclay, France) * Silvia Hansen-Schirra (University of Mainz, Germany) * Hitoshi Isahara (Toyohashi University of Technology) * Kyo Kageura (University of Tokyo, Japan) * Adam Kilgarriff (Lexical Computing Ltd, UK) * Natalie Kübler (Université Paris Diderot, France) * Philippe Langlais (Université de Montréal, Canada) * Michael Mohler (Language Computer Corp., US) * Emmanuel Morin (Université de Nantes, France) * Dragos Stefan Munteanu (Language Weaver, Inc., US) * Lene Offersgaard (University of Copenhagen, Denmark) * Ted Pedersen (University of Minnesota, Duluth, US) * Reinhard Rapp (Université Aix-Marseille, France) * Sujith Ravi (Google, Mountain View, US) * Serge Sharoff (University of Leeds, UK) * Michel Simard (National Research Council Canada) * Richard Sproat (OGI School of Science & Technology, US) * Tim Van de Cruys (IRIT-CNRS, Toulouse, France) * Stephan Vogel (QCRI, Qatar) * Guillaume Wisniewski (Université Paris Sud & LIMSI-CNRS, Orsay, France) * Pierre Zweigenbaum (LIMSI-CNRS, Orsay, France) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:11:05 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:11:05 +0100 Subject: Job: Resident Academic Post in Human Language Technology, Malta Message-ID: Date: Sat, 15 Feb 2014 11:06:40 +0100 From: François Portet Message-ID: <52FF3C30.9070000 at imag.fr> X-url: http://www.um.edu.mt/hrmd/vacancies X-url: http://www.um.edu.mt/__data/assets/pdf_file/0004/204259/Post_in_Human_Language_Technology.pdf The University of Malta has just announced a resident academic post in Human Language Technology within the Institute of Linguistics. This is a tenured position. Applications will be received until February 28th, 2014. This call is associated with the recently developed B.Sc. and M.Sc. programme in Computational Linguistics and Human Language Technology, and is part of an ongoing effort to strengthen the research and teaching in this area within the Institute. Full details are available from the following URL: http://www.um.edu.mt/hrmd/vacancies (see: Resident Academic Post in Human Language Technology (http://www.um.edu.mt/__data/assets/pdf_file/0004/204259/Post_in_Human_Language_Technology.pdf). Enquiries can be directed to Albert Gatt (albert.gatt at um.edu.mt ) or the Institute of Linguistics (linguistics at um.edu.mt). ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:18:10 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:18:10 +0100 Subject: Conf: LATA 2014, Madrid, Spain, March 10-14, 2014 Message-ID: Date: Sun, 16 Feb 2014 11:59:41 +0100 From: "GRLMC" Message-ID: X-url: http://grammars.grlmc.com/lata2014/ ************************************************************************ 8th INTERNATIONAL CONFERENCE ON LANGUAGE AND AUTOMATA THEORY AND APPLICATIONS LATA 2014 Madrid, Spain March 10-14, 2014 http://grammars.grlmc.com/lata2014/ ************************************************************************ PROGRAM Monday, March 10: 9:30 - 10:30 Registration 10:30 - 10:40 Opening 10:40 - 11:30 Helmut Seidl: Interprocedural Information Flow Analysis of XML Processors - Invited Lecture 11:30 - 11:45 Break 11:45 - 13:00 Alberto Dennunzio, Enrico Formenti, Luca Manzoni: Extremal Combinatorics of Reaction Systems Fernando Arroyo, Sandra Gómez Canaval, Victor Mitrana, Ștefan Popescu: Networks of Polarized Evolutionary Processors are Computationally Complete Liang Ding, Abdul Samad, Xingran Xue, Xiuzhen Huang, Russell L. Malmberg, Liming Cai: Stochastic k-Tree Grammar and its Application in Biomolecular Structure Modeling 13:00 - 14:30 Lunch 14:30 - 15:45 Béatrice Bérard, Olivier Carton: Channel Synthesis Revisited Daniel Průša: Weight-reducing Hennie Machines and Their Descriptional Complexity Manfred Droste, Stefan Dück: Weighted Automata and Logics for Infinite Nested Words 15:45 - 16:00 Break 16:00 - 17:15 Friedrich Otto, František Mráz: Extended Two-Way Ordered Restarting Automata for Picture Languages Sang-Ki Ko, Yo-Sub Han, Kai Salomaa: Top-Down Tree Edit-Distance of Regular Tree Languages Bertram Felgenhauer, René Thiemann: Reachability Analysis with State-Compatible Automata 17:15 - 17:45 Coffee Break 17:45 - 18:35 Leslie A. Goldberg: The Complexity of Approximate Counting - Invited Lecture Tuesday, March 11: 9:00 - 9:50 Sanjeev Khanna: Matchings, Random Walks, and Sampling - Invited Lecture 9:50 - 10:00 Break 10:00 - 11:15 Niko Beerenwinkel, Stefano Beretta, Paola Bonizzoni, Riccardo Dondi, Yuri Pirola: Covering Pairs in Directed Acyclic Graphs Eike Best, Raymond Devillers: Characterisation of the State Spaces of Live and Bounded Marked Graph Petri Nets María Martos-Salgado, Fernando Rosa-Velardo: Expressiveness of Dynamic Networks of Timed Petri Nets 11:15 - 11:45 Coffee Break 11:45 - 13:00 Bireswar Das, Patrick Scharpfenecker, Jacobo Torán: Succinct Encodings of Graph Isomorphism Matthias Gallé, Matías Tealdi: On Context-Diverse Repeats and their Incremental Computation Marcella Anselmo, Dora Giammarresi, Maria Madonia: Picture Codes with Finite Deciphering Delay 13:00 - 14:30 Lunch 14:30 - 15:45 Paul Tarau: Computing with Catalan Families Joan Boyar, Shahin Kamali, Kim S. Larsen, Alejandro López-Ortiz: On the List Update Problem with Advice Rob Gysel: Minimal Triangulation Algorithms for Perfect Phylogeny Problems 15:45 - 16:00 Break 16:00 - 17:15 Grégoire Laurence, Aurélien Lemay, Joachim Niehren, Sławek Staworko, Marc Tommasi: Learning Sequential Tree-to-word Transducers Dariusz Kalociński: On Computability and Learnability of the Pumping Lemma Function Slimane Bellaouar, Hadda Cherroun, Djelloul Ziadi: Efficient List-based Computation of the String Subsequence Kernel Wednesday, March 12: 9:00 - 9:50 Oscar H. Ibarra: On the Parikh Membership Problem for FAs, PDAs, and CMs - Invited Lecture 9:50 - 10:00 Break 10:00 - 11:15 Marius Konitzer, Hans Ulrich Simon: DFA with a Bounded Activity Level Shenggen Zheng, Jozef Gruska, Daowen Qiu: On the State Complexity of Semi-Quantum Finite Automata Vojtěch Vorel: Complexity of a Problem Concerning Reset Words for Eulerian Binary Automata 11:15 - 11:45 Coffee Break 11:45 - 13:00 Pascal Caron, Marianne Flouret, Ludovic Mignot: (k,l)-Unambiguity and Quasi-Deterministic Structures: an Alternative for the Determinization Zuzana Bednárová, Viliam Geffert: Two Double-Exponential Gaps for Automata with a Limited Pushdown Parosh Aziz Abdulla, Mohamed Faouzi Atig, Jari Stenman: Computing Optimal Reachability Costs in Priced Dense-Timed Pushdown Automata 13:00 - 14:30 Lunch 15:45 - 17:45 Sightseeing in Madrid by Bus Thursday, March 13: 9:00 - 9:50 Javier Esparza: A Brief History of Strahler Numbers (I) - Invited Tutorial 9:50 - 10:00 Break 10:00 - 11:15 Zeinab Mazadi, Ziyuan Gao, Sandra Zilles: Distinguishing Pattern Languages with Membership Examples Francine Blanchet-Sadri, Andrew Lohr, Sean Simmons, Brent Woodhouse: Computing Depths of Patterns Alexandre Blondin Massé, Sébastien Gaboury, Sylvain Hallé, Michaël Larouche: Solving Equations on Words with Morphisms and Antimorphisms 11:15 - 11:45 Coffee Break 11:45 - 13:00 Anton Cerný: Solutions to the Multi-Dimensional Equal Powers Problem Constructed by Composition of Rectangular Morphisms Enrico Formenti, Markus Holzer, Martin Kutrib, Julien Provillard: ω-rational Languages: High Complexity Classes vs. Borel Hierarchy Thomas Weidner: Probabilistic ω-Regular Expressions 13:00 - 14:30 Lunch 14:30 - 15:45 Haizhou Li, François Pinet, Farouk Toumani: Probabilistic Simulation for Probabilistic Data-aware Business Processes Daniel Bundala, Jakub Závodný: Optimal Sorting Networks Etienne Dubourg, David Janin: Algebraic Tools for the Overlapping Tile Product 15:45 - 16:00 Break 16:00 - 17:15 Luca Breveglieri, Stefano Crespi Reghizzi, Angelo Morzenti: Shift-Reduce Parsers for Transition Networks Gadi Aleksandrowicz, Andrei Asinowski, Gill Barequet, Ronnie Barequet: Formulae for Polyominoes on Twisted Cylinders Alexandre Blondin Massé, Amadou Makhtar Tall, Hugo Tremblay: On the Arithmetics of Discrete Figures Friday, March 14: 9:00 - 9:50 Javier Esparza: A Brief History of Strahler Numbers (II) - Invited Tutorial 9:50 - 10:00 Break 10:00 - 11:15 Hanna Klaudel, Maciej Koutny, Zhenhua Duan: Interval Temporal Logic Semantics of Box Algebra Pierre Ganty, Ahmed Rezine: Ordered Counter-Abstraction: Refinable Subword Relations for Parameterized Verification Matthew Gwynne, Oliver Kullmann: On SAT Representations of XOR Constraints 11:15 - 11:45 Coffee Break 11:45 - 13:00 Bernd Finkbeiner, Hazem Torfah: Counting Models of Linear-time Temporal Logic Claudia Carapelle, Shiguang Feng, Oliver Fernández Gil, Karin Quaas: Satisfiability for MTL and TPTL over Non-Monotonic Data Words Joachim Klein, David Müller, Christel Baier, Sascha Klüppelholz: Are Good-for-games Automata Good for Probabilistic Model Checking? 13:00 Closing ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:12:40 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:12:40 +0100 Subject: Appel: Special session Interspeech 2014, Speech technologies for Ambient Assisted Living Message-ID: Date: Mon, 17 Feb 2014 17:44:06 +0100 From: François Portet Message-ID: <53023C56.6040902 at imag.fr> X-url: http://www.interspeech2014.org/public.php?page=special_sessions.html#speech-technologies-ambient Dear colleagues, Please find below the announcement of the Interspeech 2014 special session : Speech technologies for Ambient Assisted Living. Special session at Interspeech 2014 Submission deadline: 24th March 2014 Singapore, 14-18 September 2014 http://www.interspeech2014.org/public.php?page=special_sessions.html#speech-technologies-ambient This special session focuses on the use of speech technologies for ambient assisted living, the creation of smart spaces and intelligent companions that can preserve independence and executive function, social communication and security of people with special needs. Currently, speech input for assistive technologies remains underutilized despite its potential to deliver highly informative data and serve as the primary means of interaction with the home. Speech interfaces could replace or augment obtrusive and sometimes outright inaccessible conventional computer interfaces. Moreover, the smart home context can support speech communication by providing a number of concurrent information sources (e.g., wearable sensors, home automation sensors, etc.), enabling multimodal communication. In practice, its use remains limited due to challenging real-world conditions, and because conventional speech interfaces can have difficulty with the atypical speech of many users. This, in turn, can be attributed to the lack of abundant speech material, and the limited adaptation to the user of these systems. Taking up the challenges of this domain requires a multidisciplinary approach to define the user's needs, record corpora in realistic usage conditions, develop speech interfaces that are robust to both environment and user's characteristics and are able to adapt to specific users. This special session aims at bringing together researchers in speech and audio technologies with people from the ambient assisted living and assistive technologies communities to meet and foster awareness between members of either community, discuss problems, techniques and datasets, and perhaps initiate common projects. Topics of the session will include: Assistive speech technology Applications of speech technology (ASR, dialogue, synthesis) for ambient assisted living Understanding, modelling, or recognition of aged and atypical speech Multimodal speech recognition (context-aware ASR) Multimodal emotion recognition Audio scene and smart space context analysis Assessment of speech and language processing within the context of assistive technology Speech synthesis and speech recognition for physical or cognitive impairments Symbol languages, sign languages, nonverbal communication Speech and NLP applied to typing interface applications Language modelling for Augmentative and Alternative Communication text entry and speech generating devices Deployment of speech and NLP tools in the clinic or in the field Linguistic resources; corpora and annotation schemes Evaluation of systems and components. Submission instructions: Researchers who are interested in contributing to this special session are invited to submit a paper according to the regular submission procedure of INTERSPEECH 2014, and to select "Speech technologies for Ambient Assisted Living" in the special session field of the paper submission form. Please feel free to contact the organisers if you have any question regarding the special session. Organizers: Michel Vacher michel.vacher [at] imag.fr Laboratoire d'Informatique de Grenoble, François Portet francois.portet [at] imag.fr Laboratoire d'Informatique de Grenoble, Frank Rudzicz frank [at] cs.toronto.edu University of Toronto, Jort F. Gemmeke jgemmeke [at] amadana.nl KU Leuven, Heidi Christensen h.christensen [at] dcs.shef.ac.uk University of Sheffield, ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:13:09 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:13:09 +0100 Subject: Job: Professeur en Informatique, Universite Aix Marseille Message-ID: Date: Sat, 15 Feb 2014 16:10:21 +0100 From: Alexis Nasr Message-ID: <52FF835D.1070901 at lif.univ-mrs.fr> Un poste de professeur d'informatique (CNU 27) sera ouvert pour la rentrée prochaine à l'université d'Aix Marseille, au Laboratoire d'Informatique Fondamentale (CNRS UMR 7279). Le candidat recruté pourra soit renforcer une des thématiques présentes du laboratoire en intégrant une de ses sept équipes, parmi lesquelles l'équipe Traitement Automatique du Langage Ecrit et Parlé, soit y apporter une nouvelle thématique, pourvu que celle-ci soit compatible avec la recherche qui est faite au LIF. La capacité d'un candidat à enrichir la dimension technologique des recherches menées au LIF sera considérée avec intérêt : bien que la majorité de ces recherches relève de l'informatique fondamentale, chaque équipe développe également des projets plus technologiques en collaboration avec des partenaires industriels, orientation qui est soutenue par le laboratoire. contacts recherche : jean-marc.tablot at lif.univ-mrs.fr (directeur du laboratoire) alexis.nasr at lif.univ-mrs.fr (responsable de l'équipe TALEP) contact enseignement : denis.lugiez at lif.univ-mrs.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sun Feb 23 10:15:10 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sun, 23 Feb 2014 11:15:10 +0100 Subject: Q: Modele acoustique, SRAP Message-ID: Date: Wed, 19 Feb 2014 14:59:41 +0100 From: Sahar G Message-ID: Bonjour à tous, je cherche des informations ( cours ) sur le modèle acoustique, notamment, les étapes d'apprentissage, les données nécessaires, les techniques.. Je sollicite votre aide. Merci :) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sun Feb 23 10:14:22 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sun, 23 Feb 2014 11:14:22 +0100 Subject: Job: Junior Linguists (German) for NLP tasks Message-ID: Date: Tue, 18 Feb 2014 23:44:39 +0100 From: Thomas Meyer Message-ID: <-1243038235167481879 at unknownmsgid> ------------------------------------------------------------------------ Apologies for cross-posting ------------------------------------------------------------------------ Junior Linguists [German] Job description: As a Junior Linguist, you will be annotating and reviewing linguistic data in German. You will also contribute to a number of other tasks to improve natural language processing at Google. There are five full-time contract positions available, at 40 hours per week for a period of 12 months. You will be mostly working from home, but in close proximity to our office in *Munich* for training and subsequent intermediary meetings. The tasks include: - annotating and reviewing linguistic data - labeling text with semantic, syntactic and morphological features - annotating lexicon entries according to guidelines - deriving NLP data for new and on-going projects Requirements: - native-level speaker of German and fluent in English - degree in Linguistics, Computational Linguistics or similar, with knowledge of semantics, syntax, morphology and/or lexicography - ability to quickly grasp technical concepts; learn in-house tools - keen interest in technology and computer-literate - should have an interest in natural language processing, linguistics, linguistic resources - excellent oral and written communication skills - attention to detail and good organizational skills - ability to work independently from home with reliable internet connectivity Project duration: 12 months For immediate consideration, please e-mail your CV and a cover letter in English (in PDF format) with “Junior Linguists [German]” in the subject line to the following address: E-mail address for applications: AdvancedLinguistics at gmail.com Application deadline: open until filled Contact information: Linne Ha ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sun Feb 23 10:27:31 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sun, 23 Feb 2014 11:27:31 +0100 Subject: Seminaire: Olivier Cure, Traces Digitales, 6 mars 2014, Universite de Paris Est Marne la Vallee Message-ID: Date: Thu, 20 Feb 2014 19:39:19 +0100 From: Nicolas Turenne Message-ID: <53064BD7.6010103 at yahoo.fr> X-url: http://www.inra-ifris.org/axes-de-recherche-thematique/groupe-plateforme-cortext/seminaire-de-l-axe-traces-digitales-groupe-cortext.html Chers collègues, Dans le cadre de la seizième séance du séminaire 'traces digitales' du groupe CorText, nous avons le plaisir d'accueillir Olivier Curé (Université Paris-Est Marne la Vallée) qui nous parlera nous parlera d’ontologie. Cette matinée aura lieu le jeudi 06 mars de 10h à 11h30, dans la salle de réunion C219 du Latts (Université Paris-Est Marne la Vallée - rue Galilée, bâtiment du Bois de l’Etang - 2ème étage - aile C). Voir http://www.inra-ifris.org/axes-de-recherche-thematique/groupe-plateforme-cortext/seminaire-de-l-axe-traces-digitales-groupe-cortext.html 10h-11h30 Olivier Curé (Maître de Conférence HDR, LIGM, Université Paris-Est Marne-la-Vallée) Exploitation d’ontologies dans le cadre d’une application médicale pour le grand public Résumé : Dans cet exposé, je présente des travaux récents qui ont été menés dans le cadre du développement d’une application d’automédication. Cette application a pour objectif d’informer le grand public sur les médicaments de prescription facultative et les symptômes associés. L’application comporte de nombreuses fonctionnalités développées autour de différentes ontologies exploitant les technologies du Web Sémantique, e.g., requêtage et raisonnement. Cette présentation mettra en évidence nos contributions, en particulier le développement et l’enrichissement d’ontologies par induction, le support d’une nouvelle classe de contrainte d’intégrité participant à l’améliorer de la qualité de données. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sun Feb 23 10:32:24 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sun, 23 Feb 2014 11:32:24 +0100 Subject: Seminaire: Cours de Ted Briscoe (Evolutionary linguistics), mars 2014, ENS Ulm, Paris Message-ID: Date: Fri, 21 Feb 2014 16:49:00 +0100 From: Thierry Poibeau Message-Id: <85397E93-8406-46B0-B830-BD838EA30343 at ens.fr> X-url: http://transfers.ens.fr/index.php/activites/professeurs-invites/495 Chers collègues, Le laboratoire Lattice (http://www.lattice.cnrs.fr/), avec le soutien du labex Transfers (http://transfers.ens.fr/), accueille Ted Briscoe (http://www.cl.cam.ac.uk/~ejb1/), de l’Université de Cambridge, pour une série de cours qui seront donnés en mars à l’Ecole normale supérieure (45 rue d’Ulm), sur le thème de la « modélisation de l’évolution des langues » (« evolutionary linguistics »). Les cours auront lieu aux dates et lieux suivants : - mardi 11 mars, de 10h30 à 12h30 : salle Celan - jeudi 13 mars, de 10h30 à 12h30 : salle R - mardi 18 mars, de 10h30 à 12h30 : salle IHMC - jeudi 20 mars, de 10h30 à 12h30 : salle R Le programme est détaillé ci-dessous et à l’adresse suivante : http://transfers.ens.fr/index.php/activites/professeurs-invites/495 Tout le monde est le bienvenu, dans la imite des places disponibles. Bien cordialement, Thierry Poibeau ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sun Feb 23 10:25:05 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sun, 23 Feb 2014 11:25:05 +0100 Subject: Seminaire: Julia Hirschberg, 18 mars 2014, BLRI, Aix-en-Provence Message-ID: Date: Thu, 20 Feb 2014 16:28:47 +0100 From: Nadéra Bureau Message-ID: <00c401cf2e50$72fc1c20$58f45460$@bureau at lpl-aix.fr> 18 mars 2014 11h Salle de conférences B011, bât. B 5 avenue Pasteur, Aix-en-Provence (Labex BLRI) Julia Hirschberg (Columbia University) Prosodic and Social Dimensions of Entrainment in Dialogue Résumé : When people speak together, they often adapt aspects of their speaking style based upon the style of their conversational partner. This phenomena goes by many names, including adaptation, alignment, and entrainment, inter alia. In this talk, I will describe experiments in English and Mandarin prosodic entrainment in the Columbia Games Corpus and in the Tongji Games Corpus, large corpora of speech recorded from subjects playing a series of computer games. I will discuss how prosodic entrainment is related to turn-taking behaviors and to several measures of task and dialogue success. I will also discuss experiments relating entrainment to several social dimensions, including likeability and dominance. This is joint work with Stefan Benus, Agustín Gravano, Ani Nenkova, Rivka Levitan, Laura Willson, and Zhihua Xia. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sun Feb 23 10:29:39 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sun, 23 Feb 2014 11:29:39 +0100 Subject: Job: TEMIS, Poste Consultant en solutions Message-ID: Date: Fri, 21 Feb 2014 10:11:50 +0000 From: Sylvie Guillemin-Lanne Message-ID: <0195B854C753DA4CAE5056A849136D8861119D at EXMBX01.hosting.local> TEMIS, éditeur de logiciel spécialisé en Text-Mining, recherche pour son équipe Projets Innovants un consultant en Solutions Text-Mining, Un(e) linguiste informaticien(ne) Poste en CDD de un an renouvelable Vous travaillerez au sein de l’équipe Projets Innovants sur des projets partenaires, co-financés par le gouvernement français ou la commission européenne. Vous consacrerez 50% de votre temps , sous la supervision de l’équipe R&D, sur les nouvelles fonctionnalités de nos produits, que vous testerez et customiserez pour les besoins des Projets Innovants. Vous avez un bagage linguistique confirmé en analyse de documents, sémantique textuelle, fouille de données textuelles Vous avez des compétences sur python/perl ou java vous permettant de transformer des données textuelles pour les utiliser dans différents outils. Vous aimez découvrir et apprendre de nouvelles techniques et outils. Vous êtes autonome, passionné, curieux et vous allez naturellement vers différentes personnes pour trouver comment réaliser vos projets. Vous avez un très bon niveau d’anglais. Nous vous remercions d’envoyer votre CV et une lettre de motivation à Sylvie Guillemin-Lanne, sylvie.guillemin-lanne at temis.com ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sun Feb 23 10:30:20 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sun, 23 Feb 2014 11:30:20 +0100 Subject: Soft: L'analyseur multilingue LIMA sous licence libre Message-ID: Date: Fri, 21 Feb 2014 13:31:58 +0100 From: Gaël de Chalendar Message-ID: <1445036.1DoyL0GLaY at brezhoneg.intra.cea.fr> L'analyseur multilingue LIMA sous licence libre Le CEA LIST annonce la mise sous licence libre de son analyseur linguistique LIMA, désormais disponible sous la licence Affero GPL v3 ici: https://github.com/aymara/lima/wiki. LIMA (Libre Multilingual Analyzer) est un analyseur linguistique multilingue développé par le laboratoire LVIC (Laboratoire Vision et Ingénierie des Contenus) du CEA LIST pendant plus de 10 ans, selon un modèle propriétaire. Le logiciel est désormais disponible selon un modèle de double licence. LIMA supporte l'ensemble des étapes de l'analyse linguistique (morphologie, syntaxe, sémantique, entités, coréférences) selon une architecture modulaire et hautement configurable, avec des performances élevées aussi bien en vitesse qu'en qualité d'analyse. LIMA présente aussi l'avantage d'avoir été conçu dès le départ dans le but de l'intégrer à des solutions de qualité industrielle. La version Libre est complète avec des modules de traitement et des ressources permettant d'analyser des textes en anglais et en français. Conformément à la licence Affero GPL v3, il est possible d'utiliser LIMA librement tant que les logiciels qui y sont liés, qu'ils soient distribués ou exécutés à travers le réseau, sont eux aussi des logiciels libres avec une licence compatible. La version commerciale est complétée avec des modules et des ressources permettant d'analyser des textes dans de multiples langues supplémentaires (allemand, arabe, chinois, espagnol, hongrois, italien, ...). Elle est disponible directement auprès du CEA LIST dans le cadre de partenariats de R&D ou auprès de son partenaire ANT'inno, qui propose des offres incluant le support du logiciel et l'adaptation de ressources linguistiques à des besoins spécifiques. Avec cette publication libre de son logiciel, l'objectif du CEA LIST est de fournir une solution d'analyse linguistique utilisable aussi bien par des chercheurs du domaine que comme outil de prétraitement de textes dans des applications open source. L'adoption de LIMA par une large communauté en permettra une amélioration plus rapide et robuste, aussi bien d'un point de vue purement logiciel que scientifique. L'intérêt pour les utilisateurs de la version Libre est de disposer d'un outil de qualité industrielle dont les développements open source sont supportés par un acteur pérenne (CEA). Concernant la version commerciale, il leur est possible de librement tester et adapter LIMA à leur usage avant d'en acquérir une licence commerciale. Contact: Gaël de Chalendar gael.de-chalendar at cea.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sun Feb 23 10:28:52 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sun, 23 Feb 2014 11:28:52 +0100 Subject: Appel: VarDial Workshop, COLING 2014 Message-ID: Date: Thu, 20 Feb 2014 16:49:25 -0300 From: Marcos Zampieri Message-ID: X-url: http://corporavm.uni-koeln.de/vardial/DSL X-url: http://corporavm.uni-koeln.de/vardial/sharedtask.html Call For Papers 1st Workshop on Applying NLP Tools to Similar Languages, Varieties and Dialects (VarDial) and DSL Shared Task at COLING 2014 in Dublin, Ireland. VarDial Workshop: http://corporavm.uni-koeln.de/vardial/ DSL Shared Task: http://corporavm.uni-koeln.de/vardial/sharedtask.html The interest in language resources and computational models for the study of similar languages, varieties and dialects has been growing substantially in the last few years. The VarDial workshop aims to provide a forum for scholars working on a range of topics related to the study of linguistic variation. We welcome papers dealing with one or more topics of the VarDial workshop: 1) Language Resources and Tools for Varieties and Dialects - Corpus compilation (comparable, parallel) - Adaptation of tools (taggers, parsers) for similar languages, varieties and dialects - Evaluation of LR and tools applied to language varieties - Reusability of LR in NLP applications (e.g. machine translation) - Corpus-driven studies in dialectology and language variation 2) NLP Tasks and Applications - Identification of lexical variation and lexicon induction - Automatic classification of language varieties - Domain adaptation between language varieties - Identification of false friends - Machine translation between varieties and dialects Along with the VarDial workshop we will also be organizing the Discriminating between Similar Languages (DSL) shared task. We will release a specific call for participation soon. More information: http://corporavm.uni-koeln.de/vardial /sharedtask.html Instruction for Authors Papers should contain a maximum of 8 pages + 2 additional pages for references. The templates are available at the COLING website. For better presentation we strongly recommend papers to be prepared using LaTeX. The review process will be double blind, submissions should be anonymous. Important Dates Paper Submission: May 2nd, 2014 Notification of Acceptance: June 6th, 2014 Camera-Ready Version: June 27th, 2014 Workshop: August 23rd or 24th, 2014 Program Committee Željko Agić (University of Potsdam, Germany) Jorge Baptista (University of Algarve and INESC-ID, Portugal) Francis Bond (Nanyang Technological University, Singapore) Aoife Cahill (Educational Testing Service, United States) Paul Cook (University of Melboune, Australia) Liviu Dinu (University of Bucarest, Romania) Stefanie Dipper (Ruhr University Bochum, Germany) Sascha Diwersy (University of Cologne, Germany) Tomaž Erjavec (Jozef Stefan Institute, Slovenia) Mikel L. Forkada (University of Alicante, Spain) Binyam Gebrekidan Gebre (Max Planck Institute for Psycholinguistics, Holland) Nitin Indurkhya (University of New South Wales, Australia) Jeremy Jancsary (Nuance Communications, Austria) Marco Lui (University of Melbourne, Australia) Preslav Nakov (Qatar Computing Research Institute, Qatar) Santanu Pal (Saarland University, Germany) Sebastian Padó (University of Stuttgart, Germany) Reinhard Rapp (University of Mainz, Germany and University of Aix-Marsaille, France) Felipe Sánchez Martínez (University of Alicante, Spain) Kevin Scanell (Saint Louis University, USA) Yves Scherrer (University of Geneva, Switzerland) Serge Sharoff (Leeds University, United Kingdom) Kiril Simov (Bulgarian Academy of Sciences, Bulgaria) Elke Teich (Saarland University, Germany) Joel Tetreault (Nuance Communications, USA) Francis Tyers (University of Alicante, Spain) Cristina Vertan (University of Hamburg, Germany) Torsten Zesch (University of Duisburg-Essen, Germany) Organizers Marcos Zampieri (Saarland University, Germany) Liling Tan (Saarland University, Germany) Nikola Ljubešić (University of Zagreb, Croatia) Jörg Tiedemann (Uppsala University, Sweden) Contact Workshop: vardialworkshop at gmail.com Shared Task: dsl.sharedtask at gmail.com ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sun Feb 23 10:24:01 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sun, 23 Feb 2014 11:24:01 +0100 Subject: Appel: #di2014 - DIGITAL INTELLIGENCE, Sept. 17-19, Nantes, France Message-ID: Date: Thu, 20 Feb 2014 15:25:32 +0100 From: Frederic Benhamou & Milad Doueihi - #di2014 Co-Chairs Message-Id: <20140220142552.BF4081010443 at smtp.univ-nantes.fr> ============== #di2014 | CALL FOR PAPERS ============= CALL FOR PAPERS – #di2014 International Conference on Digital Intelligence Sept. 17-19, 2014 - Nantes, France www.di2014.org - @DIConf2014 Digital Intelligence 2014 (#di2014) is a new international scientific and interdisciplinary conference dedicated to digital society and cultures. The challenge is to bring together researchers from various disciplines in order to discuss and contribute to shape a new scientific and cultural paradigm. #di2014 will be jointly organized with Scopitone, the main French Festival on digital arts and electronic music (Scopitone 2014, Sept. 16-21, 2014). The program will be built around four main components: plenary invited talks, contributed talks in parallel sessions, artistic sessions, performances and works shared with the Scopitone festival and finally sessions and events related to the (real) economy of digital cultures. Program co-chairs: Milad Doueihi, Université Laval (Canada) / Frédéric Benhamou, Université de Nantes (France) Conference co-chairs: Francky Trichet, Université de Nantes (France) / Stéphane Roche, Université Laval (Canada) ---------------------------- INVITED SPEAKERS ---------------------------- - Serge Abiteboul, Académie des Sciences / INRIA / Collège de France, France - Gérard Berry, Académie des Sciences / INRIA / Collège de France, France - Caterina Fake, Co-Founder of Flickr, Findery & Hunch, USA - Bruno Latour, MediaLab, Science Po, France - Carlo Ratti, MIT Senseable City Lab, Massachusetts Institute of Technology, USA - Jeffrey Schnapp, MetaLAB, Harvard University, USA ---------------------------- IMPORTANT DATES ---------------------------- - Submission deadline: April 30, 2014 - Notification of acceptance/rejection: June 16, 2014 - Conference: Sept. 17-19, 2014 ---------------------------- SUBMISSION GUIDELINES ------------------ Papers should be submitted online using this form: https://www.easychair.org/conferences/?conf=di2014 Papers should not exceed 5 pages in LNCS format (http://www.springer.de/comp/lncs/authors.html) and should be submitted in PDF format. The submitted papers (technical or position papers, surveys) must be unpublished and must not be under review elsewhere. ---------------------------- TOPICS ------------------------------------------- Main areas of interest include, but are not limited to: - Art & Culturally-Aware Information Technology: Digital Art, IT for cultural, natural and scientific Heritage, In-museum Innovative Technology and uses - Data: Big Data, Open Data, Linked Data, Data Journalism, Data Visualization - Digital Organizations: Applications of new technologies to e-Business and e-Administration, e-Commerce, e-Marketing, m-Commerce, m-Marketing, Organizational and management issues - Social Web: Social Network Analysis, Communities of practice / interest in social media and mobile devices, Collective intelligence, Collaborative production and Social Computing, Knowledge Ecosystems, Digital Ecosystems, Economics and social innovation on Digital Ecosystems - Digital Identity: e-reputation (human, product, enterprise, government, etc.), Digital traces and memories, Privacy, Trust, Security and Personal Data Management Systems, Human factors (culture, affect, motivation, cognition) and user centered design in digital technology - e-Learning: Social impact and cultural issues in e-Learning, Policy and organizational issues in e-Learning, e-Universities, e-Schools, e-Learning technologies and tools, e-Learning standards (Open Course Ware, Open Access), open e-Learning : OER, MOOCs, virtual mobility - Green Digital Economy: Green computing, IT for Sustainability, Smart Cities and Homes - Digital Trends and Emerging Practices: Cloud Computing, Emotional Computing, Serious Games, Makers, FabLabs, MediaLabs, LivingLabs, ArtLab, innovative Digital Solution in Health, Tourism, Internet of Things, Digital Addiction Studies - Human-Robot Interaction: Ethical and social issues of HRI, Bio-inspired robotics, Humanoid robotics, Socially intelligent robots, assistive (health & personal care) robotics, Transhumanism - Human-Computer Interaction: Virtual, Augmented and Mixed Reality, Design, User Experience and Usability - Digital humanities: Philosophy of the Web, Digital Literature, Digital Literacy ---------------------------- PROCEEDINGS --------------------------------- The accepted papers will be included in the electronic conference proceedings. Authors of selected conference papers will be invited to submit an extended version of their work to scientific journals or a compilation volume of articles. ---------------------------- NANTES DIGITAL WEEK --------------------- #di2014 is at the heart of the Nantes Digital Week (Sept. 13-21, 2014) which brings together conferences, workshops, demonstrations, performances and concerts related to Art, Science and Economy. Both Festive, innovative and hybrid, Nantes Digital Week appeal to a wide audience and will be organized around major events: Startup Weekend, Scopitone festival on Digital Art and Electronic Cultures, FabLab Day, Robotic Day, etc. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sun Feb 23 10:18:00 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sun, 23 Feb 2014 11:18:00 +0100 Subject: Appel: CICE-14 Message-ID: Date: Thu, 20 Feb 2014 10:05:10 +0000 (GMT) From: "CICE, cfp" Message-ID: <998870958.12733.1392890710902.open-xchange at email.1and1.co.uk> X-url: http://www.ciceducation.org Apologies for cross-postings. Kindly email this call for papers to your colleagues, faculty members and postgraduate students. CALL FOR ABSTRACTS, EXTENDED ABSTRACTS, PAPERS, POSTERS AND WORKSHOPS! ============================================ Canada International Conference on Education (CICE-2014) June 16-19, 2014 Cape Breton University, Nova Scotia, Canada www.ciceducation.org ============================================ The CICE is an international refereed conference dedicated to the advancement of the theory and practices in education. The CICE promotes collaborative excellence between academicians and professionals from Education. The aim of CICE is to provide an opportunity for academicians and professionals from various educational fields with cross-disciplinary interests to bridge the knowledge gap, promote research esteem and the evolution of pedagogy. The CICE-2014 invites research papers that encompass conceptual analysis, design implementation and performance evaluation. All the accepted papers will appear in the proceedings and modified version of selected papers will be published in special issues peer reviewed journals. The topics in CICE-2014 include but are not confined to the following areas: *Art Education * Music Education * Writing Education * Imaginative Education * Language Education * History *Adult Education * Competitive Skills * Continuing Education * Higher Education * Vocational Education * Transferring Disciplines *Business Education * Educational Administration * Human Resource Development * Academic Advising and Counselling * Education Policy and Leadership * Industrial Cooperation * Life-long Learning Experiences * Workplace Learning and Collaborative Learning * Work Employability * Educational Institution Government Partnership * Patent Registration and Technology Transfer * University Spin-Off Companies *Course Management * Accreditation and Quality Assurance * Academic Experiences and Best Practice Contributions * Copy-right * Digital Libraries and Repositories * Digital Rights Management * Evaluation and Assessment * E-content Management and Development * Open Content * e-Portfolios * Grading Methods * Knowledge Management * Quality processes at National and International level * Security and Data Protection * Student Selection Criteria in Interdisciplinary Studies * User-Generated Content *Curriculum, Research and Development * Acoustics in Education Environment * APD/Listening * Counsellor Education * Courses, Tutorials and Labs * Curriculum Design * ESL/TESL *Educational Foundations * Early Childhood Education * Elementary Education * Geographical Education * Health Education * Home Education * Rural Education * Science Education * Secondary Education * Second life Educators * Social Studies Education * Special Education *Learning / Teaching Methodologies and Assessment * Simulated Communities and Online Mentoring * e-Testing and new Test Theories * Supervising and Managing Student Projects * Pedagogy Enhancement with e-Learning * Educating the Educators * Immersive Learning * Blended Learning * Computer-Aided Assessment * Metrics and Performance Measurement * Assessment Software Tools * Assessment Methods in Blended Learning Environments *Global Issues In Education and Research * Education, Research and Globalization * Barriers to Learning (ethnicity, age, psychosocial factors, ...) * Women and Minorities in Science and Technology * Indigenous and Diversity Issues * Government Policy issues * Organizational, Legal and Financial Aspects * Digital Divide * Increasing Affordability and Access to the Internet * Ethical issues in Education * Intellectual Property Rights and Plagiarism *Pedagogy * Teacher Education * Cross-disciplinary areas of Education * Educational Psychology * Education practice trends and issues * Indigenous Education * Kinesiology and Leisure Science * K12 * Life-long Learning Education * Mathematics Education * Physical Education (PE) * Reading Education * Religion and Education Studies *Research Management * Research Methodologies * Academic Research Projects * Joint-research programmes * Research on Technology in Education * Research Centres * Links between Education and Research * New Challenges in Education * ECTS experiences * The Bologna Process and its implementation * Joint-Degree Programmes * Erasmus and Exchange experiences in universities * Students and Teaching staff Exchange programmes *Ubiquitous Learning * Accessibility to Disabled Users * Animation, 3D, and Web 3D Applications * Context Dependent Learning * Distance Education * E-Learning * E-Manufacturing * Educational Technology * Educational Games and Software * Human Computer Interaction * ICT Education * Internet technologies * Learning Management Systems (LMS) * Mobile Applications and Learning (M-learning) * Multi-Virtual Environment * Standards and Interoperability * Technology Enhanced Learning * Technology Support for Pervasive Learning *Ubiquitous Computing * Videos for Learning and Educational Multimedia * Virtual and Augmented Reality * Virtual Learning Environments (VLE) * Web 2.0, Social Networking, Blogs and Wikis * Wireless Applications *Research In Progress Important dates Research Paper, Extended Abstract, Case Study, Work in Progress and Report Submission Deadline March 10, 2014 Notification of Paper, Extended Abstract, Case Study, Work in Progress and Report Acceptance Date March 25, 2014 Final Paper Submission Deadline for Conference Proceedings Publication April 30, 2014 Workshop Proposal Submission Deadline March 20, 2014 Notification of Workshop Proposal Acceptance/Rejection March 30, 2014 Poster/Demo Proposal Submission March 20, 2014 Notification of Poster/Demo Acceptance March 30, 2014 Participant(s) Registration (Open) December 01, 2013 to June 15, 2014 Early Bird Registration (Authors and Participants) January 30 to April 15, 2014 Late Bird Registration (Authors only) April 16 to May 18, 2014 Conference Dates June 16-19, 2014 For further information please visit CICE-2014 at www.ciceducation.org From hamon at LIMSI.FR Wed Feb 26 09:03:50 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 26 Feb 2014 10:03:50 +0100 Subject: Appel: Seven SIGIR=?WINDOWS-1252?Q?=9214_?=Workshops on emerging areas in IR Message-ID: Date: Wed, 26 Feb 2014 05:00:45 +0000 From: Richi Nayak Message-ID: <032189CFEE6B094A8CB9DB0838786ADF052740 at ex10mb1.qut.edu.au> X-url: http://sigir.org/sigir2014/ [Apologies if you receive this more than once] The workshop program of the SIGIR’14: 37th Annual ACM SIGIR Conference, Gold Coast, Australia, 6-11 July, 2014 will host seven attractive workshops covering novel ideas and emerging areas in IR: * ERD’14: Entity Recognition and Disambiguation Challenge http://web-ngram.research.microsoft.com/ERD2014/ The Entity Recognition and Disambiguation Workshop will be organized as a challenge, where participants submit working systems that identify the entities mentioned in text. The challenge will have two tracks, focusing on long and short texts. All submissions will be evaluated on shared datasets; part of the data will be withheld, to be used for the final evaluation of all submitted systems to determine the winners. Each participating team will be offered a spot at the workshop to present their system. David Carmel, Yahoo! Research Ming-Wei Chang, Microsoft Research Evgeniy Gabrilovich, Google Bo-June (Paul) Hsu, Microsoft Research Kuansan Wang, Microsoft Research * GEAR’14: Gathering Efficient Assessments of Relevance Workshop https://sites.google.com/site/sigirgear/ Evaluation is a fundamental part of Information Retrieval, and in the conventional Cranfield evaluation paradigm, sets of relevance assessments are a fundamental part of test collections. In this workshop, we wish to revisit how relevance assessments can be efficiently created. Potential themes include methods for generating assessments, the process of assessment, effort involved in assessing different materials, exploration of the concept of relevance etc. A discussion and exploration of this issue will be facilitated through the presentation of results based papers and position papers on the topic, as well as a group design activity. Martin Halvey, Glasgow Caledonian University Robert Villa, University of Sheffield Paul Clough, University of Sheffield * MedIR’14: Medical Information Retrieval Workshop http://medir.dcu.ie/ Medical information is accessible from diverse sources including the general web, social media, journal articles, and hospital records; users include patients and their families, researchers, practitioners and clinicians. Challenges in medical information retrieval include: diversity of users and user ability; variations in the format, reliability, and quality of biomedical and medical information; the multimedia nature of data; and the need for accuracy and reliability. The aim of the workshop is to bring together researchers interested in medical information search with the goal of identifying specific challenges that need to be addressed to advance the state-of-the-art. Eiji Aramaki, Kyoto University, Japan Lorraine Goeuriot, Dublin City University, Ireland Gareth JF Jones, Dublin City University, Ireland Liadh Kelly, Dublin City University, Ireland Henning Müller, University of Applied Sciences Western Switzerland Justin Zobel, University of Melbourne, Australia * PIR’14: Privacy-Preserving IR Workshop — When Information Retrieval Meets Privacy and Security http://www.cs.georgetown.edu/~huiyang/sigir2014-pir-workshop/ Information retrieval and information privacy/security are two fast-growing computer science disciplines. There are many synergies and connections between these two disciplines. However, there have been very limited efforts to connect the two. On the other hand, due to lack of mature techniques in privacy-preserving IR, concerns about privacy and security have become serious obstacles that prevent valuable user data to be used in IR research such as studies about query logs, social media, tweets, sessions, and medical record retrieval. This privacy-preserving IR workshop aims to spurring research brings together the research fields of IR and privacy/security, and mitigate privacy threats in information retrieval by exploring novel algorithms and tools. Luo Si (Purdue University, USA) Grace Hui Yang (Georgetown University, USA) * SMIR’14: Semantic Matching in Information Retrieval http://smir2014.noahlab.com.hk Recently, significant progress has been made in research on what we call semantic matching (SM), in Web search, question answering, online advertisement, cross language information retrieval, multimedia retrieval, and other tasks. Let us take Web search as example of the problem. When comparing the textual content of query and documents, the simple term-based approaches can fail when searcher and author use different terms. A more realistic approach beyond bag-of-words, referred to as semantic matching (SM), is to conduct deeper query and document analysis to encode text with richer representations and then perform query-document matching with such representations. The main purpose of the workshop is to bring together IR and NLP researchers working on or interested in semantic matching, to share latest research results, express opinions on the related issues, and discuss future directions. Julio Gonzalo, UNED, Spain Hang Li, Noah's Ark Lab, Huawei, Hong Kong Alessandro Moschitti, Qatar Computing Research Institute, Qatar Jun Xu, Noah's Ark Lab, Huawei, Hong Kong * SoMeRA’14: Social Media Retrieval and Analysis Workshop http://www.cp.jku.at/conferences/SoMeRA2014/ The SoMeRA 2014 workshop will present and discuss cutting edge research on all topics of retrieval, recommendation, and browsing in social media, as well as on the analysis of user's multifaceted traces in social media. In particular, novel methods and ideas that address challenges such as large quantity and noisiness of user-generated multimedia data, user biases, cold-start problem, or integrating contextual aspects into retrieval and recommendation techniques are highly welcome. The workshop will further foster the exchange of ideas between different communities, in particular it aims at better connecting the multimedia and recommender systems communities with the information retrieval community. The workshop will feature both oral presentations (full papers) and poster/demo presentations (short papers). Markus Schedl, Johannes Kepler University, Austria Peter Knees, Johannes Kepler University, Austria Jialie Shen, Singapore Management University, Singapore * TAIA’14: Temporal, social and spatially Aware Information Access Workshop http://research.microsoft.com/en-us/people/milads/taia2014.aspx Users provide an unprecedented volume of detailed, and continuously updated information about where they are, what they are doing, who they are with, and what they are thinking and feeling about their activities. The provision of this stream creates an informal contract between the user and the information access application in which the user will provide the information, but the application must provide results that are contextually relevant. In this workshop we explore spatial and temporal context in dynamic geotagged collections, such as Wikipedia, and traditional news sources, as well as social media sites such as Twitter, Foursquare, Facebook and Flickr. To ground the workshop, and provide a locus for discussion of the two aspects of user context, we focus on event detection and recommendation. Events are a natural theme around which to center discussions of spatial and temporal context because events are defined by their time and place. Fernando Diaz, Microsoft Research Claudia Hauff, Delft University of Technology Vanessa Murdock, Microsoft Maarten de Rijke, University of Amsterdam Milad Shokouhi, Microsoft Please look at the individual websites for the calls, and deadlines — and participate in the discussion on the SIGIR’14 workshop day, on Friday 11 July 2014, in the beautiful scenery of Gold Coast, Queensland, Australia. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Feb 26 08:40:10 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 26 Feb 2014 09:40:10 +0100 Subject: Sujet de these: Humanites numeriques, a pourvoir immediatement, Lattice Message-ID: Date: Tue, 25 Feb 2014 18:28:36 +0100 From: Thierry Poibeau Message-Id: <7C7AC0DD-14E0-4F33-AF35-3021D7B1EE8C at ens.fr> Bonjour, Suite à un désistement tardif, le Lattice dispose d’un financement pour une allocation de recherche de trois ans, à partir d’avril 2014. Le thème de recherche s’inscrit dans le cadre des « humanités numériques » et de l’analyse automatique de masses de données textuelles (thème possible : passage de documents papier vers un support numérique dans le domaine de l’archéologie, traitement automatique des langues pour l’analyse de documents littéraires, apport du traitement automatique des langues pour les sciences humaines, etc.). La thèse prendra en compte un ou deux cas d’études précis : il s’agira, à partir de ces cas d’étude, d’envisager l’apport du numérique pour les sciences humaines et sociales. On s’appuiera autant que possible sur des masses de données importantes. Conditions pour candidater : - avoir un Master avec mention (Master 2 déjà obtenu) - ne pas avoir déjà été inscrit en thèse Compétences souhaitées : - maîtrise des outils de traitement automatique des langues - compétences en informatique - intérêt pour les sciences humaines et sociales - bonne maîtrise du français et de l’anglais - qualité de rédaction Pour candidater, contacter Thierry Poibeau (prenom.nom at ens.fr) dès que possible (et dans tous les cas avant le 4 mars 2014) par mail avec un bref descriptif du projet / des motivations, un CV et un relevé des notes de Master. Bien cordialement, Thierry Poibeau http://www.lattice.cnrs.fr/Thierry-Poibeau ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Feb 26 08:45:19 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 26 Feb 2014 09:45:19 +0100 Subject: Appel: Atelier FondamenTAL, TALN 2014 Message-ID: Date: Sun, 23 Feb 2014 22:37:58 +0100 From: Marie-Hélène Stéfanini Message-ID: <530A6A36.1010502 at lif.univ-mrs.fr> X-url: http://talep.lif.univ-mrs.fr/FondamenTAL TALN 2014, 21^ème conférence sur le Traitement Automatique des Langues Naturelles APPEL A CONTRIBUTIONS Atelier FondamenTAL Les ressources lexicales de Jean Dubois et Françoise Dubois-Charlier Retours d?expériences et projets, et hommage à Paul Sabatier 1^er juillet 2014, Faculté Saint-Charles, à Marseille Depuis 2007, le site FondamenTAL (http://talep.lif.univ-mrs.fr/FondamenTAL) créé par Paul Sabatier permet au public de s?informer sur l?ensemble des ressources lexicales numérisées de Jean Dubois et Françoise Dubois-Charlier. De nombreux chercheurs et doctorants en Sciences du Langage et en TAL ont demandé au site http://www.modyco.fr/Dictionnaires de leur communiquer les différents dictionnaires. Ou bien ils se sont rendus sur http://rali.iro.umontreal.ca/ Dubois pour bénéficier d?une interface de consultation du Dictionnaire des Verbes Français (LVF). Il y a eu déjà beaucoup de références à Les Verbes Français (LVF) dans des publications et thèses de linguistique. De plus, deux numéros de revue (Langue Française n° 153 et Langages n° 179-180) et un chapitre d'ouvrage (Ressources Lexicales 2013, Gala & Zock ed.) contribuent à faire connaître les travaux de Dubois et Dubois-Charlier. Une communauté informelle des utilisateurs de ces ressources s?est constituée, qui aura l'occasion, nous l'espérons, d'être en partie réunie à l'occasion de cet atelier. La finalité de l'atelier que nous proposons est triple : réunir les chercheurs ayant utilisé de façon significative les ressources de Dubois et Dubois-Charlier jusqu?à présent disponibles, pour faire avec eux un état des lieux de leurs recherches ; informer la communauté de la diffusion de ressources lexicales jusqu'à présent inédites à savoir : 1) une version, révisée par Paul Sabatier, de Les Verbes Français (LVF, 26.000 entrées) ; 2) un inédit, Locutions Verbales (LVF, 3.510 entrées), révisé par Paul Sabatier ; 3) un autre inédit le Dictionnaire Electronique des Mots (DEM ;145 000 entrées). rendre hommage à Paul Sabatier, en évoquant ses derniers travaux portant sur les derniers dictionnaires électroniques de Dubois et Dubois-Charlier. Le Dictionnaire Electronique des Mots (DEM,145 000 entrées) est une base de données lexicales du français qui constitue la synthèse des différents dictionnaires de verbes simples, de locutions verbales et d'adjectifs déjà connus de ces auteurs, augmentée de toutes les autres catégories de mots, tant simples que locutionnels. Les auteurs en ont déjà présenté les grandes lignes en 2010 dans Langages n° 179-180. Son caractère à la fois exhaustif, rigoureusement systématique et syntactico-sémantique, devrait intéresser bien des chercheurs en TAL, d?autant plus qu?il aura été rendu disponible dans plusieurs formats pratiques pour le traitement automatique. L'atelier sera donc l'occasion non seulement d'évoquer des travaux ayant porté sur les ressources déjà publiées de Dubois et Dubois-Charlier, mais aussi de présenter le DEM et de détailler ses propriétés et modes d'utilisation. De plus, le rôle important que Paul Sabatier a joué dans le travail de révision de ces ressources sera largement évoqué. Présidents de l'atelier Denis Le Pesant, Université Paris-Ouest Nanterre La Défense & MoDyCo (CNRS) Marie-Hélène Stéfanini, Laboratoire d?Informatique Fondamentale (LIF), Aix-Marseille Université & CNRS Comité de programme Françoise Dubois-Charlier, Université d'Aix-Marseille Guy Lapalme, Université de Montréal, Laboratoire Recherche Appliquée en Linguistique Informatique (RALI) Danielle Leeman, Université Paris Ouest Nanterre & ICAR Denis Le Pesant, MoDyCo (CNRS) & Université Paris Ouest Nanterre La Défense Max Silberztein, Université de Franche-Comté Marie-Hélène Stéfanini, Laboratoire d?Informatique Fondamentale (LIF), Aix-Marseille Université & CNRS Agnès Tutin, LIDILEM & Université Stendhal (Grenoble). Modalités de soumission 1^ère étape : Soumission du résumé d'une communication (une page maximum). Les résumés seront adressés à denis.lepesant at orange.fr et marie-helene.stefanini at lif.univ-mrs.fr avant le 21 mars 2014. Nous annoncerons ultérieurement la durée des exposés, qui sera fonction du nombre de communications acceptées. 2^ème étape : Envoi d'un article par les auteurs des communications acceptées Dates importantes 20 mars 2014 : soumission des résumés de communication 10 avril 2014 : notification de l'acceptation ou du refus de la communication 1^er mai 2014 : envoi de la première version des articles à denis.lepesant at orange.fr et marie-helene.stefanini at lif.univ-mrs.fr 13 mai 2014: résultat de l'examen des articles 30 mai 2014 : réception de la version finale des articles Les articles en français ou en anglais devront suivre le format d'un article court de TALN 2014 (soit 6 à 8 pages). Une feuille de style LaTeX et des modèles LibreOffice et Word sont disponibles sur le site web de la conférence (http://www.taln2014.org/site/soumission/). Site de l'atelier : http://www.taln2014.org/site/ressources-lexicales-et-tal-vue-densemble-sur-les-dictionnaires-electroniques-de-jean-dubois-et-francoise-dubois-charlier/ Pour information : Site TALN 2014, 21^ème conférence sur le Traitement Automatique des Langues Naturelles : www.taln2014.org ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Feb 26 08:39:57 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 26 Feb 2014 09:39:57 +0100 Subject: Appel: Natural Language and Computer Science, July 17-18, Vienna Message-ID: Date: Sun, 23 Feb 2014 19:00:06 +0100 From: retore Message-Id: X-url: http://www.indiana.edu/~iulg/nlcs.html Workshop on NATURAL LANGUAGE AND COMPUTER SCIENCE (NLCS '14) 17-18 July, 2014 Vienna, Austria http://www.indiana.edu/~iulg/nlcs.html Endorsed by the Association for Computational Linguistics Special Interest Group on Computational Semantics. Vienna Summer of Logic http://vsl2014.at/ Computer Science Logic - Logic In Computer Science 2014 http://lii.rwth-aachen.de/lics/csl-lics14/ Paper submission deadline: April 1, 2014 AIMS AND SCOPE Formal tools coming from logic, category theory, are important for natural language processing and especially for computational semantics. Moreover, work on these tools borrows heavily from all areas of theoretical computer science. In the other direction, applications having to do with natural language have inspired developments on the formal side. The workshop invites papers on both topics and their applications, as well as on the combination between logical and statistical methods. Specific topics include, but are not limited to: * linguistic, computational and logical aspects of the interface between syntax and semantics * logical aspects of linguistic theories * logic for semantics of lexical items, sentences, discourse and dialog * continuations in natural language semantics * formal tools in textual inference, such as logics for natural language inference * applications of category theory in semantics * linear logic in semantics * formal approaches to unifying data-driven (quantitative, statistical) and declarative (logical) approaches to semantics * natural language processing tools using some logic IMPORTANT DATES Paper submission deadline: April 1, 2014 Author notification: May 1, 2014 Electronic versions of papers due: May 15, 2014 Workshop: July 17, 2014 INVITED SPEAKERS TO INCLUDE Anne Abeillé (to be confirmed) Université Paris Diderot Aarne Ranta Chalmers University of Technology and University of Gothenburg SUBMISSIONS Please submit extended abstracts of 4-10 pages using EasyChair: https://www.easychair.org/conferences/?conf=nlcs14 ORGANIZERS Valeria de Paiva, Nuance Larry Moss, Indiana University Christian Retoré, Université de Bordeaux PROGRAM COMMITTEE Robin Cooper, University of Gothenburg Valeria de Paiva, Nuance Christophe Fouqueré, Université Paris 13 Larry Moss, Indiana University Ian Pratt-Hartmann, University of Manchester, UK Christian Retoré, Université de Bordeaux Wlodek Zadrozny, UNC, Charlotte ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Feb 26 08:49:09 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 26 Feb 2014 09:49:09 +0100 Subject: Job: Postdoctoral fellowship on computational analysis of parliamentary text Message-ID: Date: Tue, 25 Feb 2014 14:52:58 -0500 From: Graeme Hirst Message-Id: <25687A62-1ED0-40B7-88EB-41F134143A65 at cs.toronto.edu> X-url: http://www.sgs.utoronto.ca/postdoctoralfellows/Pages/Current-Postdocs.aspx The Departments of Political Science and Computer Science of the University of Toronto seek to hire a postdoctoral researcher for a two-year term, beginning 1 July 2014, or as soon as possible. The successful applicant will join an international consortium of researchers focused on the development of computational tools for the analysis of political ideology in the recorded histories of parliamentary debates in Canada, the Netherlands, and the United Kingdom. The successful applicant will have a Ph.D. in a relevant field of specialization (e.g., political science, computational linguistics), as well as experience with the computational analysis of large datasets, especially large corpora of text. Fluency in French and/or Dutch is an asset, but not a requirement. The stipend for this position is $50 000 (CDN) per year, plus an additional 10% benefits package, which is outlined at: http://www.sgs.utoronto.ca/postdoctoralfellows/Pages/Current-Postdocs.aspx. The University of Toronto is situated in downtown Toronto, Ontario, Canada, a large multicultural city in the center of an urban area with a population of more than 5.5 million people. The University of Toronto is within walking distance of all of the amenities that a major metropolitan center provides. Applicants should forward a cover letter, a one-page description of their research interests, a curriculum vitae, and the contact information for three referees to both Professor Christopher Cochrane (christopher.cochrane at utoronto.ca) and Professor Graeme Hirst (gh at cs.toronto.edu). Consideration of applications will begin on 1 April 2014, and continue until the position is filled. The University of Toronto is strongly committed to diversity within its community and especially welcomes applications from visible minority group members, women, Aboriginal persons, persons with disabilities, members of sexual minority groups, and others who may contribute to the further diversification of ideas. From hamon at LIMSI.FR Wed Feb 26 08:48:12 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 26 Feb 2014 09:48:12 +0100 Subject: Appel: COLING 2014 Message-ID: Date: Tue, 25 Feb 2014 11:39:51 +0000 From: John Judge Message-ID: <530C8107.4040402 at computing.dcu.ie> X-url: http://www.coling-2014.org ********** Apologies for cross-posting ********** Second (Main) Call for Papers - Coling 2014 COLING 2014 Dublin, Ireland, 23-29 August, 2014 The International Committee on Computational Linguistics (ICCL) is pleased to announce the 25th International Conference on Computational Linguistics (Coling 2014), at Dublin City University (DCU, Dublin, Ireland). DCU is a young, dynamic and ambitious university with a mission to transform lives and societies through education, research and innovation. Most of the local organisers are from CNGL, Ireland's Centre for Global Intelligent Content (formerly the Centre for Next Generation Localisation), which embodies the leading position of Ireland in the global localisation/internationalisation business, a strong focus on language technologies including machine translation, computational linguistics and natural language processing, as well as on intelligent management, search, retrieval, transformation and adaptation of content. Coling will cover a broad spectrum of technical areas related to natural language and computation. The conference will include full papers (presented as oral presentations or posters), demonstrations, tutorials, and workshops. TOPICS OF INTEREST Coling 2014 solicits papers and demonstrations on original and unpublished research on the following topics, including, but not limited to: - pragmatics, semantics, syntax, grammars and the lexicon; - cognitive, mathematical and computational models of language processing; - models of communication by language; - lexical semantics and ontologies; - word segmentation, tagging and chunking; - parsing, both syntactic and deep; - generation and summarisation; - paraphrasing, textual entailment and question answering; - speech recognition, text-to-speech and spoken language understanding; - multimodal and natural language interfaces and dialogue systems; - information retrieval, information extraction and knowledge base linking; - machine learning for natural language; - modelling of discourse and dialogue; - sentiment analysis, opinion mining and social media; - multilingual processing, machine translation and translation aids; - applications, tools and language resources; - system evaluation methodology and metrics. In all relevant areas, we encourage authors to include analysis of the influence of theories (intuitions, methodologies, insights), to technologies (computational algorithms, methods, tools, data), and/or contributions of technologies to theory development. In technologically oriented papers, we encourage in-depth analysis and discussion of errors made in the experiments described, if possible linking them to the presence or absence of linguistically-motivated features. Contributions that display and rigorously discuss future potential, even if not (yet) attested in standard evaluation, are welcome. PAPER REQUIREMENTS Papers should describe original work; they should emphasise completed work or well-advanced ongoing research rather than intended work, and should indicate clearly the state of completion of the reported results. Wherever appropriate, concrete evaluation results should be included. Submissions will be judged on correctness, originality, technical strength, significance and relevance to the conference, and interest to the attendees. Submissions presented at the conference should mostly contain new material that has not been presented at any other meeting with publicly available proceedings. Papers that are being submitted in parallel to other conferences or workshops must indicate this on the title page, as must papers that contain significant overlap with previously published work. REVIEWING Reviewing will be double blind. It will be managed by an international Conference Program Committee consisting of Program Chairs, members of the Scientific Advisory Board and Area Chairs, who will be assisted by invited reviewers. Important Notice [1] In order to allow participants to be acquainted with the published papers ahead of time which in turn should facilitate discussions at Coling 2014, we have set the official publication date two weeks before the conference, i.e., on August 11, 2014. On that day, the papers will be available online for all participants to download, print and read. If your employer is taking steps to protect intellectual property related to your paper, please inform them about this timing. [2] While submissions are anonymous, we strongly encourage authors to plan for depositing language resources and other data as well as tools used and/or developed for the experiments described in the papers, if the paper is accepted. In this respect, we encourage authors then to deposit resources and tools to available open-access repositories of language resources and/or repositories of tools (such as META-SHARE, Clarin, ELRA, LDC or AFNLP/COCOSDA for data, and github, sourceforge, CPAN and similar for software and tools) and refer to them instead of submitting them with the paper, even though it will also be an open possibility (through the START system). The details will be given in the submission site for camera-ready versions of accepted papers. [3] There will be a separate call for demonstrations. Accepted papers on demonstrations will also be included in the proceedings. INSTRUCTIONS FOR AUTHORS For Coling 2014, there will be one category of research papers only. All of the papers will be included in conference proceedings, this time in electronic form only. The maximum submission length is 8 pages (A4), plus two extra pages for references. Authors of accepted papers will be given additional space in the camera-ready version to reflect space needed for changes stemming from reviewers comments. Authors can indicate their preference for presentation mode (i.e. oral or poster presentation) in the submission form, and the reviewers will recommend an appropriate mode of presentation to the program committee which will then decide. There will be no distinction in the proceedings between research papers presented orally vs. as posters. Papers shall be submitted in English, anonymised with regard to the authors and/or their institution (no author-identifying information on the title page nor anywhere in the paper), including referencing style as usual. Authors should also ensure that identifying meta-information is removed from files submitted for review. Papers must conform to official Coling 2014 style guidelines, which are available here http://www.coling-2014.org/doc/coling2014.zip. Submission and reviewing will be managed online by the START system. The only accepted format for submitted papers is in Adobe's PDF. Submissions must be uploaded on the START system (https://www.softconf.com/coling2014/main/) by the submission deadlines; submissions after that time will not be reviewed. To minimise network congestion, we request authors to upload their submissions as early as possible. IMPORTANT DATES February 2014: Opening of the submission website https://www.softconf.com/coling2014/main/ March 21, 2014: Paper submission deadline May 9-12, 2014: Author response period May 23, 2014: Author notification June 6, 2014: Camera-ready PDF due August 11, 2014: Official paper publication date August 25-29, 2014: Main conference From hamon at LIMSI.FR Wed Feb 26 08:46:30 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 26 Feb 2014 09:46:30 +0100 Subject: Appel: ESSLLI 2014 Student Session Message-ID: Date: Tue, 25 Feb 2014 11:01:40 +0100 From: Ronald de Haan Message-ID: X-url: http://www.kr.tuwien.ac.at/drm/dehaan/stus2014/ Please forward to students. Apologies for the multiple messages. *2nd Call for Papers* *ESSLLI 2014 STUDENT SESSION* Held during the 26th European Summer School in Logic, Language and Information Tübingen, Germany, August 11-22, 2014 *Deadline for submissions: April 1st, 2014* https://www.easychair.org/conferences/?conf=essllistus2014 *ABOUT:* The Student Session of the 26th European Summer School in Logic, Language, and Information (ESSLLI) will take place in Tübingen, Germany on August 11-22, 2014. We invite submissions of original, unpublished work from students in any area at the intersection of Logic & Language, Language & Computation, or Logic & Computation. Submissions will be reviewed by several experts in the field, and accepted papers will be presented orally or as posters and will appear in the student session proceedings by Springer. This is an excellent opportunity to receive valuable feedback from expert readers and to present your work to a diverse audience. *SEPARATE POSTER SESSION:* Note that there are two separate kinds of submissions, one for the oral presentations and one for the posters. This means that papers can be directly submitted as posters. Reviewing and ranking will be done separately. We particularly encourage submissions for posters, as they offer an excellent opportunity to present research in progress. *SUBMISSION GUIDELINES:* All authors must be students, and submissions may be singly or jointly authored. Submissions should not be longer than 8 pages for an oral presentation and 4 pages for a poster presentation (including examples and references). Submissions must be anonymous, without any identifying information. More detailed guidelines regarding submission can be found on the Student Session website: http://www.kr.tuwien.ac.at/drm/dehaan/stus2014/ . *FURTHER INFORMATION:* Please direct inquiries about submission procedures or other matters relating to the Student Session to dehaan at kr.tuwien.ac.at. ESSLLI 2014 will feature a wide range of foundational and advanced courses and workshops in all areas of Logic, Language, and Computation. For further information, including registration information and course listings, and for general inquiries about ESSLLI 2014, please consult the main ESSLLI 2014 page: http://www.esslli2014.info/. Kind regards, The ESSLLI 2014 Student Session Organization Committee Chair: - Ronald de Haan (Technische Universität Wien) LoCo (Logic and Computation) co-chairs: - Zoé Christoff (Universiteit van Amsterdam) - Aybüke Özgün (Université de Lorraine) LoLa (Logic and Language) co-chairs: - Philip Schulz (Universiteit van Amsterdam) - Thomas Brochhagen (Universität Düsseldorf) LaCo (Language and Computation) co-chairs: - Miriam Kaeshammer (Universität Düsseldorf) - Ramon Ziai (Universität Tübingen) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Thu Feb 27 09:38:39 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Thu, 27 Feb 2014 10:38:39 +0100 Subject: Appel: Journee commune IA et TAL, 25 mars 2014 Message-ID: Date: Thu, 27 Feb 2014 10:31:47 +0100 From: Pierre Zweigenbaum Message-ID: <530F0603.9050606 at limsi.fr> X-url: http://www.afia.asso.fr/tiki-index.php?page=Journée+commune+AFIA+-+ATALA+2014 Appel à soumissions Appel à participation Intelligence artificielle et traitement automatique des langues Langue, apprentissage automatique et fouille de données Journée commune AFIA - ATALA Association française pour l'intelligence artificielle Association pour le traitement automatique des langues http://www.afia.asso.fr/tiki-index.php?page=Journée+commune+AFIA+-+ATALA+2014 Paris, 25 mars 2014 Auditorium de l'INALCO Conférence invitée : François Yvon Méthodes statistiques en traitement des langues: État des lieux et perspectives THÈME Le traitement automatique des langues (TAL), visant à réaliser des tâches relevant de la cognition humaine, a de longue date des liens avec l'intelligence artificielle (IA). L'objectif de cette journée est de renforcer les liens qui existent entre ces deux domaines, avec un accent sur deux thématiques particulièrement actives : * D'une part, l'apprentissage automatique a pris une large part dans les travaux en TAL, comme en témoignent les articles publiés dans les grandes conférences du domaine ou des ouvrages de synthèse comme le livre de Gaussier et Yvon (2011, 2012). Il s'agit surtout d'apprentissage supervisé, avec un regain d'intérêt pour l'apprentissage non supervisé et l'exploitation de grands corpus non annotés. * D'autre part, la fouille de données est souvent employée pour explorer les corpus et acquérir des connaissances linguistiques et autres ressources pour le TAL ; elle est par ailleurs utilisatrice de TAL dans le cadre de la fouille de textes à visée applicative. L'une comme l'autre peuvent aider à relever le défi des données textuelles massives qui forment une part importante des « big data » et, de façon tout aussi pertinente, à mettre au point des méthodes qui font avancer le domaine. Nous sollicitons ici des interventions qui présentent un état de l'art ou un tutoriel sur les thématiques de l'apprentissage et de la fouille de données en relation avec le traitement automatique des langues. Des présentations de ce type sur d'autres thèmes à l'interface entre TAL et IA seront également considérées. COORDINATEURS DE LA JOURNÉE Pierre Zweigenbaum & Brigitte Grau (LIMSI-CNRS) MODALITÉS DE SOUMISSION Les propositions d'intervention sont attendues sous la forme d'un résumé d'une page au format texte à envoyer à : pz at limsi.fr Date limite de soumission : 4 mars 2014 Date de notification : 10 mars 2014 Le corps du message doit contenir les informations suivantes : Titre de l'intervention : Nature de l'intervention : état de l'art, tutoriel Auteurs : Mots-Clefs : Résumé ou plan de l'intervention : COMITÉ DE PROGRAMME (en cours de complétion) Alexandre Allauzen (LIMSI) Thierry Charnois (LIPN) Vincent Claveau (IRISA) Gaël Dias (GREYC) Éric Gaussier (LIG) Brigitte Grau (LIMSI) Alexis Nasr (LIF) Isabelle Tellier (LATTICE) Yannick Toussaint (LORIA) Mathieu Valette (INALCO) Pierre Zweigenbaum (LIMSI) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA décline toute responsabilité concernant le contenu des messages diffusés sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 1 08:19:48 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 1 Feb 2014 09:19:48 +0100 Subject: Appel: revue TAL - Note de lecture (BUVET) Message-ID: Date: Wed, 29 Jan 2014 16:59:15 +0100 (CET) From: Denis Maurel Message-ID: <799078705.18806356.1391011155799.JavaMail.zimbra at univ-tours.fr> La revue TAL publie r?guli?rement des notes de lecture. Nous recherchons un coll?gue souhaitant lire le livre: "Pierre-Andr? BUVET. La Dimension lexicale de la d?termination en fran?ais. Honor? Champion. 2013. 473 pages." et pr?t ? en faire un compte-rendu pour la revue TAL (cet ouvrage sera envoy? gracieusement en ?change du service rendu). Cette note de lecture doit ?tre r?dig?e en fran?ais (trois pages maximum, au format de la revue) et envoy?e fin mars 2014. D'autres compte-rendu sont possibles si vous avez lu r?cemment un ouvrage qui vous a int?ress? et si vous ?tes pr?t ? partager votre lecture avec la communaut?... Denis Maurel ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 1 08:25:24 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 1 Feb 2014 09:25:24 +0100 Subject: Appel: 15th International Workshop on Non-Monotonic Reasoning (NMR 2014) Message-ID: Date: Thu, 30 Jan 2014 10:59:24 +0900 From: S?bastien Konieczny Message-Id: X-url: http://www.kr.tuwien.ac.at/nmr14/ CALL FOR PAPERS 15th International Workshop on Non-Monotonic Reasoning (NMR 2014) http://www.kr.tuwien.ac.at/nmr14/ Vienna, Austria, July 17?19, 2014 Co-located with KR 2014 [http://kr.org/KR2014/], DL 2014 [http://www.dbai.tuwien.ac.at/dl2014/], FLoC 2014, and Logic Colloquium 2014. NMR 2014 is part of the Vienna Summer of Logic [http://vsl2014.at/] ______________________________________________________________________ * Aims and Scope * The NMR workshop series is the premier specialized forum for researchers in non-monotonic reasoning and related areas. This will be the 15th workshop in this series. Its aim is to bring together active researchers in the broad area of non-monotonic reasoning, including belief revision, reasoning about actions, argumentation, declarative programming, preferences, non-monotonic reasoning for ontologies, uncertainty, and other related topics. NMR will share a joint session with the 27th International Workshop on Description Logics (DL 2014). * Topics * NMR 2014 welcomes the submission of papers broadly centered on issues and research in non-monotonic reasoning. We welcome papers of either theoretical or practical nature. Topics of interest include (but are not limited to): - abductive reasoning and diagnosis, - algorithms and complexity analysis, - argumentation and dialog, - answer-set programming, - belief revision, belief update, and belief merging, - benchmarks for non-monotonic reasoning, - declarative programming for non-monotonic reasoning, - default reasoning, - empirical studies of reasoning strategies, - foundations of non-monotonic reasoning, - hybrid approaches (non-monotonic reasoning combined with other computing paradigms), - inconsistency handling, - implementations and systems, - non-monotonic logics in multi-agent interaction, including negotiation and dispute resolution, - non-monotonic reasoning for ontologies, - reasoning and decision making under uncertainty, - reasoning with preferences, - representing actions and planning, - causal reasoning, and - similarity based-reasoning. * Tracks * To focus the different topics of submissions, the workshop comprises the following thematic tracks: 1. Actions, Causality, and Belief Change; 2. Declarative Programming; 3. Argumentation and Dialog; 4. Preferences, Norms, and Trust; 5. NMR and Uncertainty; 6. Commonsense and NMR for Ontologies; as well as the following special tracks: 7. Systems and Applications; 8. Benchmarks for NMR. * Systems and Applications Track * Recent years witnessed the development of mature solver technology for some NMR based formalisms and, accordingly, successful real-world applications. This track welcomes papers on describing implemented NMR systems as well as papers presenting applications of NMR formalisms and systems. Topics of interest include pure system descriptions (providing information on the basic functionality and usability of the respective systems), the comparison and evaluation of NMR systems, NMR applications in industry and academia, software engineering and modeling methodology aspects, and reports from the field. * Benchmarks for NMR special track * The aim of the Benchmarks for NMR special track is to discuss the construction of benchmarks for NMR. Benchmarks proved useful in a variety of domains in order to develop efficient algorithms and methods. They are for the moment insufficiently developed for main NMR areas. We want to discuss this issue in NMR 2014. Typical questions of interested could be: - How to obtain benchmarks from real application cases? - How to build sensible random benchmarks? - How to export existing benchmarks in some formalism into other domains formalisms? - Etc. Papers related to these issues, description of existing systems of benchmarks, etc., are welcome. * Submissions * Papers should be between 4 and 10 pages in AAAI style (http://www.aaai.org/Publications/Author/author.php) including references, figures, and appendixes if any. System descriptions can typically be on the lower bound of the page range. Papers submission will be handled electronically by means of the easychair system. Papers must be submitted in PDF only. The submission page is available at http://www.easychair.org/conferences/?conf=nmr2014 Papers already published at other conferences and that can be of interest for an NMR audience are welcomed to NMR 2014, provided that the initial publication is mentioned in a footnote on the first page. Submissions will have to indicate to which of the above listed tracks it is intended to belong to, as well as whether it constitutes new research or recently published research. * Proceedings * There are no formal proceedings for NMR. The accepted papers will be published as a technical report and will be made available in the CoRR Computing Research Repository, see http://arxiv.org/corr/home. The copyright of the papers lies with the authors, and as far as NMR is concerned, they are free to submit to other conferences and workshops as well. Similarly, papers already published can be submitted (but this has to be indicated in the submission). * Important Dates * Submission deadline: February 14, 2014 Notification: April 2, 2014 Camera-ready articles due: April 25, 2014 NMR 2014: July 17-19, 2014 * Location * NMR 2014 will be held at the Vienna University of Technology and is part of the Vienna Summer of Logic, http://vsl2014.at/, which will probably be the largest scientific logic event in known history. * Workshop Chairs * S?bastien Konieczny (CNRS, Universit? d'Artois, France) Hans Tompits (Vienna University of Technology, Austria) * Track Chairs * 1. Actions, Causality, and Belief Change Renata Wasserman (Universidade de S?o Paulo, Brasil) 2. Declarative Programming Tomi Janhunen (Aalto University, Finland) 3. Argumentation and Dialog Paul E. Dunne (University of Liverpool, UK) 4. Preferences, Norms, and Trust Mehdi Dastani (Utrecht University, Netherlands) 5. NMR and Uncertainty Lluis Godo (Universitat Autonoma de Barcelona, Spain) 6. Commonsense and NMR for Ontologies Guilin Qi (Southeast University, China) 7. Systems and Applications Esra Erdem (Sabanci University, Turkey) 8. Benchmarks for NMR S?bastien Konieczny (CNRS, Universit? d'Artois, France) Email: nmr14 [at] kr [dot] tuwien [dot] ac [dot] at * Homepage * http://www.kr.tuwien.ac.at/nmr14/ ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 1 08:33:57 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 1 Feb 2014 09:33:57 +0100 Subject: Stage: Offre de stage SNCF, Extraction d'informations sur les pratiques de mobilite Message-ID: Date: Fri, 31 Jan 2014 09:52:57 +0100 From: "REUTENAUER Coralie (SNCF / DIRECTION SI2R / DIR I&R PRODUCTION)" Message-ID: <17936745830A554B9A3D9540F41B312D02DF5B26 at se3p08bm045.COMMUN.AD.SNCF.FR> ******************* Intitul? : Extraction d'informations sur les pratiques de mobilit? La Direction Innovation et Recherche de la SNCF recherche un stagiaire pour travailler sur un projet d'?tude de la mobilit? des voyageurs ? travers l'analyse de donn?es textuelles. *Activit?s du stage* ------------------------------ R?alisation d'une plateforme d'?tiquetage s?mantique de donn?es textuelles pour l'analyse des pratiques de mobilit?. *Th?me* ------------------------------ La soci?t? conna?t depuis quelques ann?es des changements majeurs dans les pratiques de mobilit?, du fait d'autres formes d'organisation du travail, de l'?mergence de nouveaux modes de transport,... Les voyageurs s'expriment sur le web social ? propos leurs d?placements, aussi bien en situation normale qu'en situation perturb?e. Les messages contiennent des informations sur les activit?s des voyageurs, leurs particularit?s sociologiques ou encore leurs motivations. Une analyse s?mantique en fonction de tels crit?res est susceptible d'apporter une meilleure connaissance des comportements, des besoins et des attentes. Elle permet une compr?hension nuanc?e et diff?renci?e de la mobilit?. Le stage aura pour objectif de contribuer ? la mise en place d'une plateforme d'analyse de donn?es pour l'extraction d'informations sur les pratiques de mobilit?. *Description * ------------------------------ Le stagiaire devra : - prendre connaissance du contexte du stage (SNCF, Direction Innovation & Recherche, objectifs du stage et cadre de r?alisation, situation actuelle et interlocuteurs sur les sujets concern?s, ...) - faire un ?tat de l'art des outils disponibles sur le march? en analysant leurs possibilit?s, leurs avantages et leurs inconv?nients. - mettre en place une interface de gestion de la base de donn?es (collecte, structuration et interface). - D?finir et impl?menter des m?thodes pour l'?tiquetage s?mantique des donn?es, en fonction d'une typologie qui lui sera pr?alablement sp?cifi?e - Exploiter les r?sultats d'annotations et proposer des pistes d'am?lioration Pr?sentations et rapports : - pr?sentation de d?but de stage ? la SNCF (au bout d'un mois de stage) : contexte de stage, planning de r?alisation et premiers travaux r?alis?s. - rapport final de stage complet comprenant : m?thodologie retenue, travaux r?alis?s, r?sultats obtenus et probl?mes rencontr?s... 2 soutenances de fin de stage : une ? l'?cole et une ? la SNCF. Des pr?sentations en interne SNCF ou externes pourront ?tre effectu?es. *Profil recherch?* ------------------------------ Niveau : De formation Bac+5 en Traitement Automatique du Langage Naturel ou Informatique (ing?nieur ou master 2). Comp?tences souhait?es : - Capacit?s d'analyse, de r?daction et de synth?se - Autonomie, qualit?s relationnelles, qualit? de pr?sentation (orale/?crite). - Manipulation et test des outils de TAL - Connaissances en TAL et linguistique - Bonnes comp?tences en informatique (programmation, gestion de bases de donn?es) - Des connaissances en statistiques seront appr?ci?es. Bonus : - Ma?trise d'au moins un langage de programmation (PHP de pr?f?rence) - Maitrise des expressions r?guli?res et du SQL *Modalit?s du poste* ------------------------------ - Dur?e : 4 mois - R?mun?ration pr?vue: indemnit?s de stage + carte de circulation SNCF - D?but : ? partir de Juin 2014 - Lieu : Paris Merci d'adresser CV et lettre de motivation ? Coralie Reutenauer ? l'adresse mail suivante : coralie.reutenauer at sncf.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 1 08:21:11 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 1 Feb 2014 09:21:11 +0100 Subject: Seminaire: Conferences Nick Riemer, Universite Paris Diderot Message-ID: Date: Wed, 29 Jan 2014 18:07:32 +0100 From: Thierry Poibeau Message-Id: <2BD46089-6E29-4586-B4B5-650ADC89F01E at ens.fr> X-url: http://www.linguist.univ-paris-diderot.fr/linglunch.html Chers coll?gues, Nick Riemer, Professeur ? l?Universit? de Sydney, chercheur invit? ? l'Univ. Paris Diderot, s?journera ? l'Universit? du 22 janvier au 22 f?vrier 2014, ? l'initiative de l?UFR de Linguistique et du laboratoire d'Histoire des Th?ories Linguistiques (UMR 7597, CNRS/Paris Diderot/Sorbonne Nouvelle). Il donnera trois conf?rences: - mardi 4 f?vrier (14h-16h): ? L?invariance, l?expressivit? et le sens : vers une typologie de la signification lexicale ? conf?rence organis?e par HTL/ UMR7597 et le LaTTiCe/ UMR8094, ODG salle 209 - jeudi 6 f?vrier (12h-13h): ? Six reasons not to believe in the descriptive /expressive distinction ? s?minaire Linglunch, UFR Linguistique, ODG salle 533 http://www.linguist.univ-paris-diderot.fr/linglunch.html - vendredi 14 f?vrier (13h-15h): ? La s?mantique mentaliste : enjeux ?pist?mologiques ? s?minaire HTL /UMR7597 et Labex EFL ? Linguistique et th?ories de l?esprit avant et apr?s l??mergence de la psychologie exp?rimentale ?, ODG salle 131 lieu : Universit? Paris Diderot, B?t. Olympe de Gouges (ODG) , rue Einstein 75013 Paris Cordialement, Thierry Poibeau et Jacqueline L?on ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 1 08:40:44 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 1 Feb 2014 09:40:44 +0100 Subject: Appel: 4th Workshop on Controlled Languages (CNL 2014) Message-ID: Date: Fri, 31 Jan 2014 13:29:44 +0100 From: Camilo Thorne Message-ID: X-url: http://attempto.ifi.uzh.ch/site/cnl2014/ Second Call for Papers *** FOURTH WORKSHOP ON CONTROLLED NATURAL LANGUAGE (CNL 2014) *** Co-located with COLING 2014 20-22 August 2014 Galway, Ireland http://attempto.ifi.uzh.ch/site/cnl2014/ Proceedings in Springer LNCS. --------------------------------------------------------------------- This workshop on controlled natural language (CNL) has a broad scope and embraces all approaches that are based on natural language and apply restrictions on vocabulary, grammar, and/or semantics. This includes (but is certainly not limited to) approaches that have been called simplified language, plain language, formalized language, processable language, fragments of language, phraseologies, conceptual authoring, language generation, and guided natural language interfaces. Some CNLs are designed to improve communication among humans, especially for non-native speakers of the respective natural language. In other cases, the restrictions on the language are supposed to make it easier for computers to analyze such texts in order to improve computer-aided, semi-automatic, or automatic translations into other languages. A third group of CNL has the goal to enable reliable automated reasoning and formal knowledge representation from seemingly natural texts. All these types of CNL are covered by this workshop. Topics ------ Possible topics for CNL 2014 include: - CNL for knowledge representation - CNL for query interfaces - CNL for specifications - CNL for business rules - CNL for dialogue systems - CNL for machine translation - CNL for improved understandability of texts - CNL for natural language generation - design of CNLs - CNL applications - CNL evaluation - usability and acceptance of CNL - CNL grammars and lexica - multilingual CNLs - reasoning in CNL - spoken CNL - CNL in the context of the Semantic Web and Linked Open Data - CNL in the government - CNL in industry - CNL use cases - theoretical properties of CNL Important Dates --------------- Submission deadline: 14 March 2014 Notification of acceptance: 12 May 2014 Deadline for revised papers: 9 June 2014 Workshop: 20-22 August 2014 Submissions and Proceedings --------------------------- We invite researchers to submit papers with novel contributions in the area of CNL. These research papers should be formatted according to the Springer LNCS format and should not exceed 10 pages. Papers should be submitted via the EasyChair conference system: http://www.easychair.org/conferences/?conf=cnl2014 Accepted papers will be included in the workshop proceedings to be published by Springer in their LNCS/LNAI series and indexed in all major citation databases including ISI Web of Science and Scopus. Venue ----- The workshop takes place in Ireland at INSIGHT at NUI Galway (formerly DERI). Organization Committee ---------------------- - Brian Davis (INSIGHT at NUI Galway, formerly DERI, Ireland) - Kaarel Kaljurand (University of Zurich, Switzerland) - Tobias Kuhn (ETH Zurich, Switzerland) Program Committee ----------------- - Krasimir Angelov (Chalmers University, Sweden) - Johan Bos (University of Groningen, Netherlands) - Paul Buitelaar (INSIGHT at NUI Galway, formerly DERI, Ireland) - Olga Caprotti (University of Helsinki, Finland) - Rogan Creswick (Galois, USA) - Danica Damljanovic (University of Sheffield, UK) - Brian Davis (INSIGHT at NUI Galway, formerly DERI, Ireland) - Ronald Denaux (iSOCO, Spain) - Vania Dimitrova (University of Leeds, UK) - Ramona Enache (Chalmers University, Sweden) - Esra Erdem (Sabanci University, Turkey) - S?bastien Ferr? (University Rennes 1, France) - Norbert E. Fuchs (University of Zurich, Switzerland) - Normunds Gr?z?tis (University of Latvia) - Siegfried Handschuh (INSIGHT at NUI Galway, formerly DERI, Ireland) - Stefan H?fler (University of Zurich, Switzerland) - Kaarel Kaljurand (University of Zurich, Switzerland) - Peter Koepke (University of Bonn, Germany) - Tobias Kuhn (ETH Zurich, Switzerland) - Hans Lei? (University of Munich, Germany) - Reinhard Muskens (Tilburg University, Netherlands) - Adegboyega Ojo (INSIGHT at NUI Galway, formerly DERI, Ireland) - Gordon Pace (University of Malta) - Richard Power (The Open University, UK) - Laurette Pretorius (University of South Africa) - Allan Ramsay (University of Manchester, UK) - Mike Rosner (University of Malta) - Aarne Ranta (University of Gothenburg, Sweden) - Rolf Schwitter (Macquarie University, Australia) - Geoff Sutcliffe (University of Miami, USA) - Irina Temnikova (University of Wolverhampton, UK) - Allan Third (The Open University, UK) - Camilo Thorne (Free University of Bozen-Bolzano, Italy) - Silvie Spreeuwenberg (LibRT, Netherlands) - Uta Schwertel (imc, Germany) - Jeroen Van Grondelle (Be Informed, Netherlands) - Yorick Wilks (University of Sheffield, UK) - Adam Wyner (University of Aberdeen, UK) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 1 08:43:19 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 1 Feb 2014 09:43:19 +0100 Subject: Appel: IC 2014, 2nd appel a ateliers et tutoriels Message-ID: Date: Fri, 31 Jan 2014 18:51:08 +0100 From: Catherine Faron Zucker Message-ID: <52EBE28C.7050606 at i3s.unice.fr> X-url: http://www.irit.fr/IC2014 *2nd Appel ? Ateliers et Tutoriels des* 25?mes Journ?es Francophones d'Ing?nierie des Connaissances : /Connaissances pervasives : des utilisateurs plong?s dans des SBC et des SBC plong?s dans le monde/ http://www.irit.fr/IC2014 du 12 au 16 mai 2014 ? Clermont-Ferrand ------------------------------------------------------------------------ *Dates importantes :* Date limite de soumission des propositions d'ateliers et tutoriels : 7 f?vrier 2014 R?ponse du Comit? de programme : 11 f?vrier 2014 ------------------------------------------------------------------------ Les ateliers sont l'occasion de rassembler des groupes d'int?r?t sur un sujet pr?cis, de lancer un groupe de travail sur un th?me nouveau, de confronter, discuter et d?battre de diff?rentes approches autour d'une probl?matique commune. Les tutoriels sont l'occasion de former les membres de la communaut? d'Ing?nierie des connaissances ? des mod?les, m?thodes, techniques ou outils. L'organisation de la conf?rence IC 2014 permet la tenue d'ateliers et de tutoriels le lundi 12 mai apr?s-midi et mardi 13 mai. Plusieurs ateliers en parall?le peuvent ?tre envisag?s. Un atelier peut durer une demi-journ?e ou une journ?e compl?te. ------------------------------------------------------------------------ *Soumissions* Les propositions d'ateliers ou de tutoriels seront soumises pour avis au comit? de programme d'IC2014. Elles doivent ?tre envoy?es par courriel ? (faron at i3s.unice.fr). Les informations suivantes doivent ?tre fournies : - Titre de l'atelier - Nom et pr?nom du responsable (contact) - Adresse ?lectronique du responsable (contact) - Comit? de pilotage (nom, pr?nom, adresse ?lectronique, page web pour chaque personne) - Texte de l'appel ? communications de l'atelier - Moyens mat?riels requis - Dur?e de l'atelier : 1/2 ou 1 journ?e compl?te - Cr?neau(x) pr?f?r?(s) : Lundi apr?s-midi / Mardi matin / Mardi apr?s-Midi Le responsable d'un atelier ou tutoriel sera responsable de la communication sur celui-ci, notamment de la cr?ation d'un site web ou d'une page du site de la conf?rence, de la diffusion d'un appel ? communications, de l'organisation d'un processus de revue des soumissions, de la notification aux auteurs, de la collecte des versions d?finitives des articles accept?s et de la constitution des actes de l'atelier. Il devra s'assurer de l'inscription des organisateurs et auteurs ? l'atelier et encourager les inscriptions ? la conf?rence. A titre indicatif, le calendrier des ateliers pourra ?tre : - Diffusion de l'appel ? communication : 12 f?vrier 2014 - Date limite de soumission : 10 mars 2014 - R?ponse aux auteurs : 31 mars 2014 - Versions d?finitives des contributions : 14 avril 2014 Le respect de ce calendrier permettra de distribuer les actes des ateliers sur la cl? USB d?livr?e aux participants. ------------------------------------------------------------------------ La conf?rence d'Ing?nierie des connaissances est plac?e sous l'?gide du coll?ge IC de l'AFIA ------------------------------------------------------------------------ * Pr?sidente du comit? de programme :* Catherine Faron Zucker (I3S, Universit? Nice Sophia Antipolis) *Pr?sidente du comit? d'organisation :* Catherine Roussey, (IRSTEA) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 1 08:40:00 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 1 Feb 2014 09:40:00 +0100 Subject: Soft: Public release of the OpenDial toolkit Message-ID: Date: Fri, 31 Jan 2014 12:43:38 +0100 From: Pierre Lison Message-Id: <7D2E2BBB-45C5-4FC9-AED8-60D1ED025321 at ifi.uio.no> X-url: http://opendial.googlecode.com Dear all, I?m pleased to announce the first public release of the OpenDial toolkit! OpenDial is a Java-based software toolkit that can be used to develop robust and adaptive dialogue systems for various domains. Dialogue understanding, management and generation models are expressed in OpenDial through probabilistic rules encoded in a simple XML format. You can find more information about the toolkit on the project website: http://opendial.googlecode.com The current release contains a completely refactored code base, a large set of unit tests and a full-fledged graphical interface. You can also find on the website practical examples of dialogue domains and a step-by-step documentation on how to use the toolkit. Feel free to let me know if you have any comments or suggestions, Pierre Lison (PhD Research Fellow) Department of Informatics, University of Oslo Mobile: +47.967.998.12 Web: http://folk.uio.no/plison ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 1 08:23:13 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 1 Feb 2014 09:23:13 +0100 Subject: Appel: Final Call for SIGIR'14 Workshop Proposals (due Feb 3) Message-ID: Date: Thu, 30 Jan 2014 09:15:25 +1000 From: Richi Nayak Message-ID: X-url: http://sigir.org/sigir2014/ [Apologies if you receive this more than once] This is the Final Call for SIGIR'14 Workshop Proposals: - Deadline Monday 3 February 2014 (any time-zone) - On any topic, in particular novel ideas and emerging areas in IR - In a workshop format (not just presenting papers) - Proposals in pdf, ACM style max. 4 pages - Submit at (ttps://www.easychair.org/conferences/?conf=sigir2014workshops) Full details: -- Call for Workshop Proposals 37th Annual ACM SIGIR Conference, SIGIR 2014 Gold Coast, Australia, 6-11 July, 2014 http://sigir.org/sigir2014/ * SIGIR 2014 Workshop Program SIGIR is the major international forum for the presentation of new research results and for the demonstration of new systems and techniques in the broad field of information retrieval (IR). Proposals are solicited for workshops to be held at ACM SIGIR 2014 in Gold Coast, Australia. Workshops will last for one day and will be held on Friday 11 July 2014. Workshop topics will typically match those identified in the SIGIR 2014 general call for contributions, but proposals concerned with other areas of IR are welcome. SIGIR 2014 workshops will provide a platform for presenting novel ideas and emerging areas in IR, in a less formal and possibly more focused way than the conference itself. The format of each workshop is to be determined by the organizers, but it is expected that workshops will contain ample time for general discussion and engagement by all participants -- not just those presenting papers. Workshops that foster collaboration, discussion, group problem-solving and community building initiatives are particularly encouraged. Workshops that only involve the presentation of papers in a "mini conference" format are discouraged. Researchers and practitioners from all areas of IR are invited to submit proposals for review. The organizers of approved workshops will be expected to define the workshop's focus, gather and review submissions, and decide upon final program content. Organizers (including co-organizers) are expected to attend their entire workshop and to provide an article for ACM SIGIR Forum summarizing the event. * Submission Requirements Workshop proposals should include the following information: 1. Title 2. Motivation for the workshop and appropriateness to SIGIR. 3. Theme and purpose of the workshop. 4. Format and planned activities and a tentative schedule of events. 5. Selection process for participants and/or presenters and maximum number of participants (if limited). 6. List of organisers with a short biographical sketch of each organizer, describing relevant qualifications and experience. 7. Names of potential program committee members. 8. Related workshops (if applicable). If the workshop has been held previously at SIGIR or another conference, then the organizers should indicate this and describe briefly past attendance and outcomes, and why another workshop is needed. Proposals should be submitted via the easychair workshop submission webpage (https://www.easychair.org/conferences/?conf=sigir2014workshops) in pdf format and should not exceed 4 pages in ACM style. * Important Dates - Monday 3 February 2014: Workshop proposals due - Monday 10 March 2014: Notification of workshop acceptance [We try to notify 1-2 weeks early!] - Sunday 6-11 July, 2014: SIGIR Conference - Friday 11 July, 2014: SIGIR Workshops day * SIGIR Workshop Co-Chairs Jaap Kamps, University of Amsterdam, Email: sigir2014workshops at easychair.org Gabriella Kazai, Microsoft Research and Bing, Email: sigir2014workshops at easychair.org From hamon at LIMSI.FR Sat Feb 1 08:31:09 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 1 Feb 2014 09:31:09 +0100 Subject: Seminaire: Alpage, Eric de la Clergerie, 7 fevrier 2014 Message-ID: Date: Fri, 31 Jan 2014 08:33:43 +0100 From: Marie Candito Message-ID: ************** S?minaire de l'?quipe Alpage ********* Il s'agit du s?minaire de recherche en linguistique informatique organis? par l'?quipe Alpage, ?quipe mixte INRIA - Paris Diderot, sp?cialis?e en traitement automatique des langues. Vendredi 7 f?vrier de 11h ? 12h30, en salle 267 B?timent Olympe de Gouges rue Albert Einstein 75013 Paris (Il s'agit du dernier b?timent de la rue Albert Einstein, dans la r?cemment inaugur?e "place Paul Ricoeur") Toute personne int?ress?e est la bienvenue. *********************************************************** Eric de la Clergerie (Alpage, INRIA / Universit? Paris Diderot) Titre : Analyse syntaxique statistique et hybride: 2 exp?riences R?sum? : Je pr?sente 2 exp?riences d'analyse syntaxique qui n'ont en commun que de s'appuyer sur le syst?me DyALog. La premi?re exp?rience, initi?e dans le cadre de la campagne d'?valuation SPMRL 2013, concerne le d?veloppement rapide d'un analyseur statistique de type Shift-Reduce (? la MALT) mais avec faisceau et utilisation de techniques de programmation dynamique. Son autre originalit? r?side dans la possibilit? de prendre des treillis de mots en entr?e. La seconde exp?rience s'inscrit dans le cadre du d?veloppement ? long-terme de FRMG, un analyseur fond? sur une grammaire TAG symbolique ? large couverture pour le fran?ais. L'utilisation de techniques d'apprentissage faiblement supervis? sur le French TreeBank (FTB) a permis d'am?liorer fortement les performances du module de d?sambiguisation de FRMG. Les r?sultats obtenus par cette hybridation symbolique/statistique se situent au niveau de ceux obtenus par des analyseurs purement statistiques entra?n?s et ?valu?s sur le FTB. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 1 08:28:11 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 1 Feb 2014 09:28:11 +0100 Subject: Revue: Journal of Language Modelling 1(2) Message-ID: Date: Thu, 30 Jan 2014 10:34:49 +0100 From: Adam Przepiorkowski Message-ID: <87zjmdhkxy.fsf at bach.ipipan.waw.pl> X-url: http://jlm.ipipan.waw.pl/ It is our pleasure to announce the publication of issue 1(2) of the Journal of Language Modelling, a free open-access peer-reviewed journal aiming to bridge the gap between theoretical, formal and computational linguistics: http://jlm.ipipan.waw.pl/ (click on ?CURRENT? to see this issue). TABLE OF CONTENTS: "Populating a multilingual ontology of proper names from open sources" Agata SAVARY, Leszek MANICKI, Ma?gorzata BARON "Grammatical typology and frequency analysis: number availability and number use" Dunstan BROWN, Greville G. CORBETT, Sebastian FEDDEN, Andrew HIPPISLEY, Paul MARRIOTT "Reactivation of antecedents by overt vs null pronouns: Evidence from Persian Niloofar KESHTIARI, Shravan VASISHTH "Syntax-driven semantic frame composition in Lexicalized Tree Adjoining Grammars" Laura KALLMEYER, Rainer OSSWALD "External reviewers 2012?2013" The current make-up of the JLM Editorial Board is enclosed below. Best regards, Adam Przepi?rkowski (for JLM editors) ====================================================================== EDITORIAL BOARD: Steven Abney, University of Michigan, USA Ash Asudeh, Carleton University, CANADA; University of Oxford, UNITED KINGDOM Chris Biemann, Technische Universit?t Darmstadt, GERMANY Igor Boguslavsky, Technical University of Madrid, SPAIN; Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, RUSSIA Ant?nio Branco, University of Lisbon, PORTUGAL David Chiang, University of Southern California, Los Angeles, USA Greville Corbett, University of Surrey, UNITED KINGDOM Dan Cristea, University of Ia?i, ROMANIA Jan Daciuk, Gda?sk University of Technology, POLAND Mary Dalrymple, University of Oxford, UNITED KINGDOM Darja Fi?er, University of Ljubljana, SLOVENIA Anette Frank, Universit?t Heidelberg, GERMANY Claire Gardent, CNRS/LORIA, Nancy, FRANCE Jonathan Ginzburg, Universit? Paris-Diderot, FRANCE Stefan Th. Gries, University of California, Santa Barbara, USA Heiki-Jaan Kaalep, University of Tartu, ESTONIA Laura Kallmeyer, Heinrich-Heine-Universit?t D?sseldorf, GERMANY Jong-Bok Kim, Kyung Hee University, Seoul, KOREA Kimmo Koskenniemi, University of Helsinki, FINLAND, Finland Jonas Kuhn, Universit?t Stuttgart, GERMANY Alessandro Lenci, University of Pisa, ITALY J?n Ma?utek, Comenius University in Bratislava, SLOVAKIA Igor Mel??uk, University of Montreal, CANADA Glyn Morrill, Technical University of Catalonia, Barcelona, SPAIN Stefan M?ller, Freie Universit?t Berlin, GERMANY Reinhard Muskens, Tilburg University, NETHERLANDS Mark-Jan Nederhof, University of St Andrews, UNITED KINGDOM Petya Osenova, Sofia University, BULGARIA David Pesetsky, Massachusetts Institute of Technology, USA Maciej Piasecki, Wroc?aw University of Technology, POLAND Christopher Potts, Stanford University, USA Louisa Sadler, University of Essex, UNITED KINGDOM Ivan A. Sag?, Stanford University, USA Agata Savary, Universit? Fran?ois Rabelais Tours, FRANCE, France Sabine Schulte im Walde, Universit?t Stuttgart, GERMANY Stuart M. Shieber, Harvard University, USA Mark Steedman, University of Edinburgh, UNITED KINGDOM Stan Szpakowicz, Electrical Engineering and Computer Science, University of Ottawa, CANADA; Institute of Computer Science, Polish Academy of Sciences, Warsaw, POLAND Shravan Vasishth, Universit?t Potsdam, GERMANY Zygmunt Vetulani, Adam Mickiewicz University, Pozna?, POLAND Aline Villavicencio, Federal University of Rio Grande do Sul, Porto Alegre, BRAZIL Veronika Vincze, University of Szeged, HUNGARY Yorick Wilks, Florida Institute of Human and Machine Cognition, USA Shuly Wintner, University of Haifa, ISRAEL Zden?k ?abokrtsk?, Charles University in Prague, CZECH REPUBLIC ====================================================================== ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 1 08:46:21 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 1 Feb 2014 09:46:21 +0100 Subject: Appel: Medias et bien-etre, discours et representations Message-ID: date: Sat, 01 Feb 2014 07:52:32 +0100 from: "Pascale Vergely" message-id: <612-52ec9980-1-6511a680 at 8997611> Appel ? communications M?dias et bien-?tre : discours et repr?sentations A la suite du colloque international "M?dias et sant? publique" organis? par le laboratoire SPH de l?Universit? Bordeaux en octobre 2012, l?Universit? de Bologne (DIT D?partement d?Interpr?tation et de Traduction de Forl?) en partenariat avec les Universit?s Bordeaux et Rome 3, en collaboration avec le Dorif Universit?, souhaitent poursuivre les r?flexions entam?es en se focalisant sur la notion de bien-?tre. A la fois physique, psychologique et sociale, la notion de bien-?tre s?av?re centrale dans nos soci?t?s complexes. Sa d?finition, selon le dictionnaire, souligne qu?il s?agit autant d?une ? sensation agr?able procur?e par la satisfaction de besoins physiques ? et par l?? absence de tensions psychologiques ?, que d?une ? situation mat?rielle qui permet de satisfaire les besoins de l?existence ?. Dans ce colloque nous entendons privil?gier les dimensions du bien-?tre centr?es sur la sant?, que ce soit dans le domaine de la nutrition (r?gimes, produits et cuisine bio-, produits naturels?), du sport (gymnastique de forme ou l'entra?nement physique) et des th?rapies (stations thermales, m?decine non conventionnelle, m?decine douce, produits param?dicaux?). L?id?al normatif ou l?injonction sociale de rechercher un ?tat de ? bien-?tre ? seront ?galement interrog?s. Le colloque qui sera organis? ? Forl? les 16 et 17 octobre 2014 vise ? r?unir des chercheurs d?horizons diff?rents (en Linguistique et Sciences du Langage, en Traductologie, en Sciences de l?Information et de la Communication, en Sociologie, en Philosophie?), ainsi que des professionnels de la sant? et du bien-?tre et des m?dias, afin de croiser leurs r?flexions sur les textes et discours li?s au bien-?tre. Les communications - en fran?ais ou en italien - pourront s?articuler autour des axes suivants (liste non exhaustive) : La traduction et la traductologie : - La traduction de textes centr?s sur le bien-?tre : questions th?oriques et strat?gies de traduction. - Traduction et genres textuels en rapport au bien-?tre : quelles strat?gies pour une grande vari?t? de genres, des textes scientifiques r?dig?s par les sp?cialistes aux textes ? vis?e promotionnelle ou vulgarisatrice. - La traduction de textes centr?s sur le bien-?tre entre sciences dures et sciences humaines : des probl?matiques terminologiques aux enjeux stylistiques. - La localisation de sites consacr?s au bien-?tre dans diff?rentes langues et cultures. - Quelle formation pour le traducteur de textes d?une si grande vari?t? et h?t?rog?n?it? linguistique, discursive et culturelle ? L?analyse du discours : - Le(s) discours et les textes ? propos du bien-?tre : quelles relations entre situation d??nonciation, interdiscours et formation sociodiscursive ? Quelles strat?gies rh?toriques sont mises en ?uvre ? - Comment traiter l?h?t?rog?n?it? des supports des textes et des discours (publicit?s, cartels, murs, ?crans, documents audio-visuels et de l?internet?) ? propos du bien-?tre et quels rapports entretiennent-ils avec le contexte environnemental ? - Quels liens existe-t-il entre l?axe de l?intentionnalit? (axe de production) et l?axe de l?acceptabilit? (axe de la r?ception-interpr?tation) pour ces discours et textes ? - Est-ce que les discours scientifiques transforment les subjectivit?s ? La vulgarisation entre analyse du discours et linguistique textuelle : - La vulgarisation en tant que traduction intralinguistique : quelles marques linguistiques et s?miotiques (reformulations, d?signations qualifiantes, infographies, encadr?s et illustrations ? vocation didactique, etc.) dans des corpus ? propos du bien-?tre ? - Les textes sur le bien-?tre ? la crois?e entre vulgarisation et promotion : ethos, polyphonie et enjeux ?nonciatifs. - Enjeux terminologiques et reformulations dans les textes de vulgarisation. - Vulgarisation et typologies textuelles (les diff?rentes formes de la pr?sentation du bien-?tre dans les m?dias, dans les publications sp?cialis?es, dans les revues, dans les brochures d?informations de centres thermaux ou de m?decine douce, etc.) - Strat?gies de vulgarisation et classes d??ge : quelles diff?rences entre les textes pour adultes et ceux pour les enfants ? Calendrier 31 mars 2014 : Date limite de r?ception des propositions de communication (300-400 mots), accompagn?es d?une courte notice bio-bibliographique. Les propositions (en fran?ais ou en italien) sont ? adresser ? : Roberta Pederzoli: r.pederzoli at unibo.it Licia Reggiani: licia.reggiani at unibo.it Laura Santone: laura.santone at uniroma3.it Fin mai 2014 : Notification d?acceptation 16 et 17 octobre 2014 : Colloque Comit? scientifique Josiane Boutet (Paris 7), Bruna Donatelli (Un. Roma Tre), Enrica Galazzi (Un. Cattolica de Milan), Chiara Elefante (Un. Bologne), Danielle Londei (Un. Bologne), Olivier Laught (Un. Bordeaux), Sophie Moirand (Un. Paris 3), Paola Paissa (Un. de Turin), Roberta Pederzoli (Un. Bologne), Sandrine Reboul-Tour? (Un. Paris 3), Laura Santone (Un. Roma Tre), Pascale Vergely (Un. Bordeaux). ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 1 08:44:46 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 1 Feb 2014 09:44:46 +0100 Subject: Appel: Semantic Sentiment Analysis 2014, workshop ESWC 2014 Message-ID: Date: Fri, 31 Jan 2014 23:44:41 +0100 From: Diego Reforgiato Message-ID: <52EC2759.5050501 at istc.cnr.it> X-url: http://ontologydesignpatterns.org/wiki/SemanticSentimentAnalysis2014 ** apologies for cross-posting ** ============================================ Call for Workshop: Semantic Sentiment Analysis 2014 Semantic Web and Sentiment Analysis ============================================ http://ontologydesignpatterns.org/wiki/SemanticSentimentAnalysis2014 Dates: May 25, 2014 Hashtag: #ssa2014 Venue: Anissaras, Crete, Greece Jointly with the 11th ESWC 2014 (http://2014.eswc-conferences.org) Workshop Chairs: - Aldo Gangemi (U. Paris Nord France/ISTC-CNR Rome Italy) - Harith Alani (KMI-OU Milton Keynes UK) - Malvina Nissim (University of Bologna, Italy) - Erik Cambria (NUS Singapore) - Diego Reforgiato Recupero (ISTC-CNR Catania Italy) Mining opinions and sentiments from natural language is a difficult task as opinions and sentiments are often conveyed implicitly through latent semantics, which make approaches based on surface lexical hints less effective in those cases. To this end, semantic sentiment analysis may go beyond word-level analysis of text, and provide novel approaches to opinion mining and sentiment analysis that allow a more efficient passage from (unstructured) textual information to (structured) machine-processable data, in potentially any domain. Semantic sentiment analysis may benefit from the use of web ontologies (e.g. linked data, multilingual linked lexica, linked vocabularies), or graph-based quasi-ontologies (e.g. ConceptNet, SenticNet, Nell, OIE, etc.) which enable the aggregation of conceptual and affective information associated with natural language opinions. This workshop focuses on the introduction, presentation, and discussion of novel approaches to semantic sentiment analysis. Special focus will be given to semantic methods, models, and tools that exploit common-sense knowledge bases to perform multi- or open-domain sentiment analysis. The intended audience of the workshop includes researchers from academia and industry as well as professionals and industrial practitioners to discuss and exchange positions on new hybrid techniques, which use semantics for sentiment analysis. The expected number of participants ranges between 20 and 40. * Topics of Interest * Topics of Includes but not limited to: - Ontologies and knowledge bases for sentiment analysis - Topic and entity based sentiment analysis - Evolution of sentiment within and across social media systems and topics - Semantic processing of social media for sentiment analysis - Contextualised sentiment analysis - Comparison of semantic approaches for sentiment analysis - Personalised sentiment analysis and monitoring - Prediction of sentiment towards events, people, organisations, etc. - Baselines and datasets for semantic sentiment analysis - Concept-level sentiment analysis - Biologically-inspired opinion mining * Submission Guidelines * Submissions must comply with the Springer LNCS style and will be made using EasyChair at the following link https://www.easychair.org/conferences/?conf=ssa14. Authors are invited to submit full papers (up to 8 e.g. original work) pages or short papers (up to 4 pages e.g. pilot experiments, position statement, short communications). Accepted papers will be published by CEUR-WS. Moreover, selected expanded versions of workshop papers will be invited for a special issue of Springer Cognitive Computation (http://springer.com/journal/12559). At least one of the authors of the accepted papers must register for both the main conference and the workshop to be included into the workshop proceedings. * Important Dates * Submission Deadline: March 6, 2014 Notification of Acceptance: April 1, 2014 Camera-Ready Paper: April 15, 2014 From hamon at LIMSI.FR Mon Feb 3 09:08:06 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Mon, 3 Feb 2014 10:08:06 +0100 Subject: Appel: NLDB'2014, Montpellier, France (Extended deadline) Message-ID: Date: Mon, 3 Feb 2014 05:32:23 +0100 From: Mathieu Roche Message-Id: <00AB2166-CA0A-472B-9333-3F5B6CD90005 at lirmm.fr> X-url: http://www.nldb.org/ ============== CALL FOR PAPERS - NLDB'2014 19th International Conference on Application of Natural Language to Information Systems 18-20 June 2014 - Montpellier, France http://www.nldb.org/ Submission deadline: February 10, 2014 [EXTENDED DEADLINE] ============== Since 1995, the NLDB conference aims at bringing together researcher, industrials and potential users interested in various application of Natural Language in the Database and Information Systems field. The integration of databases and natural language has been an utopia for many years. However, progress has been made and this is now an established field thanks to developments in Natural Language and technologies that made the storage and manipulation of large electronic dictionaries possible. As Information Systems are now evolving into the communication area, the term databases should be considered in the broader sense of information and communication systems. The use of Natural Language in Software Engineering has contributed to both improving the development process from the viewpoints of developers (improve the process of conceptual modeling, validation, etc) and the usability of applications by users (natural language query interfaces, semantic webs, etc). NLDB'2014 will take place in Montpellier (France). The conference invites researchers from academia and industry to submit papers for oral or poster presentations on recent, unpublished research that addresses theoretical aspects, algorithms, applications, architectures for applied and integrated NLP, resources for applied NLP, and other aspects of NLP, as well as review and discussion papers. Topics of interest include but are not limited to: - Applications of NLP in Information Systems: Multilingual Information Systems, NLP in Requirement Engineering, NLP in Knowledge Management, Semantic Data Integration and Data Cleaning - Social Media and Web Data: Corpus analysis, Language identification, Text normalization, Robust NLP for social media, Text classification, Information Extraction and Sentiment Analysis for social media - Big Data and Natural Language - Semantic Web Open Linked Data: Ontology Learning and Alignment, Populating ontologies, Querying Ontologies and linked data, Semantic tagging and classification, Ontology-driven NLP - Question Answering (QA): NL interfaces to databases, QA using web data, multi-lingual QA, Non-factoid QA (how/why/opinion questions, lists), geographical QA, QA corpora and training sets - Natural language and Ubiquitous Computing: Pervasive Computing, Embedded, Robotic and Mobile Applications. - Natural Language in Conceptual Modeling: Analysis of Natural Language Descriptions, Terminological Ontologies, Consistency Checking, Metadata Creation and Harvesting, Ontology-driven Systems Integration, Ontology Management - NLP Applications: Business Intelligence, Subjectivity and Sentiment Analysis, QA systems, Event Detection, Named Entity and Event Detection, Information Extraction, Summarization, NLP for Data Mining, NLP for Data Warehouses, Plagiarism detection, Identity detection ============== Submission guidelines: Authors should follow the LNCS format and submit their manuscripts in pdf. Submissions can be full papers (12 pages maximum including references), short papers (6 pages) or papers for a poster presentation (4 pages) : https://www.easychair.org/conferences/?conf=nldb2014 The programme committee may decide to accept some full papers as short papers or poster contributions. The Proceedings of NLDB 2014 will be published by Springer LNCS Series. We plan to publish extended versions of a selection of the best papers after the conference in the 'Data and Knowledge Engineering Journal' as a special issue. ============== Conference Chair: - Elisabeth M?tais, Conservatoire National des Arts et M?tiers, Paris, France Programme Committee Chairs: - Mathieu Roche, Cirad, TETIS, France - Maguelonne Teisseire, Irstea, TETIS, France ============== Important dates: - Paper submission: February 10, 2014 [extended deadline] - Notification of acceptance: March 24, 2014 - Camera-ready papers: April 7, 2014 - Conference: 18-20 June 2014 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 4 22:04:20 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 4 Feb 2014 23:04:20 +0100 Subject: Appel: TSD 2014 Message-ID: Date: Tue, 04 Feb 2014 22:30:47 +0100 From: TSD 2014 Message-Id: X-url: http://www.tsdconference.org/ ********************************************************* TSD 2014 - FIRST CALL FOR PAPERS ********************************************************* Seventeenth International Conference on TEXT, SPEECH and DIALOGUE (TSD 2014) Brno, Czech Republic, 8-12 September 2014 http://www.tsdconference.org/ The conference is organized by the Faculty of Informatics, Masaryk University, Brno, and the Faculty of Applied Sciences, University of West Bohemia, Pilsen. The conference is supported by International Speech Communication Association. Venue: Brno, Czech Republic THE SUBMISSION DEADLINES: March 15 2014 ............ Submission of abstracts March 22 2014 ............ Submission of full papers Submission of abstract serves for better organization of the review process only - for the actual review a full paper submission is necessary. TSD SERIES TSD series evolved as a prime forum for interaction between researchers in both spoken and written language processing from all over the world. Proceedings of TSD form a book published by Springer-Verlag in their Lecture Notes in Artificial Intelligence (LNAI) series. TSD Proceedings are regularly indexed by Thomson Reuters Conference Proceedings Citation Index. Moreover, LNAI series are listed in all major citation databases such as DBLP, SCOPUS, EI, INSPEC or COMPENDEX. TOPICS Topics of the conference will include (but are not limited to): Corpora and Language Resources (monolingual, multilingual, text and spoken corpora, large web corpora, disambiguation, specialized lexicons, dictionaries) Speech Recognition (multilingual, continuous, emotional speech, handicapped speaker, out-of-vocabulary words, alternative way of feature extraction, new models for acoustic and language modelling) Tagging, Classification and Parsing of Text and Speech (morphological and syntactic analysis, synthesis and disambiguation, multilingual processing, sentiment analysis, credibility analysis, automatic text labeling, summarization, authorship attribution) Speech and Spoken Language Generation (multilingual, high fidelity speech synthesis, computer singing) Semantic Processing of Text and Speech (information extraction, information retrieval, data mining, semantic web, knowledge representation, inference, ontologies, sense disambiguation, plagiarism detection) Integrating Applications of Text and Speech Processing (machine translation, natural language understanding, question-answering strategies, assistive technologies) Automatic Dialogue Systems (self-learning, multilingual, question-answering systems, dialogue strategies, prosody in dialogues) Multimodal Techniques and Modelling (video processing, facial animation, visual speech synthesis, user modelling, emotions and personality modelling) Papers on processing of languages other than English are strongly encouraged. PROGRAM COMMITTEE Hynek Hermansky, USA (general chair) Eneko Agirre, Spain Genevieve Baudoin, France Paul Cook, Australia Jan Cernocky, Czech Republic Simon Dobrisek, Slovenia Karina Evgrafova, Russia Darja Fiser, Slovenia Radovan Garabik, Slovakia Alexander Gelbukh, Mexico Louise Guthrie, GB Jan Hajic, Czech Republic Eva Hajicova, Czech Republic Yannis Haralambous, France Ludwig Hitzenberger, Germany Jaroslava Hlavacova, Czech Republic Ales Horak, Czech Republic Eduard Hovy, USA Maria Khokhlova, Russia Daniil Kocharov, Russia Ivan Kopecek, Czech Republic Valia Kordoni, Germany Steven Krauwer, The Netherlands Siegfried Kunzmann, Germany Natalija Loukachevitch, Russia Vaclav Matousek, Czech Republic Diana McCarthy, United Kingdom France Mihelic, Slovenia Hermann Ney, Germany Elmar Noeth, Germany Karel Oliva, Czech Republic Karel Pala, Czech Republic Nikola Pavesic, Slovenia Fabio Pianesi, Italy Maciej Piasecki, Poland Adam Przepiorkowski, Poland Josef Psutka, Czech Republic James Pustejovsky, USA German Rigau, Spain Leon Rothkrantz, The Netherlands Anna Rumshisky, USA Milan Rusko, Slovakia Mykola Sazhok, Ukraine Pavel Skrelin, Russia Pavel Smrz, Czech Republic Petr Sojka, Czech Republic Stefan Steidl, Germany Georg Stemmer, Germany Marko Tadic, Croatia Tamas Varadi, Hungary Zygmunt Vetulani, Poland Pascal Wiggers, The Netherlands Yorick Wilks, GB Marcin Wolinski, Poland Victor Zakharov, Russia KEYNOTE SPEAKERS Ralph Grishman, New York University, USA Bernardo Magnini, FBK - Fondazione Bruno Kessler, Italy Salim Roukos, IBM, USA FORMAT OF THE CONFERENCE The conference program will include presentation of invited papers, oral presentations, and poster/demonstration sessions. Papers will be presented in plenary or topic oriented sessions. Social events including a trip in the vicinity of Brno will allow for additional informal interactions. SUBMISSION OF PAPERS Authors are invited to submit a full paper not exceeding 8 pages formatted in the LNCS style (see below). Those accepted will be presented either orally or as posters. The decision about the presentation format will be based on the recommendation of the reviewers. The authors are asked to submit their papers using the on-line form accessible from the conference website. Papers submitted to TSD 2014 must not be under review by any other conference or publication during the TSD review cycle, and must not be previously published or accepted for publication elsewhere. As reviewing will be blind, the paper should not include the authors' names and affiliations. Furthermore, self-references that reveal the author's identity, e.g., "We previously showed (Smith, 1991) ...", should be avoided. Instead, use citations such as "Smith previously showed (Smith, 1991) ...". Papers that do not conform to the requirements above are subject to be rejected without review. The authors are strongly encouraged to write their papers in TeX or LaTeX formats. These formats are necessary for the final versions of the papers that will be published in the Springer Lecture Notes. Authors using a WORD compatible software for the final version must use the LNCS template for WORD and within the submit process ask the Proceedings Editors to convert the paper to LaTeX format. For this service a service-and-license fee of CZK 1500 will be levied automatically. The paper format for review has to be either PDF or PostScript file with all required fonts included. Upon notification of acceptance, presenters will receive further information on submitting their camera-ready and electronic sources (for detailed instructions on the final paper format see http://www.springer.de/comp/lncs/authors.html#Proceedings, Sample File typeinst.zip). Authors are also invited to present actual projects, developed software or interesting material relevant to the topics of the conference. The presenters of demonstrations should provide an abstract not exceeding one page. The demonstration abstracts will not appear in the conference proceedings. IMPORTANT DATES March 15 2014 ............ Submission of abstracts March 22 2014 ............ Submission of full papers May 15 2014 .............. Notification of acceptance May 31 2014 .............. Final papers (camera ready) and registration August 3 2014 ............ Submission of demonstration abstracts August 10 2014 ........... Notification of acceptance for demonstrations sent to the authors September 8-12 2014 ...... Conference date Submission of abstracts serves for better organization of the review process only - for the actual review a full paper submission is necessary. The accepted conference contributions will be published in proceedings that will be made available to participants at the time of the conference. OFFICIAL LANGUAGE The official language of the conference is English. ACCOMMODATION The organizing committee will arrange discounts on accommodation in the 4-star hotel at the conference venue. The current prices of the accommodation will be available at the conference website. ADDRESS All correspondence regarding the conference should be addressed to Ales Horak, TSD 2014 Faculty of Informatics, Masaryk University Botanicka 68a, 602 00 Brno, Czech Republic phone: +420-5-49 49 18 63 fax: +420-5-49 49 18 20 email: tsd2014 at tsdconference.org The official TSD 2014 homepage is: http://www.tsdconference.org/ LOCATION Brno is the second largest city in the Czech Republic with a population of almost 400.000 and is the country's judiciary and trade-fair center. Brno is the capital of South Moravia, which is located in the south-east part of the Czech Republic and is known for a wide range of cultural, natural, and technical sights. South Moravia is a traditional wine region. Brno had been a Royal City since 1347 and with its six universities it forms a cultural center of the region. Brno can be reached easily by direct flights from London, Moscow, and Eindhoven, and by trains or buses from Prague (200 km) or Vienna (130 km). For the participants with some extra time, nearby places may also be of interest. Local ones include: Brno Castle now called Spilberk, Veveri Castle, the Old and New City Halls, the Augustine Monastery with St. Thomas Church and crypt of Moravian Margraves, Church of St. James, Cathedral of St. Peter & Paul, Cartesian Monastery in Kralovo Pole, the famous Villa Tugendhat designed by Mies van der Rohe along with other important buildings of between-war Czech architecture. For those willing to venture out of Brno, Moravian Karst with Macocha Chasm and Punkva caves, battlefield of the Battle of three emperors (Napoleon, Russian Alexander and Austrian Franz - Battle by Austerlitz), Chateau of Slavkov (Austerlitz), Pernstejn Castle, Buchlov Castle, Lednice Chateau, Buchlovice Chateau, Letovice Chateau, Mikulov with one of the largest Jewish cemeteries in Central Europe, Telc - a town on the UNESCO heritage list, and many others are all within easy reach. From hamon at LIMSI.FR Tue Feb 4 21:36:02 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 4 Feb 2014 22:36:02 +0100 Subject: Appel: KI 2014 Message-ID: Date: Mon, 3 Feb 2014 14:39:45 +1100 From: Michael Thielscher Message-Id: <1F4275CC-271B-4C6E-82C8-B0B40944A797 at cse.unsw.edu.au> X-url: http://www.easychair.org/conferences/?conf=ki2014 **** Please accept our apologies if you receive multiple copies of this call **** ====================================================================== KI 2014 - The 37th German Conference on Artificial Intelligence Stuttgart, September 22--26, 2014 ====================================================================== www.ki2014.de KI 2014 is the 37th edition of the German Conference on Artificial Intelligence, which traditionally brings together academic and industrial researchers from all areas of AI, providing a premier forum for exchanging news and research results on theory and applications of intelligent system technology. The technical program of KI 2014 will comprise paper and poster presentations and a variety of workshops and tutorials. KI 2014 will be co-located with Informatik 2014 (Annual Conference of the German Informatics Society) and MATES 2014 (The 12th German Conference on Multi- Agent System Technologies). The conference invites original research papers from all areas of AI, its fundamentals, its algorithms, its history and its applications. You are invited to submit original research and application papers on all aspects of AI research, including but not limited to the following: - Agent-based and multiagent systems - AI applications and innovations - Cognitive modeling - Commonsense reasoning - Computer vision - Constraint satisfaction, search, and optimization - Evolutionary computation - Game playing and interactive entertainment - Information retrieval, integration, and extraction - Knowledge acquisition and ontologies - Knowledge representation and reasoning - Machine learning and data mining - Multidisciplinary AI - Natural language processing - Planning and scheduling - Robotics - Uncertainty in AI - Web and information systems We especially welcome application papers that provide novel insights on the interplay of AI and the real world, as well as papers that bring useful computational technologies from other areas of computer science into AI. ======== Important Dates ======== Workshop/Tutorial submission: March 15 Full/Short Paper submission: May 1 Acceptance notification: June 23 Final version due: July 4 KI Workshops and Conference: September 22-26 ======== Workshops, Tutorials, Doctoral Consortium ======== Together with the main conference, we plan to organize a small number of high-quality workshops and tutorials. We especially encourage workshops organized by AI Special Interest Groups (GI-Fachgruppen), workshops that bring together researchers from different disciplines, and workshops that highlight emerging topics of AI research. Tutorials should target a large percentage of conference participants, including graduate students as well as experienced researchers, and practitioners. The technical program of KI2014 is also complemented by a doctoral consortium that invites participation by PhD students at any stage and from any subject area within AI. The KI doctoral consortium is organized as a joint event with the doctoral program of Informatik 2014, and includes a mentoring program, a PhD workshop, and poster presentations. For details on submitting workshop and tutorials and for applying to the doctoral consortium, please see the KI2014 web page. ======== Submission ======== Submitted papers, which have to be in English, must not exceed 12 pages in Springer LNCS style for full technical contributions and 6 pages for short papers. Full technical papers are expected to report on new research that makes a substantial technical contribution to the field. Short papers are also included in the main proceedings and can report on new research or other issues of interest to the AI community. Examples of work suitable for short paper submissions include: novel ideas whose scope is not large enough for a full paper: important implementation techniques; novel interesting benchmark problems; short experimental studies; interesting applications that are not yet completely solved or analyzed; position or challenge papers; etc. Short paper submissions are especially invited for software demonstration or PhD work in progress. Submission will be through the Easychair conference management system : http://www.easychair.org/conferences/?conf=ki2014 Papers will be subject to blind peer review. All papers will be reviewed based on the standard criteria of relevance, significance of results, originality of ideas, soundness, and quality of the presentation. All accepted papers will be published in the main conference proceedings, and will be presented at the conference. At least one author of each accepted paper must register for the conference and present the contribution. The KI 2014 proceedings will be published by Springer as a volume of the LNAI (Lecture Notes in Artificial Intelligence) series. ======== Main Organizers ======== Program Chairs * Carsten Lutz (Universit?t Bremen) * Michael Thielscher (University of New South Wales) Workshop and Tutorial Chair * Frieder Stolzenburg (Hochschule Harz) Doctoral Consortium Chair * Markus Kr?tzsch (TU Dresden) For enquiries, please contact ki2014 at easychair.org. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 4 21:39:13 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 4 Feb 2014 22:39:13 +0100 Subject: Appel: RECITAL 2014 Message-ID: Date: Mon, 03 Feb 2014 11:52:07 +0100 From: Nuria Gala Message-ID: <52EF74D7.3000409 at lif.univ-mrs.fr> RECITAL 2014 : deuxi?me appel ? communications ----------------------------------------------- RECITAL 2014 16?me Rencontre des ?tudiants Chercheurs en Informatique pour le Traitement Automatique des Langues. Facult? Saint Charles, Aix Marseille Universit? (Marseille), du 1er au 4 juillet 2014. Dates importantes ----------------- - Date limite de soumission : samedi 5 avril 2014 - Notification aux auteurs : samedi 3 mai 2014 - Version d?finitive : samedi 17 mai 2014 Pr?sentation ------------ RECITAL 2014, la conf?rence annuelle des jeunes chercheurs associ?e ? TALN, se d?roulera ? Marseille du 1er au 4 juillet 2014. RECITAL offre aux jeunes chercheurs en Traitement Automatique des Langues (TAL) l'occasion de pr?senter leurs travaux et de comparer leurs approches. Elle est r?serv?e aux ?tudiants (master et doctorat) et aux jeunes chercheurs ayant obtenu leur doctorat depuis moins d'un an. Fort du succ?s des ann?es pr?c?dentes, nous encourageons la soumission de travaux m?me pr?liminaires, de projets de th?se, et de travaux des premiers mois de recherche (?tat de l'art, premi?res pistes, etc.). L'objectif premier de RECITAL est de soutenir les travaux des jeunes chercheurs en TAL et de faciliter leur int?gration dans notre communaut?. A ce titre, nous visons : - des relectures p?dagogiques : les auteurs doivent pouvoir comprendre les erreurs qu'ils ont pu commettre afin de pouvoir les corriger et am?liorer la qualit? de leur travail; - des relectures positives : il n'est jamais n?cessaire de d?courager un jeune chercheur, les ma?tres-mots devront encourager/guider; - l'?change direct : les relectures seront communiqu?es sign?es, donc non-anonymes, aux auteurs. Libre ? eux (ou mieux, aux relecteurs) d'aller informellement discuter ensemble lors de la conf?rence. Le prix du meilleur papier RECITAL sera d?cern? lors de la c?r?monie de cl?ture. Th?mes principaux ----------------- Les communications pourront porter sur les th?mes habituels du TAL : - Analyse et g?n?ration dans les domaines suivants : + Phon?tique + Phonologie + Morphologie + Syntaxe + S?mantique - Analyse et g?n?ration dans les domaines suivants : + Phon?tique + Phonologie + Morphologie + Syntaxe + S?mantique + Discours - D?veloppement de ressources linguistiques pour le TAL : + Bases de donn?es comportant des informations morphologiques, syntaxiques, s?mantiques, et/ou phonologiques + Grammaires + Lexiques + Ontologies + Linguistique de corpus - Applications du TAL : + Analyse de sentiments ou d'opinions + Cat?gorisation ou classification automatique + D?sambigu?sation lexicale + Dialogue homme-machine en langage naturel + Enseignement assist? par ordinateur + Indexation automatique + Recherche et extraction d'information + R?sum? automatique + R?solution d'anaphores + Syst?mes de question-r?ponse + Traduction automatique + Web s?mantique - Approches : + Linguistiques formelles destin?es ? soutenir les traitements automatiques + Symboliques + Logiques + Statistiques + Bas?es sur l'apprentissage automatique Cette liste n'est pas exhaustive et l'ad?quation d'une proposition de communication ? la conf?rence sera jug?e par le comit? de programme. Crit?res de s?lection --------------------- Les auteurs doivent ?tre des ?tudiant(e)s ou bien des jeunes docteur(e)s ayant soutenu leur th?se depuis moins d'un an. Les publications avec des chercheurs confirm?s (ce qui inclus les directeurs de th?ses) doivent ?tre soumises ? TALN et non ? RECITAL. Les auteurs sont invit?s ? soumettre des travaux de recherche originaux, n'ayant pas fait l'objet de publications ant?rieures. Les soumissions seront examin?es par au moins deux sp?cialistes du domaine. Seront consid?r?es en particulier: - La correction du contenu scientifique et technique - La situation des travaux dans le contexte de la recherche internationale - L'organisation et la clart? de la pr?sentation - L'ad?quation aux th?mes de la conf?rence Les articles s?lectionn?s seront publi?s dans les actes de la conf?rence. Suivant l'avis du comit? de programme, les pr?sentations se feront soit sous forme orale soit sous forme de poster. Modalit?s de Soumission ----------------------- Les articles seront r?dig?s en fran?ais pour les francophones, en anglais pour ceux qui ne ma?trisent pas le fran?ais. Les articles doivent faire de 8 ? 14 pages. Une feuille de style LaTeX, un mod?le Word et un mod?le LibreOffice seront disponibles sur le site web de la conf?rence (www.taln2014.org). Contact : nuria.gala(arobase)lif.univ-mrs.fr et/ou klim.peshkov(arobase)lpl-aix.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 4 21:37:17 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 4 Feb 2014 22:37:17 +0100 Subject: Appel: 19th Conference on Formal Grammar, FG-2014 Message-ID: Date: Mon, 3 Feb 2014 11:42:11 +0100 (CET) From: Frank Richter Message-ID: X-url: http://fg.phil.hhu.de/ FINAL CALL FOR PAPERS FG-2014: The 19th Conference on Formal Grammar Tuebingen, Germany, August 16-17 2014 http://fg.phil.hhu.de/ Collocated with the European Summer School in Logic, Language and Information Background FG-2014 is the 19th conference on Formal Grammar, to be held in conjunction with the European Summer School in Logic, Language and Information, which takes place in 2014 in Tuebingen, Germany. Previous Formal Grammar meetings were held in Barcelona (1995), Prague (1996), Aix-en-Provence (1997), Saarbruecken (1998), Utrecht (1999), Helsinki (2001), Trento (2002), Vienna (2003), Nancy (2004), Edinburgh (2005), Malaga (2006), Dublin (2007), Hamburg (2008), Bordeaux (2009), Copenhagen (2010), Ljubljana (2011), Opole (2012) and Duesseldorf (2013). Aims and Scope FG provides a forum for the presentation of new and original research on formal grammar, mathematical linguistics and the application of formal and mathematical methods to the study of natural language. Themes of interest include, but are not limited to, * formal and computational phonology, morphology, syntax, semantics and pragmatics; * model-theoretic and proof-theoretic methods in linguistics; * logical aspects of linguistic structure; * constraint-based and resource-sensitive approaches to grammar; * learnability of formal grammar; * integration of stochastic and symbolic models of grammar; * foundational, methodological and architectural issues in grammar and linguistics; * mathematical foundations of statistical approaches to linguistic analysis. Previous conferences in this series have welcomed papers from a wide variety of frameworks. Submission Details We invite **electronic** submissions of original, 16-page papers (including references and possible technical appendices). Authors are encouraged to use the Springer-Verlag LNCS style: http://www.springer.com/computer/lncs?SGWID=0-164-6-793341-0 The submission deadline is **February 23 2014**. Papers must be submitted electronically at https://www.easychair.org/conferences/?conf=fg2014 Papers should report original work which was not presented in other conferences. However, simultaneous submission is allowed, provided that the authors indicate other conferences to which the work was submitted in a footnote. Note that accepted papers can only be presented in one of the venues. Submissions will be reviewed anonymously by at least three reviewers. Accepted papers will be published as a volume in the Springer LNCS series, under the FoLLI subline, either separately or jointly with the papers from FG-2015, depending on the number of accepted papers. Important Dates * February 23, 2014: Deadline for paper submission * April 20, 2014: Notification of acceptance * June 1, 2014: Camera ready copies due * August 16-17, 2014: Conference dates Invited Speakers Thomas Icard (Stanford University, USA) Christian Retor? (Universit? Bordeaux 1, France) Program Committee Alexander Clark (King's College London) Benoit Crabb? (Universit? Paris 7, France) Berthold Crysmann (CNRS - LLF, France) Denys Duchier (Universit? d'Orleans, France) Annie Foret (IRISA - IFSIC, France) Nissim Francez (Technion, Israel) Philippe de Groote (Inria Nancy) Laura Kallmeyer (University of D?sseldorf, Germany) Makoto Kanazawa (National Institute of Informatics, Japan) Greg Kobele (University of Chicago, USA) Valia Kordoni (Humboldt-Universit?t Berlin, Germany) Wolfgang Maier (University of D?sseldorf, Germany) Stefan M?ller (Freie Universit?t Berlin, Germany) Gerald Penn (University of Toronto, Canada) Christian Retor? (Universit? Bordeaux 1, France) Manfred Sailer (Goethe University Frankfurt, Germany) Ed Stabler (UCLA, USA) Jesse Tseng (CNRS - CLLE-ERSS, France) Oriol Valent?n (Universitat Polit?cnica de Catalunya, Spain) Standing committee * Glyn Morrill (Universitat Polit?cnica de Catalunya, Spain) * Reinhard Muskens (Tilburg University, The Netherlands) * Rainer Osswald (Heinrich-Heine-Universitaet Duesseldorf, Germany) * Frank Richter (Heinrich-Heine-Universitaet Duesseldorf, Germany) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 4 21:39:50 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 4 Feb 2014 22:39:50 +0100 Subject: Appel: QALD-4 training phase starts now! Message-ID: Date: Mon, 3 Feb 2014 13:51:53 +0100 (CET) From: Elena Cabrio Message-ID: <743639560.1084915.1391431913412.JavaMail.root at inria.fr> X-url: http://nlp.uned.es/clef-qa/ X-url: http://www.sc.cit-ec.uni-bielefeld.de/qald Dear all, We are very happy to announce QALD-4, the fourth in a series of evaluation campaigns on multilingual question answering over linked data, this time with a strong emphasis on interlinked data and hybrid question answering. In the context of the question answering lab at CLEF 2014 (http://nlp.uned.es/clef-qa/), we offer three open challenges: Task 1: Multilingual question answering over DBpedia Task 2: Biomedical question answering over interlinked data Task 3: Hybrid question answering All relevant information about training and test phase as well as the lab can be found on our website: http://www.sc.cit-ec.uni-bielefeld.de/qald The training phase starts now, and the further schedule is as follows: Release of test data: April 1, 2014 Submission deadline: May 1, 2014 Release of results: May 8, 2014 Workshop: September, 2014 We cordially invite everyone working on question answering over linked data to participate, and look forward to another exciting challenge! With best regards, Christina Unger, Vanessa Lopez, Philipp Cimiano, Elena Cabrio, Axel Ngonga, Sebastian Walter, Corina Forascu From hamon at LIMSI.FR Tue Feb 4 21:40:34 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 4 Feb 2014 22:40:34 +0100 Subject: Conf: TOTh 2014, Opening Conference Message-ID: Date: Mon, 3 Feb 2014 14:08:44 +0100 From: Luc Damas Message-ID: ------------------------------------------------------ TOTh 2014 - Opening Conference 12-13 June 2014 - www.porphyre.org/toth/ "Ontology meets natural language: What can linguists learn?" Christiane Fellbaum - Princeton University The creation of large lexical databases for Natural Language Processing has renewed interest in the structure of the lexicon and challenged many linguists' theories of systemic regularities in the face of empirical investigation. The need for mapping lexicons across languages and research in automatic inferencing and reasoning have resulted in fruitful collaboration between lexicology and the neighboring discipline of ontology and have afforded new perspectives on the lexicon. My talk will address the following questions and provide some answers based on recent research. * To what extent can a formal ontology be mapped onto the lexicon of a natural language and reflect both its surface and its hidden structure? * How much lexical semantics can ontology capture, and how useful is an ontological perspective of the lexicon for word sense disambiguation? * How can ontology inform the construction of lexical databases for automatic reasoning, e.g., by introducing categorial distinctions such as between types and roles? --------------------------------------------------- From hamon at LIMSI.FR Tue Feb 4 21:42:59 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 4 Feb 2014 22:42:59 +0100 Subject: Job: Research Assistant in Text Simplification, UPF/TALN/, Barcelona Message-ID: Date: Mon, 3 Feb 2014 18:22:51 +0100 From: Horacio Saggion Message-ID: X-url: http://www.dtic.upf.edu/~hsaggion/ TALN Research Group, Department of Information and Communication Technologies (DTIC), Universitat Pompeu Fabra (UPF), Barcelona, Spain Applications are invited for a 12-months research position in the area of text simplification. Funding for the position arises from a new collaborative European project. The research will be carried out at the Natural Language Processing research group (TALN) in Barcelona under the supervision of Dr. Horacio Saggion as PI. *Brief Description of Work* The work to be carried out involves the development/adaptation of a text simplification system to the English language and improvement of technology already developed in the Simplext project. *Qualifications:* Applicants should have: *** a MsC or PhD in computer science, artificial intelligence, or computational linguistics *** Knowledge of machine-learning and rule-based approaches *** Knowledge of Java is a plus *Additionally:* *** Knowledge of the GATE system will be well appreciated *** Highly motivated individual with experience or interest in the area of automatic text simplification and/or summarization. *Language requirements:* *** good knowledge of English *Starting date:* The position is expected to start early March/April 2014. *Educational level required:* Msc. or PhD. *Salary and duration of the contract* We offer a gross salary of EURO 20.000 - EURO 22.000 which can be complemented with teaching if desired. The duration of the contract is 12 months. The position can be extended depending on available funds. *How to apply:* A cover letter, curriculum vitae, and the names and contact information of at least two referees to horacio.saggion at upf.edu. Use as subject of your message "Research Position Simplification/NLP" *Selection process* The selection process will start immediately. From hamon at LIMSI.FR Tue Feb 4 21:41:37 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 4 Feb 2014 22:41:37 +0100 Subject: Appel: FLAIRS-27, Call for Poster Abstracts (Submissions Due February 10, 2014) Message-ID: Date: Mon, 3 Feb 2014 14:49:45 +0100 From: Christophe Jouis Message-Id: <1050D157-C8B8-43C7-9F45-F6DC9D33C4E7 at lip6.fr> X-url: http://www.FLAIRS-27.info ======================================================================= The 27th International Conference of the Florida Artificial Intelligence Research Society (FLAIRS-27) Poster abstract submission deadline: February 10, 2014 www.FLAIRS-27.info May 21-23, 2014 Pensacola Beach, Florida, USA Hilton Pensacola Beach Gulf Front http://hiltonpensacolabeach.com/ ======================================================================= FLAIRS-27 continues a tradition of presenting and discussing state of the art artificial intelligence and related research in a sociable atmosphere within a beautiful setting. Events will include invited speakers, special tracks, and presentations of papers and posters. As always, there will be a Best Paper award and a Best Poster award. In addition, because FLAIRS has a rich tradition of encouraging student authors, there will be a Best Student Paper award for the best paper written primarily by a student. A second submission round of poster abstracts (250 words) for presentation at the conference is now invited. Topics of interest are in all areas of (or related to) artificial intelligence, including, but not limited to Foundations: * Cognitive modeling * Knowledge representation * Learning * Perception * Reasoning and programming * Search Architectures: * Agents and distributed AI * Information retrieval * Intelligent user interfaces * Natural language systems * Robotics Applications: * Aviation and aerospace * Education * Entertainment * Management and manufacturing * Medicine * World Wide Web Implications: * Evaluation of AI systems * Philosophical foundations * Social impact and ethics * Teaching AI Special Areas: * AI and Cyber Security * Culture-Aware AI Systems * AI in Business * Semantics, Logic and Information Extraction in AI * Affective Computing * Intelligent Tutoring Systems * Games and Multimedia * Data Mining * Case-Based Reasoning * Uncertain Reasoning * Intelligent Autonomous Systems * Robotics * Healthcare Informatics * Applied Natural Language Processing ============= Location ============= The 27th International Conference of FLAIRS takes place on the beachfront in the center of everything on Pensacola Beach, Florida! Enjoy captivating views of the emerald waters of the Gulf of Mexico and 52 miles of snow white sand beaches from your private Gulf Front balcony. Our hotel is within walking distance from restaurants, vibrant nightlife and recreational activities such as charter fishing, paddle boarding, sailing, kayaking, biking, and miniature golf. We are just a quick drive to historic downtown Pensacola and regional attractions such as the National Naval Aviation Museum, Gulf Islands National Seashore and Fort Pickens, Pensacola Lighthouse, Blue Wahoos Baseball Stadium, Veterans Memorial Park, Pensacola Civic Center, and the Saenger Theater. ============= Important Dates ============= February 10, 2014 Second submission (Abstract only) February 17, 2014 Abstract only notification February 24, 2014 Camera ready version due ============= Submissions ============= We are offering a second round for submission of poster abstracts (up to 250 words) to give new and developing research an outlet for peer feedback. Each submitted poster abstract will be subject to an editorial review to determine acceptability. Editors can make spelling, grammatical, and formatting changes to the abstracts. All posters will be presented according to space and time availability. An author of each accepted poster abstract is required to register, attend, and present the poster. It is a requirement of FLAIRS that there will be at least one early regular author registration per submission. Submissions must be original, and not submitted concurrently to another conference. Abstract papers must be in AAAI format, and submitted as PDF through the EasyChair conference system, which can be accessed through the main conference web site (www.FLAIRS-27.info). The proceedings of FLAIRS-27 will be published by the AAAI. Authors of accepted papers will be required to sign a form transferring copyright of their contribution to AAAI. An author of each accepted paper is required to register, attend, and present the poster at FLAIRS. All submissions and registrations must comply with FLAIRS requirements as posted on the conference website. ============= ORGANIZATION ============= Conference Chair: G. Michael Youngblood, PARC, a Xerox company Program Chairs: Bill Eberle, Tennessee Technological University Chutima Boonthum-Denecke, Hampton University Special Tracks Coordinator: Ingrid Russell, University of Hartford Contact Bill Eberle: weberle at tntech.edu From hamon at LIMSI.FR Tue Feb 4 21:46:39 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 4 Feb 2014 22:46:39 +0100 Subject: Appel: Hackathon Participation, WASSA 2014@ACL 2014 Message-ID: Date: Tue, 04 Feb 2014 11:33:06 +0100 From: Alexandra Balahur Dobrescu Message-id: <52F0C1E2.4060806 at jrc.ec.europa.eu> X-url: http://optima.jrc.it/wassa2014/ /*Apologies for cross-postings ***************************************************************** */ */ 5^th Workshop on Computational Approaches to /* */Subjectivity, Sentiment and Social Media Analysis (WASSA 2014)/* *http://optima.jrc.it/wassa2014/* *- Endorsed by SIGNLL and SIGSEM - *** Sentiment analysis systems hackathon In the light of the fact that different sentiment analysis systems have been proposed and showcased in the past years, we feel there is a growing need to make other researchers and users familiar with these systems and have them employ them for building an end application. The Hackathon word stands for "Hacking Marathon", and its purpose is to introduce some system and make users employ it to build an application for a real-life task. We would like to invite researchers who have a working prototype of their system to announce their availability to participate in the WASSA 2014 Hackathon. The systems can be oriented to the following tasks: ./The use of semantic resources and methods (knowledge bases, semantic representations, inference mechanisms) for subjectivity, sentiment and emotion analysis/ ./Opinion retrieval, extraction, categorization, aggregation and summarization/ ./Trend detection in social media using subjectivity and sentiment analysis techniques/ ./Data linking through social networks based on affect-related NLP methods/ ./Mass opinion estimation based on NLP and statistical models/ ./Online reputation management/ ./Subjectivity, sentiment and emotion detection in social networks/ ./Classification of stance in dialogues/ ./Subjectivity and sentiment analysis in social media/ To participate with a system or as a "hacker" of one of the participating systems, please send an e-mail to: Alexandra.Balahur at jrc.ec.europa.eu In order to speed up the process, we will put the "hackers" in contact with the system authors, so that the applications can be explored prior to the WASSA 2014 workshop. We will also invite papers describing the results of these experiments as part of the final workshop. We are looking forward to your participation! On behalf of the WASSA 2014 Organizers, Alexandra Balahur From hamon at LIMSI.FR Tue Feb 4 21:48:07 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 4 Feb 2014 22:48:07 +0100 Subject: Job: Recherche un ingenieur d'etudes, ELDA Message-ID: Date: Tue, 04 Feb 2014 16:29:09 +0100 From: ELRA ELDA Information Message-ID: <52F10745.8010207 at elda.org> X-url: http://www.elra.info X-url: http://www.elda.org ELDA (European Language Resources Distribution Agency) cherche ? pourvoir imm?diatement un poste d'ing?nieur d'?tudes et de d?veloppement. Il/elle aura en charge des ?tudes et des d?veloppements techniques dans le domaine de l'ing?nierie linguistique, en particulier en technologies vocales, multim?dias et multimodales. Au sein du d?partement R&D, il/elle travaillera sur des projets de d?veloppement d'outils pour la production, la validation et le management de ressources linguistiques dans le cadre de projets collaboratifs et internationaux. Ce poste est une excellente opportunit? pour des candidats cr?atifs et motiv?s, qui souhaitent participer activement ? la construction du domaine de l'ing?nierie de la langue dans l'Union europ?enne. Profil : * De formation sup?rieure (Ing?nieur ou Docteur) en traitement automatique de la parole ou du multim?dia/multimodales (reconnaissance/synth?se de la parole, traitement d'images/vid?os, analyse de documents/OCR, etc.). * Exp?rience et/ou connaissance des programmes de recherche en Europe, aux Etats-Unis ou au Japon. * Exp?rience en gestion de projet, Implication dans la gestion de projets europ?ens serait un plus significatif. * bonne maitrise de l'environnement de programmation Django, de Django-CMS, Python, PhP serait un plus. * Exp?rience et/ou connaissance des questions relatives aux ressources linguistiques et ? l'?valuation des technologies de la langue. * Aptitude ? travailler tant de fa?on ind?pendante qu'au sein d'une ?quipe, en particulier aptitude ? superviser les membres d'une ?quipe multidisciplinaire. * Anglais courant exig?. Salaire: en fonction des qualifications et exp?riences. Toute candidature sera ?tudi?e jusqu'? ce que le poste soit pourvu. Le poste est bas? ? Paris. Les candidats doivent avoir la citoyennet? d'un pays de l'Union europ?enne ou ?tre en possession d'un titre de s?jour. Les candidatures doivent ?tre adress?es par courriel (lettre de motivation et curriculum vitae) ? : Khalid CHOUKRI ELRA / ELDA 55-57, rue Brillat Savarin 75013 Paris FRANCE Fax : +33 1 43 13 33 30 courriel : job at elda.org ELDA, unit? op?rationnelle de l'association ELRA, est charg?e de promouvoir le d?veloppement de ressources linguistiques sous toutes les formes ?lectroniques utilisables, en particulier sous la forme de corpus oraux et ?crits, de lexiques et de bases terminologiques. Depuis sa cr?ation en 1995, ELDA s'est affirm?e comme un centre unique en Europe pour la distribution de ressources linguistiques, capable de r?pondre aux divers besoins des d?veloppeurs de technologie. Ses activit?s se d?veloppent maintenant vers de nouveaux types de ressources linguistiques (donn?es de type multimodal/multim?dia). Certaines ressources linguistiques sont con?ues au cours de projets (co-)financ?s par ELDA. Celles-ci sont ensuite compil?es sous la forme d'un catalogue de ressources linguistiques. ELDA est impliqu?e dans un certain nombre de projets europ?ens et nationaux. ELDA s'int?resse ?galement aux probl?mes juridiques en rapport avec les ressources linguistiques, r?alise r?guli?rement des ?tudes de march? sur les besoins des utilisateurs, et travaille ? l'am?lioration des proc?dures de validation des ressources. Pour de plus amples renseignements concernant ELRA/ELDA, voir: http://www.elra.info ou www.elda.org ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 4 21:44:50 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 4 Feb 2014 22:44:50 +0100 Subject: Appel: Roadmapping Workshop on Linked Data for Language Technology - #LiderEU Message-ID: Date: Tue, 4 Feb 2014 09:24:51 +0100 From: Maud Ehrmann Message-ID: X-url: http://2014.data-forum.eu/ X-url: http://lider-project.eu/ X-url: http://www.multilingualweb.eu/ X-url: http://nlp2rdf.org X-url: http://linguistics.okfn.org/ X-url: http://dbpedia.org Dear all, I would like to draw your attention to the Linked Data for Language Technology (LD4LT) workshop (http://www.w3.org/community/ld4lt/wiki/LD4LT_Group_Kick-Off_and_Roadmap_Meeting) that will be organized as part of the European Data Forum 2014 (http://2014.data-forum.eu/). The goal of the workshop is to gather use cases and requirements as a basis to develop a European R&D road-map and agenda for content analytics solutions based on Linked Data. It is a free community event - there is no admission fee for participants, but registration is required (https://www.w3.org/2002/09/wbs/1/lider-rmws-20140321/). You are encouraged to provide a title for a position statement in your registration form. This is a simple, short statement that summarizes your ideas / technologies / use cases related to Linked Data and Language Technology. The meeting is supported by the LIDER (http://lider-project.eu/) project, the MultilingualWeb community (http://www.multilingualweb.eu/), the NLP2RDF project (http://nlp2rdf.org), the Working Group for Open Data in Linguistics (http://linguistics.okfn.org/) as well as the DBpedia Project (http://dbpedia.org). As input to the discussion and the work of the LD4LT group, you may also want to fill in the first LIDER survey (https://www.w3.org/2002/09/wbs/1/ld4lt-wbs1/) . We hope to see you at the workshop. Best Regards, Maud Ehrmann, on behalf of the LIDER consortium. From hamon at LIMSI.FR Sat Feb 8 11:10:30 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 12:10:30 +0100 Subject: Stage: Development of linguistic resources to improve an information extraction tool Message-ID: Date: Thu, 6 Feb 2014 15:28:38 +0100 From: Dhouha Bouamor Message-ID: Trooclick France is a company that specializes in the development of web applications for the automatic processing of information. Our goal is to create services that rebuild the user's trust in digital content. Up to now, Web players were able to enhance the relevance of this content; we go a step further and contribute to improve its reliability. Trooclick was created in November 2012. Just a few months later, in April 2013, it received financial support from the BPI (French public investment bank) and in June 2013 the French government granted it the Status of "Young Innovative Company" (JEI), recognizing its innovative nature. It now counts twelve committed and passionate members in its tight-knit team. The company carries out R&D projects in search of technical solutions in the Artificial Intelligence field. Due to its growth, Trooclick is now looking for candidates for a 6 month internship for its office in Paris (17?me). Missions: As a member of the technical team, you will benefit from ongoing training and you will help us design and build our information extraction framework based on advanced NLP technologies. You will turn ideas into well-documented and reliable linguistic resources (both dictionaries and extraction rules) to ensure efficiency, quality, performance and scalability. A great team player, you will interact with other departments to understand and fine tune specifications. You will carry out unitary testing, create and maintain our test validation corpus and participate in editing technical documents. All developments will be done in English. Qualifications: - BSc/MSc - Experience with NLP tools such as Gate, Treetagger, NooJ, Stanford for linguistic annotation, named entity recognition, relationship and fact extraction, sentiment analysis, etc. - Experience in scripting languages such as Perl or Python as well as XML format to be autonomous in completing some technical tasks. - Experience with basic database management operations (SQL language) Knowledge of Semantic Web technologies (RDF, OWL, SKOS, etc.) will be a plus. - Excellent communication skills in English and French - We are open to new ideas that will significantly contribute to our success. Our friendly team will provide the opportunity for valuable collaboration. - We offer you career perspectives in a young and dynamic company with an interesting and diversified scope of duties at the cutting edge of research. We welcome applications from highly motivated individuals able to learn new techniques and share knowledge and experience with the team. Interested? Then send your application to jobs at trooclick.com! ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 10:54:13 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 11:54:13 +0100 Subject: Appel: INTERSPEECH 2014 Message-ID: Date: Thu, 06 Feb 2014 09:56:31 +0800 From: "Organization @ Interspeech 2014" Message-ID: <52F2EBCF.5070801 at interspeech2014.org> X-url: http://www.interspeech2014.org --- INTERSPEECH 2014 - SINGAPORE --- September 14-18, 2014 --- http://www.interspeech2014.org INTERSPEECH is the world's largest and most comprehensive conference on issues surrounding the science and technology of spoken language processing, both in humans and in machines. The theme of INTERSPEECH 2014 is --- Celebrating the Diversity of Spoken Languages --- INTERSPEECH 2014 emphasizes an interdisciplinary approach covering all aspects of speech science and technology spanning basic theories to applications. In addition to regular oral and poster sessions, the conference will also feature plenary talks by internationally renowned experts, tutorials, special sessions, show & tell sessions, and exhibits. A number of satellite events will take place immediately before and after the conference. Please follow the details of these and other news at the INTERSPEECH website www.interspeech2014.org . --------- SCOPE ----------- We invite you to submit original papers in any related area, including but not limited to: 1: Speech Perception and Production 2: Prosody, Phonetics, Phonology, and Para-/Non- Linguistic Information 3: Analysis of Speech and Audio Signals 4: Speech Coding and Enhancement 5: Speaker and Language Identification 6: Speech Synthesis and Spoken Language Generation 7: Speech Recognition - Signal Processing, Acoustic Modelling, Robustness, and Adaptation 8: Speech Recognition - Architecture, Search & Linguistic Components 9: LVCSR and Its Applications, Technologies and Systems for New Applications 10: Spoken Language Processing - Dialogue, Summarization, Understanding 11: Spoken Language Processing -Translation, Info Retrieval 12: Spoken Language Evaluation, Standardization and Resources A detailed description of these areas is accessible at: http://www.interspeech2014.org/public.php?page=conference_areas.html ------- SUBMISSIONS ------- Papers for the INTERSPEECH 2014 proceedings should be up to 4 pages of text, plus one page (maximum) for references only. Paper submissions must conform to the format defined in the paper preparation guidelines and provided in the Authors' kit, on the INTERSPEECH 2014 website, along with the Call for Papers. Optionally, authors may submit additional files, such as multimedia files, which will be included in the official conference proceedings USB drive. Authors must declare that their contributions are original and are not being submitted for publication elsewhere (e.g. another conference, workshop, or journal). Papers must be submitted via the online paper submission system, which will be opened in February 2014. The conference will be conducted in English. Information on the paper submission procedure is available at: http://www.interspeech2014.org/public.php?page=submission_procedure.html ----- IMPORTANT DATES ----- 24 March, 2014 Full Paper submission deadline 10 June, 2014 Notification of acceptance 20 June, 2014 Camera-ready paper due 10 July, 2014 Early registration deadline 14-18 September, 2014 Conference dates We look forward to welcoming you to INTERSPEECH 2014 in Singapore! - The Organizing team of INTERSPEECH 2014 - Email: tpc at interspeech2014.org From hamon at LIMSI.FR Sat Feb 8 11:21:37 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 12:21:37 +0100 Subject: Appel: TOTh 2014, Extension de la date limite Message-ID: Date: Fri, 7 Feb 2014 17:52:33 +0100 From: Luc Damas Message-ID: X-url: http://www.porphyre.org/toth/ X-url: http://www.porphyre.org/toth/toth-2014/appel-a-communications The deadline for abstract submissions for the TOTh 2014 Conference has been extended to 24th February. La date limite de soumission des r?sum?s pour la Conf?rence TOTh 2014 est repouss?e au lundi 24 f?vrier. ----------------------------------------------------- TOTh 2014 : APPEL A COMMUNICATIONS Terminologie & Ontologie : Th?ories et applications Chamb?ry - France 24 f?vrier 2014 : Date limite de soumission des r?sum?s (2 ? 3 pages) Conf?rence : 12 & 13 juin 2014 Formation : 10 & 11 juin 2014 http://www.porphyre.org/toth/ http://www.porphyre.org/toth/toth-2014/appel-a-communications http://www.porphyre.org/toth/files/TOTh-2014-appel-a-communications.pdf Soumission par EasyChair : ----------------------------------- Les soumissions des r?sum?s ?tendus (2 ? 3 pages) se font exclusivement par EasyChair: http://www.easychair.org/conferences/?conf=toth2014 Pr?sentation : ------------------ La 8?me ?dition des conf?rences internationales TOTh se d?roulera les jeudi 12 et vendredi 13 juin 2014 ? l'Universit? de Savoie (Chamb?ry, France) avec le soutien du Minist?re de la Culture (DGLFLF) et de l'Association Europ?enne de Terminologie. Les conf?rences TOTh - Terminologie & Ontologie : Th?ories et applications - ont pour objectif de rassembler chercheurs, professionnels et, de fa?on plus g?n?rale, toute personne dont les pr?occupations rel?vent de la langue et de l'ing?nierie des connaissances. Le programme comportera, en plus des communications retenues, une conf?rence invit?e, une disputatio et une session poster. Une formation d'un jour et demi sur l'analyse de textes ? des fins onomasiologiques pr?c?dera la conf?rence les mardi 10 et mercredi 11 juin. Un prix << Jeune chercheur >> est d?cern? par le comit? de programme lors de la conf?rence Th?mes de la Conf?rence (liste non exhaustive) : -------------------------------------------------------------- - Langues de sp?cialit? et Lexicographie de sp?cialit?, - Traduction, - Traitement de corpus, - Lexiques, Dictionnaires, Terminologies, - Am?nagement linguistique et Am?nagement terminologique, - Repr?sentation, Extraction et Gestion des connaissances, - Syst?mes conceptuels, - Ontologies, - Thesaurus, - Applications : Recherche d'information, Web des donn?es, Biblioth?ques num?riques, Environnements de d?veloppement, Banques terminologiques, - Normes, Formats d'?change. TOTh 2014 http://www.porphyre.org/toth/ http://www.porphyre.org/toth/toth-2014/appel-a-communications http://www.porphyre.org/toth/files/TOTh-2014-appel-a-communications.pdf ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 11:02:25 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 12:02:25 +0100 Subject: Job: CDI, Web-Miner, MarketScience Message-ID: Date: Thu, 06 Feb 2014 11:40:20 +0100 From: Aude Robert Message-ID: <52F36694.8060503 at marketscience.fr> Title : CDI, Web-Miner, MarketScience Compagny: MarketScience Department: IT Software City : Orl?ans (France, 45) Job Type : Full time / CDI Languages : English and French IDEAL CANDIDATE: We are looking for a candidate who will work on our Media Mining module. The module will aggregate and store information extracted from the Internet that will be integrated into a financial market Bloomberg application. REQUIREMENTS : Autonomous and very ambitious Good experience in Python or PHP, and HTML is necessary. Web crawler/information retrieval knowledge Database management(MySQL/SQL Server) knowledge Focused on code quality Knowledge in Natural Language Processing, Text Mining or Information Retrieval Very good english writing and communication skills Planning and long term thinking CAREER LEVEL : 3 Year Exp. RESPONSABILITIES : Develop and maintain a web mining system. Develop and maintain a storage solution for the extracted information. Process extracted information (Natural Language Processing) Assure long term scaling of the solution. Excellent compensation for excellent & ambitious candidate. DESCRIPTION OF COMPANY : MarketScience is a French/Romanian Start-up in Information Technology/Finance. Salary will be defined according to profile and experience. For more information, please contact aude.robert at marketscience.fr To submit an application, please send your CV and cover letter to both nicolasboitout at hotmail.com ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 10:43:02 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 11:43:02 +0100 Subject: Appel: CBMI2014, Submission deadline 16 Feb, Workshop on Content-Based Multimedia Indexing Message-ID: Date: Wed, 05 Feb 2014 09:59:19 +0100 From: Bernard Merialdo Message-ID: <52F1FD67.1080806 at eurecom.fr> X-url: http://cbmi2014.itec.aau.at/ (our apologies for duplicate postings) ************************************************************************* CALL FOR PAPERS Paper submission deadline: February 16, 2014 C B M I 2014 12th International Workshop on Content-Based Multimedia Indexing 18-20 June 2014, Klagenfurt, Austria http://cbmi2014.itec.aau.at/ in cooperation with IEEE CSS and ACM SIGMM ************************************************************************* The 12th International Content Based Multimedia Indexing Workshop aims to bring together the various communities involved in all aspects of content-based multimedia indexing, retrieval, browsing and presentation. Following the eleven successful previous events of CBMI (Toulouse 1999, Brescia 2001, Rennes 2003, Riga 2005, Bordeaux 2007, London 2008, Chania 2009, Grenoble 2010, Madrid 2011, Annecy 2012, and Veszprem 2013), CBMI 2014 will take place in Klagenfurt, in the very south of Austria from June 18th to June 20th 2014. CBMI 2014 is organized in cooperation with IEEE Circuits and Systems Society and ACM SIG Multimedia. Topics of the workshop include but are not limited to visual indexing, audio and multi-modal indexing, multimedia information retrieval, and multimedia browsing and presentation. Additional special sessions are planned in the fields of endoscopic videos and images and multimedia metadata. Paper submission: Authors are invited to submit full-length and special session papers of 6 pages and short (poster) and demo papers of 4 pages maximum. All peer-reviewed, accepted and registered papers will be published in the CBMI 2014 workshop proceedings to be indexed and distributed by the IEEE Xplore. The submissions are peer reviewed in single blind process, the language of the workshop is English. Selected papers of the conference will be invited to submit extended versions of their contributions to a special issue of Multimedia Tools and Applications journal (MTAP). * Important dates: Paper submission deadline: February 16, 2014 Notification of acceptance: March 30, 2014 Camera-ready papers due: April 14, 2014 Author registration: April 14, 2014 Early registration: May 25, 2014 * Topics of interest include, but are not limited to: * Visual Indexing (image, video, graphics) * Visual content extraction Identification and tracking of semantic regions * Identification of semantic events * Audio and Multi-modal Indexing * Audio indexing (audio, speech, music) * Audio content extraction * Multi-modal and cross-modal indexing * Multimedia fusion * Metadata generation, coding and transformation * Multimedia Information Retrieval (image, audio, video, ...) * Matching and similarity search * Content-based search * Multimedia data mining * Multimedia recommendation * Large scale multimedia database management * Multimedia Browsing and Presentation * Summarization, browsing and organization of multimedia content * Personalization and content adaptation * User interaction and relevance feedback * Multimedia interfaces, presentation and visualization tools * Contact For more information please visit http://cbmi2014.itec.aau.at/ and for additional questions, remarks, or clarifications please contact cbmi2014 at itec.aau.at From hamon at LIMSI.FR Sat Feb 8 10:49:09 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 11:49:09 +0100 Subject: Appel: LRE-Rel2 Religious Text Analytics Workshop, LREC 2014 Message-ID: Date: Wed, 5 Feb 2014 15:06:58 +0000 (GMT) From: Eric Atwell Message-ID: X-url: http://www.comp.leeds.ac.uk/arabic/lre-rel2.html 2nd Workshop on Language Resources and Evaluation for Religious Texts (LRE-Rel2) 31st May 2014 (afternoon session) Harpa Conference Center, Reykjavik, Iceland http://www.comp.leeds.ac.uk/arabic/lre-rel2.html Second Call for Papers, deadline: 19.02.2014 After a successful launch at LREC 2012, we are organising a second workshop on Language Resources and Evaluation for Religious Texts hosted by LREC 2014 in Reykjavik, Iceland. This is an inclusive workshop title, aimed at researchers with a generic interest in religious texts to raise awareness of different perspectives and practices, and to identify some common themes. Our first workshop attracted a range of scholarship, particularly on Arabic and Islamic Studies, and this year we are keen to extend this range to canonical texts from other languages and religions - Christian, Jewish, Sikh, Buddhist, and other - and, given the Icelandic venue, to iconic texts from historical religions such as The Prose Edda of Norse mythology. We are also keen to foster inter-faith corpus studies, tracing similarities as well as differences in religious texts, where this genre includes: the faith-defining religious canon; authoritative interpretations and commentary; sermons, liturgy, prayers, poetry, and lyrics. Workshop Attendance: Prospective authors should note that attendance at the workshop is necessary for their paper to be published in the workshop proceedings, though for papers with multiple authors, only one author will be required to attend. Therefore, prospective authors are strongly encouraged to check the LREC website and other sources for information on visa restrictions, and likely travel and hotel costs. Workshop Topics: Submissions are invited for (but not limited to) the following topics: - measuring semantic relatedness between multiple religious texts and corpora from different religions; - analysis of ceremonial, liturgical, and ritual speech; recitation styles; speech decorum; discourse analysis for religious texts; - formulaic language and multi-word expressions in religious texts; - suitability of modal and other logic types for knowledge representation and inference in religious texts; - issues in, and evaluation of, machine translation in religious texts; - text-mining, stylometry, and authorship attribution for religious texts; - corpus query languages and tools for exploring religious corpora; - dictionaries, thesaurai, Wordnet, and ontologies for religious texts; - (new) corpora and rich and novel annotation schemes for religious texts; - annotation and analysis of religious metaphor; - genre analysis for religious texts; - application in other disciplines (e.g. theology, classics, philosophy, literature) of computer-mediated methods for analysing religious texts. Important Dates: 19.02.2014 Deadline for paper submissions 05.03.2014 Notification of acceptance 21.03.2014 Camera-ready copies due from authors 06.04.2014 LREC deadline for camera-ready copy of Workshop Proceedings 31.05.2014 LRE-Rel2 Workshop (afternoon session) Submission of Papers: Please submit your papers via the LRE-Rel2 submission page in LREC's START Manager. The URL for this is: https://www.softconf.com/lrec2014/LRE-Rel2/ Authors should submit a full paper, not an abstract. They should also follow LREC guidelines for length of papers, and are advised to use the LREC template for paper submissions. This template will be published by LREC in due course. In the meantime, authors can use the LREC 2012 template: http://www.lrec-conf.org/lrec2012/?Authors-Kit Request from LREC Organisers: "...When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. Moreover, ELRA encourages all LREC authors to share the described LRs (data, tools, services, etc.), to enable their reuse, replicability of experiments, including evaluation ones, etc..." LRE-Rel2 Organising Committee: Eric Atwell and Claire Brierley: Computing, University of Leeds, LEEDS, LS2 9JT, UK E.S.Atwell at leeds.ac.uk; C.Brierley at leeds.ac.uk Majdi Sawalha and Bassam Hammo: Computer Information Systems, University of Jordan, AMMAN 11942, Jordan sawalha.majdi at leeds.ac.uk; LRE-Rel2 Programme Committee (provisional): Eric Atwell, School of Computing, University of Leeds, UK Claire Brierley, School of Computing, University of Leeds, UK Majdi Sawalha, Computer Information Systems, University of Jordan, Jordan Bassam Hammo, Computer Information Systems, University of Jordan, Jordan Sane Yagi, Department of Linguistics, University of Jordan, Jordan Dag Haug, Department of Philosophy, History of Art and Ideas, University of Oslo, Norway Moshe Koppel, Department of Computer Science, Bar-Ilan University, Israel Nils Reiter: Department of Computational Linguistics, Heidelberg University, Germany Andrew Wilson, Department of Linguistics and English Language, University of Lancaster, UK Claudia Resch, Institute for Corpus Linguistics and Text Technology, Austrian Academy of Sciences, Austria Nadeem Obaid, Computer Information Systems, University of Jordan, Jordan Muhammad A.M. Abushariah, Computer Information Systems, University of Jordan, Jordan Mortaza Rezaee, Islamic College, London, UK Janet Watson, Arabic and Middle Eastern Studies, and Linguistics and Phonetics, University of Leeds, UK John Lee, Halliday Centre for Intelligent Applications of Language Studies, City University of Hong Kong, (HK) Mohamed Menacer, Taibah University, Saudi Arabia Deryle Lonsdale, Department of Linguistics and English Language, Brigham Young University, US Bob MacDonald, Research and Development, Anthony Macauley Associates, Canada Kais Dukes, School of Computing, University of Leeds, UK Aida Mustapha, Department of Computer Science and Information Technology, Putra University, Malaysia Liviu Dinu, Centre for Computational Linguistics, University of Bucharest, Romania Behrooz Minaei, School of Computer Engineering, Iran University of Science and Technology, Iran Gurpreet Singh, Centre for Language and Communication Studies, Trinity College Dublin, Ireland Azzeddine Mazroui, Mathematics and Computer Science, Mohammed 1st University, Morocco From hamon at LIMSI.FR Sat Feb 8 11:03:30 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 12:03:30 +0100 Subject: These: Rima Harastani, Alignement lexical en corpus comparables Message-ID: Date: Thu, 6 Feb 2014 12:40:10 +0100 From: Rima Harastani Message-ID: Bonjour, J'ai le plaisir de vous inviter ? ma soutenance de th?se r?alis?e au sein du Laboratoire d'Informatique de Nantes Atlantique (LINA) et intitul?e : "Alignement lexical en corpus comparables : le cas des compos?s savants et des adjectifs relationnels". La soutenance aura lieu le lundi 10 f?vrier 2014 ? 14h00 en salle 3 au LINA (b?timent 11, Facult? des Sciences et des Techniques). Vous ?tes ?galement convi?-e-s au pot de th?se qui suivra. Bien cordialement, Rima Harastani *************************** Le jury de soutenance sera compos? de : M. Herv? BLANCHON, Ma?tre des conf?rences, Universit? Pierre Mend?s - Rapporteur M. Ulrich HEID, Professeur des universit?s, Universit? de Stuttgart - Rapporteur M. Vincent CLAVEAU, Charg? de recherche, CNRS-IRISA Rennes - Examinateur M. Holger SCHWENK, Professeur des universit?s, Universit? du Maine - Examinateur Mme. B?atrice DAILLE, Professeur des universit?s, Universit? de Nantes - co-encadrante de th?se M. Emmanuel MORIN, Professeur des universit?s, Universit? de Nantes - directeur de th?se *************************** R?sum? : Notre travail concerne l'extraction automatique d'une liste de termes align?s avec leurs traductions (c'est-?-dire un lexique bilingue sp?cialis?) ? partir d'un corpus comparable dans un domaine de sp?cialit?. Un corpus comparable comprend des textes ?crits dans deux langues diff?rentes sans aucune relation de traduction entre eux mais dont les textes appartiennent ? un m?me domaine. Les contributions de cette th?se portent sur l'am?lioration de la qualit? d'un lexique bilingue sp?cialis? extrait ? partir d'un corpus comparable. Nous proposons des m?thodes consacr?es ? la traduction de deux types de termes, qui ont des caract?ristiques en commun entre plusieurs langues ou qui posent par leur nature des probl?mes pour la traduction : les compos?s savants (termes contenant au moins une racine emprunt?e au grec ou latin) et les termes compos?s d'un nom et un adjectif relationnel. Nous d?veloppons ?galement une m?thode, qui exploite des contextes riches en termes sp?cifiques au domaine du corpus, pour r?ordonner dans un lexique bilingue sp?cialis? des traductions candidates fournies pour un terme. Les exp?riences sont r?alis?es en utilisant deux corpus comparables sp?cialis?s, le premier dans le domaine du cancer du sein et le deuxi?me dans le domaine des ?nergies renouvelables, sur les langues fran?ais, anglais, allemand et espagnol. Mots cl?s : corpus comparables, langue de sp?cialit?, alignement multilingue, compos?s savants, adjectifs relationnels. *************************** Abstract: Our work concerns the automatic extraction of a list of aligned terms with their translations (i.e. specialized bilingual lexicon) from comparable corpora belonging to a specific domain. Comparable corpora include texts written in two languages which are not translations of each other but belong to the same domain. This thesis contributes to the improvement of the quality of an extracted bilingual lexicon. We propose methods dedicated to the translation of two types of terms that have common characteristics among many languages or that cause specific problems for translation due to their nature. These types of terms are the neoclassical compounds (terms containing at least one root borrowed from Greek or Latin) and the terms composed of one noun and one relational adjective. We also propose a method, which exploits contexts rich in domain-specific terms to re-rank some provided translations in a bilingual lexicon for a given term. The experiments are performed using two specialized comparable corpora, the first in the domain of breast cancer and the second in the domain of renewable energy, on the languages French, English, German and Spanish. Keywords: comparable corpora, specialized language, multilingual alignment, neoclassical compounds, relational adjectives. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 10:44:31 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 11:44:31 +0100 Subject: Appel: Journal of Language Modelling, Special issue on Grammar Engineering (deadline extension) Message-ID: Date: Wed, 5 Feb 2014 10:53:22 +0100 From: Yannick Parmentier Message-ID: <20140205095322.GB1924 at haku> X-url: http://jlm.ipipan.waw.pl/ Message-ID: <20140205095322.GB1924 at haku> User-Agent: Mutt/1.5.21 (2010-09-15) **Apologies for cross-posting. Please post or distribute.** *DEADLINE EXTENSION: February, 23rd* CALL FOR PAPERS Journal of Language Modelling SPECIAL ISSUE ON High-level Methodologies for Grammar Engineering http://jlm.ipipan.waw.pl/ Special Issue Description ------------------------- Many grammatical frameworks have been proposed over the last decades to describe the syntax (and semantics) of natural language. Among the most widely used, one may cite (in chronological order) Tree-Adjoining Grammar (TAG) [Joshi et al., 1975], Lexical-Functional Grammar (LFG) [Bresnan and Kaplan, 1982], Combinatory Categorial Grammar (CCG) [Steedman, 1987], or Head- driven Phrase Structure Grammar (HPSG) [Pollard and Sag, 1994]. These frameworks present theoretical and practical interests. From a theoretical point of view, they provide a formal device for the linguist to experiment with her/his theories. >From a practical point of view, they make it possible to automatically process natural language in applications such as dialog systems, machine translation, etc. They differ in their expressivity and complexity. Some prove more suitable for the description of a given language than others. Still, for many of these frameworks, large resources (e.g., grammars) have been designed, at first by hand, and later via dedicated tools (e.g., integrated grammar environments such as XLE for LFG [King et al., 2000]). In this Special Issue, we are concerned with this complex task of grammar engineering, keeping in mind the two above-mentioned theoretical and practical interests. Topics of Interest ------------------ This special issue welcomes contributions from the following topics (without being limited to these): 1. New or updated work on the traditional topics of GEAF workshops The traditional topics in the series of "Grammar Engineering Accross Frameworks" workshops, "Formal Grammar" conferences and / or the "High-level Methodologies for Grammar Engineering" ESSLLI 2013 workshop are appropriate. The submitted paper must clearly deal with language modelling. An extended version of a workshop/conference paper can be submitted, provided that the contribution has been substantially updated. The forums for any preliminary versions of the paper must be indicated. 2. Study of new fundamental and / or applied results in relation with grammar development, maintenance and enhancement - dedicated description languages for grammar engineering - semi-automatic acquisition of grammars - development of grammars for under-resourced languages - cross-framework grammar engineering - cross-language grammar engineering - seamless construction of grammars from dynamically changing linguistic descriptions - designs of weighted grammars - efficient and dynamic construction of structures from updated rules or logical formulas 3. Study of new methods for grammar engineering and their applications - debugging environments for grammar design - systems based on large-scale grammars 4. In addition, the topics of interest include reports on implementations - new kinds of implementations of grammar compilers, libraries and on-demand operations - benchmarks suitable for evaluation of grammars - methods that construct, minimize or decompose grammars Important Dates --------------- - Deadline for submissions: 23 February 2014 - First decision: 12 May 2014 - Submission of revised version: 17 July 2014 - Final decision: 22 September 2014 - Submission of camera-ready versions: 19 October 2014 Submission ---------- Articles submitted to this special issue must adhere to the JLM instructions for contributors and use the JLM's style files, see: http://jlm.ipipan.waw.pl/about/submissions#authorGuidelines Paper submissions should be made through JLM's electronic submission system: http://jlm.ipipan.waw.pl/about/submissions#onlineSubmissions All submitted articles should be anonymous and any obvious self-references should be avoided. The contribution of the paper must be substantial. All articles will be evaluated by at least 3 reviewers, including at least one member of JLM's Editorial Board, and at least one member of the issue's Guest Editorial Board. Editorial Board --------------- Guest Editors - Denys Duchier, Universit? d'Orl?ans, FRANCE - Yannick Parmentier, Universit? d'Orl?ans, FRANCE Guest Editorial Board - Emily Bender, University of Washington, USA - Philippe Blache, CNRS, Universit? d'Aix-en-Provence, FRANCE - Eric De La Clergerie, INRIA Paris, FRANCE - Beno?t Crabb?, Universit? Diderot Paris 7, FRANCE - Berthold Crysmann, CNRS Paris, FRANCE - Denys Duchier, Universit? d'Orl?ans, FRANCE - Claire Gardent, CNRS/LORIA, Nancy, FRANCE - Joseph Le Roux, Universit? Paris Nord, FRANCE - Timm Lichte, Universit?t D?sseldorf, GERMANY - Montserrat Marimon, Universitat de Barcelona, SPAIN - Yusuke Miyao, National Institute of Informatics, Tokyo, JAPAN - Stefan M?ller, Freie Universit?t Berlin, GERMANY - Yannick Parmentier, Universit? d'Orl?ans, FRANCE Contact ------- Yannick Parmentier Department of Computer Science, University Institute of Technology University of Orl?ans firstname (dot) lastname (at) univ-orleans (dot) fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 11:13:20 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 12:13:20 +0100 Subject: Seminaire: Exploration textometrique des thematiques dans un corpus, 13 fevrier 2014, Universite de Paris Est Marne la Vallee Message-ID: Date: Thu, 06 Feb 2014 16:12:20 +0100 From: Nicolas Turenne Message-ID: <52F3A654.3000909 at yahoo.fr> X-url: http://www.inra-ifris.org/axes-de-recherche-thematique/groupe-plateforme-cortext/seminaire-de-l-axe-traces-digitales-groupe-cortext.html [d?sol? pour les r?ceptions multiples] Chers coll?gues, Dans le cadre de la quinzi?me s?ance du s?minaire 'traces digitales' du groupe CorText, nous avons le plaisir d'accueillir Philippe Gambette (MCF, Universit? Paris-Est Marne la Vall?e) et Mathieu Brugidou (chercheur HDR, EDF) qui nous parleront d?analyse statistique de donn?es textuelles. Cette matin?e aura lieu le jeudi 13 f?vrier de 10h ? 13h00, dans la salle de r?union C219 du Latts (Universit? Paris-Est Marne la Vall?e - rue Galil?e, b?timent du Bois de l?Etang - 2?me ?tage - aile C). Voir http://www.inra-ifris.org/axes-de-recherche-thematique/groupe-plateforme-cortext/seminaire-de-l-axe-traces-digitales-groupe-cortext.html 10h-11h30 Philippe Gambette (ma?tre de conf?rences, LIGM, Universit? Paris-Est Marne-la-Vall?e) Exploration textom?trique des th?matiques dans un corpus de projets de recherche sur la biodiversit? R?sum? : ? partir d?un corpus de plus de 4000 r?sum?s de projets financ?s par une quinzaine d?agences de divers pays europ?ens, nous dressons un panorama des tendances temporelles et g?ographiques de la recherche sur la biodiversit? financ?e en Europe entre 2004 et 2011. Pour cela, analyses factorielles, ?valuations de sp?cificit?s de vocabulaire et visualisations en nuages arbor?s sont combin?es dans le cadre d?une analyse exploratoire enrichie par le regard d?un expert du domaine. Les qualit?s et les limites de la visualisation en nuage arbor? seront tout particuli?rement ?voqu?es, et approfondies par d?autres cas d?utilisation. 11h30-13h Mathieu Brugidou (chercheur HDR, EDF) Le Grenelle de l?environnement : une approche morphologique et formelle avec des logiciels d?analyse des donn?es textuelles, Alceste et Tropes. R?sum? : L?hypoth?se privil?gi?e dans ce travail est de consid?rer le Grenelle de l?environnement comme un dispositif d??criture collective. Il ne s?agit pas seulement d?un dispositif de participation et de concertation des parties prenantes, marqu? fortement par une dimension de n?gociation, comme l?indique la reprise du terme de Grenelle. Ce processus de concertation s?av?re ?tre aussi un dispositif d??criture collective destin? ? produire des synth?ses et des mesures d?action publique. Deux types d?approches seront privil?gi?s dans cette analyse. Une premi?re approche, morphologique, est orient?e vers l?analyse de la repr?sentation des probl?mes ; une seconde, formelle, [Lassegue, 2010] est d?di?e ? l?analyse de la formulation de mesures : La perspective morphologique cherche ? mettre en ?vidence les structures th?matiques des diff?rents corpus et leurs dynamiques. Nous faisons l?hypoth?se que le dispositif du Grenelle ? et notamment sa conception en phases ? produit des cartographies des probl?mes et que ceux-ci ne peuvent ?tre repr?sent?s que de mani?re relationnelle. La d?finition des probl?mes pour les acteurs du Grenelle implique de trier les questions, d?abord entre celles qui rel?vent ou qui ne rel?vent pas du Grenelle de l?environnement, ensuite entre celles d?finissant les diff?rents probl?mes identifi?s au cours du Grenelle. Les m?thodes d?analyses des donn?es textuelles seront ici privil?gi?es ? notamment celles s?inscrivant dans la filiation de Benz?cri, ? l?image de la m?thode Alceste d?velopp?e par M.Reinert, tourn?es vers une approche empirique des structures ; Une deuxi?me perspective peut ?tre qualifi?e de formelle dans la mesure o? elle s?attache aux effets des dispositifs d??critures sur la forme des mesures. Il s?agit de d?crire la mise en forme des mesures en faisant l?hypoth?se que celles-ci ? au-del? de leurs contenus ? pr?sentent des caract?ristiques linguistiques propres qui r?pondent ? une grammaire [Lemieux, 2009] qui d?finit leurs propri?t?s pragmatiques. Cette deuxi?me approche, plus analytique, privil?gie des ?tudes diachroniques suivant le phasage du dispositif. Ce sont notamment des analyses morphosyntaxiques qui seront ici conduites avec l?aide du logiciel Tropes. Ces analyses seront conduites sur des corpus construits selon diff?rents param?tres de temps (les phases de la n?gociation) mais aussi d?espaces de discours (les diff?rents dispositifs d??critures, lesquels renvoient aux phases du Grenelle mais aussi ? des sous espaces des rapports, par exemple l?opposition entre synth?se et annexes, etc.). ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 11:25:34 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 12:25:34 +0100 Subject: Stage: Master recherche, Annotation semantique dynamique/Dynamic semantic annotation, LIPN, Universite Paris 13 Message-ID: Date: Fri, 7 Feb 2014 18:49:22 +0100 From: Adeline Nazarenko Message-Id: <300A46BB-69F2-47EC-9292-B3576E7A713E at lipn.univ-paris13.fr> (English version below) Proposition de stage de master recherche Annotation s?mantique dynamique Mots clefs : Traitement Automatique des Langues, annotation s?mantique, gestion de contenus, ing?nierie des connaissances, web s?mantique Encadrants : Adeline Nazarenko et Fran?ois L?vy (LIPN, Universit? Paris 13 ? Sorbonne Paris Cit? & CNRS) Dur?e : 4 ? 6 mois (printemps-?t? 2014) Indemnit?s : 430? /mois (Labex EFL) Probl?matique L'annotation s?mantique des documents joue aujourd'hui un r?le clef pour beaucoup d'applications de gestion de contenus textuels (navigation textuelle, recherche d'information s?mantique, restructuration de documents, etc.). L'annotation s?mantique consiste ? apposer sur un texte des informations, ou m?tadonn?es, dont la s?mantique est port?e par un mod?le s?mantique formel (langage d'indexation, thesaurus, ontologie, par exemple) [13, 6, 14]. On associe ainsi au texte une repr?sentation s?mantique formelle et les moteurs de recherche ou agents logiciels peuvent exploiter ? la fois le contenu textuel (recherche en plain texte, calculs distributionnels) et la s?mantique formelle qui lui est associ?e. Les outils d'annotation de la premi?re g?n?ration sont assez frustres, se contentant souvent de lier les mentions des entit?s nomm?es identifi?es dans les textes ? des instances existantes ou ? de nouvelles instances de concepts dans une ontologie [10, 4]. Le d?veloppement des applications sp?cialis?es de gestion de contenus et l'essor du web de donn?es am?nent aujourd'hui ? revoir les m?thodes d'annotation s?mantique : on a besoin de m?thodes et d'outils qui offrent une expressivit? d'annotation plus riche (par ex. annoter des instances de concepts mais aussi des concepts et des relations) tout en ?tant robustes, g?n?riques et adaptables ? diff?rents domaines et contextes d'utilisation. Objectif du stage Le stage permettra de proposer une m?thode d'annotation s?mantique qui int?gre des mesures de qualit? de l'annotation et qui permette de r?viser l'annotation dynamiquement. On supposera que le mod?le s?mantique utilis? est de type ontologique. Si l'on consid?re qu'un syst?me d'annotation S= est compos? d'une ontologie O, d'un texte T et d'un ensemble d'annotations ou de liens A associant ? des segments de T des entit?s de O, il faut r?viser le syst?me S si l'un de ses composants est mis ? jour (le texte est modifi?, l'ontologie est enrichie ou restructur?e) ou lorsque des incoh?rences ou d?fauts de couverture sont d?tect?s. Le stage consistera ? ?tudier les diff?rents cas de figures dans lesquels un tel syst?me d'annotation doit ?tre r?vis? et ? proposer une m?thode d'annotation dynamique int?grant des processus de r?vision. La m?thode d'annotation dynamique doit 1) int?grer des crit?res de coh?rence et des mesures de couverture pour identifier quand la r?vision d'un syst?me d'annotation est n?cessaire, 2) proposer des proc?dures de r?vision adapt?es aux diff?rents cas de figure et 3) contr?ler la convergence du processus global de r?vision. En commen?ant par les types d'annotation les plus simples (par ex. un texte annot? avec les instances et les concepts d'une ontologie), le stagiaire devra proposer une m?thode d'annotation dynamique. Il pourra s'appuyer sur les outils d'annotation s?mantique existants de l'?quipe RCLN, sur l'expertise des membres de l'?quipe et sur des cas d'usage r?els pour ?valuer l'apport de cette dynamique de l'annotation. Il est souhaitable que la m?thode propos?e soit directement int?gr?e ? un outil d'annotation existant mais elle pourra aussi ?tre test?e en simulation si l'int?gration s'av?re trop co?teuse. Description du travail Le stage comportera diff?rentes parties : 1) ?tat de l'art sur l'annotation s?mantique et veille sur les outils existants (outils de l'?quipe RCLN ou autres) ; 2) description, mod?lisation et impl?mentation du processus d'annotation dynamique (pour les types d'annotations les plus simples ; en s'appuyant sur les outils existants et/ou sur des technologies s?mantiques) ; 3) analyse, test et ?valuation de l'approche propos?e sur des cas d'usage r?els simplifi?s fournis par l'?quipe RCLN. En outre et en pr?vision d'une poursuite en th?se, le stagiaire pourra chercher ? sp?cifier une m?thode d'annotation s?mantique plus riche prenant en compte une palette ?tendue de types d'annotations. Contexte et informations pratiques Le stage sera encadr? par Adeline Nazarenko et Fran?ois L?vy. Le/la stagiaire sera int?gr?(e) ? l'?quipe RCLN et b?n?ficiera de l'expertise de celle-ci en mati?re de traitement automatique des langues, d'ing?nierie des connaissances textuelles et de web s?mantique. En particulier, l'?quipe RCLN a une solide exp?rience en mati?re d'annotation s?mantique (annotation manuelle [2, 3] ou automatique, par apprentissage [8], formalismes et ressources pour l'annotation [9, 12]) et de construction d'ontologies ? partir de textes [1]. Elle a aussi l?exp?rience de l'int?gration de ces m?thodes d'annotation et d'acquisition dans les outils d'analyse de contenus [7, 5, 11]. Le/la stagiaire travaillera au LIPN (Universit? Paris 13 ? Sorbonne Paris Cit? & CNRS) o? il/elle se verra attribuer un bureau. Il/elle aura acc?s ? l'ensemble des moyens techniques et des donn?es n?cessaires ? son travail. Le stage est pr?vu pour une dur?e de 6 mois. Il devrait d?buter au printemps 2014. Le stage sera financ? dans le cadre d'une op?ration de recherche de l'axe ? Analyse s?mantique computationnelle ? du Labex ? Fondements empiriques de la linguistique ?. Les candidatures doivent ?tre adress?es ? Fran?ois L?vy (francois.levy ? lipn.univ-paris13.fr) avant le 7 mars 2014 : envoyer une lettre de motivation, un CV, les relev?s de notes de master. R?f?rences [1] N. Aussenac-Gilles, S. Despr?s, and S. Szulman. ? The TERMINAE Method and Platform for Ontology Engineering from texts ?. In Paul Buitelaar and Philipp Cimiano, editors, Bridging the Gap between Text and Knowledge - Selected Contributions to Ontology Learning and Population from Text, pages 199?223. IOS Press, janvier 2008. [2] K. Fort. Les ressources annot?es, un enjeu pour l?analyse de contenu : vers une m?thodologie de l?annotation manuelle de corpus. Th?se d'informatique, Universit? Paris 13 ? Sorbonne Paris Cit?, Villetaneuse, France, 2012. [3] K. Fort., A. Nazarenko, S. Rosset. ? Modeling the Complexity of Manual Annotation Tasks: a Grid of Analysis ?. In Proceedings of the 24th International Conference on Computational Linguistics (COLING 2012), Mumbai, India, December 2012. [4] C. Giuliano, A. Gliozzo. ? Instance-based ontology population exploiting named-entity substitution ?. In Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008), pages 265?272, Manchester, August 2008. [5] A. Guiss?, F. L?vy, A. Nazarenko. Un moteur s?mantique pour explorer des textes r?glementaires. In Actes des 22?mes journ?es francophones d'Ing?nierie des Connaissances, Chamb?ry, 2011. [6] A. Kiryakov, B. Popov, I. Terziev, D. Manov, and D. Ognyanoff. ? Semantic annotation, indexing, and retrieval ?. Journal of Web Semantics, 2(1):49?79, 2004. [7] F. L?vy, A. Nazarenko, A. Guiss?. ? Annotation, indexation et parcours de documents num?riques ?. Revue des Sciences et Technologies de l'Information, 13(3/2010):121?152, 2010. [8] Y. Ma, F. L?vy, A. Nazarenko. Annotation s?mantique pour des domaines sp?cialis?s et des ontologies riches. In de la 20?me conf?rence du Traitement Automatique du Langage Naturel (TALN 2013), pp 464-478, 17-21 Juin 2013, Les Sables d?Olonne. [9] Y. Ma, A. Nazarenko, L. Audibert. ? Formal description of resources for ontology-based semantic annotation ?. In Proceedings of the International Conference on Language Resources and Evaluation (LREC 2010), Malta, May 2010. ELRA. [10] B. Magnini, A. Pianta, O. Popescu, M. Speranza. ? Ontology population from textual mentions: Task definition and benchmark ?. In Proceedings of the OLP2 workshop on Ontology Population and Learning, Sidney, Australia, 2006. [11] A. Nazarenko, A. Guiss?, F. L?vy, N. Omrane, S. Szulman. ? Integrating Written Policies in Business Rule Management Systems ?. In Rule-Based reasoning, Programming, and Applications, volume 6826 of Lecture Notes in Computer Science, pages 99?113, Barcelona, Espagne, 2011. [12] N. Omrane, A. Nazarenko, P. Rosina, S. Szulman, C. Westphal. ? Lexicalized ontology for a business rules management platform: An automotive use case ?. In Proceedings of the 5th International Symposium on Rules, International Business Rules Forum (RuleMF at BRF), Ft Lauderdale, Florida, USA, November 2011. [13] B. Popov, A. Kiryakov, D. Ognyanoff, D. Manov, A. Kirilov. ? Kim ? a semantic platform for information extraction and retrieval ?. Natural Language Engineering, 10(3-4):375?392, 2004. [14] V. Uren, P. Cimiano, J. Iria, S. Handschuh, M. Vargas-Vera, E. Motta, F. Ciravegna. ? Semantic annotation for knowledge management: Requirements and a survey of the state of the art ?. Journal of Web Semantics, 4, 2006. ------------------------------------------------------- (French version above) Proposal for a master internship Dynamic semantic annotation Keywords: Natural Language Engineering, Semantic Annotation, Content Management, Knowledge Engineering, Semantic Web Supervision: Adeline Nazarenko and Fran?ois L?vy (LIPN, Universit? Paris 13 ? Sorbonne Paris Cit? & CNRS) Duration: 4-6 months (spring-summer 2014) Indemnit?s: 430? /month (Labex EFL) Problem The semantic annotation of documents plays a key role for many applications of textual content management (e.g. navigation, semantic information retrieval, publication). Semantic Annotation consists in enriching a text with metadata which semantics is given by a formal semantic model (e.g. indexing language, thesaurus, ontology) [13, 6 , 14]. A formal semantic representation is thus associated with the text so that search engines or software agents can jointly exploit the textual content (plain text search, distributional measures) and the formal semantics associated with it. The first generation annotation tools are quite simple. They often merely bind references to named entities identified in the texts to existing instances or new instances of concepts in an ontology [10 , 4]. However, the development of specialized applications of content management and linked data calls for renewed methods of semantic annotation: we need methods and tools that provide a richer expressiveness of annotation (e.g. annotation wrt. concepts and relations and not only instances) while being robust, generic and adaptable to different domains and use cases. Goal The goal of the internship is to design a semantic annotation method incorporating annotation quality measures and enabling the dynamic revision of annotations, assuming that the semantic model is ontological. If we consider that an annotation system S = consists of an ontology O, a text T and a set of annotations or links A associating segments of with entities of O, one must revise the system S if one of its components is updated (the text is modified, the ontology is enriched or restructured ) or when inconsistencies or gaps in coverage are detected. The Master student will study the different scenarios requiring the revision of such an annotation system and propose a method of dynamic annotation integrating such a revision process. The dynamic annotation method must 1) integrate consistency criteria and coverage metrics to identify when the revision of an annotation system is necessary, 2) propose revision procedures adapted to different use scenarios and 3) control the convergence of the overall revision process. Starting with the simplest types of annotation (e.g. a text annotated with instances and concepts of an ontology), the student will provide a method for dynamic annotation. It will rely on existing semantic annotation tools, on the expertise of RCLN team members and on real use cases to assess the contribution of this dynamic annotation. The proposed method will be directly integrated into an existing annotation tool or tested through simulation if integration is too complex. Description of work The work will include several parts: - state of the art on semantic annotation and review of existing tools; - description, modeling and implementation of the dynamic annotation process (for the simplest types of annotations and based on existing tools and/or semantic technologies); - analysis, test and evaluation of the proposed approach on simple but real use cases provided by the RCLN team. In addition, and in anticipation of a PhD followup, the student may start to specify a richer semantic annotation method taking into account a wider range of annotation types. Context and Practical Information The work will be supervised by Pr. Adeline Nazarenko and Pr. Francois Levy. The intern will be integrated in the RCLN team and benefit from its expertise in natural language processing, knowledge engineering and semantic web. In particular, RCLN has a solid experience in semantic annotation (manual annotation [2, 3] or based on machine learning [8], formalisms and resources for annotation [9, 12]) and text-based ontology design [1]. It also knows how to integrate those methods of acquisition and annotation in content analysis tools [7 , 5, 11 ]. The intern will work at LIPN (University Paris 13 - Sorbonne Paris Cit? & CNRS) where he/she will be assigned a desk. He/she will have access to local facilities and data resources. The internship is for a period of 6 months. It should start in spring 2014. It will be funded by the Labex "Empirical Foundations of Language" (research strand ?computational semantic analysis?). Applications should be addressed to Fran?ois L?vy (francois.levy to lipn.univ - paris13.fr) before March 7, 2014 : send a cover letter, a CV and transcripts. References [1] N. Aussenac-Gilles, S. Despr?s, and S. Szulman. ? The TERMINAE Method and Platform for Ontology Engineering from texts ?. In Paul Buitelaar and Philipp Cimiano, editors, Bridging the Gap between Text and Knowledge - Selected Contributions to Ontology Learning and Population from Text, pages 199?223. IOS Press, janvier 2008. [2] K. Fort. Les ressources annot?es, un enjeu pour l?analyse de contenu : vers une m?thodologie de l?annotation manuelle de corpus. Th?se d'informatique, Universit? Paris 13 ? Sorbonne Paris Cit?, Villetaneuse, France, 2012. [3] K. Fort., A. Nazarenko, S. Rosset. ? Modeling the Complexity of Manual Annotation Tasks: a Grid of Analysis ?. In Proceedings of the 24th International Conference on Computational Linguistics (COLING 2012), Mumbai, India, December 2012. [4] C. Giuliano, A. Gliozzo. ? Instance-based ontology population exploiting named-entity substitution ?. In Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008), pages 265?272, Manchester, August 2008. [5] A. Guiss?, F. L?vy, A. Nazarenko. Un moteur s?mantique pour explorer des textes r?glementaires. In Actes des 22?mes journ?es francophones d'Ing?nierie des Connaissances, Chamb?ry, 2011. [6] A. Kiryakov, B. Popov, I. Terziev, D. Manov, and D. Ognyanoff. ? Semantic annotation, indexing, and retrieval ?. Journal of Web Semantics, 2(1):49?79, 2004. [7] F. L?vy, A. Nazarenko, A. Guiss?. ? Annotation, indexation et parcours de documents num?riques ?. Revue des Sciences et Technologies de l'Information, 13(3/2010):121?152, 2010. [8] Y. Ma, F. L?vy, A. Nazarenko. Annotation s?mantique pour des domaines sp?cialis?s et des ontologies riches. In de la 20?me conf?rence du Traitement Automatique du Langage Naturel (TALN 2013), pp 464-478, 17-21 Juin 2013, Les Sables d?Olonne. [9] Y. Ma, A. Nazarenko, L. Audibert. ? Formal description of resources for ontology-based semantic annotation ?. In Proceedings of the International Conference on Language Resources and Evaluation (LREC 2010), Malta, May 2010. ELRA. [10] B. Magnini, A. Pianta, O. Popescu, M. Speranza. ? Ontology population from textual mentions: Task definition and benchmark ?. In Proceedings of the OLP2 workshop on Ontology Population and Learning, Sidney, Australia, 2006. [11] A. Nazarenko, A. Guiss?, F. L?vy, N. Omrane, S. Szulman. ? Integrating Written Policies in Business Rule Management Systems ?. In Rule-Based reasoning, Programming, and Applications, volume 6826 of Lecture Notes in Computer Science, pages 99?113, Barcelona, Espagne, 2011. [12] N. Omrane, A. Nazarenko, P. Rosina, S. Szulman, C. Westphal. ? Lexicalized ontology for a business rules management platform: An automotive use case ?. In Proceedings of the 5th International Symposium on Rules, International Business Rules Forum (RuleMF at BRF), Ft Lauderdale, Florida, USA, November 2011. [13] B. Popov, A. Kiryakov, D. Ognyanoff, D. Manov, A. Kirilov. ? Kim ? a semantic platform for information extraction and retrieval ?. Natural Language Engineering, 10(3-4):375?392, 2004. [14] V. Uren, P. Cimiano, J. Iria, S. Handschuh, M. Vargas-Vera, E. Motta, F. Ciravegna. ? Semantic annotation for knowledge management: Requirements and a survey of the state of the art ?. Journal of Web Semantics, 4, 2006. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 10:47:33 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 11:47:33 +0100 Subject: Journee: Formation au logiciel TXM (textometrie), dates pour mars-avril 2014 Message-ID: Date: Wed, 05 Feb 2014 15:53:08 +0100 From: Serge Heiden Message-ID: <52F25054.2030106 at ens-lyon.fr> X-url: http://textometrie.ens-lyon.fr Bonjour, Le logiciel open-source de textom?trie TXM, dont le d?veloppement est coordonn? par le laboratoire ICAR ? Lyon, permet l'analyse documentaire (qualitative) et statistique (quantitative) de donn?es textuelles (dont oral transcrit), pour des corpus de recherche personnels ou pour la mise en ligne de corpus : http://textometrie.ens-lyon.fr Les prochains ateliers de formation au logiciel TXM auront lieu ? Lyon, ? l'ENS site Descartes, les journ?es suivantes (8h30/9h30-17h30/18h) : - lundi 7 avril : Initiation ? TXM ; - mardi 8 avril : Pr?paration de corpus et import dans TXM. (Les ateliers de f?vrier sont complets.) Un atelier "TXM avanc?" pourra ?galement ?tre organis? si nous pouvons r?unir un petit groupe d'utilisateurs exp?riment?s (faisant d?j? des analyses avec TXM sur leurs donn?es). Si vous ?tes int?ress?(e) par l'une de ces s?ances, il faut vous inscrire en envoyant un mail ? textometrie ens-lyon fr (l'inscription est n?cessaire m?me si vous aviez r?pondu ? la consultation doodle.) Toute l'information sur les ateliers : https://groupes.renater.fr/wiki/txm-users/public/ateliers_txm Au plaisir de vous retrouver peut-?tre ? l'une de ces journ?es, Serge Heiden, pour l'?quipe TXM. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 10:46:34 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 11:46:34 +0100 Subject: Appel: LREC 2014 Workshop on Building and Using Comparable Corpora (7th BUCC) Message-ID: Date: Wed, 5 Feb 2014 14:06:23 +0100 From: "Reinhard Rapp" Message-ID: <231BF614593D4F2A923714DCB28EF473 at ASUSPC> X-url: http://comparable.limsi.fr/bucc2014/ We apologize for multiple postings Please distribute to interested colleagues ============================================================ Final Call for Papers 7th WORKSHOP ON BUILDING AND USING COMPARABLE CORPORA Building Resources for Machine Translation Research http://comparable.limsi.fr/bucc2014/ May 27, 2014 Co-located with LREC 2014 Harpa Conference Centre, Reykjavik (Iceland) DEADLINE FOR PAPERS: February 10, 2014 https://www.softconf.com/lrec2014/BUCC2014/ *** INVITED SPEAKER *** Chris Callison-Burch (University of Pennsylvania) ============================================================ MOTIVATION In the language engineering and the linguistics communities, research in comparable corpora has been motivated by two main reasons. In language engineering, on the one hand, it is chiefly motivated by the need to use comparable corpora as training data for statistical Natural Language Processing applications such as statistical machine translation or cross-lingual retrieval. In linguistics, on the other hand, comparable corpora are of interest in themselves by making possible inter-linguistic discoveries and comparisons. It is generally accepted in both communities that comparable corpora are documents in one or several languages that are comparable in content and form in various degrees and dimensions. We believe that the linguistic definitions and observations related to comparable corpora can improve methods to mine such corpora for applications of statistical NLP. As such, it is of great interest to bring together builders and users of such corpora. The scarcity of parallel corpora has motivated research concerning the use of comparable corpora: pairs of monolingual corpora selected according to the same set of criteria, but in different languages or language varieties. Non-parallel yet comparable corpora overcome the two limitations of parallel corpora, since sources for original, monolingual texts are much more abundant than translated texts. However, because of their nature, mining translations in comparable corpora is much more challenging than in parallel corpora. What constitutes a good comparable corpus, for a given task or per se, also requires specific attention: while the definition of a parallel corpus is fairly straightforward, building a non-parallel corpus requires control over the selection of source texts in both languages. Parallel corpora are a key resource as training data for statistical machine translation, and for building or extending bilingual lexicons and terminologies. However, beyond a few language pairs such as English- French or English-Chinese and a few contexts such as parliamentary debates or legal texts, they remain a scarce resource, despite the creation of automated methods to collect parallel corpora from the Web. To exemplify such issues in a practical setting, this year's special focus will be on Building Resources for Machine Translation Research This special topic aims to address the need for: (1) Machine Translation training and testing data such as spoken or written monolingual, comparable or parallel data collections, and (2) methods and tools used for collecting, annotating, and verifying MT data such as Web crawling, crowdsourcing, tools for language experts and for finding MT data in comparable corpora. TOPICS We solicit contributions including but not limited to the following topics: Topics related to the special theme: * Methods and tools for collecting and processing MT data, including crowdsourcing * Methods and tools for quality control * Tools for efficient annotation * Bilingual term and named entity collections * Multilingual treebanks, wordnets, propbanks, etc. * Comparable corpora with parallel units annotated * Comparable corpora for under-resourced languages and specific domains * Multilingual corpora with rich annotations: POS tags, NEs, dependencies, semantic roles, etc. * Data for special applications: patent translation, movie subtitles, MOOCs, meetings, chat-rooms, social media, etc. * Legal issues with collecting and redistributing data and generating derivatives Building comparable corpora: * Human translations * Automatic and semi-automatic methods * Methods to mine parallel and non-parallel corpora from the Web * Tools and criteria to evaluate the comparability of corpora * Parallel vs non-parallel corpora, monolingual corpora * Rare and minority languages, across language families * Multi-media/multi-modal comparable corpora Applications of comparable corpora: * Human translations * Language learning * Cross-language information retrieval & document categorization * Bilingual projections * Machine translation * Writing assistance Mining from comparable corpora: * Extraction of parallel segments or paraphrases from comparable corpora * Extraction of bilingual and multilingual translations of single words and multi-word expressions; proper names, named entities, etc. IMPORTANT DATES February 10, 2014 Deadline for submission of full papers March 10, 2014 Notification of acceptance March 27, 2014 Camera-ready papers due May 27, 2014 Workshop date SUBMISSION INFORMATION Papers should follow the LREC main conference formatting details (to be announced on the conference website http://lrec2014.lrec-conf.org/en/ ) and should be submitted as a PDF-file via the START workshop manager at https://www.softconf.com/lrec2014/BUCC2014/ Contributions can be short or long papers. Short paper submission must describe original and unpublished work without exceeding six (6) pages. Characteristics of short papers include: a small, focused contribution; work in progress; a negative result; an opinion piece; an interesting application nugget. Long paper submissions must describe substantial, original, completed and unpublished work without exceeding ten (10) pages. Reviewing will be double blind, so the papers should not reveal the authors' identity. Accepted papers will be published in the workshop proceedings. Double submission policy: Parallel submission to other meetings or publications is possible but must be immediately notified to the workshop organizers. When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. Moreover, ELRA encourages all LREC authors to share the described LRs (data, tools, services, etc.), to enable their reuse, replicability of experiments, including evaluation ones, etc. JOURNAL SPECIAL ISSUE Authors of selected papers will be encouraged to submit substantially extended versions of their manuscripts to an upcoming special issue on ?Machine Translation Using Comparable Corpora? of the Journal of Natural Language Engineering. ORGANISERS Pierre Zweigenbaum, LIMSI, CNRS, Orsay (France) Ahmet Aker, University of Sheffield (UK) Serge Sharoff, University of Leeds (UK) Stephan Vogel, QCRI (Qatar) Reinhard Rapp, Universities of Mainz (Germany) and Aix-Marseille (France) FURTHER INFORMATION Pierre Zweigenbaum: pz (at) limsi (dot) fr SCIENTIFIC COMMITTEE * Ahmet Aker, University of Sheffield (UK) * Srinivas Bangalore (AT&T Labs, US) * Caroline Barri?re (CRIM, Montr?al, Canada) * Chris Biemann (TU Darmstadt, Germany) * Herv? D?jean (Xerox Research Centre Europe, Grenoble, France) * Kurt Eberle (Lingenio, Heidelberg, Germany) * Andreas Eisele (European Commission, Luxembourg) * ?ric Gaussier (Universit? Joseph Fourier, Grenoble, France) * Gregory Grefenstette (INRIA, Saclay, France) * Silvia Hansen-Schirra (University of Mainz, Germany) * Hitoshi Isahara (Toyohashi University of Technology) * Kyo Kageura (University of Tokyo, Japan) * Adam Kilgarriff (Lexical Computing Ltd, UK) * Natalie K?bler (Universit? Paris Diderot, France) * Philippe Langlais (Universit? de Montr?al, Canada) * Michael Mohler (Language Computer Corp., US) * Emmanuel Morin (Universit? de Nantes, France) * Dragos Stefan Munteanu (Language Weaver, Inc., US) * Lene Offersgaard (University of Copenhagen, Denmark) * Ted Pedersen (University of Minnesota, Duluth, US) * Reinhard Rapp (Universit? Aix-Marseille, France) * Sujith Ravi (Google, Mountain View, US) * Serge Sharoff (University of Leeds, UK) * Michel Simard (National Research Council Canada) * Richard Sproat (OGI School of Science & Technology, US) * Tim Van de Cruys (IRIT-CNRS, Toulouse, France) * Stephan Vogel (QCRI, Qatar) * Guillaume Wisniewski (Universit? Paris Sud & LIMSI-CNRS, Orsay, France) * Pierre Zweigenbaum (LIMSI-CNRS, Orsay, France) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 11:20:43 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 12:20:43 +0100 Subject: Seminaire: Programme Seminaire Linguistique Theorique, Universite de Bourgogne Message-ID: Date: Fri, 7 Feb 2014 16:12:55 +0100 (CET) From: Philippe Monneret Message-ID: <447312548.2154087.1391785975799.JavaMail.root at u-bourgogne.fr> S?MINAIRE 2014 QUESTIONS DE LINGUISTIQUE TH?ORIQUE Philippe Monneret Universit? de Bourgogne / GReLiSC (EA 4178 CPTC) B?timent Droit-Lettres ? 10h-12h ------------------------------------------------------------------------ Jeudi 23 janvier (salle 105 PEG) Pr?sentation et introduction du s?minaire ? Analogie et similarit? en sciences du langage ? Philippe Monneret (Universit? de Bourgogne) Jeudi 6 f?vrier (salle 155) ? Pourquoi tant de logiques modales ? ? Guillaume Coqui (Universit? de Bourgogne) Jeudi 13 f?vrier (salle du Conseil Lettres) ? Remarques sur la d?termination en fran?ais ? Pierre-Andr? Buvet (Universit? de Paris 13) Jeudi 20 f?vrier Colloque ? Vers une histoire des th?ories phonosymboliques ? Organis? par Luca Nobile (Universit? de Bourgogne) Jeudi 6 mars (salle 155) ? Sur le possible en linguistique ? Thomas Verjans (Universit? de Bourgogne) Jeudi 13 mars (salle du Conseil Lettres) ? Mon nom est PERSONNE : qui est JE, qui est TU, qui est IL ? ? Georges Kleiber (Universit? de Strasbourg) Jeudi 20 mars (salle du Conseil Lettres) ? L?expression de la finalit? en fran?ais ? Gaston Gross (Universit? de Paris 13) Vendredi 28 mars (salle du Conseil Droit) ? Philosophie du langage, philosophie de la linguistique, philosophie des langues ? Christian Puech (Universit? de Paris 3) Jeudi 3 avril (salle 155) ? Sur la disparition de cuidier ? Thierry Ponchon (Universit? de Reims, Champagne-Ardenne) Vendredi 11 avril (salle du Conseil Lettres) ? Essai de typologie de l?iconicit? linguistique ? Luca Nobile (Universit? de Bourgogne) Jeudi 17 avril (salle 155) ? Penser sans langage. Le cas des enfants en p?riode pr?-linguistique ? Anna Ciaunica ( Mind, Language and Action Lab - University of Porto) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 11:01:30 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 12:01:30 +0100 Subject: Appel: SIGDIAL 2014 Conference, June 18th-20th, Philadelphia, USA Message-ID: Date: Thu, 6 Feb 2014 09:47:00 +0000 From: Helen Hastie Message-ID: X-url: http://www.sigdial.org/workshops/conference15 *SIGDIAL 2014 CONFERENCE* *Wednesday, June 18 to Friday, June 20, 2014 * The 15th Annual Meeting of the Special Interest Group on Discourse and Dialog will be co-located with the 8th International Conference on Natural Language Generation (INLG 2014 (http://inlg2014.wordpress.com/)) in Philadelphia, PA, USA and immediately preceding ACL 2014. http://www.sigdial.org/workshops/conference15 *CALL FOR PAPERS*The 2014 SIGDIAL conference continues a series of fourteen conferences, providing a regular forum for the presentation of cutting edge research across the areas of discourse and dialog and attracting a diverse set of participants from academia and industry. The conference is sponsored by the SIGdial organization, which serves as the Special Interest Group on discourse and dialog for both ACL and ISCA. *KEYNOTE SPEAKERS*We are pleased to announce our keynote speakers will be Prof. Steve Young from the University of Cambridge and Prof. Lillian Lee from Cornell University. *TOPICS OF INTEREST*We welcome formal, corpus-based, system-building or analytical work on discourse and dialog including but not restricted to the following themes and topics: - Discourse Processing and Dialog Systems - Corpora, Tools and Methodology - Pragmatic and/or Semantic Modeling - Dimensions of Interaction - Open Domain Dialog - Style, Voice and Personality in Spoken Dialog and Written Text - Applications of Dialog and Discourse Processing Technology - Novel Methods for Generation Within Dialog for a joint special session with INLG *SPECIAL SESSIONS*There will be one special session co-located with INLGon the morning of June 20th. There will be a second special session on the Dialogue State Tracking Challenge (DSTC) (http://camdial.org/~mh521/dstc/)*. *Proposals for additional special sessions are also still welcome. Special sessions submission will undergo regular SIGdial review process. *IMPORTANT DATES* *Special Session Proposal Deadline: * Sunday, 9 February 2014 (23:59, GMT-11) *Special Session Notification: * Monday, 17 February 2014 *Long, Short and Demonstration* *Paper Submission Deadline: * Sunday, 9 March 2014 (23:59, GMT-11) *Paper Notification: * Friday, 18 April 2014 *Final Paper Due* - For papers accepted subject to receiving mentoring Wednesday, 14 May 2014 - For accepted papers Friday, 23 May 2014 *Conference * Wednesday, June 18, 2014 to Friday, June 20, 2014 *SUBMISSIONS* *Special Session Proposals* The SIGdial organizers welcome the submission of special session proposals. A SIGDIAL special session is the length of a regular session at the conference; may be organized as a poster session, a poster session with panel discussion, or an oral presentation session; and will be held on the last day of the conference. Special sessions may, at the discretion of the SIGdial organizers, be held as parallel sessions. Those wishing to organize a special session should prepare a two-page proposal containing: a summary of the topic of the special session; a list of organizers and sponsors; a list of people who may submit and participate; and a requested format (poster/panel/oral session). These proposals should be sent to *conference[a]sigdial.org (http://sigdial.org)* by the special session proposal deadline. Special session proposals will be reviewed jointly by the general and program co-chairs. *Papers* The program committee welcomes the submission of long papers, short papers, and demonstration descriptions. All accepted submissions will be published in the conference proceedings. Long papers may, at the discretion of the technical program committee, be accepted for oral or poster presentation. They must be no longer than 8 pages, including title, content, and examples. Two additional pages are allowed for references and appendices, which may include extended example discourses or dialogs, algorithms, graphical representations, etc. Short papers will be presented as posters. They should be no longer than 4 pages, including title and content. One additional page is allowed for references and appendices. Demonstration papers should be no longer than 3 pages, including references. A separate one-page document should be provided to the program co-chairs for demonstration descriptions, specifying furniture and equipment needed for the demo. Authors of a submission may designate their paper to be considered for a SIGDIAL special session, which would highlight a particular area or topic. All papers will undergo regular peer review. Papers that have been or will be submitted to other meetings or publications must provide this information (see submission format). A paper accepted for presentation at SIGDIAL 2014 must not have been presented at any other meeting with publicly available proceedings. Any questions regarding submissions can be sent to the program co-chairs at program-chairs[at]sigdial.org. Authors are encouraged to submit additional supportive material such as video clips or sound clips and examples of available resources for review purposes. Submission is electronic using paper submission software. *FORMAT* All long, short, and demonstration submissions should follow the two-column ACL 2014 format. We strongly recommend the use of ACL LaTeX style files or Microsoft Word style files tailored for the ACL 2014 conference. Submissions must conform to the official ACL 2014 style guidelines ( http://www.cs.jhu.edu/ACL2014/CallforPapers.htm), and they must be electronic in PDF. As in most previous years, submissions will not be anonymous. *MENTORING SERVICE* For several years, the SIGDIAL conference has offered a mentoring service. Submissions with innovative core ideas that may need language (English) or organizational assistance will be flagged for "mentoring" and conditionally accepted with recommendation to revise with a mentor. An experienced mentor who has previously published in the SIGDIAL venue will then help the authors of these flagged papers prepare their submissions for publication. Any questions about the mentoring service can be addressed to the mentoring chair at mentoring[at]sigdial.org. *STUDENT SUPPORT* SIGdial also offers a limited number of scholarships for students presenting a paper accepted to the conference. Application materials will be posted at the conference website. *BEST PAPER AWARDS* In order to recognize significant advancements in dialog and discourse science and technology, SIGDIAL will recognize two best paper awards. A selection committee consisting of prominent researchers in the fields of interest will select the recipients of the awards. *SPONSORSHIP* SIGDIAL offers a number of opportunities for sponsors. For more information, email the conference organizers at sponsor-chair[at]sigdial.org . *DIALOG AND DISCOURSE* SIGDIAL authors are encouraged to submit their research to the journal Dialog and Discourse, which is endorsed by SIGdial. *ORGANIZING COMMITTEE* General Co-Chairs Kallirroi Georgila, University of Southern California, USA Matthew Stone, Rutgers, The State University of New Jersey, USA Technical Program Co-Chairs Helen Hastie, Heriot-Watt University, Edinburgh, UK Ani Nenkova, University of Pennsylvania, USA Mentoring Chair Svetlana Stoyanchev, AT&T Research Labs, USA Local Chair Keelan Evanini, Educational Testing Service, USA Sponsorships Chair Giuseppe Di Fabbrizio, Amazon.com, USA SIGdial President Amanda Stent, Yahoo! Labs, USA SIGdial Vice President Jason Williams, Microsoft Research, USA SIGdial Secretary/Treasurer Kristiina Jokinen, University of Helsinki, Finland From hamon at LIMSI.FR Sat Feb 8 11:16:21 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 12:16:21 +0100 Subject: Conf: WWW'14, Tutorial on Concept-Level Sentiment Analysis, Seoul, Korea Message-ID: Date: Fri, 7 Feb 2014 17:43:30 +0800 From: Erik Cambria Message-ID: <6BBCDA75-D5AB-4402-BC54-3E4965361F62 at nus.edu.sg> Apologies for cross-posting, Participants are invited to the WWW?14 tutorial on Concept-Level Sentiment Analysis, which will be held within the World Wide Web conference this April in Seoul, Korea. The tutorial aims to provide its participants means to efficiently design models, techniques, tools, and services for concept-level sentiment analysis and their commercial realizations. The tutorial draws on insights resulting from the recent IEEE Intelligent Systems special issues on Concept-Level Opinion and Sentiment Analysis and the IEEE CIM special issue on Computational Intelligence for Natural Language Processing. The tutorial includes a hands-on session to illustrate how to build a concept-level opinion-mining engine step-by-step, from semantic parsing to concept-level reasoning. BACKGROUND AND MOTIVATIONS As the Web rapidly evolves, Web users are evolving with it. In an era of social connectedness, people are becoming increasingly enthusiastic about interacting, sharing, and collaborating through social networks, online communities, blogs, Wikis, and other online collaborative media. In recent years, this collective intelligence has spread to many different areas, with particular focus on fields related to everyday life such as commerce, tourism, education, and health, causing the size of the Social Web to expand exponentially. The distillation of knowledge from such a large amount of unstructured information, however, is an extremely difficult task, as the contents of today?s Web are perfectly suitable for human consumption, but remain hardly accessible to machines. The opportunity to capture the opinions of the general public about social events, political movements, company strategies, marketing campaigns, and product preferences has raised growing interest both within the scientific community, leading to many exciting open challenges, as well as in the business world, due to the remarkable benefits to be had from marketing and financial market prediction. Mining opinions and sentiments from natural language, however, is an extremely difficult task as it involves a deep understanding of most of the explicit and implicit, regular and irregular, syntactical and semantic rules proper of a language. Existing approaches mainly rely on parts of text in which opinions and sentiments are explicitly expressed such as polarity terms, affect words and their co-occurrence frequencies. However, opinions and sentiments are often conveyed implicitly through latent semantics, which make purely syntactical approaches ineffective. Concept-level sentiment analysis focuses on a semantic analysis of text through the use of web ontologies or semantic networks, which allow the aggregation of conceptual and affective information associated with natural language opinions. By relying on external knowledge, such approaches step away from blind use of keywords and word co-occurrence count, but rather rely on the implicit features associated with natural language concepts. Unlike purely syntactical techniques, concept-based approaches are able to detect also sentiments that are expressed in a subtle manner, e.g., through the analysis of concepts that do not explicitly convey any emotion, but which are implicitly linked to other concepts that do so. The bag-of-concepts model can represent semantics associated with natural language much better than bags-of-words. In the bag-of-words model, in fact, a concept such as cloud computing would be split into two separate words, disrupting the semantics of the input sentence (in which, for example, the word cloud could wrongly activate concepts related to weather). The analysis at concept-level allows for the inference of semantic and affective information associated with natural language text and, hence, enables comparative fine-grained feature-based sentiment analysis. Rather than gathering isolated opinions about a whole item (e.g., iPhone5), users are generally more interested in comparing different products according to specific features (e.g., iPhone5?s vs Galaxy S3?s touchscreen), or even sub-features (e.g., fragility of iPhone5?s vs Galaxy S3?s touchscreen). In this context, the construction of comprehensive common and common-sense knowledge bases is key for feature-spotting and polarity detection, respectively. Common-sense, in particular, is necessary to properly deconstruct natural language text into sentiments ? for example, to appraise the concept small room as negative for a hotel review and small queue as positive for a post office, or the concept go read the book as positive for a book review but negative for a movie review. TUTORIAL PROGRAM - Introduction (5 mins) - New Avenues in Sentiment Analysis Research - From Heuristics to Discourse Structure (5 mins) - From Coarse to Fine-Grained Analysis (5 mins) - From Keywords to Concepts (10 mins) - Concept-Level Models - Knowledge acquisition models (10 mins) - Emotion categorization models (10 mins) - Vector space models (10 mins) - Concept-Level Techniques - Analogical reasoning (10 mins) - Parallel analogy (10 mins) - Spreading activation (10 mins) - Concept-Level Tools - Sentiment resources (15 mins) - Common knowledge repositories (15 mins) - Aspect mining and polarity detection (10 mins) - Building a Concept-Level Opinion-Mining Engine - Semantic parsing (15 mins) - Sentic API (15 mins) - Application Samples (20 mins) - Conclusion (5 mins) IMPACT AND RELEVANCE The World Wide Web Conference is a global event bringing together key researchers, innovators, decision-makers, technologists, and business experts trying to make meaning out of Web data. Within this research and business area, opinion mining and sentiment analysis have become increasingly important subtasks in recent years. However, there are still many challenges, including social information understanding and integration, that need to be addressed. For these reasons, a tutorial on concept-level sentiment analysis is strongly relevant to WWW?14. TARGET AUDIENCE AND PREREQUISITES The target audience includes researchers and professionals in the fields of sentiment analysis, Web data mining, and related areas. The tutorial also aims to attract researchers from industry community as it covers research efforts for the development of applications in fields such as commerce, tourism, education, and health. The audience is expected to have basic computer science skills, but psychologists and sociologists are also very welcome. The tutorial not only covers state-of-the-art approaches to concept-level sentiment analysis, but also provides information about techniques and tools to be used for practical opinion mining. ABOUT THE TUTOR Erik Cambria received his BEng and MEng with honors in Electronic Engineering from the University of Genova, in 2005 and 2008 respectively. In 2011, he has been awarded a PhD in Computing Science and Mathematics, following the completion of an industrial Cooperative Awards in Science and Engineering (CASE) research project, funded by the UK Engineering and Physical Sciences Research Council (EPSRC), which was born from the collaboration between the University of Stirling and the MIT Media Laboratory. Today, Erik is the lead investigator of a MINDEF-funded project on Commonsense Knowledge Representation & Reasoning at the National University of Singapore (Temasek Laboratories) and an associate researcher at the MIT Media Laboratory (Synthetic Intelligence Project). His interests include AI, Semantic Web, KR, NLP, opinion mining and sentiment analysis, affective and cognitive modeling, intention awareness, HCI, and e-health. Erik is also chair of several international conferences, e.g., Extreme Learning Machines (ELM), and workshop series, e.g., ICDM SENTIRE. He is on the editorial board of Springer Cognitive Computation and he is a guest editor of many other leading AI journals. Erik is also a fellow of the Brain Sciences Foundation, the Chinese Academy of Sciences, National Taiwan University, Microsoft Research Asia, and HP Labs India. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 10:51:47 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 11:51:47 +0100 Subject: Stage: Stage de Master 2, Analyse des citations et des enumerations dans les fora de sante, Toulouse Message-ID: Date: Wed, 05 Feb 2014 16:43:23 +0100 From: Mustapha Mojahid Message-ID: <52F25C1B.2050808 at irit.fr> X-url: http://www.msh-m.fr/programmes/programmes-2013/parlons-de-nous Titre : Analyse des citations et des ?num?rations dans les fora de sant?. *Lieu : *Laboratoire IRIT (Institut de Recherche en Informatique de Toulouse), Universit? Paul Sabatier, Equipe ELIPSE (Etude de L'Interaction Personne Syst?mE). Contexte et probl?matique Ce sujet du stage se situe dans le cadre d'un projet national ? Parlons de nous ? (http://www.msh-m.fr/programmes/programmes-2013/parlons-de-nous) o? l'on ?tudie des fora de sant? pour tenter de r?pondre ? la question ? ? quoi pensent les patients ? ? et du projet interMSH qui lui fait suite. Les forums de sant? sont des lieux o? les patients ?changent de nombreux points de vues, conseils, o? ils s'interrogent et discutent, et ce dans un contexte bien diff?rent des t?tes ? t?tes men?es avec les professionnels de sant?, qui ont lieu dans une dur?e et un lieu contraint. Le relatif anonymat des ?changes, l'implication r?currente de certains dans les forums, les questions d?battues peuvent ?tre r?v?latrices de points de vue, de connaissances ou m?connaissances d'informations m?dicales, d'alertes, en provenance du grand public. Dans ce contexte ?minemment subjectif, la caract?risation et la compr?hension des perceptions dans les fils de discussions des forums est difficile, mais aussi particuli?rement int?ressante et instructive dans une perspective d'am?lioration des programmes de sant? publique. *Objectif du projet* Un des objectifs du projet est de d?velopper une plateforme pour aider des chercheurs (linguistes, sociologues et psychologues) et des m?decins ? observer certains comportements dans les fils de discussions dans des fora de sant?. Dans le cadre d'un travail de th?se en informatique lanc? cette ann?e sur le sujet du contexte et des informations m?dicales, nous avons initi? cette plateforme. La plateforme vise ? proposer une interface qui permet de repr?senter et visualiser sch?matiquement les fils discussions (ou extraits de ces fils de discussions) de forums au travers de traits et crit?res que les chercheurs vont choisir. On esp?re ainsi pouvoir associer ? certains sch?mas de discussions une qualit? informationnelle du fil ?tudi? (exemple : une discussion qui diverge du th?me initial et se recentre entre 2 personnes habitu?es devient peut ?tre un apart? hors sujet). Les crit?res actuellement pris en compte sont de nature contextuelle (les profils des utilisateurs (?ge, sexe), le temps, nombres d'interventions, la longueur des ?changes, les micro-?changes...). Nous souhaitons travailler sur l'exploration d'autres crit?res qui prendront en consid?ration des traitements linguistiques des discussions afin de disposer d'un jeu de vues sur les discussions. Nous souhaitons ?valuer si ces informations permettent de r?pondre ? notre hypoth?se de caract?risation des fils de discussion. *Objectif du stage* Ce stage vise deux objectifs : 1. Explorer des pistes linguistiques pressenties comme - ?tudier l'utilisation des citations : en effet les internautes se citent et se r?pondent beaucoup au fil des discussions et visualiser ces interconnections dans le fil de discussions nous permettraient peut ?tre d'en avoir une compr?hension ?largie ; - connaitre les ?num?rations dans les fils de discussions (quels indices discursifs et de mise en forme mat?rielle peut-on rep?rer et qu'en d?duire ?) - analyser en utilisant les terminologies m?dicales existantes, les proximit?s s?mantiques entre les diff?rents post de discussion dans un fil. 2. ?laborer des strat?gies pour coupler les indices de l'architecture de texte et ceux li?s au contexte (profil de l'usager, th?matique abord?e, statut du message dans la discussion, etc.). Ce travail se fera en collaboration avec le doctorant. Nous souhaitons en effet dans cette plateforme, en manipulant des jauges constitu?es par ces crit?res, observer et pouvoir caract?riser comment se construisent les r?ponses ? Est-ce que les r?ponses sont fournies par des habitu?s ou des b?otiens ? Peut-on ?carter certaines discussions (vulgarit?, ?parpillement...), ou au contraire anticiper sur des contenus plut?t informatifs voire cruciaux ? Quels rebondissements ? Quels recentrages ?... Nous nous focaliserons sur un sous-ensemble de ces besoins. La plate-forme ? configurable ?envisag?e doit permettre aux chercheurs / m?decins de pouvoir observer des comportements et des ? histoires de discussion ? st?r?otyp?s. *Perspectives * Deux th?ses possibles dans la continuit? de ce sujet de master (1. sur l'?tude des ?num?rations dans les manuels scolaires d'histoire g?ographie 2.sur l'am?lioration de l'accessibilit? textuelle pour des personnes non-voyantes). *Modalit?s du stage * Encadrants : Lydia-Mai Ho-Dac, CLLE-ERSS, Universit? Toulouse le Mirail Nathalie Souf, IRIT-ELIPSE, Universit? Paul Sabatier et ISIS Castres Mustapha Mojahid, IRIT-ELIPSE, Universit? Paul Sabatier Dur?e : 5-6 mois. R?mun?ration : celle pr?vue par la r?glementation ? savoir 436,05 EUR par mois. D?but : ? partir de Mars-Avril 2014. *Profil du candidat * Le candidat devra ?tre inscrit dans un Master 2 en traitement automatique des langues. *Comp?tences demand?es* - comp?tences en traitement automatique des langues et/ou en linguistique de corpus. - comp?tences de base en informatique et id?alement ma?trise d'outils pour l'analyse de corpus et/ou de langages de programmation de type perl et python. *Comment candidater ?* Envoyer un CV (avec le d?tail des cours et notes des deux ann?es de Master) et une lettre de motivation ? : Mustapha.Mojahid at irit.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 11:27:00 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 12:27:00 +0100 Subject: These: Inga Gheorghita, Construction automatique de hierarchies semantiques a partir du TLFi Message-ID: Date: Fri, 07 Feb 2014 23:24:56 +0100 From: Inga Gheorghita Message-ID: Bonjour, J?ai le plaisir de vous convier ? ma soutenance de th?se de doctorat en linguistique informatique intitul?e : ? Construction automatique de hi?rarchies s?mantiques ? partir du Tr?sor de la langue fran?aise informatis? (TLFi) : application ? l'indexation et la recherche d'images ?. La th?se sera pr?sent?e et soutenue le lundi 17 f?vrier 2014 ? 9h30 ? l?Universit? de Lorraine, Campus Lettres et Sciences Humaines et Sociales, b?timent CNRS-ATILF, salle Imbs. La soutenance sera suivie du traditionnel pot de th?se auquel vous ?tes tous chaleureusement invit?s. ======================= Composition du jury : ======================= B?atrice DAILLE, Professeur ? l?Universit? de Nantes (rapporteur) Brigitte GRAU, Professeur ? l?ENSIIE (rapporteur) Cyril MARCH, Ing?nieur, Directeur g?n?ral de Xilopix (invit?) Jean-Marie PIERREL, Professeur ? l?Universit? de Lorraine (directeur de th?se) Alain POLGU?RE, Professeur ? l?Universit? de Lorraine (examinateur) Yannick TOUSSAINT, Charg? de Recherche HDR ? l?INRIA (examinateur) ========== R?sum? : ========== L?objectif principal de cette th?se est de montrer que les informations lexicales issues d?un dictionnaire de langue, tel le Tr?sor de la langue fran?aise informatis? (TLFi), peuvent am?liorer les processus d?indexation et de recherche d?images. Le probl?me d?utilisation d?une telle ressource est qu?elle n?est pas suffisamment formalis?e pour ?tre exploit?e d?embl?e dans un tel domaine d?application. Pour r?soudre ce probl?me, nous proposons, dans un premier temps, une approche de construction automatique de hi?rarchies s?mantiques ? partir du TLFi. Apr?s avoir d?fini une caract?ristique quantitative (mesurable) et comparable des noms apparaissant dans les d?finitions lexicographiques, ? travers une formule de pond?ration permettant de s?lectionner le nom de poids maximal comme l?hyperonyme pour un lex?me donn? du TLFi, nous proposons un algorithme de construction automatique de hi?rarchies s?mantiques pour les lex?mes des vocables du TLFi. Une fois notre approche valid?e ? travers des ?valuations manuelles, nous montrons, dans un second temps, que les hi?rarchies s?mantiques obtenues ? partir du TLFi peuvent ?tre utilis?es pour l?enrichissement d?un th?saurus construit manuellement ainsi que pour l?indexation automatique d?images ? partir de leurs descriptions textuelles associ?es. Nous prouvons aussi que l?exploitation d?une telle ressource dans le domaine de recherche d?images am?liore la pr?cision de la recherche en structurant les r?sultats selon les domaines auxquels les concepts de la requ?te de recherche peuvent faire r?f?rence. La mise en place d?un prototype nous a permis ainsi d??valuer et de valider les approches ainsi propos?es. ========== Mots-cl?s : ========== hi?rarchies s?mantiques, ressources lexicales, pond?ration, dictionnaire de langue, TLFi, indexation et recherche d?images, th?saurus Au plaisir de vous retrouver, Inga Gheorghita ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sat Feb 8 10:53:09 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sat, 8 Feb 2014 11:53:09 +0100 Subject: Appel: ACFAS Colloque 635, Langues Naturelles, Informatique et Sciences Cognitives Message-ID: Date: Wed, 5 Feb 2014 12:04:51 -0500 From: Fatiha Sadat Message-ID: X-url: http://www.acfas.ca/evenements/congres/programme_preliminaire/82/600/635/C Appel ? communications - EXTENSION DE LA DATE DE SOUMISSION POUR LE 14 F?vrier 2014 Le sujet de ce colloque traite de la linguistique informatique et plus particuli?rement du traitement automatique des langues naturelles (TALN/TAL) ainsi que son interaction avec les sciences cognitives. Il s'agit d'un domaine qui se situe au carrefour de la linguistique, de l'intelligence artificielle et des sciences cognitives dans leurs recherches sur le langage naturel. Ce colloque vise les diff?rentes applications du TALN dans un contexte monolingue, bilingue ou multilingue, tel que le d?veloppement des m?thodologies n?cessaires pour la construction et l'enrichissement des ressources linguistiques. Tous les aspects du traitement automatique des langues sont consid?r?s, en particulier les travaux th?oriques ayant pour but de d?crire les ph?nom?nes linguistiques par des mod?les calculatoires, le d?veloppement de logiciels d'aide aux linguistes pour leurs travaux de recherche, tels que des ?tudes de corpus, le d?veloppement de logiciels et de ressources pour utiliser, traiter, organiser et conserver des donn?es linguistiques ?crites ou orale. Il sera demand? aux pr?sentateurs de mettre en valeur les aspects li?s au traitement automatique des langues naturelles, quel que soit le sujet abord?, ainsi qu'? leur pertinence au niveau des sciences cognitives. Plus sp?cifiquement, nous proposons d'abord les sujets suivants : - mod?les de calcul des langues naturelles, - r?seaux s?mantiques de grande taille, - mod?les cognitifs et psychologiques des langues naturelles, - ?volution des langues naturelles, - traitement du discours, - pragmatique, processus cognitifs et sociaux des langues, - r?sum? automatique et extraction d'information, - interfaces et syst?mes de dialogue, - traduction automatique, - traitement des ressources multilingues, - construction et enrichissement des outils et ressources en TALN, - corpus linguistique, - forage de donn?es. Chaque proposition soumise doit comprendre : le nom et le pr?nom du conf?rencier, sa fonction, son affiliation et son courriel. De plus, un r?sum? d'un maximum de 1500 caract?res (espaces compris) et dont le titre ne d?passe pas 180 caract?res (espaces comprises) est exig?. Les propositions doivent ?tre achemin?es sous forme de document Word ou RTF. *CALENDRIER* Date limite de soumission : 14 f?vrier 2014 (extension) Notification aux auteurs : 25 f?vrier 2014 Date du colloque : 13 mai 2014 *MODALIT?S DE SOUMISSION* Les articles seront r?dig?s en fran?ais et doivent ?tre soumis ? travers le syst?me de conf?rences libreEasychairet le lien suivant https://www.easychair.org/conferences/?conf=lnisc2014 Responsable(s) Fatiha SADAT *UQAM - Universit? du Qu?bec ? Montr?al*, Alexandre BLONDIN MASS? *UQAC - Universit? du Qu?bec ? Chicoutimi*, Stevan HARNAD *UQAM - Universit? du Qu?bec ? Montr?al* Cet appel se trouve ? cette adresse: http://www.acfas.ca/evenements/congres/programme_preliminaire/82/600/635/C ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From pattisonnr at 163.COM Sun Feb 9 03:00:48 2014 From: pattisonnr at 163.COM (=?GBK?B?y+/X3Lzg?=) Date: Sun, 9 Feb 2014 11:00:48 +0800 Subject: =?GBK?Q?=CE=A2=D0=C5=D3=AA=CF=FA=CB=C4=B2=BD=CB=BC=CE=AC=B7=A8=A3=BA?= Message-ID: An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: ??????????.xlsx Type: application/vnd.openxmlformats-officedocument.spreadsheetml.sheet Size: 26890 bytes Desc: not available URL: From moradarbg at 163.COM Mon Feb 10 04:51:50 2014 From: moradarbg at 163.COM (=?GBK?B?u8bW97nc?=) Date: Mon, 10 Feb 2014 12:51:50 +0800 Subject: =?GBK?Q?=C9=CF=BA=A3=CF=E8=CC=A9=CF=E4=B0=FC=C6=A4=BE=DF=40{ok}?= Message-ID: An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: ????????@{ok}.xlsx Type: application/vnd.openxmlformats-officedocument.spreadsheetml.sheet Size: 35087 bytes Desc: not available URL: From hamon at LIMSI.FR Tue Feb 11 20:50:18 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 21:50:18 +0100 Subject: Appel: YSIP Message-ID: Date: Sat, 08 Feb 2014 17:45:54 +0100 From: Bertram Fronh?fer Message-ID: <52F65F42.8080406 at tu-dresden.de> X-url: http://infocom.iitit.ru/ X-url: http://ysip.computational-logic.org/ The Institute for Information Processing and Telecommunication of the North-Caucasus Federal University and the International Center for Computational Logic of the The Institute for Information Processing and Telecommunication of the North-Caucasus Federal University and the International Center for Computational Logic of the Technische Universit?t Dresden are jointly organizing the Young Scientists' International Workshop on Trends in Information Processing (YSIP) at the North-Caucasus Federal University in Stavropol from April 22 until April 25 as part of the International Conference on Infocommunicational Technologies in Science, Production and Education (INFOCOM-6) . We invite young scientists ? typically master or PhD students ? to submit new scientific results ? as, for example, obtained in their bachelor thesis, project work, master thesis, or PhD project ? in all areas of Information Processing. In particular, papers are welcome in, but are not limited to, the following areas: * Semantic Methods * Semantic Web * Languages for Semantic Information Processing * Applications of Semantic Information Processing * Formal Concept Analysis * Big Data * Information Retrieval * Question-Answering Systems * Information Security * Problem Solving * Knowledge Representation and Reasoning * Common Sense Reasoning * Machine Learning * Pattern Recognition Important Dates: 28 February 2014 Deadline of Submission 15 March 2014 Information about Acceptance or Rejection 20 March 2014 Registration 31 March 2014 Submission of Final Version 22-25 April 2014 Workshop For more information see: http://ysip.computational-logic.org/ ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 11 20:41:48 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 21:41:48 +0100 Subject: Appel: TKE 2014, extended deadline Message-ID: Date: Sat, 8 Feb 2014 15:15:44 +0100 From: Laurent Romary Message-Id: X-url: http://tke2014.coreon.com +++ Extended deadline +++ TKE 2014 Ontology, terminology & text mining ? Call for papers and workshop proposals The 11th international conference on Terminology and Knowledge Engineering, TKE 2014, is organised by GTW and DIN Deutsches Institut f?r Normung e. V. in cooperation with Inria, Coreon, Cologne University of Applied Sciences and Copenhagen Business School. The theme of TKE 2014 will be ?Ontology, terminology & text mining. Venue: DIN Deutsches Institut f?r Normung e. V., Am DIN-Platz, Burggrafenstr. 6, 10787 Berlin Dates: - Main conference: 19th -20th June 2014 - Post-conference workshops: 21st June 2014 See more information on the conference web site: http://tke2014.coreon.com. Important dates: [NEW]1st March 2014 Submission of papers and workshop proposals 31rst March 2014 Notification of acceptance of papers and workshop proposals 15th May 2014 Final versions for the proceedings Submissions should be anonymous and should be made on the web site: http://tke2014.sciencesconf.org. See information on format for papers on http://tke2014.coreon.com. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 11 20:54:36 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 21:54:36 +0100 Subject: Appel: Workshop Affect, Compagnon Artificiel, Interaction (WACAI) 2014 Message-ID: Date: Mon, 10 Feb 2014 14:33:10 +0100 From: Magalie Ochs Message-ID: <52F8D516.8050508 at telecom-paristech.fr> X-url: http://wacai14.litislab.fr/ *Appel ? Contributions : Workshop francophone Affect, Compagnon Artificiel, Interaction (WACAI) 2014 ? 30 juin, 1er juillet ? Rouen * http://wacai14.litislab.fr/ L?objectif du workshop WACAI 2014 est de r?unir les recherches et d?veloppements en cours autour des /Agents Conversationnels Anim?s (ACA)/ et des/robots sociaux/. Cette rencontre est plus particuli?rement centr?e sur l??tude, la mod?lisation, le d?veloppement et l??valuation de l?interaction de syst?mes interactifs avec leurs partenaires (humains ou artefactuels). A l?interface entre sciences de l?ing?nieur et sciences humaines et sociales, l?ambition de WACAI est d?appr?hender l?interaction personne-syst?me dans toute sa complexit? (biologique, linguistique, sociale, culturelle et ?motionnelle). Apr?s les pr?c?dentes ?ditions bi-annuelles du workshop WACA, organis?es successivement ? Grenoble (2005), ? Toulouse (2006), ? Paris (2008), ? Lille (2010) et WACAI ? Grenoble (2012), cette nouvelle ?dition se d?roulera ? Rouen le 30 Juin et 1er Juillet 2014. Les contributions attendues (en fran?ais ou en anglais) sont de trois ordres : - des articles scientifiques (max 6 pages) ; - des revues de questions ou des ?tats de l?art (max 8 pages), notamment sur les liens entre les probl?matiques communes et les sp?cificit?s des communaut?s ACA et robotique ; - des descriptions de r?alisations ou d?exp?rimentations en cours (2 pages); sur les domaines de l'/Informatique Affective/, le /Traitement des //Signaux Sociaux/, les /Agents Conversationnels Anim?s/ et les /Robots Sociaux/. *Dates importantes* ***Soumission des articles : 23 Avril 2014**** Notification aux auteurs : 14 Mai 2014 Version finale : 28 Mai 2014 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 11 20:59:29 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 21:59:29 +0100 Subject: Appel: ESWC 2014, Challenge Semantic Publishing Message-ID: Date: Mon, 10 Feb 2014 15:22:49 +0100 (CET) From: speroni at cs.unibo.it Message-Id: <20140210142309.BD95211973C at vina.cines.fr> X-url: http://challenges.2014.eswc-conferences.org/SemPub ** apologies for cross-posting ** ==== Second Call for Challenge: Semantic Publishing ==== Challenge Website: http://challenges.2014.eswc-conferences.org/SemPub Call Web page: http://2014.eswc-conferences.org/important-dates/call-SemPub ** NEWS: ** an open in-use task has been added! 11th Extended Semantic Web Conference (ESWC) 2014 Dates: May 25 - 29, 2014 Venue: Anissaras, Crete, Greece Hashtag: #eswc2014 Feed: @eswc_conf Site: http://2014.eswc-conferences.org General Chair: Valentina Presutti (STLab, ISTC-CNR, IT) Challenge Coordinator: Milan Stankovic (Sepage & Universite Paris-Sorbonne, FR) Challenge Chairs: - Angelo Di Iorio (Department of Computer Science and Engineering, University of Bologna, IT) - Christoph Lange (Enterprise Information Systems, University of Bonn / Fraunhofer IAIS, DE) MOTIVATION AND OBJECTIVES Scholarly publishing is increasingly enabling a new wave of applications that better support researchers in disseminating, exploiting and evaluating their results. The potential of publishing scientific papers enriched with semantic information is huge and raises interesting and challenging issues. Semantic Web technologies play a central role in this context, as they can help publishers to make scientific results available in an open format the whole research community can benefit from. The Semantic Publishing Challenge 2014 is intended to be the first in a series of events at ESWC for producing and exploiting semantic publishing data. The main focus this year is on extracting information and using this information to assess the quality of scientific productions. Linked open datasets about scientific production exist - e.g. DBLP - but they usually cover basic bibliographic information, which is not sufficient to assess quality. Quality-related information are often hidden and not yet available as LOD. There is also a growing interest in alternative forms of publishing scientific data as (semantic) datasets that can be more easily shared, linked to each other, and reasoned on. Alternative metrics for scientific impact are also gaining relevance. We are seeking the most innovative and impacting applications in this emerging contexts. TARGET AUDIENCE The Challenge is open to everyone from industry and academia. TASKS The Challenge includes three tasks. Participants can participate in as many tasks as they like. = Extraction Tasks = We ask challengers to automatically annotate a set of multi-format and multi-source input documents and to produce a Linked Open Dataset that fully describes these documents, their context, and relevant parts of their content. The evaluation will consist of evaluating a set of queries against the produced dataset to assess its correctness and completeness. The input dataset will be split in two parts: a training/testing part and an evaluation part, which will disclosed a few days before the submission deadline. Participants will be asked to run their tool on the evaluation dataset and to produce the final Linked Open Dataset. == Task 1: Extraction and assessment of workshop proceedings information == Participants are required to extract information from a set of HTML tables of contents, partly including microformat and RDFa annotations but not necessarily being valid HTML, of selected computer science workshop proceedings published with the CEUR-WS.org open access service. The extracted information is expected to answer queries about the quality of these workshops, for instance by measuring their growth, longevity, connection with other events, distribution of papers and authors. == Task 2: Extraction and characterization of citations == Participants are required to extract information about the citations in scientific journals and their relevance. Input documents are in XML JATS and TaxPub, an official extension of JATS customized for taxonomic treatments, and selected from the PubMedCentral Open Access Subset and the Pensoft Biodiversity Data Journal and ZooKeys archive. The extracted information is expected to be used for assessing the value of citations, for instance by considering their position in the paper, their co-location with other citations or their purpose. == In-use Task 3: Semantic technologies in improving scientific production == Participants are asked to submit demos that showcase the potential of Semantic Web technology for enhancing and assessing the quality of scientific production. The task has a completely open structure and is, in particular, independent from tasks 1 and 2: participants are free to decide which tool to show and which dataset to use. The evaluation will be different from other tasks and will consist of two phases: after a first round of review, a number of submissions will be invited to demo their work at ESWC. The final decision will be taken at the Conference by a jury formed of PC members present at the event and other invited experts. Further details are available at: http://challenges.2014.eswc-conferences.org/index.php/SemPub/Task3 EVALUATION = Extraction Tasks 1 and 2 = Participants will be requested to submit the LOD that their tool produces from the evaluation dataset, as well as a paper that describes their approach. They will also be given a set of queries in natural language form and will be asked to translate those queries into a SPARQL form that works on their LOD. The results of the queries on the produced LOD will be compared with the expected output, and precision and recall will be measured to identify the best performing approach. Separately, the most original approach will be assigned by the Program Committee. = In-use Task 3 = Participants are required to submit a paper description as for tasks 1 and 2 and a demo version of the tool (open source appreciated but not mandatory). The evaluation will consist of two phases: after a first round of review, a number of submissions will be invited to demo their work at ESWC. The final decision will be taken at the Conference by a jury formed of PC members present at the event and other invited experts. The winner will be selected according to its potential impact, originality, breakthrough, the quality of the demo, and the appropriateness for ESWC. Further details about the evaluation are provided on the challenge wiki. FEEDBACK AND DISCUSSION A discussion group is open for participants to ask questions and to receive updates about the challenge (see link at bottom). Participants are invited to subscribe to this group as soon as possible and to communicate their intention to participate. They are also invited to use this channel to discuss problems in the input dataset and to suggest changes. JUDGING AND PRIZES The Program Committee and the chairs will select a number of submissions conforming to the challenge requirements that will be invited to present their work. Submissions accepted for presentation will receive constructive reviews from the Program Committee, they will be included in the Springer LNCS post-proceedings of ESWC, and they will also have a presentation slot in a poster session dedicated to the challenge. In addition, the winners will present their work in a special slot of the main program of ESWC and will be invited to submit a revised and extended paper to a dedicated Semantic Web Journal special issue. Five winners will be selected. For each of Tasks 1 and 2 we will select: * best performing tool, given to the paper which will get the highest score in the evaluation * most original approach, selected by the Challenge Committee with the reviewing process The winner of Task 3 will be selected by the jury according to its potential impact, originality, breakthrough, the quality of the demo, and the appropriateness for ESWC. Winners will be selected only for tasks with at least 3 participants. In any case all submissions will be reviewed and, if accepted, published in ESWC post-proceedings. An amount of 700 Euro has already been secured for the final prize. We are currently working on securing further funding. HOW TO PARTICIPATE Participants are required to submit: * Abstract: no more than 200 words. * Description: It should explain the details of the automated annotation system, including why the system is innovative, how it uses Semantic Web technology, what features or functions the system provides, what design choices were made and what lessons were learned. The description should also summarize how participants have addressed the evaluation tasks. An outlook towards how the data could be consumed is appreciated but not strictly required. Papers must be submitted in PDF format, following the style of the Springer's Lecture Notes in Computer Science (LNCS) series (http://www.springer.com/computer/lncs/lncs+authors), and not exceeding 5 pages in length. Submissions for task 1 and task 2 also have to include: * The Linked Open Dataset produced by the tool on the evaluation dataset (as a file or as a URL, in Turtle or RDF/XML). * A set of SPARQL queries that work on that LOD and correspond to the natural language queries provided as input * Participants will also be asked to submit their tool (source and/or binaries, or a link these can be downloaded from, or a web service URL) for verification purposes. Submissions for the in-use task 3 have to include: * a demo version of the tool. The demo must be made available along with the paper submission but participants are allowed to refine it until the presentation at ESWC-14. All papers submissions should be provided via EasyChair: https://www.easychair.org/conferences/?conf=eswc2014-challenges MAILING LIST We invite the potential participants to subscribe to our mailing list in order to be kept up to date with the latest news related to the challenge. https://lists.sti2.org/mailman/listinfo/eswc2014-sempub-challenge IMPORTANT DATES * December 3, 2013: Publication of the full description of the extraction tasks 1 and 2, rules and queries; publication of the training/testing dataset * January 31, 2014, 23:59 CET: Deadline for making remarks to the task 1 and 2 training/testing datasets * February 5, 2014: Publication of the final task 1 and 2 training/testing datasets * March 7, 2014, 23:59 CET: Abstract submission * March 11, 2014: Publication of the task 1 and 2 evaluation dataset * March 14, 2014, 23:59 CET: Submission due * April 9, 2014, 23:59 CET: Notification of acceptance * May 27-29, 2014: Demo at ESWC-14, and winner selection PROGRAM COMMITTEE Soren Auer (University of Bonn / Fraunhofer IAIS, DE) (supervisor) Chris Bizer (University of Mannheim, DE) Sarven Capadisli (University of Leipzig, DE) Alexander Constantin (University of Manchester, UK) Jeremy Debattista (University of Bonn / Fraunhofer IAIS, DE) Alexander Garcia Castro (Florida State University, US) Leyla Jael Garcia Castro (Bundeswehr University of Munich, DE) Paul Groth (VU University of Amsterdam, Netherlands) Rinke Hoekstra (VU University of Amsterdam, Netherlands) Aidan Hogan (DCC, Universidad de Chile) Evangelos Milios (Dalhousie University, CA) Lyubomir Penev (Pensoft Publishers, BG) Robert Stevens (University of Manchester, UK) Jun Zhao (Lancaster University, UK) We are inviting further members. From hamon at LIMSI.FR Tue Feb 11 21:11:22 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 22:11:22 +0100 Subject: Appel: ECML PKDD 2014 (Reminder) Message-ID: Date: Mon, 10 Feb 2014 21:42:19 +0100 (CET) From: Amedeo Napoli Message-ID: <1665645114.3459231.1392064939712.JavaMail.root at loria.fr> X-url: http://www.ecmlpkdd2014.org/ Calls -- ECML PKDD 2014 (Reminder) Apologize for multiple receptions ----------------------------------- ECML PKDD 2014 September 15-19 2014, Nancy France http://www.ecmlpkdd2014.org/ ----------------------------------- ----- CALLS ----- ----------------------------------------------- Call For Workshops (workshops at ecmlpkdd2014.org) ----------------------------------------------- Workshop proposal deadline 2 March 2014 Workshop acceptance notification 21 March 2014 ----------------------------------------------- Call For Tutorials (tutorials at ecmlpkdd2014.org) ----------------------------------------------- Tutorial proposal deadline 7 March 2014 Tutorial acceptance notification 28 March 2014 ------------------------------------------- Call For Papers (pcchairs at ecmlpkdd2014.org) ------------------------------------------- Abstract submission deadline 12 April 2014 Paper submission deadline 16 April 2014 --------------------------------------- Call For Demos (demos at ecmlpkdd2014.org) --------------------------------------- Demo submission deadline 16 May 2014 Notification of acceptance 15 June 2014 ----------------------------------------------- Call For Nectar Track (nectar at ecmlpkdd2014.org) ----------------------------------------------- Submission deadline 14 June 2014 Notifications of acceptance 28 June 2014 --------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 11 21:16:27 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 22:16:27 +0100 Subject: Appel: COLING 2014 Update, Tutorials & Workshops Announced, Paper Submission Message-ID: Date: Tue, 11 Feb 2014 12:25:34 +0000 From: John Judge Message-ID: <52FA16BE.5000801 at computing.dcu.ie> X-url: http://www.coling-2014.org ********** Apologies for cross-posting ********** COLING 2014 Dublin, Ireland, 23-29 August, 2014 COLING 2014 (http://www.coling-2014.org), the 25th International Conference on Computational Linguistics, will be organised by CNGL (Centre for Global Intelligent Content) at the Helix Convention Centre at Dublin City University (DCU) from 23-29 August 2014. The COLING conference is organised under the auspices of the International Committee on Computational Linguistics (ICCL). Accepted Tutorials and Workshops We are pleased to announce that the workshops and tutorials for COLING 2014 have been finalised. In total there will be 6 tutorials and 18 workshops covering a wide range of topics. You will be able to register for tutorials and workshops when the main conference registration opens. Accepted Tutorials: - Dependency Parsing: Past, Present, and Future - Wenliang Chen, Zhenghua Li and Min Zhang - Using Neural Networks for Modeling and Representing Natural Languages - Thomas Mikolov - Multilingual Word Sense Disambiguation and Entity Linking - Roberto Navigli and Andrea Moro - Selection Bias, Label Bias, and Bias in Ground Truth - Anders S?gaard, Barbara Plank and Dick Hovy - Automated Grammatical Error Correction for Language Learners - Joel Tetreault and Claudia Leacock - Biomedical/clinical NLP - Ozlem Uzuner and Meliha Yetisgen Accepted Workshops: - Workshop on Lexical and Grammatical Resources for Language Processing - First Joint Workshop on Statistical Parsing of Morphologically Rich Languages and Syntactic Analysis of Non-Canonical Languages (SPMRL-SANCL) - Celtic Language Technology Workshop (CLTW) - 5th Workshop on South and Southeast Asian Natural Language Processing (WSSANLP) - Synchronic and Diachronic Approaches to Analyzing Technical Language (SADAATL) - The 3rd Workshop on Vision and Language (VL'14) - First Joint Workshop on Multidisciplinary Approaches to Big Social Data Analysis and Social NLP (MABSDA-SocialNLP) - First Joint Workshop on Open Infrastructures for HLT Resource Processing and Development and Text Analysis Frameworks - Cognitive Aspects of the Lexicon (CogALex-IV) - Semantic Web and Information Extraction (SWAIE) - VarDial: Workshop on Applying NLP Tools to Similar Languages, Varieties and Dialects (VarDial) - SemEval-2014: Semantic Evaluation Exercises - International Workshop on Semantic Evaluation (SemEval) - The AHA!-Workshop on Information Discovery in Text (AHA!) - CompuTerm 2014. 4th International Workshop on Computational Terminology (CompuTerm) - The First Workshop on Computational Approaches to Compound Analysis (ComAComA) - Automatic Text Simplification? Methods and Applications in the Multilingual Society (ATS-MA) - The 8th Linguistic Annotation Workshop (The LAW VIII) - The third Joint Conference on Lexical and Computational Semantics (*SEM-2014) Call for Papers: The first call for papers for the main conference is open. The paper submission system is now open at https://www.softconf.com/coling2014/main/. Papers should be prepared using the paper templates available here http://www.coling-2014.org/doc/coling2014.zip. We invite authors to submit papers in all relevant areas and encourage authors to include analysis of the influence of theories (intuitions, methodologies, insights), to technologies (computational algorithms, methods, tools, data), and/or contributions of technologies to theory development. Contributions that display and rigorously discuss future potential, even if not (yet) attested in standard evaluation, are welcome. For more information and instructions for authors see the preliminary call for papers here http://www.coling-2014.org/call-for-papers.php. Sponsorship and Promotion Opportunities With the tremendous impact and growth of COLING over many years, we have set out a comprehensive offering of corporate sponsorship and support opportunities for industry participants. The COLING 2014 organising committee is committed to working in close partnership with corporate supporters to maximise the benefits of sponsorship and provide opportunities for sponsors to engage appropriately with COLING delegates. With this in mind, if you wish to explore additional options for support or means of engagement with the COLING 2014 community please feel free to contact sponsorship at coling-2014.org. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 11 20:57:26 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 21:57:26 +0100 Subject: Appel: ESWC 2014, Challenge Linked Open Data-enabled Message-ID: Date: Mon, 10 Feb 2014 15:09:59 +0100 (CET) From: speroni at cs.unibo.it Message-Id: <20140210141019.5E85E119754 at vina.cines.fr> X-url: http://challenges.2014.eswc-conferences.org/RecSys ** apologies for cross-posting ** = Second Call for Challenge: Linked Open Data-enabled Recommender Systems = Challenge Website: http://challenges.2014.eswc-conferences.org/RecSys Call Web page: http://2014.eswc-conferences.org/important-dates/call-RecSys 11th Extended Semantic Web Conference (ESWC) 2014 Dates: May 25 - 29, 2014 Venue: Anissaras, Crete, Greece Hashtag: #eswc2014 Feed: @eswc_conf Site: http://2014.eswc-conferences.org General Chair: Valentina Presutti (STLab, ISTC-CNR, IT) Challenge Coordinator: Milan Stankovic (Sepage & Universite Paris-Sorbonne, FR) Challenge Chairs: - Tommaso Di Noia (Polytechnic University of Bari, IT) - Ivan Cantador (Universidad Autonoma de Madrid, ES) MOTIVATION AND OBJECTIVES People generally need more and more advanced tools that go beyond those implementing the canonical search paradigm for seeking relevant information. A new search paradigm is emerging, where the user perspective is completely reversed: from finding to being found. Recommender systems may help to support this new perspective, because they have the effect of pushing relevant objects, selected from a large space of possible options, to potentially interested users. To achieve this result, recommendation techniques generally rely on data referring to three kinds of objects: users, items and their relations. Recent developments in the Semantic Web community offer novel strategies to represent data about users, items and their relations that might improve the current state of the art of recommender systems, in order to move towards a new generation of recommender systems that fully understand the items they deal with. More and more semantic data are published following the Linked Data principles, that enable to set up links between objects in different data sources, by connecting information in a single global data space: the Web of Data. Today, the Web of Data includes different types of knowledge represented in a homogeneous form: sedimentary one (encyclopedic, cultural, linguistic, common-sense) and real-time one (news, data streams, ...). These data might be useful to interlink diverse information about users, items, and their relations and implement reasoning mechanisms that can support and improve the recommendation process. The primary goal of this challenge is twofold. On the one hand, we want to create a link between the Semantic Web and the Recommender Systems communities. On the other hand, we aim to show how Linked Open Data (LOD) and semantic technologies can boost the creation of a new breed of knowledge-enabled and content-based recommender systems. TARGET AUDIENCE The target audience is all of the Semantic Web and the Recommender Systems communities, both academic and industrial, which are interested in personalized information access with a particular emphasis on Linked Open Data. During the last ACM RecSys conference more than 60% of participants were from industry. This is for sure a witness of the actual interest of recommender systems for industrial applications ready to be released in the market. TASKS * Task 1: Rating prediction in cold-start situations This task deals with the rating prediction problem, in which a system is requested to estimate the value of unknown numeric scores (a.k.a. ratings) that a target user would assign to available items, indicating whether she likes or dislikes them. In order to favor the proposal of content-based, LOD-enabled recommendation approaches, and limit the use of collaborative filtering approaches, this task aims at predicting ratings in cold-start situations, that is, predicting ratings for users who have a few past ratings, and predicting ratings of items that have been rated by a few users. The dataset to use in the task - DBbook - relates to the book domain. It contains explicit numeric ratings assigned by users to books. For each book we provide the corresponding DBpedia URI. Participants will have to exploit the provided ratings as training sets, and will have to estimate unknown ratings in a non-provided evaluation set. Recommendation approaches will be evaluated on the evaluation set by means of metrics that measure the differences between real and estimated ratings, namely the Root Mean Square Error (RMSE). * Task 2: Top-N recommendation from binary user feedback This task deals with the top-N recommendation problem, in which a system is requested to find and recommend a limited set of N items that best match a user profile, instead of correctly predict the ratings for all available items. Similarly to Task 1, in order to favor the proposal of content-based, LOD-enabled recommendation approaches, and limit the use of collaborative filtering approaches, this task aims to generate ranked lists of items for which no graded ratings are available, but only binary ones. Also in this case, the DBbook dataset is used. In this task, the accuracy of recommendation approaches will be evaluated on an evaluation set using the F-measure. * Task 3: Diversity A very interesting aspect of content-based recommender systems, and then of LOD-enabled ones, is giving the possibility to evaluate the diversity of recommended items in a straight way. This is a very popular topic in content-based recommender systems, which usually suffer from over-specialization. In this task, the evaluation will be made by considering a combination of both accuracy (F-measure) of the recommendation list and the diversity (Intra-List Diversity) of items belonging to it. Also for this task, the DBbook dataset is used. Given the domain of books, diversity with respect to the two properties http://dbpedia.org/ontology/author and http://purl.org/dc/terms/subject will be considered. DATASET * DBbook dataset This dataset relies on user data and preferences retrieved from the Web. The books available in the dataset have been mapped to their corresponding DBpedia URIs. The mapping contains 8170 DBpedia URIs. These mappings can be used to extract semantic features from DBpedia or other LOD repositories to be exploited by the recommendation approaches proposed in the challenge. The dataset is split in a training set and an evaluation set. In the former, user ratings are provided to train a system while in the latter, ratings have been removed, and they will be used in the eventual evaluation step. The mapping file is available at: http://sisinflab.poliba.it/semanticweb/lod/recsys/2014challenge/DBbook_Items_DBpedia_mapping.tsv.zip It contains a tab-separated values file where each line has the following format: DBbook_ItemID \t name \t DBpedia_URI. We suggest to extract a semantic descriptions for all the items present in this mapping file by starting from the DBpedia URIs. The training sets are available at: * Task 1: http://sisinflab.poliba.it/semanticweb/lod/recsys/2014challenge/DBbook_train_ratings.zip The archive contains a tab-separated values file containing the training data and a README describing its content. Each line in the file is composed by: userID \t itemID \t rating. The ratings are in scale 0-5. The training set contains 75559 ratings. There are 6181 users and 6166 items which have been rated by at least one user. * Task 2 and Task 3: http://sisinflab.poliba.it/semanticweb/lod/recsys/2014challenge/DBbook_train_binary.zip The archive contains a tab-separated values file containing the training data and a README describing its content. Each line in the file is composed by: userID \t itemID \t rating. The ratings are in binary scale. 1 means that the item is relevant for the user, 0 means irrelevant. The training set contains 72372 ratings. There are 6181 users and 6733 items which have been rated by at least one user. ADDITIONAL DATASETS Although not used in the challenge, two additional rating datasets linked to DBpedia are provided, namely the well known MovieLens10M dataset and the Last.fm dataset published at HetRec'11 workshop. http://sisinflab.poliba.it/semanticweb/lod/recsys/datasets/ We encourage participants to use these datasets for testing the developed recommendation approaches on several domains. JUDGING AND PRIZES After a first round of reviews, the Program Committee and the chairs will select a number of submissions that will have to satisfy the challenge requirements, and will have to be presented at the conference. Submissions accepted for presentation will receive constructive reviews from the Program Committee, and will be included in post-proceedings. All accepted submissions will have a slot in a poster session dedicated to the challenge. In addition, the winners will present their work in a special slot of the main program of ESWC'14, and will be invited to submit a paper to a dedicated Semantic Web Journal special issue. For each task we will select: * the best performing tool, given to the paper which will get the highest score in the evaluation * the most original approach, selected by the Challenge Program Committee with the reviewing process An amount of 700 Euro has already been secured for the final prize. We are currently working on securing further funding. Winners will be selected only for tasks with at least 3 participants. In any case, all submissions will be reviewed and, if accepted, published at ESWC post-proceedings. HOW TO PARTICIPATE 1. Make your result submission * Register your group using the registration web form available at: http://193.204.59.20:8181/eswc2014lodrecsys/signup.html * Choose one or more tasks among Task 1, Task 2 and Task 3 * Build your recommender system using the provided training data. * Evaluate your approach by submitting your results using the evaluation service. * Your final score will be the one computed with respect to the last result submission made before March 7, 2014, 23:59 CET. 2. Submit your paper The following information has to be provided: * Abstract: no more than 200 words. * Description: It should contain the details of the system, including why the system is innovative, how it uses Semantic Web, which features or functions the system provides, what design choices were made, and what lessons were learned. The description should also summarize how participants have addressed the evaluation tasks. Papers must be submitted in PDF format, following the style of the Springer's Lecture Notes in Computer Science (LNCS) series (http://www.springer.com/computer/lncs/lncs+authors), and not exceeding 5 pages in length. All submissions should be provided via EasyChair: https://www.easychair.org/conferences/?conf=eswc2014-challenges MAILING LIST We invite the potential participants to subscribe to our mailing list in order to be kept up to date with the latest news related to the challenge. https://lists.sti2.org/mailman/listinfo/eswc2014-recsys-challenge IMPORTANT DATES * March 7, 2014, 23:59 CET: Result submission due * March 14, 2014, 23:59 CET: Paper submission due * April 9, 2014, 23:59 CET: Notification of acceptance * May 27-29, 2014: The Challenge takes place at ESWC'14 EVALUATION COORDINATOR * Vito Claudio Ostuni (Polytechnic University of Bari, IT) PROGRAM COMMITTEE (to be completed) * Pablo Castells, Universidad Autonoma de Madrid, Spain * Oscar Corcho, Universidad Politecnica de Madrid, Spain * Marco de Gemmis, University of Bari Aldo Moro, Italy * Frank Hopfgartner, Technische Universitat Berlin, Germany * Andreas Hotho, Universitat Wurzburg, Germany * Dietmar Jannach, TU Dortmund University, Germany * Pasquale Lops, University of Bari Aldo Moro, Italy * Valentina Maccatrozzo, VU University Amsterdam, The Netherlands * Roberto Mirizzi, Polytechnic University of Bari, Italy * Alexandre Passant, seevl.fm, Ireland * Francesco Ricci, Free University of Bozen-Bolzano, Italy * Giovanni Semeraro, University of Bari Aldo Moro, Italy * David Vallet, NICTA, Australia * Manolis Wallace, University of Peloponnese, Greece * Markus Zanker, Alpen-Adria-Universitaet Klagenfurt, Austria * Tao Ye, Pandora Internet Radio, USA From hamon at LIMSI.FR Tue Feb 11 21:08:25 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 22:08:25 +0100 Subject: Job: Open Position for Graduate Fellowship in Semantic Web Technologies, CNR-STLAB Message-ID: Date: Mon, 10 Feb 2014 16:48:46 +0100 (CET) From: speroni at cs.unibo.it Message-Id: <20140210154907.4FE7D119754 at vina.cines.fr> X-url: http://istc.cnr.it/vacancy/assegno-di-ricerca-n%C2%B0-2092014-tecnologie-semantiche-e-open-data-legovernment-e-la-fruizione- TITLE: Open Position for Graduate Fellowship in Semantic Web Technologies at CNR-STLAB Important notice: send by February 19th to stlab at cnr.it your CV, a motivational statement, and the contact of at least one referee (or a recommendation letter). The official procedure for being admitted to the selection is described on the official application post (see below) and the deadline is February 27th. Topic: Theories and methods for knowledge extraction and representation at a web scale and their application to cultural heritage and eGovernment. Type of Grant: Graduate Fellowship Employer: Institute of Cognitive Science and Technologies of CNR Salary: EUR 19.367,00 (nineteen-thousand-three-hundred-sixtyseven/00) net of expenses in charge of CNR. Starting from: April 2014 Duration: 12 months Location: Rome, Italy Official application deadline: 27 February, 2014 Link to the official application post: http://istc.cnr.it/vacancy/assegno-di-ricerca-n%C2%B0-2092014-tecnologie-semantiche-e-open-data-legovernment-e-la-fruizione- Scientific responsible: Dr. Valentina Presutti Contact person (administrative issues): stefania.capotosti at istc.cnr.it Type of Grant: Graduate Fellowship There will be a public selection procedure, based on qualifications and an interview, for the assignment of n. 1 (one) - Graduate Fellowship in order to conduct research related to the Scientific Area Information Sciences AND Computer Sciences at the Institute of Cognitive Sciences and Technologies, CNR, in the scope of the projects: eGovernment, Digital Libraries and Hermes, under the scientific responsibility of Dr. Valentina Presutti. To the selection may apply individuals who, whatever their nationality or age, are in possession of the following requirements at the date of expiry of the deadline for submission of applications: a) Degree in Computer Science or Engineering or Literature and Philosophy in accordance with the legislation in force before DM 509/99 or Degree in Computer Science, Engineering, Linquistics or Science of Language (or equivalent) in accordance with the regulations referred to in DM 509/99 or Master's Degree in Computer Science, Engineering Linguistics or Science of Language (or equivalent) in accordance with the regulations referred to in DM 270/04, with professional resume suitable for the conduct of research according to the specifications given in the following points (the candidate is in charge, penalty of exclusion, of demonstrating equiparation of graduation diplomas); b) All qualifications obtained abroad (bechelor's degree, doctorate, and any other qualification) shall be previously recognized in Italy in accordance with current legislation (information on the website of the Ministry of University and Scientific Research: www.miur.it. The equivalence of those diplomas obtained abroad who have not already been recognized in Italy with the expected formal procedure above, will be evaluated, with the only purpose of the present selection, by the Examining Committee constituted according to art. 6, paragraph 1 of the Regulations; c) Documented experience of research, development and application of semantic technologies. In particular, it is required expertise in at least one of the following areas: ontology design and open data, knowledge representation, and natural lanugage processing; d) Excellent knowledge of OWL, RDF, and SPARQL; e) Knowledge of mobile application development platforms; f) Knowledge and documented experience of Java development; preference will be given to candidates that know also other programming languages; g) English proficiency. THE ENGLISH CALL ON THE WEB-SITE DOES NOT HAVE LEGAL VALUE IN ITSELF, AND THUS DOES NOT SUPERSEDE THE ITALIAN VERSION OF THE CALL ANNOUNCEMENT (BANDO). From hamon at LIMSI.FR Tue Feb 11 20:39:46 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 21:39:46 +0100 Subject: Appel: INTERSPEECH 2014, Special Sessions Message-ID: Date: Mon, 10 Feb 2014 12:20:47 +0800 From: "Organization @ Interspeech 2014" Message-ID: <52F8539F.8050905 at interspeech2014.org> X-url: http://www.INTERSPEECH2014.org --- INTERSPEECH 2014 - SINGAPORE --- September 14-18, 2014 --- http://www.INTERSPEECH2014.org INTERSPEECH is the world's largest and most comprehensive conference on issues surrounding the science and technology of spoken language processing, both in humans and in machines. The theme of INTERSPEECH 2014 is --- Celebrating the Diversity of Spoken Languages --- INTERSPEECH 2014 includes a number of special sessions covering interdisciplinary topics and/or important new emerging areas of interest related to the main conference topics. Special sessions proposed for the forthcoming edition are: - A Re-evaluation of Robustness - Deep Neural Networks for Speech Generation and Synthesis - Exploring the Rich Information of Speech Across Multiple Languages - INTERSPEECH 2014 Computational Paralinguistics ChallengE (ComParE) - Multichannel Processing for Distant Speech Recognition - Open Domain Situated Conversational Interaction - Phase Importance in Speech Processing Applications - Speaker Comparison for Forensic and Investigative Applications - Text-dependent for Short-duration Speaker Verification - Tutorial Dialogues and Spoken Dialogue Systems - Visual Speech Decoding A description of each special session is given below. For paper submission, please follow the main conference procedure and chose the Special Session track when selecting your paper area. Paper submission procedure is described at: http://www.INTERSPEECH2014.org/public.php?page=submission_procedure.html For more information, feel free to contact the Special Session Chair, Dr. Tomi H. Kinnunen, at email tkinnu [at]cs.uef.fi ------------------------------------------------------------------------ Special Session Description ------------------------------------------------------------------------ A Re-evaluation of Robustness The goal of the session is to facilitate a re-evaluation of robust speech recognition in the light of recent developments. It?s a re-evaluation at two levels: - a re-evaluation in perspective brought by breakthroughs in performance obtained by Deep Neural Network which leads to a fresh questioning of the role and contribution of robust feature extraction. - A literal re-evaluation on common databases to be able to present and compare performances of different algorithms and system approaches to robustness. Paper submissions are invited on the theme of noise robust speech recognition and required to submit results on the Aurora 4 database to facilitate cross comparison of the performance between different techniques. Recent developments raise interesting research questions that the session aims to help Progress by bringing focus and exploration of these issues. For example 1. What role is there for signal processing to create feature representations to use as inputs to Deep Learning or can deep learning do all the work? 2. What feature representations can be automatically learnt in a deep learning architecture? 3. What other techniques can give great improvement in robustness? 4. What techniques don?t work and why? The session organizers wish to encourage submissions that bring insight and understanding to the issues highlighted above. Authors are requested not only to present absolute performance of the whole system but also to highlight the contribution made by various components in a complex system. Papers that are accepted for the session are encouraged to also evaluate their techniques on new test data sets (available in July) and submit their results at the end of August. Session organization The session will be structured as a combination of 1. Invited talks 2. Oral paper presentations 3. Poster presentations 4. Summary of contributions and results on newly released test sets 5. Discussion Organizers: David Pearce, Audience dpearce [at]audience.com Hans-Guenter Hirsch, Niederrhein University of Applied Sciences, hans-guenter.hirsch [at]hs-niederrhein.de Reinhold Haeb-Umbach, University of Paderborn, haeb [at]nt.uni-paderborn.de Michael Seltzer, Microsoft, mseltzer [at]microsoft.com Keikichi Hirose, The University of Tokyo, hirose [at]gavo.t.u-tokyo.ac.jp Steve Renals, University of Edinburgh, s.renals [at]ed.ac.uk Sim Khe Chai, National University of Singapore, simkc [at]comp.nus.edu.sg Niko Moritz, Fraunhofer IDMT, Oldenburg, niko.moritz [at]idmt.fraunhofer.de K K Chin, Google, kkchin [at]google.com Deep Neural Networks for Speech Generation and Synthesis This special session aims to bring together researchers who work actively on deep neural networks for speech research, particularly, in generation and synthesis, to promote and to understand better the state-of-art DNN research in statistical learning and compare results with the parametric HMM-GMM model which has been well-established for speech synthesis, generation, and conversion. DNN, with its neuron-like structure, can simulate human speech production system in a layered, hierarchical, nonlinear and self-organized network. It can transform linguistic text information into intermediate semantic, phonetic and prosodic content and finally generate speech waveforms. Many possible neural network architectures or typologies exist, e.g. feed-forward NN with multiple hidden layers, stacked RBM or CRBM, Recurrent Neural Net (RNN), which have been used to speech/image recognition and other applications. We would like to use this special session as a forum to present updated results in the research frontiers, algorithm development and application scenarios. Particular focused areas will be on parametric TTS synthesis, voice conversion, speech compression, de-noising and speech enhancement. Organizers: Yao Qian, Microsoft Research Asia, yaoqian [at]microsoft.com Frank K. Soong, Microsoft Research Asia, frankkps [at]microsoft.com Exploring the Rich Information of Speech Across Multiple Languages Spoken language is the most direct means of communication between human beings. However, speech communication often demonstrates its language-specific characteristics because of, for instance, the linguistic difference (e.g., tonal vs. non-tonal, monosyllabic vs. multisyllabic) across languages. Our knowledge on the diversities of speech science across languages is still limited, including speech perception, linguistic and non-linguistic (e.g., emotion) information, etc. This knowledge is of great significance to facilitate our design of language-specific application of speech techniques (e.g., automatic speech recognition, assistive hearing devices) in the future. This special session will provide an opportunity for researchers from various communities (including speech science, medicine, linguistics and signal processing) to stimulate further discussion and new research in the broad cross-language area, and present their latest research on understanding the language-specific features of speech science and their applications in the speech communication of machines and human beings. This special session encourages contributions all fields on speech science, e.g., production and perception, but with a focus on presenting the language-specific characteristics and discussing their implications to improve our knowledge on the diversities of speech science across multiple languages. Topics of interest include, but are not limited to: 1. characteristics of acoustic, linguistic and language information in speech communication across multiple languages; 2. diversity of linguistic and non-linguistic (e.g., emotion) information among multiple spoken languages; 3. language-specific speech intelligibility enhancement and automatic speech recognition techniques; and 4. comparative cross-language assessment of speech perception in challenging environments. Organizers: Junfeng Li, Institute of Acoustics, Chinese Academy of Sciences, junfeng.li.1979 [at]gmail.com Fei Chen, The University of Hong Kong, feichen1 [at]hku.hk INTERSPEECH 2014 Computational Paralinguistics ChallengE (ComParE) The INTERSPEECH 2014 Computational Paralinguistics ChallengE (ComParE) is an open Challenge dealing with speaker characteristics as manifested in their speech signal's acoustic properties. This year, it introduces new tasks by the Cognitive Load Sub-Challenge, the Physical Load Sub-Challenge, and a Multitask Sub-Challenge: For these Challenge tasks, the COGNITIVE-LOAD WITH SPEECH AND EGG database (CLSE), the MUNICH BIOVOICE CORPUS (MBC), and the ANXIETY-DEPRESSION-EMOTION-SLEEPINESS audio corpus (ADES) with high diversity of speakers and different languages covered (Australian English and German) are provided by the organizers. All corpora provide fully realistic data in challenging acoustic conditions and feature rich annotation such as speaker meta-data. They are given with distinct definitions of test, development, and training partitions, incorporating speaker independence as needed in most real-life settings. Benchmark results of the most popular approaches are provided as in the years before. Transcription of the train and development sets will be known. All Sub-Challenges allow contributors to find their own features with their own machine learning algorithm. However, a standard feature set will be provided per corpus that may be used. Participants will have to stick to the definition of training, development, and test sets. They may report on results obtained on the development set, but have only five trials to upload their results on the test sets, whose labels are unknown to them. Each participation will be accompanied by a paper presenting the results that undergoes peer-review and has to be accepted for the conference in order to participate in the Challenge. The results of the Challenge will be presented in a Special Session at INTERSPEECH 2014 in Singapore. Further, contributions using the Challenge data or related to the Challenge but not competing within the Challenge are also welcome. More information is given also on the Challenge homepage: http://emotion-research.net/sigs/speech-sig/is14-compare Organizers: Bj?rn Schuller, Imperial College London / Technische Universit?t M?nchen,schuller [at]IEEE.org Stefan Steidl, Friedrich-Alexander-University, stefan.steidl [at]fau.de Anton Batliner, Technische Universit?t M?nchen / Friedrich-Alexander-University, batliner [at]cs.fau.de Jarek Krajweski, Bergische Universit?t Wuppertal, krajewsk [at]uni-wuppertal.de Julien Epps, The University of New South Wales / National ICT Australia, j.epps [at]unsw.edu.au Multichannel Processing for Distant Speech Recognition Distant speech recognition in real-world environments is still a challenging problem: reverberation and dynamic background noise represent major sources of acoustic mismatch that heavily decrease ASR performance, which, on the contrary, can be very good in close-talking microphone setups. In this context, a particularly interesting topic is the adoption of distributed microphones for the development of voice-enabled automated home environments based on distant-speech interaction: microphones are installed in different rooms and the resulting multichannel audio recordings capture multiple audio events, including voice commands or spontaneous speech, generated in various locations and characterized by a variable amount of reverberation as well as possible background noise. The focus of the proposed special session will be on multichannel processing for automatic speech recognition (ASR) in such a setting. Unlike other robust ASR tasks, where static adaptation or training with noisy data sensibly ameliorates performance, the distributed microphone scenario requires full exploitation of multichannel information to reduce the highly variable dynamic mismatch. To facilitate better evaluation of the proposed algorithms the organizers will provide a set of multichannel recordings in a domestic environment. The recordings will include spoken commands mixed with other acoustic events occurring in different rooms of a real apartment. The data is being created in the frame of the EC project DIRHA (Distant speech Interaction for Robust Home Applications) which addresses the challenges of speech interaction for home automation. The organizers will release the evaluation package (datasets and scripts) on February 17; the participants are asked to submit a regular paper reporting speech recognition results on the evaluation set and comparing their performance with the provided reference baseline. Further details are available at: http://dirha.fbk.eu/INTERSPEECH2014 Organizers: Marco Matassoni, Fondazione Bruno Kessler, matasso [at]fbk.eu Ramon Fernandez Astudillo, Instituto de Engenharia de Sistemas e Computadores, ramon.astudillo [at]inesc-id.pt Athanasios Katsamanis, National Technical University of Athens, nkatsam [at]cs.ntua.gr Open Domain Situated Conversational Interaction Robust conversational systems have the potential to revolutionize our interactions with computers. Building on decades of academic and industrial research, we now talk to our computers, phones, and entertainment systems on a daily basis. However, current technology typically limits conversational interactions to a few narrow domains/topics (e.g., weather, traffic, restaurants). Users increasingly want the ability to converse with their devices over broad web-scale content. Finding something on your PC or the web should be as simple as having a conversation. A promising approach to address this problem is situated conversational interaction. The approach leverages the situation and/or context of the conversation to improve system accuracy and effectiveness. Sources of context include visual content being displayed to the user, Geo-location, prior interactions, multi-modal interactions (e.g., gesture, eye gaze), and the conversation itself. For example, while a user is reading a news article on their tablet PC, they initiate a conversation to dig deeper on a particular topic. Or a user is reading a map and wants to learn more about the history of events at mile marker 121. Or a gamer wants to interact with a game?s characters to find the next clue in their quest. All of these interactions are situated ? rich context is available to the system as a source of priors/constraints on what the user is likely to say. This special session will provide a forum to discuss research progress in open domain situated conversational interactions. Topics of the session will include: - Situated context in spoken dialog systems - Visual/dialog/personal/geo situated context - Inferred context through interpretation and reasoning - Open domain spoken dialog systems - Open domain spoken/natural language understanding and generation - Open domain semantic interpretation - Open domain dialog management (large-scale belief state/policy) - Conversational Interactions - Multi-modal inputs in situated open domains (speech/text + gesture, touch, eye gaze) - Multi-human situated interactions Organizers: Larry Heck, Microsoft Research, larry [at]ieee.org Dilek Hakkani-T?r, Microsoft Research, dilek [at]ieee.org Gokhan Tur, Microsoft Research, gokhan [at]ieee.org Steve Young, Cambridge University, sjy [at]eng.cam.ac.uk Phase Importance in Speech Processing Applications In the past decades, the amplitude of speech spectrum is considered to be the most important feature in different speech processing applications and phase of the speech signal has received less attention. Recently, several findings justify the phase importance in speech and audio processing communities. The importance of phase estimation along with amplitude estimation in speech enhancement, complementary phase-based features in speech and speaker recognition and phase-aware acoustic modeling of environment are the most prominent reported works scattered in different communities of speech and audio processing. These examples suggest that incorporating the phase information can push the limits of state-of-the-art phase-independent solutions employed for long in different aspects of audio and speech signal processing. This Special Session aims to explore the recent advances and methodologies to exploit the knowledge of signal phase information in different aspects of speech processing. Without a dedicated effort to bring researchers from different communities, a quick advance in investigation towards the phase usefulness in speech processing applications is difficult to achieve. Therefore, as the first step in this direction, we aim to promote the "phase-aware speech and audio signal processing" to form a community of researchers to organize the next steps. Our initiative is to unify these efforts to better understand the pros and cons of using phase and the degree of feasibility for phase estimation/enhancement in different areas of speech processing including: speech enhancement, speech separation, speech quality estimation, speech and speaker recognition, voice transformation and speech analysis and synthesis. The goal is to promote the importance of the phase-based signal processing and studying its importance and sharing interesting findings from different speech processing applications. Organizers: Pejman Mowlaee, Graz University of Technology, pejman.mowlaee [at]tugraz.at Rahim Saeidi, University of Eastern Finland, rahim.saeidi [at]uef.fi Yannis Styilianou, Toshiba Labs Cambridge UK / University of Crete, yannis [at]csd.uoc.gr Speaker Comparison for Forensic and Investigative Applications In speaker comparison, speech/voice samples are compared by humans and/or machines for use in investigation or in court to address questions that are of interest to the legal system. Speaker comparison is a high-stakes application that can change people?s lives and it demands the best that science has to offer; however, methods, processes, and practices vary widely. These variations are not necessarily for the better and though recognized, are not generally appreciated and acted upon. Methods, processes, and practices grounded in science are critical for the proper application (and non-application) of speaker comparison to a variety of international investigative and forensic applications. This special session will contribute to scientific progress through 1) understanding speaker comparison for investigative and forensic application (e.g., describe what is currently being done and critically analyze performance and lessons learned); 2) improving speaker comparison for investigative and forensic applications (e.g., propose new approaches/techniques, understand the limitations, and identify challenges and opportunities); 3) improving communications between communities of researchers, legal scholars, and practitioners internationally (e.g., directly address some central legal, policy, and societal questions such as allowing speaker comparisons in court, requirements for expert witnesses, and requirements for specific automatic or human-based methods to be considered scientific); 4) using best practices (e.g., reduction of bias and presentation of evidence); 5) developing a roadmap for progress in this session and future sessions; and 6) producing a documented contribution to the field. Some of these objectives will need multiple sessions to fully achieve and some are complicated due to differing legal systems and cultures. This special session builds on previous successful special sessions and tutorials in forensic applications of speaker comparison at INTERSPEECH beginning in 2003. Wide international participation is planned, including researchers from the ISCA SIGs for the Association Francophone de la Communication Parl?e (AFCP) and the Speaker and Language Characterization (SpLC). Organizers: Joseph P. Campbell, PhD, MIT Lincoln Laboratory, jpc [at]ll.mit.edu Jean-Fran?ois Bonastre, l'Universit? d'Avignon, jean-francois.bonastre [at]univ-avignon.fr Text-dependent for Short-duration Speaker Verification In recent years, speaker verification engines have reached maturity and have been deployed in commercial applications. Ergonomics of such applications is especially demanding and imposes a drastic limitation in terms of speech duration during authentication. A well known tactic to address the problem of lack of data, due to short duration, is using text-dependency. However, recent breakthroughs achieved in the context of text-independent speaker verification in terms of accuracy and robustness do not benefit text-dependent applications. Indeed, large development data required by the recent approaches is not available in the text-dependent context. The purpose of this special session is to gather the research efforts from both academia and industry toward a common goal of establishing a new baseline and explore new directions for text-dependent speaker verification. The focus of the session is on robustness with respect to duration and modeling of lexical information. To support the development and evaluation of text-dependent speaker verification technologies, the Institute for Infocomm Research (I2R) has recently released the RSR2015 database, including 150 hours of data recorded from 300 speakers. The papers submitted to the special session are encouraged, but not limited, to provide results based on the RSR2015 database in order to enable comparison of algorithms and methods. For this purpose, the organizers strongly encourage the participants to report performance on the protocol delivered with the database in terms of EER and minimum cost (in the sense of NIST 2008 Speaker Recognition evaluation). To get the database, please contact the organizers. Further details are available at: http://www1.i2r.a-star.edu.sg/~kalee/is2014/tdspk.html Organizers: Anthony LARCHER (alarcher [at]i2r.a-star.edu.sg) Institute for Infocomm Research Hagai ARONOWITZ (hagaia [at]il.ibm.com) IBM Research ? Haifa Kong Aik LEE (kalee [at]i2r.a-star.edu.sg) Institute for Infocomm Research Patrick KENNY (patrick.kenny [at]crim.ca) CRIM ? Montr?al Tutorial Dialogues and Spoken Dialogue Systems The growing interest in educational applications that use spoken interaction and dialogue technology has boosted research and development of interactive tutorial systems, and over the recent years, advances have been achieved in both spoken dialogue community and education research community, with sophisticated speech and multi-modal technology which allows functionally suitable and reasonably robust applications to be built. The special session combines spoken dialogue research, interaction modeling, and educational applications, and brings together the two INTERSPEECH SIG communities: SLaTE and SIGdial. The session focuses on methods, problems and challenges that are shared by both communities, such as sophistication of speech processing and dialogue management for educational interaction, integration of the models with theories of emotion, rapport, and mutual understanding, as well as application of the techniques to novel learning environments, robot interaction, etc. The session aims to survey issues related to the processing of spoken language in various learning situations, modeling of the teacher-student interaction in MOOC-like environments, as well as evaluating tutorial dialogue systems from the point of view of natural interaction, technological robustness, and learning outcome. The session encourages interdisciplinary research and submissions related to the special focus of the conference, "Celebrating the Diversity of Spoken Languages". For further information click http://junionsjlee.wix.com/INTERSPEECH Organizers: Maxine Eskenazi, max+ [at]cs.cmu.edu Kristiina Jokinen, kristiina.jokinen [at]helsinki.fi Diane Litman, litman [at]cs.pitt.edu Martin Russel, M.J.RUSSELL [at]bham.ac.uk --- Visual Speech Decoding Speech perception is a bi-modal process that takes into account both the acoustic (what we hear) and visual (what we see) speech information. It has been widely acknowledged that visual clues play a critical role in automatic speech recognition (ASR) especially when audio is corrupted by, for example, background noise or voices from untargeted speakers, or even inaccessible. Decoding the visual speech is utterly important for ASR technologies to be widely implemented to realize truly natural human-computer interactions. Despite the advances in acoustic ASR, visual speech decoding remains a challenging problem. The special session aims to attract more effort to tackle this important problem. In particular, we would like to encourage researchers to focus on some critical questions in the area. We propose four questions as the initiative as follows: 1. How to deal with the speaker dependency in visual speech data? 2. How to cope with the head-pose variation? 3. How to encode temporal information in visual features? 4. How to automatically adapt the fusion rule when the quality of the two individual (audio and visual) modalities varies? Researchers and participants are encouraged to raise more questions related to visual speech decoding. We expect the session to draw a wide range of attention from both the speech recognition and machine vision communities to the problem of visual speech decoding. Organizers: Ziheng Zhou, University of Oulu, ziheng.zhou [at]ee.oulu.fi Matti Pietik?inen, University of Oulu, matti.pietikainen [at]ee.oulu.fi Guoying Zhao, University of Oulu, gyzhao [at]ee.oulu.fi ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 11 21:12:49 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 22:12:49 +0100 Subject: Job: Deep neural networks for statistical machine translation, several positions available (Postdoc, PhD, engineers, etc), France Message-ID: Date: Mon, 10 Feb 2014 22:06:10 +0100 From: Holger Schwenk Message-ID: <52F93F42.7090004 at lium.univ-lemans.fr> During the last years, there have been several breakthroughs in the use of neural networks for natural language processing, in particular using deep architectures. The computer science laboratory of the University of Le Mans (LIUM) is working since many years on statistical machine translation (SMT), and we were among the first researchers to successfully use neural networks, for instance continuous space language and translation models. We want to substantially increase our research efforts in this area, hoping to achieve a significant advances in SMT. Our goal is to build state-of-the-art large scale SMT systems using deep neural networks. In this major research effort, we have openings at different levels - postdoc positions - PhD positions - short term visits with a well focused research project - engineers The candidates are expected to have demonstrated knowledge in at least one of the following fields: - neural networks (feed-forward, recurrent NN, deep learning, etc). - statistical machine translation - efficient implementation of machine learning algorithms (GPU, MPI, etc) The positions are immediately available. Applications are accepted until the positions are filled. Initial appointment is for one year, renewable for up to three years. Competitive salaries are available, including health care and other social benefits, travel support, etc. The working language is English or French. LIUM is participating in several international projects, financed by the European Commission, DARPA and the French government. We collaborate with leading research groups in USA and Europe. A large computer cluster is available to support the research (700 CPU cores with a total of 6 TBytes of memory and more than 250 TBytes of RAID disk space). We also own a cluster with 30 Tesla K20 and K40 GPU cards, connected by a fast Infiniband network. Le Mans is located in between Paris and the Atlantic ocean. Both can be reached in about 1 hour by high speed train. The Loire valley with many wineries and other attractions is just a short drive away ... Applications should include an CV, a list of publications and the name of two references. We will invite interesting candidates for further discussions. For more information, please contact Holger Schwenk by email: Holger.Schwenk at lium.univ-lemans.fr From hamon at LIMSI.FR Tue Feb 11 20:51:33 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 21:51:33 +0100 Subject: Stage: Stage M2, Analyse automatique de comptes-rendus de consultations medicales, IRIT Message-ID: Date: Mon, 10 Feb 2014 13:19:07 +0100 From: philippe muller Message-ID: <52F8C3BB.8060604 at irit.fr> Proposition de stage de M2(R) Titre : Analyse automatique de comptes-rendus de consultations m?dicales Mots-clefs : traitement automatique des langues naturelles, extraction d'information, apprentissage automatique, m?decine R?sum? : Ce stage de M2 Recherche vise l'analyse automatique de comptes-rendus de consultations m?dicales pour en extraire certaines informations n?cessaires ? la r?alisation d?une ?tude ?pid?miologique. Les compte-rendus sont constitu?s de texte libre r?sumant les donn?es sociod?mographiques, le diagnostic, les sympt?mes, et les r?sultats des tests ?ventuels r?alis?s chez ces patients. Deux objectifs principaux sont consid?r?s : d'une part classifier les comptes rendus vis ? vis de l?existence ou non d?une pathologie, d'autre part extraire un certain nombre d?informations pr?cises en lien avec ces maladies. Le stagiaire devra ?valuer et comparer les strat?gies classiques d'extraction d'information en Traitement Automatique des Langues en domaine sp?cialis?, et l'application de m?thodes d'apprentissage automatique pour les deux objectifs vis?s. Les donn?es utilis?es sont un corpus de compte-rendus de consultation d?identifi?s, en fran?ais. Encadrement : Ce stage sera effectu? ? l'IRIT (Institut de Recherche en Informatique de Toulouse), en coop?ration avec l'INSERM, et sera codirig? par Philippe Muller (IRIT, ?quipe MELODI "MEthodes et ing?nierie des Langues, des Ontologies et du DIscours") et Virginie Gardette (?quipe "Vieillissement et maladie d'Alzheimer" de l'UMR INSERM 1027 "Epid?miologie et analyses en sant? publique"). http://www.irit.fr/-Equipe-MELODI- http://www.u1027.inserm.fr/42537678/0/fiche___pagelibre/&RH=1303915788348 Comp?tences requises : Le stagiaire devra avoir une formation en M2 informatique, id?alement avec des comp?tences en apprentissage automatique et/ou traitement automatique du langage naturel. Dur?e : 4-6 mois ? partir de mars ou avril. R?mun?ration : 436,05?/mois, conform?ment ? la r?glementation. Candidature : Envoyez un CV (avec relev?s de notes r?cents) et une lettre de motivation ? philippe.muller at irit.fr et virginie.gardette at univ-tlse3.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 11 21:03:16 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 22:03:16 +0100 Subject: Appel: ESWC 2014, Challenge Concept-Level Sentiment Analysis Message-ID: Date: Mon, 10 Feb 2014 15:55:04 +0100 (CET) From: speroni at cs.unibo.it Message-Id: <20140210145524.577BFDBA76 at vina.cines.fr> X-url: http://challenges.2014.eswc-conferences.org/SemSA ** apologies for cross-posting ** ==== Second Call for Challenge: Concept-Level Sentiment Analysis ==== Challenge Website: http://challenges.2014.eswc-conferences.org/SemSA Call Web page: http://2014.eswc-conferences.org/important-dates/call-SemSA 11th Extended Semantic Web Conference (ESWC) 2014 Dates: May 25 - 29, 2014 Venue: Anissaras, Crete, Greece Hashtag: #eswc2014 Feed: @eswc_conf Site: http://2014.eswc-conferences.org General Chair: Valentina Presutti (STLab, ISTC-CNR, IT) Challenge Coordinator: Milan Stankovic (Sepage & Universite Paris-Sorbonne, FR) Challenge Chairs: - Erik Cambria (National University of Singapore, SG) - Diego Reforgiato (STLab, ISTC-CNR, IT) MOTIVATION AND OBJECTIVES Mining opinions and sentiments from natural language is an extremely difficult task as it involves a deep understanding of most of the explicit and implicit, regular and irregular, syntactical and semantic rules proper of a language. Existing approaches mainly rely on parts of text in which opinions and sentiments are explicitly expressed such as polarity terms, affect words and their co-occurrence frequencies. However, opinions and sentiments are often conveyed implicitly through latent semantics, which make purely syntactical approaches ineffective. To this end, concept-level sentiment analysis aims to go beyond a mere word-level analysis of text and provide novel approaches to opinion mining and sentiment analysis that allow a more efficient passage from (unstructured) textual information to (structured) machine-processable data, in potentially any domain. Concept-level sentiment analysis focuses on a semantic analysis of text through the use of web ontologies or semantic networks, which allow the aggregation of conceptual and affective information associated with natural language opinions. By relying on large semantic knowledge bases, concept-level sentiment analysis steps away from blind use of keywords and word co-occurrence count, but rather relies on the implicit features associated with natural language concepts. This Challenge focuses on the introduction, presentation, and discussion of novel approaches to concept-level sentiment analysis. Participants will have to design a concept-level opinion-mining engine that exploits common-sense knowledge bases, e.g., SenticNet, and/or Linked Data and Semantic Web ontologies, e.g., DBPedia, to perform multi-domain sentiment analysis. The main motivation for the Challenge, in particular, is to go beyond a mere word-level analysis of natural language text and provide novel concept-level tools and techniques that allow a more efficient passage from (unstructured) natural language to (structured) machine-processable data, in potentially any domain. Systems must have a semantics flavor (e.g., by making use of Linked Data or known semantic networks within their core functionalities) and authors need to show how the introduction of semantics can be used to obtain valuable information, functionality or performance. Existing natural language processing methods or statistical approaches can be used too as long as the semantics plays a main role within the core approach (engines based merely on syntax/word-count will be excluded from the competition). TARGET AUDIENCE The Challenge is open to everyone from industry and academia. TASKS The Concept-Level Sentiment Analysis Challenge is defined in terms of different tasks. The first task is elementary whereas the others are more advanced. The input units of each task are sentences. Sentences are assumed to be in grammatically correct American English and have to be processed according to the input format specified at http://sentic.net/challenge/sentence. * Elementary Task: Polarity Detection The main goal of the task is polarity detection. The proposed systems will be assessed according to precision, recall and F-measure of detected binary polarity values (1=positive; 0=negative) for each input sentence of the evaluation dataset, following the same format as in http://sentic.net/challenge/task0. The problem of subjectivity detection is not addressed within this Challenge, hence participants can assume that there will be no neutral sentences. Participants are encouraged to use the Sentic API or further develop and apply sentic computing tools. * Advanced Task #1: Aspect-Based Sentiment Analysis The output of this task will be a set of aspects of the reviewed product and a binary polarity value associated to each of such aspects, in the format specified at http://sentic.net/challenge/task1. So, for example, while for the Elementary task an overall polarity (positive or negative) is expected for a review about a mobile phone, this task requires a set of aspects (such as 'speaker', 'touchscreen', 'camera', etc.) and a polarity value (positive OR negative) associated with each of such aspects. Systems will be assessed according to both aspect extraction and aspect polarity detection. * Advanced Task #2: Semantic Parsing As suggested by the title, the Challenge focuses on sentiment analysis at concept-level. This means that the proposed systems are not supposed to work at word/syntax level but rather work with concepts/semantics. Hence, this task will evaluate the capability of the proposed systems to deconstruct natural language text into concepts, following the same format as in http://sentic.net/challenge/task2. SenticNet will be taken as a reference to test the efficiency of the proposed parsers, but extracted concepts won't necessary have to match SenticNet concepts. The proposed systems, for example, are supposed to be able to extract a multi-word expression like 'buy christmas present' from sentences such as 'Today I bought a lot of very nice Christmas presents'. The number of extracted concepts per sentence will be assessed through precision, recall and F-measure against the evaluation dataset. * Advanced Task #3: Topic Spotting Input sentences will be about four different domains, namely: books, DVDs, electronics, and kitchen appliances. This task focuses on the automatic classification of sentences into one of such domains, in the format specified at http://sentic.net/challenge/task3. All sentences are assumed to belong to only one of the above-mentioned domains. The proposed systems are supposed to exploit the extracted concepts to infer which domain each sentence belongs to. Classification accuracy will be evaluated in terms of precision, recall and F-measure against the evaluation dataset. EVALUATION DATASET Systems will be evaluated against a testing dataset which will be revealed and released after the first-round of evaluation during the Conference. The dataset will be made public on the challenge website. Participants are suggested to train and/or test their own systems using the Blitzer Dataset. The testing dataset will be constructed in the same way and from the same sources as the Blitzer dataset. EVALUATION The evaluation will be performed by the members of the Program Committee. For systems that can be tuned with different parameters, please indicate a range of up to 4 sets of settings. Settings with the best F-measures will be considered for judgment. For each system, reviewers will give a numerical score within the range [1-10] and details motivating their choice. The scores will be given to the following aspects: 1. Use of common-sense knowledge and semantics; 2. Precision, recall, and F-measure wrt the selected task; 3. Computational time; 4. Innovative nature of the approach. JUDGING AND PRIZES After a first round of review, the Program Committee and the chairs will select a number of submissions confirming to the challenge requirements that will be invited to present their work. Submissions accepted for presentation will be included in post-proceedings and will receive constructive reviews from the Program Committee. All accepted submissions will have a slot in a poster session dedicated to the challenge. In addition, the winners will present their work in a special slot of the main program of ESWC and will be invited to submit a paper to a dedicated Semantic Web Journal special issue. For the Concept-Level Sentiment Analysis Challenge there will be two awards for each task: * Quantitative: the system with the highest average score in items 1-3 above; * Innovative: the system with the highest score in item 4 above. There will be a board of judges at the conference who will evaluate again the systems in more detail. The judges will then meet in private to discuss the entries and to determine the winners. It may happen that the same system runs for both the awards. Winners will be selected only for tasks with at least 3 participants. In any case all submissions will be reviewed and, if accepted, published in ESWC post-proceedings. An amount of 700 euros has already been secured for the first task for what the first point of the evaluation aspects is concerned. We are currently working on securing further funding. HOW TO PARTICIPATE The following information has to be provided: * Abstract: no more than 200 words. * Description: It should contain the details of the system, including why the system is innovative, how it uses Semantic Web, which features or functions the system provides, what design choices were made and what lessons were learned. The description should also summarize how participants have addressed the evaluation tasks. Papers must be submitted in PDF format, following the style of the Springer's Lecture Notes in Computer Science (LNCS) series (http://www.springer.com/computer/lncs/lncs+authors), and not exceeding 5 pages in length. * Web Access: The application can either be accessible via the web or downloadable. If the application is not publicly accessible, password must be provided. A short set of instructions on how to use the application should be provided as well. All submissions should be provided via EasyChair: https://www.easychair.org/conferences/?conf=eswc2014-challenges MAILING LIST We invite the potential participants to subscribe to our mailing list in order to be kept up to date with the latest news related to the challenge. https://lists.sti2.org/mailman/listinfo/eswc2014-semsa-challenge IMPORTANT DATES * March 14, 2014, 23:59 CET: Submission due * April 9, 2014, 23:59 CET: Notification of acceptance * May 27-29, 2014: The Challenge takes place at ESWC-14 PROGRAM COMMITTEE * Newton Howard, MIT Media Laboratory (USA) * Cheng Xiang Zhai, University of Illinois at Urbana-Champaign (USA) * Rada Mihalcea, University of North Texas (USA) * Ping Chen, University of Houston-Downtown (USA) * Yongzheng Zhang, LinkedIn Inc. (USA) * Giuseppe Di Fabbrizio, Amazon Inc. (USA) * Rui Xia, Nanjing University of Science and Technology (China) * Rafal Rzepka, Hokkaido University (Japan) * Amir Hussain, University of Stirling (UK) * Alexander Gelbukh, National Polytechnic Institute (Mexico) * Bjoern Schuller, Technical University of Munich (Germany) * Amitava Das, Samsung Research India (India) * Dipankar Das, National Institute of Technology (India) * Carlo Strapparava, Fondazione Bruno Kessler (Italy) * Stefano Squartini, Marche Polytechnic University (Italy) * Cristina Bosco, University of Torino (Italy) * Paolo Rosso, Technical University of Valencia (Spain) From hamon at LIMSI.FR Tue Feb 11 21:23:29 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 22:23:29 +0100 Subject: Job: Professeur en informatique, UFR Mathematiques et Informatique, Universite de Lorraine Message-ID: Date: Tue, 11 Feb 2014 18:07:36 +0100 From: smaili Message-ID: <52FA58D8.1040706 at loria.fr> L'UFR Math?matiques et Informatique et le LORIA de l'UL proposent un poste de professeur en informatique dont les profils enseignement et recherche sont donn?s ci-dessous. Par ailleurs, nous organisons ? l'UFR MI le 19 mars une journ?e de pr?sentation o? les candidats qui le souhaitent viendront pr?senter leurs dossiers devant les ?quipes p?dagogiques de l'UFR Math?matiques et Informatique. Profil enseignement : Le professeur doit s'int?grer ? l'?quipe p?dagogique de la licence et du master sciences cognitives. Les sciences cognitives s'int?ressent ? la compr?hension de la pens?e artificielle, animale ou humaine. Les comp?tences recherch?es vont donc de l'informatique aux sciences du langage en passant par la biologie.L'UFR recrutera un professeur qui poss?de un maximum de comp?tences li?es aux sciences cognitives et qui pourrait intervenir dans les cours : aspects cognitifs li?s ? la linguistique computationnelle, les neurosciences, la repr?sentation des connaissances, l'intelligence artificielle, l'intelligence collective, etc. L'enseignant doit avoir une comp?tence et une exp?rience certaine dans ce domaine pour ?tre op?rationnel d?s la rentr?e 2014-2015 o? les besoins pour nos formations sont importants. Le candidat s'int?grera ? l'?quipe p?dagogique et contribuera ? la cr?ation de projets innovants.Au niveau administratif, le candidat doit ?tre en mesure tr?s rapidement de prendre des responsabilit?s lourdes : montage de nouveaux dipl?mes, r?daction d'habilitation, direction de dipl?mes ou d'UFR. Le professeur recrut? devra ?tre un moteur dans les futurs projets que nous sommes en train de mettre au point. Composante/UFR :Math?matiques et InformatiqueR?f?rence UFR : K. Sma?li Mots-cl?s enseignement : sciences cognitives, traitement automatique des langues, neurosciences, intelligence artificielle Profil recherche : Le professeur recrut? effectuera sa recherche dans le Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA). Il devra s'int?grer dans le d?partement 4 (Traitement des langues et des connaissances). Toutes les ?quipes du d?partement sont concern?es. Ce d?partement travaille sur les trois notions ?troitement li?es du document, du langage et de la connaissance. Au sens large, on peut parler de s?mantiques computationnelles. En effet, dans l'analyse de document, les caract?res et les symboles dans un document peuvent ?tre consid?r?s comme des indices de faible niveau et combin?s pour construire des entit?s beaucoup plus complexes.?La reconnaissance de la parole ou l'analyse de phrase ?tudie les s?mantiques relatives au langage. Et, ? un plus haut niveau, les repr?sentations du discours ou de la connaissance n?cessite l'?tude de formalismes logiques, de leur pouvoir expressif, la cartographie d'un formalisme vers un autre, le raisonnement, et la preuve... Les ?quipes concern?es sont avant tout Cello, Orpailleurs, Parole (Multispeech et SMarT), Qgar, Read, Semagramme, Synalp. Nom laboratoire 1 :LORIANum?ro unit? du laboratoire 1 : UMR 7503 Mots-cl?s recherche : S?mantique computationnelle, traitement de la langue naturelle, logique computationnelle, analyse de document, reconnaissance des formes, mod?lisation stochastique, traitement de texte/ de donn?es, r?seaux bay?siens, d?couverte de la connaissance, repr?sentation de la connaissance, syst?mes de la connaissance, biologie des syst?mes, accueil mol?culaire, logic lin?aire, grammaires cat?gorielles, syntaxe, reconnaissance de la parole, mod?lisation articulatoire, reconnaissance de l'?criture manuelle, classificateurs incr?mentaux ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 11 21:17:55 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 22:17:55 +0100 Subject: Soft: Talismane, analyseur syntaxique robuste en dependances Message-ID: Date: Tue, 11 Feb 2014 16:27:03 +0100 From: Assaf Urieli Message-ID: X-url: http://redac.univ-tlse2.fr/applications/talismane.html Talismane : analyseur syntaxique robuste en d?pendances ******************************************************* Talismane est un outil libre et complet d'analyse syntaxique en d?pendances, pr?t ? l'emploi pour le fran?ais. L'outil Talismane est disponible pour t?l?chargement ? : http://redac.univ-tlse2.fr/applications/talismane.html Talismane a ?t? d?velopp? par Assaf URIELI dans le cadre de sa th?se au sein de l'axe TAL de CLLE-ERSS (UMR 5263), Universit? de Toulouse II le Mirail. - Th?se : http://w3.erss.univ-tlse2.fr/textes/pagespersos/urieli/URIELI-thesis-2013.pdf - Laboratoire CLLE-ERSS : http://w3.erss.univ-tlse2.fr/ C'est un logiciel libre (open source) ?crit en Java, et diffus? sous une licence Affero GPL v3 : http://www.gnu.org/licenses/agpl-3.0.html Talismane effectue, pour le fran?ais : - la segmentation en phrases ; - la segmentation en mots (tokenisation) + lemmatisation ; - l'?tiquetage morpho-syntaxique (pos-tagging) ; - l'analyse syntaxique en d?pendances (parsing). Talismane a ?t? entra?n? sur le French Treebank (Abeill? et al, 2003) pour la segmentation en phrases et en mots ainsi que pour le pos-tagging, et sur le French Treebank converti en d?pendances (Candito et al, 2010) pour le parsing. Il utilise le LeFFF comme lexique par d?faut (Sagot 2010). Il est enti?rement param?trable (apprentissage, traits, r?gles, jeu d'?tiquettes, lexique...) et peut ?tre r?entra?n? pour d'autres langues. En plus, Talismane permet de : - analyser rapidement (2 millions de mots par heure dans la configuration de base) ; - parser du XML ou du HTML facilement, par l'ajout de filtres indiquant quelles parties analyser ; - ajouter des r?gles pour forcer ou emp?cher le syst?me de choisir une certaine segmentation, ?tiquette ou lien de d?pendance, selon le contexte ; - choisir entre une analyse de meilleure qualit? (faisceau large) ou plus rapide (faisceau ?troit) ; - garder une trace de la position exacte de chaque token analys? dans le fichier d'origine ; - indiquer la confiance du syst?me dans chaque d?cision prise. Un mode d'emploi est disponible sur le site. Cordialement, Assaf URIELI Laboratoire CLLE-ERSS, UMR 5263 Universit? de Toulouse II le Mirail ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 11 21:28:45 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 22:28:45 +0100 Subject: Appel: 7th BUCC Workshop at LREC: Deadline extension until Feb. 23 and Journal Special Issue Message-ID: Date: Tue, 11 Feb 2014 21:29:13 +0100 From: "Reinhard Rapp" Message-ID: <5441EDC688574124BDEA8066E6719EDA at ASUSPC> X-url: http://comparable.limsi.fr/bucc2014/ We apologize for multiple postings Please distribute to interested colleagues ============================================================ DEADLINE EXTENSION AND JOURNAL SPECIAL ISSUE ============================================================ 7th WORKSHOP ON BUILDING AND USING COMPARABLE CORPORA Building Resources for Machine Translation Research http://comparable.limsi.fr/bucc2014/ May 27, 2014 Co-located with LREC 2014 Harpa Conference Centre, Reykjavik (Iceland) EXTENDED DEADLINE FOR PAPERS: February 23, 2014 https://www.softconf.com/lrec2014/BUCC2014/ *** INVITED SPEAKER *** Chris Callison-Burch (University of Pennsylvania) ============================================================ MOTIVATION In the language engineering and the linguistics communities, research in comparable corpora has been motivated by two main reasons. In language engineering, on the one hand, it is chiefly motivated by the need to use comparable corpora as training data for statistical Natural Language Processing applications such as statistical machine translation or cross-lingual retrieval. In linguistics, on the other hand, comparable corpora are of interest in themselves by making possible inter-linguistic discoveries and comparisons. It is generally accepted in both communities that comparable corpora are documents in one or several languages that are comparable in content and form in various degrees and dimensions. We believe that the linguistic definitions and observations related to comparable corpora can improve methods to mine such corpora for applications of statistical NLP. As such, it is of great interest to bring together builders and users of such corpora. The scarcity of parallel corpora has motivated research concerning the use of comparable corpora: pairs of monolingual corpora selected according to the same set of criteria, but in different languages or language varieties. Non-parallel yet comparable corpora overcome the two limitations of parallel corpora, since sources for original, monolingual texts are much more abundant than translated texts. However, because of their nature, mining translations in comparable corpora is much more challenging than in parallel corpora. What constitutes a good comparable corpus, for a given task or per se, also requires specific attention: while the definition of a parallel corpus is fairly straightforward, building a non-parallel corpus requires control over the selection of source texts in both languages. Parallel corpora are a key resource as training data for statistical machine translation, and for building or extending bilingual lexicons and terminologies. However, beyond a few language pairs such as English- French or English-Chinese and a few contexts such as parliamentary debates or legal texts, they remain a scarce resource, despite the creation of automated methods to collect parallel corpora from the Web. To exemplify such issues in a practical setting, this year's special focus will be on Building Resources for Machine Translation Research This special topic aims to address the need for: (1) Machine Translation training and testing data such as spoken or written monolingual, comparable or parallel data collections, and (2) methods and tools used for collecting, annotating, and verifying MT data such as Web crawling, crowdsourcing, tools for language experts and for finding MT data in comparable corpora. TOPICS We solicit contributions including but not limited to the following topics: Topics related to the special theme: * Methods and tools for collecting and processing MT data, including crowdsourcing * Methods and tools for quality control * Tools for efficient annotation * Bilingual term and named entity collections * Multilingual treebanks, wordnets, propbanks, etc. * Comparable corpora with parallel units annotated * Comparable corpora for under-resourced languages and specific domains * Multilingual corpora with rich annotations: POS tags, NEs, dependencies, semantic roles, etc. * Data for special applications: patent translation, movie subtitles, MOOCs, meetings, chat-rooms, social media, etc. * Legal issues with collecting and redistributing data and generating derivatives Building comparable corpora: * Human translations * Automatic and semi-automatic methods * Methods to mine parallel and non-parallel corpora from the Web * Tools and criteria to evaluate the comparability of corpora * Parallel vs non-parallel corpora, monolingual corpora * Rare and minority languages, across language families * Multi-media/multi-modal comparable corpora Applications of comparable corpora: * Human translations * Language learning * Cross-language information retrieval & document categorization * Bilingual projections * Machine translation * Writing assistance Mining from comparable corpora: * Extraction of parallel segments or paraphrases from comparable corpora * Extraction of bilingual and multilingual translations of single words and multi-word expressions; proper names, named entities, etc. IMPORTANT DATES February 23, 2014 Deadline for submission of papers (extended) March 10, 2014 Notification of acceptance March 27, 2014 Camera-ready papers due May 27, 2014 Workshop date SUBMISSION INFORMATION Papers should follow the LREC main conference formatting details (to be announced on the conference website http://lrec2014.lrec-conf.org/en/ ) and should be submitted as a PDF-file via the START workshop manager at https://www.softconf.com/lrec2014/BUCC2014/ Contributions can be short or long papers. Short paper submission must describe original and unpublished work without exceeding six (6) pages. Characteristics of short papers include: a small, focused contribution; work in progress; a negative result; an opinion piece; an interesting application nugget. Long paper submissions must describe substantial, original, completed and unpublished work without exceeding ten (10) pages. Reviewing will be double blind, so the papers should not reveal the authors' identity. Accepted papers will be published in the workshop proceedings. Double submission policy: Parallel submission to other meetings or publications is possible but must be immediately notified to the workshop organizers. When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. Moreover, ELRA encourages all LREC authors to share the described LRs (data, tools, services, etc.), to enable their reuse, replicability of experiments, including evaluation ones, etc. JOURNAL SPECIAL ISSUE Authors of selected papers will be encouraged to submit substantially extended versions of their manuscripts to an upcoming special issue on ?Machine Translation Using Comparable Corpora? of the Journal of Natural Language Engineering. ORGANISERS Pierre Zweigenbaum, LIMSI, CNRS, Orsay (France) Ahmet Aker, University of Sheffield (UK) Serge Sharoff, University of Leeds (UK) Stephan Vogel, QCRI (Qatar) Reinhard Rapp, Universities of Mainz (Germany) and Aix-Marseille (France) CONTACT Pierre Zweigenbaum: pz (at) limsi (dot) fr SCIENTIFIC COMMITTEE * Ahmet Aker, University of Sheffield (UK) * Srinivas Bangalore (AT&T Labs, US) * Caroline Barri?re (CRIM, Montr?al, Canada) * Chris Biemann (TU Darmstadt, Germany) * Herv? D?jean (Xerox Research Centre Europe, Grenoble, France) * Kurt Eberle (Lingenio, Heidelberg, Germany) * Andreas Eisele (European Commission, Luxembourg) * ?ric Gaussier (Universit? Joseph Fourier, Grenoble, France) * Gregory Grefenstette (INRIA, Saclay, France) * Silvia Hansen-Schirra (University of Mainz, Germany) * Hitoshi Isahara (Toyohashi University of Technology) * Kyo Kageura (University of Tokyo, Japan) * Adam Kilgarriff (Lexical Computing Ltd, UK) * Natalie K?bler (Universit? Paris Diderot, France) * Philippe Langlais (Universit? de Montr?al, Canada) * Michael Mohler (Language Computer Corp., US) * Emmanuel Morin (Universit? de Nantes, France) * Dragos Stefan Munteanu (Language Weaver, Inc., US) * Lene Offersgaard (University of Copenhagen, Denmark) * Ted Pedersen (University of Minnesota, Duluth, US) * Reinhard Rapp (Universit? Aix-Marseille, France) * Sujith Ravi (Google, Mountain View, US) * Serge Sharoff (University of Leeds, UK) * Michel Simard (National Research Council Canada) * Richard Sproat (OGI School of Science & Technology, US) * Tim Van de Cruys (IRIT-CNRS, Toulouse, France) * Stephan Vogel (QCRI, Qatar) * Guillaume Wisniewski (Universit? Paris Sud & LIMSI-CNRS, Orsay, France) * Pierre Zweigenbaum (LIMSI-CNRS, Orsay, France) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 11 21:10:05 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 22:10:05 +0100 Subject: Appel: RJCIA 2014 (15 jours avant la date-butoir) Message-ID: Date: Mon, 10 Feb 2014 17:19:02 +0100 From: Gr?gory Bonnet Message-ID: <52F8FBF6.8060809 at unicaen.fr> X-url: http://rjcia2014.greyc.fr -- Merci de diffuser aussi largement que possible cet appel, en particulier au sein des associations de doctorants et des listes de diffusion de vos ?coles doctorales et laboratoires. Veuillez excuser les r?ceptions multiples. -- ********************************************************************** Appel ? communications RJCIA 2014 : 12?mes Rencontres des Jeunes Chercheurs en Intelligence Artificielle 30 juin - 1er juillet 2014, Rouen, France http://rjcia2014.greyc.fr ********************************************************************** Pr?sentation Les douzi?mes rencontres des jeunes chercheurs en Intelligence Artificielle (RJCIA 2014) se d?rouleront du 30 juin au 1er juillet 2014 ? Rouen. Ces rencontres sont h?berg?es par RFIA'14 (Reconnaissance des Formes et l'Intelligence Artificielle), organis?e par le laboratoire LITIS et ses ?tablissements de tutelle l'INSA de Rouen, l?Universit? de Rouen et l?Universit? du Havre, au nom de l'Association Fran?aise d'Intelligence Artificielle (AFIA) et de l'Association Fran?aise de Reconnaissance et d'Interpr?tation des Formes (AFRIF). Les RJCIA 2014 sont destin?es aux jeunes chercheurs en Intelligence Artificielle : doctorants ou titulaires d'un doctorat depuis moins d'un an. L'objectif de cette manifestation est double : permettre aux jeunes chercheurs pr?parant une th?se en Intelligence Artificielle, ou l'ayant soutenue depuis peu, de se rencontrer et de pr?senter leurs travaux, et ainsi de former des contacts avec d'autres jeunes chercheurs et d'?largir leurs perspectives en ?changeant avec des sp?cialistes d'autres domaines de l'Intelligence Artificielle ; former les jeunes chercheurs ? la pr?paration d'un article, ? sa r?vision pour tenir compte des observations du comit? de programme, et ? sa pr?sentation devant un auditoire de sp?cialistes, leur permettant ainsi d'obtenir des retours de chercheurs de leur domaine ou de domaines connexes. Pour tout renseignement compl?mentaire, s'adresser ? Gr?gory Bonnet (gregory.bonnet[at]unicaen.fr), pr?sident du comit? de programme. Appel ? communication Les RJCIA 2014 ont pour vocation de compl?ter le panorama de la recherche francophone en Intelligence Artificielle propos? par RFIA'14 en pr?sentant des travaux de jeunes chercheurs dans ce domaine. Ainsi, toute contribution relevant de l'Intelligence Artificielle est la bienvenue. La liste indicative des th?matiques ci-dessous n'est pas exhaustive. Apprentissage artificiel (supervis?, non supervis?, par renforcement) Fouille de masse de donn?es, fouille de grands graphes, fouille de texte Extraction et utilisation de connaissances par et pour la fouille (de masses) de donn?es Satisfaction de contraintes Repr?sentation des connaissances, mod?les de raisonnement, raisonnement, planification Agents autonomes et syst?mes multi-agents (d?cision individuelle, collective, interactions) Informatique ?motionnelle, agent virtuel socio-?motionnel, interaction humain-agent Robotique autonome, contr?le intelligent, aide ? la d?cision dans les syst?mes robotiques Gestion des connaissances : mod?lisation, organisation, optimisation et traitement Traitement automatique des langues Donn?es li?es, donn?es ouvertes, publication de donn?es Masse de donn?es, Web de donn?es, web s?mantique, ontologies. Extraction d'information, indexation s?mantique de contenu Visualisation de donn?es et de connaissances Validation, ?valuation des syst?mes d'Intelligence Artificielle Applications de l?Intelligence Artificielle (m?decine, transport, agronomie, droit) Conditions de soumission Les auteurs sont invit?s ? soumettre des propositions d'articles de 6 pages double colonne maximum selon le style propos? par RFIA'14. dans l'une des deux cat?gories suivantes : Communications de synth?se : Les RJCIA 2014 sont l'occasion pour les doctorants de premi?re ann?e de pr?senter leur sujet de th?se au travers d'une communication de synth?se. Cette derni?re permet de pr?senter un ?tat de l'art, d'identifier des probl?matiques et/ou de proposer de premi?re pistes de travail. Les auteurs sont encourag?s ? clairement d?crire le contexte de leur travaux et de mettre en lumi?re les probl?mes rencontr?s. Communications de recherche : Les RJCIA 2014 invitent aussi les jeunes chercheurs plus confirm?s ? pr?senter des travaux de recherche originaux th?oriques, exp?rimentaux ou ? vocation op?rationnelle. Une rigueur devra ?tre apport?e ? la r?daction des articles, tant sur le plan de la forme que sur celui du contenu scientifique. Les jeunes chercheurs qui le souhaitent sont ?galement invit?s ? soumettre leur communication ? la session IA de RFIA'14. Les auteurs sont invit?s ? indiquer lors de la soumission la cat?gorie dans laquelle s'inscrit leur proposition. Le premier auteur de la communication doit ?tre doctorant ou docteur depuis moins d'un an (th?se en cours, ou soutenue en 2013). Aucune exclusivit? n'est demand?e : nous souhaitons que les articles soumis dans les conf?rences ou ateliers internationaux puissent ?galement ?tre propos?s aux RJCIA 2014. La langue de la conf?rence est le fran?ais de pr?f?rence, ou l'anglais sinon. Chaque soumission sera relue par au moins trois membres du comit? de programme. Les crit?res de relecture incluent l'originalit? des id?es, la qualit? technique, l'importance des r?sultats et la qualit? de la pr?sentation. Dates importantes Ouverture du site de soumissions : 6 janvier 2014 Date limite de soumission : 24 f?vrier 2014 Notification aux auteurs : 25 avril 2014 R?ception des versions d?finitives : 16 mai 2014 RJCIA 2014 : lundi 30 juin et mardi 1er juillet 2014 Soumission d'articles Le format de soumission est disponible sur le site de la conf?rence : http://rjcia2014.greyc.fr Les soumissions se sont font via Easychair : https://www.easychair.org/conferences/?conf=rjcia2014 Comit? de programme Nicolas B?chet, IRISA, Universit? de Bretagne-Sud Lamia Belouaer, IRENav, Ecole Navale Gr?gory Bonnet, GREYC, Universit? de Caen Basse-Normandie Elise Bonzon, LIPADE, Universit? Paris Descartes Gauvain Bourgne, LIP6, Universit? Pierre et Marie Curie Sylvain Bouveret, LIG, INP Grenoble Am?lie Cordier, LIRIS, Universit? Lyon 1 Alice Hermann, LORIA, Universit? de Lorraine Abir B?atrice Karami, LIRIS, Universit? Lyon 1 Jean-Marie Lagniez, CRIL, Universit? d'Artois Robin Lamarche-Perrin, Max Planck Institute for Mathematics in the Sciences Marie Lefevre, LIRIS, Universit? Lyon 1 Cyrille Martin, DAI-Labor (CC NGS), Technischen Universit?t Berlin Benoit Mathern, LIRIS, Universit? Lyon 1 La?titia Matignon, LIRIS, Universit? Lyon 1 Alexandre Niveau, GREYC, Universit? de Caen Basse-Normandie Antoine Nongaillard, DISP, Universit? Lyon 2 Alexandre Pauchet, LITIS, INSA Rouen C?dric Pralet, Onera Mathieu Roche, TETIS, Cirad Julien Rossit, LIPADE, Universit? Paris Descartes St?phanie Roussel, CRIL Karima Sedki, Lim&Bio, Universit? Paris 13 Informations pratiques Consultez le site de RFIA'14 (http://rfia2014.univ-rouen.fr/) afin d'obtenir les informations pratiques pour s'inscrire et se rendre au LITIS. Pour le comit? de programme des RJCIA 2014, Gr?gory Bonnet ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 11 21:21:26 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 11 Feb 2014 22:21:26 +0100 Subject: Stage: Master 2e annee, Creation de la maquette numerique d'une serie de documents traitant d'archeologie Message-ID: Date: Tue, 11 Feb 2014 16:37:08 +0100 From: Thierry Poibeau Message-Id: <46F083B2-0D77-4D14-8508-E665CCD78F4A at ens.fr> Proposition de stage de Master 2e ann?e : Cr?ation de la maquette num?rique d'une s?rie de documents traitant d'arch?ologie Descriptif : Le stage s'inscrit dans le domaine des humanit?s num?riques, dans le cadre d'un projet EITAB, PEPS PSL-CNRS, mettant en ?uvre une collaboration entre le laboratoire Lattice (UMR8094, analyse linguistique et traitement automatique des langues,http://www.lattice.cnrs.fr/) et le laboratoire AOROC (UMR 8546, arch?ologie, http://www.archeo.ens.fr/). Les deux laboratoires font partie de l'Ecole normale sup?rieure et collaborent au sein du laboratoire d'excellence TransferS (http://www.transfers.ens.fr/). Le stage vise ? cr?er, ? partir d'un document textuel traitant d'arch?ologie, la maquette d'un ebook. Il s'agit de mettre en place un acc?s pertinent et convivial ? un texte de sp?cialit? ? partir de donn?es extraites du texte (termes structur?s en index, etc.). On s'int?ressera donc particuli?rement aux outils de structuration (base de donn?es, mise en place de liens entre termes, etc.) et de mise en forme (xml, html, css) des donn?es. Une collaboration ?troite avec les interlocuteurs c?t? arch?ologie est indispensable. Le LATTICE dispose d'un extracteur de termes et les outils classiques de structuration de donn?es et de cr?ation d'ontologies pourront ?tre utilis?s. Le stage porte essentiellement sur l'interaction entre texte et index. En fonction des comp?tences du candidat retenu, on pourra aussi s'int?resser ? l'interaction entre texte et cartes g?ographiques (pour permettre de visualiser les principaux sites arch?ologiques, cr?er des liens entre objets d'?tude et zones g?ographiques, et pouvoir r?pondre ? des questions comme : ? sur quels sites de la r?gion tourangelle ont ?t? trouv?es des tuiles ? rebord ? ?). On pourra aussi, si le temps le permet, s'int?resser ? l'adaptation du rendu final en fonction du support (ordinateur, tablette, etc.). Le stage a une finalit? avant tout pratique : il s'agit de mettre en place une maquette op?rationnelle montrant un exemple concret de r?alisation possible. L'arch?ologie se pr?te particuli?rement bien ? l'enrichissement multim?dia du texte num?rique. Mission du stage Cr?er la maquette num?rique d'une s?rie de documents traitant d'arch?ologie. ? veiller ? la coh?rence des donn?es (corriger et enrichir les index existants) ? structurer le document texte ? cr?er index et requ?tes dynamiques. Comp?tences requises ? connaissance d'outils d'?dition ?lectronique ? connaissance des langages de structuration de document (xml, html, css, etc) ? int?r?t pour la mise en place de solutions applicatives, prise en compte des besoins utilisateurs ? connaissance des bases de donn?es ? qualit? de r?daction en fran?ais et en anglais ? un int?r?t pour l'arch?ologie serait un plus Informations compl?mentaires sur EITAB : http://www.archeo.ens.fr/spip.php?article586 Conditions Le stage se d?roulera au laboratoire Lattice (? Montrouge, http://www.lattice.cnrs.fr/) pendant 6 mois, ? partir d'avril 2014 en ?troite collaboration avec le laboratoire AOROC (? l'ENS, 45 rue d'Ulm ? Paris). Ce stage est indemnis? suivant les r?gles en vigueur gr?ce ? un projet PEPS de site co-financ? par le CNRS et PSL. Lieu Laboratoire LATTICE ? Montrouge. Encadrants Fr?d?rique M?lanie Becquet et Thierry Poibeau pour le LATTICE et Katherine Gruel pour AOROC. Comment postuler ? Envoyer un CV et une lettre de motivation ? Thierry Poibeau et Fr?d?rique M?lanie prenom suivi du nom s?par? par un point arobase ens.fr d?s que possible et, dans tous les cas, avant le 20 f?vrier 2014. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Feb 14 21:03:27 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 22:03:27 +0100 Subject: Appel: Arabic natural dialect processing, Workshop ICCTIM2014 Message-ID: Date: Wed, 12 Feb 2014 17:19:40 +0100 From: smaili Message-ID: <52FB9F1C.4020401 at loria.fr> X-url: http://sdiwc.net/conferences/2014/icctim2014/workshops/ We apologize for multiple postings, please distribute to interested colleagues DEADLINE *March 10, 2014* * Workshop at **ICCTIM2014**, Dubai, april 10, 2014 * http://sdiwc.net/conferences/2014/icctim2014/workshops/ Title : Arabic natural dialect processing Summary Modern Standard Arabic (MSA) is the language of more than 250 million persons. It is used mainly in writing and in formal speech. Unfortunately, most of Arab people, do not use MSA in their daily conversations; the result is that different Arabic dialects are spoken through more than twenty countries.In fact, MSA is not acquired as a mother tongue, but rather it is learned as a second language at school and through exposure to formal broadcast programs (such as the daily news), religious practice, and newspaper. Spoken Arabic is often referred to as colloquial Arabic, dialects, or vernaculars. It's a mixed form, which has many variations, and often a dominating influence from local languages (from before the introduction of Arabic) and from languages of the countries which occupied the Arabic region. Differences between the various variants of spoken Arabic can be large enough to make them incomprehensible to Arabic people coming from different regions. Hence, regarding the large differences between such spoken languages, we can consider them as disparate languages or more exactly as different dialects depending on the geographical place in which they are practiced: Morocco, Algeria, Egypt,... Because in general, they are not written therefore, corpora are not available. Everyone knows the importance of such corporawhen we would like to mine texts or to develop some applications as speech recognition or machine translation which are based on statistical models. The only existing corpora but not yet explored are those used in social networks which cannot be used easily due to the multiplicity of formats, the number of foreign words, the mixture between dialects and French or English and so on. This workshop is an opportunity for the NLP community to focus on this challenging topic and encourage them to develop new resource Arabic dialect,Arabic dialect corpora processing tools and help in highlighting the difficulties of processing Arabic dialects especially those which use so many foreign words adapted lexically and grammatically to Arabic. The workshop topics include but not limited to: 1. Collecting Arabic dialect corpora 2. Diacritization of Arabic dialects 3. Mining Arabic social networks 4. Language modelling 5. Arabic dialect morphology 6. Development of mobile Arabic dialects applications: speech recognition, machine translation, ... 7. Tagging Arabic dialects corpora 8. Maghreb Arabic dialects versus Orient Arabic dialects: Linguistic study. Format and duration: a full day workshop will held on April 10, 2014. The language of the workshop is English and submissions should be with respect to ICCTIM2014 paper submission instructions. All papers will be peer reviewed.Papers must be submitted electronically in PDF format as soon as possible and before March 10, 2014. When you submit by using the OpenConf management system, please select others in the proposed topics and in keywords, enter Arabic dialect. In all the cases, when you submit, please send an email to the chairman of the workshop:smaili at loria.fr From hamon at LIMSI.FR Fri Feb 14 20:45:15 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 21:45:15 +0100 Subject: Revue: Texto ! Textes & Cultures Message-ID: Date: Wed, 12 Feb 2014 14:40:39 +0100 From: Carine Duteil-Mougel Message-Id: X-url: http://www.revue-texto.net Bonjour, Vous pourrez trouver ci-dessous le sommaire du nouveau num?ro de la revue Texto ! Textes et Cultures. http://www.revue-texto.net Bonne lecture ! ///// TEXTO ! Volume XIX - n?1 (2014). Coordonn? par R?gis Missire Dits et in?dits D?nier, th?oriser, ?liminer : le ? travail ? de Duch Sur L??limination de Rithy Panh avec Christophe Bataille Elise LAMY-RESTED ? la d?couverte du sens historique. Alfred Baeumler et la Nouvelle vision du monde L?ONORE BAZINEK L'exp?rience de la parole : le th?me du sujet parlant ANTONINO BONDI Le discours de Dakar - Repr?sentations et st?r?otypes dans un discours en Afrique sur l?Afrique MICHELLE LECOLLE Parutions et tr?sors Pour une conception int?grale de la cr?ation lexicale CHRISTOPHE G?RARD Constantes et variations du rituel politique: l'exemple des v?ux pr?sidentiels (D?monstration Lexico 3/TextObserver) JEAN-MARC LEBLANC S?mantique du site web de la restauration: ?tude th?matique et expressive d'un genre polys?miotique ERIC TRUDEL Outils et m?thodes pour appliquer le savoir-faire des s?manticiens aux traitements des masses documentaires MATHIEU VALETTE, EGLE EENSOO, EVELYNE BOURION & MONIQUE SLODZIAN Note de lecture. Gaston Gross. Manuel d?analyse linguistique : Approche s?mantico-syntaxique du lexique. PASCAL VAILLANT Dynamiques du changement s?mantique. D?tection, analyse et mod?lisation du changement s?mantique en corpus en diachronie courte - Pr?sentation de th?se ARMELLE BOUSSIDAN Rep?res pour l'?tude La s?mantique interpr?tative - Du mot au corpus et du s?me aux formes s?mantiques ALI BELGHANEM ? la recherche du temps perdu, ou comment rendre l?espace immat?riel THIERRY M?ZAILLE Saussurismes et s?miotique Triple articulation de la langue et articulation herm?neutique du langage - Quand De l?essence double du langage r?interpr?te les textes saussuriens SIMON BOUQUET ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Feb 14 21:15:38 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 22:15:38 +0100 Subject: Appel: COLING 2014, February 2014 Newsletter Message-ID: Date: Thu, 13 Feb 2014 17:07:23 +0000 From: ??? Message-ID: <01648087DD5D4349A12B758B6F475B842CD7C9EF at SejongSRV.sejong.knu.ac.kr> X-url: http://www.coling-2014.org We apologize if you received multiple copies. COLING 2014 Dublin, Ireland, 23-29 August, 2014 COLING 2014 (http://www.coling-2014.org), the 25th International Conference on Computational Linguistics, will be organised by CNGL (Centre for Global Intelligent Content) at the Helix Convention Centre at Dublin City University (DCU) from 23-29 August 2014. The COLING conference is organised under the auspices of the International Committee on Computational Linguistics (ICCL). Accepted Tutorials and Workshops We are pleased to announce that the workshops and tutorials for COLING 2014 have been finalised. In total there will be 6 tutorials and 18 workshops covering a wide range of topics. You will be able to register for tutorials and workshops when the main conference registration opens. Accepted Tutorials: - Dependency Parsing: Past, Present, and Future - Wenliang Chen, Zhenghua Li and Min Zhang - Using Neural Networks for Modeling and Representing Natural Languages- Thomas Mikolov - Multilingual Word Sense Disambiguation and Entity Linking - Roberto Navigli and Andrea Moro - Selection Bias, Label Bias, and Bias in Ground Truth - Anders S?gaard, Barbara Plank and Dick Hovy - Automated Grammatical Error Correction for Language Learners - Joel Tetreault and Claudia Leacock - Biomedical/clinical NLP - Ozlem Uzuner and Meliha Yetisgen Accepted Workshops: - Workshop on Lexical and Grammatical Resources for Language Processing - First Joint Workshop on Statistical Parsing of Morphologically Rich Languages and Syntactic Analysis of Non-Canonical Languages (SPMRL-SANCL) - Celtic Language Technology Workshop (CLTW) - 5th Workshop on South and Southeast Asian Natural Language Processing (WSSANLP) - Synchronic and Diachronic Approaches to Analyzing Technical Language (SADAATL) - The 3rd Workshop on Vision and Language (VL'14) - Second Workshop on Natural Language Processing for Social Media (SocialNLP 2014) - First Joint Workshop on Open Infrastructures for HLT Resource Processing and Development and Text Analysis Frameworks - Cognitive Aspects of the Lexicon (CogALex-IV) - Semantic Web and Information Extraction (SWAIE) - VarDial: Workshop on Applying NLP Tools to Similar Languages, Varieties and Dialects (VarDial) - SemEval-2014: Semantic Evaluation Exercises - International Workshop on Semantic Evaluation (SemEval) - The AHA!-Workshop on Information Discovery in Text (AHA!) - CompuTerm 2014. 4th International Workshop on Computational Terminology (CompuTerm) - The First Workshop on Computational Approaches to Compound Analysis (ComAComA) - Automatic Text Simplification? Methods and Applications in the Multilingual Society (ATS-MA) - The 8th Linguistic Annotation Workshop (The LAW VIII) - The third Joint Conference on Lexical and Computational Semantics (*SEM-2014) Call for Papers: The first call for papers for the main conference is open. The paper submission system is now open at https://www.softconf.com/coling2014/main/. Papers should be prepared using the paper templates available here http://www.coling-2014.org/doc/coling2014.zip. We invite authors to submit papers in all relevant areas and encourage authors to include analysis of the influence of theories (intuitions, methodologies, insights), to technologies (computational algorithms, methods, tools, data), and/or contributions of technologies to theory development. Contributions that display and rigorously discuss future potential, even if not (yet) attested in standard evaluation, are welcome. For more information and instructions for authors see the preliminary call for papers here http://www.coling-2014.org/call-for-papers.php. Sponsorship and Promotion Opportunities With the tremendous impact and growth of COLING over many years, we have set out a comprehensive offering of corporate sponsorship and support opportunities for industry participants. The COLING 2014 organising committee is committed to working in close partnership with corporate supporters to maximise the benefits of sponsorship and provide opportunities for sponsors to engage appropriately with COLING delegates. With this in mind, if you wish to explore additional options for support or means of engagement with the COLING 2014 community please feel free to contact sponsorship at coling-2014.org. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Feb 14 20:49:03 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 21:49:03 +0100 Subject: Stage: Parametrage et evaluation d'un systeme de classification automatique de rapports de securite, Toulouse Message-ID: Date: Wed, 12 Feb 2014 16:03:13 +0100 From: Ludovic Tanguy Message-ID: <52FB8D31.4090104 at univ-tlse2.fr> Stage de Master 2 / Ecole d'ing?nieur : param?trage et ?valuation d'un syst?me de classification automatique de rapports de s?curit?. L'entreprise : CFH (Conseil en facteurs humains)/SafetyDATA est une PME sp?cialis?e dans le traitement automatique des langues dans le domaine de la s?curit?. Elle travaille en collaboration avec CLLE-ERSS, un laboratoire de linguistique, et plus particuli?rement avec l'?quipe TAL (Traitement automatique des langues). Le stage sera co-encadr? par CFH et le laboratoire CLLE. Contexte : CFH a con?u un syst?me de traitement automatique des langues dont le but est d'analyser des rapports d'incidents afin de proposer une ou plusieurs cat?gorie(s) pour leur indexation dans une base de donn?es. Le syst?me est actuellement d?ploy? et analyse chaque mois plusieurs centaines de documents (en fran?ais et en anglais) ? l'aide de r?gles apprises automatiquement et bas?es sur le rep?rage de certains termes dans les rapports analys?s. Voir plus de d?tails sur http://www.safety-data-analysis.com/ Le stage vise l'?volution de ce syst?me de classification, notamment en envisageant l'utilisation d'un syst?me d'apprentissage supervis? statistique. L'objectif de ce stage est double : 1/ Identifier et quantifier le gain apport? par l'utilisation d'un mod?le statistique (SVM, r?gression logistique, r?seau bay?sien, etc.) par rapport au syst?me actuel ; 2/ Mesurer l'impact sur les performances du syst?me des diff?rents traitements linguistiques appliqu?s aux documents avant leur analyse (correction des erreurs, normalisation des formes de surface, identification d'expressions complexes, utilisation de classes s?mantiques, etc.). Profil recherch? : ?tudiant en deuxi?me ann?e de master ou derni?re ann?e d'?cole d'ing?nieur, en informatique ou traitement automatique des langues. Comp?tences requises : Syst?mes de classification automatique par apprentissage artificiel. L'?tudiant doit conna?tre le fonctionnement de ces syst?mes et ?tre autonome quant ? leur utilisation et ?valuation (scripts, gestion de donn?es volumineuses, etc.) On attend ?galement de l'?tudiant une capacit? ? observer les donn?es et ? s'int?grer dans un environnement interdisciplinaire. D?tails : - stage conventionn? et r?mun?r? bas? ? Toulouse - dur?e : 4 ? 6 mois ? partir de mars Contact : C?line Raynal raynal at conseil-fh.fr Ludovic Tanguy tanguy at univ-tlse2.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Feb 14 20:55:23 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 21:55:23 +0100 Subject: Appel: SEPLN 2014 Message-ID: Date: Wed, 12 Feb 2014 16:28:44 +0100 From: Horacio Saggion Message-ID: X-url: http://www.taln.upf.edu/pages/sepln2014/es/index.html X-url: http://www.taln.upf.edu/pages/sepln2014/en/index.html CALL FOR PAPERS: 30th CONFERENCE OF THE SPANISH SOCIETY FOR NATURAL LANGUAGE PROCESSING (SEPLN 2014) September 17-19, 2014 Universitat de Girona http://www.taln.upf.edu/pages/sepln2014/es/index.html -------------------------------------------------------------------- INTRODUCTION ----------------------- The 30th edition of the Annual Conference of the Spanish Society for Natural Language Processing (SEPLN) will take place in Universitat de Girona, Girona, Spain on 17-19 September 2014. We also expect to organize associated workshops. The huge amount of information available in digital format and in different languages calls for systems to enable us to access this vast library in an increasingly more structured way. In this same area, there is a renewed interest in improving information accessibility and information exploitation in multilingual environments. Many of the formal foundations for dealing appropriately with these necessities have been, and are still being established in the area of Natural Language Processing and its many branches: Information extraction and retrieval, Questions answering systems, Machine translation, Automatic analysis of textual content, Text summarization, Text generation, and Speech recognition and synthesis. The aim of the conference is to provide a forum for discussion and communication where the latest research work and developments in the field of Natural Language Processing (NLP) can be presented by scientific and business communities. The conference also aims at exposing new possibilities of real applications and R&D projects in this field. Moreover, as in previous editions, there is the intention of identifying future guidelines or paths for basic research and foreseen software applications, in order to compare them against the market needs. Finally, the conference intends to be an appropriate forum in helping new professionals to become active members in this field. TOPICS ----------- Researchers and companies are encouraged to send communications, project abstracts or demonstrations related to any language technology topic including but not limited to the following: * Linguistic, mathematic and psycholinguistic models of language. * Machine learning in NLP. * Computational lexicography and terminology. * Corpus linguistics. * Development of linguistic resources and tools. * Grammars and formalisms for morphological and syntactic analysis. * Semantics, pragmatics and discourse. * Lexical ambiguity resolution. * Monolingual and multilingual text generation. * Machine translation. * Speech synthesis and recognition. * Dialogue systems. * Audio indexing. * Monolingual and multilingual information extraction and retrieval. * Question answering systems. * Evaluation of NLP systems. * Automatic textual content analysis. * Sentiment analysis and opinion mining. * Plagiarism detection. * Negation and speculation processing. * Text mining in blogosphere and social networks. * Text summarization. * Image retrieval. * NLP in biomedical domain. * NLP-based generation of teaching resources. * NLP for languages with limited resources. * NLP industrial applications. CONTACT -------------- All information related to the conference can be found in the web: http://www.taln.upf.edu/pages/sepln2014/en/index.html STRUCTURE OF THE CONFERENCE -------------------------------------------------- The conference will last three days, and will consist of sessions devoted to presenting papers, posters, tutorials, ongoing research projects and prototype or product demonstrations connected with topics addressed in the conference. Besides, we expect to organize associated workshops. SUBMISSION OF CONTRIBUTIONS ------------------------------------------------- Authors are encouraged to send theoretical or application-oriented proposals related to NLP. The proposals must include the following sections: * The title of the communication. * An abstract in English and Spanish (maximum 150 words) and a list of keywords. * The paper can be written in Spanish or English. Its overall maximum length will be 8 pages, including references. * The documents must not include headers or footers. * Papers should NOT include the names of the authors. The papers proposed will be reviewed at least by three reviewers, and can be accepted to be presented either as posters or as communications, depending on the program necessities. However, no distinction will be made between communications and posters in the printed version of the SEPLN journal. *** IMPORTANT NOTE ON CAMERA READY **** The final version of the paper (camera ready) should be submitted together with a cover letter explaining how the suggestions of the reviewers were implemented in the final version. ********************************************** Please, send your proposals using the following link:: http://www.sepln.org/myreview-sepln53/ The format of the SEPLN journal must be followed: http://www.sepln.org/?page_id=1285&lang=en In addition, all proposals will have to comply with the following requirements, depending on whether they pare papers, demos or projects. PROJECTS AND DEMOS ---------------------------------- As in previous editions, the organizers encourage participants to give oral presentations of R&D projects and demos of systems or tools related to the NLP field. For oral presentations on R&D projects to be accepted, the following information must be included: * Project title. * Name, affiliation, address, e-mail and phone number of the project director. * Funding institutions. * Groups participating in the project. * Abstract (4 pages maximum, including references). For demonstrations to be accepted, the following information is mandatory: * Demo title. * Name, affiliation, e-mail and phone number of the authors. * Abstract (4 pages maximum, including references). * Time estimation for the whole presentation. **** SEE NOTE ON CAMERA READY ABOVE **** IMPORTANT DATES ---------------------------- Deadline for full papers, demos, and projects: 10th April 2014 Notifications: 26th May 2014 Camera Ready: 7th June 2014 From hamon at LIMSI.FR Fri Feb 14 21:01:10 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 22:01:10 +0100 Subject: Job: Professeur en informatique, UFR Mathematiques et Informatique, Universite de Lorraine, Modification du profil Message-ID: Date: Wed, 12 Feb 2014 17:12:37 +0100 From: smaili Message-ID: <52FB9D75.9050402 at loria.fr> X-url: http://www.loria.fr/la-recherche/departements/traitement-automatique-des-langues-et-des-connaissances L'UFR Math?matiques et Informatique et le LORIA de l'UL proposent un poste de professeur en informatique dont les profils enseignement et recherche sont donn?s ci-dessous. Par ailleurs, nous organisons ? l'UFR MI le 19 mars une journ?e de pr?sentation o? les candidats qui le souhaitent viendront pr?senter leurs dossiers devant les ?quipes p?dagogiques de l'UFR Math?matiques et Informatique. Profil enseignement : Le professeur doit s'int?grer ? l'?quipe p?dagogique de la licence et du master sciences cognitives. Les sciences cognitives s'int?ressent ? la compr?hension de la pens?e artificielle, animale ou humaine. Les comp?tences recherch?es vont donc de l'informatique aux sciences du langage en passant par la biologie.L'UFR recrutera un professeur qui poss?de un maximum de comp?tences li?es aux sciences cognitives et qui pourrait intervenir dans les cours : aspects cognitifs li?s ? la linguistique computationnelle, les neurosciences, la repr?sentation des connaissances, l'intelligence artificielle, l'intelligence collective, etc. L'enseignant doit avoir une comp?tence et une exp?rience certaine dans ce domaine pour ?tre op?rationnel d?s la rentr?e 2014-2015 o? les besoins pour nos formations sont importants. Le candidat s'int?grera ? l'?quipe p?dagogique et contribuera ? la cr?ation de projets innovants.Au niveau administratif, le candidat doit ?tre en mesure tr?s rapidement de prendre des responsabilit?s lourdes : montage de nouveaux dipl?mes, r?daction d'habilitation, direction de dipl?mes ou d'UFR. Le professeur recrut? devra ?tre un moteur dans les futurs projets que nous sommes en train de mettre au point. Composante/UFR : Math?matiques et Informatique R?f?rence UFR : K. Sma?li Mots-cl?s enseignement : sciences cognitives, traitement automatique des langues, neurosciences, intelligence artificielle Profil recherche : Le professeur recrut? effectuera sa recherche dans le Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA). Il devra s'int?grer dans le d?partement 4 (Traitement des langues et des connaissances) http://www.loria.fr/la-recherche/departements/traitement-automatique-des-langues-et-des-connaissances. Toutes les ?quipes du d?partement sont concern?es. Ce d?partement ?tudie les trois domaines ?troitement li?s que sont les langues naturelles, les connaissances et les documents. Les th?matiques autour des langues couvrent un large spectre et concernent notamment la parole (analyse, reconnaissance, synth?se et perception), la syntaxe (par de m?thodes symboliques ou statistiques), la s?mantique et la repr?sentation du discours. Dans le domaine des connaissances, les travaux concernent, entre autres, la repr?sentation des connaissances et la formalisation du raisonnement, la d?couverte de connaissances par l'utilisation de m?thodes symboliques ou num?riques de fouille de donn?es, les m?thodes d'apprentissage et la classification. Concernant les documents, les principales th?matiques sont la reconnaissance de formes, la reconnaissance de symboles et la mod?lisation de l'?criture manuscrite. Les ?quipes concern?es sont Cello, Orpailleur, Parole (Multispeech et SMarT), Qgar, Read, S?magramme, Synalp. Mots cl?s traitement automatique des langues, linguistique computationnelle, grammaires formelles, traduction automatique, analyse/synth?se de la parole, reconnaissance de la parole, mod?lisation articulatoire de la parole, traitement du signal audio, s?paration de sources sonores, mod?les probabilistes, fouilles de donn?es complexes, apprentissage symbolique et num?rique, repr?sentation des connaissances et raisonnement, classification, reconnaissance de formes, mod?lisation de l'?criture manuelle ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Feb 14 21:19:39 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 22:19:39 +0100 Subject: Appel: Traitement automatique des langues africaines, TALAf 2014 Message-ID: Date: Fri, 14 Feb 2014 15:34:37 +0100 From: Mathieu Mangeot Message-Id: <9E5A0F68-F283-4A20-BF7C-6F42C360D4B0 at imag.fr> X-url: http://jibiki.univ-savoie.fr/~mangeot/TALAf/2014/ Appel ? communications TALAf 2014 : Traitement automatique des langues africaines (?crit et parole) Atelier TALN 2014 - Marseille le 1^er juillet 2014 Version html : http://jibiki.univ-savoie.fr/~mangeot/TALAf/2014/ Version PDF : http://jibiki.univ-savoie.fr/~mangeot/TALAf/2014/atelierTALAf2014v5.pdf PR?SENTATION Dans la suite du premier atelier TALAf qui s'est tenu le 8 juin 2012 ? Grenoble, lors de la conf?rence JEP-TALN-RECITAL 2012 (voir les actes : http:// aclweb.org/anthology//W/W12/#1300), nous proposons une nouvelle ?dition de cet atelier lors de la conf?rence TALN 2014 le premier juillet ? Marseille. Nous accueillons les travaux men?s sur toutes les langues peu dot?es d'Afrique. L'arabe dialectal de l'Afrique du nord (maghr?bin) est ?galement bienvenu. Les recherches en traitement automatique des langues africaines sont actuellement ? l'or?e de d?veloppements majeurs. Les efforts de reconnaissance des langues nationales et de standardisation des diff?rents alphabets commencent ? porter leurs fruits. Au Niger, par exemple, les alphabets des langues fulfulde, haussa, kanouri, songhai-zarma et tamajaq ont ?t? d?finis par des arr?t?s du gouvernement en 1999. Par ailleurs, un certain nombre de coll?gues form?s dans les pays du Nord reviennent dans leur pays avec la volont? de continuer leur travail sur les langues locales. Il y a ?galement des diasporas disposant de moyens technologiques leur permettant de contribuer directement en ligne et de mani?re b?n?vole. Pour autant, les langues nationales de la plupart des pays d'Afrique sont peu dot?es (langues-?) : les ressources ?lectroniques disponibles sont rares, mal distribu?es, voire inexistantes. Seules sont accessibles les fonctions d'?dition et d'impression rendant l'exploitation de ces langues difficile. Au moment o? il est question de les introduire dans le syst?me ?ducatif, de cr?er des normes d'?criture standardis?es et stabilis?es et surtout de d?velopper leur usage ? l'?crit et ? l'oral dans l'administration et la vie quotidienne, un d?veloppement de ces langues s?impose comme une n?cessit? vitale. D?velopper le traitement automatique de langues africaines n?cessite l'?laboration de ressources qui seront les fondements ? partir desquels des traitements plus ?labor?s peuvent ?tre construits. Il appara?t indispensable de constituer en premier lieu des corpus ?crits et oraux annot?s aussi larges que possibles. ? partir de tels corpus, il est possible d?extraire des exemples pour aider ? la constitution de dictionnaires ou de mettre au point des mod?les de langage pour la reconnaissance vocale. Toutefois, la constitution de tels corpus reste une entreprise d?licate dans le contexte de langues peu dot?e car les transcriptions souffrent du manque de standardisation de la langue et l'enrichissement de corpus reste tr?s on?reux. Le d?veloppement d'applications ? base de traitement de l'oral peut ?tre consid?r? comme prioritaire dans des r?gions de tradition orale. De plus, l'usage de t?l?phones mobiles, tr?s r?pandu, permet d'imaginer un d?ploiement rapide de ces applications. Les dictionnaires sont ?galement n?cessaires pour construire les outils de base tels les correcteurs orthographiques (qui peuvent servir ? leur tour pour corriger les corpus ?crits) ou encore pour l'aide ? la transcription de corpus oraux. Il existe parfois des dictionnaires bilingues couplant la langue officielle et une langue nationale. Par exemple, au Mali, le p?re Charles Bailleul est l'auteur d?un dictionnaire bambara-fran?ais ; au Niger, le projet ?ducatif SOUT?BA a cr?? cinq dictionnaires bilingues destin?s aux enfants de primaire. Mais ceux-ci existent uniquement en version papier ou sous forme de fichiers d'?diteurs de texte (format.doc). Informatiser ces dictionnaires pour les rendre utilisables par des outils de traitement automatique n?cessite, dans un premier temps, d'ajouter des informations manquantes : prononciation, r?gles de flexion morphologiques et flexionnelles, exemples et traductions tir?s de corpus, etc. Il s'agit dans un premier temps de les informatiser (les transformer dans un format utilisable par des outils de traitement automatique) et de les compl?ter avec des informations manquantes : prononciation, exemples et traductions tir?s de corpus, etc. Des astuces peuvent parfois ?tre invent?es pour pallier le manque de ressources. Par exemple, s'il n'existe pas de corpus oraux avec transcriptions, il est possible de constituer un corpus oral de textes lus. Enfin, il y a lieu de prendre en compte les contraintes socio-?conomiques s'exer?ant sur la population des locuteurs : les ressources ?conomiques sont limit?es, les ressources humaines qualifi?es sont rares, les recherches sont sporadiques et isol?es, les r?sultats confidentiels et parcellaires. Il est donc n?cessaire de d?finir des m?thodologies ?conomes en co?t d'achat de logiciels et en temps de travail qualifi? visant ? produire des r?sultats p?rennes, partag?s et faciles ? enrichir. La constitution de ressources linguistiques de mani?re g?n?rale, et plus encore pour les langues africaines devrait donc respecter un certain nombre de principes : utilisation d'outils en source ouverte, d?finition et utilisation de standards (ISO, Unicode), transfert de connaissances entre les coll?gues des pays du Nord et du Sud, disponibilit? des ressources sous licence ouverte (Creative Commons), etc. Cet atelier a pour but d'effectuer un ?tat des lieux des travaux de constitution de ressources linguistiques de base (dictionnaires, corpus oraux et ?crits), de mettre au point des m?thodologies simples et ?conomes d'?laboration de ressource, d'?changer sur les techniques permettant de se passer de certaines ressources inexistantes et de fixer un certain nombre de principes pour les futurs travaux dans le domaine. L'atelier se d?roulera sur une demi-journ?e ou une journ?e enti?re, sous r?serve des soumissions et de la d?cision finale du comit? de programme de TALN. Les ateliers TALAf sont soutenus par l'association LTT (Lexicologie Terminologie Traduction). TYPES DE COMMUNICATION TH?MES L'atelier est ouvert ? la pr?sentation de travaux de recherche portant sur les th?mes suivants : Ressources : - constitution de corpus ?crits (monolingues, bilingues align?s ou comparables) - constitution de corpus oraux (incluant la transcription) - ?laboration de lexiques et dictionnaires (monolingues, bilingues) - ?valuation de la qualit? des ressources Outils : - analyseurs morphologiques, correcteurs orthographiques - analyseurs syntaxiques, correcteurs grammaticaux - syst?mes de TA (statistique ou ? base de r?gles) - reconnaissance de la parole - synth?se vocale CRIT?RES DE S?LECTION Les soumissions seront examin?es par au moins deux sp?cialistes du domaine. Pour les travaux de recherches, seront consid?r?es en particulier : - l'ad?quation aux th?mes de l'atelier. - l'importance et l'originalit? de la contribution, - la correction du contenu scientifique et technique, - l'organisation et la clart? de la pr?sentation. MODALIT?S DE SOUMISSION Les articles seront r?dig?s en fran?ais pour les francophones, en anglais pour ceux qui ne ma?trisent pas le fran?ais. Les formats pr?cis de soumission sont disponibles pour Word et Latex sur le site de taln2014 : http://www.taln2014.org/site/soumission/ Les propositions de communications doivent ?tre envoy?es sous forme pdf ? l'adresse suivante : https://www.easychair.org/conferences/?conf=talaf20140 COMIT? DE PROGRAMME Laurent Besacier (LIG, Grenoble, France) Philippe Bretier (Voxygen, Pleumeur-Bodou, France) Khalid Choukri (ELDA, Paris, France) Mame Thierno Ciss? (ARCIV, Universit? Cheikh Anta Diop, Dakar, S?n?gal) Denys Duchier (Universit? d'Orl?ans, Orl?ans, France) Chantal Enguehard (LINA, Nantes, France) Gil Francopoulo (Tagmatica, Paris, France) Mathieu Mangeot (LIG, Grenoble, France) Ch?rif Mbodj, (Centre de Linguistique Appliqu?e de Dakar, S?n?gal) Kamal Na?t-Zerrad (INALCO, Paris, France) Pascal Nocera, (Universit? d'Avignon, France) Francois Pellegrino, (DDL, Lyon, France) Fatiha Sadat (UQAM, Montr?al, Canada) Mamadou Lamine Sanogo (INSS, Ouagadougou, Burkina-Faso) Emmanuel Schang (Universit? d'Orl?ans, Orl?ans, France) Gilles S?rasset (LIG, Grenoble, France) Valentin Vydrin (LLACAN-INALCO, Paris, France) CALENDRIER - Date limite de soumission : 26 avril 2014 - Notification aux auteurs : 24 mai 2014 - Date limite de soumission des versions d?finitives : 15 juin 2014 - Atelier : 1 juillet 2014 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Feb 14 21:21:54 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 22:21:54 +0100 Subject: Appel: Atelier IC&Sante, associe a la conference IC 2014 Message-ID: Date: Fri, 14 Feb 2014 16:49:13 +0100 From: Sandra Bringay Message-Id: <428D889D-DA62-4520-BD59-81DEE14FB069 at lirmm.fr> X-url: https://www.lirmm.fr/ic-sante/ Atelier Ing?nierie des connaissances et Sant? https://www.lirmm.fr/ic-sante/ En association avec les 25?mes Journ?es francophones d'Ing?nierie des Connaissances, Clermont Ferrand, du 12 au 16 mai 2014 (http://www.irit.fr/IC2014) Au cours des derni?res ann?es, le nombre de donn?es m?dicales n'a cess? d'augmenter suite ? l?informatisation croissante des professionnels de sant?. Ces donn?es restent trop souvent sous-exploit?es. Les m?thodes d?ing?nierie des connaissances proposent des solutions pertinentes pour traiter toute la richesse potentielle de ces bases, m?me si des questions d'?thique, de mise en place de solutions, d'acquisition et de s?curit? restent pos?es. L?atelier sera l?occasion de d?battre des verrous li?s ? la mod?lisation, la structuration, l'extraction, la recherche et la fouille de donn?es de sant? de tous types. L?atelier permettra de pr?senter des activit?s de recherche innovantes, des m?thodes, des technologies et des d?monstrateurs d?velopp?s aussi bien dans le monde acad?mique, dans les laboratoires affili?s aux structures de sant? et par des industriels impliqu?s dans le domaine de l'ing?nierie de connaissances. L?objectif de cet atelier est d?offrir un cadre d??change entre les chercheurs, les industriels et les utilisateurs int?ress?s par les derni?res avanc?es scientifiques du domaine de l?ing?nierie des connaissances en Sant?. On s'int?ressera par exemple aux th?mes : * Terminologies et Ontologies m?dicales : Interop?rabilit? des syst?mes d'information en sant?, indexation terminologique de la litt?rature m?dicale, de dossiers m?dicaux de patients et de r?cits cliniques, annotations de documents, formalismes de repr?sentation, alignement d'ontologies. * Usage du Web 2.0 en sant? : R?seaux collaboratifs des praticiens de sant?, patients et famille, r?seaux bibliographiques dans le domaine m?dical, annotation sociale de ressources m?dicale * Traitement d'informations multimodales : Traitement du texte, image et son, classification et int?gration de contenus m?dicaux multimodaux, multi-sources * Extraction de connaissances ? partir de contenus m?dicaux : Anonymisation, extraction d'entit?s nomm?es, de variable et/ou indicateurs pathologiques, recherche de corr?lations, fouille de donn?es m?dicales * Syst?mes d'information en sant? : Repr?sentation, traitement et acc?s aux informations et des connaissances en sant?, Syst?mes d'information d?cisionnels de sant?, Gestion des dossiers m?dicaux de patients, Personnalisation et sant?. Nous faisons appel aux chercheurs et aux experts du domaine pour qu?ils pr?sentent leurs travaux. Nous encourageons ?galement les jeunes chercheurs et les industriels int?ress?s par ces questions ? participer ? cette manifestation et ? se faire conna?tre. Chaque soumission fera l?objet d?une ?valuation par plusieurs re-lecteurs du comit? de programme. L?atelier est ouvert ? la pr?sentation de travaux de diff?rents stades d'avancement. Site web de l'atelier : -------------------------- https://www.lirmm.fr/ic-sante/ Soumissions : -------------------------- Les textes (soumission et version finale) doivent ?tre r?dig?s en fran?ais (en anglais pour les non francophones), de fa?on anonyme, sur 6 pages maximum en incluant toutes les r?f?rences et les figures, au format de la conf?rence Ing?nierie des connaissances (http://www.irit.fr/IC2014/node/16). Les articles sont ? envoyer via le site https://www.easychair.org/conferences/?conf=icsant2014 Dates importantes : -------------------------- * Date limite de soumission : 10 mars 2014 * R?ponse aux auteurs : 31 mars 2014 * Versions d?finitives des contributions : 14 avril 2014 Organisatrices : --------------------- * Sandra Bringay, LIRMM-UM3, http://www.lirmm.fr/~bringay/ * Nathalie Souf, IRIT-UPS, http://www.irit.fr/~Nathalie.Souf * Lynda Tamine-Lechani, IRIT-UPS, http://www.irit.fr/~Lynda.Tamine-Lechani/ Comit? de programme : ------------------------------- J?rome Az?, LIRMM, Montpellier Audrey Baneyx, Sciences Politiques, Paris Catherine Berrut, LIG, Grenoble Sandra Bringay, LIRMM, Montpellier Jean Charlet, AP-HP & INSERM U1142, Paris Adrien Coulet, LORIA, Nancy Olivier Cure, LIGM, Paris Cedrick Fairon, Universit? de Louvain Natalia Grabar, STL, Lille Brigitte Grau, LIMSI, Paris Marie-Christine Jaulent, INSERM U1142, Paris Cl?ment Jonquet, LIRMM, Montpellier Vanda Luengo, LIG, Grenoble Fleur Mougin, ERIAS - ISPED, Universit? de Bordeaux Pascal Poncelet, LIRMM, Montpellier Mathieu Roche, CIRAD, Montpellier Lina Soualmia, LITIS, Rouen Nathalie Souf, IRIT, Toulouse Lynda Tamine Lechani IRIT, Toulouse Maguelonne Teisseire, IRSTEA, Montpellier Mouna Torjmen Khemakhem, Universit? de Sfax, Tunisie Pierre Zweigenbaum, LIMSI, Paris ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Feb 14 21:20:31 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 22:20:31 +0100 Subject: Job: CDI linguiste-informaticien H/F, Eptica-Lingway Message-ID: Date: Fri, 14 Feb 2014 16:36:55 +0100 From: Hugues de Mazancourt Message-Id: <8111AB85-F54E-4A54-9DDB-7CF69EA3FBE0 at eptica.com> Poste ? pourvoir en CDI Localisation: Boulogne-Billancourt R?umn?ration: en fonction de l'exp?rience Eptica Lingway, filiale du groupe Eptica, d?veloppe et commercialise la gamme LEA pour l'e-recrutement. Cette gamme comprend LEA CV pour l'analyse automatique de CV, LEA Search pour la recherche s?mantique notamment d'offres d'emploi et de CV, et LEA Matching pour le matching offres-cv. Ces produits mettent en oeuvre des grammaires d'analyse, des dictionnaires et un r?seau s?mantique multilingue adapt? pour le monde du recrutement. Il met en ?uvre des strat?gies de recherche documentaire sp?cifiques, notamment pour le matching offres-CV. La gamme LEA est disponible en plusieurs langues europ?ennes et asiatiques. Dans le cadre des ?volutions de l'offre LEA, Eptica-Lingway recherche un(e) linguiste informaticien(ne) en CDI. Au sein de l'?quipe technique, le candidat aura les missions suivantes : - participer ? la conception, d?veloppement, maintenance des fonctionnalit?s linguistiques des produits de la gamme LEA, - apporter une expertise aux probl?matiques techniques de nos clients, - effectuer des analyses de corpus, - effectuer le support de second niveau pour les clients ou partenaires, - participer aux r?unions de projet en cas de besoin, - compl?ter et ex?cuter les tests de qualit? des produits. Comp?tences requises : - Traitement Automatique des Langues (?tude de corpus, moteur de recherche, grammaires locales, techniques de "machine learning"), - Des connaissances en programmation/scripting (Java, Groovy, Perl, ...) et bases de donn?es - Ma?trise de l'anglais, - Ma?trise d'autres langues europ?ennes appr?ci?e, - Bonnes capacit?s d'analyse, - Facilit? ? travailler en ?quipe. Envoyer CV et lettre de motivation ? hugues.de-mazancourt at eptica.com ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Feb 14 21:04:25 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 22:04:25 +0100 Subject: Appel: Call for Demos, NLDB'2014 Message-ID: Date: Wed, 12 Feb 2014 18:16:32 +0100 From: Mathieu Roche Message-ID: <74038619aa63fbdf9b4901f47d8e716a at lirmm.fr> X-url: http://www.nldb.org/ ******************************************* Call for Demos - NLDB'2014 18-20 June 2014 - Montpellier, France http://www.nldb.org/ ******************************************* The 19th International Conference on Application of Natural Language to Information Systems (NLDB?2014) invites submissions of demonstrations of state-of-the-art research or industrial prototypes related to all aspects of Natural Language in the Database and Information Systems field. Topics of interest include but are not limited to: - Applications of NLP in Information Systems - Social Media and Web Data - Big Data and Natural Language - Semantic Web and Open Linked Data - Question Answering (QA) - Natural language and Ubiquitous Computing - Natural Language in Conceptual Modeling - NLP Applications (Opinion Mining, Information Extraction, ?) Demo submissions will be handled online via the easychair conference management system: https://www.easychair.org/conferences/?conf=nldb2014demonstratio Demonstration paper submissions should have 4 pages (LNCS format). Developers should outline the design of their system and provide details to allow the evaluation of its validity, quality, originality, and relevance to NLP in Information Systems. The accepted papers for demos will be included in the conference proceedings, to be published by Springer Verlag in the "Lecture Notes in Computer Science" (LNCS) Series. The demos will be presented in a special demonstration session. At least one of the demo submitters must register for the conference, and perform the demo on site. Important Dates: - Demo submission deadline (firm): March 13, 2014 - Notification of acceptance: March 28, 2014 - Camera-ready paper due: April 7, 2014 From hamon at LIMSI.FR Fri Feb 14 21:06:20 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 22:06:20 +0100 Subject: Appel: Atelier TALN SemDis 2014 Message-ID: Date: Thu, 13 Feb 2014 09:42:24 +0100 From: C?cile Fabre Message-ID: <52FC8570.2090902 at univ-tlse2.fr> X-url: http://www.irit.fr/semdis2014/ APPEL ? PARTICIPATION Atelier SemDis2014 (S?mantique Distributionnelle) Atelier organis? dans le cadre de la conf?rence TALN 2014 Marseille, 1er juillet 2014 ************************************************************ Pr?sidents : C?cile Fabre (CLLE-ERSS) et Tim Van de Cruys (IRIT-MELODI) Site Web : www.irit.fr/semdis2014/ Contact : semdis2014 at gmail.com. ************************************************************ SemDis 2014 est la 2e ?dition de l'atelier consacr? aux approches de TAL fond?es sur l'analyse s?mantique distributionnelle. Cette ann?e l'atelier s'organisera autour de deux t?ches portant sur le traitement du fran?ais : 1. une t?che comp?titive de substitution lexicale, adaptation au fran?ais de la t?che SemEval 2007 (McCarthy & Navigli 2010). Etant donn? un mot-cible dans une phrase compl?te, il s'agit de proposer une (ou plusieurs) unit?s de substitution qui n'alt?rent pas le sens global de l'?nonc?. Le choix du substitut est libre et sera confront? aux r?ponses fournies par des annotateurs humains. Le jeu d'?valuation comprend 300 phrases issues du corpus frWaC. Cette t?che donnera lieu ? un classement des syst?mes en comp?tition. Les participants ? la t?che devront soumettre leurs r?sultats et un article d?crivant le syst?me utilis?. 2. une t?che exploratoire sur un corpus sp?cialis?, le corpus TALN, s?lection d'articles en fran?ais issus des conf?rences TALN et RECITAL sur la p?riode 2007 ? 2013. Nous invitons les participants ? d?ployer sur ce corpus une ou plusieurs techniques d'analyse distributionnelle de leur choix, selon leurs objectifs propres. Pour faciliter les ?changes ? partir d'exemples communs, une s?lection de mots est propos?e (calculer, complexe, pr?cis, fr?quence, graphe, m?thode, s?mantique, trait). Cette t?che n'est pas une t?che comp?titive ; les participants doivent soumettre un article d?crivant le travail r?alis? ? partir de ce corpus. La description compl?te des t?ches et des donn?es est ? lire sur le site de l'atelier : www.irit.fr/semdis2014/ Dates importantes : ***************** Participation ? la t?che 1 : 10 mars 2014 : diffusion des donn?es de test 17 mars 2014 : date limite de soumission des r?sultats par les participants 7 avril 2014 : annonce des r?sultats Soumission des articles pour les 2 t?ches : 22 avril 2014 : envoi des articles 13 mai : notification 30 mai 2014 : r?ception des versions finales Types de communications : *********************** Les articles seront r?dig?s en fran?ais pour les francophones, en anglais pour ceux qui ne ma?trisent pas le fran?ais. Ils devront suivre le format de TALN 2014. Les articles envoy?s dans le cadre de la t?che 1 comprendront 6 ? 12 pages. Les articles envoy?s dans le cadre de la t?che 2 comprendront 12 ? 14 pages. Une feuille de style LaTeX et des mod?les LibreOffice et Word sont disponibles sur le site web de la conf?rence (http://www.taln2014.org/site/soumission/). Nous annoncerons ult?rieurement la dur?e des expos?s, qui sera fonction du nombre de communications accept?es. Modalit?s de soumission : ********************** Les articles, au format pdf, doivent ?tre d?pos?s ? l'adresse : https://www.easychair.org/conferences/?conf=semdis2014 Comit? d?organisation : ******************* L'atelier est organis? conjointement par les ?quipes CLLE-ERSS et IRIT-MELODI (Toulouse). C?cile Fabre (CLLE-ERSS) Nabil Hathout (CLLE-ERSS) Mai Ho-Dac (CLLE-ERSS) Fran?ois Morlane-Hond?re (CLLE-ERSS) Philippe Muller (IRIT-MELODI) Franck Sajous (CLLE-ERSS) Ludovic Tanguy (CLLE-ERSS) Tim Van de Cruys (IRIT-MELODI) Comit? de programme : ******************** Stergos Afantenos - IRIT, Toulouse Ann Bertels - ILT-QLVL, Leuven Yves Bestgen - UCL/CECL, Louvain-La-Neuve Marie Candito - ALPAGE, Paris Eric de la Clergerie - ALPAGE, Paris C?cile Fabre - CLLE, Toulouse Olivier Ferret - CEA-LIST, Fontenay-aux-Roses Nabil Hathout - CLLE, Toulouse Olivier Kraif - LIDILEM, Grenoble Philippe Langlais - RALI, Montr?al Emmanuel Morin - LINA, Nantes Philippe Muller - IRIT, Toulouse Adeline Nazarenko - LIPN, Paris Pascale S?billot - IRISA, Rennes Ludovic Tanguy - CLLE, Toulouse Tim Van de Cruys - IRIT, Toulouse ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Feb 14 21:17:19 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 22:17:19 +0100 Subject: Stage: Pretraitement de donnees textuelles pour un systeme d'analyse semantique, Strasbourg Message-ID: Date: Fri, 14 Feb 2014 12:50:12 +0400 From: Yuliya <4arly at bk.ru> Message-ID: <1392367811.166452886 at f389.i.mail.ru> X-url: http://macaon.lif.univ-mrs.fr/ Sujet de stage en informatique Pr?traitement de donn?es textuelles pour un syst?me d?analyse s?mantique Entreprise : Rebuz SAS, Strasbourg Dur?e : 2-5 mois Niveau : M1-M2 R?mun?ration : 436,05 euros/mois Contact : Mme Yuliya Goncharova, 4arly at bk.ru D?tails : Rebuz est une soci?t? sp?cialis?e dans l'analyse de textes pour la veille ?conomique. Le syst?me original repose sur l'analyse s?mantique ?paul?e par des principes de la linguistique cognitive. Le stage portera sur l'am?lioration du module de pr?traitement existant. Ce sujet sera particuli?rement int?ressant pour les ?tudiants souhaitant en apprendre plus sur le Traitement Automatique de Langues (TAL) et sur la Recherche d'Information (RI). Les t?ches seront adapt?es selon le niveau et les pr?f?rences du (de la) candidat(e) s?lectionn?(e). Objectifs du stage * R?vision du module actuel (?crit en Java) * Nettoyage et optimisation du code * Int?gration de l'?tiqueteur morphosyntaxique MACAON [1] * S?ries de tests et perfectionnement Comp?tences recherch?es : - aisance dans la programmation en Java (un ?chantillon de code sera demand?) - bonnes connaissances des tests unitaires (JUnit) - bonnes capacit?s de travail en ?quipe - rigueur - responsabilit? - autonomie [1] http://macaon.lif.univ-mrs.fr/ ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Fri Feb 14 21:23:35 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Fri, 14 Feb 2014 22:23:35 +0100 Subject: Appel: Colloque international Langage et Analogie. Figement. Polysemie Message-ID: Date: Fri, 14 Feb 2014 18:52:51 +0100 From: Pierre-Andr? Buvet Message-ID: <52FE57F3.9070204 at ldi.univ-paris13.fr> Colloque international LANGAGE ET ANALOGIE. FIGEMENT. POLYS?MIE Universit? de Grenade, Facult? de Traduction et d'interpr?tariat 17-19 septembre 2014 La vitalit? d'une discipline se mesure ? sa capacit? d'int?gration des sciences et des technologies de son ?poque. En linguistique, une large part de l'innovation th?orique se d?veloppe au contact de deux domaines particuli?rement productifs : les sciences cognitives et l'informatique. Les premi?res d?couvrent un certain nombre de contraintes non sp?cifiquement langagi?res que les mod?les linguistiques ne peuvent ignorer, sauf ? renoncer ? toute pr?tention r?aliste au sens psychologique du terme. La dimension sociale des langues elle-m?me ne peut plus ?tre pens?e ind?pendamment du fait que les langues se manifestent et se d?veloppent par l'activit? des cerveaux singuliers des locuteurs qui les parlent et les comprennent. Par ailleurs, au plan technologique, les progr?s de l'informatique permettent d'explorer des quantit?s de donn?es linguistiques toujours plus vastes dans des corpus de plus en plus nombreux, et par cons?quent d'esp?rer non seulement des descriptions qui tendent ? l'exhaustivit? mais aussi un traitement automatique du langage qui s'approche, au moins dans ses finalit?s, de son traitement naturel. Les concepts d'analogie et de figement, auxquels ce colloque est consacr?, jouent un r?le essentiel dans l'articulation des sciences du langage avec la psychologie cognitive et l'outil informatique. Le r?le central de l'analogie dans la cognition humaine est aujourd'hui bien document? : l'?tre humain poss?de une capacit? beaucoup plus d?velopp?e que d'autres esp?ces animales ? percevoir, ? concevoir et ? traiter une multiplicit? de similarit?s pertinentes dans son environnement, gr?ce auxquelles cet environnement lui appara?t ? chaque instant comme structur?, ou, en d'autres termes, cat?goris?, conceptualis?. Au plan linguistique, les processus analogiques sont susceptibles d'impliquer des similarit?s sur au moins trois plans fondamentaux : au plan du contenu, les similarit?s entre des entit?s extra-linguistiques produisent des concepts qui donnent lieu ou bien ? une r?ification s?miotique sous la forme de d?nominations, ou bien ? une unification proc?durale au moyen de mots grammaticaux de sens instructionnel ; au plan formel, les similarit?s entre les signifiants produisent des r?gularit?s pr?sentes partout o? existe un paradigme ; enfin, la similarit? entre formes et contenus se traduit par le caract?re motiv? des signes linguistiques, qu'il s'agisse de motivation interne, c'est-?-dire d'iconicit? au sens strict (imagic iconicity, symbolisme phon?tique, etc.), ou de motivation externe ou syntagmatique (la motivation relative saussurienne, l'iconicit? diagrammatique au sens peircien ou encore l'iconicit? des linguistiques cognitives contemporaines, qui se manifeste dans les th?ories cognitives de la m?taphore et plus largement de l'embodiment). Or la plupart des concepts mis en jeu en linguistique analogique -- d?nomination, motivation, iconicit?, polys?mie, etc. -- ont connu un profond renouvellement sous l'influence des d?veloppements de la recherche en phras?ologie. Et l'on sait bien que le probl?me des expressions fig?es reste en linguistique informatique un verrou technologique qui constitue un d?fi majeur de la recherche contemporaine sur le traitement automatique du langage. L'enjeu de ce colloque bisannuel, dont la premi?re session s'est d?roul?e en Tunisie en octobre 2012, consiste donc ? croiser la recherche en linguistique analogique et la recherche en phras?ologie, dans le but d'ouvrir de nouvelles perspectives dont le caract?re innovant soit fond? sur la pluralit? des approches. La session 2014 accordera un int?r?t particulier ? la polys?mie, dans les perspectives de la linguistique analogique et de la linguistique du figement. Selon la premi?re, la polys?mie constitue une situation interm?diaire entre la configuration de l'isomorphisme (une forme -- un sens) et les configurations disjonctives de l'homonymie (un signifiant pour deux signifi?s distincts) et du polymorphisme (un signifi? pour deux signifiants distincts). Le champ d'investigation principal porte donc sur l'analyse qualitative de la similarit? entre les acceptions des polys?mes et sur le traitement cognitif des analogies que conditionne cette similarit? (pertinence d'un sens de base, existence d'un sentiment m?taphorique, etc.). Quant aux unit?s polylexicales, il est ?tabli que leur polys?mie est restreinte par rapport ? celle des unit?s monolexicales, puisque les expressions fig?es sont constitu?es d'unit?s satur?es dans leur contexte imm?diat et par cons?quent que la variation contextuelle li?e ? la variation polys?mique ne peut jouer que pour l'ensemble de l'expression. Il demeure que certaines expressions fig?es sont polys?miques (ex. perdre pied) et par ailleurs que la question du lien entre le sens litt?ral et le sens phras?ologique des expressions fig?es interroge les limites du concept de polys?mie. Dans la syst?matique g?n?rale des langues, une tension existe entre deux grands principes d'?conomie cognitive : l'?conomie quantitative et l'?conomie proc?durale. Cette derni?re est maximale dans les situations d'isomorphisme (analogie diagrammatique) et d'iconicit? (analogie imaginale). Les syst?mes s?miotiques sont en effet d'autant plus simples dans leur fonctionnement qu'ils sont r?guliers et d'autant plus ais?s ? m?moriser qu'ils sont motiv?s. Cependant, ? ces principes, s'opposent la tendance ? l'?conomie quantitative visant ? r?duire le nombre de signes ? m?moriser, tendance qui se traduit ou bien par la polys?mie (analogie entre signes fond?e sur la similarit? des signifi?s), qui permet d'augmenter le rendement cognitif d'un signe en lui donnant plusieurs emplois, ou bien par le figement, qui permet de construire un sens nouveau au moyen d'unit?s lexicales d?j? disponibles, la s?quence fig?e ?tant le plus souvent motiv?e (analogie entre forme et contenu). L'un des enjeux de ce colloque sera donc de chercher ? mieux comprendre les modalit?s de r?solution de cette tension selon les langues et les discours consid?r?s. Plus largement, le colloque accueillera toute contribution visant ? ?clairer les rapports entre au moins deux des trois concepts interrog?s : analogie, figement et polys?mie. Conf?renciers invit?s Xavier Blanco (Universit? autonome de Barcelone) Mirella Conenna (Universit? Aldo Moro, Bari) Christian Cuxac (Universit? de Paris VIII) Maribel Gonz?lez Rey (USC, Saint Jacques de Compostelle Jean-Ren? Ladmiral (Universit? de Paris X) Pedro Mogorron (Universit? d'Alicante) Emmanuel Sander (Universit? de Paris VIII) Olivier Soutet (Universit? de Paris IV) Mark Turner (Case Western Reserve University) Comit? d'honneur Francisco Lodeiro Gonz?lez Recteur de l'Universit? de Grenade Angela Collado Ais Doyenne de la Facult? de l'Universit? de Grenade Antonio Chicharro Chamorro Directeur du d?partement de Linguistique G?n?rale et Th?orie de la Litt?rature de l'Universit? de Grenade Sylvie Laigneau-Fontaine Directrice du Laboratoire CPTC (EA 4178), Universit? de Bourgogne Eric Heilmann Doyen de la facult? de Lettres et Philosophie de l'Universit? de Bourgogne Comit? scientifique Didier Bottineau (Universit? de Paris X) Samir Bajric (Universit? de Paris IV) Carmen Mellado Blanco (Universit? de St Jacques de Compostelle) Pierre-Andr? Buvet (Universit? de Paris XIII) Gaston Gross (Universit? de Paris XIII) Georges Kleiber (Universit? de Strasbourg) Juan de Dios Luque Dur?n (Universit? de Grenade) Luc?a Luque Nadal (Universit? de Cordoue) Angel L?pez Garc?a (Universidad de Valencia) Aleksandar Mijatovic (Universit? de Rijeka) Rosemeire Monteiro-Plantin (Universit? F?d?rale du Cear?) Julia Sevilla Mu?oz (Universit? Complutense de Madrid) Franck Neveu (Universit? de Paris IV) Luca Nobile (Universit? de Bourgogne) Jos? Manuel Pazos Breta?a (Universit? de Grenade) Thierry Ponchon (Universit? de Reims) Christian Puech (Universit? de Paris III) Dan Savatovsky (Universit? de Paris III) Jean-Pierre Thibaud (Universit? de Bourgogne) Thomas Verjans (Universit? de Bourgogne) Les communications seront propos?es en fran?ais Elles doivent ?tre envoy?es ? l'adresse suivante : colloqueanalogie at gmail.com. Informations compl?mentaires disponibles sur le site du colloque ? l'adresse https://sites.google.com/site/colloquelangageetanalogie/accueil-grenade-2014-1 Date limite de soumission (5000 signes + orientations bibliographiques) : 5 mai 2014 R?ponse du comit? scientifique : 30 juin 2014 Remise des articles : 31 janvier 2015 au plus tard. Publication des actes : 2015 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From pilarcumjl at 163.COM Mon Feb 17 00:23:51 2014 From: pilarcumjl at 163.COM (=?GBK?B?svG4sdfc?=) Date: Mon, 17 Feb 2014 08:23:51 +0800 Subject: J5 =?GBK?Q?=A2=DF=C9=CF=BA=A3=B1=C8=CC=D8=BB=AF=B9=A4?= Message-ID: An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: J5 ???????.xlsx Type: application/vnd.openxmlformats-officedocument.spreadsheetml.sheet Size: 32739 bytes Desc: not available URL: From hamon at LIMSI.FR Mon Feb 17 14:25:22 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Mon, 17 Feb 2014 15:25:22 +0100 Subject: Appel: TALN 2014, Extension de date limite Message-ID: Date: Mon, 17 Feb 2014 15:05:16 +0100 From: Philippe Blache Message-Id: <0403127F-6FE2-4B9D-9767-C60B58ED1DC1 at lpl-aix.fr> X-url: http://www.taln2014.org APPEL ? COMMUNICATIONS TALN-2014 21e conf?rence sur le Traitement Automatique des Langues Naturelles http://www.taln2014.org 1-4 Juillet 2014 Marseille --> Extension de date limite de soumission --> Nouvelle Date : Jeudi 27 F?vrier 2014 --------------------- CALENDRIER ========== 1. Articles longs - Date limite de soumission : 20 f?vrier 2014 - Notification aux auteurs : 14 avril 2014 - Date limite de soumission des versions d?finitives : 30 mai 2014 2. Articles courts - Date limite de soumission : 24 avril 2014 - Notification aux auteurs : 22 mai 2014 - Date limite de soumission des versions d?finitives : 30 mai 2014 3. D?monstrations - Date limite de soumission : 21 avril 2014 - Notification aux auteurs : 10 mai 2014 - Date limite de soumission des versions d?finitives : 30 mai 2014 PR?SENTATION ============ Organis?e par le LPL (Laboratoire Parole et Langage) et le LIF (Laboratoire d?Informatique Fondamentale), la conf?rence TALN (Traitement Automatique des Langues Naturelles) aura lieu pour son 20?me anniversaire ? Marseille (Facult? Saint-Charles, Universit? d'Aix-Marseille) du 1er au 4 juillet 2014 La conf?rence TALN?2014, qui est organis?e sous l??gide de l?ATALA, se tiendra conjointement avec la 16e ?dition des Rencontres des ?tudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RECITAL?2014). La conf?rence TALN?2014 comprendra des communications orales pr?sentant des travaux de recherche et des prises de position, des communications affich?es, des conf?rences invit?es et des d?monstrations. La langue officielle de la conf?rence est le fran?ais. Les communications en anglais sont accept?es pour les participants non francophones. TYPES DE COMMUNICATIONS ======================= Deux formats de communications sont pr?vus : les articles longs (de 12 ? 14 pages) et les articles courts (de 6 ? 8 pages). Les auteurs sont invit?s ? pr?senter deux types de communications : - des articles pr?sentant des travaux de recherche originaux - des prises de position pr?sentant un point de vue sur l??tat des recherches en TALN, fond?es sur une solide exp?rience du domaine Les articles doivent pr?senter des travaux originaux, comportant un apport substantiel par rapport ? d'autres travaux ?ventuellement d?j? publi?s. Les traductions d'articles publi?s ailleurs dans une autre langue ne sont pas accept?es. Les articles longs seront pr?sent?s sous forme d?une communication orale, les articles courts sous forme d?un poster. Les communications pourront porter sur tous les th?mes du TAL. CRIT?RES DE S?LECTION ===================== Les soumissions seront examin?es par au moins deux sp?cialistes du domaine. Pour les travaux de recherche, seront consid?r?es en particulier : - l?ad?quation aux th?mes de la conf?rence - l?importance et l?originalit? de la contribution - la correction du contenu scientifique et technique - la discussion critique des r?sultats, en particulier par rapport aux autres travaux du domaine - la situation des travaux dans le contexte de la recherche internationale - l?organisation et la clart? de la pr?sentation Pour les prises de position, seront privil?gi?es : - la largeur de vue et la prise en compte de l??tat de l?art - l?originalit? et l?impact du point de vue pr?sent? Les articles s?lectionn?s seront publi?s dans les actes de la conf?rence. Le comit? de programme s?lectionnera parmi les communications accept?es un article (prix TALN) pour recommandation ? publication (dans une version ?tendue) dans la revue Traitement Automatique des Langues (revue TAL). MODALIT?S DE SOUMISSION ======================= Les articles seront r?dig?s en fran?ais pour les francophones, en anglais pour ceux qui ne ma?trisent pas le fran?ais. Une feuille de style LaTeX et un mod?le Word sont disponibles sur le site web de la conf?rence : http://www.taln2014.org COMITE D'ORGANISATION ===================== Philippe Blache Fr?d?ric B?chet Carine Andr? Brigitte Bigi S?bastien Bermond Cyril Deniaud Nad?ra Bureau Beno?t Favre St?phanie Desous Jo?lle Lavaud Nuria Gala Fr?d?ric Lefevre Gr?goire Montcheuil Alexis Nasr Catherine Perrot Klim Peshkov Laurent Pr?vot Carlos Ramisch St?phane Rauzy Claudia Starke CONTACTS ======== philippe.blache[arobas]lpl-aix.fr nadera.bureau[arobas]lpl-aix.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:50:56 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:50:56 +0100 Subject: Seminaire: Data, digital methods and mapping social complexity, ENSCI, Paris Message-ID: Date: Tue, 18 Feb 2014 18:35:45 +0100 From: Audrey Baneyx - Sciences Po Message-ID: <530399F1.6080804 at sciencespo.fr> X-url: http://www.digitalmethods-seminar.org Ch?re coll?gue, Cher coll?gue, Nous avons le plaisir de vous inviter ? participer au nouveau s?minaire de recherche conjoint du m?dialab (Sciences Po) et de la plateforme CorTexT (IFRIS) "*Data, digital methods and mapping social complexity: Visualizing social and semantic dynamics in the social sciences*". Le s?minaire sera h?berg? par l'ENSCI ? Paris et anim? par une ?quipe de chercheurs de Sciences Po et de l'IFRIS de f?vrier ? juillet 2014. Le s?minaire est destin? ? un public de doctorants, chercheurs, ing?nieurs et designers. Nos rencontres auront lieu huit jeudi apr?s-midis ainsi que deux journ?es compl?tes dans l'Espace Vi?not de l'ENSCI, situ?e au rez-de-chauss?e du 48 rue Saint Sabin, Paris. La langue de travail sera le fran?ais, except? lors des s?ances auxquelles sont invit?s des chercheurs anglophones. Vous trouverez ci-dessous la pr?sentation du s?minaire et le programme des sessions. Nous serions heureux de vous compter parmi nous. Merci de nous faire conna?tre votre participation en vous inscrivant directement sur notre site : www.digitalmethods-seminar.org . Bien cordialement /Le Comit? d'organisation/ -------------------------------------------------- Data, digital methods and mapping social complexity: Visualizing social and semantic dynamics in the social sciences *Pr?sentation du s?minaire* Les cartes sont des repr?sentations visuelles de l'espace. Ce sont des images codifi?es qui permettent de signaler la pr?sence d'?l?ments sur un territoire donn? comme des objets, des zones ou des flux. L'art cartographique a une histoire longue et riche. Initialement con?u pour assister le d?placement des hommes et des biens, la carte vise ? repr?senter l'espace g?ographique en positionnant des ?l?ments graphiques sur une surface bidimensionnelle. En tant qu'outil figuratif, la carte est rapidement devenu un instrument de recherche et de gestion. De fait, la pratique cartographique a fortement marqu? nos fa?ons de repr?senter, naviguer, comprendre, organiser et contester les espaces que nous habitons. L'av?nement r?cent de l'informatique et des satellites a profond?ment boulevers? la pratique cartographique. Les Syst?mes d'Informations G?ographiques (SIG) permettent aujourd'hui de capter, stocker, manier, analyser et g?rer des donn?es spatiales pour assister la science et la prise de d?cision. Les satellites permettent de positionner et suivre des mouvements sur l'int?gralit? du globe terrestre alors que des millions d'utilisateurs acc?dent ? des cartes interactives via leurs ordinateurs ou t?l?phones portables. En parall?le ? ces d?veloppements, un nombre croissant de chercheurs en sciences sociales s'int?resse ? la repr?sentation de donn?es non-spatiales. Ces repr?sentations sont r?guli?rement pr?sent?es comme des cartes de territoires inconnus, capables de figurer l'?mergence, la stabilisation et la d?stabilisation de ph?nom?nes sociaux complexes. En mobilisant des outils d'extraction, de traitement automatis? et de visualisation de donn?es provenant de sources h?t?rog?nes (corpus de documents, bases de donn?es, donn?es web), ces chercheurs esp?rent cartographier de fa?on dynamique des ph?nom?nes sociaux ? diff?rents degr?s de granularit?. La transition technique et ?pist?mologique ouverte par ces outils est parfois nomm?e /Humanit?s Digitales/. Ces d?veloppements semblent particuli?rement attractifs pour les chercheurs int?ress?s par l'?tude quantitative des ph?nom?nes sociaux. Ainsi, ces outils pourraient combler le foss? m?thodologique qui s?pare l'analyse des interactions sp?cifiques de celles des structures globales. Au vu de l'expansion constante de la production et de la circulation de donn?es num?riques, il para?t essentiel pour les chercheurs en sciences sociales de s'interroger sur ces nouvelles sources de donn?es ainsi que sur les outils et m?thodes pour les analyser. Parmi ces derni?res, l'analyse de r?seau permet de positionner les acteurs en visualisant leurs interactions. Elle compl?te les analyses statistiques (r?gression, analyse de correspondance etc.), offrant un nouvel angle pour ?tudier la soci?t? /faite/ et /en train de se faire/. Une telle approche ouvre de nouvelles perspectives pour de nombreux champs disciplinaires en sciences sociales, allant de la sociologie des sciences et des techniques aux sciences politiques. Mais d'o? viennent ces outils ? Le d?veloppement de logiciels de traitement et de visualisation de donn?es tels que Gephi, le CorTexT Manager, Science Scape et Hyphe sont le fruit de travaux interdisciplinaires, ? l'intersection de la linguistique informatique, du data mining, de l'intelligence artificielle, des syst?mes dynamiques et de la science des r?seaux. Ces communaut?s ont travaill? sur des probl?mes vari?s comme l'extraction, le parsing, la d?sambigu?sation, le clustering, le filtrage et la visualisation de donn?es. De m?me, la s?miologie graphique mobilis?e dans ces logiciels se nourrit de travaux en cartographie et en design. Ainsi, pour produire des visualisations pertinentes, robustes et lisibles, les chercheurs en sciences sociales peuvent se tourner vers des communaut?s de chercheurs et de praticiens d?j? structur?es autour de probl?mes et de m?thodes sp?cifiques. Notre s?minaire vise ? orchestrer la rencontre et favoriser l'?change entre ces communaut?s (chercheurs en sciences sociales et sciences ? exactes ?, ing?nieurs et designers) ? travers la lecture, la pr?sentation et la discussion de textes acad?miques et non acad?miques, de travaux d'invit?s et d'outils. Vous trouverez en pi?ce jointe la feuille de route (en anglais) que nous avons construite pour rendre compte des multiples questions et mani?res de faire auxquelles sont confront?s les cartographes /en herbe/ de la complexit? sociale. *Programme des sessions* *Calendrier* *27.02.14 // ENSCI -- Espace Vi?not // 14:00-17:30* Introduction Richard Rogers (University of Amsterdam) *06.03.14 // ENSCI -- Espace Vi?not // 14:00-17:30* Mapping science and technology through structured data Jean-Philippe Cointet (INRA) *27.03.14 // ENSCI -- Espace Vi?not // 14:30-18:00* Digitized archives and distant reading Claire Lemercier (Sciences Po) *10.04.14 // ENSCI -- Espace Vi?not // 14:30-18:00* Natively digital data mapping Nortje Marres (Goldsmiths University of London) *24.04.14 // ENSCI -- Espace Vi?not // 14:30-18:00* Transformative interactions: web effects on social dynamics /Invitations en cours/ *15.05.14 // ENSCI -- Espace Vi?not // 14:30-18:00* Visualizing complexity Donato Ricci (Sciences Po) *05.06.14 // ENSCI -- Espace Vi?not // 14:00-17:30* Spatializing data Jacques Levy (EPFL) *19.06.14 // ENSCI -- Espace Vi?not // 14:30-18:00 * Activism, journalism, decision-making : DMT's in practice Benjamin Ooghe-Tabanou (Regards Citoyens) & Data journaliste [? confirmer] *10.07.14-11.07.14 // ENSCI -- Espace Vi?not [? confirmer] // 9:30-17:30* BarCamp Avec la participation des ?quipes de la plateforme CorTexT de l'IFRIS et du m?dialab Sciences Po ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:31:19 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:31:19 +0100 Subject: Conf: TOTh 2014, disputatio, 12-13 June 2014 Message-ID: Date: Mon, 17 Feb 2014 09:32:04 +0100 From: Luc Damas Message-ID: X-url: http://porphyre.org/toth/toth-2014/disputatio ------------------------------------------------------ TOTh 2014 - Disputatio 12-13 June 2014 www.porphyre.org/toth/ http://porphyre.org/toth/toth-2014/disputatio "Harris et les immunologistes : un exemple ? suivre ?" Pr. Pierre Lerat - Universit? Paris XIII Zellig Harris a fond? principalement ses analyses de discours sur un m?canisme logico-linguistique universel : la pr?dication. Il a obtenu des r?sultats particuli?rement significatifs dans ses derniers travaux (1988), qui portent sur des comptes rendus d'immunologistes. Les contraintes r?ciproques entre les types d'expressions pr?dicatives et les types d'expressions r?f?rentielles requis par les discours scientifiques et techniques sont fortes. L'int?r?t de cette approche conduit ? poser deux questions : ? la terminologie celle de la pr?visibilit? des mots sp?cialis?s, aux ontologies celle du travail sur les sch?mas pr?dicatifs autrement qu'en termes de << relations associatives >> parmi d'autres. --------------------------------------------------- ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:52:28 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:52:28 +0100 Subject: Job: Research Assistant position on OBDA at the Free Univ. of Bozen-Bolzano, Italy Message-ID: Date: Tue, 18 Feb 2014 21:00:09 +0100 From: Diego Calvanese Message-ID: ------------------------------------------------------------------------ Research Assistant position at the Free University of Bozen-Bolzano/Italy ------------------------------------------------------------------------ The KRDB Research Centre for Knowledge and Data at the Faculty of Computer Science of the Free University of Bozen-Bolzano (FUB), Italy is looking for a talented Research Assistant to work fulltime on *** SPARQL Query Answering for Ontology Based Data Access and Ontology Based Data Integration *** The position will be financed by the Large Scale Integrating Project (IP) "Scalable End-user Access to Big Data" (Optique) [2], within the EU 7th Framework Programme (FP7). The position concerns the design and implementation of techniques for optimized SPARQL query answering over virtual RDF graphs, obtained through R2RML mappings over relational databases. Requirements for this position are an MSc or preferably a PhD in Computer Science, fluency in English, and strong Java programming skills. Some background on SPARQL, SQL, and OWL are also required. During this project the applicant will have the opportunity to apply the results of their research in challenging industrial use-cases involving reasoning on Big Data, as provided by the Optique project and by the user base of the -ontop- [1] framework for Ontology Based Data Access. The position is to start in 2014 as early as possible. The position has an initial contract duration of 1 year, and will be extended for up to 3 years in case of mutual agreement. Candidates interested in pursuing a PhD in Computer Science where they combine applied and foundational research, may also apply for enrollment in the PhD program of the Faculty of Computer Science at FUB, starting in Winter 2014. The position comes with a great location, Bolzano [3,4]. The Free University of Bozen-Bolzano is located right in the center of one of the most beautiful cities in the Alps, and consistently ranks among the top locations in Italy for its living standard. The region, South Tyrol, is among the top destinations for Hikers, Mountaineers, Rock climbers, Skiers, etc. And last, as one would expect from any Italian location, food in Bolzano is great. Interested applicants should get in contact immediately with Prof. Diego Calvanese (calvanese at inf.unibz.it), Dr. Martin Rezk (mrezk at inf.unibz.it), or Dr. Guohui Xiao (xiao at inf.unibz.it). Please attach a detailed CV including a statement of interest, project experience (if any) and up to 2 references (reference letters will be requested directly from the referees). [1] http://ontop.inf.unibz.it/ [2] http://www.optique-project.eu/ [3] http://en.wikipedia.org/wiki/Bolzano/ [4] https://www.google.com/search?q=bolzano+italy&prmd=imvns&tbm=isch From hamon at LIMSI.FR Tue Feb 18 21:47:08 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:47:08 +0100 Subject: Job: Speech Data Evaluator, Google Speech Research (Multiple positions) Message-ID: Date: Tue, 18 Feb 2014 12:49:33 +0000 From: Hadrien Gelas Message-ID: Bonjour, Veuillez consid?rer l'annonce ci-dessous et, si possible, notamment pour les personnes impliqu?es dans des formations, faire circuler ce message aupr?s des ?tudiants ou autres personnes int?ress?es. Merci d'avance, Cordialement, Hadrien Gelas ________________________________ Speech Data Evaluator for French Job title: Speech Data Evaluator for French (Multiple positions) Linguistic Field(s): - Phonetics/Phonology - Semantics - Discourse Analysis Job description: As a Speech Data Evaluator and a native-level speaker of French, you will be part of a team based in Dublin, processing large amounts of linguistic data and carrying out a number of tasks to improve the quality of Google?s speech synthesis and speech recognition in your own language. This includes: - classifying and annotating linguistic data - transcription - labeling text for disambiguation, expansion, and text normalization - providing phonetic transcription of lexicon entries according to given standards and using in-house tools Job requirements: - native-level speaker of French (with good command of the standard dialect) and fluent in English - passion for language with good knowledge of orthography and grammar in the target language - a degree in a language-related field such as linguistics, language teaching, translation, editing, writing, proofreading, or similar - keen interest in technology and computer-literate (should feel comfortable using in-house tools and should have an interest in current speech, mobile and online technology) - attention to detail and good organizational skills Project duration: 6-11 months (with potential for extension) **This is not a permanent position but a contract position.** For immediate consideration, please email your CV and cover letter in English (PDF format preferred) with "Speech Data Evaluator [French]" in the subject line. Email Address for applications: DataOpsMan at gmail.com Contact information: Ara Kim Closing date: open until filled ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:36:13 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:36:13 +0100 Subject: Appel: IC 2014, Posters et demonstrations Message-ID: Date: Mon, 17 Feb 2014 17:06:24 +0100 From: Catherine Faron Zucker Message-ID: <53023380.10006 at i3s.unice.fr> X-url: http://www.irit.fr/IC2014 *Appel ? soumissions pour la session Posters et D?monstrations des* 25?mes Journ?es Francophones d'Ing?nierie des Connaissances : /Connaissances pervasives : des utilisateurs plong?s dans des SBC et des SBC plong?s dans le monde/ http://www.irit.fr/IC2014 du 12 au 16 mai 2014 ? Clermont-Ferrand ------------------------------------------------------------------------ *Dates importantes :* Date limite de soumission : vendredi 28 f?vrier 2014 Date de notification aux auteurs : lundi 17 mars 2014 ------------------------------------------------------------------------ La conf?rence d'Ing?nierie des Connaissances f?te cette ann?e ses 25 ans! Elle est le rendez-vous annuel de la communaut? fran?aise et francophone qui s'int?resse aux probl?matiques li?es ? l'ing?nierie des connaissances. Acad?miques et industriels s'y retrouvent pour ?changer et r?fl?chir sur des probl?mes de recherche qui se posent en acquisition, repr?sentation et gestion des connaissances. L'essor des Sciences et Technologies de l'Information et de la Communication, et notamment des technologies du Web, dans l'ensemble de la soci?t? engendre des mutations dans les pratiques individuelles et collectives. L'Ing?nierie des Connaissances accompagne cette ?volution, en inventant les mod?les, m?thodes et outils permettant l''int?gration de connaissances et de raisonnements sur ces connaissances dans des environnements informatiques. De fait, on constate aujourd'hui une omnipr?sence des connaissances, en m?me temps qu'un d?cloisonnement des communaut?s de recherche en Intelligence Artificielle pour r?pondre aux probl?mes qui se posent. Ainsi, la construction des syst?mes ? base de connaissances fait appel ? l'Ing?nierie des Connaissances, au traitement automatique des langues, ? l'apprentissage automatique et ? la fouille de donn?es; le traitement des connaissances associe Ing?nierie des Connaissances avec recherche d'information, recherche op?rationnelle, analyse de donn?es; la mod?lisation des environnements informatiques, de ses acteurs humains ou logiciels, allie Ing?nierie des Connaissances et syst?mes multi-agents; enfin l'Ing?nierie des Connaissances touche ? des th?matiques classiques dans les communaut?s de l'interaction homme-machine et des environnements informatiques pour l'apprentissage humain pour placer l'utilisateur au centre des syst?mes. En pla?ant l'utilisateur au coeur des syst?mes informatiques, l'Ing?nierie des Connaissances est sollicit?e pour traiter la masse de donn?es disponibles sur le web tout en assistant l'utilisateur pour les visualiser, contextualiser, expliquer, recommander, classer, pr?dire, approximer, s?curiser, fiabiliser, pour pallier ? leur impr?cision voire incoh?rence. ------------------------------------------------------------------------ *Soumissions* /*Posters :*/ Sont concern?s des travaux en cours ou au stade de r?sultats pr?liminaires de recherche ou appliqu?s. Les posters sont destin?s ? provoquer la discussion au sein de la communaut? pour accompagner au mieux le d?marrage des travaux pr?sent?s. Ils seront expos?s dans un espace de discussion. /*D?monstrations :*/ Une ou plusieurs s?ances de d?monstrations seront organis?es pour permettre aux participants de pr?senter leurs d?veloppements dans un espace de discussion. Ce peuvent ?tre des maquettes, prototypes, outils finalis?s, mais aussi applications industrielles et op?rationnelles. Les propositions de d?monstrations peuvent venir compl?ter une communication. Des r?sultats peuvent donner lieu ? une double soumission pour poster ET d?monstration. La soumission d'un poster et/ou d'une d?monstration peut ?galement venir en compl?ment d'un article (long ou court) accept? ? la conf?rence ou d'une pr?sentation dans la session sp?ciale "Logiciels, Projets, Industrie". Les textes associ?s aux d?monstrations et aux posters ne devront pas exc?der 3 pages et devront respecter les styles de documents propos?s pour la conf?rence : http://www.irit.fr/IC2014. Les auteurs sont invit?s ? soumettre leurs communications au format PDF sur EasyChair: https://www.easychair.org/conferences/?conf=ic2014. Les articles des posters et d?monstrations accept?s seront inclus dans les actes d'IC 2014. Pour les posters accept?s, le format de pr?sentation sera A0 vertical. ------------------------------------------------------------------------ La conf?rence d'Ing?nierie des connaissances est plac?e sous l'?gide du coll?ge IC de l'AFIA ------------------------------------------------------------------------ *Pr?sidente du comit? de programme :* Catherine Faron Zucker (I3S, Universit? Nice Sophia Antipolis) *Pr?sidente du comit? d'organisation :* Catherine Roussey, (IRSTEA) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:59:02 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:59:02 +0100 Subject: Appel: Computerm 2014, COLING 2014 Workshop Message-ID: Date: Tue, 18 Feb 2014 22:56:44 +0100 From: Thierry Hamon Message-ID: <87ppmk84pv.fsf at smtp-out.limsi.fr> X-url: http://perso.limsi.fr/hamon/Computerm2014/ 4th International Workshop on Computational Terminology CompuTerm 2014 COLING 2014 Workshop 23rd or 24th August 2014, Dublin, Ireland http://perso.limsi.fr/hamon/Computerm2014/ Submission deadine: 2nd May 2014 Computational Terminology covers an increasingly important aspect in Natural Language Processing areas such as text mining, information retrieval, information extraction, summarisation, textual entailment, document management systems, question-answering systems, ontology building, etc. Terminological information is paramount for knowledge mining from texts for scientific discovery and competitive intelligence. Scientific needs in fast growing domains (such as biomedicine, chemistry and ecology) and the overwhelming amount of textual data published daily demand that terminology is acquired and managed systematically and automatically; while in well established domains (such as law, economy, banking and music) the demand is on fine-grained analyses of documents for knowledge description and acquisition. Moreover, capturing new concepts leads to the acquisition and management of new knowledge. The aim of this fourth CompuTerm workshop is to bring together Natural Language Processing researchers to discuss recent advances in computational terminology and its impact in many NLP applications. The topics addressed in this workshop are wide ranging: - term extraction, recognition and filtering, which is the core of the terminological activity that lays basis for other terminological topics and tasks; - event recognition and extraction, that extends the notion of the terminological entity from terms meaning static units up to terms meaning procedural and dynamic processes; - acquisition of semantic relations among terms, which is also an important research topic as the acquisition of semantic relationships between terms finds applications such as the population and update of existing knowledge bases, definition of domain specific templates in information extraction and disambiguation of terms; - term variation management, that helps to deal with the dynamic nature of terms, their acquisition from heterogeneous sources, their integration, standardisation and representation for a large range of applications and resources, is also increasingly important, as one has to address this research problem when working with various controlled vocabularies, thesauri, ontologies and textual data. Term variation is also related to their paraphrases and reformulations, due to historical, regional, local or personal issues. Besides, the discovery of synonym terms or term clusters is equally beneficial to many NLP applications; - definition acquisition, that covers important research and aims to provide precise and nonambiguous description of terminological entities. Such definitions may contain elements necessary for the formal description of terms and concepts within ontologies; - consideration of the user expertise, that is becoming a new issue in the terminological activity, takes into account the fact that specialized domains contain notions and terms often nonunderstandable to non-experts or to laymen (such as patients within the medical area, or bank clients within banking and economy areas). This aspect, although related to specialized areas, provides direct link between specialized languages and general language; - systematic terminology management and updating domain specific dictionaries and thesauri, that are important aspects for maintaining the existing terminological resources. These aspects become crucial because the amount of the existing terminological resources is constantly increasing and because their perennial and efficient use depends on their maintenance and updating, while their re-acquisition is costly and often non-reproducible; - monolingual and multilingual resources, that open the possibility for developing cross-lingual and multi-lingual applications, requires specific corpora, methods and tools which design and evaluation are challenging issues; - robustness and portability of methods, which allows to apply methods developed in one given context to other contexts (corpora, domains, languages, etc.) and to share the research expertise among them; - social netwoks and modern media processing, that attracts an increasing number of researchers and that provides challenging material to be processed; - utilization of terminologies in various NLP applications, as they are a necessary component of any NLP system dealing with domain-specific literature, is another novel and challenging research direction. The workshop submissions are open to different approaches, ranging from term extraction in various languages (using verb co-occurrence, information theoretic approaches, machine learning, etc.), translation pairs extracting from bilingual corpora based on terminology, up to semantic oriented approaches and theoretical aspects of terminology. Besides, experiments on the evaluation of terminological methods and tools are also encouraged since they provide interesting and useful proof about the utility of terminological resources: - direct evaluation may concern the efficiency of the terminological methods and tools to capture the terminological entities and relations, as well as various kinds of related information; - indirect evaluation may concern the use of terminological resources in various NLP applications and the impact these resources have on the performance of the automatic systems. In this case, research and competition tracks (such as TREC, BioCreative, CLEF, CLEF-eHealth, I2B2, *SEM, and other shared tasks), provide particularly fruitful evaluation contexts and proved very successful in identifying key problems in terminology such as term variation and ambiguity. We encourage authors to submit their research work related to various aspects of computational terminology, such as mentioned in this call. The workshop authors will be proposed to submit an extented version of their work to a special issue of an international journal or of a book collection. Importante dates: - 1st workshop CFP: 17th February 2014 - Paper due date: 2nd May 2014 - Notification of acceptance: 6th June 2014 - Camera-ready deadline: 27th June 2014 - Workshop: 23rd or 24th August 2014 Submission instructions: Authors are invited to submit full papers on original, unpublished work in the topic area of this workshop. Submissions should follow the COLING 2014 instruction for authors (http://www.coling-2014.org/call-for-papers.php) and be formatted using the COLING 2014 stylefiles for latex, MS Word or LibreOffice (http://www.coling-2014.org/doc/coling2014.zip), with blind review and not exceeding 8 pages plus two extra pages for references. The PDF files will be submitted electronically at https://www.softconf.com/coling2014/WS-9/ Organisers: - Patrick Drouin, Observatoire de linguistique Sens-Texte, Universit? de Montr?al, Montr?al, Canada - Natalia Grabar, CNRS UMR 8163 STL, Universit? Lille 1&3, Villeneuve d'Ascq - Thierry Hamon, LIMSI-CNRS, Orsay, France & Universit? Paris 13, Sorbonne Paris Cit?, Villetaneuse, France - Kyo Kageura, Library and Information Science Laboratory, University of Tokyo, Tokyo, Japan Program Committee - Sophia Ananiadou, National Centre for Text Mining, UK - Olivier Bodenreider, NLM, USA - Beatrice Daille, IRIN, France - ?ric Gaussier, LIG, Universit? Joseph Fourier, France - Gregory Grefenstette, Clairvoyance Corp, France - Marie-Claude L'Homme, University of Montr?al, Canada - Philippe Langlais, RALI, Canada - John McNaught, UMIST & National Centre for Text Mining, UK - Rogelio Nazar, University Pompeu Fabra, Spain - Goran Nenadic, UMIST & National Centre for Text Mining, UK - Jorge Vivaldi Palatresi, University Pompeu Fabra, Spain - Selja Sepp?l?, University at Buffalo, USA - Karine Verspoor, NICTA, Australia - Pierre Zweigenbaum, LIMSI, France ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:36:50 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:36:50 +0100 Subject: Appel: Organisation de la Plate-forme IA de l'AFIA 2015 (date limite reponse 28 mars 2014) Message-ID: Date: Mon, 17 Feb 2014 23:19:35 +0100 From: Secretaire AFIA Message-Id: <87A5A11D-FEFD-4CB4-AC31-63CAF45F1382 at afia.asso.fr> X-url: http://www.afia.asso.fr/tiki-download_file.php?fileId=146 --- Merci de relayer ce message aupr?s des personnes int?ress?es --- L'AFIA recherche un organisateur pour sa Plate-forme 2015 (p?riode : entre le 29 juin et le 10 juillet, montant d'inscription max : 300 euros pour un ?v?nement de 5 jours, h?bergement maximum de 4 conf?rences en parall?le, nombre de participants envisag?s : 300). Si vous ?tes int?ress?s par l'organisation d'un tel ?v?nement, merci de contacter le bureau de l'AFIA (bureau AT afia.asso.fr) pour plus de sp?cifications sachant que la date limite pour une r?ponse consolid?e est fix?e au 28 mars 2014. Pour avoir plus d'information, consulter : http://www.afia.asso.fr/tiki-download_file.php?fileId=146 le CA de l'AFIA ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:47:46 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:47:46 +0100 Subject: Conf: Online Registration to LREC 2014 now open! Message-ID: Date: Tue, 18 Feb 2014 17:11:38 +0100 From: ELRA ELDA Information Message-ID: <5303863A.4070703 at elda.org> X-url: http://lrec2014.lrec-conf.org/en/registration/ X-url: http://www.lrec-conf.org/lrec2014 [Apologies for multiple postings] The online registration to the Main conference, the workshops and tutorials is now open @ http://lrec2014.lrec-conf.org/en/registration/. Contact: registration at lrec-conf.org www.lrec-conf.org/lrec2014 Follow us on Twitter: @LREC2014 (https://twitter.com/LREC2014) From hamon at LIMSI.FR Tue Feb 18 21:41:53 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:41:53 +0100 Subject: Appel: Special Track Computational Intelligence (ACM-MEDES 2014) Message-ID: Date: Mon, 17 Feb 2014 18:12:18 -0500 From: "Ismail Biskri" Message-ID: <009501cf2c35$b43f7460$1cbe5d20$@Biskri at uqtr.ca> X-url: http://sigappfr.acm.org/MEDES/14/index.php?option=com_content&view=article&id=23&Itemid=24 http://sigappfr.acm.org/MEDES/14/index.php?option=com_content&view=article&id=23&Itemid=24 Description What we call here Computational Intelligence is the study of data mining, information extraction, knowledge processing, language processing, discourse processing, and any area that relates to an intelligent processing of language, information and knowledge. Until recently this area has been performed by experts in computer science or experts in artificial intelligence. In fact, there were many computer applications in Computational Intelligence without any theoretical support. Today, people working in such areas often work as members of pluridisciplinary and interdisciplinary teams, including, of course, computer scientists and expert of artificial intelligence but also mathematicians, statisticians, linguists, language experts, philosophers, logicians, cognitive scientists, neuroscientists, psycholinguists, anthropologists, etc. Therefore, Computational Intelligence must become more connected to areas as cognitive science, mathematics, statistic, linguistics, philosophy, logics, neuroscience, etc. through the development of methods, strategies and algorithms for all aspects of language processing, discourse processing, knowledge processing, information processing, etc. These methods and strategies can be seen according to their theoretical or practical significance. The aim of this track is to present original works related to computational intelligence. These works should focus either on practical applications, or on results of theoretical foundations of the discipline. We, therefore, invite authors for original papers submissions for any work related to theoretical or practical aspects of Computational Intelligence. Topics of Interest Submissions are invited in, but not limited to, the following areas: * Philosophy of language and discourse * Cognitive science * Non standard logics * Natural language processing * Computational linguistics (lexicology; morphology; syntax; semantics) * Information extraction * Ontologies * Knowledge processing * Data Mining, Text Mining, Web Mining * Information retrieval * Web Semantics Paper Submission Papers should be submitted in electronic form (PDF) using EasyChair (https://www.easychair.org/conferences/?conf=medes2014). Papers are up to 6 pages and must be formatted according to the ACM proceedings format which can be found here: http://www.acm.org/sigs/publications/proceedings-templates. Papers that fail to comply with length limit will be rejected. Submissions will be peer-reviewed by at least 3 peer reviewers. After the preliminary notification date, authors rebut by evidence and arguments all reviewer inquiries and their comments. Based on the rebuttal feedback, reviewers notify authors with the final decision. Selection criteria will include: relevance, significance, impact, originality, technical soundness, and quality of presentation. Preference will be given to submissions that take strong or challenging positions on important emergent topics related to Digital Ecosystems. At least one author should attend the conference to present the paper. The conference Proceedings will be published by ACM and indexed by the ACM Digital Library and DBLP. Important dates * Full Papers Submission: May 12, 2014 * Notification of Paper Acceptance: July 13, 2014 * Camera ? ready papers: August 07, 2014 * Special Track days: September 15-17, 2014 Special Track chairs Isma?l Biskri Universit? du Qu?bec ? Trois-Rivi?res Canada ismail.biskri at uqtr.ca Adel Jebali University of Concordia Canada adel.jebali at concordia.ca Programme Committee (to be completed) Ismail Biskri, Universite de Qu?bec ? Trois Rivi?res, Canada Jean-Pierre Descl?s, Universit? de Paris-Sorbonne, France Al Moatasem Alrahabi, Universit? Paris-Sorbonne Abu-Dhabi Rim Faiz, IHEC de Carthage, Universit? du 7 novembre ? Carthage, Tunisia Adel Jebali, University Concordia, Montreal, Canada Jean-Guy Meunier, Universit? de Qu?bec ? Montr?al, Canada Anca Pascu, Universit? de Bretagne Occidentale, France Christophe Jouis, Universit? Paris 6, France Malek Mouhoub, University of Regina, Canada Juan Manuel Torres Moreno, Universit? d?Avignon, France Agata Jackiewicz, Universit? Paris-Sorbonne, France Marc Bertin, Universit? de Qu?bec ? Montr?al, Canada Iana Atanassova, University Concordia, Canada Florence Le Priol, Universit? Paris-Sorbonne, France Sonia El Euch, Universit? du Qu?bec ? Trois-Rivi?res, Canada Maria Antonietta Pinto, Universita di Roma, Italy Roger Nkambou, Universit? du Qu?bec ? Montr?al, Canada Daniel Vanderveken, Universit? du Qu?bec ? Montr?al, Canada, Candida De Sousa Melo, Universidade Federal do Rio Grande do Norte, Brazil ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:48:48 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:48:48 +0100 Subject: Info: Prix de la these ATALA 2014 Message-ID: Date: Tue, 18 Feb 2014 17:15:57 +0100 From: pap Message-ID: <5303873D.9000903 at limsi.fr> X-url: http://www.atala.org/_Prix-de-la-these_ Prix de la th?se ATALA 2014 Depuis 2011, l'ATALA, Association pour le Traitement Automatique des Langues, d?cerne un prix de th?se pour r?compenser des travaux francophones d'excellence dans le domaine du TAL. Ce prix illustre la volont? de l'ATALA de soutenir les jeunes chercheurs francophones et de contribuer ? la diffusion de leur travaux ? l'ensemble de la communaut? du TAL. Contrairement aux ann?es pr?c?dentes, le prix 2014 n'est pas dot?. Peuvent candidater tous les docteurs ayant soutenu leur th?se de doctorat dans une universit? francophone entre le 1er septembre 2011 et le 31 d?cembre 2013 et n'ayant jamais candidat? auparavant au prix de th?se ATALA. Un jury d'une quinzaine de chercheurs et enseignants-chercheurs couvrant l'ensemble des domaines de recherche en TAL examineront les dossiers afin de s?lectionner le (la) laur?at(e). Le (la) laur?at(e) recevra son prix au cours du colloque TALN 2014 qui aura lieu ? la Facult? Saint-Charles de Marseille, du 1er au 4 juillet 2014. ? cette occasion le (la) laur?at(e) pr?sentera ses travaux lors d'une session pl?ni?re. Date limite de soumission des dossiers : 15 mars 2014. Le dossier ? fournir est constitu? d'un CV (2 pages), des rapports de pr?-soutenance et de soutenance, et de la th?se. ?tapes - Remplir le formulaire http://www.atala.org/_Prix-de-la-these_ - Envoyer une archive contenant le CV, les rapports et la th?se (ou bien une URL d'o? l'on puisse r?cup?rer les documents) ? : pap at limsi.fr - Annonce des r?sultats d?but mai 2014 - Remise du prix lors de TALN 2014 Jury: (En cours de composition) Co-pr?sid? par : Fr?d?rique Segond (Groupe Viseo), Patrick Paroubek (LIMSI-CNRS) et Charles Teiss?dre (Universit? Paris-Sorbonne). ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:30:17 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:30:17 +0100 Subject: These: Dhouha Bouamor, Constitution de ressources linguistiques multilingues Message-ID: Date: Sun, 16 Feb 2014 13:54:56 +0000 From: SEMMAR Nasredine 202247 Message-ID: <50CC12289F0C854FBFF134F270030CD71C49B57B at EXDAG0-B2.intra.cea.fr> Bonjour, J'ai le plaisir de vous inviter ? ma soutenance de th?se r?alis?e au sein du Laboratoire Vision et Ing?nierie des Contenus (LVIC) et le Laboratoire d'Informatique pour la M?canique et les Sciences de l'Ing?nieur (LIMSI) intitul?e : "Constitution de ressources linguistiques multilingues ? partir de corpus de textes parall?les et comparables". La soutenance aura lieu le vendredi 21 f?vrier 2014 ? 14h00 au LIMSI. Vous ?tes ?galement convi?s au pot de th?se qui suivra. Bien cordialement, Dhouha Bouamor ************************************************************************ Le jury de soutenance sera compos? de : M. Reinhard Rapp, Professeur ? l'Universit? de Mainz, Rapporteur M. Eric Gaussier, Professeur ? l'Universit? Joseph Fourier, Rapporteur M. Philippe Langlais, Professeur ? l'Universit? de Montr?al, Examinateur M. Fran?ois Yvon, Professeur ? l'Universit? Paris-Sud, Examinateur M. Pierre Zweigenbaum, Directeur de recherche au LIMSI-CNRS, Directeur de th?se M. Nasredine Semmar, Chercheur au LVIC - CEA LIST, Encadrant ************************************************************************ R?sum? : Les lexiques bilingues sont des ressources particuli?rement utiles pour la Traduction Automatique et la Recherche d?Information Interlingue. Leur construction manuelle n?cessite une expertise forte dans les deux langues concern?es et est un processus co?teux. Plusieurs m?thodes automatiques ont ?t? propos?es comme une alternative, mais elles ne sont disponibles que dans un nombre limit? de langues et leurs performances sont encore loin derri?re la qualit? des traductions manuelles. Notre travail porte sur l'extraction de ces lexiques bilingues ? partir de corpus de textes parall?les et comparables. En nous basant sur des corpus parall?les, nous pr?sentons une approche qui porte sur le traitement d?expressions polylexicales, allant de leur acquisition automatique ? leur int?gration dans un syst?me de traduction automatique statistique. Pour les corpus comparables, nous proposons deux nouvelles approches dont le but est d?extraire des lexiques bilingues sp?cialis?s dans les domaines de la finance des entreprises, du cancer du sein, de l??nergie ?olienne et de la technologie mobile. La premi?re approche ?tend l?approche distributionnelle par un processus de d?sambigu?sation lexicale. La deuxi?me approche repose sur Wikip?dia et l?analyse explicite s?mantique. L?originalit? de cette approche r?side dans le fait que, au lieu de consid?rer l?espace des mots d?un corpus pour la repr?sentation des mots que l?on souhaite traduire, ces derniers sont repr?sent?s dans l?espace des titres des articles de Wikip?dia. Ces deux approches se comparent favorablement aux m?thodes existantes dans la plupart des configurations. Mots cl?s : extraction de lexique bilingue, corpus parall?le, corpus comparable, alignement, traduction automatique statistique. ************************************************************************ Abstract: Bilingual lexicons are central components of machine translation and cross-lingual information retrieval systems. Their manual construction requires extensive expertise in both languages involved and it is a costly process. Several automatic methods were proposed as an alternative but they often rely of resources available in a limited number of languages and their performances are still far behind the quality of manual translations. Our work concerns bilingual lexicon extraction from multilingual parallel and comparable corpora. Based on parallel corpora, we present an approach that focuses on processing multiword expressions, ranging from their automatic acquisition to their integration into a statistical machine translation system. For comparable corpora, we propose two new approaches that aim at extracting bilingual lexicons specialized in the corporate finance, breast cancer, wind energy and mobile technology domains. The first approach augments the distributional approach by proposing a word sense disambiguation process that keeps only the words that are more likely to give the best representation of a word to be translated. Our second approach is based on Wikipedia and its explicit semantic analysis. The main originality of this approach is in the way words are represented: instead of representing the words in a corpus words space, they are represented in Wikipedia titles? space. The newly introduced approaches compare favorably to existing methods in almost configurations. Keywords: bilingual lexicon extraction, parallel corpus, comparable corpus, alignment, statistical machine translation. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:06:14 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:06:14 +0100 Subject: Appel: 7e Atelier BUCC a LREC, extension au 23/2 et numero special de JNLE Message-ID: Date: Fri, 14 Feb 2014 22:41:02 +0100 From: Pierre Zweigenbaum Message-ID: <52FE8D6E.8050403 at limsi.fr> X-url: http://comparable.limsi.fr/bucc2014/ Veuillez excuser l'envoi multiple --- merci de transmettre aux coll?gues int?ress?s ============================================================ DEADLINE EXTENSION AND JOURNAL SPECIAL ISSUE ============================================================ 7th WORKSHOP ON BUILDING AND USING COMPARABLE CORPORA Building Resources for Machine Translation Research http://comparable.limsi.fr/bucc2014/ May 27, 2014 Co-located with LREC 2014 Harpa Conference Centre, Reykjavik (Iceland) EXTENDED DEADLINE FOR PAPERS: February 23, 2014 https://www.softconf.com/lrec2014/BUCC2014/ *** INVITED SPEAKER *** Chris Callison-Burch (University of Pennsylvania) ============================================================ MOTIVATION In the language engineering and the linguistics communities, research in comparable corpora has been motivated by two main reasons. In language engineering, on the one hand, it is chiefly motivated by the need to use comparable corpora as training data for statistical Natural Language Processing applications such as statistical machine translation or cross-lingual retrieval. In linguistics, on the other hand, comparable corpora are of interest in themselves by making possible inter-linguistic discoveries and comparisons. It is generally accepted in both communities that comparable corpora are documents in one or several languages that are comparable in content and form in various degrees and dimensions. We believe that the linguistic definitions and observations related to comparable corpora can improve methods to mine such corpora for applications of statistical NLP. As such, it is of great interest to bring together builders and users of such corpora. The scarcity of parallel corpora has motivated research concerning the use of comparable corpora: pairs of monolingual corpora selected according to the same set of criteria, but in different languages or language varieties. Non-parallel yet comparable corpora overcome the two limitations of parallel corpora, since sources for original, monolingual texts are much more abundant than translated texts. However, because of their nature, mining translations in comparable corpora is much more challenging than in parallel corpora. What constitutes a good comparable corpus, for a given task or per se, also requires specific attention: while the definition of a parallel corpus is fairly straightforward, building a non-parallel corpus requires control over the selection of source texts in both languages. Parallel corpora are a key resource as training data for statistical machine translation, and for building or extending bilingual lexicons and terminologies. However, beyond a few language pairs such as English- French or English-Chinese and a few contexts such as parliamentary debates or legal texts, they remain a scarce resource, despite the creation of automated methods to collect parallel corpora from the Web. To exemplify such issues in a practical setting, this year's special focus will be on Building Resources for Machine Translation Research This special topic aims to address the need for: (1) Machine Translation training and testing data such as spoken or written monolingual, comparable or parallel data collections, and (2) methods and tools used for collecting, annotating, and verifying MT data such as Web crawling, crowdsourcing, tools for language experts and for finding MT data in comparable corpora. TOPICS We solicit contributions including but not limited to the following topics: Topics related to the special theme: * Methods and tools for collecting and processing MT data, including crowdsourcing * Methods and tools for quality control * Tools for efficient annotation * Bilingual term and named entity collections * Multilingual treebanks, wordnets, propbanks, etc. * Comparable corpora with parallel units annotated * Comparable corpora for under-resourced languages and specific domains * Multilingual corpora with rich annotations: POS tags, NEs, dependencies, semantic roles, etc. * Data for special applications: patent translation, movie subtitles, MOOCs, meetings, chat-rooms, social media, etc. * Legal issues with collecting and redistributing data and generating derivatives Building comparable corpora: * Human translations * Automatic and semi-automatic methods * Methods to mine parallel and non-parallel corpora from the Web * Tools and criteria to evaluate the comparability of corpora * Parallel vs non-parallel corpora, monolingual corpora * Rare and minority languages, across language families * Multi-media/multi-modal comparable corpora Applications of comparable corpora: * Human translations * Language learning * Cross-language information retrieval & document categorization * Bilingual projections * Machine translation * Writing assistance Mining from comparable corpora: * Extraction of parallel segments or paraphrases from comparable corpora * Extraction of bilingual and multilingual translations of single words and multi-word expressions; proper names, named entities, etc. IMPORTANT DATES February 23, 2014 Deadline for submission of papers (extended) March 10, 2014 Notification of acceptance March 27, 2014 Camera-ready papers due May 27, 2014 Workshop date SUBMISSION INFORMATION Papers should follow the LREC main conference formatting details (to be announced on the conference website http://lrec2014.lrec-conf.org/en/ ) and should be submitted as a PDF-file via the START workshop manager at https://www.softconf.com/lrec2014/BUCC2014/ Contributions can be short or long papers. Short paper submission must describe original and unpublished work without exceeding six (6) pages. Characteristics of short papers include: a small, focused contribution; work in progress; a negative result; an opinion piece; an interesting application nugget. Long paper submissions must describe substantial, original, completed and unpublished work without exceeding ten (10) pages. Reviewing will be double blind, so the papers should not reveal the authors' identity. Accepted papers will be published in the workshop proceedings. Double submission policy: Parallel submission to other meetings or publications is possible but must be immediately notified to the workshop organizers. When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. Moreover, ELRA encourages all LREC authors to share the described LRs (data, tools, services, etc.), to enable their reuse, replicability of experiments, including evaluation ones, etc. JOURNAL SPECIAL ISSUE Authors of selected papers will be encouraged to submit substantially extended versions of their manuscripts to an upcoming special issue on "Machine Translation Using Comparable Corpora" of the Journal of Natural Language Engineering. ORGANISERS Pierre Zweigenbaum, LIMSI, CNRS, Orsay (France) Ahmet Aker, University of Sheffield (UK) Serge Sharoff, University of Leeds (UK) Stephan Vogel, QCRI (Qatar) Reinhard Rapp, Universities of Mainz (Germany) and Aix-Marseille (France) CONTACT Pierre Zweigenbaum: pz (at) limsi (dot) fr SCIENTIFIC COMMITTEE * Ahmet Aker, University of Sheffield (UK) * Srinivas Bangalore (AT&T Labs, US) * Caroline Barri?re (CRIM, Montr?al, Canada) * Chris Biemann (TU Darmstadt, Germany) * Herv? D?jean (Xerox Research Centre Europe, Grenoble, France) * Kurt Eberle (Lingenio, Heidelberg, Germany) * Andreas Eisele (European Commission, Luxembourg) * ?ric Gaussier (Universit? Joseph Fourier, Grenoble, France) * Gregory Grefenstette (INRIA, Saclay, France) * Silvia Hansen-Schirra (University of Mainz, Germany) * Hitoshi Isahara (Toyohashi University of Technology) * Kyo Kageura (University of Tokyo, Japan) * Adam Kilgarriff (Lexical Computing Ltd, UK) * Natalie K?bler (Universit? Paris Diderot, France) * Philippe Langlais (Universit? de Montr?al, Canada) * Michael Mohler (Language Computer Corp., US) * Emmanuel Morin (Universit? de Nantes, France) * Dragos Stefan Munteanu (Language Weaver, Inc., US) * Lene Offersgaard (University of Copenhagen, Denmark) * Ted Pedersen (University of Minnesota, Duluth, US) * Reinhard Rapp (Universit? Aix-Marseille, France) * Sujith Ravi (Google, Mountain View, US) * Serge Sharoff (University of Leeds, UK) * Michel Simard (National Research Council Canada) * Richard Sproat (OGI School of Science & Technology, US) * Tim Van de Cruys (IRIT-CNRS, Toulouse, France) * Stephan Vogel (QCRI, Qatar) * Guillaume Wisniewski (Universit? Paris Sud & LIMSI-CNRS, Orsay, France) * Pierre Zweigenbaum (LIMSI-CNRS, Orsay, France) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:11:05 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:11:05 +0100 Subject: Job: Resident Academic Post in Human Language Technology, Malta Message-ID: Date: Sat, 15 Feb 2014 11:06:40 +0100 From: Fran?ois Portet Message-ID: <52FF3C30.9070000 at imag.fr> X-url: http://www.um.edu.mt/hrmd/vacancies X-url: http://www.um.edu.mt/__data/assets/pdf_file/0004/204259/Post_in_Human_Language_Technology.pdf The University of Malta has just announced a resident academic post in Human Language Technology within the Institute of Linguistics. This is a tenured position. Applications will be received until February 28th, 2014. This call is associated with the recently developed B.Sc. and M.Sc. programme in Computational Linguistics and Human Language Technology, and is part of an ongoing effort to strengthen the research and teaching in this area within the Institute. Full details are available from the following URL: http://www.um.edu.mt/hrmd/vacancies (see: Resident Academic Post in Human Language Technology (http://www.um.edu.mt/__data/assets/pdf_file/0004/204259/Post_in_Human_Language_Technology.pdf). Enquiries can be directed to Albert Gatt (albert.gatt at um.edu.mt ) or the Institute of Linguistics (linguistics at um.edu.mt). ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:18:10 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:18:10 +0100 Subject: Conf: LATA 2014, Madrid, Spain, March 10-14, 2014 Message-ID: Date: Sun, 16 Feb 2014 11:59:41 +0100 From: "GRLMC" Message-ID: X-url: http://grammars.grlmc.com/lata2014/ ************************************************************************ 8th INTERNATIONAL CONFERENCE ON LANGUAGE AND AUTOMATA THEORY AND APPLICATIONS LATA 2014 Madrid, Spain March 10-14, 2014 http://grammars.grlmc.com/lata2014/ ************************************************************************ PROGRAM Monday, March 10: 9:30 - 10:30 Registration 10:30 - 10:40 Opening 10:40 - 11:30 Helmut Seidl: Interprocedural Information Flow Analysis of XML Processors - Invited Lecture 11:30 - 11:45 Break 11:45 - 13:00 Alberto Dennunzio, Enrico Formenti, Luca Manzoni: Extremal Combinatorics of Reaction Systems Fernando Arroyo, Sandra G?mez Canaval, Victor Mitrana, ?tefan Popescu: Networks of Polarized Evolutionary Processors are Computationally Complete Liang Ding, Abdul Samad, Xingran Xue, Xiuzhen Huang, Russell L. Malmberg, Liming Cai: Stochastic k-Tree Grammar and its Application in Biomolecular Structure Modeling 13:00 - 14:30 Lunch 14:30 - 15:45 B?atrice B?rard, Olivier Carton: Channel Synthesis Revisited Daniel Pr??a: Weight-reducing Hennie Machines and Their Descriptional Complexity Manfred Droste, Stefan D?ck: Weighted Automata and Logics for Infinite Nested Words 15:45 - 16:00 Break 16:00 - 17:15 Friedrich Otto, Franti?ek Mr?z: Extended Two-Way Ordered Restarting Automata for Picture Languages Sang-Ki Ko, Yo-Sub Han, Kai Salomaa: Top-Down Tree Edit-Distance of Regular Tree Languages Bertram Felgenhauer, Ren? Thiemann: Reachability Analysis with State-Compatible Automata 17:15 - 17:45 Coffee Break 17:45 - 18:35 Leslie A. Goldberg: The Complexity of Approximate Counting - Invited Lecture Tuesday, March 11: 9:00 - 9:50 Sanjeev Khanna: Matchings, Random Walks, and Sampling - Invited Lecture 9:50 - 10:00 Break 10:00 - 11:15 Niko Beerenwinkel, Stefano Beretta, Paola Bonizzoni, Riccardo Dondi, Yuri Pirola: Covering Pairs in Directed Acyclic Graphs Eike Best, Raymond Devillers: Characterisation of the State Spaces of Live and Bounded Marked Graph Petri Nets Mar?a Martos-Salgado, Fernando Rosa-Velardo: Expressiveness of Dynamic Networks of Timed Petri Nets 11:15 - 11:45 Coffee Break 11:45 - 13:00 Bireswar Das, Patrick Scharpfenecker, Jacobo Tor?n: Succinct Encodings of Graph Isomorphism Matthias Gall?, Mat?as Tealdi: On Context-Diverse Repeats and their Incremental Computation Marcella Anselmo, Dora Giammarresi, Maria Madonia: Picture Codes with Finite Deciphering Delay 13:00 - 14:30 Lunch 14:30 - 15:45 Paul Tarau: Computing with Catalan Families Joan Boyar, Shahin Kamali, Kim S. Larsen, Alejandro L?pez-Ortiz: On the List Update Problem with Advice Rob Gysel: Minimal Triangulation Algorithms for Perfect Phylogeny Problems 15:45 - 16:00 Break 16:00 - 17:15 Gr?goire Laurence, Aur?lien Lemay, Joachim Niehren, S?awek Staworko, Marc Tommasi: Learning Sequential Tree-to-word Transducers Dariusz Kaloci?ski: On Computability and Learnability of the Pumping Lemma Function Slimane Bellaouar, Hadda Cherroun, Djelloul Ziadi: Efficient List-based Computation of the String Subsequence Kernel Wednesday, March 12: 9:00 - 9:50 Oscar H. Ibarra: On the Parikh Membership Problem for FAs, PDAs, and CMs - Invited Lecture 9:50 - 10:00 Break 10:00 - 11:15 Marius Konitzer, Hans Ulrich Simon: DFA with a Bounded Activity Level Shenggen Zheng, Jozef Gruska, Daowen Qiu: On the State Complexity of Semi-Quantum Finite Automata Vojt?ch Vorel: Complexity of a Problem Concerning Reset Words for Eulerian Binary Automata 11:15 - 11:45 Coffee Break 11:45 - 13:00 Pascal Caron, Marianne Flouret, Ludovic Mignot: (k,l)-Unambiguity and Quasi-Deterministic Structures: an Alternative for the Determinization Zuzana Bedn?rov?, Viliam Geffert: Two Double-Exponential Gaps for Automata with a Limited Pushdown Parosh Aziz Abdulla, Mohamed Faouzi Atig, Jari Stenman: Computing Optimal Reachability Costs in Priced Dense-Timed Pushdown Automata 13:00 - 14:30 Lunch 15:45 - 17:45 Sightseeing in Madrid by Bus Thursday, March 13: 9:00 - 9:50 Javier Esparza: A Brief History of Strahler Numbers (I) - Invited Tutorial 9:50 - 10:00 Break 10:00 - 11:15 Zeinab Mazadi, Ziyuan Gao, Sandra Zilles: Distinguishing Pattern Languages with Membership Examples Francine Blanchet-Sadri, Andrew Lohr, Sean Simmons, Brent Woodhouse: Computing Depths of Patterns Alexandre Blondin Mass?, S?bastien Gaboury, Sylvain Hall?, Micha?l Larouche: Solving Equations on Words with Morphisms and Antimorphisms 11:15 - 11:45 Coffee Break 11:45 - 13:00 Anton Cern?: Solutions to the Multi-Dimensional Equal Powers Problem Constructed by Composition of Rectangular Morphisms Enrico Formenti, Markus Holzer, Martin Kutrib, Julien Provillard: ?-rational Languages: High Complexity Classes vs. Borel Hierarchy Thomas Weidner: Probabilistic ?-Regular Expressions 13:00 - 14:30 Lunch 14:30 - 15:45 Haizhou Li, Fran?ois Pinet, Farouk Toumani: Probabilistic Simulation for Probabilistic Data-aware Business Processes Daniel Bundala, Jakub Z?vodn?: Optimal Sorting Networks Etienne Dubourg, David Janin: Algebraic Tools for the Overlapping Tile Product 15:45 - 16:00 Break 16:00 - 17:15 Luca Breveglieri, Stefano Crespi Reghizzi, Angelo Morzenti: Shift-Reduce Parsers for Transition Networks Gadi Aleksandrowicz, Andrei Asinowski, Gill Barequet, Ronnie Barequet: Formulae for Polyominoes on Twisted Cylinders Alexandre Blondin Mass?, Amadou Makhtar Tall, Hugo Tremblay: On the Arithmetics of Discrete Figures Friday, March 14: 9:00 - 9:50 Javier Esparza: A Brief History of Strahler Numbers (II) - Invited Tutorial 9:50 - 10:00 Break 10:00 - 11:15 Hanna Klaudel, Maciej Koutny, Zhenhua Duan: Interval Temporal Logic Semantics of Box Algebra Pierre Ganty, Ahmed Rezine: Ordered Counter-Abstraction: Refinable Subword Relations for Parameterized Verification Matthew Gwynne, Oliver Kullmann: On SAT Representations of XOR Constraints 11:15 - 11:45 Coffee Break 11:45 - 13:00 Bernd Finkbeiner, Hazem Torfah: Counting Models of Linear-time Temporal Logic Claudia Carapelle, Shiguang Feng, Oliver Fern?ndez Gil, Karin Quaas: Satisfiability for MTL and TPTL over Non-Monotonic Data Words Joachim Klein, David M?ller, Christel Baier, Sascha Kl?ppelholz: Are Good-for-games Automata Good for Probabilistic Model Checking? 13:00 Closing ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:12:40 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:12:40 +0100 Subject: Appel: Special session Interspeech 2014, Speech technologies for Ambient Assisted Living Message-ID: Date: Mon, 17 Feb 2014 17:44:06 +0100 From: Fran?ois Portet Message-ID: <53023C56.6040902 at imag.fr> X-url: http://www.interspeech2014.org/public.php?page=special_sessions.html#speech-technologies-ambient Dear colleagues, Please find below the announcement of the Interspeech 2014 special session : Speech technologies for Ambient Assisted Living. Special session at Interspeech 2014 Submission deadline: 24th March 2014 Singapore, 14-18 September 2014 http://www.interspeech2014.org/public.php?page=special_sessions.html#speech-technologies-ambient This special session focuses on the use of speech technologies for ambient assisted living, the creation of smart spaces and intelligent companions that can preserve independence and executive function, social communication and security of people with special needs. Currently, speech input for assistive technologies remains underutilized despite its potential to deliver highly informative data and serve as the primary means of interaction with the home. Speech interfaces could replace or augment obtrusive and sometimes outright inaccessible conventional computer interfaces. Moreover, the smart home context can support speech communication by providing a number of concurrent information sources (e.g., wearable sensors, home automation sensors, etc.), enabling multimodal communication. In practice, its use remains limited due to challenging real-world conditions, and because conventional speech interfaces can have difficulty with the atypical speech of many users. This, in turn, can be attributed to the lack of abundant speech material, and the limited adaptation to the user of these systems. Taking up the challenges of this domain requires a multidisciplinary approach to define the user's needs, record corpora in realistic usage conditions, develop speech interfaces that are robust to both environment and user's characteristics and are able to adapt to specific users. This special session aims at bringing together researchers in speech and audio technologies with people from the ambient assisted living and assistive technologies communities to meet and foster awareness between members of either community, discuss problems, techniques and datasets, and perhaps initiate common projects. Topics of the session will include: Assistive speech technology Applications of speech technology (ASR, dialogue, synthesis) for ambient assisted living Understanding, modelling, or recognition of aged and atypical speech Multimodal speech recognition (context-aware ASR) Multimodal emotion recognition Audio scene and smart space context analysis Assessment of speech and language processing within the context of assistive technology Speech synthesis and speech recognition for physical or cognitive impairments Symbol languages, sign languages, nonverbal communication Speech and NLP applied to typing interface applications Language modelling for Augmentative and Alternative Communication text entry and speech generating devices Deployment of speech and NLP tools in the clinic or in the field Linguistic resources; corpora and annotation schemes Evaluation of systems and components. Submission instructions: Researchers who are interested in contributing to this special session are invited to submit a paper according to the regular submission procedure of INTERSPEECH 2014, and to select "Speech technologies for Ambient Assisted Living" in the special session field of the paper submission form. Please feel free to contact the organisers if you have any question regarding the special session. Organizers: Michel Vacher michel.vacher [at] imag.fr Laboratoire d'Informatique de Grenoble, Fran?ois Portet francois.portet [at] imag.fr Laboratoire d'Informatique de Grenoble, Frank Rudzicz frank [at] cs.toronto.edu University of Toronto, Jort F. Gemmeke jgemmeke [at] amadana.nl KU Leuven, Heidi Christensen h.christensen [at] dcs.shef.ac.uk University of Sheffield, ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Tue Feb 18 21:13:09 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Tue, 18 Feb 2014 22:13:09 +0100 Subject: Job: Professeur en Informatique, Universite Aix Marseille Message-ID: Date: Sat, 15 Feb 2014 16:10:21 +0100 From: Alexis Nasr Message-ID: <52FF835D.1070901 at lif.univ-mrs.fr> Un poste de professeur d'informatique (CNU 27) sera ouvert pour la rentr?e prochaine ? l'universit? d'Aix Marseille, au Laboratoire d'Informatique Fondamentale (CNRS UMR 7279). Le candidat recrut? pourra soit renforcer une des th?matiques pr?sentes du laboratoire en int?grant une de ses sept ?quipes, parmi lesquelles l'?quipe Traitement Automatique du Langage Ecrit et Parl?, soit y apporter une nouvelle th?matique, pourvu que celle-ci soit compatible avec la recherche qui est faite au LIF. La capacit? d'un candidat ? enrichir la dimension technologique des recherches men?es au LIF sera consid?r?e avec int?r?t : bien que la majorit? de ces recherches rel?ve de l'informatique fondamentale, chaque ?quipe d?veloppe ?galement des projets plus technologiques en collaboration avec des partenaires industriels, orientation qui est soutenue par le laboratoire. contacts recherche : jean-marc.tablot at lif.univ-mrs.fr (directeur du laboratoire) alexis.nasr at lif.univ-mrs.fr (responsable de l'?quipe TALEP) contact enseignement : denis.lugiez at lif.univ-mrs.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sun Feb 23 10:15:10 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sun, 23 Feb 2014 11:15:10 +0100 Subject: Q: Modele acoustique, SRAP Message-ID: Date: Wed, 19 Feb 2014 14:59:41 +0100 From: Sahar G Message-ID: Bonjour ? tous, je cherche des informations ( cours ) sur le mod?le acoustique, notamment, les ?tapes d'apprentissage, les donn?es n?cessaires, les techniques.. Je sollicite votre aide. Merci :) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sun Feb 23 10:14:22 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sun, 23 Feb 2014 11:14:22 +0100 Subject: Job: Junior Linguists (German) for NLP tasks Message-ID: Date: Tue, 18 Feb 2014 23:44:39 +0100 From: Thomas Meyer Message-ID: <-1243038235167481879 at unknownmsgid> ------------------------------------------------------------------------ Apologies for cross-posting ------------------------------------------------------------------------ Junior Linguists [German] Job description: As a Junior Linguist, you will be annotating and reviewing linguistic data in German. You will also contribute to a number of other tasks to improve natural language processing at Google. There are five full-time contract positions available, at 40 hours per week for a period of 12 months. You will be mostly working from home, but in close proximity to our office in *Munich* for training and subsequent intermediary meetings. The tasks include: - annotating and reviewing linguistic data - labeling text with semantic, syntactic and morphological features - annotating lexicon entries according to guidelines - deriving NLP data for new and on-going projects Requirements: - native-level speaker of German and fluent in English - degree in Linguistics, Computational Linguistics or similar, with knowledge of semantics, syntax, morphology and/or lexicography - ability to quickly grasp technical concepts; learn in-house tools - keen interest in technology and computer-literate - should have an interest in natural language processing, linguistics, linguistic resources - excellent oral and written communication skills - attention to detail and good organizational skills - ability to work independently from home with reliable internet connectivity Project duration: 12 months For immediate consideration, please e-mail your CV and a cover letter in English (in PDF format) with ?Junior Linguists [German]? in the subject line to the following address: E-mail address for applications: AdvancedLinguistics at gmail.com Application deadline: open until filled Contact information: Linne Ha ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sun Feb 23 10:27:31 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sun, 23 Feb 2014 11:27:31 +0100 Subject: Seminaire: Olivier Cure, Traces Digitales, 6 mars 2014, Universite de Paris Est Marne la Vallee Message-ID: Date: Thu, 20 Feb 2014 19:39:19 +0100 From: Nicolas Turenne Message-ID: <53064BD7.6010103 at yahoo.fr> X-url: http://www.inra-ifris.org/axes-de-recherche-thematique/groupe-plateforme-cortext/seminaire-de-l-axe-traces-digitales-groupe-cortext.html Chers coll?gues, Dans le cadre de la seizi?me s?ance du s?minaire 'traces digitales' du groupe CorText, nous avons le plaisir d'accueillir Olivier Cur? (Universit? Paris-Est Marne la Vall?e) qui nous parlera nous parlera d?ontologie. Cette matin?e aura lieu le jeudi 06 mars de 10h ? 11h30, dans la salle de r?union C219 du Latts (Universit? Paris-Est Marne la Vall?e - rue Galil?e, b?timent du Bois de l?Etang - 2?me ?tage - aile C). Voir http://www.inra-ifris.org/axes-de-recherche-thematique/groupe-plateforme-cortext/seminaire-de-l-axe-traces-digitales-groupe-cortext.html 10h-11h30 Olivier Cur? (Ma?tre de Conf?rence HDR, LIGM, Universit? Paris-Est Marne-la-Vall?e) Exploitation d?ontologies dans le cadre d?une application m?dicale pour le grand public R?sum? : Dans cet expos?, je pr?sente des travaux r?cents qui ont ?t? men?s dans le cadre du d?veloppement d?une application d?autom?dication. Cette application a pour objectif d?informer le grand public sur les m?dicaments de prescription facultative et les sympt?mes associ?s. L?application comporte de nombreuses fonctionnalit?s d?velopp?es autour de diff?rentes ontologies exploitant les technologies du Web S?mantique, e.g., requ?tage et raisonnement. Cette pr?sentation mettra en ?vidence nos contributions, en particulier le d?veloppement et l?enrichissement d?ontologies par induction, le support d?une nouvelle classe de contrainte d?int?grit? participant ? l?am?liorer de la qualit? de donn?es. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sun Feb 23 10:32:24 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sun, 23 Feb 2014 11:32:24 +0100 Subject: Seminaire: Cours de Ted Briscoe (Evolutionary linguistics), mars 2014, ENS Ulm, Paris Message-ID: Date: Fri, 21 Feb 2014 16:49:00 +0100 From: Thierry Poibeau Message-Id: <85397E93-8406-46B0-B830-BD838EA30343 at ens.fr> X-url: http://transfers.ens.fr/index.php/activites/professeurs-invites/495 Chers coll?gues, Le laboratoire Lattice (http://www.lattice.cnrs.fr/), avec le soutien du labex Transfers (http://transfers.ens.fr/), accueille Ted Briscoe (http://www.cl.cam.ac.uk/~ejb1/), de l?Universit? de Cambridge, pour une s?rie de cours qui seront donn?s en mars ? l?Ecole normale sup?rieure (45 rue d?Ulm), sur le th?me de la ? mod?lisation de l??volution des langues ? (? evolutionary linguistics ?). Les cours auront lieu aux dates et lieux suivants : - mardi 11 mars, de 10h30 ? 12h30 : salle Celan - jeudi 13 mars, de 10h30 ? 12h30 : salle R - mardi 18 mars, de 10h30 ? 12h30 : salle IHMC - jeudi 20 mars, de 10h30 ? 12h30 : salle R Le programme est d?taill? ci-dessous et ? l?adresse suivante : http://transfers.ens.fr/index.php/activites/professeurs-invites/495 Tout le monde est le bienvenu, dans la imite des places disponibles. Bien cordialement, Thierry Poibeau ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sun Feb 23 10:25:05 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sun, 23 Feb 2014 11:25:05 +0100 Subject: Seminaire: Julia Hirschberg, 18 mars 2014, BLRI, Aix-en-Provence Message-ID: Date: Thu, 20 Feb 2014 16:28:47 +0100 From: Nad?ra Bureau Message-ID: <00c401cf2e50$72fc1c20$58f45460$@bureau at lpl-aix.fr> 18 mars 2014 11h Salle de conf?rences B011, b?t. B 5 avenue Pasteur, Aix-en-Provence (Labex BLRI) Julia Hirschberg (Columbia University) Prosodic and Social Dimensions of Entrainment in Dialogue R?sum? : When people speak together, they often adapt aspects of their speaking style based upon the style of their conversational partner. This phenomena goes by many names, including adaptation, alignment, and entrainment, inter alia. In this talk, I will describe experiments in English and Mandarin prosodic entrainment in the Columbia Games Corpus and in the Tongji Games Corpus, large corpora of speech recorded from subjects playing a series of computer games. I will discuss how prosodic entrainment is related to turn-taking behaviors and to several measures of task and dialogue success. I will also discuss experiments relating entrainment to several social dimensions, including likeability and dominance. This is joint work with Stefan Benus, Agust?n Gravano, Ani Nenkova, Rivka Levitan, Laura Willson, and Zhihua Xia. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sun Feb 23 10:29:39 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sun, 23 Feb 2014 11:29:39 +0100 Subject: Job: TEMIS, Poste Consultant en solutions Message-ID: Date: Fri, 21 Feb 2014 10:11:50 +0000 From: Sylvie Guillemin-Lanne Message-ID: <0195B854C753DA4CAE5056A849136D8861119D at EXMBX01.hosting.local> TEMIS, ?diteur de logiciel sp?cialis? en Text-Mining, recherche pour son ?quipe Projets Innovants un consultant en Solutions Text-Mining, Un(e) linguiste informaticien(ne) Poste en CDD de un an renouvelable Vous travaillerez au sein de l??quipe Projets Innovants sur des projets partenaires, co-financ?s par le gouvernement fran?ais ou la commission europ?enne. Vous consacrerez 50% de votre temps , sous la supervision de l??quipe R&D, sur les nouvelles fonctionnalit?s de nos produits, que vous testerez et customiserez pour les besoins des Projets Innovants. Vous avez un bagage linguistique confirm? en analyse de documents, s?mantique textuelle, fouille de donn?es textuelles Vous avez des comp?tences sur python/perl ou java vous permettant de transformer des donn?es textuelles pour les utiliser dans diff?rents outils. Vous aimez d?couvrir et apprendre de nouvelles techniques et outils. Vous ?tes autonome, passionn?, curieux et vous allez naturellement vers diff?rentes personnes pour trouver comment r?aliser vos projets. Vous avez un tr?s bon niveau d?anglais. Nous vous remercions d?envoyer votre CV et une lettre de motivation ? Sylvie Guillemin-Lanne, sylvie.guillemin-lanne at temis.com ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sun Feb 23 10:30:20 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sun, 23 Feb 2014 11:30:20 +0100 Subject: Soft: L'analyseur multilingue LIMA sous licence libre Message-ID: Date: Fri, 21 Feb 2014 13:31:58 +0100 From: Ga?l de Chalendar Message-ID: <1445036.1DoyL0GLaY at brezhoneg.intra.cea.fr> L'analyseur multilingue LIMA sous licence libre Le CEA LIST annonce la mise sous licence libre de son analyseur linguistique LIMA, d?sormais disponible sous la licence Affero GPL v3 ici: https://github.com/aymara/lima/wiki. LIMA (Libre Multilingual Analyzer) est un analyseur linguistique multilingue d?velopp? par le laboratoire LVIC (Laboratoire Vision et Ing?nierie des Contenus) du CEA LIST pendant plus de 10 ans, selon un mod?le propri?taire. Le logiciel est d?sormais disponible selon un mod?le de double licence. LIMA supporte l'ensemble des ?tapes de l'analyse linguistique (morphologie, syntaxe, s?mantique, entit?s, cor?f?rences) selon une architecture modulaire et hautement configurable, avec des performances ?lev?es aussi bien en vitesse qu'en qualit? d'analyse. LIMA pr?sente aussi l'avantage d'avoir ?t? con?u d?s le d?part dans le but de l'int?grer ? des solutions de qualit? industrielle. La version Libre est compl?te avec des modules de traitement et des ressources permettant d'analyser des textes en anglais et en fran?ais. Conform?ment ? la licence Affero GPL v3, il est possible d'utiliser LIMA librement tant que les logiciels qui y sont li?s, qu'ils soient distribu?s ou ex?cut?s ? travers le r?seau, sont eux aussi des logiciels libres avec une licence compatible. La version commerciale est compl?t?e avec des modules et des ressources permettant d'analyser des textes dans de multiples langues suppl?mentaires (allemand, arabe, chinois, espagnol, hongrois, italien, ...). Elle est disponible directement aupr?s du CEA LIST dans le cadre de partenariats de R&D ou aupr?s de son partenaire ANT'inno, qui propose des offres incluant le support du logiciel et l'adaptation de ressources linguistiques ? des besoins sp?cifiques. Avec cette publication libre de son logiciel, l'objectif du CEA LIST est de fournir une solution d'analyse linguistique utilisable aussi bien par des chercheurs du domaine que comme outil de pr?traitement de textes dans des applications open source. L'adoption de LIMA par une large communaut? en permettra une am?lioration plus rapide et robuste, aussi bien d'un point de vue purement logiciel que scientifique. L'int?r?t pour les utilisateurs de la version Libre est de disposer d'un outil de qualit? industrielle dont les d?veloppements open source sont support?s par un acteur p?renne (CEA). Concernant la version commerciale, il leur est possible de librement tester et adapter LIMA ? leur usage avant d'en acqu?rir une licence commerciale. Contact: Ga?l de Chalendar gael.de-chalendar at cea.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sun Feb 23 10:28:52 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sun, 23 Feb 2014 11:28:52 +0100 Subject: Appel: VarDial Workshop, COLING 2014 Message-ID: Date: Thu, 20 Feb 2014 16:49:25 -0300 From: Marcos Zampieri Message-ID: X-url: http://corporavm.uni-koeln.de/vardial/DSL X-url: http://corporavm.uni-koeln.de/vardial/sharedtask.html Call For Papers 1st Workshop on Applying NLP Tools to Similar Languages, Varieties and Dialects (VarDial) and DSL Shared Task at COLING 2014 in Dublin, Ireland. VarDial Workshop: http://corporavm.uni-koeln.de/vardial/ DSL Shared Task: http://corporavm.uni-koeln.de/vardial/sharedtask.html The interest in language resources and computational models for the study of similar languages, varieties and dialects has been growing substantially in the last few years. The VarDial workshop aims to provide a forum for scholars working on a range of topics related to the study of linguistic variation. We welcome papers dealing with one or more topics of the VarDial workshop: 1) Language Resources and Tools for Varieties and Dialects - Corpus compilation (comparable, parallel) - Adaptation of tools (taggers, parsers) for similar languages, varieties and dialects - Evaluation of LR and tools applied to language varieties - Reusability of LR in NLP applications (e.g. machine translation) - Corpus-driven studies in dialectology and language variation 2) NLP Tasks and Applications - Identification of lexical variation and lexicon induction - Automatic classification of language varieties - Domain adaptation between language varieties - Identification of false friends - Machine translation between varieties and dialects Along with the VarDial workshop we will also be organizing the Discriminating between Similar Languages (DSL) shared task. We will release a specific call for participation soon. More information: http://corporavm.uni-koeln.de/vardial /sharedtask.html Instruction for Authors Papers should contain a maximum of 8 pages + 2 additional pages for references. The templates are available at the COLING website. For better presentation we strongly recommend papers to be prepared using LaTeX. The review process will be double blind, submissions should be anonymous. Important Dates Paper Submission: May 2nd, 2014 Notification of Acceptance: June 6th, 2014 Camera-Ready Version: June 27th, 2014 Workshop: August 23rd or 24th, 2014 Program Committee ?eljko Agi? (University of Potsdam, Germany) Jorge Baptista (University of Algarve and INESC-ID, Portugal) Francis Bond (Nanyang Technological University, Singapore) Aoife Cahill (Educational Testing Service, United States) Paul Cook (University of Melboune, Australia) Liviu Dinu (University of Bucarest, Romania) Stefanie Dipper (Ruhr University Bochum, Germany) Sascha Diwersy (University of Cologne, Germany) Toma? Erjavec (Jozef Stefan Institute, Slovenia) Mikel L. Forkada (University of Alicante, Spain) Binyam Gebrekidan Gebre (Max Planck Institute for Psycholinguistics, Holland) Nitin Indurkhya (University of New South Wales, Australia) Jeremy Jancsary (Nuance Communications, Austria) Marco Lui (University of Melbourne, Australia) Preslav Nakov (Qatar Computing Research Institute, Qatar) Santanu Pal (Saarland University, Germany) Sebastian Pad? (University of Stuttgart, Germany) Reinhard Rapp (University of Mainz, Germany and University of Aix-Marsaille, France) Felipe S?nchez Mart?nez (University of Alicante, Spain) Kevin Scanell (Saint Louis University, USA) Yves Scherrer (University of Geneva, Switzerland) Serge Sharoff (Leeds University, United Kingdom) Kiril Simov (Bulgarian Academy of Sciences, Bulgaria) Elke Teich (Saarland University, Germany) Joel Tetreault (Nuance Communications, USA) Francis Tyers (University of Alicante, Spain) Cristina Vertan (University of Hamburg, Germany) Torsten Zesch (University of Duisburg-Essen, Germany) Organizers Marcos Zampieri (Saarland University, Germany) Liling Tan (Saarland University, Germany) Nikola Ljube?i? (University of Zagreb, Croatia) J?rg Tiedemann (Uppsala University, Sweden) Contact Workshop: vardialworkshop at gmail.com Shared Task: dsl.sharedtask at gmail.com ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sun Feb 23 10:24:01 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sun, 23 Feb 2014 11:24:01 +0100 Subject: Appel: #di2014 - DIGITAL INTELLIGENCE, Sept. 17-19, Nantes, France Message-ID: Date: Thu, 20 Feb 2014 15:25:32 +0100 From: Frederic Benhamou & Milad Doueihi - #di2014 Co-Chairs Message-Id: <20140220142552.BF4081010443 at smtp.univ-nantes.fr> ============== #di2014 | CALL FOR PAPERS ============= CALL FOR PAPERS ? #di2014 International Conference on Digital Intelligence Sept. 17-19, 2014 - Nantes, France www.di2014.org - @DIConf2014 Digital Intelligence 2014 (#di2014) is a new international scientific and interdisciplinary conference dedicated to digital society and cultures. The challenge is to bring together researchers from various disciplines in order to discuss and contribute to shape a new scientific and cultural paradigm. #di2014 will be jointly organized with Scopitone, the main French Festival on digital arts and electronic music (Scopitone 2014, Sept. 16-21, 2014). The program will be built around four main components: plenary invited talks, contributed talks in parallel sessions, artistic sessions, performances and works shared with the Scopitone festival and finally sessions and events related to the (real) economy of digital cultures. Program co-chairs: Milad Doueihi, Universit? Laval (Canada) / Fr?d?ric Benhamou, Universit? de Nantes (France) Conference co-chairs: Francky Trichet, Universit? de Nantes (France) / St?phane Roche, Universit? Laval (Canada) ---------------------------- INVITED SPEAKERS ---------------------------- - Serge Abiteboul, Acad?mie des Sciences / INRIA / Coll?ge de France, France - G?rard Berry, Acad?mie des Sciences / INRIA / Coll?ge de France, France - Caterina Fake, Co-Founder of Flickr, Findery & Hunch, USA - Bruno Latour, MediaLab, Science Po, France - Carlo Ratti, MIT Senseable City Lab, Massachusetts Institute of Technology, USA - Jeffrey Schnapp, MetaLAB, Harvard University, USA ---------------------------- IMPORTANT DATES ---------------------------- - Submission deadline: April 30, 2014 - Notification of acceptance/rejection: June 16, 2014 - Conference: Sept. 17-19, 2014 ---------------------------- SUBMISSION GUIDELINES ------------------ Papers should be submitted online using this form: https://www.easychair.org/conferences/?conf=di2014 Papers should not exceed 5 pages in LNCS format (http://www.springer.de/comp/lncs/authors.html) and should be submitted in PDF format. The submitted papers (technical or position papers, surveys) must be unpublished and must not be under review elsewhere. ---------------------------- TOPICS ------------------------------------------- Main areas of interest include, but are not limited to: - Art & Culturally-Aware Information Technology: Digital Art, IT for cultural, natural and scientific Heritage, In-museum Innovative Technology and uses - Data: Big Data, Open Data, Linked Data, Data Journalism, Data Visualization - Digital Organizations: Applications of new technologies to e-Business and e-Administration, e-Commerce, e-Marketing, m-Commerce, m-Marketing, Organizational and management issues - Social Web: Social Network Analysis, Communities of practice / interest in social media and mobile devices, Collective intelligence, Collaborative production and Social Computing, Knowledge Ecosystems, Digital Ecosystems, Economics and social innovation on Digital Ecosystems - Digital Identity: e-reputation (human, product, enterprise, government, etc.), Digital traces and memories, Privacy, Trust, Security and Personal Data Management Systems, Human factors (culture, affect, motivation, cognition) and user centered design in digital technology - e-Learning: Social impact and cultural issues in e-Learning, Policy and organizational issues in e-Learning, e-Universities, e-Schools, e-Learning technologies and tools, e-Learning standards (Open Course Ware, Open Access), open e-Learning : OER, MOOCs, virtual mobility - Green Digital Economy: Green computing, IT for Sustainability, Smart Cities and Homes - Digital Trends and Emerging Practices: Cloud Computing, Emotional Computing, Serious Games, Makers, FabLabs, MediaLabs, LivingLabs, ArtLab, innovative Digital Solution in Health, Tourism, Internet of Things, Digital Addiction Studies - Human-Robot Interaction: Ethical and social issues of HRI, Bio-inspired robotics, Humanoid robotics, Socially intelligent robots, assistive (health & personal care) robotics, Transhumanism - Human-Computer Interaction: Virtual, Augmented and Mixed Reality, Design, User Experience and Usability - Digital humanities: Philosophy of the Web, Digital Literature, Digital Literacy ---------------------------- PROCEEDINGS --------------------------------- The accepted papers will be included in the electronic conference proceedings. Authors of selected conference papers will be invited to submit an extended version of their work to scientific journals or a compilation volume of articles. ---------------------------- NANTES DIGITAL WEEK --------------------- #di2014 is at the heart of the Nantes Digital Week (Sept. 13-21, 2014) which brings together conferences, workshops, demonstrations, performances and concerts related to Art, Science and Economy. Both Festive, innovative and hybrid, Nantes Digital Week appeal to a wide audience and will be organized around major events: Startup Weekend, Scopitone festival on Digital Art and Electronic Cultures, FabLab Day, Robotic Day, etc. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Sun Feb 23 10:18:00 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Sun, 23 Feb 2014 11:18:00 +0100 Subject: Appel: CICE-14 Message-ID: Date: Thu, 20 Feb 2014 10:05:10 +0000 (GMT) From: "CICE, cfp" Message-ID: <998870958.12733.1392890710902.open-xchange at email.1and1.co.uk> X-url: http://www.ciceducation.org Apologies for cross-postings. Kindly email this call for papers to your colleagues, faculty members and postgraduate students. CALL FOR ABSTRACTS, EXTENDED ABSTRACTS, PAPERS, POSTERS AND WORKSHOPS! ============================================ Canada International Conference on Education (CICE-2014) June 16-19, 2014 Cape Breton University, Nova Scotia, Canada www.ciceducation.org ============================================ The CICE is an international refereed conference dedicated to the advancement of the theory and practices in education. The CICE promotes collaborative excellence between academicians and professionals from Education. The aim of CICE is to provide an opportunity for academicians and professionals from various educational fields with cross-disciplinary interests to bridge the knowledge gap, promote research esteem and the evolution of pedagogy. The CICE-2014 invites research papers that encompass conceptual analysis, design implementation and performance evaluation. All the accepted papers will appear in the proceedings and modified version of selected papers will be published in special issues peer reviewed journals. The topics in CICE-2014 include but are not confined to the following areas: *Art Education * Music Education * Writing Education * Imaginative Education * Language Education * History *Adult Education * Competitive Skills * Continuing Education * Higher Education * Vocational Education * Transferring Disciplines *Business Education * Educational Administration * Human Resource Development * Academic Advising and Counselling * Education Policy and Leadership * Industrial Cooperation * Life-long Learning Experiences * Workplace Learning and Collaborative Learning * Work Employability * Educational Institution Government Partnership * Patent Registration and Technology Transfer * University Spin-Off Companies *Course Management * Accreditation and Quality Assurance * Academic Experiences and Best Practice Contributions * Copy-right * Digital Libraries and Repositories * Digital Rights Management * Evaluation and Assessment * E-content Management and Development * Open Content * e-Portfolios * Grading Methods * Knowledge Management * Quality processes at National and International level * Security and Data Protection * Student Selection Criteria in Interdisciplinary Studies * User-Generated Content *Curriculum, Research and Development * Acoustics in Education Environment * APD/Listening * Counsellor Education * Courses, Tutorials and Labs * Curriculum Design * ESL/TESL *Educational Foundations * Early Childhood Education * Elementary Education * Geographical Education * Health Education * Home Education * Rural Education * Science Education * Secondary Education * Second life Educators * Social Studies Education * Special Education *Learning / Teaching Methodologies and Assessment * Simulated Communities and Online Mentoring * e-Testing and new Test Theories * Supervising and Managing Student Projects * Pedagogy Enhancement with e-Learning * Educating the Educators * Immersive Learning * Blended Learning * Computer-Aided Assessment * Metrics and Performance Measurement * Assessment Software Tools * Assessment Methods in Blended Learning Environments *Global Issues In Education and Research * Education, Research and Globalization * Barriers to Learning (ethnicity, age, psychosocial factors, ...) * Women and Minorities in Science and Technology * Indigenous and Diversity Issues * Government Policy issues * Organizational, Legal and Financial Aspects * Digital Divide * Increasing Affordability and Access to the Internet * Ethical issues in Education * Intellectual Property Rights and Plagiarism *Pedagogy * Teacher Education * Cross-disciplinary areas of Education * Educational Psychology * Education practice trends and issues * Indigenous Education * Kinesiology and Leisure Science * K12 * Life-long Learning Education * Mathematics Education * Physical Education (PE) * Reading Education * Religion and Education Studies *Research Management * Research Methodologies * Academic Research Projects * Joint-research programmes * Research on Technology in Education * Research Centres * Links between Education and Research * New Challenges in Education * ECTS experiences * The Bologna Process and its implementation * Joint-Degree Programmes * Erasmus and Exchange experiences in universities * Students and Teaching staff Exchange programmes *Ubiquitous Learning * Accessibility to Disabled Users * Animation, 3D, and Web 3D Applications * Context Dependent Learning * Distance Education * E-Learning * E-Manufacturing * Educational Technology * Educational Games and Software * Human Computer Interaction * ICT Education * Internet technologies * Learning Management Systems (LMS) * Mobile Applications and Learning (M-learning) * Multi-Virtual Environment * Standards and Interoperability * Technology Enhanced Learning * Technology Support for Pervasive Learning *Ubiquitous Computing * Videos for Learning and Educational Multimedia * Virtual and Augmented Reality * Virtual Learning Environments (VLE) * Web 2.0, Social Networking, Blogs and Wikis * Wireless Applications *Research In Progress Important dates Research Paper, Extended Abstract, Case Study, Work in Progress and Report Submission Deadline March 10, 2014 Notification of Paper, Extended Abstract, Case Study, Work in Progress and Report Acceptance Date March 25, 2014 Final Paper Submission Deadline for Conference Proceedings Publication April 30, 2014 Workshop Proposal Submission Deadline March 20, 2014 Notification of Workshop Proposal Acceptance/Rejection March 30, 2014 Poster/Demo Proposal Submission March 20, 2014 Notification of Poster/Demo Acceptance March 30, 2014 Participant(s) Registration (Open) December 01, 2013 to June 15, 2014 Early Bird Registration (Authors and Participants) January 30 to April 15, 2014 Late Bird Registration (Authors only) April 16 to May 18, 2014 Conference Dates June 16-19, 2014 For further information please visit CICE-2014 at www.ciceducation.org From hamon at LIMSI.FR Wed Feb 26 09:03:50 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 26 Feb 2014 10:03:50 +0100 Subject: Appel: Seven SIGIR=?WINDOWS-1252?Q?=9214_?=Workshops on emerging areas in IR Message-ID: Date: Wed, 26 Feb 2014 05:00:45 +0000 From: Richi Nayak Message-ID: <032189CFEE6B094A8CB9DB0838786ADF052740 at ex10mb1.qut.edu.au> X-url: http://sigir.org/sigir2014/ [Apologies if you receive this more than once] The workshop program of the SIGIR?14: 37th Annual ACM SIGIR Conference, Gold Coast, Australia, 6-11 July, 2014 will host seven attractive workshops covering novel ideas and emerging areas in IR: * ERD?14: Entity Recognition and Disambiguation Challenge http://web-ngram.research.microsoft.com/ERD2014/ The Entity Recognition and Disambiguation Workshop will be organized as a challenge, where participants submit working systems that identify the entities mentioned in text. The challenge will have two tracks, focusing on long and short texts. All submissions will be evaluated on shared datasets; part of the data will be withheld, to be used for the final evaluation of all submitted systems to determine the winners. Each participating team will be offered a spot at the workshop to present their system. David Carmel, Yahoo! Research Ming-Wei Chang, Microsoft Research Evgeniy Gabrilovich, Google Bo-June (Paul) Hsu, Microsoft Research Kuansan Wang, Microsoft Research * GEAR?14: Gathering Efficient Assessments of Relevance Workshop https://sites.google.com/site/sigirgear/ Evaluation is a fundamental part of Information Retrieval, and in the conventional Cranfield evaluation paradigm, sets of relevance assessments are a fundamental part of test collections. In this workshop, we wish to revisit how relevance assessments can be efficiently created. Potential themes include methods for generating assessments, the process of assessment, effort involved in assessing different materials, exploration of the concept of relevance etc. A discussion and exploration of this issue will be facilitated through the presentation of results based papers and position papers on the topic, as well as a group design activity. Martin Halvey, Glasgow Caledonian University Robert Villa, University of Sheffield Paul Clough, University of Sheffield * MedIR?14: Medical Information Retrieval Workshop http://medir.dcu.ie/ Medical information is accessible from diverse sources including the general web, social media, journal articles, and hospital records; users include patients and their families, researchers, practitioners and clinicians. Challenges in medical information retrieval include: diversity of users and user ability; variations in the format, reliability, and quality of biomedical and medical information; the multimedia nature of data; and the need for accuracy and reliability. The aim of the workshop is to bring together researchers interested in medical information search with the goal of identifying specific challenges that need to be addressed to advance the state-of-the-art. Eiji Aramaki, Kyoto University, Japan Lorraine Goeuriot, Dublin City University, Ireland Gareth JF Jones, Dublin City University, Ireland Liadh Kelly, Dublin City University, Ireland Henning M?ller, University of Applied Sciences Western Switzerland Justin Zobel, University of Melbourne, Australia * PIR?14: Privacy-Preserving IR Workshop ? When Information Retrieval Meets Privacy and Security http://www.cs.georgetown.edu/~huiyang/sigir2014-pir-workshop/ Information retrieval and information privacy/security are two fast-growing computer science disciplines. There are many synergies and connections between these two disciplines. However, there have been very limited efforts to connect the two. On the other hand, due to lack of mature techniques in privacy-preserving IR, concerns about privacy and security have become serious obstacles that prevent valuable user data to be used in IR research such as studies about query logs, social media, tweets, sessions, and medical record retrieval. This privacy-preserving IR workshop aims to spurring research brings together the research fields of IR and privacy/security, and mitigate privacy threats in information retrieval by exploring novel algorithms and tools. Luo Si (Purdue University, USA) Grace Hui Yang (Georgetown University, USA) * SMIR?14: Semantic Matching in Information Retrieval http://smir2014.noahlab.com.hk Recently, significant progress has been made in research on what we call semantic matching (SM), in Web search, question answering, online advertisement, cross language information retrieval, multimedia retrieval, and other tasks. Let us take Web search as example of the problem. When comparing the textual content of query and documents, the simple term-based approaches can fail when searcher and author use different terms. A more realistic approach beyond bag-of-words, referred to as semantic matching (SM), is to conduct deeper query and document analysis to encode text with richer representations and then perform query-document matching with such representations. The main purpose of the workshop is to bring together IR and NLP researchers working on or interested in semantic matching, to share latest research results, express opinions on the related issues, and discuss future directions. Julio Gonzalo, UNED, Spain Hang Li, Noah's Ark Lab, Huawei, Hong Kong Alessandro Moschitti, Qatar Computing Research Institute, Qatar Jun Xu, Noah's Ark Lab, Huawei, Hong Kong * SoMeRA?14: Social Media Retrieval and Analysis Workshop http://www.cp.jku.at/conferences/SoMeRA2014/ The SoMeRA 2014 workshop will present and discuss cutting edge research on all topics of retrieval, recommendation, and browsing in social media, as well as on the analysis of user's multifaceted traces in social media. In particular, novel methods and ideas that address challenges such as large quantity and noisiness of user-generated multimedia data, user biases, cold-start problem, or integrating contextual aspects into retrieval and recommendation techniques are highly welcome. The workshop will further foster the exchange of ideas between different communities, in particular it aims at better connecting the multimedia and recommender systems communities with the information retrieval community. The workshop will feature both oral presentations (full papers) and poster/demo presentations (short papers). Markus Schedl, Johannes Kepler University, Austria Peter Knees, Johannes Kepler University, Austria Jialie Shen, Singapore Management University, Singapore * TAIA?14: Temporal, social and spatially Aware Information Access Workshop http://research.microsoft.com/en-us/people/milads/taia2014.aspx Users provide an unprecedented volume of detailed, and continuously updated information about where they are, what they are doing, who they are with, and what they are thinking and feeling about their activities. The provision of this stream creates an informal contract between the user and the information access application in which the user will provide the information, but the application must provide results that are contextually relevant. In this workshop we explore spatial and temporal context in dynamic geotagged collections, such as Wikipedia, and traditional news sources, as well as social media sites such as Twitter, Foursquare, Facebook and Flickr. To ground the workshop, and provide a locus for discussion of the two aspects of user context, we focus on event detection and recommendation. Events are a natural theme around which to center discussions of spatial and temporal context because events are defined by their time and place. Fernando Diaz, Microsoft Research Claudia Hauff, Delft University of Technology Vanessa Murdock, Microsoft Maarten de Rijke, University of Amsterdam Milad Shokouhi, Microsoft Please look at the individual websites for the calls, and deadlines ? and participate in the discussion on the SIGIR?14 workshop day, on Friday 11 July 2014, in the beautiful scenery of Gold Coast, Queensland, Australia. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Feb 26 08:40:10 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 26 Feb 2014 09:40:10 +0100 Subject: Sujet de these: Humanites numeriques, a pourvoir immediatement, Lattice Message-ID: Date: Tue, 25 Feb 2014 18:28:36 +0100 From: Thierry Poibeau Message-Id: <7C7AC0DD-14E0-4F33-AF35-3021D7B1EE8C at ens.fr> Bonjour, Suite ? un d?sistement tardif, le Lattice dispose d?un financement pour une allocation de recherche de trois ans, ? partir d?avril 2014. Le th?me de recherche s?inscrit dans le cadre des ? humanit?s num?riques ? et de l?analyse automatique de masses de donn?es textuelles (th?me possible : passage de documents papier vers un support num?rique dans le domaine de l?arch?ologie, traitement automatique des langues pour l?analyse de documents litt?raires, apport du traitement automatique des langues pour les sciences humaines, etc.). La th?se prendra en compte un ou deux cas d??tudes pr?cis : il s?agira, ? partir de ces cas d??tude, d?envisager l?apport du num?rique pour les sciences humaines et sociales. On s?appuiera autant que possible sur des masses de donn?es importantes. Conditions pour candidater : - avoir un Master avec mention (Master 2 d?j? obtenu) - ne pas avoir d?j? ?t? inscrit en th?se Comp?tences souhait?es : - ma?trise des outils de traitement automatique des langues - comp?tences en informatique - int?r?t pour les sciences humaines et sociales - bonne ma?trise du fran?ais et de l?anglais - qualit? de r?daction Pour candidater, contacter Thierry Poibeau (prenom.nom at ens.fr) d?s que possible (et dans tous les cas avant le 4 mars 2014) par mail avec un bref descriptif du projet / des motivations, un CV et un relev? des notes de Master. Bien cordialement, Thierry Poibeau http://www.lattice.cnrs.fr/Thierry-Poibeau ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Feb 26 08:45:19 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 26 Feb 2014 09:45:19 +0100 Subject: Appel: Atelier FondamenTAL, TALN 2014 Message-ID: Date: Sun, 23 Feb 2014 22:37:58 +0100 From: Marie-H?l?ne St?fanini Message-ID: <530A6A36.1010502 at lif.univ-mrs.fr> X-url: http://talep.lif.univ-mrs.fr/FondamenTAL TALN 2014, 21^?me conf?rence sur le Traitement Automatique des Langues Naturelles APPEL A CONTRIBUTIONS Atelier FondamenTAL Les ressources lexicales de Jean Dubois et Fran?oise Dubois-Charlier Retours d?exp?riences et projets, et hommage ? Paul Sabatier 1^er juillet 2014, Facult? Saint-Charles, ? Marseille Depuis 2007, le site FondamenTAL (http://talep.lif.univ-mrs.fr/FondamenTAL) cr?? par Paul Sabatier permet au public de s?informer sur l?ensemble des ressources lexicales num?ris?es de Jean Dubois et Fran?oise Dubois-Charlier. De nombreux chercheurs et doctorants en Sciences du Langage et en TAL ont demand? au site http://www.modyco.fr/Dictionnaires de leur communiquer les diff?rents dictionnaires. Ou bien ils se sont rendus sur http://rali.iro.umontreal.ca/ Dubois pour b?n?ficier d?une interface de consultation du Dictionnaire des Verbes Fran?ais (LVF). Il y a eu d?j? beaucoup de r?f?rences ? Les Verbes Fran?ais (LVF) dans des publications et th?ses de linguistique. De plus, deux num?ros de revue (Langue Fran?aise n? 153 et Langages n? 179-180) et un chapitre d'ouvrage (Ressources Lexicales 2013, Gala & Zock ed.) contribuent ? faire conna?tre les travaux de Dubois et Dubois-Charlier. Une communaut? informelle des utilisateurs de ces ressources s?est constitu?e, qui aura l'occasion, nous l'esp?rons, d'?tre en partie r?unie ? l'occasion de cet atelier. La finalit? de l'atelier que nous proposons est triple : r?unir les chercheurs ayant utilis? de fa?on significative les ressources de Dubois et Dubois-Charlier jusqu?? pr?sent disponibles, pour faire avec eux un ?tat des lieux de leurs recherches ; informer la communaut? de la diffusion de ressources lexicales jusqu'? pr?sent in?dites ? savoir : 1) une version, r?vis?e par Paul Sabatier, de Les Verbes Fran?ais (LVF, 26.000 entr?es) ; 2) un in?dit, Locutions Verbales (LVF, 3.510 entr?es), r?vis? par Paul Sabatier ; 3) un autre in?dit le Dictionnaire Electronique des Mots (DEM ;145 000 entr?es). rendre hommage ? Paul Sabatier, en ?voquant ses derniers travaux portant sur les derniers dictionnaires ?lectroniques de Dubois et Dubois-Charlier. Le Dictionnaire Electronique des Mots (DEM,145 000 entr?es) est une base de donn?es lexicales du fran?ais qui constitue la synth?se des diff?rents dictionnaires de verbes simples, de locutions verbales et d'adjectifs d?j? connus de ces auteurs, augment?e de toutes les autres cat?gories de mots, tant simples que locutionnels. Les auteurs en ont d?j? pr?sent? les grandes lignes en 2010 dans Langages n? 179-180. Son caract?re ? la fois exhaustif, rigoureusement syst?matique et syntactico-s?mantique, devrait int?resser bien des chercheurs en TAL, d?autant plus qu?il aura ?t? rendu disponible dans plusieurs formats pratiques pour le traitement automatique. L'atelier sera donc l'occasion non seulement d'?voquer des travaux ayant port? sur les ressources d?j? publi?es de Dubois et Dubois-Charlier, mais aussi de pr?senter le DEM et de d?tailler ses propri?t?s et modes d'utilisation. De plus, le r?le important que Paul Sabatier a jou? dans le travail de r?vision de ces ressources sera largement ?voqu?. Pr?sidents de l'atelier Denis Le Pesant, Universit? Paris-Ouest Nanterre La D?fense & MoDyCo (CNRS) Marie-H?l?ne St?fanini, Laboratoire d?Informatique Fondamentale (LIF), Aix-Marseille Universit? & CNRS Comit? de programme Fran?oise Dubois-Charlier, Universit? d'Aix-Marseille Guy Lapalme, Universit? de Montr?al, Laboratoire Recherche Appliqu?e en Linguistique Informatique (RALI) Danielle Leeman, Universit? Paris Ouest Nanterre & ICAR Denis Le Pesant, MoDyCo (CNRS) & Universit? Paris Ouest Nanterre La D?fense Max Silberztein, Universit? de Franche-Comt? Marie-H?l?ne St?fanini, Laboratoire d?Informatique Fondamentale (LIF), Aix-Marseille Universit? & CNRS Agn?s Tutin, LIDILEM & Universit? Stendhal (Grenoble). Modalit?s de soumission 1^?re ?tape : Soumission du r?sum? d'une communication (une page maximum). Les r?sum?s seront adress?s ? denis.lepesant at orange.fr et marie-helene.stefanini at lif.univ-mrs.fr avant le 21 mars 2014. Nous annoncerons ult?rieurement la dur?e des expos?s, qui sera fonction du nombre de communications accept?es. 2^?me ?tape : Envoi d'un article par les auteurs des communications accept?es Dates importantes 20 mars 2014 : soumission des r?sum?s de communication 10 avril 2014 : notification de l'acceptation ou du refus de la communication 1^er mai 2014 : envoi de la premi?re version des articles ? denis.lepesant at orange.fr et marie-helene.stefanini at lif.univ-mrs.fr 13 mai 2014: r?sultat de l'examen des articles 30 mai 2014 : r?ception de la version finale des articles Les articles en fran?ais ou en anglais devront suivre le format d'un article court de TALN 2014 (soit 6 ? 8 pages). Une feuille de style LaTeX et des mod?les LibreOffice et Word sont disponibles sur le site web de la conf?rence (http://www.taln2014.org/site/soumission/). Site de l'atelier : http://www.taln2014.org/site/ressources-lexicales-et-tal-vue-densemble-sur-les-dictionnaires-electroniques-de-jean-dubois-et-francoise-dubois-charlier/ Pour information : Site TALN 2014, 21^?me conf?rence sur le Traitement Automatique des Langues Naturelles : www.taln2014.org ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Feb 26 08:39:57 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 26 Feb 2014 09:39:57 +0100 Subject: Appel: Natural Language and Computer Science, July 17-18, Vienna Message-ID: Date: Sun, 23 Feb 2014 19:00:06 +0100 From: retore Message-Id: X-url: http://www.indiana.edu/~iulg/nlcs.html Workshop on NATURAL LANGUAGE AND COMPUTER SCIENCE (NLCS '14) 17-18 July, 2014 Vienna, Austria http://www.indiana.edu/~iulg/nlcs.html Endorsed by the Association for Computational Linguistics Special Interest Group on Computational Semantics. Vienna Summer of Logic http://vsl2014.at/ Computer Science Logic - Logic In Computer Science 2014 http://lii.rwth-aachen.de/lics/csl-lics14/ Paper submission deadline: April 1, 2014 AIMS AND SCOPE Formal tools coming from logic, category theory, are important for natural language processing and especially for computational semantics. Moreover, work on these tools borrows heavily from all areas of theoretical computer science. In the other direction, applications having to do with natural language have inspired developments on the formal side. The workshop invites papers on both topics and their applications, as well as on the combination between logical and statistical methods. Specific topics include, but are not limited to: * linguistic, computational and logical aspects of the interface between syntax and semantics * logical aspects of linguistic theories * logic for semantics of lexical items, sentences, discourse and dialog * continuations in natural language semantics * formal tools in textual inference, such as logics for natural language inference * applications of category theory in semantics * linear logic in semantics * formal approaches to unifying data-driven (quantitative, statistical) and declarative (logical) approaches to semantics * natural language processing tools using some logic IMPORTANT DATES Paper submission deadline: April 1, 2014 Author notification: May 1, 2014 Electronic versions of papers due: May 15, 2014 Workshop: July 17, 2014 INVITED SPEAKERS TO INCLUDE Anne Abeill? (to be confirmed) Universit? Paris Diderot Aarne Ranta Chalmers University of Technology and University of Gothenburg SUBMISSIONS Please submit extended abstracts of 4-10 pages using EasyChair: https://www.easychair.org/conferences/?conf=nlcs14 ORGANIZERS Valeria de Paiva, Nuance Larry Moss, Indiana University Christian Retor?, Universit? de Bordeaux PROGRAM COMMITTEE Robin Cooper, University of Gothenburg Valeria de Paiva, Nuance Christophe Fouquer?, Universit? Paris 13 Larry Moss, Indiana University Ian Pratt-Hartmann, University of Manchester, UK Christian Retor?, Universit? de Bordeaux Wlodek Zadrozny, UNC, Charlotte ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Wed Feb 26 08:49:09 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 26 Feb 2014 09:49:09 +0100 Subject: Job: Postdoctoral fellowship on computational analysis of parliamentary text Message-ID: Date: Tue, 25 Feb 2014 14:52:58 -0500 From: Graeme Hirst Message-Id: <25687A62-1ED0-40B7-88EB-41F134143A65 at cs.toronto.edu> X-url: http://www.sgs.utoronto.ca/postdoctoralfellows/Pages/Current-Postdocs.aspx The Departments of Political Science and Computer Science of the University of Toronto seek to hire a postdoctoral researcher for a two-year term, beginning 1 July 2014, or as soon as possible. The successful applicant will join an international consortium of researchers focused on the development of computational tools for the analysis of political ideology in the recorded histories of parliamentary debates in Canada, the Netherlands, and the United Kingdom. The successful applicant will have a Ph.D. in a relevant field of specialization (e.g., political science, computational linguistics), as well as experience with the computational analysis of large datasets, especially large corpora of text. Fluency in French and/or Dutch is an asset, but not a requirement. The stipend for this position is $50 000 (CDN) per year, plus an additional 10% benefits package, which is outlined at: http://www.sgs.utoronto.ca/postdoctoralfellows/Pages/Current-Postdocs.aspx. The University of Toronto is situated in downtown Toronto, Ontario, Canada, a large multicultural city in the center of an urban area with a population of more than 5.5 million people. The University of Toronto is within walking distance of all of the amenities that a major metropolitan center provides. Applicants should forward a cover letter, a one-page description of their research interests, a curriculum vitae, and the contact information for three referees to both Professor Christopher Cochrane (christopher.cochrane at utoronto.ca) and Professor Graeme Hirst (gh at cs.toronto.edu). Consideration of applications will begin on 1 April 2014, and continue until the position is filled. The University of Toronto is strongly committed to diversity within its community and especially welcomes applications from visible minority group members, women, Aboriginal persons, persons with disabilities, members of sexual minority groups, and others who may contribute to the further diversification of ideas. From hamon at LIMSI.FR Wed Feb 26 08:48:12 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 26 Feb 2014 09:48:12 +0100 Subject: Appel: COLING 2014 Message-ID: Date: Tue, 25 Feb 2014 11:39:51 +0000 From: John Judge Message-ID: <530C8107.4040402 at computing.dcu.ie> X-url: http://www.coling-2014.org ********** Apologies for cross-posting ********** Second (Main) Call for Papers - Coling 2014 COLING 2014 Dublin, Ireland, 23-29 August, 2014 The International Committee on Computational Linguistics (ICCL) is pleased to announce the 25th International Conference on Computational Linguistics (Coling 2014), at Dublin City University (DCU, Dublin, Ireland). DCU is a young, dynamic and ambitious university with a mission to transform lives and societies through education, research and innovation. Most of the local organisers are from CNGL, Ireland's Centre for Global Intelligent Content (formerly the Centre for Next Generation Localisation), which embodies the leading position of Ireland in the global localisation/internationalisation business, a strong focus on language technologies including machine translation, computational linguistics and natural language processing, as well as on intelligent management, search, retrieval, transformation and adaptation of content. Coling will cover a broad spectrum of technical areas related to natural language and computation. The conference will include full papers (presented as oral presentations or posters), demonstrations, tutorials, and workshops. TOPICS OF INTEREST Coling 2014 solicits papers and demonstrations on original and unpublished research on the following topics, including, but not limited to: - pragmatics, semantics, syntax, grammars and the lexicon; - cognitive, mathematical and computational models of language processing; - models of communication by language; - lexical semantics and ontologies; - word segmentation, tagging and chunking; - parsing, both syntactic and deep; - generation and summarisation; - paraphrasing, textual entailment and question answering; - speech recognition, text-to-speech and spoken language understanding; - multimodal and natural language interfaces and dialogue systems; - information retrieval, information extraction and knowledge base linking; - machine learning for natural language; - modelling of discourse and dialogue; - sentiment analysis, opinion mining and social media; - multilingual processing, machine translation and translation aids; - applications, tools and language resources; - system evaluation methodology and metrics. In all relevant areas, we encourage authors to include analysis of the influence of theories (intuitions, methodologies, insights), to technologies (computational algorithms, methods, tools, data), and/or contributions of technologies to theory development. In technologically oriented papers, we encourage in-depth analysis and discussion of errors made in the experiments described, if possible linking them to the presence or absence of linguistically-motivated features. Contributions that display and rigorously discuss future potential, even if not (yet) attested in standard evaluation, are welcome. PAPER REQUIREMENTS Papers should describe original work; they should emphasise completed work or well-advanced ongoing research rather than intended work, and should indicate clearly the state of completion of the reported results. Wherever appropriate, concrete evaluation results should be included. Submissions will be judged on correctness, originality, technical strength, significance and relevance to the conference, and interest to the attendees. Submissions presented at the conference should mostly contain new material that has not been presented at any other meeting with publicly available proceedings. Papers that are being submitted in parallel to other conferences or workshops must indicate this on the title page, as must papers that contain significant overlap with previously published work. REVIEWING Reviewing will be double blind. It will be managed by an international Conference Program Committee consisting of Program Chairs, members of the Scientific Advisory Board and Area Chairs, who will be assisted by invited reviewers. Important Notice [1] In order to allow participants to be acquainted with the published papers ahead of time which in turn should facilitate discussions at Coling 2014, we have set the official publication date two weeks before the conference, i.e., on August 11, 2014. On that day, the papers will be available online for all participants to download, print and read. If your employer is taking steps to protect intellectual property related to your paper, please inform them about this timing. [2] While submissions are anonymous, we strongly encourage authors to plan for depositing language resources and other data as well as tools used and/or developed for the experiments described in the papers, if the paper is accepted. In this respect, we encourage authors then to deposit resources and tools to available open-access repositories of language resources and/or repositories of tools (such as META-SHARE, Clarin, ELRA, LDC or AFNLP/COCOSDA for data, and github, sourceforge, CPAN and similar for software and tools) and refer to them instead of submitting them with the paper, even though it will also be an open possibility (through the START system). The details will be given in the submission site for camera-ready versions of accepted papers. [3] There will be a separate call for demonstrations. Accepted papers on demonstrations will also be included in the proceedings. INSTRUCTIONS FOR AUTHORS For Coling 2014, there will be one category of research papers only. All of the papers will be included in conference proceedings, this time in electronic form only. The maximum submission length is 8 pages (A4), plus two extra pages for references. Authors of accepted papers will be given additional space in the camera-ready version to reflect space needed for changes stemming from reviewers comments. Authors can indicate their preference for presentation mode (i.e. oral or poster presentation) in the submission form, and the reviewers will recommend an appropriate mode of presentation to the program committee which will then decide. There will be no distinction in the proceedings between research papers presented orally vs. as posters. Papers shall be submitted in English, anonymised with regard to the authors and/or their institution (no author-identifying information on the title page nor anywhere in the paper), including referencing style as usual. Authors should also ensure that identifying meta-information is removed from files submitted for review. Papers must conform to official Coling 2014 style guidelines, which are available here http://www.coling-2014.org/doc/coling2014.zip. Submission and reviewing will be managed online by the START system. The only accepted format for submitted papers is in Adobe's PDF. Submissions must be uploaded on the START system (https://www.softconf.com/coling2014/main/) by the submission deadlines; submissions after that time will not be reviewed. To minimise network congestion, we request authors to upload their submissions as early as possible. IMPORTANT DATES February 2014: Opening of the submission website https://www.softconf.com/coling2014/main/ March 21, 2014: Paper submission deadline May 9-12, 2014: Author response period May 23, 2014: Author notification June 6, 2014: Camera-ready PDF due August 11, 2014: Official paper publication date August 25-29, 2014: Main conference From hamon at LIMSI.FR Wed Feb 26 08:46:30 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Wed, 26 Feb 2014 09:46:30 +0100 Subject: Appel: ESSLLI 2014 Student Session Message-ID: Date: Tue, 25 Feb 2014 11:01:40 +0100 From: Ronald de Haan Message-ID: X-url: http://www.kr.tuwien.ac.at/drm/dehaan/stus2014/ Please forward to students. Apologies for the multiple messages. *2nd Call for Papers* *ESSLLI 2014 STUDENT SESSION* Held during the 26th European Summer School in Logic, Language and Information T?bingen, Germany, August 11-22, 2014 *Deadline for submissions: April 1st, 2014* https://www.easychair.org/conferences/?conf=essllistus2014 *ABOUT:* The Student Session of the 26th European Summer School in Logic, Language, and Information (ESSLLI) will take place in T?bingen, Germany on August 11-22, 2014. We invite submissions of original, unpublished work from students in any area at the intersection of Logic & Language, Language & Computation, or Logic & Computation. Submissions will be reviewed by several experts in the field, and accepted papers will be presented orally or as posters and will appear in the student session proceedings by Springer. This is an excellent opportunity to receive valuable feedback from expert readers and to present your work to a diverse audience. *SEPARATE POSTER SESSION:* Note that there are two separate kinds of submissions, one for the oral presentations and one for the posters. This means that papers can be directly submitted as posters. Reviewing and ranking will be done separately. We particularly encourage submissions for posters, as they offer an excellent opportunity to present research in progress. *SUBMISSION GUIDELINES:* All authors must be students, and submissions may be singly or jointly authored. Submissions should not be longer than 8 pages for an oral presentation and 4 pages for a poster presentation (including examples and references). Submissions must be anonymous, without any identifying information. More detailed guidelines regarding submission can be found on the Student Session website: http://www.kr.tuwien.ac.at/drm/dehaan/stus2014/ . *FURTHER INFORMATION:* Please direct inquiries about submission procedures or other matters relating to the Student Session to dehaan at kr.tuwien.ac.at. ESSLLI 2014 will feature a wide range of foundational and advanced courses and workshops in all areas of Logic, Language, and Computation. For further information, including registration information and course listings, and for general inquiries about ESSLLI 2014, please consult the main ESSLLI 2014 page: http://www.esslli2014.info/. Kind regards, The ESSLLI 2014 Student Session Organization Committee Chair: - Ronald de Haan (Technische Universit?t Wien) LoCo (Logic and Computation) co-chairs: - Zo? Christoff (Universiteit van Amsterdam) - Ayb?ke ?zg?n (Universit? de Lorraine) LoLa (Logic and Language) co-chairs: - Philip Schulz (Universiteit van Amsterdam) - Thomas Brochhagen (Universit?t D?sseldorf) LaCo (Language and Computation) co-chairs: - Miriam Kaeshammer (Universit?t D?sseldorf) - Ramon Ziai (Universit?t T?bingen) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN ------------------------------------------------------------------------- From hamon at LIMSI.FR Thu Feb 27 09:38:39 2014 From: hamon at LIMSI.FR (Thierry Hamon) Date: Thu, 27 Feb 2014 10:38:39 +0100 Subject: Appel: Journee commune IA et TAL, 25 mars 2014 Message-ID: Date: Thu, 27 Feb 2014 10:31:47 +0100 From: Pierre Zweigenbaum Message-ID: <530F0603.9050606 at limsi.fr> X-url: http://www.afia.asso.fr/tiki-index.php?page=Journ?e+commune+AFIA+-+ATALA+2014 Appel ? soumissions Appel ? participation Intelligence artificielle et traitement automatique des langues Langue, apprentissage automatique et fouille de donn?es Journ?e commune AFIA - ATALA Association fran?aise pour l'intelligence artificielle Association pour le traitement automatique des langues http://www.afia.asso.fr/tiki-index.php?page=Journ?e+commune+AFIA+-+ATALA+2014 Paris, 25 mars 2014 Auditorium de l'INALCO Conf?rence invit?e : Fran?ois Yvon M?thodes statistiques en traitement des langues: ?tat des lieux et perspectives TH?ME Le traitement automatique des langues (TAL), visant ? r?aliser des t?ches relevant de la cognition humaine, a de longue date des liens avec l'intelligence artificielle (IA). L'objectif de cette journ?e est de renforcer les liens qui existent entre ces deux domaines, avec un accent sur deux th?matiques particuli?rement actives : * D'une part, l'apprentissage automatique a pris une large part dans les travaux en TAL, comme en t?moignent les articles publi?s dans les grandes conf?rences du domaine ou des ouvrages de synth?se comme le livre de Gaussier et Yvon (2011, 2012). Il s'agit surtout d'apprentissage supervis?, avec un regain d'int?r?t pour l'apprentissage non supervis? et l'exploitation de grands corpus non annot?s. * D'autre part, la fouille de donn?es est souvent employ?e pour explorer les corpus et acqu?rir des connaissances linguistiques et autres ressources pour le TAL ; elle est par ailleurs utilisatrice de TAL dans le cadre de la fouille de textes ? vis?e applicative. L'une comme l'autre peuvent aider ? relever le d?fi des donn?es textuelles massives qui forment une part importante des ? big data ? et, de fa?on tout aussi pertinente, ? mettre au point des m?thodes qui font avancer le domaine. Nous sollicitons ici des interventions qui pr?sentent un ?tat de l'art ou un tutoriel sur les th?matiques de l'apprentissage et de la fouille de donn?es en relation avec le traitement automatique des langues. Des pr?sentations de ce type sur d'autres th?mes ? l'interface entre TAL et IA seront ?galement consid?r?es. COORDINATEURS DE LA JOURN?E Pierre Zweigenbaum & Brigitte Grau (LIMSI-CNRS) MODALIT?S DE SOUMISSION Les propositions d'intervention sont attendues sous la forme d'un r?sum? d'une page au format texte ? envoyer ? : pz at limsi.fr Date limite de soumission : 4 mars 2014 Date de notification : 10 mars 2014 Le corps du message doit contenir les informations suivantes : Titre de l'intervention : Nature de l'intervention : ?tat de l'art, tutoriel Auteurs : Mots-Clefs : R?sum? ou plan de l'intervention : COMIT? DE PROGRAMME (en cours de compl?tion) Alexandre Allauzen (LIMSI) Thierry Charnois (LIPN) Vincent Claveau (IRISA) Ga?l Dias (GREYC) ?ric Gaussier (LIG) Brigitte Grau (LIMSI) Alexis Nasr (LIF) Isabelle Tellier (LATTICE) Yannick Toussaint (LORIA) Mathieu Valette (INALCO) Pierre Zweigenbaum (LIMSI) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ATALA d?cline toute responsabilit? concernant le contenu des messages diffus?s sur la liste LN -------------------------------------------------------------------------