From thierry.hamon at UNIV-PARIS13.FR Tue Dec 6 20:49:35 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Tue, 6 Dec 2011 21:49:35 +0100 Subject: Livre: Advanced Language Technologies for Digital Libraries, Springer Message-ID: Date: Wed, 30 Nov 2011 17:09:42 +0000 From: Frédérique Segond Message-ID: X-url: http://www.springer.com/computer/database+management+%26+information+retrieval/book/978-3-642-23159-9?changeHeader Nouveau Volume paru chez Springer : « Advanced Language Technologies for Digital Libraries" International Workshops on NLP4DL 2009, Viareggio, Italy, June 15, 2009 and AT4DL 2009, Trento, Italy, September 8, 2009 Series: Lecture Notes in Computer Science http://www.springer.com/series/558, Vol. 6699 Subseries: Theoretical Computer Science and General Issues http://www.springer.com/series/7407 Bernardi, Raffaella; Chambers , Sally; Gottfried, Bjorn; Segond, Frederique; Zaihrayeu, Ilya (Eds) Pour plus de details : http://www.springer.com/computer/database+management+%26+information+retrieval/book/978-3-642-23159-9?changeHeader ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Tue Dec 6 20:51:20 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Tue, 6 Dec 2011 21:51:20 +0100 Subject: Appel: EACL 2012 call for demos, Dernier rappel Message-ID: Date: Wed, 30 Nov 2011 17:21:38 +0000 From: Frédérique Segond Message-ID: X-url: http://eacl2012.org/home/index.html" The Thirteenth Conference of the European Chapter of the Association for Computational Linguistics http://www.eacl2012.org Avignon, France April 23-27, 2012 Call for demos EACL 2012 features a special session for demonstrations. Topics of interest cover all aspects of computational linguistics, as outlined in the main conference call for papers. Demos will be presented at specific sessions from the main program of the conference. The demo track allows researchers and practitioners to demonstrate their new and innovative systems while interacting with their audience in an informal setting. Demos' submissions should be concerned with mature systems or prototypes in which computational linguistics or NLP technologies are used to solve practically important problems. Demo papers should make clear which aspects of the system will be demonstrated, and how the system is used to solve practical problems. It should include a discussion of the implementation and case studies of how the system is applied. Requirements At least one author of each accepted demo paper must register for and attend the conference, and demonstrate the system during the demo sessions (please see the Conference Policies). Authors of accepted demos may also be required to demonstrate their systems at additional events during the conference. Each accepted demo paper will be allocated four pages in the conference proceedings; no extra pages can be purchased. Demo authors are not required to transfer copyright. Submission Procedure Authors of demos should submit an original paper describing their proposed presentation. The length of the paper is limited to 4 pages excluding references which are unlimited. The submission should be in line with the EACL 2012 main conference formatting styles and instructions for authors. Proposals for demos should provide: 1. an overview of what the demonstration aims to achieve 2. how the demo illustrates novel ideas 3. any URLs that link to screen-shots, live demos, or related information 4. equipment or facilities required for the system demonstration Submission site: https://www.softconf.com/eacl2012/EACL-Demos-2012/ Reviewing Demos will be blind peer-reviewed by two members of the Demos Program Committee, who will judge the originality, significance, quality, and clarity of each submission. Demos Chair : Frédérique Segond (Objet Direct, France) Contact: fsegond at objetdirect.com PC Members Sophia Annaniadou (University of Manchester, UK) Jordi Atserias (Yahoo, Spain) Galia Angelova, (Bulgarian Academy of Sciences, Bulgaria ) Raffaella Bernardi (University of Trento, Italy) Luca Dini (CELI , Italy) Gregory Grefenstette (Exalead, France) Michael Matthews (Yahoo, Spain) Piek Vossen (Vrije Universiteit of Amsterdam, Nederland) Stelios Piperidis (ILSP, Greece) Sebastian Padó (University of Heidelberg, Germany) Ralf Steinberger (European Commission - Joint Research Centre (JRC), Italy) Evelyne Viegas (Microsoft, USA) Claire Waast (EDF France) Wlodek Zadrozny (IBM, USA) Important Dates * Paper submission deadline: December, 4, 2011 (11:59pm Samoa Time (UTC/GMT -11 hours)) * Notification of acceptance: February, 6, 2012 * Camera ready copies due: March 9, 2012 * EACL 2012 Conference: April 23-27, 2012 Submission DEMOS site: https://www.softconf.com/eacl2012/EACL-Demos-2012/ ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Tue Dec 6 20:53:07 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Tue, 6 Dec 2011 21:53:07 +0100 Subject: These: Loic Kervajan, Traduction automatique Fr/LSF Message-ID: Date: Wed, 30 Nov 2011 20:39:50 +0100 From: Loïc Kervajan Message-ID: <006001ccaf97$d42615d0$7c724170$@fr> X-url: http://www.kervajan.fr ------------------------------------------------------------------------ Chers amis, J’ai le plaisir de vous annoncer que je soutiendrai ma Thèse de Doctorat le vendredi 9 décembre 2011 à 14h Il s’agit d’une Thèse de linguistique appliquée au traitement automatique des langues. Le sujet est le suivant : Contribution à la traduction automatique français/Langue des Signes Française au moyen de personnages virtuels La soutenance, publique et traduite en LSF, se déroulera : à l’Université de Sciences Humaines d’Aix-en-Provence (29, av Robert Schuman, 13621 Aix-en-Provence) en salle C.212 * Pour parfaire l’organisation, je vous remercie de me confirmer votre présence par retour de mail. Amicalement, Loïc Kervajan Mail : loic at kervajan.fr Site perso : http://www.kervajan.fr * pour aller en C212 : Arriver par le grand hall. Monter les grands escaliers (sur la droite lorsqu’on entre) Continuer tout droit jusqu’au bout du couloir. Monter encore d’un étage. Prendre le couloir de gauche, la C212 est sur la gauche... ... ce sera fléché ! ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Tue Dec 6 20:54:45 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Tue, 6 Dec 2011 21:54:45 +0100 Subject: Appel: TextGraphs-7 Workshop associated to ACL 2012 Message-ID: Date: Thu, 01 Dec 2011 19:39:55 +0100 From: Gaël Dias Message-ID: <4ED7C9FB.3020206 at unicaen.fr> X-url: http://www.textgraphs.org/ws12 ====================================================================== CALL FOR PAPERS TextGraphs-7: Graph-based Methods for Natural Language Processing (http://www.textgraphs.org/ws12) Special Theme: Understanding and Mining Social Media Using Graphs: Information Propagation in Blogs and Tweets Workshop of ACL 2012 July 12, 2012, Jeju, Republic of Korea Submission dealdine: March 18, 2012 ======================================================================== (1) Workshop Description (2) Workshop Submissions (3) Important Dates (4) Author Instructions (5) Invited Speaker (6) Poster Session (7) Workshop Organisers (8) Program Committee (9) Contacts ================================================== (1) Workshop Description ================================================== Recent years have shown an increased interest in integrating various aspects of the field of Graph Theory (GT) into Natural Language Processing (NLP). Many language phenomena can be naturally put into graph-based representations and in the last 5 years a significant number of NLP applications adopted efficient and elegant solutions from graph theoretical frameworks. These applications range from part-of- speech tagging, word sense disambiguation, ontology learning and parsing to information extraction, semantic role assignment, summarization and sentiment analysis to name but a few. The emergence of new fields of research focusing on the social media such as Twitter and Facebook brought the graph-based methods even more into focus. In particular, graph-based algorithms are used to explore social network connections and propagation of information in those networks in addition to exploring the connections between the language entities. As a consequence, many new applications have been emerging such as rumor proliferation, e-reputation, multiple identity detection, language dynamics learning and future events prediction to name but a few. TextGraphs-7 aims to bring together researchers interested in GT applied to NLP and provide an environment for further integration of graph-based solutions into NLP tasks. The workshop also encourages discussions about theoretical justifications from GT that explain empirical results obtained in the NLP community. As a consequence, a deeper understanding of new theories of graph-based algorithms is likely to help to create new approaches and widen the usage of graphs for NLP applications. ================================================= (2) Workshop Submissions ================================================= TextGraphs-7 will invite innovative submissions on the following, but not limited to, three main topics: Topic I: Graph-based methods for Information Retrieval, Information Extraction and Text Mining - Graph-based methods for word sense disambiguation, - Graph-based representations for ontology learning, - Graph-based strategies for semantic relations identification, - Encoding semantic distances in graphs, - Graph-based techniques for document navigation and visualization, - and more subtopics; Topic II: New graph-based methods for NLP applications - Random walk methods in graphs, - Spectral graph clustering, - Semi-supervised graph-based methods, - Methods and analyses for statistical networks, - Small world graphs, - Dynamic graph representations , - Topological and pretopological analysis of graphs, - and more subtopics; Topic III: Graph-based methods for applications on social networks - Rumor proliferation, - E-reputation, - Multiple identity detection, - Language dynamics studies, - Surveillance systems, - and more subtopics. ====================================================== (3) Important Dates ====================================================== Submission deadline: March 18, 2012 Notification of Acceptance: April 15, 2012 Camera-ready deadline: April 30, 2012 Workshop: July 12, 2012 ==================================================== (4) Authors Instructions ==================================================== Submissions will consist of (1) regular full papers of maximum 8 pages (for well established research), (2) regular short papers of maximum 4 pages (for on-going research), (3) position papers of maximum 4 pages (for prospective ideas) and (4) demo papers of maximum 4 pages (for software and architectures). One extra page will be accepted for the Reference section for all submissions’ types. Textgraphs-7 will have a blind review policy. In order to facilitate blind reviewing, the authors should omit their names and affiliations from the paper. Furthermore, all self-references that may reveal the author identity must be avoided. Papers that do not conform to these requirements will be rejected without review. All submissions must be in PDF format and must be formatted using the ACL 2012 Style Files available a http://www.acl2012.org/call/sub01.asp. Papers should be submitted to the START Conference Manager available at the following url: https://www.softconf.com/acl2012/textgraphs-7/. ======================================================= (5) Invited Speaker ======================================================= TextGraphs-7 will have the pleasure to invite Lada Adamic to present her work on information propagation in blogs and tweets. Lada Adamic is an associate professor in the School of Information at the University of Michigan Ann Arbor. She is also affiliated with the Center for the Study of Complex Systems and the Electrical Engineering and Computer Science Department. Her research interests center on information dynamics in networks: how information diffuses, how it can be found, and how it influences the evolution of a network's structure. She worked previously in Hewlett-Packard's Information Dynamics Lab. Her projects have included identifying expertise in online question and answer forums, studying the dynamics of viral marketing, and characterizing the structure in blogs and other online communities. She recently received the 10 years best paper award from the ACM/IEEE/WIC Conference 2011. ================================================== (6) Poster Session ================================================== Authors of accepted papers and demos will be able to present their work on a poster in addition to the oral presentation. The posters will be displayed in the workshop room all day and there will be a short poster session in the workshop schedule. ================================================== (7) Workshop Organisers ================================================== Ahmed Hassan Microsoft Research, Redmond, WA, United States Phone: +14257222849, Email: hassanam at microsoft.com Gaël Dias University of Caen Basse-Normandie, Caen, France Phone: +33231567487 Email: gael.dias at unicaen.fr Irina Matveeva Dieselpoint Inc., Chicago, United States Phone: +17737047852 Email: imatveeva at dieselpoint.com ================================================== (8) Program Committee ================================================== Aitor Soroa, University of the Basque Country, Spain Alessandro Moschitti, University of Trento, Italy Animesh Mukherjee, ISI Foundation, Italy Antoine Widlöcher, University of Caen Basse-Normandie, France Arzucan Özgür, Istanbul University, Turkey Bruno Crémilleux, University of Caen Basse-Normandie, France Chris Biemann, Darmstadt University of Technology, Germany Dragomir Radev, University of Michigan, USA Fabio Massimo Zanzotto, University of Rome, Italy Gina-Anne Levow, University of Washington, USA Guillaume Cleuziou, University of Orléans, France Michael Gamon, Microsoft Research, USA Patrik Pantel, Microsoft Research, USA Paul Tarau, University of North Texas, USA Philippe Muller, Paul Sabatier University, France Rada Mihalcea, University of North Texas, USA Swapna Somasundaran, Siemens Corporate Research, USA Torsten Zesch, Darmstadt University of Technology, Germany Uwe Quasthoff, University of Leipzig, Germany Zornitsa Kozareva, ISI, University of Southern California, USA ==================================================== (9) Workshop Contacts ==================================================== For any question, please email us at textgraphs2012 at gmail.com or look at the web site http://www.textgraphs.org/ws12 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Tue Dec 6 20:58:15 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Tue, 6 Dec 2011 21:58:15 +0100 Subject: Job: Post-doc textometrie, SDL Message-ID: Date: Thu, 1 Dec 2011 12:08:21 -0800 (PST) From: Georgeta Cislaru Message-ID: <1322770101.8412.YahooMailNeo at web39424.mail.mud.yahoo.com> X-url: http://www.univ-paris3.fr/anr-ecritures/ Appel à candidatures Contrat post-doctoral Intitulé du poste : « Approche textométrique longitudinale de brouillons alignés » Lieu : Université Sorbonne nouvelle Paris 3, France, 19 rue des Bernardins et 46 rue Saint-Jacques, 75005 Paris. Début du contrat : 1/01/2012 ou 1/02/2012 Durée : 12 mois Rémunération : environ 1900 euros net/mois (selon l’expérience et le profil du candidat, la rémunération peut légèrement varier). Projet concerné : ANR ECRITURES « Brouillons d’écrits sociaux : approche génétique, discursive et textométrique de l'écriture professionnelle » (Drafts of Social Workers’ Writings: a Genetic, Discursive and Textometric Study of Professional Writing) Site Web: www.univ-paris3.fr/anr-ecritures/ Rattachement : EA 2290 SYLED (fédération CLESTHIA), Université Sorbonne nouvelle Paris 3, France. Profil du/de la candidat/e : titulaire d’un doctorat Linguistique génétique, Linguistique de corpus ou traitement automatique, le candidat maîtrisera les bases du traitement automatique (approche textométrique de préférence) et sera intéressé par le développement de méthodes et d’outils textométriques. Un intérêt pour les méthodes d’analyse qualitative est également attendu (analyse de discours, sémantique, lexicologue, syntaxe…). Mission du post-doc : traitement automatique du corpus brouillons dans une perspective génétique et dans une perspective de développement d’une méthode textométrique longitudinale de brouillons alignés (Linguistique génétique et traitement automatique) ; participation au recueil et à l’étiquetage du corpus ; le post-doctorant contribuera au développement des outils et de la méthode et prendra en charge une partie des analyses longitudinales. Le post-doctorant valorisera ce travail par une publication internationale. Participation à la gestion globale du projet. Synthèse du projet ECRITURES Programme de travail financé par l’ANR (Agence nationale de la Recherche), du 1 avril 2011 au 31 mars 2014. Le projet ECRITURES propose une approche linguistique de la performance langagière. Il vise à rendre compte de l’impact que les contraintes linguistiques, discursives, sociales et cognitives exercent sur la production écrite des travailleurs sociaux du champ de la protection de l’enfance. Domaines concernés : analyse de discours, linguistique de corpus, génétique textuelle, cognition sociale, psycholinguistique. Objectifs : - Dégager, à partir d’analyses discursives et génétiques des écrits des travailleurs sociaux, les aspects linguistiques d’une dynamique d’écriture encadrée socialement et institutionnellement ; - Apporter une réflexion analytique et méthodologique en vue du développement de logiciels de suivi de rédaction ; développer une méthode textométrique longitudinale des brouillons alignés ; - Proposer une grille et des critères pour des développements didactiques en vue de la formation à l’écriture professionnelle. Corpus : - un sous-corpus de brouillons d’écrits sociaux (SAFE de Caen) produit sur un traitement de texte normal et enregistrant les versions successives des rapports éducatifs ; - un sous-corpus produit via un logiciel de suivi de rédaction (Inputlog) ; - un corpus « de contrôle » (rapports académiques, autres genres possibles). Retombées extradisciplinaires du projet : Les résultats de ce projet, outre d’autres disciplines telles que la psychologie ou la sociologie, intéresseront également les travailleurs sociaux, en collaboration avec lesquels il est mené. Pour faire acte de candidature, il convient d'adresser à Serge Fleury (serge.fleury at univ-paris3.fr) et à Georgeta Cislaru (georgeta.cislaru at univ-paris3.fr) : - une lettre de motivation ; - un CV détaillé ; - le rapport de soutenance et un résumé de la thèse. Date limite : le 15 décembre 2011 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Tue Dec 6 20:59:45 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Tue, 6 Dec 2011 21:59:45 +0100 Subject: Appel: NAACL 2012 Workshop on Computational Linguistics for Literature Message-ID: Date: Fri, 02 Dec 2011 15:26:52 -0500 From: Stan Szpakowicz Message-ID: <4ED9348C.4060409 at eecs.uottawa.ca> X-url: http://www.naaclhlt2012.org/conference/conference.php Workshop on Computational Linguistics for Literature Co-located with The 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies June 7 or 8, 2012 Montréal, Québec, Canada All information, including announcements and updates, can be found on the workshop's Web site: https://sites.google.com/site/clfl2012/ MOTIVATION AND SCOPE The amount of literary material available on-line keeps growing rapidly. Not only are there machine-readable texts in libraries, collections and e-book stores, but there is also more and more “live” literature – e-zines, blogs, self-published e-books and so on. There is a need for tools to help users navigate, visualize and appreciate high volume of available literature. Literary texts are quite different from technical and formal documents, which have been the focus of NLP research thus far. Most forms of statistical language processing rely on lexical information in one way or another. In literature, the primary mode is narrative rather than exposition. Stories may be cognitively easier to read than certain expository genres, such as scientific documents, but it is a challenging form of discourse for NLP tools and methods. For instance, literary prose lacks overt lexical clues and structural markers typically leveraged in the processing of more structured genres. Also, even conventional literary texts exhibit far less unity of time, space and topic than most formal discourse. Learning to handle these challenges in literary data may help move past heavy reliance on surface clues in general. Literature also differs from other genres because of the needs of its typical audience. For instance, reading, searching or browsing literature online is a different task than searching for the latest news on a particular topic. Search criteria would be rather abstract: not a keyword, but a literary style, similarity to another work, point of view and so on. When looking for a summary or a digest, a reader may prefer to know or visualize a text's broad characteristics than facts which summarize the plot. We invite papers that touch upon these areas, but also welcome other ideas which promote the processing of literary narrative or related forms of discourse. TOPICS OF INTEREST Note: Papers on other closely related topics will also be considered * the needs of the readers and how those needs translate into meaningful NLP tasks; * searching for literature; * recommendation systems for literature; * computational modelling of narratives, computational narratology; * summarization of literature; * differences between literature and other genres as relevant to computational linguistics; * discourse structure in literature; * emotion analysis for literature; * profiling and authorship attribution; * identification and analysis of literature genres; * building and analysing social networks of characters; * generation of literary narrative, dialogue or poetry; * modelling dialogue literary style for generation. SUBMISSION We invite submission of long and short papers, describing completed or ongoing research on systems, studies, theories and models which can inform the area of computational linguistics for literature. Long papers should be at most 8 pages, plus unlimited space for references. Short papers should be at most 4 pages plus references, and can be appropriate for either oral or poster presentation. Accepted long papers, and perhaps selected short papers, will be presented as talks. In addition, we encourage submission of position papers -- mapping out research ideas and programs -- of up to 6 pages plus references. There will be double-blind review: submissions must be anonymized. Style files and sample PDFs are available on this page: http://www.naaclhlt2012.org/conference/conference.php Submission page: please visit later IMPORTANT DATES (all deadlines 11:59 pm. Hawaii Time) Submission deadline: February 20, 2012 Notification of acceptance: March 23, 2012 Camera-ready version due: April 10, 2012 Workshop: June 7 or June 8, 2012 PROGRAM COMMITTEE * Cecilia Ovesdotter Alm (Rochester Institute of Technology) * Nicholas Dames (Columbia University) * Hal Daumé III (University of Maryland) * Anna Feldman (Montclair State University) * Mark Finlayson (MIT) * Pablo Gervás (Universidad Complutense de Madrid) * Roxana Girju (University of Illinois at Urbana-Champaign) * Amit Goyal (University of Maryland) * Katherine Havasi (MIT Media Lab) * Matthew Jockers (Stanford University) * James Lester (North Carolina State University) * Inderjeet Mani (Children's Organization of Southeast Asia) * Kathy McKeown (Columbia University) * Saif Mohammad (National Research Council, Canada) * Vivi Nastase (HITS gGmbH) * Rebecca Passonneau (Columbia University) * Livia Polanyi (LDM Associates) * Owen Rambow (Columbia University) * Michaela Regneri (Saarland University) * Reid Swanson (University of California, Santa Cruz) * Marilyn Walker (University of California, Santa Cruz) * Janice Wiebe (University of Pittsburgh) CO-ORGANIZERS * David Elson (Google) * Anna Kazantseva (University of Ottawa) * Rada Mihalcea (University of North Texas) * Stan Szpakowicz (University of Ottawa) CONTACT INFORMATION Send general inquiries to clfl.workshop at gmail.com ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Tue Dec 6 21:03:05 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Tue, 6 Dec 2011 22:03:05 +0100 Subject: Ecole: WSLST 2012 Message-ID: Date: Sun, 4 Dec 2011 11:03:06 +0100 From: "GRLMC" Message-ID: <8426A6D3003445D1A0850B9DFC44555F at Carlos1> X-url: http://grammars.grlmc.com/wslst2012/ ********************************************************************* 2012 INTERNATIONAL WINTER SCHOOL IN LANGUAGE AND SPEECH TECHNOLOGIES WSLST 2012 (formerly International PhD School in Language and Speech Technologies) Tarragona, Spain January 23-27, 2012 Organized by: Research Group on Mathematical Linguistics (GRLMC) Rovira i Virgili University http://grammars.grlmc.com/wslst2012/ ********************************************************************* AIM: WSLST 2012 offers a broad and intensive series of lectures on language and speech technologies at different levels. The students choose their preferred courses according to their interests and background. Instructors are top names in their respective fields. The School intends to help students initiate and foster their research career. The previous event in this series was SSLST 2011 (http://grammars.grlmc.com/sslst2011/). ADDRESSED TO: Undergraduate and graduate students from around the world. Most appropriate degrees include: Computer Science and Linguistics. Other students (for instance, from Mathematics, Electrical Engineering, Philosophy, or Cognitive Science) are welcome too. The School is appropriate also for people more advanced in their career who want to keep themselves updated on developments in the field. There is no overlap in the class schedule. COURSES AND PROFESSORS: - Eneko Agirre (U Basque Country), Semantic Processing of Text: Word Sense Disambiguation, Entity Linking and Semantic Similarity [introductory/intermediate, 8 hours] - William J. Byrne (Cambridge), Weighted Finite State Transducers in Statistical Machine Translation [introductory/advanced, 6 hours] - Marcello Federico (Fondazione Bruno Kessler, Trento), Statistical Language Modeling [introductory/advanced, 8 hours] - Ralph Grishman (New York), Information Extraction [intermediate, 8 hours] - John Nerbonne (Groningen), Using Edit-Distance to Understand Linguistic Variation [introductory/advanced, 4 hours] - Geoffrey K. Pullum (Edinburgh), The Formal Properties of Human Languages: Description with a View to Implementation [introductory/intermediate, 8 hours] - Jian Su (Institute for Infocomm Research, Singapore), Coreference Resolution and Discourse Relation Recognition [advanced, 4 hours] - Christoph Tillmann (IBM T.J. Watson Research Center), Simple and Effective Algorithms and Models for Non-hierarchical Statistical Machine Translation [intermediate, 6 hours] - David R. Traum (U Southern California), Approaches to Dialogue Systems and Dialogue Management [introductory, 8 hours] - Dekai Wu (Hong Kong U of Science and Technology), Syntactic and Semantic Statistical Machine Translation [introductory/advanced, 8 hours] REGISTRATION: It has to be done on line at http://grammars.grlmc.com/wslst2012/Registration.php FEES: They are variable, depending on the number of courses each student takes. The rule is: 1 hour = - 10 euros (for payments until November 13, 2011), - 15 euros (for payments after November 13, 2011). PAYMENT PROCEDURE: The fees must be paid to the School's bank account: Uno-e Bank (Julian Camarillo 4 C, 28037 Madrid, Spain): IBAN: ES3902270001820201823142 - Swift code: UNOEESM1 (account holder: Carlos Martin-Vide GRLMC; account holder’s address: Av. Catalunya, 35, 43002 Tarragona, Spain) Please mention WSLST 2012 and your full name in the subject. A receipt will be provided on site. Remarks: - Bank transfers should not involve any expense for the School. - People claiming early registration will be requested to prove that they gave the bank transfer order by the deadline. - Students may be refunded only in the case when a course gets cancelled due to the unavailability of the instructor. People registering on site at the beginning of the School must pay in cash. For the sake of local organization, however, it is much recommended to do it earlier. ACCOMMODATION: Information about accommodation is available on the website of the School. CERTIFICATES: Students will be delivered a certificate stating the courses attended, their contents, and their duration. IMPORTANT DATES: Announcement of the programme: September 26, 2011 Starting of the registration: September 26, 2011 Early registration deadline: November 13, 2011 Starting of the School: January 23, 2012 End of the School: January 27, 2012 QUESTIONS AND FURTHER INFORMATION: Florentina-Lilica Voicu: florentinalilica.voicu at urv.cat WEBSITE: http://grammars.grlmc.com/wslst2012/ POSTAL ADDRESS: WSLST 2012 Research Group on Mathematical Linguistics (GRLMC) Rovira i Virgili University Av. Catalunya, 35 43002 Tarragona, Spain Phone: +34-977-559543 Fax: +34-977-558386 ACKNOWLEDGEMENTS: Diputació de Tarragona Universitat Rovira i Virgili ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Tue Dec 6 21:05:02 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Tue, 6 Dec 2011 22:05:02 +0100 Subject: These: Juliette Kahn, Performance et confiance en identification biometrique vocale Message-ID: Date: Mon, 05 Dec 2011 09:27:19 +0100 From: Juliette KAHN Message-ID: <20111205092719.18459ficlldm6vs7 at webmail.univ-avignon.fr> Bonjour à tous, J'ai le plaisir de vous inviter à ma soutenance de thèse qui aura lieu le 19 décembre 2011 à 14h30 dans l'amphi Blaise Pascal du Centre d'Enseignement et de Recherche en Informatique (CERI) de l'Université d'Avignon et des Pays de Vaucluse (339, chemin des Meinajaries-Agroparc BP 91228-84911 AVIGNON Cedex 9-FRANCE) . Vous êtes également les bienvenus au pot de thèse qui suivra la soutenance. Titre de la thèse Parole de locuteur : performance et confiance en identification biométrique vocale Résumé Ce travail de thèse explore l'usage biométrique de la parole dont les applications sont très nombreuses (sécurité, environnements intelligents, criminalistique, surveillance du territoire ou authentification de transactions électroniques). La parole est soumise à de nombreuses contraintes fonction des origines du locuteur (géographique, sociale et culturelle) mais également fonction de ses objectifs performatifs. Le locuteur peut être considéré comme un facteur de variation de la parole, parmi d'autres. Dans ce travail, nous présentons des éléments de réponses aux deux questions suivantes : - Tous les extraits de parole d'un même locuteur sont-ils équivalents pour le reconnaître ? - Comment se structurent les différentes sources de variation qui véhiculent directement ou indirectement la spécificité du locuteur ? Nous construisons, dans un premier temps, un protocole pour évaluer la capacité humaine à discriminer un locuteur à partir d'un extrait de parole en utilisant les données de la campagne NIST-HASR 2010. La tâche ainsi posée est difficile pour nos auditeurs, qu'ils soient naïfs ou plus expérimentés. Dans ce cadre, nous montrons que ni la (quasi)unanimité des auditeurs ni l'auto-évaluation de leurs jugements ne sont des gages de confiance dans la véracité de la réponse soumise. Nous quantifions, dans un second temps, l'influence du choix d'un extrait de parole sur la performance des systèmes automatiques. Nous avons utilisé deux bases de données, NIST et BREF ainsi que deux systèmes de RAL, ALIZE/SpkDet (LIA) et Idento (SRI). Les systèmes de RAL, aussi bien fondés sur une approche UBM-GMM que sur une approche i-vector montrent des écarts de performances importants mesurés à l'aide d'un taux de variation autour de l'EER moyen, Vr (pour NIST, VrIdento = 1.41 et VrALIZE/SpkDet =1.47 et pour BREF, Vr = 3.11) selon le choix du fichier d'apprentissage utilisé pour chaque locuteur. Ces variations de performance, très importantes, montrent la sensibilité des systèmes automatiques au choix des extraits de parole, sensibilité qu'il est important de mesurer et de réduire pour rendre les systèmes de RAL plus fiables. Afin d'expliquer l'importance du choix des extraits de parole, nous cherchons les indices les plus pertinents pour distinguer les locuteurs de nos corpus en mesurant l'effet du facteur Locuteur sur la variance des indices. La F0 est fortement dépendante du facteur Locuteur, et ce indépendamment de la voyelle. Certains phonèmes sont plus discriminants pour le locuteur : les consonnes nasales, les fricatives, les voyelles nasales, voyelles orales mi-fermées à ouvertes. Ce travail constitue un premier pas vers une étude plus précise de ce qu'est le locuteur aussi bien pour la perception humaine que pour les systèmes automatiques. Si nous avons montré qu'il existait bien une différence cepstrale qui conduisait à des modèles plus ou moins performants, il reste encore à comprendre comment lier le locuteur à la production de la parole. Enfin, suite à ces travaux, nous souhaitons explorer plus en détail l'influence de la langue sur la reconnaissance du locuteur. En effet, même si nos résultats indiquent qu'en anglais américain et en français, les mêmes catégories de phonèmes sont les plus porteuses d'information sur le locuteur, il reste à confirmer ce point et à évaluer ce qu'il en est pour d'autres langues. Jury Mme Régine ANDRE-OBRECHT Professeur, IRIT, Toulouse Rapporteur M. Anders ERIKSSON Professeur, Université de Göteborg, Göteborg (Suède) Rapporteur Mme Martine ADDA-DECKER Professeur, LPP, Paris Présidente du jury M. Edouard GEOFFROIS Ingénieur, DGA, Paris Examinateur Mme Solange ROSSATO Maître de Conférences, LIG, Grenoble Co-encadrante M. Jean-François BONASTRE Professeur, LIA, Avignon Directeur de thèse Juliette Kahn ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Tue Dec 6 21:07:07 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Tue, 6 Dec 2011 22:07:07 +0100 Subject: Conf: Concept Types and Frames 2012, Duesseldorf Message-ID: Date: Mon, 05 Dec 2011 15:43:25 +0100 From: Timm Lichte Message-ID: <4EDCD88D.1000309 at phil.uni-duesseldorf.de> X-url: http://www.sfb991.uni-duesseldorf.de/concept-types-and-frames-in-language-cognition-and-science/ X-url: http://www.sfb991.uni-duesseldorf.de/en/sfb991/ First Announcement. A call for papers will be issued in January. ======================================================================== International Interdisciplinary Conference "Concept Types and Frames in Language, Cognition, and Science" (CTF’12) ======================================================================== Düsseldorf (GER), August 22–24, 2012 http://www.sfb991.uni-duesseldorf.de/concept-types-and-frames-in-language-cognition-and-science/ Organized and financed by DFG Coordinated Research Centre CRC 991/SFB 991 “The Structure of Representations in Language, Cognition, and Science” http://www.sfb991.uni-duesseldorf.de/en/sfb991/ Aims and scope --------------- A sequel to CTF’07 and CTF’09, CTF’12 is the third interdisciplinary conference to contribute to the development of a general frame theory of human concepts. The conference explores the application of frames in linguistics and other sciences. It discusses foundational issues for the development of a theory of frames from the perspectives of general and computational linguistics, mathematics, cognitive psychology, cognitive neuroscience, ontology, philosophy of mind, and philosophy of science. Invited speakers ----------------- t.b.a. ======================================================================== Frames ------- CTF’12 explores fundamental aspects of a formal theory of frames understood as recursive attribute-value structures with functional attributes. The aspects include representation and model-theoretic interpretation, attributes and value types, and appropriate frame structures for different types of concepts. A special focus will be on dynamic aspects of frame theory: modeling dynamic frame components for event and process frames; exploring operations on frames such as type shifts and frame composition. Cognition ---------- CTF’12 explores empirical evidence for and consequences of the frame approach. It addresses the distinction of types of concepts, in particular relational and functional nominal concepts. Major topics include the grounding of concepts in the sensory-motor system and dynamic aspects of concept formation. Language --------- CTF’12 explores the application of a formal­ized theory of frames at various levels of linguistic description: lexical semantics of the different conceptual types of nouns and verbs, deep lexical decomposition, semantics of word formation, the syntax-semantics interface, syntactic and semantic composition, and anaphora and coherence in discourse. Science -------- CTF’12 explores the applications of the frame approach in various fields of institutional categorization: e.g., the evolution of the frame notion in the history of philosophy; scientific frames and paradigm shifts; frames for legal concepts; frames in applied ontology; a frame model of the mind; and frame description of psychiatric disorders. ======================================================================== Important dates ---------------- Submission of abstracts (1000 words): March 15, Notification of acceptance: May 30 Programme committee -------------------- Sebastian Löbner (general chair), Rainer Osswald, Gottfried Vosgerau Organizational committee ------------------------- Dorothea Brenner, David Hommen, Timm Lichte, Daniel Schulzek ========================================================================= Topics for the CTF'12 Language --------- noun types and determination - definiteness and indefiniteness - possession - usage - processing - grammaticalization conceptual shifts of noun meanings - shifts and coercion - diachronic shifts decompositional frames for verbs and the syntax-semantics interface - verb classes in terms of frame types - verbal concepts and linking - construction-based decomposition of verb meanings - aspectual composition decompositional frames for nouns - frame analysis of legal terms - affordance attributes in noun frames and qualia syntactic and semantic composition - nominal concept types in the theory of semantic composition - semantic composition in terms of frames - syntax-driven frame composition frames in the semantics of word-formation - nominal compounding as frame combination - deverbal nouns as the result of frame transformation frames in discourse - frames and associative anaphora - the role of frames in discourse coherence Science -------- concepts - emergence of the modern frame notion of concepts - architecture of the mind - the role of function in object categorization frame analysis in institutional categorization - frames and paradigm shifts - frame analysis of legal concepts - frame analysis of psychiatric disorders - frame analysis in other scientific fields concept types and frames in ontology - attributes in ontology - relations and functions in ontology - formal approaches to ontology using functional relations Cognition ---------- embodiment of concepts - emergence of the modern frame notion of concepts - grounding of verb meanings in the sensory-motor system - affordances and the grounding of noun meanings in the sensory-motor system - common coding and simulation - frames and conceptual processing types of concepts - empirical evidence for different kinds of concepts - cognitive representation of dynamic concepts, in particular verb meanings neurocognition - frames and conceptual processing - neural correlates of attributes and frames - computational modelling of concept formation Frames ------- elements of frame theory - modelling the dynamics of verb meanings - modelling general operations on frames - modelling frame composition frame-related spaces - mathematical properties of value spaces - mathematical properties of attribute spaces - mathematical properties of frame spaces large frame systems - design and implementation ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Tue Dec 6 21:11:08 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Tue, 6 Dec 2011 22:11:08 +0100 Subject: Appel: EACL 2012 Joint Workshop on Information Retrieval and Hybrid Machine Translation Message-ID: Date: Tue, 6 Dec 2011 11:55:17 +0100 From: "Reinhard Rapp" Message-ID: X-url: http://www-lium.univ-lemans.fr/esirmt-hytra Apologies for multiple postings Please distribute to colleagues ======================================================================== Joint Workshop on EXPLOITING SYNERGIES BETWEEN INFORMATION RETRIEVAL AND MACHINE TRANSLATION (ESIRMT) and HYBRID APPROACHES TO MACHINE TRANSLATION (HyTra) Co-located with EACL 2012 (http://eacl2012.org/) Avignon, France April 23 and 24, 2012 Deadline for paper submissions: February 2, 2012 http://www-lium.univ-lemans.fr/esirmt-hytra ======================================================================== This two-day workshop addresses two specific but related research problems in computational linguistics. The ESIRMT event (1st day) aims at reducing the gap, both theoretical and practical, between information retrieval and machine translation research and applications. Although both fields have been already contributing to each other instrumentally, there is still too much work to be done in relation to solidly framing these two fields into a common ground of knowledge from both the procedural and paradigmatic perspectives. The HyTra event (2nd day) aims at sharing ideas among researchers developing and applying statistical, example-based, or rule-based machine translation systems and who wish to enhance their systems with elements from the other approaches. The joint workshop will provide participants with the opportunity of discussing research related to technology integration and system combination strategies at both the general level of cross-language information access and the specific level of machine translation technologies. Contributions are to be organized into two tracks, corresponding to ESIRMT and HyTra, respectively. Topics of interest include, but are not limited to: ESIRMT track: * machine translation and information retrieval hybridization * applications of MT and IR hybrid systems * IR techniques integrated in MT systems * MT techniques integrated in IR systems * any kind of innovative approach exploiting synergies between MT and IR * machine learning techniques for ranking HyTra track: * ways and techniques of MT hybridization * architectures for the rapid development of hybrid MT systems * hybrid systems dealing with underresourced languages and/or with morphologically rich languages * using linguistic information (morphology, syntax, semantics) to enhance statistical MT * bootstrapping rule-based systems from corpora * hybrid methods in spoken language translation * extraction of dictionaries from parallel and comparable corpora * machine learning techniques for hybrid MT * heuristics for limiting the search space in hybrid MT * alternative methods for the fair evaluation of the output of different types of MT systems * system combination approaches * open source tools and free language resources for hybrid MT SUBMISSION INSTRUCTIONS Authors are invited to submit papers on original and previously unpublished work. Formatting should be according to EACL 2012 specifications using LaTeX or MS-Word style files, see http://eacl2012.org/information-for-authors/index.html (paper length: up to 9 pages plus an extra page for references). Reviewing of papers will be double-blind, so the submissions should not reveal the authors' identity. Submission is electronic in PDF format using the START submission system at https://www.softconf.com/eacl2012/ESIRMT-HyTra/ Double submission policy: Parallel submission to other meetings or publications are possible but must be immediately notified to the workshop contact person (see below). Authors of accepted papers will be invited to present their research at the workshop. The papers will be published in the electronic workshop proceedings. These will be part of the EACL proceedings, published by ACL. They will also be made available online via the ACL Anthology. IMPORTANT DATES February 2, 2012: Paper submissions due February 24, 2012: Notification of acceptance March 9, 2012: Camera ready papers due April 23-24, 2012: Workshop in Avignon ORGANIZERS Contact person: Marta R. Costa-jussà (e-mail: marta.ruiz at barcelonamedia.org ) ESIRMT: Marta R. Costa-jussà (Barcelona Media Innovation Center), Patrik Lambert (University of Le Mans), Rafael E. Banchs (Institute for Infocomm Research) HyTra: Reinhard Rapp (Universities of Mainz and Leeds), Bogdan Babych (University of Leeds), Kurt Eberle (Lingenio GmbH), Tony Hartley (Toyohashi University of Technology and University of Leeds), Serge Sharoff (University of Leeds), Martin Thomas (University of Leeds) INVITED SPEAKERS Philipp Koehn (University of Edinburgh) TBA PROGRAMME COMMITTEE Jordi Atserias, Yahoo! Research , Barcelona, Spain Bogdan Babych, University of Leeds, UK Sivaji Bandyopadhyay, Jadavpur University, Kolkata, India Núria Bel, Universitat Pompeu Fabra, Barcelona, Spain Pierrette Bouillon, ISSCO/TIM/ETI, University of Geneva, Switzerland Chris Callison-Burch, Johns Hopkins University, Baltimore, USA Michael Carl, Copenhagen Business School, Denmark Oliver Culo, ICSI, University of California, Berkeley, USA Kurt Eberle, Lingenio GmbH, Heidelberg, Germany Andreas Eisele, Directorate-General for Translation, European Commission, Luxembourg Marcello Federico, Fondazione Bruno Kessler, Trento, Italy José A. R. Fonollosa, Universitat Politècnica de Catalunya, Barcelona, Spain Mikel Forcada, University of Alicante, Spain Alexander Fraser, Institute for Natural Language Processing (IMS), Stuttgart, Germany Johanna Geiß, Lingenio GmbH, Heidelberg, Germany Mireia Ginesti-Rosell, Lingenio GmbH, Heidelberg, Germany Silvia Hansen-Schirra, FTSK, University of Mainz, Germany Tony Hartley, Toyohashi University of Technology, Japan, and University of Leeds, England Gareth Jones, Dublin City University, Ireland Min-Yen Kan, National University of Singapore Udo Kruschwitz, University of Essez, UK Yanjun Ma, Baidu Inc. Beijing, China Maite Melero, Barcelona Media Innovation Center, Barcelona, Spain Haizhou Li, Institute for Infocomm Research, Singapore Reinhard Rapp, University of Mainz, Germany, and University of Leeds, UK Paul Schmidt, Institut for Applied Information Science, Saarbrücken, Germany Uta Seewald-Heeg, Anhalt University of Applied Sciences, Köthen, Germany Serge Sharoff, University of Leeds, UK Wade Shen, Massachusetts Institute of Technology, Cambridge, USA Fabrizio Silvestri, Instituto de Scienza e Tecnologia del'Informazione, Pisa, Italy Harold Somers, CNGL, Dublin City University, Ireland Anders S?gaard, University of Copenhagen, Denmark Martin Thomas, University of Leeds, UK Jörg Tiedemann, University of Uppsala, Sweden Zygmunt Vetulani, University of Poznan, Poland ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Tue Dec 6 21:14:30 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Tue, 6 Dec 2011 22:14:30 +0100 Subject: Job: 2 opportunities at the Health On the Net (HON) Foundation in Geneva Message-ID: Date: Tue, 6 Dec 2011 21:11:48 +0100 (CET) From: "Natalia GRABAR" Message-ID: X-url: http://www.hon.ch/Global/job_application_NLP.html X-url: http://www.hon.ch/Global/job_application_developer.html Job opportunities at the Health On the Net (HON) Foundation in Geneva: - Research NLP, text processing, information retrieval - Developer Please send your CV, letter of motivation, and salary expectations with the subject header 'JOB NLP' to HONsecretariat at healthonnet.org. =================================== Research NLP, text processing, information retrieval http://www.hon.ch/Global/job_application_NLP.html =================================== Location: Geneva, Switzerland Type: fulltime Job description: The Health On the Net (HON) Foundation, a Geneva-based non-governmental international organization dedicated to improving the quality of on-line health information for patients as well as health professionals is looking for an NLP expert. We are looking for somebody who combines an interest in NLP research with good programming skills. Key responsibilities: * Mapping out the research agenda, participating in (EU) research projects, writing deliverables and scientific articles, attending project meetings, giving presentations * Design and development of innovative techniques for improving search quality on text and multimedia formats - document classification, (multi-lingual) IR, automatic readability assessment and detection of ethical criteria in non-structured health web documents * Building fully scalable prototypes to test new algorithms and migrating from the research prototype to delivering the solution into production Desired experience and skills: * Post-graduate degree in Computer Science, Computational Linguistics, or 3+ years experience in R&D * Background in information retrieval, document classification, or question answering * Experience in scripting languages (Perl/​Python) * Experience in multi-threaded programming, and web-service applications are a great plus * Good command of English, other languages are a plus We offer: * An international research environment * A multi-disciplinary team of NLP researchers, web developers, health professionals etc. * A group active in the international NLP research scene, currently participating in the EU project Khresmoi * The opportunity to develop programming as well as research skills More information on www.healthonnet.org. Please send your CV, letter of motivation, and salary expectations with the subject header 'JOB NLP' to HONsecretariat at healthonnet.org. =================================== Developer http://www.hon.ch/Global/job_application_developer.html =================================== Location: Geneva Switzerland Type: fulltime HON is offering at wide range of services to citizens, medical professionals and health care providers. The HON Web development team makes HON services available to millions of users in order to access health websites respecting a set of quality criteria: the HONcode. The team combines software development, system administration expertise in Unix to run multilingual services available in 102 countries. We are looking from a software developer to contribute to major development in a multidisciplinary team. Key responsibilities: * manage a large scale crawling robot and search engine dedicated to health sites * support NLP team on software development aspects, from research to production * act as software support on many projects (CMS, log analysis etc.) Desired experience and skills * engineer or bachelor degree in Computer Science with at least 2 years of experiences, or provable very strong programming achievements; * applied Agile methodology experience & continuous testing/integration; * programming language: Perl in large structured project and J2EE (web application); * javascript (JQuery) & css; * Search engine et web crawling experience is a plus; * knowledge in Solr/lucene; * any open source contribution is a plus. You are responsible and organized; creative and proactive. You are fluent in French with a very good knowledge of written and spoken English, the German is a plus. Post to be filled immediately. Please submit your CV, letter of motivation, and salary expectations to HONsecretariat at healthonnet.org ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 7 10:57:13 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 7 Dec 2011 11:57:13 +0100 Subject: Appel: EACL 2012 Workshops - Multiple Call For Papers (10 workshops) Message-ID: Date: Mon, 05 Dec 2011 14:45:29 +0100 From: Frederic Bechet Message-ID: <4EDCCAF9.9000606 at lif.univ-mrs.fr> X-url: http://eacl2012.org X-url: http://stp.lingfil.uu.se/atanlp/ X-url: http://www.lingured.info/clw2012/ X-url: http://sites.google.com/site/eaclcogws/ X-url: http://www.chss.montclair.edu/linguistics/DeceptionDetection.html X-url: http://www-lium.univ-lemans.fr/esirmt-hytra X-url: http://www-limbio.smbh.univ-paris13.fr/membres/hamon/hybrid/ X-url: http://ilk.uvt.nl/LaTeCH2012/ X-url: http://www.eecs.uottawa.ca/~diana/eacl2012_social_media_workshop.html Call for papers - EACL 2012 workshops - April 23,24 Avignon, France http://eacl2012.org Ten workshops will be held in conjunction with the EACL 2012 conference in Avignon. They are listed in the mail below, with a link to each individual Call For Paper. ---- - EACL 2012 Workshop on Applications of Tree Automata Techniques in Natural Language Processing (ATANLP 2012) * April 24, 2012, Avignon, France * http://stp.lingfil.uu.se/atanlp/ * Submission deadline for papers: January 27, 2012 - Second Workshop on Computational Linguistics and Writing (CL&W 2012): Linguistic and cognitive aspects of document creation and document engineering * April 23 or 24, 2012, France * http://www.lingured.info/clw2012/ * Submission deadline: January 27, 2012 - EACL 2012 Workshop on Computational Models of Language Acquisition and Loss * April 23 or 24, 2012, France * http://sites.google.com/site/eaclcogws/ * Submission deadline: January, 20th, 2012 - EACL 2012 Workshop on Computational Approaches to Deception Detection * April 23, 2012, France * http://www.chss.montclair.edu/linguistics/DeceptionDetection.html * Submission deadline: Jan 27, 2012 - Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra) * April 23 and 24, 2012, Avignon, France * http://www-lium.univ-lemans.fr/esirmt-hytra * Submission deadline: February 2, 2012 - EACL 2012 Workshop on Innovative hybrid approaches to the processing of textual data * April 23 and 24, 2012, Avignon, France * http://www-limbio.smbh.univ-paris13.fr/membres/hamon/hybrid/ * Submission deadline: Jan 04, 2012 (Abstract deadline) - EACL 2012 workshop on: Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH 2012) * April 24, 2012 Avignon, France * http://ilk.uvt.nl/LaTeCH2012/ * Submission deadline: January 20, 2012 - EACL 2012 Joint Workshop of LINGVIS & UNCLH Visualization of Linguistic Patterns and Uncovering Language History from Multilingual Resources * April 23-24, 2012, Avignon, France * https://sites.google.com/site/lingvisunclh/ * Submission deadline: January 30, 2012 - ROBUS-UNSUP: Joint Workshop on Unsupervised and Semi-Supervised Learning in NLP * April 23-24, 2012, Avignon, France * https://sites.google.com/site/robusunsupeacl2012/home * Submission deadline: Jan 27, 2012 - EACL 2012 Workshop on Semantic Analysis in Social Networks * April 23, 2012, Avignon, France * http://www.eecs.uottawa.ca/~diana/eacl2012_social_media_workshop.html * Submission Deadline: Jan 30, 2012 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 7 10:59:06 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 7 Dec 2011 11:59:06 +0100 Subject: Appel: AI 2012 Message-ID: Date: Tue, 06 Dec 2011 20:59:04 -0500 From: Leila Kosseim Message-id: <4EDEC868.2080600 at cs.concordia.ca> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Apologies for cross-posting. Please distribute widely. =========================== Call for Papers ========================= AI 2012: The 25th Canadian Conference on Artificial Intelligence www.canadianai.ca May 28th to May 30th, 2012 York University, Toronto, Ontario, Canada ======================================================================== IMPORTANT DATES --------------- Full paper submission deadline: January 16th, 2012 Notification of acceptance: February 20th, 2012 Final papers due: March 5th, 2012 Conference: May 28th to May 30th, 2012 CALL FOR PAPERS --------------- AI'2012, the twenty-fifth Canadian Conference on Artificial Intelligence, invites papers that present original work in all areas of Artificial intelligence, either theoretical or applied. Topics of interest include, but are not limited to: Agent Systems AI Applications Automated Reasoning Bioinformatics and BioNLP Case-based Reasoning Cognitive Models Constraint Satisfaction Data Mining E-Commerce Evolutionary Computation Games Information Retrieval Knowledge Representation Machine Learning Multi-media Processing Natural Language Processing Neural Nets Planning Robotics Search Smart Graphics Uncertainty User Modeling Web Applications Papers will be reviewed by the Program Committee members according to their originality, technical merit and clarity of presentation. Each accepted paper will be allocated a maximum of 12 pages in the proceedings. All accepted papers, for which one of the authors will have registered and is present at the conference, will be published in the conference proceedings as Lecture Notes in Artificial Intelligence by Springer (www.springer.com/lncs). Papers submitted to AI'2012 must not have been accepted for publication elsewhere or be under review for another conference. BEST PAPER AWARD ---------------- The Best Paper Award will be presented at the conference to the authors of the best paper. SUBMISSIONS ----------- Authors are invited to submit electronically, by January 16th, 2012. Submissions must be uploaded on Easychair: https://www.easychair.org/account/signin.cgi?conf=canadianai2012 Paper format: - Papers must be written in English. - Papers must be up to 12 pages in length. - Papers must be in PDF format. - Papers must be formatted according to Springer LNCS style. The use of the LaTeX2e style file available from Springer is strongly encouraged. ORGANIZERS ---------- AI'2012 is collocated with two cognate conferences: the Canadian Graphical Interface Conference and the Canadian Conference on Computer and Robot Vision. General Co-chairs AI/GI/CRV: Chrysanne Dimarco, University of Waterloo, Waterloo John Barron, University of Western Ontario, London Program Co-chairs: Leila Kosseim, Concordia University, Montreal Diana Inkpen, University of Ottawa, Ottawa Industry Chair: Atefeh Farzindar, NLP Technologies Inc. & Université de Montréal Local Arrangement Chair: Yves Lespérance, York University, Toronto Graduate student symposium co-chairs: Ebrahim Bagheri, Athabasca University, Athabasca Jocelyne Faddoul, Concordia University, Montreal Please mark the dates in your calendar. We hope to receive your submissions and see you at the conference. ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sat Dec 10 09:40:07 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sat, 10 Dec 2011 10:40:07 +0100 Subject: These: Ludovic Jean-Louis, Extraction d'information fondee sur une analyse linguistique profonde Message-ID: Date: Wed, 7 Dec 2011 16:39:02 +0100 From: "Ludovic Jean-Louis" Message-ID: <4edf88a6.59030e0a.2507.ffffb884 at mx.google.com> J'ai le plaisir de vous inviter à ma soutenance de thèse, intitulée : "Extraction d'information à partir de textes fondée sur une analyse linguistique profonde" La soutenance aura lieu le Jeudi 15 Décembre 2011 à 14h30 dans la salle du conseil au LIMSI, bâtiment 508 de l'université Paris XI (LIMSI - CNRS B.P. 133 91403 ORSAY) Le jury sera composé de : - M. Patrice Bellot, Rapporteur (Université Aix-Marseille, LSIS) - Mme Adeline Nazarenko, Rapporteur (Université Paris Nord, LIPN) - M. Claude de Loupy, Examinateur (Syllabs) - M. Pierre Zweigenbaum, Examinateur (Université Paris Sud, LIMSI) - M. Olivier Ferret, Directeur de thèse (CEA LIST, LVIC) - M. Romaric Besançon, Encadrant (CEA LIST, LVIC) Vous êtes cordialement invités au pot qui suivra. Résumé : La plus grande partie des informations disponibles librement sur le Web se présentent sous une forme textuelle, c'est-à-dire non-structurée. Dans un contexte comme celui de la veille, il est très utile de pouvoir présenter les informations présentes dans les textes sous une forme structurée en se focalisant sur celles jugées pertinentes vis-à-vis du domaine d'intérêt considéré. Néanmoins, lorsque l'on souhaite traiter ces informations de façon systématique, les méthodes manuelles ne sont pas envisageables du fait du volume important des données à considérer. L'extraction d'information s'inscrit dans la perspective de l'automatisation de ce type de tâches en identifiant dans des textes les informations concernant des faits (ou événements) afin de les stocker dans des structures de données préalablement définies. Ces structures, appelées templates (ou formulaires), agrègent les informations caractéristiques d'un événement ou d'un domaine d'intérêt représentées sous la forme d'entités nommées (nom de lieux, etc.). Dans ce contexte, le travail de thèse que nous avons mené s'attache à deux grandes problématiques : - l'identification des informations liées à un événement lorsque ces informations sont dispersées à une échelle textuelle en présence de plusieurs occurrences d'événements de même type ; - la réduction de la dépendance vis-à-vis de corpus annotés pour la mise en oeuvre d'un système d'extraction d'information. Concernant la première problématique, nous avons proposé une démarche originale reposant sur deux étapes. La première consiste en une segmentation événementielle identifiant dans un document les zones de texte faisant référence à un même type d'événements, en s'appuyant sur des informations de nature temporelle. Cette segmentation détermine ainsi les zones sur lesquelles le processus d'extraction doit se focaliser. La seconde étape sélectionne à l'intérieur des segments identifiés comme pertinents les entités associées aux événements. Elle conjugue pour ce faire une extraction de relations entre entités à un niveau local et un processus de fusion global aboutissant à un graphe d'entités. Un processus de désambiguïsation est finalement appliqué à ce graphe pour identifier l'entité occupant un rôle donné vis-à-vis d'un événement lorsque plusieurs sont possibles. La seconde problématique est abordée dans un contexte de peuplement de bases de connaissances à partir de larges ensembles de documents (plusieurs millions de documents) en considérant un grand nombre (une quarantaine) de types de relations binaires entre entités nommées. Compte tenu de l'effort représenté par l'annotation d'un corpus pour un type de relations donné et du nombre de types de relations considérés, l'objectif est ici de s'affranchir le plus possible du recours à une telle annotation tout en conservant une approche par apprentissage. Cet objectif est réalisé par le biais d'une approche dite de supervision distante prenant comme point de départ des exemples de relations issus d'une base de connaissances et opérant une annotation non supervisée de corpus en fonction de ces relations afin de constituer un ensemble de relations annotées destinées à la construction d'un modèle par apprentissage. Cette approche a été évaluée à large échelle sur les données de la campagne TAC-KBP 2010. Mots-clés : Extraction d'information, extraction de relations, extraction d'événements Abstract : The major part of the information available on the web is provided in textual form, i.e. in unstructured form. In a context such as technology watch, it is useful to present the information extracted from a text in a structured form, reporting only the pieces of information that are relevant to the considered field of interest. Such processing cannot be performed manually at large scale, given the large amount of data available. The automated processing of this task falls within the Information extraction (IE) domain. The purpose of IE is to identify, within documents, pieces of information related to facts (or events) in order to store this information in predefined data structures. These structures, called templates, aggregate fact properties – often represented by named entities – concerning an event or an area of interest. In this context, the research performed in this thesis addresses two problems: - identifying information related to a specific event, when the information is scattered across a text and several events of the same type are mentioned in the text; - reducing the dependency to annotated corpus for the implementation of an Information Extraction system. Concerning the first problem, we propose an original approach that relies on two steps. The first step operates an event-based text segmentation, which identifies within a document the text segments on which the IE process shall focus to look for the entities associated with a given event. The second step focuses on template filling and aims at selecting, within the segments identified as relevant by the event-based segmentation, the entities that should be used as fillers, using a graph-based method. This method is based on a local extraction of relations between entities, that are merged in a relation graph. A disambiguation step is then performed on the graph to identify the best candidates to fill the information template. The second problem is treated in the context of knowledge base (KB) population, using a large collection of texts (several millions) from which the information is extracted. This extraction also concerns a large number of relation types (more than 40), which makes the manual annotation of the collection too expensive. We propose, in this context, a distant supervision approach in order to use learning techniques for this extraction, without the need of a fully annotated corpus. This distant supervision approach uses a set of relations from an existing KB to perform an unsupervised annotation of a collection, from which we learn a model for relation extraction. This approach has been evaluated at a large scale on the data from the TAC-KBP 2010 evaluation campaign. Keywords: Information extraction, relation extraction, event extraction ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sat Dec 10 09:44:49 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sat, 10 Dec 2011 10:44:49 +0100 Subject: Seminaire: INFOLINGU, Myriam RAKHO, 12 decembre 2011, Universite Marne-la-Vallee Message-ID: Date: Wed, 7 Dec 2011 17:12:37 +0100 From: Myriam RAKHO Message-ID: ************************************************************************ INFOLINGU Le séminaire de l'équipe Informatique Linguistique du Laboratoire d'Informatique Gaspard Monge (LIGM) Université Paris-Est Marne-la-Vallée ************************************************************************ Date : *Le Lundi 12 décembre 2011 à 10h30* Lieu : Université Paris-Est Marne-la-Vallée Bâtiment Copernic, 4ème étage, salle de séminaires 4B08R Toute personne intéressée est la bienvenue. ************************************************************************ ------------------------------------------------------------------------ Intervenant : ------------------------------------------------------------------------ *Myriam RAKHO* (LIGM, Université Marne-la-Vallée) ------------------------------------------------------------------------ Titre de la présentation : ------------------------------------------------------------------------ * Un corpus de verbes polysémiques annoté sémantiquement avec le Lexique-Grammaire* ------------------------------------------------------------------------ Résumé : ------------------------------------------------------------------------ La désambiguïsation sémantique des mots polysémiques (mots qui possèdent la qualité d'avoir plusieurs sens possibles) consiste à assigner à l'un de ces mots dans un contexte donné (phrase, paragraphe) celui de ses sens qu'il instancie. Il s'agit, par exemple pour le verbe *comprendre*, de reconnaître automatiquement la phrase *'Le principe de précaution ne comprend pas la politisation de la science'* comme une instance du sens “1 Contenir, renfermer en soi” et la phrase '* Vous * *devez comprendre que nous ne vous laisserons pas faire cela*' comme une instance du sens “ 2 Se rendre compte de, que”. Dans le cadre multilingue, les différentes traductions possibles des mots dans une langue cible donnée sont utilisées pour représenter l'inventaire de leurs sens. Si la langue cible est l'anglais, le verbe '*comprendre*' a donc pour étiquettes de sens les verbes *understand*, *realise*, *see*, *gather*, *grasp*, *sympathise*, * appreciate*, *include*, *involve*, *imply*, etc.. L'un des inconvénients de cette approche provient du fait que la correspondance entre les sens d'un mot polysémique et leurs équivalents en langue cible est rarement biunivoque. Un même sens peut être traduit par plusieurs équivalents en langue cible et, inversement, un même équivalent en langue cible peut traduire plusieurs sens d'un mot. Jusqu'à présent, la solution à ce problème consistait à découper l'ensemble des traductions possibles d'un mot en sous-ensembles, chacun correspondant à l'un de ses sens. Par exemple, le sens (1) du verbe '* comprendre*' serait associé à la classe {*involve, imply, include*} alors que le sens (2) serait associé à la classe {*understand, realise, see, gather, grasp*}. Nous proposons une nouvelle forme d'étiquettes de sens : les entrées des mots dans les tables du Lexique-Grammaire. Le Lexique-Grammaire (LG) (Gross, 1975; Boons et al., 1976; Guillet et Leclère, 1992; Leclère, 2004) est un lexique du français dans lequel les différents sens possibles des mots sont décrits en termes de propriétés lexicales, syntaxiques et sémantiques : un mot polysémique a autant d'entrées dans les tables du Lexique-Grammaire qu'il a de sens possibles. Notre étude porte sur 20 verbes polysémiques du français sélectionnés lors de la campagne ARCADE pour l'évaluation des systèmes d'alignement de textes multilingues et utilisés lors des campagnes Senseval/Romanseval pour l'évaluation des systèmes de désambiguïsation sémantique. Pour chacun de ces verbes, nous avons extrait du corpus multilingue EuroParl l'ensemble des contextes en français dans lesquels il apparaît ainsi que leur traduction en anglais. Nous avons ensuite associé quatre étiquettes de sens à chacun de ces contextes : la traduction du verbe polysémique en question dans la version en anglais du contexte et l'entrée du verbe dans le Lexique-Grammaire qu'il représente, une “lexie” de la forme entrée_dans_le_LG#traduction (le résultat de la concaténation des deux premières étiquettes de sens) et une classe de lexies (l'ensemble des lexies qui sont associées à une même entrée du mot dans le Lexique-Grammaire. Nous présenterons ce corpus ainsi que les premiers résultats de nos expériences en désambiguïsation sémantique. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sat Dec 10 09:55:37 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sat, 10 Dec 2011 10:55:37 +0100 Subject: Appel: IC 2012 Message-ID: Date: Thu, 08 Dec 2011 09:33:58 +0100 From: Nathalie Aussenac-Gilles Message-ID: <4EE07676.4070309 at irit.fr> X-url: http://ic2012.crc.jussieu.fr/ ************************************************************************ *IC 2012 - 23es Journées francophones d’Ingénierie des Connaissances * Tous les détails sur http://ic2012.crc.jussieu.fr/ Paris au Centre des Cordeliers du 25 juin au 29 juin 2012 ************************************************************************ ************************************************************************ *Dates importantes * Date limite de soumission : *17 février 2012 * Date de notification aux auteurs : 31 mars 2012 Version définitive : 2 mai 2012 ************************************************************************ Présentation de la conférence : L'ingénierie des connaissances est une discipline intégrée à l'Intelligence Artificielle, fondée sur les connaissances, leur acquisition, leurs modèles, leurs traitements dans des environnements informatiques. Elle s'appuie sur d'autres disciplines comme la philosophie, la linguistique, les mathématiques, les sciences cognitives, l'ergonomie pour mettre à la disposition d'activités humaines des méthodes et outils "intelligents" ou tout du moins capables d'aider l'humain dans ses activités. La conférence IC réunit la communauté francophone et est un lieu d'échanges et de réflexions sur cette discipline qui doit prendre en compte l'essor des Sciences et Technologies de l’Information et de la Communication, notamment des technologies liées au Web et leurs retombées sur les pratiques individuelles et collectives. La conférence d'ingénierie des connaissances est le lieu où la communauté francophone expose et confronte ses théories, ses pratiques, ses méthodes et outils. Un des enjeux récents est la mise en place de standards d'évaluation de ses méthodes et outils. Thèmes de la conférence : Les propositions portant sur le thème « Modèles, outils d'évaluation en Ingénierie des connaissance » seront particulièrement bienvenues mais les communications peuvent porter plus largement sur l'un des thèmes suivants dès lors qu'elles sont originales et qu'elles ont une portée théorique, méthodologique ou pratique (liste non exhaustive) : * Modèles, outils d'évaluation en Ingénierie des connaissance * Modèles de connaissances : conception, réutilisation, évolution, évaluation, exploitation * Ontologies : construction, types d’ontologies, intégration et alignement, rôle des ontologies de référence * Gestion de la dynamique des ontologies (construction, association entre ontologies, gestion du temps et de l’espace, …) * Traitements et raisonnements sur les connaissances * Acquisition de connaissances à partir de textes, à partir d'images, à partir de données non structurées * Découverte interactive de connaissances dans les données, dans les historiques, dans les interactions, sur le web… * Web sémantique, web social * Connaissance et réseaux sociaux sur le web * Inférence et règles métier : acquisition conception, réutilisation, évolution, évaluation, exploitations * Conception d'interactions et interfaces avec des systèmes à base de connaissances * Visualisation de connaissances * Applications en santé, industrielles, éducation, juridiques, etc. * Activités coopératives, pratiques collectives et distribuées * Ingénierie et Gestion des connaissances pour l’entreprise * Développements théoriques et interdisciplinaires de l'ingénierie des connaissances Soumissions : Les auteurs sont invités à présenter des articles longs, des articles courts, des posters et des démonstrations. Pour chaque type de soumission (article long, article court) un prix du meilleur papier sera décerné par le comité de programme. Le prix du meilleur poster sera attribué par les participants à la conférence à l'issue de la séance de posters. Articles longs : (16 p, présentation orale 20 mm, 10 mn discussion) Sont concernés des travaux aboutis, présentant des résultats validés ou des synthèses fouillées. Une attention particulière sera portée sur l'évaluation des résultats présentés. Articles courts : (8p, présentation orale de 10 mn, 5 mn discussion) Sont concernés des travaux en cours ou au stade de résultats partiels de recherches ou appliqués. Ce type de soumission correspond à des travaux innovants de qualité, mais dont la maturité ne permet pas des élaborations complètes et des conclusions définitives. Posters : (3p, présentation dans un espace de discussion) Sont concernés des travaux qui démarrent, les premiers résultats de jeunes chercheurs. Les posters sont destinés à provoquer la discussion au sein de la communauté pour accompagner au mieux le démarrage des travaux présentés. Les auteurs sont invités à indiquer le thème le plus approprié pour leur soumission. La conférence d'Ingénierie des connaissances est sous l'égide du GRACQ (groupe d'acquisition et d'ingénierie des connaissances ) dont le site est http://www.irit.fr/GRACQ/ Présidente du comité de programme : Sylvie Szulman (LIPN -Université Paris 13) Président du comité d'organisation : Jean Charlet (AP-HP-INSERM - Université Pierre et Marie Curie) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sat Dec 10 09:57:58 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sat, 10 Dec 2011 10:57:58 +0100 Subject: Seminaire: Nouveaux modes de publication scientifique, 14 decembre 2011 Message-ID: Date: Thu, 8 Dec 2011 09:38:41 +0100 From: Centre d'Alembert Message-Id: X-url: http://www.centre-dalembert.u-psud.fr X-url: http://h0.web.u-psud.fr/plan_sciences/satellite.php?bat=338 Séminaire interdisciplinaire 2011-2012 du Centre d'Alembert http://www.centre-dalembert.u-psud.fr Nouvelles pratiques et cultures du numérique dans les sciences et les technologies Le Centre d'Alembert s'interroge cette année sur l'émergence de nouvelles pratiques et cultures dans la production du savoir scientifique, sa validation et sa circulation, sous l'influence des récentes évolutions du numérique en particulier afférentes à Internet. Séance du mercredi 14 décembre 2011 : Nouveaux modes de publication scientifique, de validation et d'accès : quelles avancées pour le mouvement des archives ouvertes, pour la libre diffusion des connaissances scientifiques sur le web ? Le libre accès aux publications scientifiques : éléments d'une prospective prudente Jean-Claude GUEDON, Professeur de littérature comparée à l'Université de Montréal. Prix d'excellence de la Society for Digital Humanities. Ancien vice-président de la Fédération canadienne des sciences humaines pour la dissémination de la recherche. Depuis environ dix ans, le mouvement de l'accès libre se déploie dans le monde. Ses progrès sont indéniables, et le sentiment que le libre accès est désormais ancré dans la réalité des communications scientifiques ne semble plus faire de doute : même les grandes maisons d'édition composent ouvertement avec ce modèle pourtant vu comme irréaliste et radical il y a seulement quelques années. Fondé sur les dépôts institutionnels et thématiques, d'une part, et les revues en libre accès d'autre part, le libre accès agit aussi à titre d'indicateur de la numérisation croissante de nos cultures. Il révèle en particulier que la numérisation engendre des fonctions nouvelles pour les documents numériques et le prochain grand défi du libre accès sera de répondre à ces interpellations du numérique : rôle des données par rapport aux publications, convergences des médias d'expression, algorithmes "embarqués" dans les documents s'ajoutent aux questions plus traditionnelles des formats adéquats pour la publication électronique en libre accès. En parallèle, la création de la valeur symbolique en science mérite d'être revisitée au travers du prisme qu'offre le libre accès. Actuellement complètement dominé, voire perverti, par l'obsession des classements et la concurrence tous azimuts, les publications scientifiques actuelles révèlent l'existence d'une confusion fâcheuse entre qualité et excellence. Relocaliser le rôle de l'excellence au sein d'une quête de la qualité semble urgent, et il n'existe pas de meilleur moyen d'atteindre cet objectif que le libre accès. Informations pratiques : Les séances ont lieu à la Faculté des Sciences d'Orsay, bâtiment des colloques (338) de 13h30 à 15h30 (entrée libre). Accès : http://h0.web.u-psud.fr/plan_sciences/satellite.php?bat=338 Programme détaillé et contact : Web : http://www.centre-dalembert.u-psud.fr Mél. : centre.dalembert at u-psud.fr Médias sociaux : http://twitter.com/centredalembert ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sat Dec 10 10:04:06 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sat, 10 Dec 2011 11:04:06 +0100 Subject: Info: MULTI-2, Erasmus Mundus Action 2 Mobility Grants Europe - South East Asia Message-ID: Date: Thu, 8 Dec 2011 16:30:15 +0100 From: Laurent Prévot Message-ID: X-url: http://eacea.ec.europa.eu/erasmus_mundus/index_en.php X-url: http://www.em-multi.eu/ **************************************** ****Apologies for cross-postings**** **************************************** MULTI Erasmus Mundus Project offers attractive mobility grants to Asian/European students and researchers (master student, PhD student, young PhD holders) for studying or doing research in one of the consortium institutions (see below). The grants cover airfare as well as attractive grants covering living expenses and of a duration from 1 academic semester to 1 academic year. Multilingualism and Multiculturalism (MULTI-2) is an Action 2 Erasmus Mundus project funded by the European Commission to promote scientific exchanges between Europe and the Industrialized countries of South-East Asia (Hong Kong, Macao, Taiwan, Singapore, Brunei). It concerns both general disciplines such as Linguistics, Area Studies or Social Sciences and more applied fields such as Natural Language Processing or Translation. To be eligible to the mobility grant in the direction of Europe -> South East Asia, the applicant has to be a national of one of the 27 European Union countries and has to register or be affiliated at one of the European partner universities. To be eligible to the mobility grant in the direction of South East Asia -> Europe, the applicant has to be a national of one of the South East Asian countries participating in MULTI and has to register or be affiliated at one of the South East Asian partner universities. Partner institutions in Europe: Université de Provence (Coordinator) Freie Universität Berlin Universität des Saarlandes Università di Pisa City University London Universidad de Sevilla Charles University in Prague Partner institutions in South East Asia: The Hong Kong Polytechnic University (SEA Coordinator) University of Macao Nanyang Technological University National Taiwan University Universiti Brunei Darussalam Application deadline is *Jan. 20, 2012* and the results will be posted around mid March. For more information, please check MULTI website: http://www.em-multi.eu/ Email Contact (remove + and spaces): erasmusmundus ++ at ++ lpl-aix.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sat Dec 10 10:08:37 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sat, 10 Dec 2011 11:08:37 +0100 Subject: Job: Stage Postdoctoral, Traduction automatique, LIMSI, Orsay Message-ID: Date: Thu, 08 Dec 2011 23:28:18 +0100 From: Francois Yvon Message-ID: <4EE13A02.5050504 at limsi.fr> X-url: http://www.limsi.fr/Scientifique/tlp/mt [apologies for cross-posting] [and for sending this message only in English] **************************************************************** POST-DOC OPENING IN STATISTICAL MACHINE TRANSLATION AT LIMSI-CNRS, FRANCE **************************************************************** The "Spoken Language Processing" team at LIMSI-CNRS, Orsay (25 km south of Paris) is seeking qualified postdoctoral researchers in the field of Statistical Machine Translation (SMT). The main task of the candidate will be to design, implement and evaluate innovative models and associated training and inference procedures for components of statistical machine translation systems. These models will be used to improve LIMSI's machine translation toolbox. Other areas in which we have a strong interest are model adaptation, confidence measures and automatic evaluation metrics. The successful candidate will join a dynamic research team working on various aspects of SMT. See http://www.limsi.fr/Scientifique/tlp/mt for information regarding our activities. Candidates are expected to hold an Engineering BSc/Msc degree and a PhD in Computer Science. Knowledge of statistical or example-based approaches for Speech or Natural Language Processing is required; the candidate is also expected to have strong programmation skills, to be familiar with statistical machine learning techniques, and have a good publication record in the field. Experience with statistical machine translation systems and/or participation in large-scale open evaluations (NIST, IWSLT etc.) will be a strong advantage. The minimum remuneration will be EUR 2000 (net income) per month. A contract of one year will be awarded. Interested candidates should send a short cover letter stating motivation and interests, along with their CV (in .pdf format, only) to the email address given below as soon as possible and by dec 31th 2012 at the latest. Questions or other clarification should be directed to the same address. Application Deadline: 31-dec-2011 [the position(s) will remain open until filled] Email Address for Applications: yvon at limsi dot fr Contact Information: F. Yvon Phone: +331-69858066 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sat Dec 10 10:12:45 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sat, 10 Dec 2011 11:12:45 +0100 Subject: Journee: Journee d'etude, Qu'est-ce qu'apprendre une langue? Universite Paris Ouest, 14 decembre 2012 Message-ID: Date: Fri, 9 Dec 2011 09:26:45 +0100 From: "Didier Bottineau" Message-ID: <975E4DBFA15D4FA8BADFC75C71940DD3 at R12B0> X-url: http://www.modyco.fr/ ANNONCE D’UNE JOURNEE D’ETUDE Qu’est-ce qu’apprendre une langue ? Mercredi 14 décembre 2011, Université Paris Ouest, Nanterre, bâtiment G, salle G 614, 9h-18h ACCES : RER A, arrêt Nanterre Université Journée scientifique organisée le mercredi 14 décembre 2011 par Gilles Louÿs (EA 1586 Centre des Sciences de la Littérature française, F.ET.E : Français pour ETudiants Etrangers, Paris Ouest Nanterre La Défense) avec le concours de l’Union des Professeurs de Langues des Grandes Ecoles, du Centre des Sciences de la Littérature française (Paris Ouest) et des écoles doctorale Lettres Langues Spectacles et Connaissance, Langage, Modélisation (Paris Ouest). Lieu : bâtiment G, salle G 614 Comité scientifique Alain Cazade (EA 4405 CICLaS, Paris Dauphine) Jörg Eschenauer (Union des Professeurs de Langues des Grandes Ecoles / Ecole des Ponts, ParisTech) Danièle Flament (UMR 7114 MoDyCo / Université Paris Ouest Nanterre la Défense) Jacques Prévot (EA 1586 Centre des Sciences de la Littérature française / Université Paris Ouest Nanterre La Défense) Jean-Louis Chiss (EA 2288 ERADLEC, Université Paris 3) Cette journée vise à explorer et définir l’expérience du sujet qui s’exprime et communique dans une langue autre que la sienne et, corrélativement, à s’interroger sur la manière dont ce savoir-faire s’acquiert à travers une gamme de pratiques pédagogiques diverses empruntées à des domaines d’application multiples. Le point de vue adopté ici est de considérer que parler dans une autre langue, c'est passer d’un « corps parlant » à un autre, pour autrui comme pour soi-même, se faire devenir momentanément un autre individu, avec les effets cognitifs, sémantiques, émotionnels, relationnels et culturels que cela suppose. L’hypothèse défendue est que l’observation de cette démarche dépasse la simple prise en compte du "para-verbal" (gestes, mimiques, postures). Par exemple, articuler des sons, tons ou mélodies en japonais et en espagnol oblige le parleur à placer différemment les divers muscles mobilisés par l'expression de ce qu'il dit, ce qui lui donne littéralement un visage et une voix distincts selon la langue qu'il parle, produisant de lui-même, consciemment ou non, une réalisation subjective et relationnelle distincte de sa manière d’être dans sa langue maternelle. Cet aspect fondamental de l’action langagière humaine est relativement peu décrit, et sa mise en œuvre n’est guère systématisée ni explicitée en didactique des langues. Dans ce contexte, l’objectif de cette journée est de donner forme et substance à ce point de vue et d’en explorer les conséquences pédagogiques susceptibles de devenir le lieu d'une rupture épistémologique : a) du point de vue théorique : du fait que les recherches en matière de langue (ou de discours) se focalisent en général précisément sur les unités dites "linguistiques" sans se préoccuper de l'amont (la réalité des conditions) dont elles sont l'aval (le résultat et la formalisation souvent désincarnée) ; b) du point de vue pratique : du fait que, de manière générale, les didactiques sont tributaires des domaines habituellement convoqués dans l’apprentissage des langues, tels la grammaire, le lexique, la phonétique, la sémantique, la pragmatique, la culture (représentations, stéréotypes…), et n’ont pas les moyens de se référer à une compréhension autre (et plus globale) de ce qu’est l’expérience de la parole. Ce colloque se propose de mettre en lumière les théories actuelles qui font toute sa place au vécu corporel de l’apprenant dans la compréhension ou la production linguistique (en tenant compte des rapports aux courants pertinents en linguistique, psychologie, phénoménologie), et de les relier aux pratiques pédagogiques faisant appel au « corps parlant » (théâtre, gestuelle, danse, musique, chant, orthophonie…) qui en illustrent sur le terrain la pertinence didactique. Programme 9h Ouverture et présentation de la journée: Christophe Martin (Paris Ouest, directeur de l'EA 1586, co-directeur de l'UFR PHILLIA) L'expérience de la parole : président Robert Damien (Paris Ouest, directeur de l'ED 139) 9h15-10h Conférencier invité : Didier Bottineau (CNRS, UMR 7114 Modyco & Paris Ouest) : « Comment l'expérience de la parole constitue le sujet psychologiquement, intersubjectivement, cognitivement, culturellement, et ce à travers l'interaction, par la motricité et la sensorialité » 10h-10h15 Echange avec la salle 10h15-10h45 Pause Le corps parlant : président Alain Cazade (EA 4405 CICLaS, Paris Dauphine) 10h45-11h05 : Philippe Turchet, Chercheur en synergologie, Canada) : « Et si l'apprentissage d'une langue passait aussi par l'intégration non consciente d'un langage corporel associé ? » 11h05-11h25 : Catherine Garitte (EA 4431 Psychologie des processus et conduites complexes, Paris Ouest) et Larisa Olteanu : « L’apprentissage des gestes conventionnels en roumain et en français chez les enfants bilingues de 7 à 11 ans » » : 11h25-11h45 : Jean-Rémi Lapaire, Université Michel de Montaigne Bordeaux 3 : « Corps pensant, corps parlant, corps en mouvement » 11h45-12h Echange avec la salle 12h-13h30 Déjeuner Perspectives didactiques 1 : président Thomas Gomez (Paris Ouest, directeur de l'ED 138) 13h30-13h50 Vera Dickman (Télécom ParisTech) : « Médiation du corps dans les apprentissages langagiers et interculturels » 13h50-14h10 Jörg Eschenauer (Ecole des Ponts ParisTech) : « Apprendre une langue en tandem » 14h10-14h30 Amina Bensallah-Poulain (UMR 7114 MoDyCo, CNRS & Paris Ouest) : « L’ancrage corporel du langage – Apport des pratiques orthophonistes à l’enseignement d’une langue orale « étrangère » » 14h30-15h Echange avec la salle 15h-15h30 Pause Perspectives didactiques 2: président Jörg Eschenauer (ParisTech & UPLEGESS) 15h30-15h50 : Joëlle Aden, Université du Maine (INEDUM/CREN, EA 2661) : « Empathie et approche théâtrale en didactique des langues » 15h50-16h10 : Graça Dos Santos (EA 369 Etudes romanes, Paris Ouest) : « Je est un autre. L'expérience du bilinguisme et du théâtre pour enseigner et apprendre une langue. » 16h10-16h30 : Stephen Scott Brewer (Centre de recherches Education Formation : EA 1589 Université Paris Ouest) : « Apprendre une langue : l’enjeu du ‘jeu intérieur’ » 16h30-17h Echange avec la salle 17h-18h Table ronde présidée par Danièle Flament-Boistrancourt (UMR 7114 MoDyCo, Paris Ouest), Boris Sautereau et Emmanuelle Sauvage (EA 1586, Paris Ouest, FETE), Sylvaine Hughes (EA 370 Centre de recherches anglophones, Paris Ouest), Idelette Muzart (EA 369 Etudes romanes, Paris Ouest), Jean-Marc Chamot (6e DAN Aïkido, EA 370, Paris Ouest). 18h Cocktail dinatoire de clôture ---------------------------- ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 14 08:41:57 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 14 Dec 2011 09:41:57 +0100 Subject: Appel: ECAI 2012, Dernier appel a workshops Message-ID: Date: Sat, 10 Dec 2011 15:11:39 +0100 From: ECAI Publicity Message-ID: <4EE3689B.8030609 at lirmm.fr> X-url: http://www2.lirmm.fr/ecai2012/ Last call for ECAI 2012 Workshops www.lirmm.fr/ecai2012 Deadline: 6 Jan 2012 ECAI, the biennial European Conference on Artificial Intelligence organized by the European Coordinating Committee for Artificial Intelligence (ECCAI), is the premier forum for presenting Artificial Intelligence results in Europe. The twentieth conference in this series, ECAI 2012, will take place in the beautiful city of Montpellier (France) August 27-31. ECAI 2012 will gather researchers and practitioners of Artificial Intelligence from all over the world; they will present the latest results and identify new trends and challenges in all subfields of Artificial Intelligence as well as demonstrate innovative applications and uses of advanced Artificial Intelligence technology. In addition to the main technical track, the ECAI 2012 program will feature keynote and invited talks, invited tutorials, an elaborate workshop program, and a system-demonstration track. Workshops ----------- All workshop proposals are welcome on any topic related to AI, ranging from robotics to games, from knowledge modelling to reasoning to constraints, from machine learning to planning and scheduling, from uncertainty to perception to language, from multi-agents to interactions and user modelling (non-exhaustive list). The goal of the workshops is to provide an informal forum for researchers to discuss focused and/or emerging research questions and challenges. Workshops can come in several flavors: * presentations of peer-reviewed papers (standard workshop format), including discussions and possibly panels; * AI competitions or challenges; * Position papers and discussion on open or controversial problems. * any other format aimed at investigating new AI trends. What a workshop should *not* be: a mini-conference packed with short talks and with no time for discussion and interaction. We therefore encourage workshop organizers to include invited talks, short tutorials, panel-discussions and representation of alternative viewpoints in their workshop program in addition to the regular submissions. The typical duration for a workshop is one day; two-day workshops can also be accommodated. For any question, any intention to submit, please contact Jérôme Lang and Michèle Sebag (lang at lamsade.dauphine.fr, sebag at lri.fr). The deadline for workshop proposals is January 6, 2012. Dates: 6 Jan 2012: workshop proposals due (early submissions are welcome!) 1 Feb 2012: workshop proposal notifications sent 19 Feb 2012: 1st call for papers sent; workshop websites ready; announcement of all ECAI-2012 workshops at the main conference site 28 May 2012 (suggested): workshop paper submission deadline 28 Jun 2012 (suggested, and subject to modification): notification on workshop paper submissions 22 Jul 2012: workshop proceedings to be sent to workshop chair Notes that the workshop submission deadline should preferably take place after ECAI notification (21 May 2012) and workshop paper notifications *must* be sent before the ECAI early registration deadline (which will be announced later). Workshop proposal submission: Please submit your proposal at http://www.easychair.org/conferences/?conf=ecai2012ws Each workshop proposal (pdf file) must include: (1) Title and acronym. (2) Names, affiliations, and contact details of all organizers (please indicate a primary contact person to whom correspondence should be directed). (3) Short description (workshop format and duration, tentative call for papers, tentative invited speakers and members of the program committee). (4) Audience: research groups working in the field; (if applicable) details about previous editions of the workshop; related workshops in other venues; expected number of submissions. (5) One paragraph about each organizer (scientific profile, previous events you have organized). (6) Every information you deem to be relevant. Policy: * all workshop participants, including the organizer, are required to register for both the workshop and the main conference. For up to three invited speakers per workshop the workshop registration will be waived, though they must still register for the main conference. Unfortunately, ECAI cannot provide any other support for these speakers. * Due to budget reasons, a workshop might be cancelled if there are not enough submissions or registrations. * Proceedings: The organizers will provide a single pdf file to be posted on the website. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 14 08:46:33 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 14 Dec 2011 09:46:33 +0100 Subject: Ecole: SSFLA 2012 Message-ID: Date: Sun, 11 Dec 2011 11:24:50 +0100 From: "GRLMC" Message-ID: X-url: http://grammars.grlmc.com/ssfla2012/ ********************************************************************* 2012 INTERNATIONAL SPRING SCHOOL IN FORMAL LANGUAGES AND APPLICATIONS SSFLA 2012 (formerly International PhD School in Formal Languages and Applications) Tarragona, Spain April 2-6, 2012 Organized by: Research Group on Mathematical Linguistics (GRLMC) Rovira i Virgili University http://grammars.grlmc.com/ssfla2012/ ********************************************************************* AIM: SSFLA 2012 offers a broad and intensive series of lectures at different levels on selected topics in language and automata theory and their applications. The students choose their preferred courses according to their interests and background. Instructors are top names in their respective fields. The School intends to help students initiate and foster their research career. The previous event in this series was FSFLA 2011 (http://grammars.grlmc.com/fsfla2011/). ADDRESSED TO: Undergraduate and graduate students from around the world. Most appropriate degrees include: Computer Science and Mathematics. Other students (for instance, from Linguistics, Electrical Engineering, Molecular Biology or Logic) are welcome too provided they have a good background in discrete mathematics. The School is appropriate also for people more advanced in their career who want to keep themselves updated on developments in the field. There will be no overlap in the class schedule. COURSES AND PROFESSORS: - Eric Allender (Rutgers), Circuit Complexity: Recent Progress in Lower Bounds [introductory/advanced, 8 hours] - Amihood Amir (Bar-Ilan), Periodicity and Approximate Periodicity in Pattern Matching [introductory, 6 hours] - Ahmed Bouajjani (Paris 7), Automated Verification of Concurrent Boolean Programs [introductory/advanced, 10 hours] - Andrea Corradini (Pisa), Computational Models Based on Rewriting [introductory, 12 hours] - Bruno Courcelle (Bordeaux), Automata for Monadic Second-order Model Checking [intermediate, 8 hours] - Manfred Droste (Leipzig), Weighted Automata and Weighted Logic [introductory/advanced, 8 hours] - Jörg Flum (Freiburg), The Halting Problem for Turing Machines [introductory/advanced, 6 hours] - Aart Middeldorp (Innsbruck), Termination of Rewrite Systems [introductory/intermediate, 8 hours] REGISTRATION: It has to be done on line at http://grammars.grlmc.com/ssfla2012/Registration.php FEES: They are variable, depending on the number of courses each student takes. The rule is: 1 hour = - 10 euros (for payments until January 22, 2012), - 15 euros (for payments after January 22, 2012). PAYMENT PROCEDURE: The fees must be paid to the School's bank account: Uno-e Bank (Julian Camarillo 4 C, 28037 Madrid, Spain): IBAN: ES3902270001820201823142 – Swift/BIC code: UNOEESM1 (account holder: Carlos Martin-Vide GRLMC; account holder's address: Av. Catalunya, 35, 43002 Tarragona, Spain) Please mention SSFLA 2012 and your name in the subject. A receipt will be provided on site. Remarks: - Bank transfers should not involve any expense for the School. - People claiming early registration will be requested to prove that the bank transfer order was carried out by the deadline. - Students may be refunded only in the case when a course gets cancelled due to the unavailability of the instructor. People registering on site at the beginning of the School must pay in cash. For the sake of local organization, however, it is much recommended to do it earlier. ACCOMMODATION: Information about accommodation is available on the website of the School. CERTIFICATE: Students will be delivered a certificate stating the courses attended, their contents, and their duration. IMPORTANT DATES: Announcement of the programme: December 10, 2011 Starting of the registration: December 10, 2011 Early registration deadline: January 22, 2012 Starting of the School: April 2, 2012 End of the School: April 6, 2012 QUESTIONS AND FURTHER INFORMATION: Lilica Voicu: florentinalilica.voicu at urv.cat WEBSITE: http://grammars.grlmc.com/ssfla2012/ POSTAL ADDRESS: SSFLA 2012 Research Group on Mathematical Linguistics (GRLMC) Rovira i Virgili University Av. Catalunya, 35 43002 Tarragona, Spain Phone: +34-977-559543 Fax: +34-977-558386 ACKNOWLEDGEMENTS: Diputació de Tarragona Universitat Rovira i Virgili ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 14 08:49:28 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 14 Dec 2011 09:49:28 +0100 Subject: Appel: FOIS 2012 Message-ID: Date: Sun, 11 Dec 2011 23:56:35 +0100 From: Nathalie Aussenac-Gilles Message-ID: <4EE53523.3020508 at irit.fr> X-url: http://purl.org/icbofois2012 (apologies for cross-posting) ... (please distribute) ----------------------------------------------------------------- ------------- CALL FOR PAPERS | FOIS 2012 | Jul 24-27, Graz, Austria ------------- -> DEADLINE: Conference paper submission: Jan 31, 2012 (in ca. 8 weeks) [1] http://purl.org/icbofois2012 ------------------------------------------------ TITLE AND CO-LOCATION ------------------------------------------------ Seventh International Conference on Formal Ontologies in Information Systems (FOIS 2012) July, 24-27, 2012 in Graz, Austria held together with the Third International Conference on Biomedical Ontology (ICBO 2012) ------------------------------------------------ DEFINITION AND SCOPE ------------------------------------------------ Ontology, originally a fundamental part of philosophical enquiry, is concerned with the analysis and categorization of what exists. In recent years, however, a complementary focus of ontological inquiry gained significant momentum fueled by the advent of complex information systems which rely on robust and coherent representations of their subject matter. The systematic study of such representations, their reasoning techniques and their relations to reality, are at the center of the modern discipline of formal ontology. Formal ontology in this modern sense is now a research focus in such diverse domains as conceptual modeling, database design, software engineering, organizational modeling, artificial intelligence, computational linguistics, the life sciences, bioinformatics, geographic information science, knowledge engineering, information retrieval, and the semantic web. Researchers in all these areas increasingly recognize the need for serious engagement with ontology, understood as a general theory of the types of entities and relations making up their respective domains of enquiry, in providing a solid foundation for their work. The FOIS conference is designed to provide a meeting point for researchers from all disciplines with an interest in formal ontology. The conference encourages submission of high quality articles on both theoretical issues and concrete applications. As in previous years, FOIS 2012 is intended as a nexus of interdisciplinary research and communication [1]. The FOIS conference series [2] began with the first meeting in Trento, Italy in June 1998 followed by meetings in 2001, 2004, 2006, 2008, and 2010. The seventh FOIS conference will be held in Graz, Austria July 24-27, 2012, in conjunction with the third International Conference on Biomedical Ontology (ICBO 2012) [1]. FOIS is the flagship conference of the International Association for Ontology and its Applications (IAOA) [3], which is a non-profit organization the purpose of which is to promote interdisciplinary research and international collaboration at the intersection of philosophical ontology, linguistics, logic, cognitive science, and computer science, as well as in the applications of ontological analysis to conceptual modeling, knowledge engineering, knowledge management, information-systems development, library and information science, scientific research, and semantic technologies in general. ------------------------------------------------ TOPICS OF INTEREST ------------------------------------------------ We seek high-quality papers on a wide range of topics. While authors may focus on fairly narrow and specific issues, all papers should emphasize the relevance of the work described to formal ontology and to information systems. Papers that completely ignore one or the other of these aspects will be considered as lying outside the scope of the meeting. Topic areas of particular interest to the conference are: Foundational Issues * Kinds of entity: particulars/universals, continuants/occurrents, abstracta/concreta, dependent/independent entities, natural objects/artifacts * Formal relations: parthood, identity, connection, dependence, constitution, subsumption, instantiation * Vagueness and granularity * Identity and change * Formal comparison among ontologies * Ontology of physical reality (matter, space, time, motion, ...) * Ontology of biological reality (genes, proteins, cells, organisms, ...) * Ontology of artifacts, functions and roles * Ontology of mental reality and agency (beliefs, intentions, emotions, ...) * Ontology of social reality (institutions, organizations, norms, social relationships, artistic expressions, ...) * Ontology of the information society (information, communication, meaning negotiation, ...) * Ontology and Natural Language Semantics, Ontology and Cognition Methodologies and Applications * Top-level vs application ontologies * Ontology integration and alignment; role of reference ontologies * Ontology-driven information systems design * Ontological foundations for conceptual modeling * Ontology-based application systems * Requirements engineering * Knowledge engineering * Knowledge management and organization * Knowledge representation; Qualitative modeling * Computational lexicons; Terminology * Information retrieval; Question-answering * Semantic Web; Web services; Grid computing * Domain-specific ontologies, especially for: Biomedical science, E-business, Enterprise integration, Engineering, Geography, Law, Library science, Linguistics, ... ------------------------------------------------ KEYNOTE SPEAKERS ------------------------------------------------ Steven R. Ray (Carnegie Mellon Silicon Valley, California, USA) Laure Vieu (CNRS-IRIT, Toulouse, France) Edward N. Zalta (Stanford University, California, USA) ------------------------------------------------ IMPORTANT DATES ------------------------------------------------ We are calling for papers to be considered for inclusion in FOIS 2012. Conference paper submission: Jan 31, 2012<-- Conference paper notification: Mar 08, 2012 Camera-ready conference papers: Mar 24, 2012 Workshop paper submission: Mar 15, 2012 Workshop paper notification: Apr 15, 2012 Camera-ready workshop papers: Jun 30, 2012 Conference dates: Jul 24-27, 2012 ------------------------------------------------ AUTHOR INSTRUCTIONS FOR CONFERENCE PAPERS ------------------------------------------------ Submitted papers must not exceed 14 pages, including the bibliography and an abstract of no more than 300 words. Papers should be submitted electronically as PDF files prepared in accordance with the IOS formatting guidelines [4]. Submissions should be made via Easychair at the following address: [5] https://www.easychair.org/account/signin.cgi?conf=fois2012 Proceedings will be published by IOS Press and will be available at the time of the conference. Please note that at least one author must register for the conference in order for an accepted paper to be published in the proceedings. ------------------------------------------------ CONFERENCE ORGANISATION ------------------------------------------------ Conference chair: Michael Gruninger (University of Toronto, Canada) Program chairs: Maureen Donnelly (University at Buffalo, USA) Giancarlo Guizzardi (Fed. Univ. of Espírito Santo, Brazil) Local organization: Stefan Schulz (Graz University, Austria) ------------------------------------------------ PROGRAM COMMITTEE ------------------------------------------------ Achille Varzi (Columbia University, New York, USA) Alan Rector (University of Manchester, UK) Albert Goldfain (University at Buffalo, USA) Aldo Gangemi (ISTC-CNR, Rome, Italy) Alessandro Lenci (University of Pisa, Italy) Antony Galton (University of Exeter, UK) Barry Smith (University at Buffalo, USA) Bill Andersen (Highleet, Inc., USA) Brandon Bennett (University of Leeds, UK) Carola Eschenbach (University of Hamburg, Germany) Chris Menzel (Texas A&M University, USA) Chris Welty (IBM Research, Hawthorne, USA) Christiane Fellbaum (Princeton University, USA) Claudio Masolo (ISTC-CNR, Trento, Italy) Colin Batchelor (Royal Society of Chemistry, UK) Daniel Oberle (SAP Research, CEC Karlsruhe, Germany) David Mark (University at Buffalo, USA) Decio Krause (Fed. Univ. of Santa Catarina (UFSC), Brazil) Fabian Neuhaus (NIST, USA) Florian Probst (SAP, Germany) Frank Loebe (University of Leipzig, Germany) Fred Freitas (Fed. Univ. of Pernambuco, Recife, Brazil) Frederico Fonseca (Penn State University, USA) Gerd Wagner (Brandenburg University of Technology, Germany) Heinrich Herre (University of Leipzig, Germany) Jan Dietz (Delft University of Technology, The Netherlands) Janna Hastings (University of Geneva, Switzerland) Jeffrey Parsons (Memorial University of Newfoundland, Canada) Jerome Euzenat (INRIA, Grenoble, France) Jerry Hobbs (University of Southern California, USA) Johanna Seibt (University of Aarhus, Denmark) John Bateman (University of Bremen, Germany) John Mylopoulos (University of Toronto, Canada) John Sowa (Vivomind Intelligence Inc., USA) Joost Breuker (University of Amsterdam, The Netherlands) Ken Kaneiwa (National Institute of Informatics, Tokyo, Japan) Krzysztof Janowicz, (UCLA, USA) Laure Vieu (CNRS-IRIT, Toulouse, France) Leo Obrst (The Mitre Corporation, USA) Leonardo Lesmo (University of Torino, Italy) Maria Luiza Machado Campos (Fed. Univ. of Rio de Janeiro, Brazil) Mark Musen (Stanford University, USA) Martin Doerr (Foundation for Research and Technology, Greece) Mathias Brochhausen (Univ. of Arkansas for Medical Sciences, USA) Mathias Samwald (Medical University of Vienna, Austria) Matteo Cristani (University of Verona, Italy) Mauricio Barcelos de Almeida (Fed. Univ. of Minas Gerais, Brazil) Michael F. Uschold (Semantic Arts, Canada) Michael Gruninger (University of Toronto, Canada) Michael Sintek (DFKI GmbH, Kaiserslautern, Germany) Michel Dumontier (Carleton University, Canada) Natasha Noy (Stanford University, USA) Nicola Guarino (ISTC-CNR, Trento, Italy) Olivier Bodenreider (National Library of Medicine, USA) Oscar Pastor (Polytechnic University of Valencia, Spain) Paul Buitelaar (National University of Ireland, Galway) Peter Rittgen (University College Boras, Sweden) Pierdaniele Giaretta (University of Verona, Italy) Pierre Grenon (Open University, UK) Ricardo Falbo (Federal University of Espírito Santo, Brazil) Riichiro Mizoguchi (University of Osaka, Japan) Robert Hoehndorf (University of Cambridge, UK) Roberta Ferrario (ISTC-CNR, Trento, Italy) Roberto Casati (CNRS-EHSS, Paris, France) Stefano Borgo (ISTC-CNR, Trento, Italy) Terry Halpin (LogicBlox, Australia) Terry Langendoen (University of Arizona, USA) Thomas Bittner (University at Buffalo, USA) Tony Cohn (School of Computing, University of Leeds, UK) Ulrike Sattler (University of Manchester, UK) Veda Storey (Georgia State University, USA) Werner Ceusters (University at Buffalo, USA) Werner Kuhn (University of Munster, Germany) William McCarthy (Michigan State University, USA) Yair Wand (University of British Columbia, Canada) ------------------------------------------------ LINKS ------------------------------------------------ [1] http://purl.org/icbofois2012 == http://www.kr-med.org/icbofois2012 FOIS and ICBO 2012 website [2] http://www.formalontology.org/ FOIS conference series [3] http://www.iaoa.org International Association for Ontology and Its Applications [4] http://www.iospress.nl/authco/instruction_crc.html IOS Press formatting guidelines [5] https://www.easychair.org/account/signin.cgi?conf=fois2012 Submission at EasyChair ----------------------------------------------------------------- ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 14 08:51:09 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 14 Dec 2011 09:51:09 +0100 Subject: Revue: CORELA, vol. 9, num. 2 Message-ID: Date: Mon, 12 Dec 2011 16:35:07 +0100 From: Gilles Col Message-Id: X-url: http://corela.edel.univ-poitiers.fr/ X-url: http://corela.edel.univ-poitiers.fr/index.php?id=2127 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=2314 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=2298 Chers collègues, L'équipe éditoriale de la revue CORELA - Cognition / Représentation / Langages (http://corela.edel.univ-poitiers.fr/) a le plaisir de vous annoncer la publication de son dernier numéro (vol. 9, num. 2). Les propositions de publication pour le prochain numéro de la revue (vol. 10, num. 1) sont à envoyer à Gilles Col (gilles.col at univ-poitiers.fr) avant le 13 avril 2012. Sommaire du numéro Jean-Baptiste Guignard (2011). "Linguistique Cognitive et modèles catégoriaux : Quelques considérations épistémologiques". http://corela.edel.univ-poitiers.fr/index.php?id=2127 Isabelle Haïk (2011). "Kissing goodbye". http://corela.edel.univ-poitiers.fr/index.php?id=2314 Bénédicte GUILLAUME (2011). "Will dans les subordonnées en if." http://corela.edel.univ-poitiers.fr/index.php?id=2298 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 14 08:52:55 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 14 Dec 2011 09:52:55 +0100 Subject: These: Tantely Harinjaka Ravelonjatovo, Analyse systematique des termes malgaches Message-ID: Date: Tue, 13 Dec 2011 12:10:05 +0000 From: Tantely Harinjaka RAVELONJATOVO Message-ID: Chers chercheurs Vous êtes convié à honorer de votre présence la soutenance de thèse de doctorat de RAVELONJATOVO Tantely intitulé : « CONTRIBUTION A LA METHODOLOGIE D’ANALYSE SYSTEMATIQUE DES TERMES MALGACHES. CAS DU DOMAINE DE L’ENVIRONNEMENT » qui aura lieu le 22 décembre 2011 à l’Amphi 24 de la Faculté des Lettres et Sciences Humaines de l’Université d’Antananarivo à 8 h 30. Après la soutenance, un pot sera offert à la salle Siméon Rajaona. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 14 08:56:25 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 14 Dec 2011 09:56:25 +0100 Subject: Appel: CAp 2012 Message-ID: Date: Tue, 13 Dec 2011 15:52:27 +0100 From: Armelle Brun Message-Id: X-url: http://cap2012.loria.fr/ ------------------------------------------------------------------------ 1er appel à communication CAp 2012 Conférence Francophone sur l'Apprentissage Automatique LORIA, Nancy, 23 au 25 mai 2012 http://cap2012.loria.fr/ CAP'12 est organisée avec le soutien de l'INRIA et de l'AFIA. ------------------------------------------------------------------------ La conférence sur l'apprentissage automatique (CAp) constitue maintenant un rendez-vous annuel incontournable pour les chercheurs de la communauté francophone de l'apprentissage automatique. La frontière entre l'apprentissage symbolique et l'apprentissage numérique étant de moins en moins tangible, la conférence CAp offre à toute personne travaillant dans le domaine de l'apprentissage une opportunité de présenter ses travaux et d'échanger ses idées avec les autres membres de la communauté. Elle est aussi l'occasion d'initier ou de poursuivre des échanges scientifiques pluridisciplinaires entre doctorants, chercheurs et industriels. Cette conférence est aussi l'occasion pour les doctorants d'appréhender un large panorama sur leur domaine de recherche et de bénéficier d'un premier contact à la fois rigoureux et bienveillant avec l'ensemble des activités liées à la communication scientifique : rédaction d'un article, recueil et prise en compte d'avis extérieurs, présentation orale, etc. Dates Importantes ---------------------- Soumission des articles complets : 1er février 2012, minuit Notification aux auteurs : 15 mars 2012 Envoi des articles définitifs : 1er avril 2012 Conférence : 23-25 mai 2012 Le comité de programme de CAP'12 incite toutes les soumissions de travaux originaux sur différents sujets ayant trait à l'apprentissage automatique, comme (mais non limités à) : * Apprentissage symbolique et algébrique : arbres de décision, programmation logique inductive, apprentissage multi-instances, treillis de Galois, recherche de motifs fréquents... * Apprentissage numérique ou statistique : réseaux neuronaux, réseaux bayésiens, machines à support de vecteurs, apprentissage semi-supervisé, co-training... * Approches hybrides : algorithmes évolutionnaires, raisonnement à partir de cas, analogie, approches cognitives, méthodes à noyaux... * Reformulation : sélection de descripteurs, sélection d'instances, propositionnalisation, abstraction... * Etude formelle de l'apprentissage : apprenabilité, évaluation des systèmes d'apprentissage, liens avec l'expert et aspects interface homme-machine (biais, visualisation, modélisation de l'expert, etc), mesures de qualité pour la fouille de données... * Apprentissage à partir de données structurées et semi-structurées, de textes (fouille de textes, web mining...), de génomes, d'images... * Modèles graphiques : modèles de Markov, réseaux Bayésiens, réduction de la dimensionnalité, sélection de variables, clustering... * Applications : fouille de données, fouille de textes, bio-informatique, médecine, chimie... * ... En plus des sessions de présentation d'articles (orales et posters), plusieurs chercheurs de renom dans le domaine de l'apprentissage ou dans l'application de l'apprentissage à différents domaines seront invités à présenter leurs travaux. Nous vous invitons donc à soumettre des articles sur vos recherches récentes sur l'apprentissage. Ces recherches peuvent être de natures diverses : travaux méthodologiques, théoriques ou expérimentaux, développement d'algorithmes, d'outils, de systèmes ou d'applications. Les articles pourront être soumis en langue française ou anglaise. Templates et Styles ----------------------- Afin d'assurer une homogénéité d'ensemble, seuls les articles soumis au format PDF seront recevables (maximum 16 pages, incluant bibliographie, figures et tables). Les templates LaTeX, Word et Pages à utiliser sont disponibles sur le site de la conférence (http://cap2012.loria.fr). Merci de ne pas altérer la mise en page ou les polices telles que définies dans les styles. Comités ---------- Comité d'organisation : Frédéric Alexandre, Laurent Bougrain (président), Armelle Brun, Olivier Buffet, Anne-Lise Charbonnier, Yannick Toussaint Comité de programme : [en cours de validation] Contact : bougrain at loria.fr Annonces ------------ Des prix pour les meilleures communications portées par un étudiant seront attribués par le comité de programme. Notez que cette année CAp est organisée conjointement avec les journées JFPDA (Journées Francophones Planification, Décision, Apprentissage pour la conduite de systèmes) (http://jfpda2012.loria.fr) et sera donc l'occasion d'élargir les échanges pluridisciplinaires en s'ouvrant à la communauté planification, décision et apprentissage par renforcement. A vos agendas et au plaisir de vous voir à Nancy en 2012 ! ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 14 08:58:03 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 14 Dec 2011 09:58:03 +0100 Subject: Job: Post-Doc LIF Marseille Message-ID: Date: Tue, 13 Dec 2011 15:54:28 +0100 From: "alexis.nasr" Message-ID: <4EE76724.8070004 at lif.univ-mrs.fr> ------------------------------------------------------------------------ [english version below] Le Laboratoire d'Informatique Fondamentale de Marseille propose une année de post-doctorat portant sur le traitement linguistique de la parole spontanée anglaise. Il s'agit d'appliquer à la sortie d'un système de transcription automatique de la parole, différents traitements, en particulier de la reconaissance d'entités nommées, de l'étiquetage morpho-syntaxique, de l'analyse syntaxique, de la désambiguisation lexicale et de l'identification de rôles sémantiques. Le candidat retenu sera accueilli par l'équipe Traitement Automatique du Langage Ecrit et Parlé. Profil recherché : - bonne maîtrise de l'anglais - formation en TAL ou en apprentissage automatique - maîtrise des langages de programmation C ou C++. Contact: Frédéric Béchet (frederic.bechet at lif.univ-mrs.fr) Alexis Nasr (alexis.nasr at lif.univ-mrs.fr) ------------------------------------------------------------------------ The Laboratoire d'Informatique Fondamentale de Marseille has an opening for a one year post-doctoral position in the field of Natural Language Processing, more precisely on the linguistic processing of the output of English Automatic Speech Recognition. Several processing modules will be developped or adapted: Named Entity Recognition, Part of Speech Tagging, syntactic parsing, Word Sense Disambiguation and Semantic Role Labeling. The candidate will integrate the Natural Language Processing Team (Traitement Automatique du Langage Ecrit et Parlé). Applicants are expected to have finished, or be about to finish their Ph.D. degrees. They must have a background in Natural Language Processing and master C or C++ programming language, as well as the English language. Contact: Frédéric Béchet (frederic.bechet at lif.univ-mrs.fr) Alexis Nasr (alexis.nasr at lif.univ-mrs.fr) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 13:13:39 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 14:13:39 +0100 Subject: These: Mohamed Belgacem, Reconnaissance automatique de la parole et ALAO : Vers un systeme d=?WINDOWS-1252?Q?=92apprentissage_de_l=92arabe_?=oral Message-ID: Date: Tue, 13 Dec 2011 17:26:58 +0100 (CET) From: mohamed belgacem Message-ID: <2079155664.1004631323793618371.JavaMail.www at wwinf7143> Bonjour à tous, J'ai le plaisir de vous inviter à ma soutenance de thèse qui aura lieu le 16 décembre 2011 à 14h30 à la salle Jacques Cartier - Maison des langues et des cultures, Université Stendhal Grenoble-FRANCE. Vous êtes également les bienvenus au pot de thèse qui suivra la soutenance. Titre de la thèse : Reconnaissance automatique de la parole et ALAO : Vers un système d’apprentissage de l’arabe oral Résumé La composante phonétique n‘est pas suffisamment traitée dans l’apprentissage des langues étrangères pourtant cette dernière est une des sources de la diversité d’accents observés chez les locuteurs non-natifs. Le signal de parole nous transmet des informations importantes sur le locuteur. Ces informations nous renseignent sur son niveau de langue et les problèmes de prononciation qu’il peut avoir. Une analyse des caractéristiques de la parole des apprenants peut contribuer à améliorer le processus d’apprentissage des langues étrangères. Cette analyse peut aider à la réalisation des applications technologiques comme la reconnaissance automatique de la parole (RAP), vérification et identification du locuteur, l’apprentissage des langues assisté par ordinateur (ALAO). Ces applications ont connu ces dernières années d’importantes avancées grâce à l’évolution des technologies informatiques. Basé sur des ressources issues du traitement automatique des langues (TAL), notre prototype 3AO élaboré dans cette thèse, se veut d’apporter des nouvelles voies à l’apprentissage de langue arabe assisté par ordinateur. Un de ses objectifs principaux est de donner aux apprenants de langue arabe des outils adaptés à leurs problématiques didactiques en reléguant au second plan les aspects informatiques. Dans cette thèse, nous présentons ci-après des outils pour l’apprentissage de la langue arabe assisté par ordinateur. Parmi ces outils il y a la reconnaissance automatique de la parole arabe qui est basée sur la modélisation statistique acoustique ainsi que sur les modèles de Markov cachés (HMM) qui sont aujourd’hui utilisés dans un très grand nombre de systèmes de reconnaissance automatique de la parole. On trouve notamment comme autres outils le corpus vocal, le dictionnaire de prononciation arabe, le phonétiseur pour l’arabe, la technique de l’alignement forcé, l’identification automatique des dialectes arabes etc. Nous présentons enfin notre prototype d’Apprentissage de l’Arabe Assisté par Ordinateur (3AO), les résultats de nos expériences et la procédure de correction qui permet à l'étudiant de cerner et de corriger ses fautes de prononciation. Jury - M. CHANIER Thierry, Professeur à l’Université Blaise Pascal, Clermont-Ferrand 2 France, Rapporteur. - M. ZRIGUI Mounir, Maitre de conférences HDR à l’Université de Monastir Tunisie, Rapporteur. - M. BESACIER Laurent, Professeur à l’Université Joseph Fourrier, Grenoble France, Examinateur. - M. JERRAYA Ahmed, Directeur de recherches, CEA-LETI, MINATEC Grenoble France, Examinateur. - M. ANTONIADIS Georges, Maitre de conférences HDR à l’Université Stendhal Grenoble France, Directeur de thèse. - Mme FRANJIE Lynne, Maitre de conférences à l’Université Stendhal Grenoble France, Examinateur. Mohamed Belgacem ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 13:16:02 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 14:16:02 +0100 Subject: Job: Research Assistant in Natural Language Processing, UPF, Barcelona Message-ID: Date: Thu, 15 Dec 2011 09:43:02 +0100 From: Horacio Saggion Message-ID: X-url: http://www.taln.upf.edu/content/news/362 http://www.taln.upf.edu/content/news/362 The Natural Language Processing Research group (TALN) at the Department of Communication and Information Technologies, Universitat Pompeu Fabra has a 12-months open position in the area of automatic text simplification in Spanish in the context of the research project Simplext. We are looking for candidates with some experience in natural language processing (NLP). Candidates with interest and/or experience in automatic text summarization, text simplification, text-to-text generation, and development of NLP applications are particularly welcome. We are looking for a candidate willing to stay in our group after the 12-months period to pursue a PhD in Natural Language Processing. We require: * A degree in Computer Science or related field. * Experience in NLP, Machine Learning, or Artificial Intelligence * Experience in Programming and willing to learn Java We offer: * A 12-month research contract * Dynamic and collaborative working environment * A maximum gross salary of 19.400 EUROS depending on experience * The position is available immediately. Universitat Pompeu Fabra (UPF) is recognised internationally for the quality of its research, it is also one of the top 5 Spanish universities in terms of scientific production and academia efficiency. The NLP group is situated in Campus de la Comunicacicón in the innovation district of Barcelona 22@ If you are interested in working with us, please send the following documentation: * Cover letter including your contact details * A CV and list of publications * The names and contact details of 2 people who can act as your references Send the documentation by e-mail to Dr. Horacio Saggion: horacio dot saggion at upf dot edu, the subject of your e-mail should be "Simplext-TALN-RA". If you have any informal query please contact Dr.Horacio Saggion: horacio dot saggion at upf dot edu. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 13:19:58 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 14:19:58 +0100 Subject: Appel: LREC 2012, Workshop NLP4ITA, Natural Language Processing for Improving Textual Accessibility Message-ID: Date: Sat, 17 Dec 2011 09:35:39 +0100 From: Horacio Saggion Message-ID: X-url: http://www.taln.upf.edu/nlp4ita ******************************************************************** First Call for Papers LREC 2012 workshop on: Natural Language Processing for Improving Textual Accessibility (NLP4ITA) May 27, 2012 (Morning Session) Istanbul, Turkey http://www.taln.upf.edu/nlp4ita ******************************************************************** === Introduction === This workshop aims to bring together researchers focused on tools and resources for making textual information more accessible to people with special needs including diverse ranges of hearing and sight disabilities, cognitive disabilities, elderly people, low-literacy readers and adults being alphabetized, among others. === Motivation and Topics of Interest === In recent years there has been an increasing interest in accessibility and usability issues. This interest is mainly due to the greater importance of the Web and the need to provide equal access and equal opportunity to people with diverse disabilities. The role of assistive technologies based on language processing has gained importance as it can be observed from the growing number of efforts (United Nations declarations on universal access to information or WAI guidelines related to content) and research in conferences and workshops (W4A, ICCHP, ASSETS, SPLAT, etc.). However, language resources and tools to develop assistive technologies are still scarce. This workshop will give an opportunity for individuals from different communities to present research findings, discover future challenges, and discuss potential collaboration. We welcome papers describing tools, resources, models, techniques and evaluation from all areas of natural language processing tailored to accessibility and assistive technologies. These include, but are not limited to, the following: - Resources: corpora of inaccessible documents, aligned corpora, gold-standards and annotation schemes. - Evaluation, complexity and readability methodologies and metrics. - Novel modeling and machine learning techniques for improving accessibility. - Natural language generation for improving accessibility. - NLP for Web accessibility. - Text adaptation, elaboration, simplification and summarization. - User studies on systems and components. - NLP tools for accessibility === Submission Information === We encourage contributions in the form of full papers (6 to 8 pages long). Papers should follow the LREC format which is available at the LREC 2012 Web Site. The submission process will be online using the START conference system at https://www.softconf.com/lrec2012/NLP4ITA All accepted papers will be presented orally and published in the workshop proceedings. === Important dates === Feb 24, 2012 Paper submission deadline March 23, 2012 Notification of acceptance March 30, 2012 Camera-ready version May 27, 2012 LREC 2012 Workshops === LRE Map 2012 === The LRE Map 2012, an initiative promoted for monitoring the production, use and evaluation of language resources and tools. When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. For further information on this new initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012. === Organizers === Ricardo Baeza-Yates (Universidad Pompeu Fabra, Yahoo!) Paloma Moreda (Universidad de Alicante) Luz Rello (Universidad Pompeu Fabra) Horacio Saggion (Universidad Pompeu Fabra) Lucia Specia (University of Wolverhampton) === Program Committee === Sandra Aluisio (University of Sao Paulo) Ricardo Baeza-Yates (Universidad Pompeu Fabra, Yahoo!) Delphine Bernhard (University of Strassbourg) Nadjet Bouayad-Agha (Universidad Pompeu Fabra) Richard Evans (University of Wolverhampton) Caroline Gasperin (TouchType Ltd) Pablo Gervás (Universidad Complutense de Madrid) Simon Harper (University of Manchester) David Kauchak (Middlebury College) Guy Lapalme (University of Montreal) Paloma Martínez (Universidad Carlos III de Madrid) Aurelien Max (Paris 11) Kathleen F. McCoy (University of Delaware) Ornella Mich (Foundazione Bruno Kessler) Ruslan Mitkov (University of Wolverhampton) Paloma Moreda (Universidad de Alicante) Constantin Orasan (University of Wolverhampton) Luz Rello (Universidad Pompeu Fabra) Horacio Saggion (Universidad Pompeu Fabra ) Advaith Siddharthan (University of Aberdeen) Lucia Specia (University of Wolverhampton) Juan Manuel Torres Moreno (University of Avignon) Markel Vigo (University of Manchester) Leo Wanner (Universidad Pompeu Fabra) Yeliz Yesilada (Middle East Technical University Northern Cyprus Campus) === Contact Address === For further information please contact us at: luz.rello at upf.edu or horacio.saggion at upf.edu ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 13:21:36 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 14:21:36 +0100 Subject: Info: ELDA Distribution Campaign 2011 Message-ID: Date: Thu, 15 Dec 2011 11:00:25 +0100 From: ELRA ELDA Information Message-ID: <4EE9C539.7080605 at elda.org> X-url: http://catalog.elra.info X-url: http://universal.elra.info X-url: http://www.elra.info/LRs-Announcements.html [Our apologies if you have received multiple copies of this announcement.] ***************************************************************** ELDA Distribution Campaign 2011 ***************************************************************** ELDA is launching a special distribution campaign offering very favorable conditions for the language resources acquisition, including discounts on public prices, from the ELRA Catalogue of Language Resources (see http://catalog.elra.info). This offer will be open until the end of December 2011. For more information on this offer, please contact Valérie Mapelli (mapelli at elda.org) Visit our On-line Catalogue: http://catalog.elra.info Visit the Universal Catalogue: http://universal.elra.info Archives of ELRA Language Resources Catalogue Updates: http://www.elra.info/LRs-Announcements.html ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 13:24:20 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 14:24:20 +0100 Subject: Job: Assistant Chef de projet (H/F), ELDA Message-ID: Date: Thu, 15 Dec 2011 11:25:55 +0100 From: ELRA ELDA Information Message-ID: <4EE9CB33.4060708 at elda.org> X-url: http://www.elda.org/ ELDA (Evaluation and Language resources Distribution Agency, _www.elda.org_ ) a pour activités principales la distribution et la production de ressources linguistiques, ainsi que l'évaluation de technologies de la langue. Dans le cadre de ses activités de production, ELDA offre 1 poste d'assistant chef de projet (H/F). Contexte ELDA participe à un projet visant à évaluer les systèmes d'analyse automatique des documents écrits. Pour les systèmes en compétition, il s'agit de répondre automatiquement aux questions suivantes : * Comment le document est-il structuré (zones de texte, images...) ? * Quelles sont les écritures présentes, avec leur type (manuscrit/dactylo) et leur langue (Français, Anglais, Arabe, autre) ? * Quelles sont les informations principales du documents (auteur, destinataire, objet, date...) ? Afin de disposer de données pour l'évaluation des systèmes d'analyse automatique, des documents originaux sont collectés. Rédigés par des volontaires (rémunérés) sous une identité fictive qui leur est attribuée, ces documents se basent sur des scénarios fictifs et des modèles de documents crées par ELDA (formulaires, bon de commande, page de catalogue, tract politique ou commercial, carte de voeux, en-têtes...). Une fois collectés, ces documents font l'objet d'une description manuelle de leur contenu, afin de pouvoir comparer l'analyse automatique des systèmes avec les performances humaines. Mission En collaboration avec le chef de projet, l'assistant chef de projet supervisera, d'une part, la collecte des documents fictifs réalisés par les 1000 volontaires distants, et d'autre part, leur description par une équipe d'annotateurs : * Gestion des volontaires & validation des documents via le site web dédié * Formation des annotateurs : prise en main des logiciels, du guide de production... * Suivi de la production : planning, tâches, corrections, validation... Profil recherché * Formation universitaire ou ingénieur * Polyvalence, rigueur, sens du contact, autonomie * Maîtrise de l'outil informatique (outils office, web, la pratique du php/MySQL serait un plus) * Un bon niveau d'anglais, et une expérience de la transcription des documents écrits seraient appréciés Durée : CDD de 12 mois à temps complet (prolongation possible) Ce poste, basé à Paris-13e (RER Cité universitaire), est à pourvoir rapidement. Le salaire est à négocier en fonction du profil du poste et de celui du candidat. Les candidatures (CV, lettre de motivation, prétentions salariales) doivent être adressées à Matthieu Carré (carre at elda.org). Elles seront étudiées à partir de janvier 2012. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 13:27:57 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 14:27:57 +0100 Subject: Stage: Conception de documents textuels illustres (H/F), ELDA Message-ID: Date: Thu, 15 Dec 2011 11:27:44 +0100 From: ELRA ELDA Information Message-ID: <4EE9CBA0.6000905 at elda.org> X-url: http://www.elda.org/ ELDA (Evaluation and Language resources Distribution Agency, www.elda.org ) a pour activités principales la distribution et la production de ressources linguistiques, ainsi que l'évaluation de technologies de la langue. Dans le cadre de ses activités de production, ELDA offre 1 stage de concepteur de documents textuels illustrés (H/F). Contexte ELDA participe à un projet visant à évaluer les systèmes d'analyse automatique des documents écrits. Pour les systèmes en compétition, il s'agit de répondre automatiquement aux questions suivantes : * Comment le document est-il structuré (zones de texte, images...) ? * Quelles sont les écritures présentes, avec leur type (manuscrit/dactylo) et leur langue (Français, Anglais, Arabe, autre)? * Quelles sont les informations principales du documents (auteur, destinataire, objet, date...) ? Afin de disposer de données pour l'évaluation des systèmes d'analyse automatique, des documents originaux sont collectés. Rédigés par des volontaires (rémunérés) sous une identité fictive qui leur est attribuée, ces documents se basent sur des scénarios fictifs et des modèles de documents crées par ELDA (formulaires, bon de commande, page de catalogue, tract politique ou commercial, carte de voeux, en-têtes...). Une fois collectés, ces documents font l'objet d'une description manuelle de leur contenu, afin de pouvoir comparer l'analyse automatique des systèmes avec les performances humaines. Mission Sous la responsabilité du chef de projet, le candidat réalisera les modèles de document nécessaires à la rédaction des documents à collecter. Les modèles peuvent être : * des fichiers images que les volontaires devront imprimer puis compléter manuellement ; * des fichiers .doc ou .ppt à compléter électroniquement (saisie au clavier) avant impression ; * des lettres-types au format image guidant la rédaction de courriers manuscrits ou dactylo. Selon ses compétences, le stagiaire pourra également intervenir sur d'autres aspects du projet, comme la maintenance du site web de collecte (php/mySQL), la validation des documents collectés, ou encore leur annotation (i.e. la description des documents via un logiciel dédié). Profil recherché * Formation universitaire ou ingénieur * Maîtrise de l'édition sous Word et Powerpoint * Compétences en traitement d'image (Gimp/Photoshop, gestion des formats) * Qualités rédactionnelles : orthographe, grammaire, inspiration (invention des scénarios fictifs) * Un bon niveau d'anglais et/ou d'arabe constitue un plus (afin de réaliser/décliner les modèles dans les 2 autres langues du projet, le français étant la langue principale). Durée : Stage long (4 mois minimum) Ce stage, basé à Paris-13e (RER Cité universitaire), est à pourvoir avant le printemps 2012. Les candidatures (CV, lettre de motivation) doivent être adressées à Matthieu Carré (carre at elda.org). Elles seront étudiées à partir de janvier 2012. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 13:30:34 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 14:30:34 +0100 Subject: Appel: Third Workshop on Building and Evaluating Resources for Biomedical Text Mining, LREC 2012 Workshops Message-ID: Date: Fri, 16 Dec 2011 18:14:06 +0100 From: ELRA ELDA Information Message-ID: <4EEB7C5E.2070106 at elda.org> X-url: http://www.nactem.ac.uk/biotxtm2012/ [Apologies for cross-postings] First Call for Papers ======================================================================== THIRD WORKSHOP ON BUILDING AND EVALUATING RESOURCES FOR BIOMEDICAL TEXT MINING Saturday 26th May 2012 organised in conjunction with LREC2012 (21-27 May 2012, Lütfi Kirdar Istanbul Exhibition and Congress Centre, Turkey) http://www.nactem.ac.uk/biotxtm2012/ ======================================================================== Over the past decade, biomedical text mining has received a large amount of interest. Faced with the rapidly increasing volume of biomedical literature, domain experts have an ever-increasing need for tools that can help them locate isolate relevant nuggets of information from this deluge of information in a timely and efficient manner. The response to such issues by the natural language processing community can be clearly evidenced in the biomedical natural language processing workshops that have been held over that past 10 years, in conjunction with ACL or NAACL meetings, to report the process in the field, as well as the founding of an ACL special interest group. Biomedical text mining applications are reliant on high quality resources. These include databases and ontologies (e.g., Biothesaurus, UMLS Metathesaurus, MeSH and the Gene Ontology) and dictionaries/computational lexicons (e.g., the BioLexicon and the UMLS SPECIALIST lexicon). Recent years have also evidenced a large increase in the number of freely-available corpora (e.g., GENIA, GREC, AIMED, BioInfer, CRAFT, BioDRB) annotated with an expanding range of information types. These now include not only named entities and simple relations that hold between them, but also more complex event structures and coreference, as well as higher level information about how events are to be interpreted (e.g., facts, analyses, speculations, etc.) and discourse structure. Community shared tasks and challenges (e.g., JNLPBA, LL05, Biocreative I/II/III, BioNLP’09, BioNLP 2011, i2b2, etc.) also normally involve the production of annotated corpora (on which the participating systems are trained and evaluated) as well as helping to steer research efforts to focus on open research problems. Following on from the success of two previous workshops, the workshop aims to bring together researchers who make use of biomedical text mining resources such as the above in their applications, or who are working on the development of new resources. The workshop will allow an assessment of the current state of the art of resources, and will provide a forum for the discussion of current problems, questions and open issues, which will be useful in guiding further research in this area. Such topics are very much relevant to META-NET (a Network of Excellence consisting of 54 research centres from 33 countries), which is dedicated to building the technological foundations of a multilingual European information society. META-NET aims to push forward research to allow a rapid expansion of language technologies; such efforts can only be acheived if appropriate resources are available. Since META-NET is concerned with enhancing information access for all European citizens, submissions concerning biomedical resources for languages other than English are particularly welcome. A further vital consideration to allow rapid building of new applications is that of interoperability and reuse. As a step towards this, several annotated corpora have been made UIMA-compliant, and are available in the U-Compare system, which allows easy construction of NLP workfows and evaluation against gold standard corpora. Some specific questions that the workshop will aim to answer include the following: * Among the available resources, which are the most used? What makes a good resource? How can easily can resources be employed for different purposes? What efforts have been made to make resources reusable or interoperable? To what extent have these efforts been successful? * Which resources are underused and why? What could be done to improve or extend them to improve their utility? * Which types of resources are still lacking and what is needed urgently? Are any resources planned or in development to address such gaps? Are any resources available that cover languages other than English? * To what extent do the existing resources support processing of text in different biomedical subdomains? How easily can they be adapted to deal with different domains? We invite papers reporting on resources that facilitate biomedical text mining, and the process of designing, building, updating, delivering, using and evaluating them. The workshop will focus both on the lexical and knowledge repositories themselves (e.g., terminologies, ontologies, controlled vocabularies, factual databases, annotated corpora, etc.) as well as on issues relating to their usability (e.g., design guidelines, standards for building resources, storage and exchange formats, interoperability issues, etc). Topics of interest include but are not limited to: * Building biomedical resources: controlled vocabularies, terminologies, ontologies, corpora * Guidelines and annotation schemas, tools, challenges, interoperability * Reengineering existing biomedical or general language resources * Update, evolution, extension or enrichment of resources * Adapting resources to new sub-domains * Interoperability of resources and standards * Lightly annotated and noisy resources * Tools for exploration of resources * Data exchange formats * Evaluation, comparison and critical assessment of resources / evaluation metrics * Test suites ORGANISATION *Sophia Ananiadou, National Centre for Text Mining, University of Manchester, UK *Kevin Bretonnel Cohen, Computational Bioscience Program, University of Colorado School of Medicine, USA *Dina Demner-Fushman, National Library of Medicine, USA *Paul Thompson, National Centre for Text Mining, University of Manchester, UK DATES February 10, 2012 Paper submissions due March 14, 2012 Paper notification of acceptance March 28, 2012 Camera-ready papers due May 26, 2012 Workshop SUBMISSIONS Papers must describe original, completed or in progress, and unpublished work. Each submission will be reviewed by two program committee members. Accepted papers will be given up to 8 pages in the workshop proceedings, and will be presented either as an oral presentation or poster. Papers should be formatted according to the stylesheet, which will be provided on the LREC 2012 website (http://www.lrec-conf.org/lrec2012/). Please submit papers in PDF format to: https://www.softconf.com/lrec2012/BiomedTextMining2012/ Paper review will be blind, so papers should not include authors' names and affiliations. Accepted papers will be published in the workshop proceedings. When submitting a paper through the START page, authors will be kindly asked to provide relevant information about the resources that have been used for the work described in their paper or that are the outcome of their research. For further information on this initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012. Authors will also be asked to contribute to the Language Library, the new initiative of LREC2012. PC MEMBERS Jari Björne, University of Turku, Finland Olivier Bodenreider, National Library of Medicine, USA Wendy Chapman, University of Pittsburgh, USA Hongfang Liu, Mayo Clinic, USA Naoaki Okazaki, Tohoku University, Japan Sampo Pyysalo, University of Manchester, UK Andrey Rzhetsky, University of Chicago, USA Stefan Schulz, Medical University Graz, Austria Lucy Vanderwende, Microsoft, USA Karin Verspoor, NICTA, Australia John Wilbur, NCBI, NLM, NIH, USA Stephen Wu, Mayo Clinic, USA Pierre Zweigenbaum, LIMSI, France Workshop contact person: Paul.Thompson at manchester.ac.uk National Centre for Text Mining, School of Computer Science, University of Manchester, UK ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 13:33:36 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 14:33:36 +0100 Subject: Appel: Workshop on Language Technology for Patent Data: Language Resources and Evaluation, LREC 2012 Workshops Message-ID: Date: Fri, 16 Dec 2011 18:25:42 +0100 From: ELRA ELDA Information Message-ID: <4EEB7F16.9070301 at elda.org> X-url: http://workshops.elda.org/ltpd2012/ [Apologies for cross-postings] *CALL FOR PAPER* Workshop on Language Technology for Patent Data: Language Resources and Evaluation To be held in conjunction with the 8th International Language Resources and Evaluation Conference (LREC 2012) 27 May 2012 (afternoon) Lütfi Kirdar Istanbul Exhibition and Congress Centre, Istanbul, Turkey http://workshops.elda.org/ltpd2012/ *Workshop Description* In the last few years, the use of patents in automatic processing has shown a growing interest in the NLP community. This has been particularly the case in the context of Machine Translation (MT) or Cross-Lingual Information Retrieval (CLIR). Nowadays this has become a major topic and besides the development of the technology itself, some key points remain regarding the resources available and the way of evaluating the quality of the technology. A large number of language resources is already available for the community, but the development of systems, in particular the statistical ones, always requires more and more data. As there is a growing interest for patents and their processing, a workshop on the topic which gathers all those involved in the different aspects concerned is a good opportunity to move forward. The domain of patents itself is increasing and the amount of potential material does not cease to increase. It is this potential material that gives hope to the community for improving the systems. For instance, in China, the number of patents have been multiplied by 3 in 5 years and they exceed 1 million published documents per year by now. EPO (the European Patent Office) uses more than 150 translation pairs per day. Every patent office receives more and more patents every day, needs a daily use of automatic tools to translate the documents, looks for existing patents and their translation, manages complex content, etc. As we can see, this is a domain in considerable demand and since the content of the patents is technical and needs high skills in a specific domain, providing documents that are sufficiently understandable to the end users is very complex. This is a real challenge for all NLP developers. Above all, this challenge is about corpora and their management. The main topic concerns their acquisition and how to collect useful data. For most of the researchers, this consists in harvesting web pages, cleaning them, getting the useful content according to a specific task, aligning the sentences, etc. The acquisition task may also be done using OCR tools on PDF. Monolingual corpora are easier to retrieve (e.g. from databases) compared to parallel corpora. However, parallel translations exist and aligned corpora as well, or corpora that could be easily aligned. Following the question of the acquisition of such documents, there is that of database management. One could say that all these questions are not only related to patent data, however this workshop would like focus on this particular domain and make some effort to improve things. Currently, the corpora are mainly used for MT. For a technical end-user in a patent office, the end goal is to manage to understand the content of a document. This may not require a very high quality translation since this person only needs to grasp the relevance of the document. However, in MT, we still need to measure quantitatively the performance of the systems. This is basically made using automatic and/or human measures, while most of the system developers are using typical automatic metrics such as BLEU to get their results. Even if the drawbacks of such metrics are well-known, it could be still relevant, for instance, to compare different versions of a system. However, even when using BLEU, the content of patent documents is very particular, which implies that different kinds of linguistic specificity need to be tackled: these include the already expected terminological level, but also a syntactic level, a semantic one, and even the structure of the documents may be different from that of other documents (for instance, patents typically comprise of a title, an abstract, a technical description of the invention, and a list of novel claims). Human measures may be also difficult to apply as patent documents are written in a way which makes them difficult to read for the layman. Furthermore, both automatic and human evaluations should have the chance to realise a deep analysis of the results, which is not trivial working with patents. However, given the often formulaic nature of the text found in patents -- which is enforced on the author due to legal constraints -- there may be opportunities to exploit this for evaluation. For instance, claims are constructed as a single sentence with an introductory phrase and a body linked by frequently occurring terms such as "in a certain embodiment", "consisting essentially of", and clauses and lists introduced using colons, e.g. "comprising: ..." The use of patents in CLIR suffers from the same kind of issues, either for the evaluation of systems or for the collection of corpora. Sentence alignment may also have specific issues related to the content of the documents, and many other types of tools may have their own thoughts using patents. Through all those technologies, one can see their usage implies several challenges, such as the integration of tools into patent information applications. The different tools should help end-users to search, examine or classify patent documents, most of the time from translations and not available in English. Web services should also be an extension of the tools and web services should be connected through workflows, helping end-users in their daily work. Among all the topics previously mentioned, we would like to contribute to the improvement of the challenging patent field, by sharing the knowledge from the whole community. The different topics addressed during the workshop will be (but are not limited to): - Corpora aspects: collecting data, cleaning, alignment, parallel corpora, etc.; - Evaluation of technologies: definition of metrics, patent specificity; - Integration of patent applications: web services, end-user applications; - IPR issues and licensing. *Organising committee* Heidi Depraetere (Crosslang, Belgium) Olivier Hamon (ELDA -- Evaluations and Language resources Distribution Agency, France) John Tinsley (PLUTO -- Patent Language Translations Online, Ireland) *Programme committee* Victoria Arranz (ELDA -- Evaluations and Language resources Distribution Agency, France) Alexandru Ceasusu (PLUTO - Patent Language Translations Online, Ireland) Khalid Choukri (ELDA, France) Terumasa Ehara (Yamanashi Eiwa College, Japan) Cristina España-Bonet (UPC, Spain) Mihai Lupu (IRF and ESTeam, Austria) Bertrand Le Chapelain (EPO, Netherlands) Bente Maegaard (University of Copenhagen, Denmark) Bruno Pouliquen (World Intellectual Property Organization, Switzerland) Lucia Specia (University of Sheffield, United Kingdom) Gregor Thurmair (Linguatec, Germany) Dan Wang (China Patent Information Center, China) Shoichi Yokoyama (Yamagata University, Japan) More to follow... *Important dates* Deadline for submission: Friday 24 February 2010 Notification of acceptance: Friday 23 March 2010 Final version due: Friday 30 March 2010 Workshop : 27 May 2010 (afternoon) *Submission Format* Full papers up to 8 pages should be formatted according to LREC 2012 guidelines and be submitted through the online submission form (https://www.softconf.com/lrec2012/PATENT2012/) on START. For further queries, please contact Olivier Hamon at hamon_at_elda_dot_org. When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. For further information on this new initiative, please refer to http://www.lrec-conf.org/lrec2012/?LREMap-2012. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 13:36:10 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 14:36:10 +0100 Subject: Appel: Describing Language Resources with Metadata: Towards Flexibility and Interoperability in the Documentation of Language Resources LREC 2012 Workshops Message-ID: Date: Fri, 16 Dec 2011 18:37:36 +0100 From: ELRA ELDA Information Message-ID: <4EEB81E0.1030306 at elda.org> X-url: http://workshops.elda.org/metadata2012/ *CALL FOR PAPERS* Workshop on *Describing Language Resources with Metadata: * *Towards Flexibility and Interoperability in the Documentation of Language Resources* ** To be held in conjunction with the 8^th International Language Resources and Evaluation Conference (LREC 2012) *22 May 2012, **Lütfi Kirdar Istanbul Exhibition and Congress Centre, Istanbul, Turkey* http://workshops.elda.org/metadata2012/ /Deadline for submission: 19 February 2012/** *Description* The description of Language Resources (LRs) continues to be a crucial point in the life cycle of LRs, and more particularly, in their sustainable exchange. This has been so for a number of repositories or LR distribution centres in place (ELRA, GSK, LDC, OLAC, TST-Centrale, BAS, among others), who house LR catalogues following some proprietary metadata schema. A number of projects and initiatives have also focused these past few years in the sharing of LRs (ENABLER, CLARIN, FLaReNet, PANACEA, META-SHARE), for example for Language Technology (LT) Based on these initiatives a consensus emerges that shows a number of requirements for standardized metadata: 1. There should be a common publication channel for the LR descriptions in the world. 2. This channel allows users to carry out easy and efficient LR data discovery and possible subsequent retrieval of LRs. 3. Expert knowledge is required to create the data model for the metadata description. 4. Subject matter experts (both researchers and LR/LT providers and developers) are required to provide the content for the data model. 5. The data model needs to be clear, expressive, flexible, customizable and interoperable. 6.Metadata have to provide for different user groups, ranging from providers to consumers (both individuals and organisations). This applies both to the information contained in the metadata and the supporting tool infrastructure for creating, maintaining, distributing, harvesting and searching the metadata. Currently several initiatives focus on metadata. From the realm of work done within initiatives like ENABLER and CLARIN descended the Component MetaData Infrastructure (CMDI, ISO TC 37 SC 4 work item for ISO 24622), which allows the combination of standard data categories (for example from ISO 12620, isocat.org) to components, which are combined into metadata profiles. Early versions of this model have been operational in repositories such as ELRA's, which complied with the work done within INTERA. FLaReNet, as the result of a permanent and cyclical consultation, has issued a set of main recommendations where a global infrastructure of uniform and interoperable metadata sets appear among the Top Priorities for the field of LRs.For use within HLT, META-SHARE provides a fully-fledged schema for the description of LRs, in the framework of the component model, covering all the current resource types and media types of use, in all the stages of a resource's life-cycle. Our aim is to learn from one another's experiences and plans in this area. Making resources available for others and putting this to a second use in other projects has never been more widely accepted as a sensible efficient way to avoid a waste of efforts and resources. However, when it comes to the details, there is still a vast number of problems. This workshop will be a forum to address issues and challenges in the concrete work with metadata for LRs, not restricted to a single initiative for archiving LRs. The current state of the art for metadata provision allows for a very flexible approach, catering for the needs of different archives and communities, referring to common data category registries that describe the meaning of a data category at least to authors of metadata. Component models for metadata provisions are for example used by CLARIN and META-SHARE, but there is also an increased flexibility in other metadata schemas such as Dublin Core, which is usually not seen as appropriate for meaningful description of language resources. Topics of interest are: 1. Infrastructures for creating components and profiles for metadata 2. Editing and creating metadata 3. Porting legacy metadata 4. Metadata as a resource 5. Maintenance of metadata 6. Classification of language resources 7. Providing metadata concepts 8. Creating components and profiles 9. Services harvesting and interpreting metadata 10. Experience from the large LR data center catalogues: LDC, ELRA, BAS, and how to interoperate with them 11. Controlled vocabularies, terminology and metadata description 12. Formal models for metadata representation and standardized models of serialisation 13. Customization and reuse of metadata schemas 14. Plans or experiences with emerging metadata infrastructures as for example from CLARIN & META-SHARE 15. Experiences with the Component based metadata infrastructures 16. Integration and conversion of multiple repositories: experiences from META-SHARE, CESAR, METANET4U and META-NORD, etc. 17. Standardization issues for metadata We invite submissions for full papers and system demonstrations that address these questions and other related issues relevant to the workshop. *Workshop Programme and Audience Addressed* This full-day workshop aims at bringing together technology oriented working groups on metadata modeling or schema creation and both researchers and producers creating metadata in the course of their work. Those interested to use metadata in their projects should get the insights and come out with a clear idea of how to either describe their LRs or convert their schema. Those who have developed recently a model can share their experience, and those who have specific concerns with interoperability of metadata schemas as developed by the various initiatives can open the discussion in search for joint solutions. Tools and the tool infrastructures should also be part of the discussion given that the initiatives provide also editors, mappings, search interfaces, component and profile registries. *Organising Committee* Victoria Arranz (ELDA/ELRA, Paris, France, arranz at elda.org) Daan Broeder (MPI, Nijmegen, The Netherlands, daan.broeder at mpi.nl) Bertrand Gaiffe (ATILF, Nancy, France, Bertrand.Gaiffe at atilf.fr) Maria Gavrilidou (Athena Research and Innovation Center, Athens, Greece, maria at ilsp.athena-innovation.gr) Monica Monachini, (CNR-ILC, Pisa, Italy, monica.monachini at ilc.cnr.it) Thorsten Trippel (University of Tübingen, Tübingen, Germany, thorsten.trippel at uni-tuebingen.de) *Programme Committee* Helen Aristar-Dry (Michigan State University, USA) Núria Bel (UPF, Barcelona, Spain) Antonio Branco, (University of Lisbon, Portugal) Lars Borin (Språkbanken, Sweden) Khalid Choukri (ELDA/ELRA, Paris, France) Thierry Declerck (DFKI, Germany) Matej Durco (Austrian Academy of Sciences, Austria) Gil Francopoulo (CNRS-LIMSI-IMMI + TAGMATICA, Paris, France) Francesca Frontini (CNR-ILC, Pisa, Italy) Erhard Hinrichs (Univerität Tübingen, Germany) Penny Labropoulou (ILSP-Athena, Athens, Greece) Valérie Mapelli (ELDA/ELRA, Paris, France) Jan Odijk (Universiteit Utrecht, The Netherlands) Elena Pierazzo (Kings College, London, UK) Laurent Romary (INRIA, France) Mike Rosner (University of Malta, Malta) Andreas Witt (IDS, Germany) Peter Wittenburg (MPI, The Netherlands) Tamás Varadi (Hungarian Academy of Sciences, Hungary) Marta Villegas (UPF, Barcelona, Spain) Sue Ellen Wright (Kent State University, USA) *Important dates* Submission of full papers: Sunday 19 February 2012 Notification of acceptance of papers and demonstrations: Thursday 22 March 2012 Submission of final version: Saturday 31 March 2012 Final programme available: Friday 13 April 2012 Workshop: Tuesday 22 May 2012 *Submission* Authors should use the START system accessible fromhttps://www.softconf.com/lrec2012/Metadata2012 and the LREC author's kit for submitting a two-column article of 4 to 8 pages. For further queries, please contact Victoria Arranz at arranz at elda.org or Thorsten Trippel at thorsten.trippel at uni-tuebingen.de. /When submitting a paper through START, authors will be kindly asked to provide relevant information about the resources that have been used for the work described in their paper or that are the outcome of their research. For further information on this initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012//. Authors will also be asked to contribute to the Language Library, the new initiative of LREC2012./ ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 13:42:01 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 14:42:01 +0100 Subject: Appel: Rencontres Jeunes Chercheurs 2012 ED 268 Paris 3 Message-ID: Date: Thu, 15 Dec 2011 14:44:29 +0100 From: RJC 2012 Message-ID: X-url: http://www.univ-paris3.fr/rjc2012 Bonjour, Veuillez trouver ci-joint l'appel à communication pour les RJC 2012. Veuillez diffuser l'appel au sein de votre laboratoire au plus grand nombre. Nous vous prions de nous excuser pour les envois multiples. Cordialement, Jiayin Gao Comité d'organisation -------------------------------------------------------------------- Chèr(e)s doctorant(e)s, L'École Doctorale 268 *« Langage et langues : description, théorisation, transmission » *de l'Université Sorbonne Nouvelle - Paris 3 organise les 15èmes Rencontres Jeunes Chercheurs (RJC 2012) du 15 juin 2012 au 16 juin 2012 Vous trouverez ci-dessous l'appel à communication. Rendez-vous sur le site des RJC, http://www.univ-paris3.fr/rjc2012 pour plus d'informations. *École Doctorale 268 « Langage et langues : description, théorisation, transmission »* *XVèmes Rencontres Jeunes Chercheurs* Créées en 1998, les Rencontres Jeunes Chercheurs de l'École Doctorale « Langage et langues» (ED 268, Université Paris III) offrent la possibilité aux jeunes chercheurs inscrits en Doctorat ou en Master Recherche de présenter leurs travaux sous forme de communication orale ou de poster. Au carrefour des disciplines : Inter - et transdisciplinarité dans les sciences du langage On assiste depuis la seconde moitié du XIXe siècle à un processus de fragmentation des savoirs, à la fois scientifique et politique, qui a contribué à construire le paysage actuel de la recherche. De nouvelles disciplines, telles que la psychologie ou la sociologie, se sont institutionnalisées. Dans le même temps, les différents domaines du savoir ont connu une spécialisation grandissante, tant dans le champ des sciences humaines que dans les autres sciences. Depuis quelques années, un sentiment de « crise d'identité » (Bouvier, 2004) des disciplines se manifeste par le nombre croissant de programmes de recherche qui visent à promouvoir la collaboration ou la confrontation de différentes spécialités, afin d'étudier les phénomènes dans leur globalité. L'étude des faits de langue requiert une multiplication des points de vue (étude du son, de la structure, du sens, etc.) et des emprunts à la logique, aux sciences cognitives, à l'anthropologie, à l'histoire, etc. Ainsi, les sciences du langage se révèlent emblématiques de ces échanges entre les domaines du savoir. S'intéresser aux rapports entre disciplines implique de s'interroger sur la notion de « discipline » elle-même, d'analyser les relations d'englobement (« sous-disciplines », « composantes ») et les délimitations internes d'une discipline donnée. On ne peut circonscrire une discipline sans faire appel à des critères à la fois épistémologiques (définition de l'objet d'étude, méthodologie, formulation d'un ensemble d'explications cohérentes, etc.), sociaux et historiques (institutionnalisation). Il faut également définir quels sont les modes d'interactions possibles entre les disciplines. À la suite de Frédéric Darbellay (2005) et de Patrick Charaudeau (2010), nous en distinguons trois : la « pluridisciplinarité », l'« interdisciplinarité » et la « transdisciplinarité ». Est « pluridisciplinaire » une démarche de recherche dans laquelle chaque discipline conserve son autonomie, ses méthodes et ses outils ; il s'agit « d'une juxtaposition de points de vue qui apportent chacun une connaissance particulière sur le phénomène étudié » (Charaudeau, 2010). Est « interdisciplinaire » une approche qui s'efforce de faire dialoguer plusieurs disciplines sur un même thème, cherchant ainsi à « établir de véritables connexions entre concepts, outils d'analyse et modes d'interprétation » (*ibid.*). Enfin, est « transdisciplinaire » une démarche qui transcende les disciplines, réalisant une « intégration des savoirs [...] de telle sorte qu'émerge un discours *sui generis* construisant son propre lieu de pensée » (*ibid.*). L'inter- et la transdisciplinarité ne se résument pas à un simple phénomène de « mode intellectuelle ». On peut les envisager comme un « principe de précaution » permettant de mettre à distance « les visions et les divisions du travail scientifique que les disciplines imposent ». Ainsi que le rappellent Nicolas Freymond *et al.* (2003) : « oublier que c'est le point de vue qui crée l'objet conduit à enfermer la pratique scientifique dans des limites qui ne jouissent d'aucune pertinence théorique ». Le recours critique à plusieurs (sous-)disciplines ou courants disciplinaires paraît donc nécessaire. Dans cette perspective, les RJC 2012 invitent les participants à réfléchir aux possibilités d'échange (de concepts, de méthodes, de résultats, etc.), de confrontation, voire de fusion, entre différentes spécialités. Nous retiendrons en particulier les communications qui s'intéressent aux questions suivantes : - Qu'est-ce qu'une « discipline » ? Quelles sont les relations entre « disciplines » et « savoirs » ? Comment les disciplines et sous-disciplines actuelles se sont-elles constituées historiquement? - Quelles ont été les évolutions socio-historiques des théories et pratiques de l'inter- et de la transdisciplinarité ? - La monodisciplinarité est-elle encore possible dans les sciences du langage ? - Quelles perspectives offrent les recherches inter- et transdisciplinaires ? - Quels sont les risques du décloisonnement ? Quelles en sont les limites et les contraintes ? - Quelles passerelles peut-on réellement établir entre les disciplines ? - Comment les différentes spécialités se positionnent-elles par rapport aux échanges inter- et transdisciplinaires ? - Comment l'interdisciplinarité se construit-elle au sein des équipes de recherche ? *Bibliographie sélective* - BOUTET, Josiane & MAINGUENEAU, Dominique (2005) « Approches interdisciplinaires des pratiques langagières et discursives », *Langage et société*, 114, Paris, MSH. - BOUVIER, Pierre (2004) « Interdisciplinarité, monodisciplinarité, transdisciplinarité » [en ligne], *Socio-anthropologie*, 14, URL : http://socio-anthropologie.revues.org/index372.html - CHARAUDEAU, Patrick (2010) « Pour une interdisciplinarité focalisée'' dans les sciences humaines et sociales » [en ligne], *Questions de Communication*, 17, URL : http://www.patrick-charaudeau.com/Pour-une-interdisciplinarite.html - CHISS, Jean-Louis & PUECH, Christian (1999) *Le Langage et ses disciplines*, Paris, Duculot. - DARBELLAY, Frédéric (2005) *Interdisciplinarité et trans-disciplinarité en analyse des discours. Complexité des textes, intertextualité et transtextualité*, Genève, Slatkine. - FREYMOND, Nicolas *et al.* (2003) « Ce qui donne sens à l'interdisciplinarité » [en ligne], *A contrario*, 1, URL :www.cairn.info/revue-a-contrario-2003-1-page-3.htm - KOURILSKY, François (dir.) (1990) *Carrefour des sciences*, Actes du colloque du CNRS « Inter-disciplinarité », Paris, Éditions du CNRS. - ORRIGI, Gloria & DARBELLAY, Frédéric (dir.) (2010) *Repenser l'interdisciplinarité*, Genève, Slatkine. - RAMADIER, Thierry (2011) « L'interdisciplinarité au service de la complexité » [en ligne], *Articulo - Journal of Urban Research*, Book Reviews, URL : http://articulo.revues.org/1731 *Conférenciers invités :* Dan SAVATOVSKY, Professeur, Université de Bourgogne *Comité Scientifique :* Violaine BIGOT Maria CANDEA Francine CICUREL Jeanne-Marie DEBAISIEUX Cécile FOUGERON Jean-Marie FOURNIER Emmanuel FRAISSE Florentina FREDET Cédric GENDROT Kim GERDES Daniel GILE Luca GRECO Jean-Patrick GUILLAUME Pierre HALLÉ Kkka KABORE Dominique KLINGLER Marie-Christine LALA Florence LEFEUVRE Jean-Léo LÉONARD Marc-Antoine MAHIEU Aliyah MORGENSTERN Jean-Paul NARCY-COMBES Gilles PHILIPPE Claire PILLOT-LOISEAU Konstantin POZDNIAKOV Christian PUECH Nicolas QUINT Sandrine REBOUL-TOURÉ Annie RIALLAND Rachid RIDOUANE, Anne SALAZAR ORVIG Pollet SAMVELIAN Dan SAVATOVSKY Jacqueline VAISSIÈRE Daniel VÉRONIQUE Martine VERTALIER *Comité d'Organisation : * Marion CARNIS, Alejandro DIAZ VILLALBA, Aurélia ELALOUF, Raphaëlle FOUILLET, Jiayin GAO, Maximilien GUÉRIN, Marine LE MENÉ, Laura NICOLAS, Marie REETZ, Marie RIVIÈRE, Camille SIMON, Marie VIAIN *Le colloque est ouvert à tous : masterants, doctorants, chercheurs...* *Entrée libre en fonction des places disponibles.* *Une attestation de présence sera remise aux participants* *Soumission* Les propositions de communication orale se feront en police Times New Roman 12, interligne simple, sous forme d'un résumé de 1000 mots au maximum (références incluses) et les propositions de poster sous forme d'un résumé de 500 mots au maximum (références incluses). Dans le cas de transcriptions phonétiques, veuillez utiliser la police SILDoulos téléchargeable ici. Les propositions sont à envoyer au Comité d'Organisation par courriel ( rjc.ed268.2012 at gmail.com) en double exemplaire au format .rtf : le premier sera nommé « anon_nom-de-l-auteur_rjc2012. rtf » (par exemple « anon_DUPONT_rjc2012.rtf ») et contiendra : *Le titre* *5 mots-clés* *La ou les discipline*(s) *Le résumé* Le type de présentation (communication orale ou poster) Le second nommé « nom-de-l-auteur_rjc2012.rtf » (par exemple « DUPONT_rjc2012.rtf ») devra comporter les informations suivantes, en plus des précédentes : *Les coordonnées (nom, prénom, courriel et adresse postale)* *L'affiliation **(nom de l'université, nom du laboratoire)* *Le niveau d'études **(master / doctorat / post-doc ; préciser le nombre d'années pour le doctorat)* *Le directeur de recherche* *Une seule soumission par participant sera examinée.* Les communications orales et les posters pourront se faire en anglais ou en français. La durée de la communication est fixée à 20 minutes + 10 minutes de discussion. *Contact et soumission:* rjc.ed268.2012 at gmail.com *Calendrier* Date limite pour les soumissions : 15 février 2012 Notification et début des inscriptions : 30 mars 2012 Date limite pour les inscriptions et le dépôt du résumé : 15 avril 2012 Colloque : *15 & 16 juin 2012* ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 14:09:26 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 15:09:26 +0100 Subject: Appel: Ireland International Conference on Education (IICE-2012), Reminder Message-ID: Date: Thu, 15 Dec 2011 23:16:35 +0000 (GMT) From: L inda Woods Message-ID: <37988060.493992.1323990995212.JavaMail.open-xchange at email.1and1.co.uk X-url: http://www.iicedu.org/ Apologies for cross-postings. Kindly email this call for papers to your colleagues, faculty members and postgraduate students. CALL FOR PAPERS ********************************************************** Ireland International Conference on Education (IICE-2012), April 16-18, 2012, Dublin, Ireland (www.iicedu.org) ********************************************************** The IICE is an international refereed conference dedicated to the advancement of the theory and practices in education. The IICE promotes collaborative excellence between academicians and professionals from Education. The aim of IICE is to provide an opportunity for academicians and professionals from various educational fields with cross-disciplinary interests to bridge the knowledge gap, promote research esteem and the evolution of pedagogy. The IICE 2012 invites research papers that encompass conceptual analysis, design implementation and performance evaluation. All the accepted papers will appear in the proceedings and modified version of selected papers will be published in special issues peer reviewed journals. The topics in IICE-2012 include but are not confined to the following areas: * Academic Advising and Counselling * Art Education * Adult Education * APD/Listening and Acoustics in Education Environment * Business Education * Counsellor Education * Curriculum, Research and Development * Competitive Skills * Continuing Education * Distance Education * Early Childhood Education * Educational Administration * Educational Foundations * Educational Psychology * Educational Technology * Education Policy and Leadership * Elementary Education * E-Learning * E-Manufacturing * ESL/TESL * E-Society * Geographical Education * Geographic information systems * Health Education * Higher Education * History * Home Education * Human Computer Interaction * Human Resource Development * Indigenous Education * ICT Education * Internet technologies * Imaginative Education * Kinesiology & Leisure Science * K12 * Language Education * Mathematics Education * Mobile Applications * Multi-Virtual Environment * Music Education * Pedagogy * Physical Education (PE) * Reading Education * Writing Education * Religion and Education Studies * Research Assessment Exercise (RAE) * Rural Education * Science Education * Secondary Education * Second life Educators * Social Studies Education * Special Education * Student Affairs * Teacher Education * Cross-disciplinary areas of Education * Ubiquitous Computing * Virtual Reality * Wireless applications * Other Areas of Education Important Dates: * Extended Abstract (Work in Progress) Submission Date: January 15, 2012 * Research Paper, Student Paper, Case Study, Report Submission Date: January 20, 2012 * Proposal for Workshops: December 15, 2011 * Notification of Workshop Acceptance/Rejection: December 20, 2011 * Poster Proposal Submission Date: January 10, 2012 * Notification of Poster Acceptance/Rejection: January 15, 2012 * Notification of Extended Abstract (Work in Progress) Acceptance/Rejection: January 25, 2012 * Notification of Research Paper, Student Paper, Case Study, Report Acceptance/Rejection: January 30, 2012 * Camera Ready Paper Due: February 10, 2012 * Early Bird Registration Deadline (Authors and Participants): February 15, 2012 * Late Bird Registration Deadline (Authors and Participants): March 10, 2012 * Conference Dates: April 16-18, 2012 For further information please visit IICE-2012 atwww.iicedu.org ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 14:10:56 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 15:10:56 +0100 Subject: Appel: ESSLLI 2012 Student Session Message-ID: Date: Fri, 16 Dec 2011 09:29:47 +0100 From: ESSLLI 2012 Student Session Message-ID: <4EEB017B.6020905 at loriweb.org> X-url: http://loriweb.org/ESSLLI2012StuS/ X-url: http://esslli2012.pl/ ** APOLOGIES FOR MULTIPLE POSTINGS** *** PLEASE DISTRIBUTE TO STUDENTS *** First Call for Papers *ESSLLI 2012 STUDENT SESSION* Held during The 24th European Summer School in Logic, Language and Information Opole, Poland, August 6-17, 2012 Deadline for submissions: March 20, 2012 http://loriweb.org/ESSLLI2012StuS/ *ABOUT:* The Student Session of the 24th European Summer School in Logic, Language, and Information (ESSLLI) will take place in Opole, Poland on August 6-17, 2012. We invite submissions of original, unpublished work from students in any area at the intersection of Logic & Language, Language & Computation, or Logic & Computation. Submissions will be reviewed by several experts in the field, and accepted papers will be presented orally or as posters and will appear in the student session proceedings. This is an excellent opportunity to receive valuable feedback from expert readers and to present your work to a diverse audience. ESSLLI 2012 will feature a wide range of foundational and advanced courses and workshops in all areas of Logic, Language, and Computation. Consult the main ESSLLI website (link below) for further information. *INSTRUCTIONS FOR AUTHORS:* Authors must be students, i.e., may not have received the Ph.D. degree before August 2012. All submissions must be in PDF format and be submitted to the conference EasyChair website. Submissions may be singly or jointly authored. No one may submit more than one singly and one jointly authored paper. There are two types of papers. Long papers of up to 8 pages will be considered for both oral presentation and the poster session. Short papers of up to 4 pages will be considered as submissions for the poster session. Submissions must be anonymous, without any identifying information, and must be must be received by *March 20, 2012*. More detailed guidelines regarding submission can be found on the Student Session website: http://loriweb.org/ESSLLI2012StuS/ Links to previous years' proceedings are also available there. Please direct inquiries about submission procedures or other matters relating to the Student Session to esslli2012stus at loriweb.org. For general inquiries about ESSLLI 2012, please consult the main ESSLLI 2012 page, http://esslli2012.pl/. Kind regards, The ESSLLI 2012 Students Session Organization Committee, Rasmus K. Rendsvig (Roskilde University) (chair) Anders Johannsen (University of Copenhagen) Dominik Klein (Tilburg University) Margot Colinet (Université de Paris 7) Matthijs Westera (Universiteit van Amsterdam) Maxim Haddad (University of Osnabrück) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 14:12:00 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 15:12:00 +0100 Subject: Appel: LACL 2012 Message-ID: Date: Fri, 16 Dec 2011 11:34:51 +0100 From: Denis Bechet Message-Id: <201112161134.51185.bechet at info.univ-nantes.fr> X-url: http://lacl.gforge.inria.fr/lacl-2012 LACL 2012 Logical Aspects of Computational Linguistics July 2-4, 2012 LINA, Nantes, France http://lacl.gforge.inria.fr/lacl-2012 PRESENTATION LACL'2012 is the 7th edition of a series of international conferences on logical and formal methods in computational linguistics. This conference addresses the use of type theoretic, proof theoretic and model theoretic methods for describing natural language syntax and semantics, as well as the implementation of natural language processing software relying on such models. It will be held at the LINA, Nantes, France. Scope: Computer scientists, linguists, mathematicians and philosophers are invited to present their work on the use of logical methods in computational linguistics and natural language processing, in natural language analysis, generation or acquisition. Topics of either theoretical or applied interest include, but are not limited to: * logical foundation of syntactic formalisms o categorial grammars o minimalist grammars o dependency grammars o tree adjoining grammars o model theoretic syntax o formal language theory for natural language processing o data-driven approaches * logics for semantics of lexical items, sentences, discourse and dialog o discourse theories o Montague semantics o compositionality o dynamic logics o game semantics o situation semantics o generative lexicon o categorical semantics * applications of these models to natural language processing o software for natural language analysis o software for acquiring linguistic resources o software for natural language generation o software for information extraction o inference tasks o evaluation o scalability SUBMISSIONS and PROCEEDINGS Articles should be written in the LaTeX format of LNCS by Springer (see authors instructions at http://www.springer.com/computer/lncs?SGWID=0-164-6-793341-0) and should not exceed 16 pages (including figures, bibliography, possible appendices). It is expected that each accepted paper be presented at the meeting by one of its authors. Papers must be submitted electronically in PDF format at http://www.easychair.org/conferences/?conf=lacl2012 PROCEEDINGS Accepted papers will be published in advance of the meeting as a volume of the FoLLI subline of Lecture Notes in Computer Science (LNCS) by Springer (http://www.springer.com/lncs). INVITED SPEAKERS To be announced. PREVIOUS EDITIONS A selection of the 1995 articles appeared in a special issue of the Journal of Logic, Language and Information (7:4, 1998). The proceedings of the international conferences LACL'96 ,LACL'97, LACL'98, LACL'2001, LACL'2005 and LACL'2011 appeared in the series Lecture Notes in Artificial Intelligence (volumes 1328, 1582, 2014, 2099, 3492, 6736) published by Springer. IMPORTANT DATES Paper submission deadline: February 5, 2012 Notification of acceptance: March, 25 2012 Camera ready copies due: April 8, 2012 Conference dates: July 2-4, 2012 CONTACTS Alexandre.Dikovsky at univ-nantes.fr and Denis.Bechet at univ-nantes.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 14:15:54 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 15:15:54 +0100 Subject: Appel: TALN'2012 (Pendant JEP-TALN-RECITAL), 4 au 8 juin 2012, Grenoble Message-ID: Date: Fri, 16 Dec 2011 16:27:25 +0100 From: =?iso-8859-1?Q?Herv=E9_Blanchon?= Message-Id: <8B926340-50AC-4A58-B0F1-41F7B2014AC6 at imag.fr> X-url: http://www.jeptaln2012.org/ Bonjour, Le site de soumission de TALN est désormais ouvert. Vous trouverez les consignes aux auteurs sur : http://www.jeptaln2012.org/?page_ide et les liens pour soumettre sont sur http://www.jeptaln2012.org/?page_idi Attention, cette année , les styles présentent des modifications importantes par rapport aux versions des conférences précédentes car ils ont été optimisés pour une lecture à l’écran. Nous souhaitons ainsi réduire l’impact écologique de l’utilisation de nos articles en évitant des impressions dues à un format inadapté. Par ailleurs, les orateurs invités suivant ont confirmé leur venue à JEP-TALN-RECITAL 2012 (titres à venir) Ian MADDIESON, University of California, Berkeley Jacqueline LEON, CNRS, Paris Yoshinori SAGISAKA, University of Waseda, Tokyo Hans USZKOREIT, DFKI, Saarbrucken Nous esperons recevoir de très nombreux articles de votre part ! Bonnes fêtes de fin d'année à tous ! ------------------------------------------------------------------------ Troisième APPEL À COMMUNICATIONS TALN'2012 http://www.jeptaln2012.org/ 19e conférence sur le Traitement Automatique des Langues Naturelles Grenoble du 4 au 8 juin 2012 CALENDRIER - Articles LONGs : - Date limite de soumission : 24 janvier 2012 - Notification aux auteurs : 1 mars 2012 - Date limite de soumission des versions définitives : 15 avril 2012 - Articles COURTs et Démonstrations : - Date limite de soumission : 15 mars 2012 - Notification aux auteurs : 16 avril 2012 - Date limite de soumission des versions définitives : 20 avril 2012 PRÉSENTATION Organisée par l'équipe GETALP du LIG (Laboratoire Informatique de Grenoble), le LIDILEM (Laboratoire de linguistique et didactique des langues étrangères et maternelles) et le département DPC du Gipsa-lab, la conférence TALN'2012 se tiendra du 4 au 8 juin 2012 à Grenoble à l'occasion de la conférence jointe JEP-TALN-RECITAL'2012. JEP-TALN-RECITAL'2012 regroupe la 29ème édition des Journées d'Étude sur la Parole (JEP'2012), la 19ème édition de la conférence sur le Traitement Automatique des Langues Naturelles (TALN'2012) et la 14ème édition des Rencontres des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RECITAL'2012). Pour la quatrième fois, après Nancy en 2002, Fès en 2004, et Avignon en 2008, l'AFCP (Association Francophone pour la Communication Parlée) et l'ATALA (Association pour le Traitement Automatique des Langues) organisent conjointement leur principale conférence afin de réunir en un seul lieu les deux communautés du traitement de la langue orale et écrite. La conférence TALN'2012 comprendra des communications orales et affichées, des conférences invitées et des ateliers. La langue officielle de la conférence est le français. Les communications en anglais sont acceptées pour les participants non-francophones. TYPES DE COMMUNICATIONS Deux formats de communications sont prévus : les articles longs (de 11 à 14 pages) et les articles courts (de 6 à 8 pages). Les auteurs sont invités à présenter deux types de communications : - des articles présentant des travaux de recherche originaux, - des prises de position présentant un point de vue sur l'état des recherches en TAL (traitement automatique de la langue), fondées sur une solide expérience du domaine. Les articles longs seront présentés sous forme de communication orale, les articles courts sous forme de poster. THÈMES Les communications pourront porter sur tous les thèmes habituels du TAL, incluant, de façon non limitative : - Analyse et génération dans les domaines suivants : - Phonétique - Phonologie - Morphologie - Syntaxe - Sémantique - Discours - Développement de ressources linguistiques pour le TAL : - Bases de données comportant des informations morphologiques, syntaxiques, sémantiques, et/ou phonologiques - Grammaires - Lexiques - Ontologies - Linguistique de corpus - Applications du TAL : - Analyse de sentiments ou d'opinions - Catégorisation ou classification automatique - Désambiguïsation lexicale - Dialogue homme-machine en langage naturel - Enseignement assisté par ordinateur - Indexation automatique - Recherche et extraction d'information - Résumé automatique - Résolution d'anaphores - Systèmes de question-réponse - Traduction automatique - Web sémantique - Approches: - Linguistiques formelles destinées à soutenir les traitements automatiques - Symboliques - Logiques - Statistiques - Basées sur l'apprentissage automatique -Prises de position présentant un point de vue sur le TAL CRITÈRES DE SÉLECTION Les soumissions seront examinées par au moins deux spécialistes du domaine. Pour les travaux de recherches, seront considérées en particulier : - l'adéquation aux thèmes de la conférence. - l'importance et l'originalité de la contribution, - la correction du contenu scientifique et technique, - la discussion critique des résultats, en particulier par rapport aux autres travaux du domaine, - la situation des travaux dans le contexte de la recherche internationale, - l'organisation et la clarté de la présentation, Pour les prises de position, seront privilégiées : - la largeur de vue et la prise en compte de l'état de l'art, - l'originalité et l'impact du point de vue présenté. Les articles sélectionnés seront publiés dans les actes de la conférence. Le comité de programme sélectionnera parmi les communications acceptées un article (prix TALN) pour recommandation à publication (dans une version étendue) dans la revue Traitement Automatique des Langues (Revue TAL). MODALITÉS DE SOUMISSION Les articles seront rédigés en français pour les francophones, en anglais pour ceux qui ne maîtrisent pas le français. Une feuille de style LaTeX et un modèle Word sont disponibles sur le site web de la conférence (http://www.jeptaln2012.org/ ). La soumission se fait en ligne (voir instructions sur le site web de la conférence). Contact : herve.blanchon at imag.fr et Georges.Antoniadis at u-grenoble3.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 14:22:41 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 15:22:41 +0100 Subject: Job: Stage M2, Info-linguiste, Scan research Message-ID: Date: Fri, 16 Dec 2011 17:54:54 +0100 From: Gilles Achache Message-ID: X-url: http://www.scan-research.net" SCAN-research Proposition de stage pour un info-linguiste (niveau Master II) Stage dans une société innovante d’étude et d’analyse de la conversation des Internautes 1. Présentation de la société et du secteur d’activité Scan-research est une jeune start-up créée en août 2010, spécialisée dans l'observation et l'analyse de l'opinion des Internautes. Nous sommes aujourd'hui hébergés par l'incubateur d’entreprises innovantes Agoranov (incubateur créé par UPMC, Paris IX-Dauphine, ENS, Paris tech) ; nous avons reçu le soutien d’Oséo et de la Mairie de Paris - Nos clients sont de grandes entreprises du luxe (LVMH…), de l'énergie (EDF, ERDF…) et des administrations. - Nous avons conclu un partenariat avec de grands titres de presse pour suivre les évolutions de la conversation des Internautes durant les élections présidentielles. Nous sommes à la recherche d’étudiants motivés, curieux et autonomes pour nous accompagner dans la réalisation de nos études et de nos missions de veille et d’analyse sur la conversation des Internautes, à l’aide de notre plateforme propriétaire. Les tâches qui vous seront confiées seront (entre autres) les suivantes : - Construire les ontologies en fonction desquelles seront catégorisées les conversations observées - Rédiger les descripteurs linguistiques de ces ontologies selon les méthodologies que nous avons développées - Interagir avec les développeurs de notre plateforme, et les concepteurs du moteur de TAL avec lequel nous travaillons 2. Profil Vous êtes infolinguistes, avec : - une bonne connaissance des problématiques du Traitement Automatique des Langues Naturelles - Une connaissance des problèmes liés à la constitution d’ontologies appliquées à la réalité sociale - Une bonne culture générale notamment en matière de sociologie, ne nuira pas ; un intérêt pour les questions de formation de l’opinion et de sémantique politique sera apprécié. - Une pratique courante de l’anglais est un requis Nous cherchons des candidats à la fois rigoureux, imaginatifs, intéressés à participer à l’élaboration de méthodologies innovantes en matière de mise en œuvre des techniques de linguistiques appliquées aux sciences sociales et au marketing. 3. Modalités pour postuler Envoyez votre candidature par mail à gilles.achache at scan-research.net 4. Déroulement du stage Salaire : à négocier Lieu du stage : Paris (boulevard Raspail) Durée : 6 mois / 12 mois (possibilité de transformer le stage en premier emploi en cas de réussite) Date de démarrage : 15 janvier 2012 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 14:25:09 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 15:25:09 +0100 Subject: Appel: EACL 2012 Workshop on Computational Models of Language Acquisition and Loss Message-ID: Date: Fri, 16 Dec 2011 19:14:32 +0100 From: Thierry Poibeau Message-Id: X-url: http://sites.google.com/site/eaclcogws/ ================================================== First Call for Papers ================================================== EACL 2012 Workshop on Computational Models of Language Acquisition and Loss http://sites.google.com/site/eaclcogws/ Deadline for Submissions: January, 20th, 2012 ------------------------------------------------------------------------ The past decades have seen a massive expansion in the application of statistical and machine learning methods to speech and natural language processing. This work has yielded impressive results which have generally been viewed as engineering achievements. Recently researchers have begun to investigate the relevance of computational learning methods for research on human language acquisition and loss. The human ability to acquire and process language has long attracted interest and generated much debate due to the apparent ease with which such a complex and dynamic system is learnt and used on the face of ambiguity, noise and uncertainty. On the other hand, changes in language abilities during aging and eventual losses related to conditions such as Alzheimer's disease and dementia have also attracted considerable investigative efforts. Parallels between the acquisition and loss have been raised, and a better understanding of the mechanisms involved in both, and of how the algorithms used to access concepts are affected in pathological cases can lead to earlier diagnosis and more targeted treatments. The use of computational modeling is a relatively recent trend boosted by advances in machine learning techniques, and the availability of resources like corpora of child and child-directed sentences, and data from psycholinguistic tasks by normal and pathological groups. Many of the existing computational models attempt to study language tasks under cognitively plausible criteria (such as memory and processing limitations that humans face), and to explain the developmental stages observed in the acquisition and evolution of the language abilities. The workshop is targeted at anyone interested in the relevance of computational techniques for understanding first, second and bilingual language acquisition and change or loss in normal and pathological conditions. Long and short papers are invited on, but not limited to, the following topics: * Computational learning theory and analysis of language learning * Computational models of first, second and bilingual language acquisition * Computational models of language changes in e.g. dementia and Alzheimer?s Disease * Computational models and analysis of factors that influence language acquisition and loss in different age groups and cultures * Computational models of various aspects of language and their interaction in acquisition and change * Computational models of the evolution of language * Data resources and tools for investigating computational models of human language processes * Empirical and theoretical comparisons of the environment and its impact on acquisition/loss * Cognitively oriented Bayesian models of language processes * Computational methods for acquiring various linguistic information (related to e.g. speech, lexicon, syntax, and semantics) and their relevance to research on human language acquisition * Investigations and comparisons of supervised, unsupervised and weakly-supervised methods for learning (e.g. machine learning, statistical, symbolic, biologically-inspired, active learning, various hybrid models) SUBMISSIONS We invite three different submission modalities: * Regular long papers (8 content pages + 1 page for references): Long papers should report on solid and finished research including new experimental results, resources and/or techniques. * Regular short papers (4 content pages + 1 page for references): Short papers should report on small experiments, focused contributions, ongoing research, negative results and/or philosophical discussion. * System demonstration (2 pages): System demonstration papers should describe and document the demonstrated system or resources. We encourage the demonstration of both early research prototypes and mature systems, that will be presented in a separate demo session. All submissions must be in PDF format and must follow the EACL 2012 formatting requirements (available at http://eacl2012.org/information-for-authors/index.html). We strongly advise the use of the provided Word or LaTeX template files. For long and short papers, the reported research should be substantially original. The papers will be presented orally or as posters. The decision as to which paper will be presented orally and which as poster will be made by the program committee based on the nature rather than on the quality of the work. Reviewing will be double-blind, and thus no author information should be included in the papers; self-reference should be avoided as well. Papers that do not conform to these requirements will be rejected without review. Accepted papers will appear in the workshop proceedings, where no distinction will be made between papers presented orally or as posters. Submission and reviewing will be electronic, managed by the START system: https://www.softconf.com/eacl2012/Cognitive2012/ Submissions must be uploaded onto the START system by the submission deadline: January 20, 2012 (11:59pm Samoa Time; UTC/GMT -11 hours) Please chose the appropriate submission type from the starting submission page, according to the category of your paper. IMPORTANT DATES Jan 20, 2012 Paper submission deadline Feb 20, 2012 Notification of acceptance Mar 09, 2012 Camera-ready deadline Apr 23 or 24, 2012 Workshop PROGRAM COMMITTEE Afra Alishahi, Tilburg University (Netherlands) Colin J Bannard, University of Texas at Austin (USA) Marco Baroni, University of Trento (Italy) Jim Blevins, University of Cambridge (UK) Rens Bod, University of Amsterdam (Netherlands) Antal van den Bosch, Tilburg University (Netherlands) Alexander Clark, Royal Holloway, University of London (UK) Robin Clark, University of Pennsylvania (USA) Matthew W. Crocker, Saarland University (Germany) James Cussens, University of York (UK) Walter Daelemans, University of Antwerp (Belgium) and Tilburg University (Netherlands) Barry Devereux, University of Cambridge (UK) Sonja Eisenbeiss, University of Essex (UK) Afsaneh Fazly, University of Toronto (Canada) Cynthia Fisher, University of Illinois (USA) Jeroen Geertzen, University of Cambridge (UK) Henriette Hendriks, University of Cambridge (UK) Marco Idiart, Federal University of Rio Grande do Sul (Brazil) Aravind Joshi, University of Pennsylvania (USA) Shalom Lappin, King's College London (UK) Alessandro Lenci, University of Pisa (Italy) Igor Malioutov, Massachusetts Institute of Technology (USA) Marie-Catherine de Marneffe, Stanford University (USA) Fanny Meunier, Lumière Lyon 2 University (France) Brian Murphy, Carnegie Mellon University (USA) Maria Alice Parente, Federal University of Rio Grande do Sul (Brazil) Massimo Poesio, University of Essex (UK) Brechtje Post, University of Cambridge (UK) Ari Rappoport, The Hebrew University of Jerusalem (Israel) Dan Roth, University of Illinois at Urbana-Champaign (USA) Kenji Sagae, University of Southern California (USA) Sabine Schulte im Walde, University of Stuttgart (Germany) Ekaterina Shutova, University of Cambridge (UK) Maity Siqueira, Federal University of Rio Grande do Sul (Brazil) Mark Steedman, University of Edinburgh (UK) Shuly Wintner, University of Haifa (Israel) Charles Yang, University of Pennsylvania (USA) Beracah Yankama, Massachusetts Institute of Technology (USA) Menno van Zaanen, Macquarie University (Australia) Michael Zock, LIF, CNRS, Marseille (France) WORKSHOP ORGANIZERS AND CONTACT Robert Berwick, Massachusetts Institute of Technology (USA) Anna Korhonen, University of Cambridge (UK) Thierry Poibeau, LaTTiCe-CNRS (France) and University of Cambridge (UK) Aline Villavicencio, Federal University of Rio Grande do Sul (Brazil) and Massachussets Institute of Technology (USA) For any inquiries regarding the workshop please send an email to cognitivemodels2012 at gmail.com ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 14:27:44 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 15:27:44 +0100 Subject: Appel: MUE 2012 Message-ID: Date: Sat, 17 Dec 2011 19:25:04 +0100 From: MUE2012 Message-ID: <20111217182504.GA16915 at piojito.arcos.inf.uc3m.es> X-url: http://www.arcos.inf.uc3m.es/mue2012 Dear Sir or Madam, (We apologize if you receive multiple copies of this message) ================================== The 6th FTRA International Conference on Multimedia and Ubiquitous Engineering (MUE 2012) http://www.arcos.inf.uc3m.es/mue2012 Madrid, Spain, 10-13 July 2012 ** Important Dates ** ================================== Papers due: January, 15, 2012 Notification: March, 15 2012 Camera ready: April, 15 2012 Conference dates: July, 10-13, 2012 Context ================================= The new multimedia standards (for example, MPEG-21) facilitate the seamless integration of multiple modalities into interoperable multimedia frameworks, transforming the way people work and interact with multimedia data. These key technologies and multimedia solutions interact and collaborate with each other in increasingly effective ways, contributing to the multimedia revolution and having a significant impact across a wide spectrum of consumer, business, healthcare, education, and governmental domains. Moreover, the emerging mobile computing and ubiquitous networking technologies enable users to access fully broadband mobile applications and new services anytime and everywhere. The continuous efforts have been dedicated to research and development in this wide area including wireless mobile networks, ad-hoc and sensor networks, smart user devices and advanced sensor devices, mobile and ubiquitous computing platforms, and new applications and services including location-based, context-aware, or social networking services. This conference provides an opportunity for academic and industry professionals to discuss recent progress in the area of multimedia and ubiquitous environment including models and systems, new directions, novel applications associated with the utilization and acceptance of ubiquitous computing devices and systems. MUE 2012 is the next event in a series of highly successful the International Conference on Multimedia and Ubiquitous Engineering MUE-11 (Crete, Greece, June 2011), MUE-10 (Cebu, Philippines, August 2010), MUE-09 (Qingdao, China, June 2009), MUE-08 (Busan, Korea, April 2008), and MUE-07 (Seoul, Korea, April 2007). Topics of interest ================================= * Ubiquitous Computing and Technology * Context-Aware Ubiquitous Computing * Parallel/Distributed/Grid Computing * Novel Machine Architectures * Semantic Web and Knowledge Grid * Smart Home and Generic Interfaces * AI and Soft Computing in Multimedia * Computer Graphics and Simulation * Multimedia Information Retrieval (images, videos, hypertexts, etc.) * Internet Multimedia Mining * Medical Image and Signal Processing * Multimedia Indexing and Compression * Virtual Reality and Game Technology * Current Challenges in Multimedia * Protocols for Ubiquitous Services * Ubiquitous Database Methodologies * Ubiquitous Application Interfaces * IPv6 Foundations and Applications * Smart Home Network Middleware * Ubiquitous Sensor Networks / RFID * U-Commerce and Other Applications * Databases and Data Mining * Multimedia RDBMS Platforms * Multimedia in Telemedicine * Multimedia Embedded Systems * Multimedia Network Transmission/Streaming * Entertainment Industry * E-Commerce and E-Learning * Novel Multimedia Applications * Computer Graphics * Multimedia network transmission/streaming * Security in Commerce and Industry * Security in Ubiquitous Databases * Key Management and Authentication * Privacy in Ubiquitous Environment * Sensor Networks and RFID Security * Multimedia Information Security * Forensics and Image Watermarking * Cyber Security * Intrusion detection * Biometric Security * New developments in handheld and mobile information appliances * New paradigms: mobile cloud, personal networks, social and crowd computing, etc * Operating systems aspects for personal mobile devices * New technological advances for personal mobile devices * End-user interface issues in the design and use of personal technologies * Enabling technologies for personal multimedia and ubiquitous computing * Multimedia applications and techniques for personal computing devices * Usage of personal devices for on-line learning Submissions ================================= Submissions should not exceed 8 pages in IEEE CS proceedings paper format, including tables and figures. All paper submissions must represent original and unpublished work. Submission of a paper should be regarded as an undertaking that, should the paper be accepted, at least one of the authors will register for the conference and present the work. Submissions will be conducted electronically on the conference website. Journal publications ================================== Distinguished selected papers accepted and presented in MUE 2012, after further revisions, will be published in issues of several indexed international journals. Best paper awards ================================= The Program Committee of MUE 2012 will give two best paper awards: one NVIDIA Best Student Paper Award and one NVIDIA Best Paper Award. Best Paper Awards will be given to the author(s) of a full paper presented at the conference, selected by the Organizing Committee. The Best Paper Award is a Tesla C2075 computing processor, sponsored by NVIDIA. Steering Chair ================================= James J. (Jong Hyuk) Park (SeoulTech, Korea) General Chairs ================================= Jesus Carretero (University Carlos III of Madrid, Spain) Sangsoo Yeo (Mokwon University, Korea) General Vice-Chair ================================= Seungmin Rho (Korea University, Korea) Program Chairs ================================= Jianping Fan(University of North Carolina, USA) J. Daniel Garcia (University Carlos III of Madrid, Spain) Program Vice-Chairs ================================= Claudia Cannali (University of Modena e Reggio Emilia, Italy) Benoit Huet (Eurecom, France) Jiankun Hu (University of New South Wales at the Australian Defence Force Academy, Australia) Eleni Mangina (University College Dublin, Ireland) Feng Wu (Microsoft Research Asia, China) Zhiwen Yu (Northwestern Polytechnical University, China) Workshop Chairs ================================= Juan Manuel Dodero (University of Cadiz, Spain) Abelardo Pardo (University Carlos III of Madrid, Spain) International Program Committee ================================= **Please visit the conference web site (http://www.arcos.inf.uc3m.es/mue2012) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:15:44 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:15:44 +0100 Subject: Job: cherche charge-e-s de cours, Universite Paris III (urgent) Message-ID: Date: Mon, 19 Dec 2011 09:21:10 +0100 From: Alice Krieg-Planque Message-ID: <4EEEF3F6.7050602 at club-internet.fr> Le département de littérature et linguistique françaises et latines de l'Université Paris III cherche un-e chargé-e de cours de linguistique pour le deuxième semestre (qui débute le 30 janvier 2012). Il s'agit d'un cours d'introduction à l'analyse linguistique pour les étudiants de première année en Lettres Modernes. Le contenu du cours porte sur la syntaxe et la sémantique lexicale. Les horaires du TD en question sont déjà précisés : mardi 10h-13h. Contacter : yana.grinshpun at numericable.fr et gillesphilippe at online.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:19:10 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:19:10 +0100 Subject: Appel: TOTh 2012 Message-ID: Date: Mon, 19 Dec 2011 11:23:15 +0100 From: Luc Damas Message-ID: X-url: http://www.porphyre.org/toth/ TOTh 2012 : APPEL A COMMUNICATIONS Terminologie & Ontologie : Théories et applications Conférence : 7 et 8 juin 2012 Formation : 5 et 6 juin 2012 http://www.porphyre.org/toth/ http://www.porphyre.org/toth/toth-2012/appel-a-communications http://www.porphyre.org/toth/files/TOTh-2012-appel-a-communications.pdf Dates importantes : ------------------------- 30 janvier 2012 : Date limite de soumission 5 mars 2012 : Notification d’acceptation 9 avril 2012 : Version définitive Soumission par EasyChair : ----------------------------------- Les soumissions d'articles (articles complets) se font exclusivement par EasyChair : http://www.easychair.org/conferences/?conf=toth2012 Présentation : ------------------ En rapprochant terminologie et ontologie, les conférences TOTh mettent l’accent sur les apports mutuels de ces deux disciplines et sur de nouvelles perspectives de développement tant théoriques que pratiques. La terminologie s’intéresse aux dénominations des concepts et des objets dans un domaine précis, mais aussi aux unités telles que les collocations, les combinaisons lexicales ou les phraséologies en usage, qui font partie des vocabulaires, des discours et des langues de spécialité. D’un point de vue méthodologique, le terminologue développe des pratiques qui visent la conception et la création de ressources terminologiques utiles à toute fin langagière : traduction, aménagement, lexicographie, enseignement des langues à des fins spécifiques, etc. La qualité et la rigueur des discours de spécialité étant directement dépendantes de la relation entre l’organisation lexicale et l’organisation des connaissances, une réflexion conjointe entre les terminologues, traducteurs, lexicographes, ingénieurs des connaissances et informaticiens s’impose. La terminologie et l’ontologie partagent des objectifs similaires dans leur compréhension et la description de la réalité. Les ontologies issues de l’ingénierie des connaissances constituent aujourd’hui une des voies les plus prometteuses pour la modélisation du système conceptuel des terminologies. Au cœur de nombreuses applications du monde numérique, les ontologies permettent l’opérationnalisation des terminologies et leur utilisation à différentes fins : aide à la traduction, gestion et traitement de l’information, gestion des connaissances et de contenus multilingues, web sémantique, etc. Par nature transdisciplinaires, la terminologie et l’ontologie mobilisent des pratiques et des théories qui relèvent de nombreuses disciplines dont principalement la linguistique, les langues de spécialité, l’épistémologie, l’ingénierie des connaissances et les sciences de l’information. Publics & Disciplines : ---------------------------- Les conférences TOTh visent à rassembler chercheurs, enseignants, formateurs, utilisateurs et industriels dont les préoccupations relèvent à la fois de la terminologie et de l’ontologie, et de façon plus générale de la langue et de l’ingénierie des connaissances. Elles se veulent un lieu d’échange et de partage où sont exposés problèmes, solutions et retours d’expériences tant sur le plan théorique qu’applicatif ainsi que les nouvelles tendances et perspectives des disciplines associées : terminologie, langues de spécialité, linguistique, intelligence artificielle, ingénierie des connaissances, systèmes d’information, ingénierie collaborative, sciences de l’information, documentation, etc. Prix « Jeune chercheur » : -------------------------------- Depuis 2011 un prix « Jeune chercheur » est décerné par le comité de programme lors de la conférence. Thèmes de la Conférence (liste non exhaustive) : -------------------------------------------------------------- Approches linguistiques : - les langues de spécialité, la traduction, l’aménagement linguistique et terminologique, la communication de spécialité multilingue, - les usages, les variations, les collocations terminologiques, les combinaisons lexicales, - les produits terminologiques, lexiques, dictionnaires, thésaurus, - le traitement de corpus : extraction de termes, extraction de connaissances, constitution et alignement de corpus, - la construction et la maintenance de ressources terminologiques, l’harmonisation des termes, l’harmonisation des concepts, la normalisation, les théories du sens, l’importance et la limite des corpus, - la validation de la qualité des données, le rôle des experts. Modélisation des connaissances : - les principes terminologiques pour la construction des systèmes notionnels et conceptuels, - la représentation et la gestion des connaissances : acquisition, modélisation des concepts, - les ontologies : construction, évaluation (qualité), maintenance, alignement, fusion, - les thesaurus mono- et multilingues, la gestion documentaire, - l’apport de l’intelligence artificielle, de la linguistique, de l’épistémologie, des systèmes formels (logique des descriptions par exemple). Applications : - les environnements logiciels : construction et maintenance de terminologies, thésaurus, dictionnaires, ontologies, - les applications informatiques à base de connaissances et de ressources terminologiques : aide à la traduction, gestion de contenus mono- et multilingues, web sémantique, recherche d’information, classification, gestion des connaissances, systèmes d’information, etc. - l’évaluation des outils d’acquisition et de manipulation, l’évolution des normes et standards dans le domaine des ressources linguistiques et ontologiques, la terminométrie. http://www.porphyre.org/toth/ http://www.porphyre.org/toth/toth-2012/appel-a-communications http://www.porphyre.org/toth/files/TOTh-2012-appel-a-communications.pdf ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:20:41 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:20:41 +0100 Subject: Job: Offres de stage, LIMSI-CNRS, Orsay Message-ID: Date: Mon, 19 Dec 2011 11:44:07 +0100 From: Xavier Tannier Message-ID: <4EEF1577.1070708 at limsi.fr> X-url: http://www.limsi.fr/%7Extannier/fr/Stages/sujet_2012_M2_detection_de_reactions.html X-url: http://www.limsi.fr/%7Extannier/fr/Stages/sujet_2012_M1_phrase_saillante.html X-url: http://www.limsi.fr/%7Extannier/fr/Stages/sujet_2012_L_interface_graphique_pour_chronolines.html Bonjour, Dans le cadre du projet Chronolines, l'équipe ILES du LIMSI propose les trois stages suivants : * *Master 2* : Détection de réactions et de promesses dans les articles de presse http://www.limsi.fr/%7Extannier/fr/Stages/sujet_2012_M2_detection_de_reactions.html * *Master 1* : Sélection automatique de passage représentatif d'un événement http://www.limsi.fr/%7Extannier/fr/Stages/sujet_2012_M1_phrase_saillante.html * *DUT, premier cycle ingénieur, L3 informatique* : Réalisation d'une interface graphique pour un outil de composition semi-automatique de chronologies http://www.limsi.fr/%7Extannier/fr/Stages/sujet_2012_L_interface_graphique_pour_chronolines.html Cordialement, Xavier Tannier. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:24:49 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:24:49 +0100 Subject: Job: Stage conventionne en Text Mining, EDF, La Defense Message-ID: Date: Mon, 19 Dec 2011 12:37:13 +0100 From: Anne-Laure GUENET Message-ID: Commerce – Direction des Système d’Information Domaine Analyse de Connaissance Client Département Analyse Client Proposition de stage opérationnel en Text-Mining Depuis le 1er juillet 2007, le marché de l’électricité est entièrement ouvert à la concurrence et permet au consommateur de choisir librement son fournisseur d’énergie. Dans ce contexte, il est devenu stratégique pour EDF de comprendre les besoins de ses clients, mais également d’expliquer et de prédire leur comportement. Le Domaine Analyse de la Connaissance Client au sein du Département des Systèmes d’Information de la Branche Commerce a, pour partie, la mission d’analyser les différents systèmes d’information d’EDF et notamment les données textuelles présentes dans ses bases. Actuellement, nous utilisons des techniques de Text Mining pour analyser automatiquement des commentaires et notamment des réclamations de clients provenant de nos différents SI. Le stage que nous proposons est opérationnel et va comprendre plusieurs axes : La création de cartouches d’extraction de connaissances (Technologie Skill Cartridge développée par la société TEMIS) permettant à la fois d’améliorer et de repérer des concepts métier, mais également de contribuer à l’amélioration de nos modèles de classement. Une étude portant sur la comparaison des résultats obtenus en Text Mining sur l’analyse des commentaires « réclamation » avec les informations rentrées manuellement par le conseiller. La mise en place d’un processus d’industrialisation plus léger, permettant, non pas d’analyser l’intégralité des champs commentaires, mais un échantillon représentatif de nos clients. Le stage aura une durée de 6 mois et se déroulera dans la tour EDF à la Défense au cours de l’année 2012. Le stagiaire sera rémunéré. Le profil recherché est un étudiant de niveau BAC +5 spécialisé en Traitement Automatique du Langage. De bonnes connaissances linguistiques et informatiques sont indispensables. Des connaissances statistiques ou des outils développés par TEMIS seraient un plus apprécié. Les candidatures (CV + lettre de motivation) sont à envoyer à anne-laure.guenet at edf.fr. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:30:49 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:30:49 +0100 Subject: Appel: Colloque international de linguistique francaise, Le Caire Message-ID: Date: Mon, 19 Dec 2011 13:12:59 +0000 (GMT) From: EL KHAMISSY RIHAM Message-ID: <1324300379.1649.YahooMailClassic at web29506.mail.ird.yahoo.com> PROLONGEMENT DU DELAI AU 23 DECEMBRE 2011 Veuillez diffuser l'appel Conférence de linguistique de la faculté des langues (AL ALSUN) Université de Ain Shams (Le Caire- Égypte) Femme, je n'écris pas ton nom. Elle au carrefour des sciences du langage Le département de français de la faculté des Langues (AL Alsun) organise une conférence internationale de linguistique française qui se tiendra à l'Université d'Ain Chams (Le Caire- Égypte), 29 -30 Avril 2012. Date limite de la réception des résumés - (200 à 400 mots) des propositions + Mots clés : 19 décembre 2011 Elle pronom personnel, elle non-personne, elle le féminin de il, Elle symbole de la femme. Sous la dénomination de genre, Elle imprègne le corps du langage par sa présence. Le traitement de la catégorie de sexe dans le langage touche la troisième personne en grammaire tout en forçant chaque locuteur, s'il en est une, à proclamer son sexe physique (sociologique), c'est-à-dire à apparaître dans le langage, représenté sous une forme concrète par les différentes marques du féminin et non sous la forme abstraite que la généralisation nécessite, celle que tout locuteur masculin a le droit inquestionnable d'utiliser. La présence du elle, en manifestant dans sa forme subjective le genre, arrive à le réaliser par excès de présence de féminin (et de féminisation) couvrant un large territoire du domaine linguistique. Elle propose donc un examen des problèmes liés à la tradition grammaticale française, une analyse syntaxique, et une étude du rôle de ces constructions dans la dynamique du texte et dans la structuration du champ de référence et du champ communicationnel. En outre, Elle ne se borne pas à manifester la troisième personne au féminin. Elle en linguistique n'est pas uniquement le féminin, la féminité, et autres fétiches essentialisants. Dire d'une femme « elle » et s'abstenir de mentionner son prénom, quand on ne l'ignore pas, n'est pas dénué de sens. En outre, Elle peut porter l'héritage du discours philosophique. Assumant à son tour le « postulat nominaliste » décrit par Michel de Certeau à propos de Montaigne, elle, dans le dispositif d'écriture, signifie « la chose » (par opposition au nom), celle « qui est l'étrangère, [celle qui] n'est jamais là où le mot la dit. ». Enfin, N. Sarraute écrit : « contre elle, on ne peut rien » (Elle est là) signalant l'obstination et la force, si souvent sous-estimée d'ELLE. Quel que soit l'approche linguistique, elle dispose d'une grande variété d'usages et reste un territoire d'exploration riche en apports. Les contributions seront liées aux domaines de la linguistique cités ci-après : · Grammaire · Grammaire de texte · Énonciation · Pragmatique · Argumentation · Syntaxe · Sémantique · Stylistique · Approches contrastives et comparatives (langues autorisées : français-arabe, français-anglais). NB. Toute approche interdisciplinaire ou cumulant plus d'une discipline sera la bienvenue. Mais il faut le préciser dès l'envoi de votre résumé. La langue du colloque : le français. La langue des contributions : le français (NB. pour les approches contrastives des exemples peuvent être rédigés en anglais ou en arabe). Comité d'organisation: Dr. Racha EL KHAMISSY Dr. Riham EL KHAMISSY Dr. Yomna SAFWAT Comité scientifique : Prof. Dr. Mona ABDEL AZIZ Prof. Dr. Lorine Zikri Prof. Dr. Elweya AL HAKIM Prof. Dr. Camélia SOBHI Prof. Dr. Dalia EL TOUKHY Calendrier : · Date limite de la réception des résumés - (200 à 400 mots) + 4 mots-clés : 19 décembre 2011. Prolongement au 23 décembre 2011 · Date limite de notification de l'acceptation ou du refus des propositions de communications : 1er janvier 2012 · Date limite de la réception du texte complet de la communication : 19 mars 2012 · Directives pour la version définitive : début avril 2012 · Date de la conférence : 29-30 avril 2012 Pour Candidater : Les propositions de communications doivent être anonymes, ce qui signifie qu'à aucun moment les lecteurs ne doivent être en mesure d'identifier le ou les auteurs. Des résumés-New Times Roman 12 - (200 à 400 mots) + 5 mots clés + bibliographie indicative. Les propositions doivent parvenir à l'adresse électronique suivante : conf_alsun2012 at yahoo.com Dans le corps du message électronique, vous préciserez : 1. Vos informations personnelles Nom et prénom Nationalité Statut (chercheur, enseignant, enseignant-chercheur) Poste et Affiliation Titre de votre contribution Brève notice biobibliographique de l'auteur (5 lignes max.) NB. En cas de co-écriture : Pour des raisons de commodité, un seul auteur sera désigné comme « correspondant ». Le ou les autre(s) seront seulement "présentateur(s)".C'est l'auteur "correspondant" qui recevra les différents messages électroniques liés à la proposition de communication (rapport d'évaluation). Seule son adresse électronique est donc nécessaire, celle de son éventuel co-auteur est facultative. 2. La thématique dans laquelle vous souhaitez vous inscrire. Sera attaché au courriel, en version WORD. Doc, le résumé anonyme et le titre de votre contribution. 3. Le type de votre contribution : · Présentation + publication · Publication seulement Pour plus d'infos, consultez la page du colloque sur le site de la faculté des langues – (AL ALSUN). Après acceptation Frais d'inscriptions selon le type de votre participation : Présenter + Publier Publier seulement Assister seulement Pour les enseignants chercheurs égyptiens : 550 LE 350 LE Gratuitement Pour les enseignants-chercheurs étrangers : 330 euros Gratuitement Pour les étudiants égyptiens Gratuitement L'article complet est de 15 pages maximum. Or, vous ne présenterez, lors de la conférence, qu'une communication de (2500 signes sans espaces compris ni références). Un texte plus complet figurera dans les Actes du colloque. La mise en forme de votre article complet 1. Votre article doit être envoyé au format .doc 2. Marge de 3cm de chaque côté 3. Titre principal de votre contribution en Times Roman 16 4. Sous –titres Times Roman en 14 5. Corps du texte : Police Times Roman en12 6. Espace entre paragraphe (Auto) 7. Interligne simple 8. Les paragraphes sont justifiés 9. Les titres et les sous-titres sont centrés 10. Les citations courtes sont insérées entre guillemets dans le corps de texte : les citations longues –plus de trois lignes) peuvent être distinguées par un retour à la ligne et un retrait d'1 cm. Elles sont sans guillemets et rédigées en New Times roman 11. 11. Les références Les références figurent dans le corps du texte (système (auteur : date)) Pour la bibliographie, Times ou Times New Roman, taille de police 10, justifié. La première ligne de chaque référence (donc la première ligne de chaque paragraphe) doit être affectée d'un retrait négatif de 0,5 cm . Vous trouverez ci-dessous le format des références pour les articles et les ouvrages. Nom de l'auteur, Prénom. (Année de publication). Titre de l'article. Nom de la revue, Numéro de la revue, numéros des pages. Nom de l'auteur, Prénom. (Année de publication). Titre de l'ouvrage. Ville : Éditeur. Les renvois aux sites doivent préciser la date consultation et éventuellement la date de publication. Les notes infrapaginales en New Times roman 10 ne sont utilisées que pour des remarques ou des explicitations. Elles doivent être limitées autant que possible. Vous êtes priés de suivre les normes de présentations ci-dessus. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:33:38 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:33:38 +0100 Subject: Info: Elections au CoNRS, listes electorales Message-ID: Date: Mon, 19 Dec 2011 14:19:25 +0100 From: Anne Vilnat Message-ID: <4EEF39DD.20609 at limsi.fr> X-url: http://www.dgdr.cnrs.fr/elections/scn/actualites/default.htm X-url: http://www.dgdr.cnrs.fr/elections/scn/textes/Liste%20cnSL%5B1%5D.pdf X-url: http://www.dgdr.cnrs.fr/elections/scn/dispositif/electorat.htm X-url: http://liste-electorale.dsi.cnrs.fr/inscriptions/index.php?election=section Chères et chers collègues, En 2012 auront lieu les élections du Comité national de la recherche scientifique pour le mandat 2012-2016. Elles permettent d'élire les futurs membres commissions des 41 (nouvelles) sections du Comité national. En vue de ces élections, ce message donne des informations sur la procédure pour vérifier son inscription ou pour s'inscrire sur les listes électorales. *Rappel* : Les sections du CoNRS exercent plusieurs fonctions clefs au sein du fonctionnement du CNRS. Elles participent notamment : - au processus de recrutement des CR et DR - à l'évaluation des chercheurs - au processus de changements de section, de grade, etc. Elles sont consultées lors du processus de création, renouvellement, suppressions des unités de recherche, ou pour toutes questions relevant de leur domaine. (pour plus de détails, voir : http://www.dgdr.cnrs.fr/elections/scn/actualites/default.htm) Il est primordial que des chercheurs du domaine du TAL soient représentés dans la commission dont ils relèvent : la "nouvelle" section 7 pour "l'informatique linguistique" et la 34 pour "la linguistique informatique" (pour plus d'informations, voir : http://www.dgdr.cnrs.fr/elections/scn/textes/Liste%20cnSL%5B1%5D.pdf). C'est pourquoi l'ATALA tient à souligner l'importance de ces élections et souhaite inciter tous les électeurs potentiels à y participer. Pour information, sont électeurs les agents CNRS correspondant à l'un des critères suivants : - agent fonctionnaire au 31 janvier 2012 ou personnel regi par les décrets n°80-31 et n°59-1405 - agent contractuel de droit public ou privé au 31 janvier 2012, sous contrat depuis au moins un an à la date du scrutin (post-doc entre autres). Pour les agents fonctionnaires, l'inscription dans un collège et une section est automatique. Vous pouvez contrôler votre situation à l'adresse suivante : http://www.dgdr.cnrs.fr/elections/scn/dispositif/electorat.htm Pour les agents contractuels : vous devez vous inscrire dans un collège et une section avant le 16 janvier 2012 à l'adresse suivante : http://liste-electorale.dsi.cnrs.fr/inscriptions/index.php?election=section Pour effectuer cette inscription, sachez que : - il faudra insérer en document attaché dans la rubrique prévue incluant votre CV ainsi qu'une liste des dernières publications, afin de faciliter votre classement dans la section et le collège approprié - pour être formellement validée, votre demande, devra être renvoyée au CNRS par courriel elections at cnrs-dir.fr , fax : 0144965391 ou courrier à CNRS - Organisation des élections - 3 rue Michel Ange - 75016 Paris De plus, il faudra indiquer un collège et la section demandée. Une description des trois collèges A, B et C est disponible sur le site. La section dont vous relevez très certainement en tant que chercheur dans le domaine du TAL est la section 7 ou la 34 (http://www.dgdr.cnrs.fr/elections/scn/textes/Liste%20cnSL%5B1%5D.pdf). Des informations complémentaires sont disponibles sur le site suivant : http://www.dgdr.cnrs.fr/elections/scn/actualites/default.htm Anne Vilnat Groupe ILES du LIMSI Professeur IUT Orsay BP 133 91403 Orsay Cedex Tel : 01 69 85 80 02 courriel : Anne.Vilnat(au)limsi.fr telecopie : 01 69 85 80 88 http://www.limsi.fr/Individu/anne ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:38:50 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:38:50 +0100 Subject: Appel: Multimodal Corpora: How should multimodal corpora, LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 15:33:52 +0100 From: ELRA ELDA Information Message-ID: <4EEF4B50.10901 at elda.org> X-url: http://www.multimodal-corpora.org/ [Apologies for cross-postings] *LREC 2012 Workshop Multimodal Corpora: How should multimodal corpora deal with the situation?* 1st Call for Papers 22 May 2012, Istanbul, Turkey http://www.multimodal-corpora.org/ Currently, the creation of a multimodal corpus involves the recording, annotation and analysis of a selection of many possible communication modalities such as speech, hand gesture, facial expression, and body posture. Simultaneously, an increasing number of research areas are transgressing from focused single modality research to full-fledged multimodality research. Multimodal corpora are becoming a core research asset and they provide an opportunity for interdisciplinary exchange of ideas, concepts and data. The increasing interest in multimodal communication and multimodal corpora evidenced by European Networks of Excellence and integrated projects such as HUMAINE, SIMILAR, CHIL, AMI, CALLAS and SSPNet; the success of recent conferences and workshops dedicated to multimodal communication (ICMI-MLMI, IVA, Gesture, PIT, Nordic Symposium on Multimodal Communication, Embodied Language Processing); and the creation of the Journal of Multimodal User Interfaces also testifies to the growing interest in this area, and the general need for data on multimodal behaviours. In 2012, the 8th Workshop on Multimodal Corpora will again be collocated with LREC. This year, LREC has selected Speech and Multimodal Resources as its special topic. This points to the significance of the workshop's general scope, and the fact that the main conference special topic largely covers the broad scope of the workshop provides us with a unique opportunity to step outside the boundaries and look further into the future. The workshop follows similar events held at LREC 00, 02, 04, 06, 08, 10, and ICMI 11. All workshops are documented under www.multimodal-corpora.org and complemented by a special issue of the Journal of Language Resources and Evaluation which came out in 2008 and a state-of-the-art book published by Springer in 2009. *Aims* As always, we aim for a wide cross-section of the field, with contributions ranging from collection efforts, coding, validation and analysis methods, to tools and applications of multimodal corpora. This year, however, we also want to look ahead and emphasize the fact that a growing segment of research takes a view of spoken language as situated action, where linguistic and non-linguistic actions are intertwined with the dynamic conditions given by the situation and the place in which the actions occur. In spite of this, most corpora capture little more than the linguistic and meta-linguistic actions per se, and contain little or no information about the situation in which they take place. For this reason, we encourage contributions that raise the question of what the additions to future multimodal corpora will be -- with possibilities ranging from simple dynamic information such as background noise, room temperature, light conditions and room dimensions to more complex models of room contents, external events, scents, or cognitive load modelling including physiological data such as breathing or pulse. We hope that with your help, the workshop will serve to examine the way language is conceived in corpus creation and to spark a discussion of its boundaries and how these should be accounted for in annotations and in interpretation. *Time schedule* The workshop will consist of a morning session and an afternoon session. There will be time for collective discussions. *Topics* The LREC'2012 workshop on multimodal corpora will feature a special session on the collection, annotation and analysis of corpora of situated interaction. Other topics to be addressed include, but are not limited to: * Multimodal corpus collection activities (e.g. direction-giving dialogues, emotional behaviour, human-avatar interaction, human-robot interaction, etc.) and descriptions of existing multimodal resources * Relations between modalities in natural (human) interaction and in human-computer interaction * Multimodal interaction in specific scenarios, e.g. group interaction in meetings * Coding schemes for the annotation of multimodal corpora * Evaluation and validation of multimodal annotations * Methods, tools, and best practices for the acquisition, creation, management, access, distribution, and use of multimedia and multimodal corpora * Interoperability between multimodal annotation tools (exchange formats, conversion tools, standardization) * Collaborative coding * Metadata descriptions of multimodal corpora * Automatic annotation, based e.g. on motion capture or image processing, and the integration with manual annotations * Corpus-based design of multimodal and multimedia systems, in particular systems that involve human-like modalities either in input (Virtual Reality, motion capture, etc.) and output (virtual characters) * Automated multimodal fusion and/or generation (e.g., coordinated speech, gaze, gesture, facial expressions) * Machine learning applied to multimodal data * Multimodal dialogue modelling *Important dates* * Deadline for paper submission (complete paper): 12 February 2012 * Notification of acceptance: 10 March * Final version of accepted paper: 26 March * Final program and proceedings: 20 April * Workshop: 22 May *Submissions* The workshop will consist primarily of paper presentations and discussion/working sessions. Submissions should be 4 pages long, must be in English, and follow the submission guidelines at http://www.lrec-conf.org/lrec2012/ Submission should be made at: https://www.softconf.com/lrec2012/MMCorpora2012/ Demonstrations of multimodal corpora and related tools are encouraged as well (a demonstration outline of 2 pages can be submitted). *LREC Map of Language Resources, Technologies and Evaluation* When submitting a paper, from the START page authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that either have been used for the work described in the paper or are a new result of your research (contribution to building the LREC2012 Map). *Organizing committee* Jens Edlund, KTH Royal Institute of Technology, Sweden Dirk Heylen, University of Twente, The Netherlands Patrizia Paggio, University of Copenhagen, Denmark/University of Malta, Malta ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:37:00 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:37:00 +0100 Subject: Appel: ColabTKR 2012 - Terminology and Knowledge Representation, LREC 2012 workshop Message-ID: Date: Mon, 19 Dec 2011 15:16:48 +0100 From: ELRA ELDA Information Message-ID: <4EEF4750.9020004 at elda.org> **[Apologies for cross-postings]// //*ColabTKR 2012 - Terminology and Knowledge Representation* Linguistics and ontology studies have a long record of fruitful cooperation. Cross-research in areas such as computational linguistics, natural language processing, information retrieval and ontology development, maintenance and integration have produced a wealth of multidisciplinary theories, methods, models and tools (Roche, 2008) (Staab, 2008) (Pereira et al. 2009) (Costa, 2006). More specifically, the relationship between the lexicon (lexical approaches and resources) and ontology development methods and tools, have been recently well explored in research (Huang et al, 2010). On the contrary, the relationship between terminology and ontology studies, in particular in what concerns to the initial phases of ontology development, has not received so much attention from the scientific communities involved. On the other side, in diverse professional areas, new challenges are appearing related with information and knowledge management in highly specialised technical domains, under tightly constrained time requirements, unfolding in collaborative networking contexts. Short-term collaborative networking between individuals, groups and organisations, is recognised by researchers and practitioners as possible solution to cope with an increasingly complex social and economic business environment. Moreover, the current demand for continuous innovation leads to an higher heterogeneity in the technical and scientific domains simultaneously involved in collaborative projects and activities (e.g involving SMEs and research centres) (Camarinha-Matos, 2006). Managing information and knowledge in this context places new and interesting challenges to terminology and knowledge representation, particularly when these challenges are seen from an integrated terminology/knowledge representation perspective. Terminological or ontological approaches alone are not likely to be enough in answering to the needs of precision and detail of the specialised technical domains, as much as the research efforts of articulated terminology/ontology approaches are likely to be inadequate in terms of the required resources (time and persons). Thus, these challenges call for more than the setup and configuration of common terminological or ontological resources, particularly when considering the usually accepted time-frames for developing semantic and terminological artifacts. Effective ways to collaboratively construct shared conceptualisations by the means of negotiation and representational artifacts, such as semi-formal ontologies, are then required. *Topics of interest* This workshop intends to join, under a multi-disciplinary tent, specialists in terminology, information/knowledge management, ontology development, and collaboration processes, to debate the interplay between terminology and knowledge representation methods and techniques in contexts of collaborative work. Papers are invited on, but not limited to, the following topics: . The interplay between terminology and conceptualization processes . Interfaces between terminology work and ontology development/maintenance . Collaborative processes in terminology work . Collaborative conceptualization processes and representations of knowledge . Conceptualization processes and semi-formal ontologies . Cognitive semantics and semi-formal ontologies . Knowledge organisation systems and collaboration . Modelling networks of actors and semantic networks (socio-semantic networks) . Theory, methods and tools for conceptual negotiation . Using multimodal corpora for semi-formal ontology development . Design and management of semi-formal ontology libraries . Using terminological resources for semi-formal ontology development . Term extraction and validation in domain-dependent, time-constrained applications . Terminological approaches to support the identification of conceptual relations . The role of conceptual relations in the development of semi-formal ontologies * Type of submission* Extended abstract (1500-2000 words) Extended abstracts must be submitted in https://www.softconf.com/lrec2012/TermKnowledge2012/ *LRE map* When submitting a paper through the START page, authors will be kindly asked to provide relevant information about the resources that have been used for the work described in their paper or that are the outcome of their research. For further information on this initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012. Authors will also be asked to contribute to the Language Library, the new initiative of LREC2012. *Important dates* . Submission deadline: 1FEB12 . Acceptance notification: 1MAR12 . Camera ready full paper: 30MAR12 . Workshop date: 22MAY12, afternoon session *Organizing Committee* António Lucas Soares (als at fe.up.pt) - University of Porto and INESC Porto, Portugal Rute Costa (rute.costa at fcsh.unl.pt) - New University of Lisbon, Portugal Carla Pereira (cpereira at estgf.ipp.pt) - IPP/ESTGF and INESC Porto, Portugal Alessandro Oltramari (oltramale at gmail.com) - Carnegie-Mellon University, USA Christophe Roche (christophe.roche at univ-savoie.fr) - University of Savoie, France Anita Nuopponen (atn at UWasa.fi) - University of Vaasa, Finland *Programme Committee* Gerhard Budin - University of Vienna Chiara Ghidini - Bruno Kessler Foundation (FBK) - Trento, Italy Guadalupe Aguado de Cea - Universidad Politécnica de Madrid Hanne ErdmanThomsen - Copenhagen Business School Mustafa Jarrar - University of Birzeit, Palestine António Lucas Soares - University of Porto and INESC Porto, Portugal Rute Costa - Universidade Nova de Lisboa, Portugal Carla Sofia Pereira - Polytechnic Institute of Porto and INESC Porto, Portugal) Alessandro Oltramari - Carnegie-Mellon University, USA Christophe Roche - University of Savoie, France Anita Nuopponen - University fo Vaasa, Finland Piek Vossen - VU University Amsterdam, Netherlands *References * Chu-Ren Huang, Nicoletta Calzolari, Aldo Gangemi, Alessandro Lenci, Alessandro Oltramari, and Laurent Pre?vot (eds.). 2010. Ontology and the Lexicon: A Natural Language Processing Perspective. Cambridge University Press. Roche, C. 2008. Terminologie & Ontologie: The?ories et Applications. Actes de la deuxie?me confe?rence TOTh -- Annecy-5 et 6 juin. Staab, S. 2008. On understanding the collaborative construction of conceptualisations. International and Interdisciplinary Conference "Processing Text -Technological Resources" at the Center for Interdisciplinary Research, Bielefeld University, 13-15 March. Pereira, C.; Sousa, C.; Soares, A. 2009. A socio-semantic approach to collaborative domain conceptualization. On the Move to Meaningful Internet Systems: OTM 2009 Workshops, Lecture Notes in Computer Science, Springer Berlin / Heidelberg. Costa, R. 2006. Terminology, Corpus Linguistics and Ontology, Constrastive Studies and Valency. Studies in Honor of Hans Ulrich Boas. Petra C. Steiner, Hans C. Boas. Stefan Scheirholz [eds.]. Berlin -- Bern: Peter Lang Verlag. Camarinha-Matos, L. 2006. Collaborative networks in industry -- Trends and foundations. InProc. of DET'06 - 3rd International CIRP Conference in Digital Enterprise Technology. Nuopponen, A. 2011. Methods of concept analysis - tools for systematic concept analysis (part 3 of 3). In: LSP, professional communication, knowledge management and cognition. http://rauli.cbs.dk/index.php/lspcog/index ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:40:53 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:40:53 +0100 Subject: Appel: META-RESEARCH Workshop on Advanced Treebanking, LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 15:55:57 +0100 From: ELRA ELDA Information Message-ID: <4EEF507D.9000502 at elda.org> X-url: http://www.meta-net.eu/meta-research/events/lrec2012-treebanking-workshop [Apologies for multiple postings] META-RESEARCH Workshop on Advanced Treebanking (LREC 2012) - Call for papers *The Workshop* Many EU and world-wide projects and research groups are creating, standardizing, converting and/or using treebanks, thereby often tackling the same issues and reinventing methods and tools. The workshop has been conceived by four projects, namely T4ME, META-NORD, CESAR and META4U, which under the META-NET umbrella project strive to make many treebanks and other LRT available for R&D. However, the workshop is not limited to participants of META-NET, but it is widely open: papers of authors of any origin will be welcome to share their experiences among those working in this area. Also, papers on using treebanks in innovative ways and/or analyzing issues with current treebanks (both in content and format) will be welcome, to provide feedback to treebank developers. The emphasis is not on linguistic theory or grammar, but on treebanks as language resources. *The Call* As part of the LREC'2012 Language Resources Conference, to be held in Istanbul, Turkey, the META-RESEARCH Workshop on Advanced Treebanking (http://www.meta-net.eu/meta-research/events/lrec2012-treebanking-workshop) invites papers on all aspects of treebanking, especially but not limited to - Treebank annotation (manual, semi-automatic, automatic) and related issues such as quality control, searching large collections of treebanks, parallel treebanks - Treebank conversion between different linguistic representations (e.g. dependency vs. phrase-based) and between different formats - Treebank standardization and format unification across languages and styles - Combinations of treebanks and other annotations -- including but not limited to semantic role labeling, named entity annotation, discourse annotation, information structure - Tools and online web services for the management of treebanks, visualization, search, processing, upload and download, etc. - Innovative use of treebanks in all areas of natural language research and applications - Reuse of treebank annotation for non-traditional purposes and in related disciplines (cognitive science, psycholinguistics, etc.) - Improving the use of current treebanks in linguistic and computational R&D. Oral presentations will have around 20 minutes each; demonstrations and poster presentations will be encouraged (but optional) in addition to the oral presentations, during the entire workshop in the meeting room to stimulate discussions. Submissions cannot be more than 8 pages long and will be reviewed using the START system in the usual double-blind fashion, with at least three reviews for each submission by the members of the Programme Committee. Authors will be asked to provide details about the resources used when preparing material for the paper (see http://www.lrec-conf.org/lrec2012/?LRE-Map-2012) and to the Language Library, the new initiative of LREC 2012. *Organizers* Jan Hajic, hajic at ufal.mff.cuni.cz, Charles Univ., Prague, Czech Rep. Koenraad De Smedt, desmedt at uib.no, Univ. of Bergen, Norway Marko Tadic, marko.tadic at ffzg.hr, Univ. of Zagreb, Croatia Antonio Branco, Antonio.branco at di.fc.ul.pt, University of Lisbon, Portugal *Deadlines* Submission: Monday, February 20, 2012 Decision: Monday, March 19, 2012 Camera Ready in START: Wednesday, March, 28, 2012 Workshop date: Tuesday, May 22, 2012 *Programme Committee (to be extended)* Antonio Branco, Portugal Silvie Cinkova, Czech Rep. Dan Cristea, Romania Koenraad De Smedt, Norway Rebecca Dridan, Australia Jan Hajic, Czech Rep. Silvia Hansen-Schirra, Germany Nancy Ide, USA Daisuke Kawahara, Japan Valia Kordoni, Germany Sandra Kuebler, USA Krister Linden, Finland Adam Meyers, USA Joakim Nivre, Sweden Stephan Oepen, Norway Marco Passarotti, Italy Adam Przepiorkowski, Poland Eirikur Rognvaldsson, Iceland Victoria Rosen, Norway Marko Tadic, Croatia Barbora Vidova Hladka, Czech Rep. Juergen Wedekind, Denmark Fei Xia, USA Daniel Zeman, Czech Rep. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:43:50 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:43:50 +0100 Subject: Appel: 1st CfP Challenges in the management of large corpora, LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 16:00:05 +0100 From: ELRA ELDA Information Message-ID: <4EEF5175.80803 at elda.org> X-url: http://corpora.ids-mannheim.de/cmlc.html [Apologies for multiple postings] Call for Papers: LREC 2012 Workshop Challenges in the management of large corpora We live in an age where the well-known maxim that "the only thing better than data is more data" is something that no longer sets unattainable goals. Creating extremely large corpora is no longer a challenge, given the proven methods that lie behind e.g. applying the Web-as-Corpus approach or utilizing Google's n-gram collection. Indeed, the challenge is now shifted towards dealing with the large amounts of primary data and much larger amounts of annotation data. On the one hand, this challenge concerns finding new (corpus-) linguistic methodologies that can make use of such extremely large corpora e.g. in order to investigate rare phenomena involving multiple lexical items or to find and represent fine-grained sub-regularities; on the other hand, some fundamental technical methods and strategies are being called into question. These include e.g. successful curation of the data, management of collections that span multiple volumes or that are distributed across several centres, methods to clean the data from non-linguistic intrusions or duplicates, as well as automatic annotation methods or innovative corpus architectures that maximise the usefulness of data or allow to search and to analyze it efficiently. Among the new tasks are also collaborative manual annotation and methods to manage it as well as new challenges to the statistical analysis of such data and metadata. The half-day workshop on "Challenges in the management of large corpora" aims at gathering the leading researchers in the field of Language Resource creation and Corpus Linguistics, in order to provide for an intensive exchange of expertise, results and ideas. We invite submissions dealing with: 1. building tools for all aspects of management of very large corpora, 2. dealing with large data sets (file system architecture, database architecture), 3. dealing with heavily annotated corpora, 4. managing multiple and concurrent annotation layers, 5. use of annotation standards for large data sets, 6. issues of interoperability and tool-chaining, 7. crowd sourcing for large data sets, 8. quality control of annotations in large data sets, 9. analytic tools used in research infrastructure initiatives, such as, e.g., the Common Language Resource and Technology Infrastructure (CLARIN), 10. dealing with corpora physically distributed over different locations, 11. managing metadata for extremely large corpus collections, 12. efficient user interfaces, 13. effective querying of large corpora with multiple annotation layers, 14. "bringing the code to the data" as the strategy for dealing with IPR restrictions, 15. open-source software and open-data corpora strategies, 16. other issues that arise in the context of management of large datasets. Current information is available at: http://corpora.ids-mannheim.de/cmlc.html Abstract submission We invite extended abstracts (1500 to 2000 words) for 20+10 minute presentations, as well as posters and demos. All abstracts have to be submitted via the START Conference Manager, available from https://www.softconf.com/lrec2012/LargeCorpora2012/. Please note: when submitting a contribution to the START, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the contribution or are a new result of their research. For further information on this new initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012 Important dates Workshop: 22 May 2012, afternoon session. Deadline for submission of extended abstracts: February 15. Notification of acceptance: February 29. Submission of full, camera-ready papers: March 23. Venue The workshop will take place at the Conference venue, the Lütfi Kirdar Istanbul Exhibition and Congress Centre. Further details will be available in due time from conference homepage. Organizing Committee The workshop is co-organized by the following three institutions: Institut für Deutsche Sprache, Mannheim Piotr Ban'ski, Marc Kupietz, Andreas Witt Institute for Language Information and Technology, Eastern Michigan University Helen Aristar-Dry, Anthony Aristar, Damir C'avar ICAR Laboratory, Lyon University Serge Heiden Programme committee: Núria Bel (Universitat Pompeu Fabra) Mark Davies (Brigham Young University) Stefanie Dipper (Ruhr-Universität Bochum) Tomaz( Erjavec (Joz(ef Stefan Institute) Stefan Evert (Technische Universität Darmstadt) Alexander Geyken (Berlin-Brandenburgische Akademie der Wissenschaften) Andrew Hardie (University of Lancaster) Nancy Ide (Vassar College) Sandra Kübler (Indiana University) Martin Mueller (Northwestern University) Mark Olsen (University of Chicago) Adam Przepiórkowski (Polish Academy of Sciences, University of Warsaw) Reinhard Rapp (Johannes Gutenberg-Universität Mainz, University of Leeds) Laurent Romary (INRIA, Humboldt-Universität zu Berlin) Serge Sharoff (University of Leeds) Pavel Stran(ák (Charles University in Prague) Amir Zeldes (Humboldt-Universität zu Berlin) Workshop homepage: http://corpora.ids-mannheim.de/cmlc.html ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:45:51 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:45:51 +0100 Subject: Appel: 1st CfP Language Resource Merging, LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 16:04:59 +0100 From: ELRA ELDA Information Message-ID: <4EEF529B.2080304 at elda.org> X-url: http://www.meta-net.eu/meta-share X-url: http://www.panacea-lr.eu/ [Apologies for multiple postings] **1st Call for Papers** LREC 2012 Workshop on: Language Resource Merging 22 May 2012 -- Afternoon Session *CONTEXT* The availability of adequate language resources has been a well-known bottleneck for most high-level language technology applications, e.g. Machine Translation, parsing, and Information Extraction, for at least 15 years , and the impact of the bottleneck is becoming all the more apparent with the availability of higher computational power and massive storage, since modern language technologies are capable of using far more resources than the community produces. The present landscape is characterized by the existence of numerous scattered resources, many of which have differing levels of coverage, types of information and granularity. Taken singularly, existing resources do not have sufficient coverage, quality or richness for robust large-scale applications, and yet they contain valuable information (Monachini et al. 2004 and 2006; Soria et al. 2006; Molinero, Sagot and Nicolas 2009; Necsulescu et al. 2011). Differing technology or application requirements, ignorance of the existence of certain resources, and difficulties in accessing and using them, has led to the proliferation of multiple, unconnected resources that, if merged, could constitute a much richer repository of information augmenting either coverage or granularity, or both, and consequently multiplying the number of potential language technology applications. Merging, combining and/or compiling larger resources from existing ones thus appears to be a promising direction to take. The re-use and merging of existing resources is not altogether unknown. For example, WordNet (Fellbaum, 1998) has been successfully reused in a variety of applications. But this is the exception rather than the rule; in fact, merging, and enhancing existing resources is uncommon, probably because it is by no means a trivial task given the profound differences in formats, formalisms, metadata, and linguistic assumptions. The language resource landscape is on the brink of a large change, however. With the proliferation of accessible metadata catalogues, and resource repositories (such as the new META-SHARE (http://www.meta-net.eu/meta-share) infrastructure), a potentially large number of existing resources will be more easily located, accessed and downloaded. Also, with the advent of distributed platforms for the automatic production of language resources, such as PANACEA (http://www.panacea-lr.eu/), new language resources and linguistic information capable of being integrated into those resources will be produced more easily and at a lower cost. Thus, it is likely that researchers and application developers will seek out resources already available before developing new, costly ones, and will require methods for merging/combining various resources and adapting them to their specific needs. Up to the present day, most resource merging has been done manually, with only a small number of attempts reported in the literature towards (semi-)automatic merging of resources (Crouch & King 2005; Pustejovsky et al. 2005; Molinero, Sagot and Nicolas 2009; Necsulescu et al. 2011). In order to take a further step towards the scenario depicted above, in which resource merging and enhancing is a reliable and accessible first step for researchers and application developers, experience and best practices must be shared and discussed, as this will help the whole community avoid any waste of time and resources. * AIMS OF THE WORKSHOP* This half-day workshop is meant to be part of a series of meetings constituting an ongoing forum for sharing and evaluating the results of different methods and systems for the automatic production of language resources (the first one was the LREC 2010 Workshop on Methods for the Automatic Production of Language Resources and their Evaluation Methods). The main focus of this workshop is on (semi-)automatic means of merging language resources, such as lexicons, corpora and grammars. Merging makes it possible to re-use, adapt, and enhance existing resources, alongside new, automatically created ones, with the goal of reducing the manual intervention required in language resource production, and thus ultimately production costs. *WORKSHOP TOPICS* The topics of the workshop are related to best practices, methods, techniques and experimental results regarding the merging of various types of language resources, such as lexicons and corpora, especially in support of language technology applications. In particular, new methods for automatic merging with a view towards reducing human intervention will be most welcome. Topics for submission include, but are not limited to: - Experiments on (semi-)automatic merging of automatically produced resources - Experiments on the merging of two or more existing resources containing the same or different levels of linguistic information - Studies or experiments on merging resources at different levels of granularity (corpora, lexicons, grammars) - Studies or experiments on unifying, mapping or converting encoding formats - Comparison between different resources and mapping algorithms to provide desired merging - Use of linguistic information from different sources in high-level language applications - Use of new, merged language resources in language technology applications *SUBMISSIONS* Interested participants must submit a preliminary paper of about 4-6 pages including references (between 2000-2500 words). For the submission please use the online form on START LREC Conference Manager at: https://www.softconf.com/lrec2012/MergingLR2012/ When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. For further information on this new initiative, please refer to http://www.lrecconf.org/lrec2012/?LRE-Map-2012 Papers will be peer-reviewed by the workshop Program Committee. *IMPORTANT DATES* · Deadline for paper submission: 15 February 2012 · Notification of acceptance: 15 March 2012 · Submission of camera-ready version of papers: 31 March 2012 · Workshop date: 22 May 2012 -- Afternoon Session *ORGANIZING COMMITTEE* Núria Bel, UPF, Barcelona, Spain Maria Gavrilidou, ILSP-"Athena", Athens, Greece, Monica Monachini, CNR-ILC, Pisa, Italy Valeria Quochi, CNR-ILC, Pisa, Italy Laura Rimell, University of Cambridge, UK Contacts lrec12_workshop_merging at ilc.cnr.it *PROGRAMME COMMITTEE:* Victoria Arranz, ELDA, Paris, France Paul Buitelaaar, National University of Ireland, Galway, Ireland Nicoletta Calzolari, CNR-ILC, Pisa, Italy Olivier Hamon, ELDA, Paris, France Ales( Horák, Masaryk University, Brno, Czech Republic Nancy Ide, Vassar College, Mass. USA Bernardo Magnini, FBK, Trento, Italy Paola Monachesi, Utrecht University, Utrecht, The Netherlands Jan Odijk, , Utrecht University, Utrecht, The Netherlands Muntsa Padró, IULA, Barcellona, Spain Karel Pala, Masaryk University, Brno, Czech Republic Thierry Poibeau University of Cambridge, UK and CNRS, Paris, France Benoît Sagot, INRIA, Paris, France Kiril Simov, Bulgarian Academy of Sciences, Sofia, Bulgaria Claudia Soria, CNR-ILC, Pisa, Italy Maurizio Tesconi, CNR-IIT, Pisa ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:48:24 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:48:24 +0100 Subject: Appel: 1st CfP Workshop on Language technology for normalisation of less-resourced languages, LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 16:33:04 +0100 From: ELRA ELDA Information Message-ID: <4EEF5930.9080206 at elda.org> X-url: http://ixa2.si.ehu.es/saltmil/ X-url: http://AfLaT.org/ [Apologies for multiple postings] Call for Papers: Workshop on "Language technology for normalisation of less-resourced languages" 8th SALTMIL Workshop on Minority Languages and the 4th workshop on African Language Technology (AfLaT2012). A full-day workshop at LREC 2012 Tuesday, 22 May 2012. Lütfi Kirdar Istanbul Exhibition and Congress Centre, Istanbul, Turkey SALTMIL: http://ixa2.si.ehu.es/saltmil/ AfLaT: http://AfLaT.org/ LREC 2012: http://www.lrec-conf.org/lrec2012/ Paper submission: https://www.softconf.com/lrec2012/Less- RessourcedLang2012/ Papers are invited for the above full-day workshop, in the format outlined below. Most submitted papers will be presented in poster form, though some authors may be invited to present in lecture format. Context and focus The 8th International Workshop of the ISCA Special Interest Group on Speech and Language Technology for Minority Languages (SALTMIL, http://ixa2.si.ehu.es/saltmil) and the 4th Workshop on African Language Technology (AfLaT2012) will be held as a joint effort in Istanbul, in May 2012, as part of the 2012 International Language Resources and Evaluation Conference (LREC 2012). Entitled "Language technology for normalisation of less-resourced languages", the workshop is intended to continue the series of SALTMIL/LREC workshops on computational language resources for minority languages, held in Granada (1998), Athens (2000), Las Palmas de Gran Canaria (2002) and Lisbon (2004), Genoa (2006), Marrakech (2008) and Malta (2010) and the series of AfLaT workshops, held in Athens (EACL2009), Malta (LREC2010) and Addis Ababa (AGIS11). The Istanbul 2012 workshop aims to share information on tools and best practices, so that isolated researchers will not need to start from scratch. An important aspect will be the forming of personal contacts, which can minimize duplication of effort. There will be a balance between presentations of existing language resources, and more general presentations designed to give background information needed by all researchers. While less-resourced languages and minority languages often struggle to find their place in a digital world dominated by only a handful of commercially interesting languages, a growing number of researchers are working on alleviating this linguistic digital divide, through localisation efforts, the development of BLARKs (basic language resource kits) and practical applications of human language technologies. The joint SALTMIL/AfLaT workshop on "Language technology for normalisation of less-resourced languages" provides a unique opportunity to connect these researchers and set up a common forum to meet and share the latest developments in the field. *Topics* The workshop takes an inclusive approach to the word "normalisation", considering it to include both technologies that help make languages more "normal" in society and everyday life, as well as technologies that normalise languages, i.e. help create or maintain a written standard or support diversity in standards. We particularly focus on the challenges less-resourced and minority languages face in the digital world. Papers are invited that describe research and development in the following areas in the area of technologies for language normalisation, including (but not limited to) topics such as: * Keyboard layouts and entry methods * Standardisation in machine readable lexicons/dictionaries * Computer-aided language learning (CALL) * Dealing with language variants in NLP * Automatic identification of varieties, dialects * Corpus construction and annotation * Terminology development and management * MT between varieties of the same language * Spelling correction/normalisation * Machine translation (MT) * Morphological analysers * Part-of-speech taggers and parsers * Speech recognition and synthesis * Information extraction/retrieval * Localisation efforts * Mobile phones as a platform for HLT *Schedule* 09.00 Registration/Opening 09.30 Invited talk 1 11.00 Oral Papers (3x 25+5mins) 12.30 Lunch 14:00 Oral Papers (3x 25+5mins) 16:00 Poster Session 17:00 Panel Discussion 17:30 SALTMIL Assembly 18:30 Closing *Organisers (SALTMIL and AfLaT)* * Mikel L. Forcada (SALTMIL): Machine Translation Group, School of Computing, Dublin City University, Dublin, Ireland * Guy De Pauw (AfLaT): CLiPS - Computational Linguistics Group, University of Antwerp, Antwerp, Belgium * Gilles-Maurice de Schryver(AfLaT): African Languages and Cultures, TshwaneDJe HLT, South Africa & Ghent University, Belgium * Kepa Sarasola(SALTMIL): Dept. of Computer Languages, University of the Basque Country * Francis M. Tyers(SALTMIL), Departament de Llenguatges i Sistemes Informàtics, Universitat d'Alacant, Spain * Peter Waiganjo Wagacha(AfLaT): School of Computing & Informatics, University of Nairobi, Nairobi, Kenya *Programme committee* * Iñaki Alegria: University of the Basque Country * Núria Bel, Universitat Pompeu Fabra, Barcelona, Spain * Lars Borin, Göteborgs universitet, Sweden * Sonja Bosch, University of South Africa, South Africa * Mikel L. Forcada, Universitat d'Alacant * Dafydd Gibbon, University of Bielefeld, Germany * Girish Nath Jha, Jawaharlal Nehru University, India * Hrafn Loftsson, University of Reykjavik * Guy De Pauw, CLiPS, Universiteit Antwerpen * Laurette Pretorius, University of South Africa, South Africa * Lori Levin, Carnegie Mellon University, USA * Odetunji Odejobi, Obafemi Awolowo University, Nigeria * Felipe Sánchez-Martínez, Universitat d'Alacant * Kepa Sarasola, University of the Basque Country * Kevin Scannell, Saint Louis University, USA * Gilles-Maurice de Schryver, Universiteit Gent * Francis M. Tyers, Universitat d'Alacant * Peter Waiganjo Wagacha, University of Nairobi *Submissions* We expect short papers of max 6,000 words (up to 6 pages) describing research addressing one of the above topics, to be submitted as PDF documents by using the LREC2012 START conference management system (URL: https://www.softconf.com/lrec2012/Less-RessourcedLang2012//). Submissions should be anonymized. When submitting a paper through the START page, authors will be kindly asked to provide relevant information about the resources that have been used for the work described in their paper or that are the outcome of their research. For further information on this initiative, please refer to http://www.lrecconf.org/lrec2012/?LRE-Map-2012. Authors will also be asked to contribute to the Language Library, the new initiative of LREC2012. Submissions of papers should follow the same style as the papers for the main LREC conference (an Author's Kit made of specific guidelines and downloadable templates will be published on the conference web site in due time). All contributions (including invited papers) will be included in the workshop proceedings (CD). They will also be published on the SALTMIL website. *Important dates* * 27 February 2012: Deadline for submission * 14 March 2012: Notification * 28 March 2012: Final version * 22 May 2012: Workshop *Registration* Registration details will be announced in due course ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:53:34 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:53:34 +0100 Subject: Appel: 1st CfP @NLP can u tag #user_generated_content ?! via lrec-conf.org, LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 16:47:25 +0100 From: ELRA ELDA Information Message-ID: <4EEF5C8D.5090804 at elda.org> [Apologies for multiple postings] Background and Motivation The Web 2.0 has transferred the authorship of contents from institutions to the people; the web has become a channel where users exchange, explain or write about their lives and interests, give opinions and rate others' opinions. The so-called User Generated Content (UGC) in text form is a valuable resource that can be exploited for many purposes, such as cross-lingual information retrieval, opinion mining, enhanced web search, social science analysis, intelligent advertising, and so on. In order to mine the data from the Web 2.0 we first need to understand its contents. Analysis of UG content is challenging because of its casual language, with plenty of abbreviations, slang, domain specific terms and, compared to published edited text, with a higher rate of spelling and grammar errors. Standard NLP techniques, which are used to analyze text and provide formal representations of surface data, have been typically developed to deal with standard language and may not yield the expected results on UGC. For example, shortened or misspelled words, which are very frequent in the Web 2.0 informal style, increase the variability in the forms for expressing a single concept. This workshop aims at providing a meeting point for researchers working in the processing of UGC in textual form in one way or another, as well as developers of UGC-based applications and technologies, both from industry and academia. *Topics of Interest* We are mainly interested in, but not restricted to, the following research questions: - What characterises UGC? Linguistic and textual phenomena that distinguish UGC from standard written text, and may pose a challenge for NLP. - Definition of norm, concept of error, deviation and variation in UGC. - Criteria and standards for the annotation of evaluation corpora in UGC at various levels of linguistic analysis (form, part of speech, constituents, dependencies, speech acts, deviation types, etc.). - How quality of text affects processing tasks (tokenization, POS tagging, chunking, parsing, named-entity detection, etc.) - Architecture and software design for flexible adaptation of NLP processing pipelines to new domains (topic domains and text-genre domains) - Text normalisation vs adaptation of processing tools: - Pros and cons - Task dependent? - Costs and benefits - Hybrid solutions - Approaches to normalisation (text checking, ASR, MT techniques, etc.) - Evaluation issues related to processing and normalising UGC *Intended Audience* The workshop aims at bringing together researchers and developers from academia and industry. In particular, perspectives from the following user groups are welcome: - UGC-based application developers, from both research and industry - Researchers from the NLP, IR and IE communities - Ph.D students interested or working in the processing of UGC *Submissions* - Oral papers and posters should follow the main conference formatting requirements (http://www.lrec-conf.org/lrec2012/). - To submit contributions, please follow the instructions at https://www.softconf.com/lrec2012/UGC2012/ - The contributions will undergo a double review by members of the programme committee. When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. For further information on this new initiative, please refer to: http://www.lrec-conf.org/lrec2012/?LRE-Map-2012 *Important Dates* February 15: Paper submission deadline March 15: Acceptance notifications March 30: Camera-ready papers May 26: Afternoon Workshop at LREC *Organising Committee* Laura Alonso i Alemany, Universidad Nacional de Córdoba (Argentina) Jordi Atserias, Yahoo! Research (Spain) Toni Badia, Universitat Pompeu Fabra (Spain) Maite Melero, Barcelona Media Innovation Center (Spain) Martí Quixal, Barcelona Media Innovation Center (Spain) *Programme Committee* Rafael Banchs, Institute for Infocomm Research - A*Star (Singapore) Steven Bedrick, Oregon Health & Science University Joan Codina, Universitat Pompeu Fabra (Spain) Louise-Amélie Cougnon, Université Catholique de Louvain, ILC, Cental, (Belgium) Jennifer Foster, Dublin City University (Ireland) Michael Gamon, Microsoft Research (USA) Fei Liu, Bosch Research (USA) Ulrike Pado, VICO Research&Consulting GmbH Lluís Padró, Universitat Politècnica de Catalunya (Spain) Alan Ritter, CSE, University of Washington (USA) Roser Saurí, Barcelona Media Innovation Center (Spain) Paul Schmidt, Institut der Gesellschaft zur Förderung der Angewandten Informationsforschung (Germany) L Venkata Subramaniam, IBM Research (India) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:55:13 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:55:13 +0100 Subject: Appel: 1st Cfp Natural Language Processing for Improving Textual Accessibility (NLP4ITA), LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 16:53:26 +0100 From: ELRA ELDA Information Message-ID: <4EEF5DF6.4000801 at elda.org> X-url: http://www.taln.upf.edu/nlp4ita [Apologies for multiple postings] ******************************************************************** First Call for Papers LREC 2012 workshop on: Natural Language Processing for Improving Textual Accessibility (NLP4ITA) May 27, 2012 (Morning Session) Istanbul, Turkey http://www.taln.upf.edu/nlp4ita ******************************************************************** === Introduction === This workshop aims to bring together researchers focused on tools and resources for making textual information more accessible to people with special needs including diverse ranges of hearing and sight disabilities, cognitive disabilities, elderly people, low-literacy readers and adults being alphabetized, among others. === Motivation and Topics of Interest === In recent years there has been an increasing interest in accessibility and usability issues. This interest is mainly due to the greater importance of the Web and the need to provide equal access and equal opportunity to people with diverse disabilities. The role of assistive technologies based on language processing has gained importance as it can be observed from the growing number of efforts (United Nations declarations on universal access to information or WAI guidelines related to content) and research in conferences and workshops (W4A, ICCHP, ASSETS, SPLAT, etc.). However, language resources and tools to develop assistive technologies are still scarce. This workshop will give an opportunity for individuals from different communities to present research findings, discover future challenges, and discuss potential collaboration. We welcome papers describing tools, resources, models, techniques and evaluation from all areas of natural language processing tailored to accessibility and assistive technologies. These include, but are not limited to, the following: - Resources: corpora of inaccessible documents, aligned corpora, gold-standards and annotation schemes. - Evaluation, complexity and readability methodologies and metrics. - Novel modeling and machine learning techniques for improving accessibility. - Natural language generation for improving accessibility. - NLP for Web accessibility. - Text adaptation, elaboration, simplification and summarization. - User studies on systems and components. - NLP tools for accessibility === Submission Information === We encourage contributions in the form of full papers (6 to 8 pages long). Papers should follow the LREC format which is available at the LREC 2012 Web Site. The submission process will be online using the START conference system at https://www.softconf.com/lrec2012/NLP4ITA All accepted papers will be presented orally and published in the workshop proceedings. === Important dates === Feb 24, 2012 Paper submission deadline March 23, 2012 Notification of acceptance March 30, 2012 Camera-ready version May 27, 2012 LREC 2012 Workshops === LRE Map 2012 === The LRE Map 2012, an initiative promoted for monitoring the production, use and evaluation of language resources and tools. When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. For further information on this new initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012. === Organizers === Ricardo Baeza-Yates (Universidad Pompeu Fabra, Yahoo!) Paloma Moreda (Universidad de Alicante) Luz Rello (Universidad Pompeu Fabra) Horacio Saggion (Universidad Pompeu Fabra) Lucia Specia (University of Wolverhampton) === Program Committee === Sandra Aluisio (University of Sao Paulo) Ricardo Baeza-Yates (Universidad Pompeu Fabra, Yahoo!) Delphine Bernhard (University of Strassbourg) Nadjet Bouayad-Agha (Universidad Pompeu Fabra) Richard Evans (University of Wolverhampton) Caroline Gasperin (TouchType Ltd) Pablo Gervás (Universidad Complutense de Madrid) Simon Harper (University of Manchester) David Kauchak (Middlebury College) Guy Lapalme (University of Montreal) Paloma Martínez (Universidad Carlos III de Madrid) Aurelien Max (Paris 11) Kathleen F. McCoy (University of Delaware) Ornella Mich (Foundazione Bruno Kessler) Ruslan Mitkov (University of Wolverhampton) Paloma Moreda (Universidad de Alicante) Constantin Orasan (University of Wolverhampton) Luz Rello (Universidad Pompeu Fabra) Horacio Saggion (Universidad Pompeu Fabra ) Advaith Siddharthan (University of Aberdeen) Lucia Specia (University of Wolverhampton) Juan Manuel Torres Moreno (University of Avignon) Markel Vigo (University of Manchester) Leo Wanner (Universidad Pompeu Fabra) Yeliz Yesilada (Middle East Technical University Northern Cyprus Campus) === Contact Address === For further information please contact us at: luz.rello at upf.edu or horacio.saggion at upf.edu ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:57:18 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:57:18 +0100 Subject: Appel: 1st CfP Semantic Processing of Legal Texts (SPLeT-2012), LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 17:03:42 +0100 From: ELRA ELDA Information Message-ID: <4EEF605E.1020005 at elda.org> [Apologies for multiple postings] LREC 2012 Workshop on Semantic Processing of Legal Texts (SPLeT-2012) CALL FOR PAPERS 27 May 2012, Istanbul *Workshop description* The legal domain represents a primary candidate for web-based information distribution, exchange and management, as testified by the numerous e-government, e-justice and e-democracy initiatives worldwide. The last few years have seen a growing body of research and practice in the field of Artificial Intelligence and Law which addresses a range of topics: automated legal reasoning and argumentation, semantic and cross-language legal information retrieval, document classification, legal drafting, legal knowledge discovery and extraction, as well as the construction of legal ontologies and their application to the law domain. In this context, it is of paramount importance to use Natural Language Processing techniques and tools that automate and facilitate the process of knowledge extraction from legal texts. Since 2008, the SPLeT workshops have been a venue where researchers from the Computational Linguistics and Artificial Intelligence and Law communities meet, exchange information, compare perspectives, and share experiences and concerns on the topic of legal knowledge extraction and management, with particular emphasis on the semantic processing of legal texts. Within the Artificial Intelligence and Law community, there have also been a number of dedicated workshops and tutorials specifically focussing on different aspects of semantic processing of legal texts at conferences such as JURIX-2008, ICAIL-2009, ICAIL-2011, as well as in the International Summer School "Managing Legal Resources in the Semantic Web" (2007, 2008, 2009, 2010, 2011). To continue this momentum and to advance research, a 4th Workshop on "Semantic Processing of Legal Texts" is being organized at the LREC-2012 conference to bring to the attention of the broader LR/HLT (Language Resources/Human Language Technology) community the specific technical challenges posed by the semantic processing of legal texts and also share with the community the motivations and objectives which make it of interest to researchers in legal informatics. The outcome of these interactions are expected to advance research and applications and foster interdisciplinary collaboration within the legal domain. New to this edition of the workshop are two sub-events to provide common and consistent task definitions, datasets, and evaluation for legal-IE systems along with a forum for the presentation of varying but focused efforts on their development. The first sub-event will be a shared task specifically focusing on dependency parsing of legal texts: although this is not a domain-specific task, it is a task which creates the prerequisites for advanced IE applications operating on legal texts, which can benefit from reliable preprocessing tools. For this year our aim is to create the prerequisites for more advanced domain-specific tasks (e.g. event extraction) to be organized in future SPLeT editions. We strongly believe that this could be a way to attract the attention of the LR/HLT community to the specific challenges posed by the analysis of this type of texts and to have a clearer idea of the current state of the art. The languages dealt with will be Italian and English. A specific Call for Participation for the shared task is available in a dedicated page. The second sub-event will be an online, manual, collaborative, semantic annotation exercise, the results of which will be presented and discussed at the workshop. The goals of the exercise are: (1) to gain insight on and work towards the creation of a gold standard corpus of legal documents in a cohesive domain; and (2) to test the feasibility of the exercise and to get feedback on its annotation structure and workflow. The corpus to be annotated will be a selection of documents drawn from EU and US legislation, regulation, and case law in a particular domain (e.g. consumer or environmental protection). For this exercise, the language will be English. A specific Call for Participation for this annotation exercise is available in a dedicated page. The main goals of the workshop and associated events are to provide an overview of the state-ofthe-art in legal knowledge extraction and management, to explore new research and development directions and emerging trends, and to exchange information regarding legal language resources and human language technologies and their applications. *Areas of Interest* The workshop will focus on the topics of the automatic extraction of information from legal texts and the structural organisation of the extracted knowledge. Particular emphasis will be given to the crucial role of language resources and human language technologies. Papers are invited on, but not limited to, the following topics: - Construction, extension, merging, customization of legal language resources: terminologies, ontologies - Information retrieval and extraction from legal texts - Semantic annotation of legal textual corpora - Legal text processing - Multilingual aspects of legal text semantic processing - Legal thesauri mapping - Automatic Classification of legal documents - Logical analysis of legal language - Automated parsing and translation of natural language arguments into a logical formalism - Linguistically-oriented XML mark up of legal arguments - Dialogue protocols for argumentation - Legal argument ontology - Computational theories of argumentation that are suitable to natural language - Controlled language systems for law. *Submissions* Submissions are solicited from researchers working on all aspects of semantic processing of legal texts. Authors are invited to submit papers describing original completed work, work in progress, interesting problems, case studies or research trends related to one or more of the topics of interest listed above. The final version of the accepted papers will be published in the Workshop Proceedings. Short or full papers can be submitted. Short papers are expected to present new ideas or new visions that may influence the direction of future research, yet they may be less mature than full papers. While an exhaustive evaluation of the proposed ideas is not necessary, insight and in-depth understanding of the issues is expected. Full papers should be more well developed and evaluated. Short papers will be reviewed the same way as full papers by the Program Committee and will be published in the Workshop Proceedings. Full paper submissions should not exceed 10 pages, short papers 6 pages; both should be typeset using a font size of 11 points. Style files will be made available by LREC for the camera-ready versions of accepted papers. Papers should be submitted electronically, no later than February 10, 2012. The only accepted format for submitted papers is Adobe PDF. Submission will be electronic using START paper submission software available at https://www.softconf.com/lrec2012/SPLeT2012/. Note that when submitting a paper through the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. For further information on this new initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012. Selected contributions to a Special Issue of AI&Law Journal After the Workshop a number of selected, revised, peer-reviewed articles will be published in a Special Issue on Semantic Processing of Legal Texts of the AI and Law Journal (Springer). *Important Dates* Paper submission deadline: 10 February 2012 Acceptance notification sent: 5 March 2012 Final version deadline: 23 March 2012 Workshop date: 27 May 2012 *Workshop Chairs* - Enrico Francesconi (Istituto di Teoria e Tecniche dell'Informazione Giuridica of CNR, Florence, Italy) - Simonetta Montemagni (Istituto di Linguistica Computazionale of CNR, Pisa, Italy) - Wim Peters (Natural Language Processing Research Group, University of Sheffield, UK) - Adam Wyner (Department of Computer Science, University of Liverpool, UK) Address any queries regarding the workshop to: lrec_legalWS at ilc.cnr.it *Program Committee (tbc)* Kevin Ashley (Univ of Pittsburgh) Johan Bos (University of Rome, Italy) Danièle Bourcier (Humboldt Universität, Berlin, Germany) Thomas R. Bruce (Cornell Law School, Ithaca, NY, USA) Pompeu Casanovas (Institut de Dret i Tecnologia, UAB, Barcelona, Spain) Jack Conrad (Thomson-Reuters) Matthias Grabmair (University of Pittsburgh, USA) Carole Hafner (Northeaster Univ.) Antonio Lazari (Scuola Superiore S.Anna, Pisa, Italy) Leonardo Lesmo (Dipartimento di Informatica, Università di Torino, Torino, Italy) Carl Malamud (Public.Resource.Org) Marie-Francine Moens (Katholieke Universiteit Leuven, Belgium) Thorne McCarty (Reutgers Univ.) Raquel Mochales Palau (Catholic University of Leuven, Belgium) Paulo Quaresma (Universidade de Évora, Portugal) Robert Richards (Legal Informatics blog) Tony Russell-Rose (Endeca) Erich Schweighofer (Universität Wien, Rechtswissenschaftliche Fakultät, Wien, Austria) Rolf Schwitter (Macquarie Univ) Manfred Stede (University of Potsdam, Germany) Daniela Tiscornia (Istituto di Teoria e Tecniche dell'Informazione Giuridica of CNR, Florence, Italy) Tom van Engers (Leibniz Center for Law, University of Amsterdam, Netherlands) Giulia Venturi (Scuola Superiore S.Anna, Pisa, Italy) Vern R. Walker (Hofstra University School of Law, Hofstra University, USA) Radboud Winkels (Leibniz Center for Law, University of Amsterdam, Netherlands) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:58:49 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:58:49 +0100 Subject: Appel: 1st CfP Creating Cross-language Resources for Disconnected Languages and Styles (CREDISLAS), LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 17:06:32 +0100 From: ELRA ELDA Information Message-ID: <4EEF6108.4030909 at elda.org> X-url: http://www-lium.univ-lemans.fr/credislas2012 [Apologies for multiple postings] ======================================================================== Workshop on CREATING CROSS-LANGUAGE RESOURCES FOR DISCONNECTED LANGUAGES AND STYLES Co-located with LREC 2012 (http://www.lrec-conf.org/lrec2012/) Istanbul, Turkey May 27, 2012 (afternoon session) Deadline for paper submissions: February 26, 2012 http://www-lium.univ-lemans.fr/credislas2012 ======================================================================== This half-day workshop aims at developing strategies and sharing experiences on creating resources for reducing the linguistic gap between those language pairs for which cross-language resources are scarce. Although this specific situation has been most commonly addressed for the case of minority languages that have scarce resources by themselves, it also happens to be an important issue in some other situations such as: majority languages that, because of their cultural, historical and/or geographical disconnection, do not count with a significant amount of cross-language resources between them (as Chinese and Spanish, just to mention an excellent example in this category); or, single languages for which new communication trends and styles do not have available cross-language resources between the main formal language and it (as chat speak style communications and formal languages). Current computational and data storage capabilities have favoured the proliferation of data-driven and statistical approaches in natural language processing and computational linguistics. Empirical evidence has demonstrated in a large number of cases and applications how the availability of appropriate datasets can boost the performance of processing methods and analysis techniques. In this scenario, the availability of data has become to play a fundamental role. On the other hand, both the diversity of languages and the emergence of new communication media and stylistic trends are responsible for the scarcity of resources in the case of some specific tasks and applications. In this sense, this workshop attempts to focus its attention on those specific applications or cases for which data scarcity poses a restrictive problem for data-driven approaches. This includes the following three specific situations: Minority Languages, for which scarcity of resources is a consequence of the minority nature of the language itself. In this case, attention is focused on the development of both monolingual and cross-lingual resources. Some examples in this category include: Basque, Pashto and Haitian Creole, just to mention a few. Disconnected Languages, for which a large amount of monolingual resources are available, but due to cultural, historical and/or geographical reasons cross-language resources are actually scarce. Some examples in this category include language pairs such as Chinese and Spanish, Russian and Portuguese, and Arabic and Japanese, just to mention a few. New Language Styles, which represent different communication forms or emerging stylistic trends in languages for which the available resources are practically useless. This case includes the typical examples of tweets and chat speak communications, as well as other informal form of communications, in many languages. The main topics of interest for this workshop include, but are not limited to, the following ones: * Construction and collection of monolingual resources * Construction and collection of cross-language resources * Annotation guidelines and evaluation * Automatic extraction of linguistic resources * Automatic annotation of linguistic resources * Use of crowdsourcing for generating and annotating resources * Use of pivot languages for bridging unconnected languages * Methods to adapt existing resources to new domains and styles * Generation of resources for informal communication styles * Evaluation of monolingual resources: tasks and protocols * Evaluation of cross-language resources: tasks and protocols SUBMISSION INSTRUCTIONS Authors are invited to submit papers on original and previously unpublished work. Formatting should be according to LREC 2012 specifications using LaTeX or MS-Word style files (available soon at the conference website, see http://www.lrec-conf.org/lrec2012/). Submission is electronic in PDF format using the START submission system at https://www.softconf.com/lrec2012/CREDISLAS2012/ Double submission policy: Parallel submission to other meetings or publications are possible but must be immediately notified to the workshop contact person (see below). Authors of accepted papers will be invited to present their research at the workshop. The workshop papers will be part of the LREC proceedings and published on the web site of LREC 2012 before the conference. IMPORTANT DATES February 26, 2012: Paper submissions due March 16, 2012: Notification of acceptance March 30, 2012: Camera ready papers due May 27, 2012: Workshop in Istanbul (afternoon session) ORGANIZERS Contact person: Patrik Lambert (e-mail: patrik.lambert at lium.univ-lemans.fr ) Patrik Lambert (University of Le Mans), Marta R. Costa-jussà (Barcelona Media Innovation Center), Rafael E. Banchs (Institute for Infocomm Research) PROGRAMME COMMITTEE Marianna Apidianaki, LIMSI-CNRS, Orsay, France Jordi Atserias, Yahoo! Research, Barcelona, Spain Victoria Arranz, ELDA, Paris, France Gareth Jones, Dublin City University, Ireland Min-Yen Kan, National University of Singapore Philipp Koehn, University of Edinburgh, UK Udo Kruschwitz, University of Essex, UK Yanjun Ma, Baidu Inc. Beijing, China Sara Morrissey, Dublin City University, Ireland Maja Popovic, DFKI, Berlin, Germany Paolo Rosso, Universidad de Valencia, Spain Marta Recasens, Stanford University, USA Wade Shen, Massachusetts Institute of Technology, Cambridge, USA Haifeng Wang, Baidu Inc. Beijing, China ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 22:00:43 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 23:00:43 +0100 Subject: Appel: 1st CfP Collaborative Resource Development and Delivery, LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 17:13:30 +0100 From: ELRA ELDA Information Message-ID: <4EEF62AA.9080103 at elda.org> X-url: http://www.anc.org/Collaborative_Resource_Development X-url: http://www.anc.org/MASC [Apologies for multiple postings] LREC Workshop Collaborative Resource Development and Delivery Lütfi Kirdar Istanbul Exhibition and Congress Centre Istanbul, Turkey May 27, 2012 http://www.anc.org/Collaborative_Resource_Development A confluence of needs and activities points to a new emphasis in computational linguistics to address lexical, propositional, and discourse semantics through corpora. A few examples are: - the demand for high quality linguistic annotations of corpora representing a wide range of phenomena, especially at the semantic level, to support machine learning and computational linguistics research in general; - the demand for high quality annotated corpora representing a broad range of genres that are flexible and extensible as need demands; - the demand for high quality lexical and semantic resources to incorporate into the annotation process, and for the annotation process to produce; - the need for easy-to-use, open access to all of these resources for everyone. Such resources can be very costly to produce, due to the need for manual creation or validation to ensure quality. Therefore, to answer the growing need and lower the costs of resource creation and enhancement, there is a movement within the community toward collaborative resource development, including collaborative corpus annotation and collective reation/enhancement of lexical resources and knowledge bases. Collaborative development encompasses both engaging the community in annotation and development of common resources, as well as crowd-sourcing and similar solutions. Technological advances now enable development of web-based environments for collaborative annotation and enhancement of language resources, including annotated corpora, lexicons, and others; and platforms to support web services that deliver data, annotations, and other resources as well as high-quality automated linguistic annotations of language data. At the same time, crowdsourcing is being explored as a viable means of producing high quality resources. Given the recent advancements in technology plus novel methods to collect manually annotated data, it is important to develop new methods of quality control, hopefully ones that permit rapid acquisition and sharing of resources. This workshop seeks contributions in all dimensions of collaborative resource development and delivery, with a specific focus on case studies and lessons learned. We invite submissions that address but are not limited to the following topics: - Web services and platforms for collaborative resource development and distribution; - Crowd sourcing for resource development, including studies of efficacy; - Strategies and issues for open resource distribution; - Evaluation of collaboratively developed resources; - Position papers outlining issues and proposing solutions for community-based collaborative resource development and/or delivery. Special session --------------- The workshop will include a special session devoted to means and considerations for community-based linguistic annotation, with a special emphasis on the Manually Annotated Sub-Corpus (MASC) (http://www.anc.org/MASC). We invite submissions to this session on the following topics: - position papers concerning any aspect of collaborative resource development, including means to get the community fully invested in such efforts; - case studies describing collaborative development efforts, including assessment of what works and what doesn't; - results obtained using collaboratively developed resources; - the role of standards and best practices in collaboratively developed resources and contributed annotations. Special consideration will be given to contributions that have used MASC data in a way that highlights the benefits of community-based annotation. Submission information ---------------------- Submissions may be long papers or short papers, following the formatting guidelines for submissions to the main conference given at http://www.lrec-conf.org/lrec2012/. All submissions should be made using the START system at https://www.softconf.com/lrec2012/CollaborativeDev2012/. Important Dates --------------- Submissions due: February 15, 2012 Acceptance notification to authors: March 15, 2012
 Camera ready due: April 1, 2012
 Workshop: May 27, 2012 Workshop organizers ------------------- Nancy Ide, Vassar College, USA Collin Baker, ICSI/UC Berkeley, USA Christiane Fellbaum, Princeton University, USA Rebecca Passonneau, Columbia University, USA Contact: collaboration-workshop at anc.org Program Committee Members (tentative) ------------------------------------- Collin Baker, ICSI/UC Berkeley, USA Jason Baldridge, University of Texas at Austin, USA Jordan Boyd-Graber, University of Maryland, USA Nicoletta Calzolari, ILC/CNR, Italy Bob Carpenter, Alias I,Inc., USA Chris Cieri, LDC, University of Pennsylvania, USA Mona Diab, Columbia University, USA Bill Dolan, Microsoft Corp., USA Christiane Fellbaum, Princeton University, USA Dan Flickinger, Stanford University, USA Terry Langendoen, University of Arizona, USA Eric Nyberg, Carnegie-Mellon University, USA Rebecca Passonneau, Columbia University, USA Massimo Poesio, University of Trento, Italy Sameer Pradhan, BBN Technologies, USA James Pustejovsky, Brandeis University, USA Owen Rambow, Columbia University, USA Manfred Stede, Universitat Potsdam, Germany ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 22:02:20 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 23:02:20 +0100 Subject: Appel: 1CfP 5th Workshop on the Representation and Processing of Sign Languages: Interactions between Corpus and Lexicon, LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 17:39:57 +0100 From: ELRA ELDA Information Message-ID: <4EEF68DD.20002 at elda.org> [Apologies for multiple postings] Call for Papers: 5th Workshop on the Representation and Processing of Sign Languages: Interactions between Corpus and Lexicon at LREC 2012, Istanbul, May 27th, 2012 Abstracts are invited for a full day workshop on the interplay between sign language corpora and lexicons, to take place preceding/following the 2012 LREC conference. Recent technological developments allow sign language researchers to create relatively large video corpora of sign language use that were unimaginable ten years ago. Several national projects are currently underway, and more are planned. This workshop aims to share experiences from current and past efforts. What are the problems that were encountered and the solutions created, what are the linguistic decisions taken? The special focus of this workshop is on the relation of corpus and lexicon work. For some sign languages, a corpus was built after a dictionary had been compiled, while in other cases lexicographic work and corpus annotation are done hand in hand. In some current projects, a corpus is collected without a dictionary being available, and the creation of such is only planned once the corpus is fully annotated. What are the implications of the different approaches, and what tool support is necessary for the different situations and phases? We invite abstracts for 20-minute oral/signed presentations or posters (with or without demonstrations) on the following topics: . Experiences in building sign language corpora and lexical resources . Proposals for standards for linguistic annotation . Experiences from linguistic research using corpora in combination with lexical resources . Use of (parallel) corpora and lexicons in translation studies . Tool development . Lexicon construction from corpora . Linking corpora and lexicons . Integrated presentation of corpus and dictionary contents Papers of both oral/signed presentations and posters (4-8 pages) of this workshop will be published as workshop proceedings that are included in the conference package. Please submit your abstract (max. 1000 words) through the LREC START system at https://www.softconf.com/lrec2012/SignLanguage2012/ not later than Feb 6th. When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. For further information on this new initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012. *Organising Committee* Onno Crasborn, Radboud U Nijmegen, NL Eleni Efthimiou, ILSP Athens, GR Evita Fotinea, ILSP Athens, GR Thomas Hanke, Hamburg U, DE Jette Kristoffersen, UCC Copenhagen, DK Johanna Mesch, Stockholm U, SE Programme Committee (tbc) Richard Bowden, Guildford GB Penny Boyes Braem, Zürich CH Annelies Braffort, Orsay FR Christophe Collet, Toulouse FR Kearsy Cormier, London GB John Glauert, Norwich GB Alexis Heloir, Saarbrücken DE Jens Heßmann, Magdeburg DE Trevor Johnston, Sydney AU Reiner Konrad, Hamburg DE Lorraine Leeson, Dublin IE Petros Maragos, Athens GR Carol Neidle, Boston US Christian Rathmann, Hamburg DE Adam Schembri, Melbourne AU Meike Vaupel, Zwickau DE Rosalee Wolfe, Chicago US *Interpreting* With financial support from the LREC conference and other institutions, the oral/signed presentations will be interpreted into International Sign. The International Sign interpreters will also be around for the poster sessions to help out where necessary. Deaf participants wanting to bring interpreters for their national sign languages are kindly asked to contact us so that we can plan accordingly. *Important Dates* Deadline for abstracts: February 6th, 2012 Notification of acceptance: February 29th, 2012 Submission of paper (for both oral/signed presentations and posters): March 19th, 2012 Submission of slides for interpreters' preparation (oral/signed presentations only): May 22nd, 2012 Workshop: May 27th, 2012 LREC conference: May 23rd-May 25th (main conference) May 21st & May 22nd (pre-conf workshops) May 26th & May 27th (post-conf workshops) *More details* In due time, we will update the workshop website at http://www.sign-lang.unihamburg.de/lrec2012/ with travel advise etc. For further information, please contact the organising committee by email to lrec2012 (at) sign-lang.uni-hamburg.de ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 22:03:56 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 23:03:56 +0100 Subject: Appel: 2nd CfP Workshop on Computational Models of Narrative, LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 17:43:47 +0100 From: ELRA ELDA Information Message-ID: <4EEF69C3.1030506 at elda.org> X-url: http://narrative.csail.mit.edu/ws12 [Apologies for multiple postings] 2012 Workshop on Computational Models of Narrative May 26-27, 2012 Lütfi Kirdar Istanbul Exhibition and Congress Centre Istanbul, Turkey to be co-located with the LREC'2012, the Language Resources and Evaluation Conference Second CALL FOR PAPERS Paper submission deadline: February 24, 2012 *Workshop Aims* Narratives are ubiquitous in human experience. We use them to communicate, convince, explain, and entertain. As far as we know, every society in the world has narratives, which suggests they are rooted in our psychology and serve an important cognitive function. It is becoming increasingly clear that, to truly understand and explain human intelligence, beliefs, and behaviors, we will have to understand why narrative is universal and explain (or explain away) the function it serves. The aim of this workshop (and its predecessors) is to address key, fundamental questions about narrative that advance our fundamental understanding of narrative and our ability model it computationally. *Special Focus: Shared Resources* In addition to fundamental questions, the field has yet to address key needs with regard to shared resources and corpora that could smooth and hasten the way forward. The vast majority of work on narrative uses fewer than four stories to perform their experiments, and rarely re-use narratives from previous studies. Because NLP technology cannot yet take us all the way to the highly-accurate formal representations of language semantics, this implies significant amounts of repeated work in annotation. The way forward could be catalyzed by a carefully constructed set of shared resources. This meeting will be an appropriate venue for papers addressing fundamental topics and questions regarding narrative. Moreover, the meeting will have a special focus on the identification, collection, and construction of shared resources and corpora that facilitate the computational modeling of narrative. Papers should focus on issues fundamental to computational modeling and scientific understanding, or issues related to building shared resources to advance the field. Discussing technological applications or motivations is not discouraged, but is not required. *Illustrative Topics and Questions* * What kinds of shared resources are required for the computational study of narrative? * What content and modalities should be put in a "Story Bank"? What formal representations should be used? * What shared resources are available, or how can already-extant resources be adapted to common needs? * What makes narrative different from a list of events or facts? What is special that makes something a narrative? * What are the details of the relationship between narrative and common sense? * How are narratives indexed and retrieved? Is there a "universal" scheme for encoding episodes? * What impact do the purpose, function, and genre of a narrative have on its form and content? * What comprises the set of possible narrative arcs? Is there such a set? How many possible story lines are there? * Are there systematic differences in the formal properties of narratives from different cultures? * What are appropriate representations for narrative? What representations underlie the extraction of narrative schemas? * How should we evaluate computational models of narrative? *Organizing Committee* Mark A. Finlayson, Massachusetts Institute of Technology, USA Pablo Gervás, Universidad Complutense de Madrid, Spain Deniz Yuret, Koc University, Turkey Floris Bex, University of Dundee, Scotland *Sponsors* There will be a number of travel grants available to workshop authors via our sponsors: * ONR Global * Office of Naval Research * Defense Advanced Research Projects Agency *Contact* narrative-ws12 at csail.mit.edu http://narrative.csail.mit.edu/ws12 Note: Workshop dates have changed slightly since the first call ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 22:05:48 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 23:05:48 +0100 Subject: Appel: Workshop on Semantic Relations-II. Enhancing Resources and Applications, LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 15:45:20 +0100 From: Helene Mazo Message-ID: <4EEF4E00.5070609 at elda.org> X-url: http://www.racai.ro/SemRel2012/ [Apologies for multiple postings] Call for papers Workshop on “Semantic Relations-II. Enhancing Resources and Applications” http://www.racai.ro/SemRel2012/ 22nd May, Lütfi Kirdar Istanbul Exhibition and Congress Centre, Istanbul, Turkey Held in conjunction with LREC2012 *Workshop description* Building on the interest in semantic relations manifested by the participants to the first workshop Semantic Relations. Theory and Applications (held in conjunction with LREC2010), we organize a second edition, Semantic Relations-II. Enhancing Resources and Applications, in line with current interests shown by various journals, conferences, international projects, the recent Multilingual European Technological Alliance and the synergic projects MetaNet, MetaNet4u (supporting this workshop), MetaNord and CESAR. Lexical-semantic relations are organizing principles of the lexicon. Linguists have been more concerned with the definition, characterization and classification of semantic relations, the focus being on their paradigmatic aspects. Corpus linguistics offers the material and tools for grounding theories of semantic relations in the empirical analysis of usage and phraseology, for investigating syntagmatic aspects of semantic relations, which are exploited for various tasks in computational linguistics, such as Information Retrieval, Information Extraction, Question Answering, summarization, text generation and others. Recently, semantic relations within compounds, between simplex and compounds, between stems and derived words have been dedicated in-depth analyses due to their importance for natural language understanding, rephrasing and generation. In this workshop we will highlight the interrelation between theoretical and practical aspects in the study of semantic relations, a theme with important benefits for both theoretical and computational linguistics. The intended audience for our workshop is represented by both linguists and computer scientists interested in theoretical and practical aspects of semantic relations. The aim is to highlight the various ways in which lexical-semantic relations can enhance language resources and the results of various applications using them. *Topics* Topics of the workshop will include, but are not limited to: * Co-occurrence and semantic relations * Semantic resources for discourse analysis * Extraction of semantic relations from various sources * Exploitation of semantic relations in NLP applications * Knowledge representation and semantic relations * Lexical semantics and word sense * Semantic relations and word formation (compounding and derivation) * Semantic relations and language learning and acquisition * Semantic relations in Natural Language Generation applications * Semantic relations and terminology * WordNet and WordNet-like resources *Invited speaker* Our invitated speaker, professor Patrick Hanks, will deliver a presentation on the Theory of Norm and Explotation with the title Mapping Semantic Relations onto Pattern of Word Use using Corpus Evidence. *Submission information* Papers will be submitted to the workshop via the START LREC Conference Manager, under https://www.softconf.com/lrec2012/SemRel2012/. Authors should submit a PDF file of no more than 10 pages, following the LREC conference formatting details. Papers will be blind reviewed by three members of the Program Committee, that is why the author(s)’s names and affiliation(s) should not appear in the paper. Accepted papers will be published in the workshop Proceedings. When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. For further information on this new initiative, please refer to http://www.lrec-conf.org/lrec2012/?LREMap-2012. *Important dates* Paper submission: 2nd March 2012 Notification of acceptance: 23rd March 2012 Camera-ready papers: 30th March 2012 Workshop: 22nd May 2012 *Program Committee* Eduard Barbu, CIMEC (Italy) Antonio Branco, FCL (Portugal) Elena Cabrio, LIMSI (France) Corina Forascu, UAIC (Romania) Nuria Gala, LIF-CNRS (France) Patrick Hanks, UWE (UK) Amac Herdagdelen, New England Complex Systems Institute (UK) Diana Inkpen, University of Ottawa (Canada) Radu Ion, RACAI (Romania) Elisabetta Jezek , PAVIA (Italy) Svetla Koeva, Institute for Bulgarian Language (Bulgaria) Claudia Kunze, Qualisys GmbH (Germany) Gerhard Kremer, Institut fuer Computerlinguistik, Universitaet Heidelberg (Germany) Hristina Kukova, Institute for Bulgarian Language (Bulgaria) Svetlozara Leseva, Institute for Bulgarian Language (Bulgaria) Bernardo Magnini, FBK-IRST (Italy) Emanuela Pianta, FBK-IRST (Italy) Reinhard Rapp, University of Leeds (UK) Didier Schwab, Laboratoire d'Informatique de Grenoble (France) Carlo Strapparava, FBK-IRST (Italy) Sara Tonelli, FBK-IRST (Italy) Dan Tufis, RACAI (Romania) Michael Zock, CNRS&LIF (France) *Contact person* Please email vergi AT racai DOT ro if you have any questions regarding this workshop. *Organizing Committee* Verginica Barbu Mititelu Romanian Academy Research Institute for Artificial Intelligence E-mail: verginicabm at gmail.com Octavian Popescu Human Language Technology Group, Fondazione Bruno Kessler E-mail: popescu at fbk.eu Viktor Pekar Dictionaries Dept, Academic Division, Oxford University Press E-mail: v.pekar at gmail.com ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 22:07:40 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 23:07:40 +0100 Subject: Conf: Colloque SHESL / HTL 2012, 26-28 janvier 2012, Paris Message-ID: Date: Tue, 20 Dec 2011 11:36:54 +0100 From: Christian Puech Message-Id: X-url: http://www.shesl.org/ Chers collègues, Le colloque annuel de la Société d'Histoire et d'Epistémologie des Sciences du Langage organisé en collaboration avec l'UMR 7597 "Histoire des théories linguistiques" aura lieu les : 26 (am), 27 et 28 janvier 2012: "Linguistiques d'intervention - des usages socio-politiques des savoirs sur le langage et les langues" Pôles des langues et civilisations (Auditorium BULAC INALCO) - 65 rue des Grands Moulins - 75013 - Paris Programme: Contact: shesl at neuf.fr Site: http://www.shesl.org/ Christian PUECH Université Sorbonne Nouvelle Paris III Président de la SHESL Centre de Linguistique Française. UFR LLD. Centre Censier. 13, rue de Santeuil. 75005 - Paris. christian.puech at univ-paris3.fr UMR CNRS 7597 Paris 7/Paris 3: "Histoire des théories linguistiques" ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 22:09:16 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 23:09:16 +0100 Subject: Livre: Les enonces averbaux autonomes entre grammaire et discours Message-ID: Date: Tue, 20 Dec 2011 14:41:18 +0100 From: "Annick Deniel" Message-ID: <31A343C6D09247D1B86593C2A51F1573 at DELLAnnick> COMMUNIQUE DE PRESSE Nous avons le plaisir de vous annoncer la parution de l'ouvrage ci-dessous : Les énoncés averbaux autonomes entre grammaire et discours par Florence Lefeuvre et Irmtraud Behr, respectivement Maître de Conférences habilitée à diriger les recherches et Professeure à l’Université de la Sorbonne Nouvelle. Elles sont membres de deux équipes de recherche de la Sorbonne Nouvelle, l’EA 2290 en linguistique française et l’EA 4223 en linguistique allemande. Elles sont toutes deux auteures de nombreux ouvrages et articles sur la problématique des énoncés averbaux. Ce recueil interroge les grammaires de l’écrit et de l’oral sur leur conception des unités syntaxiques autonomes, que celles-ci soient appelées phrases, énoncés, clauses. Acceptent-elles, outre les unités syntaxiques basées sur le verbe conjugué, d’autres types d’unités, cette fois basées sur un segment averbal tel que Incroyable ? L’objectif de cet ouvrage est d’examiner le lien entre le cadre grammatical adopté et l’analyse en discours de ces unités averbales autonomes. Divers types de discours sont examinés, le discours littéraire, celui de la presse, et également le discours oral qui fait l’objet des recherches les plus récentes. Cet ouvrage s’adresse aux chercheurs, enseignants, étudiants ainsi qu’à tous ceux qui souhaitent approfondir leurs connaissances sur la phrase sans verbe considérée en discours. Sommaire : Jacqueline Guillemin-Flescher : L'énoncé averbal : repérage et subjectivité / Zsuzsanna Gécseg : Etre ou ne pas être : prédication averbale et localisation spatio-temporelle en hongrois / Frédéric Laurens : Analyse des énoncés averbaux prédicatifs autonomes / Henri-José Deulofeu : Existe-t-il un "impératif averbal" en français ? / Gilles Corminboeuf : Diptyques introduits par des énonciations nominales / Claire Blanche-Benveniste : Enoncés sans verbe en français parlé : un modèle à deux composantes / Mustapha Krazem : Zidane, qui passe à Sganarelle, qui lance à Platini, qui n'a pas vu Géronte… Ou comment des noms propres deviennent phrase averbales / Catherine Chauvin : Existe-t-il des constructions averbales ? Le cas des énoncés évaluatifs avec adjectif de type "Great !" en anglais / Françoise Mignon : Le statut prédicatif du marqueur de négation non en discours / Florence Lefeuvre : Bon à l'oral : une unité averbale autonome ? / Yana Grinshpun : Phrase averbale et presse écrite : le cas des constructions en [UN + N + expansion] / Sophie-Anne Brault-Scappini, Sandra Teston-Bonnard : Les fonctionnements du GP à partir de là : entre grammaticalisation et pragmaticalisation / Noalig Tanguy : Les segments averbaux comme unités syntaxiques à l'oral / Irmtraud Behr : Etude d'énoncé averbal à un terme : entre grammaire et discours / Angelika Redder : Enoncés averbaux en série – des segments narratifs spécifiques Caractéristiques : 276 pages, 21 € - ISBN 978-2-7080-1320-9 Nous sommes certains que cet ouvrage intéressera vos lecteurs et vous serions reconnaissants si vous pouviez l'annoncer dans les pages bibliographiques de votre magazine. Si vous désirez en faire paraître une critique plus approfondie, nous sommes prêts à vous adresser un exemplaire de l'ouvrage en service de presse ( adeniel at editionstechnip.com). Nous vous demandons simplement de penser à nous envoyer le justificatif de parution, indispensable à notre dossier presse. Nous vous remercions de l'accueil que vous réserverez à ce titre. Meilleures salutations, Annick DENIEL Responsable de la communication ~ adeniel at editionstechnip.com Ligne Directe : 01 45 78 33 87 Editions OPHRYS 25, rue Ginoux • 75015 Paris Tél : 33 (0)1 45 78 33 80 Fax : 33 (0)1 45 75 37 11 Site : www.ophrys.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 22:11:03 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 23:11:03 +0100 Subject: Appel: Traitement automatique des langues naturelles, analyses et applications, Congres annuel de l'ACFAS Message-ID: Date: Tue, 20 Dec 2011 09:28:57 -0500 From: Adel Jebali Message-ID: X-url: http://www.acfas.net/membre/compte.php APPEL Colloque : traitement automatique des langues naturelles, analyses et applications Congrès annuel de l'ACFAS Montréal, Canada Le vendredi 11 mai 2012 DATES IMPORTANTES Date limite de soumission des résumés : le 5 février 2012 Notification aux auteur(e)s : le 10 février 2012 Transmission du programme final à l’ACFAS : le 15 février 2012 Colloque : le vendredi 11 mai 2012 Soumission de l'article pour publication dans les actes : à déterminer. DURÉE Les communications seront d'une durée de 20 minutes, suivies d'une période de 10 minutes pour les questions. CONFÉRENCES Les conférences peuvent porter sur tous les thèmes du traitement automatique des langues et concerner une langue ou plus. Les thèmes pouvant être traités s'inscrivent dans les domaines suivants, mais ne s'y limitent pas : Fondements - Phonétique et phonologie - Morphologie - Syntaxe - Sémantique - Lexique - lexicométrie - classification etc. Applications - Enseignement assisté par ordinateur - Extraction de l'information sur le Web - Traduction automatique - Analyse et génération de textes - Résumé automatique - Dialogue personne-machine etc. MODALITÉS - Les résumés ne doivent pas dépasser 1500 caractères (incluant les espaces). Tout caractère additionnel ne pourra figurer dans le programme. - Les propositions doivent être envoyées en pièce jointe (fichier doc, rtf ou pdf) sans mention du nom de l'auteur(e) ni de son affiliation à l'adresse suivante : ajebali at alcor.concordia.ca - Dans l'objet de votre message, veuillez inscrire « Proposition ACFAS 2012 ». - Veuillez inscrire les informations suivantes dans le corps de votre message : Nom de l'auteur(e). Titre de la communication. Affiliation. - Les auteur(e)s dont la proposition est acceptée sont tenus d'ouvrir un compte et de s'inscrire au congrès annuel de l'ACFAS à l'adresse Web suivante : http://www.acfas.net/membre/compte.php. - Les auteur(e)s des communications acceptées seront appelé(e)s dans un deuxième appel à écrire un plus long article qui sera publié dans un collectif. ORGANISATEURS Adel Jebali, Université Concordia : ajebali at alcor.concordia.ca Ismaïl Biskri, Université du Québec à Trois-Rivières : ismail.biskri at uqtr.ca ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 22:12:37 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 23:12:37 +0100 Subject: Job: Poste de PR section 27 en Technologie du Langage Humain, DLU/GREYC - Universite de Caen Basse-Normandie Message-ID: Date: Tue, 20 Dec 2011 20:09:24 +0100 From: Gaël Dias Message-ID: <4EF0DD64.3080905 at unicaen.fr> X-url: http://www.greyc.fr/dlu ==================================================================== Poste de PR section 27 en Technologie du Langage Humain DLU/GREYC - Université de Caen Basse-Normandie ==================================================================== L’Université de Caen Basse-Normandie recrute un Professeur des Universités section 27 dans le domaine de la Technologie du Langage Humain (Challenge 4 du "Programme Information and Communication Technologies" de l’Union Européenne) pour la rentrée universitaire 2012/2013. Le candidat devra démontrer une compétence renforcée de haut niveau dans l’un ou plusieurs des domaines suivants : Recherche d’Information ("Information Retrieval"), Extraction d’Information ("Information Extraction"), Traitement Automatique des Langues, Traduction Automatique, Fouille de Textes, Extraction de Connaissances, Résumé de Textes, Interface Homme-Machine entre autres. Le candidat sera intégré à l’équipe Document, Langues, Usages (DLU - http://www.greyc.fr/dlu) au sein du laboratoire Groupe de Recherche en Informatique, Image, Automatique et Instrumentation de Caen (GREYC - UMR 6072 CNRS UCBN/ENSICAEN - http://www.greyc.fr) classé niveau A. Le candidat sera amené à dynamiser et participer à l’internationalisation et l’excellence de l’équipe DLU composée de 13 Maîtres de Conférences ou Chargés de Recherche permanents et d’un Professeur des Universités récemment recruté. DLU s’appuie sur un acquis reconnu nationalement et internationalement dans le domaine du Document Numérique, de l’Ingénierie Documentaire, du Traitement Automatique des Langues, de l'Extraction des Connaissances Textuelles et des Interfaces Homme-Machine. A DLU, le candidat trouvera un environnement jeune et dynamique orientant sa recherche autour de thèmes porteurs comme la personnalisation, la collaboration, l’accessibilité, l’adaptabilité et l’ubiquité d’applications du langage humain dans des environnements numériques hétérogènes multilingues tels que la toile ou les réseaux sociaux. Ce poste vise également à renforcer la thématique des Technologies du Langage Humain dans le dispositif d’enseignement du Département Informatique de l’UFR Sciences de l’Université de Caen Basse-Normandie. Dans ce cadre, le candidat sera amené à participer activement au succès du nouveau Master Image et Texte (dont la première édition commencera à la rentrée 2012/2013) dans lequel de nombreux modules devront être assurés. Son expérience sera également importante dans l’ensemble du Département d’Informatique, notamment dans la prochaine direction du Master Document. Le candidat intéressé par un poste à hautes responsabilités dans le domaine de la Technologie du Langage Humain est invité à prendre contact auprès des Responsables de l’équipe DLU, Gaël Dias (gael.dias at unicaen.fr) et Nadine Lucas (nadine.lucas at unicaen.fr). Pour toute information supplémentaire sur la partie enseignement, le candidat devra contacter le Directeur du Département d’Informatique, Bruno Crémilleux (bruno.cremilleux at unicaen.fr). En ce qui concerne la partie Recherche, le candidat pourra également contacter les Directeurs du GREYC, Mohamed M’Saad (mohammed.msaad at ensicaen.fr) et Frédéric Jurie (frederic.jurie at unicaen.fr). ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 22:14:44 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 23:14:44 +0100 Subject: Appel: 2nd CFP, EACL 2012, Joint Workshop on Information Retrieval and Hybrid Machine Translation Message-ID: Date: Wed, 21 Dec 2011 08:48:03 +0100 From: Marta Ruiz Message-ID: X-url: http://www-lium.univ-lemans.fr/esirmt-hytra Joint Workshop on EXPLOITING SYNERGIES BETWEEN INFORMATION RETRIEVAL AND MACHINE TRANSLATION (ESIRMT) and HYBRID APPROACHES TO MACHINE TRANSLATION (HyTra) Co-located with EACL 2012 (http://eacl2012.org/) Avignon, France April 23 and 24, 2012 Deadline for paper submissions: February 2, 2012 http://www-lium.univ-lemans.fr/esirmt-hytra ======================================================================== This two-day workshop addresses two specific but related research problems in computational linguistics. The ESIRMT event (1st day) aims at reducing the gap, both theoretical and practical, between information retrieval and machine translation research and applications. Although both fields have been already contributing to each other instrumentally, there is still too much work to be done in relation to solidly framing these two fields into a common ground of knowledge from both the procedural and paradigmatic perspectives. The HyTra event (2nd day) aims at sharing ideas among researchers developing and applying statistical, example-based, or rule-based machine translation systems and who wish to enhance their systems with elements from the other approaches. The joint workshop will provide participants with the opportunity of discussing research related to technology integration and system combination strategies at both the general level of cross-language information access and the specific level of machine translation technologies. Contributions are to be organized into two tracks, corresponding to ESIRMT and HyTra, respectively. Topics of interest include, but are not limited to: ESIRMT track: * machine translation and information retrieval hybridization * applications of MT and IR hybrid systems * IR techniques integrated in MT systems * MT techniques integrated in IR systems * any kind of innovative approach exploiting synergies between MT and IR * machine learning techniques for ranking HyTra track: * ways and techniques of MT hybridization * architectures for the rapid development of hybrid MT systems * hybrid systems dealing with underresourced languages and/or with morphologically rich languages * using linguistic information (morphology, syntax, semantics) to enhance statistical MT * bootstrapping rule-based systems from corpora * hybrid methods in spoken language translation * extraction of dictionaries from parallel and comparable corpora * machine learning techniques for hybrid MT * heuristics for limiting the search space in hybrid MT * alternative methods for the fair evaluation of the output of different types of MT systems * system combination approaches * open source tools and free language resources for hybrid MT SUBMISSION INSTRUCTIONS Authors are invited to submit papers on original and previously unpublished work. Formatting should be according to EACL 2012 specifications using LaTeX or MS-Word style files, see http://eacl2012.org/information-for-authors/index.html (paper length: up to 9 pages plus an extra page for references). Reviewing of papers will be double-blind, so the submissions should not reveal the authors' identity. Submission is electronic in PDF format using the START submission system at https://www.softconf.com/eacl2012/ESIRMT-HyTra/ Double submission policy: Parallel submission to other meetings or publications are possible but must be immediately notified to the workshop contact person (see below). Authors of accepted papers will be invited to present their research at the workshop. The papers will be published in the electronic workshop proceedings. These will be part of the EACL proceedings, published by ACL. They will also be made available online via the ACL Anthology. IMPORTANT DATES February 2, 2012: Paper submissions due February 24, 2012: Notification of acceptance March 9, 2012: Camera ready papers due April 23-24, 2012: Workshop in Avignon ORGANIZERS Contact person: Marta R. Costa-jussà (e-mail: marta.ruiz at barcelonamedia.org) ESIRMT: Marta R. Costa-jussà (Barcelona Media Innovation Center), Patrik Lambert (University of Le Mans), Rafael E. Banchs (Institute for Infocomm Research) HyTra: Reinhard Rapp (Universities of Mainz and Leeds), Bogdan Babych (University of Leeds), Kurt Eberle (Lingenio GmbH), Tony Hartley (Toyohashi University of Technology and University of Leeds), Serge Sharoff (University of Leeds), Martin Thomas (University of Leeds) INVITED SPEAKERS Philipp Koehn (University of Edinburgh) Christof Monz (University of Amsterdam) PROGRAMME COMMITTEE Jordi Atserias, Yahoo! Research , Barcelona, Spain Bogdan Babych, University of Leeds, UK Sivaji Bandyopadhyay, Jadavpur University, Kolkata, India Núria Bel, Universitat Pompeu Fabra, Barcelona, Spain Pierrette Bouillon, ISSCO/TIM/ETI, University of Geneva, Switzerland Chris Callison-Burch, Johns Hopkins University, Baltimore, USA Michael Carl, Copenhagen Business School, Denmark Oliver Culo, ICSI, University of California, Berkeley, USA Kurt Eberle, Lingenio GmbH, Heidelberg, Germany Andreas Eisele, Directorate-General for Translation, European Commission, Luxembourg Marcello Federico, Fondazione Bruno Kessler, Trento, Italy José A. R. Fonollosa, Universitat Politècnica de Catalunya, Barcelona, Spain Mikel Forcada, University of Alicante, Spain Alexander Fraser, Institute for Natural Language Processing (IMS), Stuttgart, Germany Johanna Geiß, Lingenio GmbH, Heidelberg, Germany Mireia Ginesti-Rosell, Lingenio GmbH, Heidelberg, Germany Silvia Hansen-Schirra, FTSK, University of Mainz, Germany Tony Hartley, Toyohashi University of Technology, Japan, and University of Leeds, England Gareth Jones, Dublin City University, Ireland Min-Yen Kan, National University of Singapore Udo Kruschwitz, University of Essez, UK Yanjun Ma, Baidu Inc. Beijing, China José B. Mariño, Universitat Politècnica de Catalunya, Spain Maite Melero, Barcelona Media Innovation Center, Barcelona, Spain Haizhou Li, Institute for Infocomm Research, Singapore Reinhard Rapp, University of Mainz, Germany, and University of Leeds, UK Paul Schmidt, Institut for Applied Information Science, Saarbrücken, Germany Uta Seewald-Heeg, Anhalt University of Applied Sciences, Köthen, Germany Nasredine Semmar, CEA LIST, France Serge Sharoff, University of Leeds, UK Wade Shen, Massachusetts Institute of Technology, Cambridge, USA Fabrizio Silvestri, Instituto de Scienza e Tecnologia del'Informazione, Pisa, Italy Harold Somers, CNGL, Dublin City University, Ireland Anders Søgaard, University of Copenhagen, Denmark Martin Thomas, University of Leeds, UK Jörg Tiedemann, University of Uppsala, Sweden Zygmunt Vetulani, University of Poznan, Poland ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 22:16:55 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 23:16:55 +0100 Subject: Appel: LREC 2012 Workshop on Language Resources for Public Security Message-ID: Date: Wed, 21 Dec 2011 10:32:56 +0000 From: Frédérique Segond Message-ID: LREC 2012 Workshop on Language Resources for Public Security Applications CALL FOR PAPERS 27 May 2012, Istanbul (half-day) Conference venue, the Lütfi Kirdar Istanbul Exhibition and Congress Centre Deadline: March 5, 2012 Workshop description Public security in Europe and in the World is facing several threats. These include threats connected with intended human activities such as terrorism, spontaneous risks related to uncontrolled behavior of individuals involved in mass events, natural disasters, etc. Combating these dangers generates challenges for information and communication technologies which in many cases directly involve various forms of natural language processing. Gathering, maintaining and processing language resources specific for security applications is of primary importance for the language technologies concerned. In some cases it appears useful to investigate and use sensitive linguistic data which generates technological and legal problems connected with privacy, ownership civic rights protection etc. The workshop is intended to serve as a thematic discussion forum open to: - language resources suppliers, - researchers and language engineers interested in the development of systems for security applications involving language technologies, - potential/actual users of such systems, - people concerned with legal aspects of gathering, maintenance and applications of language resources for public security purposes. Generation of long term cooperation projects involving the workshop participants would be a desired side effect of the workshop. Areas of Interest The workshop will focus on the knowledge processing applications serving public security. Particular emphasis will be given to the crucial role of language resources and related technologies. Contributions are invited on - but not limited to - the following topics: - security specific corpora, - security specific terminology, - language models for specific sub-languages and language registers important for security research, - language technology based tools to enhance public security, - linguistic tools for risk assessment, - controlled languages for public security applications, - AI and NLP decision supporting systems, - sharing and processing sensitive linguistic data, - legal aspects of security-oriented natural language processing and engineering, - access to sensitive data, - IPR issues, - protection and use of sensitive source data, - international collaboration issues, - issues related with national and international funding. Submissions We invite contributions on one or more of the above listed topics, as well as any other related to the workshop area of interest. We expect papers presenting novel ideas, completed or advanced projects or discussing hot problems of the area. The paper selection procedure will be based on peer reviewing of a 2000 word abstract and will be basically the same as for the general LREC. The form of presentation will be decided later by the Program Committee, depending of the number of submissions. Abstract and full papers should be submitted electronically from the conference web site through the START Conference Manager . Templates (Latex and Word) for preparation of full papers will be accessible from the conference web site. The only accepted format for submitted papers is Adobe PDF. When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e., also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. For further information on this new initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012. The full versions of the accepted papers will be published in the Workshop Proceedings. Important Dates Abstract submission deadline: March 5, 2012 Acceptance notification: March 18, 2012 Final version deadline: March 26, 2012 LREC (Conference, Tutorials and Workshops): May 21-27, 2012 LRPS Workshop: May 27, afternoon (Initial) Program Committee: Edouard Geoffrois (co-chair) (Direction Générale de l'Armement, Paris, France) Laura Chaubard (Direction Générale de l'Armement, Paris, France) Jakub Gorczyński (Polish National Police, Poznań, Poland) Fryni Kakoyianni-Doa (University of Cyprus, Nicosia, Cyprus) Nasrullah Memon (University of Southern Denmark, Odense, Denmark) Mario Montoleone (Salerno University, Italy) Karel Pala (Masaryk University, Brno, Czech Republic) Frederique Segond (Viseo, Grenoble, France) Tadeusz Tomaszewski (University of Warsaw, Poland) Zygmunt Vetulani (co-chair) (Adam Mickiewicz University in Poznań, Poland) Contact: vetulani at amu.edu.pl Frédérique SEGOND 34, avenue Félix Viallet - 38000 GRENOBLE Tél. +33 (0)4 76 53 72 72 Mob. +33 (0)6 89 44 60 88 fsegond at viseo.net Responsable Pôle Innovation Research & Development Manager http://wiki.objetdirect.com/expertise/Fr%C3%A9d%C3%A9rique_Segond Objet Direct is partner of the GALATEAS and LEILAS EU Projects http://www.galateas.eu/index.html http://www.leilas.eu/ ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 22:18:38 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 23:18:38 +0100 Subject: Appel: Workshop on Open Data in a Semantic Web Perspective Message-ID: Date: Wed, 21 Dec 2011 15:25:01 -0500 From: eric charton Message-ID: X-url: http://wodswp2012.wikimeta.org/ First Call for paper ------------------------------------------------------------------------ Workshop on Open Data in a Semantic Web Perspective http://wodswp2012.wikimeta.org/ May 27th, 2012 York University, Toronto, Ontario, Canada (co-located with Canadia AI 2012) ------------------------------------------------------------------------ WORKSHOP DESCRIPTION Open Data is the idea that certain data should be freely available to everyone to use and republish as they wish, without restrictions from copyright, patents or other mechanisms of control. Open Data recently, gaining popularity with the rise of the W3C Semantic Web prospective vision. The LinkedData network, promoted by the W3C, is the most visible part of what an open Web of data, as a complement of a Web of document could be. Vast repositories of normalized data like those from DBPedia, PubMed, GeneID, DrugBank, OECD or UNESCO are now stimulating an intensive new field of research related to the possible uses of those data in IA and knowledge management applications. Major companies involved in data exploration and information retrieval invest in this new field and study new interfaces and algorithms to bring the gap between data and users like Google and the Public Data Explorer. Governments are increasingly recognizing the benefits of making their data open and reusable. The launch of open-data government initiatives such as Data.gov in USA or various initiatives in Europe like the Open Data Challenge, or data.gov.uk is a strong encouragement to the development of the open data movement. The wide availability of Public Sector Information can be a key driver to develop content markets in North America, which can generate new businesses and jobs and provide more information to consumers and citizens. In Canada, the Open Data Pilot is part of the Government commitment to open government, which is being pursued along three streams: open data, open information and open dialogue, and aims to drive innovation and economic opportunities for all Canadians. We would like to invite researchers and actors of the open data movement to present their work to the workshop. This workshop would like to bring together data providers, initiative groups, technology providers, and researchers to facilitate the dissemination of ideas, methods and promote new research perspectives related to the Open Data and the Semantic Web applications. TOPICS Topics of interest include, but are not limited to: 1. Open Data sets and catalogs (including Semantic Web, LinkedData, public data sets) 2. Open Data pilot initiatives (including governmental, non-profit, academic) 3. Relation between Open Data and Natural Language Processing 4. Architectures and systems using Open Data 5. Tools and resources related to Open Data 6. Open Data construction tools 7. Format of Open Data (including RDF, OWL) 8. Application of Open Data in maps, encyclopedic knowledge, genomics, chemical compounds, mathematical and scientific formulae, medical data and practice, bioscience and biodiversity. 9. Evaluation and metrics related to Open Data sets quality, processing and software performances 10.Open Data and translation 11.Open Data visualization Demos and papers related to working application or systems under development will be encouraged. We hope to cover three main perspectives: government and administrations (e.g., open ), industry (e.g., marketing, software applications and services), and academic (e.g., theoretical research related to Open Data production and uses). SUBMISSION INFORMATIONS: Two formats are proposed : 1.Full paper (up to 8 pages) 2.Demo paper (up to 4 pages) Submission of commercial applications and research related papers are strongly encouraged when they are strictly related to real open data (e.g., innovative commercial software tool using public data-sets) Authors are invited to submit electronically on the website of the workshop by February 22th, 2012, full papers in PDF, Postscript or MS-Word RTF. All papers must be written in English. For submission please use our easychair site. IMPORTANT DATES: Paper submission: February, 22, 2012 Notification of acceptance: March, 30, 2012 Camera ready: April, 27, 2012 Workshop: May 27, 2012 PROGRAM COMMITTE Christopher Baker, UNB, Saint John, Canada Pierre-Antoine Champin, Université Claude Bernard Lyon 1, Lyon, France Dragan Gasevic, Athabasca University, Athabasca, Canada Volker Haarslev, Concordia University, Montréal, Canada Anja Jentzsch, , Freie Universität, Berlin, Germany Birgitta König-Ries, University of Jena, Jena, Germany Guy Lapalme, Université de Montréal, Montréal, Canada Daniel Lemire, UQAM, Montréal, Canada Roger Nkambou, UQAM, Montréal, Canada Raphael Troncy, Eurécom, Sophia Antipolis, France Petko Valtchev, UQAM, Montréal, Canada René Witte, Semantic Software Lab - Concordia University, Montréal, Canada WORKSHOP ORGANIZERS AND CONTACT Eric Charton (École Polytechnique de Montréal) Michel Gagnon (École Polytechnique de Montréal) Marie-Jean Meurs (Concordia University) Contact eric.charton at polymtl.ca ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sat Dec 24 09:24:28 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sat, 24 Dec 2011 10:24:28 +0100 Subject: Stage: 3 offres, Ingenieur Developpement, GEOLSemantics Message-ID: Date: Thu, 22 Dec 2011 14:02:46 +0100 From: "Emmanuel DUPONT" Message-ID: <00a101ccc0aa$0098d0d0$01ca7270$@geolsemantics.com> -------------------------------- Stage fin d’études pré-embauche : Ingénieur Développement Avant-vente: Authentification forte grâce à la voix La société : La société GEOLSemantics est une société française basée à Paris et qui compte une quinzaine de personnes. Elle a pour vocation de proposer des solutions logicielles pour aider à la détection et au tracking d’activités hostiles envers les Etats, les entreprises et les personnes. Ces solutions s’appliquent à l’information contenue dans les différents média : texte, audio et vidéo. Elles utilisent des algorithmes sophistiqués d’analyse sémantique multilingue pour permettre aux opérationnels de disposer d’une information synthétique représentant la connaissance extraite des différentes sources d’information. Elles utilisent également des technologies d’avant-garde d’identification de locuteur dans les médias audio ou vidéo. Sujet du stage GEOLSemantics recherche un stagiaire bac +5 en informatique pour la réalisation d’un prototype ‘authentification forte de locuteur’ utilisant nos outils de biométrie vocale. Intégré(e) au sein d'une équipe à taille humaine, vous aurez la possibilité de vous épanouir dans le monde de la sécurité renforcée et de la lutte contre l’usurpation d’identité. Contenu du stage : Le but de ce stage est de réaliser un prototype pour appuyer la vente de nos solutions de biométrie vocale. L’objectif est d’améliorer la sécurité d’accès des applications grâce à l’identification vocale du locuteur. Cette authentification forte vient en complément des méthodes classiques basées sur un login/mot de passe. Déroulement du stage : - Choix et mise en place de l’architecture, - Elaboration de scenarii, - Développement de l’infrastructure du prototype, - Développement d’une application sous Androïd pour la mise à disposition du prototype sur mobile, - Test et présentation aux avant projets client. Vos capacités d'autonomie, vos aptitudes à vous exprimer et à rédiger vous permettront rapidement d'être responsable de la réalisation et du suivi complet de vos travaux sous la conduite d’un ingénieur expérimenté. Profil du stagiaire : Compétences techniques requises : - Master 2 ou ingénieur en informatique - bonnes connaissances en Java Web (notion de HTML/CSS) - des connaissances en Androïd (ou capacité à acquérir ces connaissances rapidement) - capacité à appréhender un domaine nouveau (biométrie vocale) - Maîtrise du XML et de RDF Qualités requises : Bon rédactionnel. Forte curiosité fonctionnelle, forte autonomie Le stage est placé sous la responsabilité du responsable avant-vente - Durée : 6 mois minimum - Début du stage : dès que possible - Lieu : Paris 15ème - Indemnités : à débattre Candidature à adresser (CV, lettre de motivation, photo) à : Emmanuel Dupont, Directeur des Opérations Société GEOLSemantics 32, rue Brancion 75015 Paris emmanuel.dupont at geolsemantics.com Site WEB : en cours de construction ---------------------------------------- Stage fin d’études pré-embauche : Ingénieur Développement Module d'élaboration de règles linguisitiques La société : Basée à Paris, GEOLSemantics est une jeune entreprise innovante qui développe et commerciale des logiciels de traitement automatique du langage naturel pour satisfaire des besoins de veille stratégique. Les applications sont multiples et principalement centrées sur la détection automatique de menace, de rumeur ou d’opinion à partir de l’analyse systématique des documents multi-lingues publiés sur le WEB, dans les sites, les blogs, les forums ou les réseaux sociaux. Sujet du stage GEOLSemantics recherche un stagiaire bac +5 en informatique pour la réalisation de son module d’élaboration de règles linguistiques de haut niveaudestinée à extraire des connaissances structurées exprimées dans une ontologie. Intégré(e) au sein d'une équipe à taille humaine, vous aurez la possibilité de vous épanouir dans le monde du Web 3.0 sur des technologies J2ee (Portlet sous Liferay, Java). Contenu du stage : Actuellement, l’écriture de règles linguistiques nécessite une expertise pointue sur notre moteur de traitement. GEOLSémantics souhaite mettre à disposition de ses ingénieurs et de ses partenaires un SDK d’assistance à l’écriture de règles linguistiques. Le but de cet outil est d’augmenter la productivité en interne et de faciliter la prise en main de notre progiciel par des intervenants extérieurs. A partir du cahier des charges, vous devrez comprendre le besoin fonctionnel pour rédiger les spécifications fonctionnelles et mettre en place rapidement une première maquette. Après validation auprès des utilisateurs finaux, vous aurez la responsabilité de développer et livrer le SDK. La décomposition des activités est la suivante : 60% de spécification : compréhension du besoin, des langages manipulés par les linguistes, du métier du Web 3.0; 40% de développement en Java (exécutable intégré à notre moteur sémantique et IHM). Vos capacités d'autonomie, vos aptitudes à vous exprimer et à rédiger vous permettront rapidement d'être responsable de la réalisation et du suivi complet de vos travaux sous la conduite d’un ingénieur expérimenté. Profil du stagiaire : Compétences techniques requises : - Capacité à appréhender un domaine nouveau (le Web sémantique) - Base algorithmique sur les automates - Bonnes compétences en Java - Maîtrise du XML et de RDF Qualités requises : Bon rédactionnel. Forte curiosité fonctionnelle Langue : Anglais exigé Le stage est placé sous la responsabilité du Directeur des Opérations - Durée : 6 mois minimum - Début du stage : dès que possible - Lieu : Paris 15ème - Indemnités : à débattre Candidature à adresser (CV, lettre de motivation, photo) à : Emmanuel Dupont, Directeur des Opérations Société Cadège/GEOLSemantics 32, rue Brancion 75015 Paris emmanuel.dupont at geolsemantics.com Site WEB : en cours de construction --------------------------------------------- Stage fin d’études pré-embauche : Ingénieur Développement Algorithme de clustering La société : Basée à Paris, GEOLSemantics est une jeune entreprise innovante qui développe et commerciale des logiciels de traitement automatique du langage naturel pour satisfaire des besoins de veille stratégique. Les applications sont multiples et principalement centrées sur la détection automatique de menace, de rumeur ou d’opinion à partir de l’analyse systématique des documents multi-lingues publiés sur le WEB, dans les sites, les blogs, les forums ou les réseaux sociaux. Sujet du stage GEOLSemantics recherche un stagiaire bac +5 en informatique pour la réalisation d’un algorithme de clustering. Intégré(e) au sein d'une équipe à taille humaine, vous aurez la possibilité de vous épanouir dans le monde du Web 3.0. Contenu du stage : De façon à évaluer la qualité des outils de regroupement d’information concernant des personnes (comme 123people), la communauté scientifique a organisé plusieurs compétitions internationales (campagnes WEBS). A partir d’un ensemble de textes qui concernent des personnes avec des homonymes (personnes différentes de même nom), une première phase consiste à regrouper les documents parlant de la même personne par des technologies de clustering et dans un deuxième temps pour chaque cluster qui concerne une seule personne (les homonymes étant sensé avoir été distingués par le clustering) on extrait un certain nombre d’information concernant la personne (date et lieu de naissance, adresse, téléphone, diplômes, ...). La société GEOLSemantics a développé des traitements linguistiques automatiques qui permettent d’identifier dans les textes les éléments significatifs et d’extraire des informations concernant les personnes. En s’appuyant sur les outils existant dans la société, il est demandé au stagiaire de développer un algorithme de clustering qui s’appuiera sur les résultats de l’analyse linguistique des textes décrivant les personnes. Ensuite de compléter les règles d’extraction des connaissances pour extraire de chaque cluster les informations demandées par la campagne d’évaluation WEBS. Les résultats pourront être comparés aux résultats obtenus pas les participants aux campagnes précédentes. La langue de la campagne étant l’anglais, il est demandé au stagiaire d’avoir une maîtrise de cette langue. Vos capacités d'autonomie, vos aptitudes à vous exprimer et à rédiger vous permettront rapidement d'être responsable de la réalisation et du suivi complet de vos travaux sous la conduite d’un ingénieur expérimenté. Profil du stagiaire : Compétences techniques requises : Profil demandé : - Master 2 ou ingénieur en informatique ayant une connaissance en traitement automatique des langues. - Goût de la compétition (il faut sortir de meilleurs résultats que les autres). Les résultats pourront faire l’objet d’une publication scientifique. - Capacité à appréhender un domaine nouveau (le Web sémantique) - Bonnes compétences en Java - Maîtrise du XML et de RDF Qualités requises : Bon rédactionnel. Forte curiosité fonctionnelle Langue : Anglais exigé Le stage est placé sous la responsabilité du Directeur Scientifique - Durée : 6 mois minimum - Début du stage : dès que possible - Lieu : Paris 15ème - Indemnités : à débattre Candidature à adresser (CV, lettre de motivation, photo) à : Emmanuel Dupont, Directeur des Opérations Société Cadège/GEOLSemantics 32, rue Brancion 75015 Paris emmanuel.dupont at geolsemantics.com Site WEB : en cours de construction ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sat Dec 24 09:26:33 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sat, 24 Dec 2011 10:26:33 +0100 Subject: Appel: ECAI 2012 Message-ID: Date: Thu, 22 Dec 2011 15:44:47 +0100 From: ECAI Publicity Message-ID: <4EF3425F.5090402 at lirmm.fr> X-url: http://www2.lirmm.fr/ecai2012/ ###################################################################### ## ECAI-2012 ## Call for Papers ## ECAI-2012 ## ###################################################################### The Twentieth European Conference on Artificial Intelligence 27-31 August 2012 Montpellier, France http://www2.lirmm.fr/ecai2012/ Deadline for submission of papers: 6 March 2012 ###################################################################### ## INTRODUCTION ## ###################################################################### ECAI is the biennial European Conference on Artificial Intelligence organized by the European Coordinating Committee for Artificial Intelligence (ECCAI); it is the premier forum for presenting Artificial Intelligence results in Europe. The twentieth conference in this series, ECAI 2012, will take place in Montpellier (France) August 27-31. ECAI 2012 will gather researchers and practitioners of Artificial Intelligence from all over the world; they will present the latest results and identify new trends and challenges in all subfields of Artificial Intelligence as well as demonstrate innovative applications and uses of advanced Artificial Intelligence technology. In addition to the main technical track, ECAI 2012 will feature keynote and invited talks, invited tutorials, an elaborate workshop program, a system demonstration track, a Turing centenary celebration, in which we will celebrate the 100th anniversary of the birth of Alan Turing and his influence on Artificial Intelligence, and a special session on Highlights of European AI, to celebrate the 20th ECAI conference and 25 years of AI Communications. ECAI 2012 will also feature the 6th STAIRS (STarting AI Researcher Symposium) and the 6th PAIS (Conference on Prestigious Applications of Intelligent Systems) as sub-conferences. ECAI-2012 will be held in the beautiful city of Montpellier, France. With excellent opportunities for sightseeing and gastronomy, Montpellier promises to be a wonderful venue for a memorable conference. ###################################################################### ## SUBMISSIONS ## ###################################################################### ECAI 2012 welcomes submissions of technical papers to the main track. Submitted papers should present original, high-quality and previously unpublished results on Artificial Intelligence. Submissions are welcome from all areas of AI; the following list of topics is indicative only. * Agents & Multiagent Systems * Case-Based Reasoning * Cognitive Modeling & Interaction * Constraints & Search * Knowledge Representation & Reasoning * Machine Learning & Data Mining * Model-Based Reasoning * Natural Language Processing * Perception & Sensing * Planning & Scheduling * Robotics * Uncertainty in AI * Applications of AI ECAI 2012 especially encourages submissions that cross discipline boundaries within Artificial Intelligence, and between Artificial Intelligence and other disciplines. Both long (6-page) and short (2-page) papers can be submitted. Whereas long papers should report on substantial research results, short papers are intended for highly promising but possibly more preliminary work. Short papers will be presented in poster form. Rejected long papers will not be considered for the short paper track. ECAI 2012 also welcomes submissions of proposals for workshops, descriptions of system-demonstrations, papers on prestigious applications of AI for PAIS, and contributions of starting researchers in AI for STAIRS; cf. the separate calls for these tracks. ###################################################################### ## IMPORTANT DATES ECAI Technical Program ## ###################################################################### Paper submission deadline: Tuesday, 6 March 2012 Author response period between(indicative): 18-23 April 2012 Notification of acceptance/rejection: Monday, 21 May 2012 Camera-ready paper due: Sunday, 3 June 2012 Conference: 27-31 August 2012 ###################################################################### ## PAPER FORMATTING & SUBMISSION INFORMATION ## ###################################################################### Long papers must not exceed SIX (6) and short papers must not exceed TWO (2) pages in camera-ready format for ECAI. Over-length submissions will be rejected without review. Papers for ECAI should be submitted using the ECAI formatting style; details of the style are available at: http://people.cs.kuleuven.be/~luc.deraedt/ecai2012-style.zip This file unpacks to a directory "ecai2012", and contains LaTeX and other style files. The file "ecai2012.tex" is an example paper in LaTeX format using the appropriate styles, and can be used as a template for ECAI submissions. A Microsoft Word template is also available. Reviewing for ECAI 2012 will be blind: reviewers will not be presented with the identity of paper authors. To allow for blind review, author names in a submitted paper or poster should be replaced by the unique tracking number assigned by the conference website at the submission of an electronic abstract. Authors should avoid writing anything that makes their identity obvious in the text. Submissions should be original, and in particular should not previously have been formally published. (Any publication venue with an ISBN or ISSN number counts as a formal publication; this includes LNCS/LNAI volumes, for example.) Submissions should not be submitted elsewhere during the ECAI 2012 review phase. The proceedings of the ECAI conference, together with those of the PAIS symposium will be published and distributed by IOS Press as an online open access book. (The proceedings of some past ECAI conferences are freely available on the IOS website). The authors will be responsible for producing camera-ready copies of papers, conforming to the ECAI formatting guidelines for inclusion in the proceedings. At least one author of each accepted long or short paper is required to attend the conference to present the contribution. Submission and review of papers for ECAI-2012 will be managed via the ConfMaster system. Confmaster may be live for ECAI submissions before 31 January. Prior to this date, the site will be used for configuration and testing. All data uploaded or registered before this date will be deleted and will not be considered for the conference. Do NOT register or upload papers before 31 January 2012! All submissions will be subject to peer review by the ECAI 2012 Programme Committee, and evaluated on the basis of: relevance; significance of contribution; technical quality; scholarship; and quality of presentation. The authors of the best papers of ECAI 2012 will be invited to submit an extended version of their paper to the Artificial Intelligence Journal. The primary authors of submitted papers will be offered the opportunity to respond to the reviews for their papers before the final decision on acceptance or otherwise is made. The author feedback phase is currently planned between 18-23 April 2012. ###################################################################### ## ORGANISATION ## ###################################################################### Conference Chair : Didier Dubois ECAI Program Chair : Luc De Raedt Organizing Committee Chair: Christian Bessiere ECCAI Chair: Gerhard Brewka Workshop Chairs: Jérôme Lang and Michèle Sebag System-Demonstration Chairs: Patrick Doherty and Fredrik Heintz STAIRS Program Chair: Kristian Kersting and Marc Toussaint PAIS Program Chairs: Paolo Frasconi and Peter Lucas Turing Centenary Celebration Chair: Michael Wooldridge ###################################################################### ## LOCAL ORGANIZING COMMITTEE ## ###################################################################### LIRMM supervisor: Christophe Dhenaut Logistics: Elisabeth Greverie, Justine Landais, Eric Bourreau Treasurer: Michel Leclere, Laetitia Megual Technical support: Remi Coletta Web site: Michel Liquiere Sponsoring: Remi Coletta Publicity: Marie-Laure Mugnier, Madalina Croitoru Other members: Stefano Cerri, Abdel Gouaich, Joel Quinqueton AFIA contacts: Patrick Albert, Yves Demazeau, Thomas Guyet, Eunika Mercier-Laurent, Jean-Denis Muller ###################################################################### ## AREA CHAIRS ## ###################################################################### Area chairs currently confirmed include: Michael Beetz [Germany] Ronen Brafman [Israel] Fabio Cozman [Brazil] James Cussens [UK] Thomas Eiter [Austria] Johannes Fuernkranz [Germany] Hector Geffner [Spain] Marco Gori [Italy] Udo Hahn [Germany] Malte Helmert [Switzerland] Manfred Jaeger [Denmark] Sarit Kraus [Israel] Andreas Krause [Switzerland] Jerome Lang [France] Pierre Marquis [France] John-Jules Meyer [The Netherlands] Bernhard Nebel [Germany] Ann Nowé [Belgium] Barry O'Sullivan [Ireland] Francesca Rossi [Italy] Michele Sebag [France] Carles Sierra [Spain] Steffen Staab [Germany] Peter Struss [Germany] Michael Thielscher [Australia] Hannu Toivonen [Finland] Antal van den Bosch [The Netherlands] Pascal Van Hentenryck [US] Michael Wooldridge [UK] Frank Wolter [UK] ###################################################################### ## KEYNOTE SPEAKERS ## ###################################################################### The ECAI-2012 programme will include plenary talks from internationally leading researchers in AI and cognate disciplines. Keynote speakers so far confirmed include: * Wofram Burgard [Freiburg, Germany] * Adnan Darwiche [UCLA, US] * Tom Mitchell [CMU, US] ###################################################################### ECAI 2012 http://www2.lirmm.fr/ecai2012/ ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sat Dec 24 09:29:30 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sat, 24 Dec 2011 10:29:30 +0100 Subject: Appel: IEEE 2nd DICTAP2011- Thailand Message-ID: Date: Thu, 22 Dec 2011 16:00:07 -0600 From: "The Second International Conference on Digital Information and Communication Technology and its Applications (DICTAP2012)" Message-ID: <0b7b8911cb11c40c6e79c4404598207d at localhost.localdomain> X-url: http://www.sdiwc.net/thi/ The Second International Conference on Digital Information and Communication Technology and its Applications (DICTAP2012) University of the Thai Chamber of Commerce(UTCC), Bangkok Thailand May 16-18, 2012 Technical co-sponsored by IEEE Thailand Section All papers will be submitted to IEEE for potential inclusion to IEEE Xplore http://www.sdiwc.net/thi/ ======================================================================= The proposed conference on the above theme will be held at University of the Thai Chamber of Commerce(UTCC), Bangkok, Thailand during the period of May 16-18, 2012. The DICTAP2012 aims to enable researchers build connections between different digital applications. The conference welcomes papers on the following (but not limited to) research topics: 1) Communication Basic and Infrastructure - Telecommunications - Communication protocols - Algorithms, Architecture, and Infrastructures - Transmission Techniques - Communication Systems 2) Networks Basic and Management - Network Systems and Devices - Wireless Networks, Ad-Hoc and Sensor Networks - Network Modeling and Simulation - Network Management Techniques - Networks Security, Encryption and Cryptography 3) Multimedia Application - Management and Diffusion of Multimedia Applications - Multimedia Data Base - Information Retrieval - Digital Rights Management 4) Image, Video, Signal and Information Processing - Compression and Coding - Analysis and Processing - Computer Graphics - Data mining - Soft Computing - Web based applications Researchers are encouraged to submit their work electronically. All papers will be fully refereed by a minimum of two specialized referees. Before final acceptance, all referees comments must be considered. Important Dates ============== Submission Date : Mar. 10, 2012 Notification of acceptance : Apr. 1, 2012 Camera Ready submission : Apr. 14, 2012 Registration : Apr. 14, 2012 Conference dates : May 16-18, 2012 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sat Dec 24 09:32:14 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sat, 24 Dec 2011 10:32:14 +0100 Subject: Appel: TKE 2012 - New frontiers in the constructive symbiosis of terminology and knowledge engineering Message-ID: Date: Fri, 23 Dec 2011 00:45:46 +0100 From: Nathalie Aussenac-Gilles Message-ID: <4EF3C12A.2010303 at irit.fr> X-url: http://www.oeg-upm.net/tke2012 [Apologies for cross-posting] ------------------------------------------------------------------------ SECOND CALL FOR PAPERS TKE 2012 New frontiers in the constructive symbiosis of terminology and knowledge engineering TKE (Terminology and Knowledge Engineering) Conference http://www.oeg-upm.net/tke2012 ------------------------------------------------------------------------ When - Conference 20-21 June 2012 When - Workshops 19& 22 June 2012 Where Madrid, Spain Submission Deadline 21-01-2012 Notification Due 18-02-2012 Final Version Due 07-04-2012 Conference Aim and Topics TKE 2012 will be the 10th edition of this conference that started out under the auspices of TermNet, and has developed towards wider fields in knowledge transfer for special language communication in a multilingual global society. In this context, interdisciplinary research emphasizing the intersection of different disciplines dealing with terminological aspects and knowledge engineering resources online is welcome. This conference will mainly focus on those theoretical, methodological and practical aspects that show the symbiosis of terminology and knowledge engineering by highlighting the recent advances in these related fields. We invite papers on all aspects of I. Terminology and knowledge engineering * Ontologies as semantic resources * Lexicons and ontologies as means for knowledge transfer * Classification systems and thesauri * Multimodal information management in terminology * Metadata in terminological and knowledge resources * Ontology localization * Multilingual terminology extraction and alignment for lexical resources * Representation of terms and conceptual relations in knowledge-based applications * Comparative studies of terminological resources and/or ontological resources from different languages, domains and approaches * Interoperability and reusability in knowledge-based tools and applications II. Terminology policies and terminological tools * Terminology in Languages for Specialized Purposes (LSPs) * Terminology and lexicography teaching and learning * Terminology for language planning * Terminological resources in the 21st century * Multilinguality and multimodality in terminological resources * Reusing, standardizing and merging terminological or ontological resources. III. Terminology applications * Applications of terminological resources * Terminology in industrial and business settings * Terminological tools for machine translation * Terminology in natural language processing tools * Terminology in e-government and sector standardization Submission of extended abstracts for oral presentations and posters Extended abstracts of papers for oral presentations should consist of not more than four pages, excluding references and illustrations, via the Conference Submission System. To submit an abstract, go to the submission site (soon available) and select "new submission". The abstract submitted will be used to assign reviewers, so be sure it identifies the subject area and expertise required to evaluate your work.The program committee may be decide to change the category of a paper before acceptance. Complete papers will be submitted electronically. The extension of papers will be no longer than 16 pages (5000-6000 words). Papers that exceed this limit will be rejected without review. Submissions must be in PDF, formatted in the style of the Springer Publications format for Lecture Notes in Computer Science (LNCS). For details on the LNCS style, see Springer’s Author Instructions. The 2nd Call for Papers will include further details on accommodation and how to arrive. Organizing Committee Guadalupe Aguado de Cea, UPM, Spain (Conference chair) Gerhard Budin, Viena University, Austria Key Sun Choi, KAIST, Korea Raúl García Castro, UPM, Spain Elena Montiel-Ponsoda, UPM, Spain Bodil Nistrup Madsen, CBS and GTW, Denmark Ana Roldán Riejos, UPM, Spain Mari Carmen Suárez-Figueroa, UPM, Spain Klaus-Dirk Schmitz, Fachhochschule Köln and GTW, Germany Hanne Erdman Thomsen, CBS and GTW, Denmark Programme Committee Bassey Antia, Nigeria Nathalie Aussenac-Gilles, France Lynne Bowker, Canada Paul Buitelaar, Ireland Maria Teresa Cabré, Spain Rute Costa, Portugal Anne Condamines, France Beatrice Daille, France Thierry Declerck, Germany Pamela Faber, Spain Gil Francopoulo, France Asunción Gómez Pérez, Spain Fidelia Ibewke-Sanjuan, France Kyo Kageura, Japan Hendrick Kockaert, Belgium Marita Kristiansen, Norway Marie-Claude L'Homme, Canada Adeline Nazarenko, France Henrik Nilsson, Sweeden Anita Nuopponen, Finland Antonio Pareja-Lora, Spain Maria Pozzi, Mexico Christophe Roche, France Laurent Romary, France Sylvie Szluman, France Frieda Steurs, Belgium Birthe Toft, Denmark Kara Warburton, Hong-Kong Sue Ellen Wright, USA Pierre Zweigenbaum, France ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sat Dec 24 09:34:16 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sat, 24 Dec 2011 10:34:16 +0100 Subject: Appel: TKE 2012 - Call for Workshops Message-ID: Date: Fri, 23 Dec 2011 23:46:50 +0100 From: Nathalie Aussenac-Gilles Message-ID: <4EF504DA.6070904 at irit.fr> Apologies for multiple posting **************************************************************** Call for Workshop Proposals 10th Terminology and Knowledge Engineering Conference (TKE 2012) 19th - 22nd June 2012 Madrid, Spain **************************************************************** TKE 2012 will be the 10th edition of this conference that started out under the auspices of TermNet, and has developed towards wider fields in knowledge transfer for special language communication in a multilingual global society. In this context, interdisciplinary research emphasizing the intersection of different disciplines dealing with terminological aspects and knowledge engineering resources online is welcome. Co-located workshops at TKE conferences are distinguished meeting points for discussing ongoing work and latest ideas related to terminology and knowledge engineering. General Information and criteria -------------------------------- The decision on acceptance/rejection of workshop proposals will be made on the basis of the overall quality of the proposal as well as of how it fits in with the conference. The organizers of successful workshops will be responsible for their own reviewing process, publicity (e.g., website and call for papers), and proceedings production. They will be required to closely cooperate with the conference organizers to finalize all organizational details. Workshop attendees must pay the TKE 2011 workshop registration fee, as well as the conference registration fee. Criteria for workshop proposals: * Co-located workshops cover topics falling in the general scope of the TKE conference. * Workshops are intended to be genuine interactive events and not mini-conferences. * We welcome workshops with creative structures and organizations that attract various types of contributions and ensure rich interactions. * They are clearly focused. * There is potentially a significant community interested in the workshop's topic. * Workshop duration can be half a day or a full day. * We advise having more than one organizer and no more than four, preferably from different institutions, bringing different perspectives to the workshop topic. Submission Guidelines --------------------- Workshop proposals must be no more than 3 pages in length, in txt or PDF format, must be sent via email to Raúl García Castro (rgarcia at fi.upm.es), and should contain the following information: * The title and brief technical description of the workshop, specifying its goals and motivation. * A brief discussion of why the topic is of particular interest at this time. * A brief description of why and to whom the workshop is of interest, the workshop audience, as well as the expected number of participants. * A brief description (draft outline) of the proposed workshop format, discussing the mix of events such as paper presentations, invited talks, panels, and general discussion. * A list of (potential) members of the program committee (at least 50% have to be known and confirmed at the time of the proposal). * An indication of whether the workshop should be considered for a half-day or full-day event. * If there have been previous editions of the workshop, their short description, including URLs as well as some statistics about submissions and acceptance rate. * The names and contact information (web page, email address) of the proposed organizing committee and short description of their relevant experience. The organizing committee should also inform about whether they plan to prepare a workshop web site or the workshop information will be included in the main conference website. Important Dates (11:59 pm Hawaii time) Workshop Proposals Due: January 25, 2012 Notification of Acceptance: February 10, 2012 Workshop Website Due: February 25, 2012 Workshop Camera-ready Proceedings Due: June 1, 2012 Workshop Day: June 19 or 22, 2012 Workshop Chair -------------- Raúl García Castro (UPM, Spain) (tke2012workshops at delicias.dia.fi.upm.es) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Thu Dec 29 10:01:56 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Thu, 29 Dec 2011 11:01:56 +0100 Subject: Appel: Ireland International Conference on Education (IICE-2012), Reminder Message-ID: Date: Sun, 25 Dec 2011 16:21:17 +0000 (GMT) From: L inda Woods Message-ID: <322433556.44731.1324830077963.JavaMail.open-xchange at email.1and1.co.uk X-url: http://www.iicedu.org/ Apologies for cross-postings. Kindly email this call for papers to your colleagues, faculty members and postgraduate students. CALL FOR PAPERS ********************************************************** Ireland International Conference on Education (IICE-2012), April 16-18, 2012, Dublin, Ireland (www.iicedu.org) ********************************************************** The IICE is an international refereed conference dedicated to the advancement of the theory and practices in education. The IICE promotes collaborative excellence between academicians and professionals from Education. The aim of IICE is to provide an opportunity for academicians and professionals from various educational fields with cross-disciplinary interests to bridge the knowledge gap, promote research esteem and the evolution of pedagogy. The IICE 2012 invites research papers that encompass conceptual analysis, design implementation and performance evaluation. All the accepted papers will appear in the proceedings and modified version of selected papers will be published in special issues peer reviewed journals. The topics in IICE-2012 include but are not confined to the following areas: * Academic Advising and Counselling * Art Education * Adult Education * APD/Listening and Acoustics in Education Environment * Business Education * Counsellor Education * Curriculum, Research and Development * Competitive Skills * Continuing Education * Distance Education * Early Childhood Education * Educational Administration * Educational Foundations * Educational Psychology * Educational Technology * Education Policy and Leadership * Elementary Education * E-Learning * E-Manufacturing * ESL/TESL * E-Society * Geographical Education * Geographic information systems * Health Education * Higher Education * History * Home Education * Human Computer Interaction * Human Resource Development * Indigenous Education * ICT Education * Internet technologies * Imaginative Education * Kinesiology & Leisure Science * K12 * Language Education * Mathematics Education * Mobile Applications * Multi-Virtual Environment * Music Education * Pedagogy * Physical Education (PE) * Reading Education * Writing Education * Religion and Education Studies * Research Assessment Exercise (RAE) * Rural Education * Science Education * Secondary Education * Second life Educators * Social Studies Education * Special Education * Student Affairs * Teacher Education * Cross-disciplinary areas of Education * Ubiquitous Computing * Virtual Reality * Wireless applications * Other Areas of Education Important Dates: * Extended Abstract (Work in Progress) Submission Date: January 15, 2012 * Research Paper, Student Paper, Case Study, Report Submission Date: January 20, 2012 * Proposal for Workshops: January 15, 2012 * Notification of Workshop Acceptance/Rejection: January 20, 2012 * Poster Proposal Submission Date: January 10, 2012 * Notification of Poster Acceptance/Rejection: January 15, 2012 * Notification of Extended Abstract (Work in Progress) Acceptance/Rejection: January 25, 2012 * Notification of Research Paper, Student Paper, Case Study, Report Acceptance /Rejection: January 30, 2012 * Camera Ready Paper Due: February 10, 2012 * Early Bird Registration Deadline (Authors and Participants): February 15, 2012 * Late Bird Registration Deadline (Authors and Participants): March 10, 2012 * Conference Dates: April 16-18, 2012 For further information please visit IICE-2012 atwww.iicedu.org ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Thu Dec 29 10:04:49 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Thu, 29 Dec 2011 11:04:49 +0100 Subject: Appel: EACL 2012 Workshop on Innovative hybrid approaches to the processing of textual data Message-ID: Date: Mon, 26 Dec 2011 18:51:42 +0100 From: Thierry Hamon Message-ID: <87d3bbz0ep.fsf at smtp.univ-paris13.fr> X-url: http://www-limbio.smbh.univ-paris13.fr/membres/hamon/hybrid/ Call for papers Innovative hybrid approaches to the processing of textual data April 22, 2012, Avignon, France Website: http://www-limbio.smbh.univ-paris13.fr/membres/hamon/hybrid/ Submission Deadline: Jan 27, 2012 The hybrid approach term covers a large set of situations in which different approaches are combined in order to better process textual data and to attempt a better achievement of the dedicated task. Among the hybridizations the possible combinations are unlimited. The most frequent combination, as stressed during The Balancing Act in 1994, addressed machine learning and rule-based systems. Beyond this, the hybridization can be augmented with distributionnal approaches, syntactic and morphological analyses, semantic distances and similarities, graph theory models, cooccurrences of linguistic units (e.g., word and their dependencies, word senses and pos-tag, NEs and semantic roles,...), knowledge-based approaches (terminologies and ontologies), etc. As a matter of fact, the hybridization implies to define a strategy to efficiently combine several approaches: cooperation between approaches, filtering, voting or ranking of the multiple system outputs, etc. Indeed, the combination of these different methods and approaches appears to provide more complete and performant results. The reason is that each method is sensitive and efficient with given data and within given contexts. Hence, their combination may improve both precision and recall. The coverage is indeed improved, while the exploitation of different methods may also lead to the improvement of the precision since their use within filtering, voting etc. modes becomes possible. In this workshop, we favour the extended meaning of the hybridization of methods, applied to various application areas, such as (but do not feel constrained by these): - automatic creation of linguistic resources - POS tagging - building and structuring of terminologies - information retrieval and filtering - information extraction - linguistic annotation - semantic labeling - sign language recognition and transcription - oral data transcription - filtering and validation of lexical resources - text summarization - question/answering system - natural language generation - etc. We invite authors to submit novel methods and novel conceptions of the hybridization performed in various areas related to the textual data processing. Important dates Nov 25, 2011: 1st workshop CFP Jan 04, 2012: Abstract deadline Jan 27, 2012: Paper due date Feb 24, 2012: Notification of acceptance Mar 09, 2012: Camera-ready deadline Apr 22, 2012: Workshop Submission instructions: Authors are invited to submit full papers on original, unpublished work in the topic area of this workshop. Submissions should be formatted using the EACL 2012 stylefiles for latex or MS Word, with blind review and not exceeding 8 pages plus an extra page for references. The PDF files will be submitted electronically at https://www.softconf.com/eacl2012/Hybrid2012/ Program Committee Philipp Cimiano, CITEC, University of Bielefeld, Germany Vincent Claveau, IRISA, CNRS, France Kevin Cohen, University of Colorado Health Sciences Center, USA Marie-Claude l'Homme, OLST, Université de Montreal, Canada Béatrice Daille, LINA, Université de Nantes, France Sampo Pyysalo, NaCTeM, University of Manchester, United Kingdom Mathieu Roche, LIRMM, Université de Montpellier 2, France Patrick Ruch, Haute école de gestion de Genève, Switzerland Paul Thompson, NaCTeM, University of Manchester, United Kingdom Özlem Uzuner, University at Albany, State University of New York, USA Organization Committee Natalia Grabar, CNRS UMR 8163 STL, Université Lille 1&3, France Marie Dupuch, MOSTRARE/LIFL & CNRS UMR 8163 STL, Université Lille 1&3, France Amandine Périnet, LIM&BIO, Université Paris 13, France Thierry Hamon, LIM&BIO, Université Paris 13, France ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Thu Dec 29 10:06:44 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Thu, 29 Dec 2011 11:06:44 +0100 Subject: Appel: 1st CfP Workshop on Language Resources and Technologies for Turkic Languages, LREC 2012 Workshop Message-ID: Date: Tue, 27 Dec 2011 18:14:27 +0100 From: ELRA ELDA Information Message-ID: <4EF9FCF3.3030805 at elda.org> [Apologies for cross-postings] First Workshop on Language Resources and Technologies for Turkic Languages May 21, 2012 (afternoon session) First Call for Papers Turkic languages (Turkish, Azerbaijani, Turkmen, Kazakh, Tatar, Uzbek and many others) are spoken as native languages by more than 150 million people all around the world. Turkic languages have complex agglutinative morphology with very productive inflectional and derivational processes leading to a very large vocabulary size. They also have a very free constituent order with almost no formal constraints. Furthermore, due to various historical and social reasons these languages have employed a wide-variety of writing systems and still do so. These aspects bring numerous challenges to computational processing of these languages in tasks such as language modeling, parsing, statistical machine translation, speech-to-speech translation, etc. Time is now ripe to focus on the development of language resources and computational processing techniques for these languages to bring their status up to par with more studied languages in the context of speech and language processing. We are pleased to invite you to the "First Workshop on Language Resources and Technologies for Turkic Languages" to be held during LREC 2012 in Istanbul. This half-day workshop will be the first attempt to bring together the researchers, commercial interests and other stakeholders actively involved in developing and using speech and language technologies for Turkic languages. The workshop invites submission of papers on original and unpublished research on all aspects of speech and language technologies for Turkic languages. Selected contributions will be presented as short-oral presentations or posters. Topics include but are not limited to: - Morphological and Syntactic Analysis for Turkic languages - Novel language modeling techniques for Turkic languages - Development and adaptation of language and speech resources for Turkic languages - Machine translation into/from/between Turkic languages - Challenges for speech recognition and generation for Turkic languages Submission Information Submissions must describe original and unpublished work. Each submission will be judged on its originality, technical content, and relevance to the workshop. Contributions accompanied by language resources and datasets that are described in the paper are particularly encouraged. Paper submissions up to 4 pages (excluding references) must conform to the official LREC 2012 style guidelines (will be distributed soon on the LREC website). Submissions should be in PDF format and submitted through the START online submission system https://www.softconf.com/lrec2012/TurkicLanguage2012/. Since the reviewing will not be blind, submissions must include the authors' names and affiliations. Double submissions to other conferences are not allowed. More information on the submission procedure will be available in the next call of the papers. When submitting a paper through the START page, authors will be kindly asked to provide relevant information about the resources that have been used for the work described in their paper or that are the outcome of their research. For further information on this initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012. Authors will also be asked to contribute to the Language Library, the new initiative of LREC2012. Important Dates Paper submission deadline: February 27, 2012 Notification of acceptance: March 19, 2012 Camera-ready submission deadline: March 30, 2012 Workshop: May 21, 2012 (afternoon session) Organizing Committee Kemal Oflazer, Carnegie Mellon University - Qatar Mehmed Özkan, Boğaziçi University Mehmet Uğur Doğan, TÜBİTAK-BİLGEM Dilek Hakkani-Tür, Microsoft Alper Kanak, TÜBİTAK-BİLGEM İlknur Durgar El-Kahlout, TÜBİTAK-BİLGEM Şeniz Demir, TÜBİTAK-BİLGEM Yücel Bicil, TÜBİTAK-BİLGEM Program Committee Adil Alpkoçak, Dokuz Eylül University Ahmet Cüneyd Tantuğ, İstanbul Technical University Arzucan Özgür, Boğaziçi University Atakan Kurt, Fatih University Banu Diri, Yıldız Technical University Barış Bozkurt, Bahçeşehir University Bilge Say, Middle East Technical University Cem Bozşahin, Middle East Technical University Cemil Demir, TÜBİTAK-BİLGEM Cenk Demiroğlu, Özyeğin University Coşkun Mermer, TÜBİTAK-BİLGEM Deniz Yüret, Koç University Deniz Zeyrek, Middle East Technical University Ebru Arısoy, IBM T.J. Watson Research Center Engin Erzin, Koç University Erdem Ünal, TÜBİTAK-BİLGEM Fatma Canan Pembe, TÜBİTAK-BİLGEM Gülşen Cebiroğlu Eryiğit, İstanbul Technical University Levent Arslan, Boğaziçi University Mehmet Fatih Amasyalı, Yıldız Technical University Murat Can Ganiz, Doğuş University Murat Saraçlar, Google - Boğaziçi University Oğuzhan Külekçi, TÜBİTAK-BİLGEM Özlem Çetinoğlu, University of Stuttgart Ruken Cakici, Middle East Technical University Selçuk Köprü, Teknoloji Yazılımevi Şükriye Ruhi, Middle East Technical University Tunga Güngör, Boğaziçi University Ümit Güz, Işık University Yeşim Aksan, Mersin University Yusuf Ziya Işık, TÜBİTAK-BİLGEM Sponsor The workshop is being supported by the European Commission FP7-REGPOT-2008-1 project (#229861) MULTISAUND (MULTilingualism Integrated to Speech and Audio UNDerstanding) - "Improvement of interactive and secure language and speech processing potential of TUBITAK-UEKAE for an increased multilingual capability in ERA". Contact Information Please contact Mehmet Uğur Doğan (mugur at uekae.tubitak.gov.tr) or İlknur Durgar El-Kahlout (idurgar at uekae.tubitak.gov.tr) for any further inquiries. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Thu Dec 29 10:10:07 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Thu, 29 Dec 2011 11:10:07 +0100 Subject: Appel: 1st CfP Workshop on Indian Language Data: Resources and Evaluation, LREC 2012 Workshops Message-ID: Date: Tue, 27 Dec 2011 18:18:48 +0100 From: ELRA ELDA Information Message-ID: <4EF9FDF8.1010607 at elda.org> X-url: http://sanskrit.jnu.ac.in/conf/wildre [Apologies for cross-postings] WILDRE- Workshop on Indian Language Data: Resources and Evaluation FIRST WORKSHOP ON INDIAN LANGUAGE DATA: RESOURCES AND EVALUATION (WILDRE) Date: Monday, 21st May 2012 (morning session) Venue: Lütfi Kirdar Istanbul Exhibition and Congress Centre, Turkey (Organized in under the platform of LREC2012 (21-27 May 2012)) Website: http://sanskrit.jnu.ac.in/conf/wildre WILDRE – the first workshop on Indian Language Data: Resources and Evaluation is being organized in Istanbul, Turkey on 21st May, 2012 under the LREC platform. India has a huge linguistic diversity and has seen concerted efforts from the Indian government and industry towards developing language resources. European Language Resource Association (ELRA) and its associate organizations have been very active and successful in addressing the challenges and opportunities related to language resource creation and evaluation. It is therefore a great opportunity for resource creators of Indian languages to showcase their work on this platform and also to interact and learn from those involved in similar initiatives all over the world. The broader objectives of the WILDRE will be - To map the status of Indian Language Resources - To investigate challenges related to creating and sharing various levels of language resources - To promote a dialogue between language resource developers and users - To provide opportunity for researchers from India to collaborate with researchers from other parts of the world DATES February 12, 2012 Paper submissions due March 18, 2012 Paper notification of acceptance March 30, 2012 Camera-ready papers due May 21, 2012 Workshop (morning session) SUBMISSIONS Papers must describe original, completed or in progress, and unpublished work. Each submission will be reviewed by two program committee members. Accepted papers will be given up to 10 pages (for full papers) 5 pages (for short papers and posters) in the workshop proceedings, and will be presented oral presentation or poster. Papers should be formatted according to the style-sheet, which will be provided on the LREC 2012 website (http://www.lrec-conf.org/lrec2012/). Please submit papers in PDF/doc format to: https://www.softconf.com/lrec2012/WILDRE2012/ We are seeking submissions under the following category - Full papers (10 pages) - Short papers (work in progress – 5 pages) - Posters (innovative ideas/proposals, research proposal of students) - Demo (of working online/standalone systems) Though our area of interest covers all NLP/language technology related activity for Indian languages, we would like to focus on the resource creation in the following areas- - Text corpora - Speech corpora - Lexicons and Machine-readable dictionaries - Ontologies - Grammars - Annotation of corpora - Language resources for basic NLP, IR and Speech Technology tasks, tools and - Infrastructure for constructing and sharing language resources - Standards or specifications for language resources applications - Licensing and copyright issues Both submission and review processes will handled electronically using the Start interface of the LREC website. The workshop website will provide the submission guidelines and the link for the electronic submission. When submitting a paper through the START page, authors will be kindly asked to provide relevant information about the resources that have been used for the work described in their paper or that are the outcome of their research. For further information on this initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012 . Authors will also be asked to contribute to the Language Library, the new initiative of LREC2012 Conference Chairs - Girish Nath Jha, Jawaharlal Nehru University, New Delhi - Kalika Bali, Microsoft Research India Lab, Bangalore - Sobha L, AU-KBC Research Centre, Anna University, Chennai Program Committee 1. A. Kumaran, MSRI, Bangalore 2. A G Ramakrishnan, I.I.Sc Bangalore 3. Amba Kulkarni, University of Hyderabad 4. Chris Cieri, LDC, University of Pennsylvania 5. Dafydd Gibbon, Universität Bielefeld, Germany 6. Dipti Mishra Sharma, IIIT, Hyderabad 7. Girish Nath Jha, Jawaharlal Nehru University, New Delhi 8. Hema Murthy, IIT, Chennai 9. Joseph Mariani, LIMSI-CNRS, France 10. Kalika Bali, MSRI, Bangalore 11. Khalid Choukri, ELRA, France 12. L Ramamoorthy, LDC-IL, CIIL, Mysore 13. Monojit Choudhary, MSRI Bangalore 14. Nicoletta Calzolari, ILC-CNR, Pisa, Italy 15. Niladri Shekhar Dash, ISI Kolkata 16. Shivaji Bandhopadhyay, Jadavpur University, Kolkata 17. Shyamal Das Mondal, IIT Kharagpur 18. Sobha L, AU-KBC Research Centre, Anna University 19. Soma Paul, IIIT, Hyderabad 20. Umamaheshwar Rao, University of Hyderabad Workshop contact: diwakar.mishra at gmail.com Diwakar Mishra, Special Center for Sanskrit Studies, Jawaharlal Nehru University, New Delhi ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Thu Dec 29 10:12:19 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Thu, 29 Dec 2011 11:12:19 +0100 Subject: Appel: 1CfP LRE-Rel: Language Resources and Evaluation for Religious Texts, LREC 2012 Workshops Message-ID: Date: Wed, 28 Dec 2011 11:49:40 +0100 From: ELRA ELDA Information Message-ID: <4EFAF444.7080701 at elda.org> X-url: http://www.comp.leeds.ac.uk/arabic/lre-rel.html [apologies for multiple postings] LRE-Rel: Language Resources and Evaluation for Religious Texts LREC 2012 pre-conference workshop 22 May 2012 (morning) Lütfi Kirdar Istanbul Exhibition and Congress Centre http://www.comp.leeds.ac.uk/arabic/lre-rel.html FIRST CALL FOR PAPERS We invite submissions for the first workshop on: Language Resources and Evalution for Religious Texts. The focus of this workshop is the application of computer-supported and Text Analytics techniques to religious texts ranging from: the faith-defining religious canon; authoritative interpretations and commentary; sermons; liturgy; prayers; poetry; and lyrics. We see this as an inclusive and cross-disciplinary topic, and the workshop aims to bring together researchers with a generic interest in religious texts to raise awareness of different perspectives and practices, and to identify some common themes. We therefore welcome submissions on a range of topics, including but not limited to: - analysis of ceremonial, liturgical, and ritual speech; recitation styles; speech decorum; discourse analysis for religious texts; - formulaic language and multi-word expressions in religious texts; - suitability of modal and other logic types for knowledge representation and inference in religious texts; - issues in, and evaluation of, machine translation in religious texts; - text-mining, stylometry, and authorship attribution for religious texts; - corpus query languages and tools for exploring religious corpora; - dictionaries, thesaurai, Wordnet, and ontologies for religious texts; - measuring semantic relatedness between multiple religious texts; - (new) corpora and rich and novel annotation schemes for religious texts; - annotation and analysis of religious metaphor; - genre analysis for religious texts; - application in other disciplines (e.g. theology, classics, philosophy, literature) of computer-supported methods for analysing religious texts. Important Dates 13.02.2012 Deadline for paper submissions 27.02.2012 Notification of acceptance 09.03.2012 Camera-ready copies due 22.05.2012 Workshop Submissions Papers should follow the LREC main conference formatting details (to be announced on the conference website: http://www.lrec-conf.org/lrec2012/) and should be submitted either as a Word document or a .pdf file (preferably both) via the START conference manager for our workshop: https://www.softconf.com/lrec2012/LRE-Rel2012/ Papers will be reviewed by three members of the Programme Committee. Accepted papers will be published in the workshop proceedings, and possibly via special issue of an appropriate journal. This year, when submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. For further information on this new initiative, please refer to: http://www.lrec-conf.org/lrec2012/LRE-Map-2012 Workshop Organisers and Co-chairs, with contact persons identified via email address Abdul Malik Al-Salman: King Saud University, Saudi Arabia Eric Atwell: University of Leeds, UK Claire Brierley: University of Leeds, UK scscb at leeds.ac.uk Azzeddine Mazroui: Mohammed First University, Morocco Majdi Sawalha: University of Jordan sawalha.majdi at gmail.com Abdul-Baquee M. Sharaf: University of Leeds, UK Bayan Abu Shawar: Arab Open University, Jordan Programme Committee Nawal Alhelwal: Arabic Department, Princess Nora bint Abdulrahman University, Saudi Arabia Qasem Al-Radaideh: Computer Information Systems, Yarmouk University, Jordan Abdul Malik Al-Salman: Computer and Information Sciences, King Saud University, Saudi Arabia Eric Atwell: School of Computing, University of Leeds, UK Amna Basharat: Foundation for Advancement of Science and Technology, FAST-NU, Pakistan James Dickins: Arabic and Middle Eastern Studies, University of Leeds, UK Kais Dukes: School of Computing, University of Leeds, UK Mahmoud El-Haj: Computer Science and Electronic Engineering, University of Essex, UK Nizar Habash: Center for Computational Learning Systems, Columbia University, US Salwa Hamada: Electronics Research Institute, Egypt Bassam Hasan Hammo: Information Systems, King Saud University, Saudi Arabia Dag Haug: Philosophy, Classics, History of Art and Ideas, University of Oslo, Norway Moshe Koppel: Department of Computer Science, Bar-Ilan University, Israel Rohana Mahmud: Computer Science and Information Technology, University of Malaya, Malaysia Azzeddine Mazroui: Mathematics and Computer Science, Mohammed 1st University, Morocco Tony McEnery: English Language and Linguistics, University of Lancaster, UK Aida Mustapha: Computer Science and Information Technology, University of Putra, Malaysia Mohamadou Nassourou: Computer Philology and Modern German Literature, University of Würzburg, Germany Nils Reiter: Department of Computational Linguistics, Heidelberg University, Germany Abdul-Baquee M. Sharaf: School of Computing, University of Leeds, UK Bayan Abu Shawar: Information Technology and Computing, Arab Open University, Jordan Andrew Wilson: Linguistics and English Language, University of Lancaster, UK Nagwa Younis: English Department, Ain Shams University, Egypt Wajdi Zaghouani: Linguistic Data Consortium, University of Pennsylvania, US Further Information If you have questions, please consult the workshop website at: http://www.comp.leeds.ac.uk/arabic/lre-rel.html Alternatively, contact: Claire Brierley scscb at leeds.ac.uk or Majdi Sawalha sawalha.majdi at gmail.com ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Thu Dec 29 10:14:16 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Thu, 29 Dec 2011 11:14:16 +0100 Subject: Appel: 1st CfP Workshop on Building and Using Comparable Corpora, LREC 2012 Workshops Message-ID: Date: Wed, 28 Dec 2011 12:17:57 +0100 From: ELRA ELDA Information Message-ID: <4EFAFAE5.6020903 at elda.org> X-url: http://hnk.ffzg.hr/5bucc2012 [Apologies for multiple postings] ============================================================ 5th WORKSHOP ON BUILDING AND USING COMPARABLE CORPORA Language Resources for Machine Translation in Less-Resourced Languages and Domains Co-located with LREC 2012 Lütfi Kirdar Istanbul Exhibition and Congress Centre Saturday, 26 May 2012 DEADLINE FOR PAPERS: 15 February 2012 http://hnk.ffzg.hr/5bucc2012 Endorsed by * ACL SIGWAC (Special Interest Group on Web as Corpus) * FLaReNet (Fostering Language Resources Network) ============================================================ MOTIVATION In the language engineering and the linguistics communities, research in comparable corpora has been motivated by two main reasons. In language engineering, it is chiefly motivated by the need to use comparable corpora as training data for statistical NLP applications such as statistical machine translation or cross-lingual retrieval. In linguistics, on the other hand, comparable corpora are of interest in themselves by making possible inter-linguistic discoveries and comparisons. It is generally accepted in both communities that comparable corpora are documents in one or several languages that are comparable in content and form in various degrees and dimensions. We believe that the linguistic definitions and observations related to comparable corpora can improve methods to mine such corpora for applications of statistical NLP. As such, it is of great interest to bring together builders and users of such corpora. The scarcity of parallel corpora has motivated research concerning the use of comparable corpora: pairs of monolingual corpora selected according to the same set of criteria, but in different languages or language varieties. Non-parallel yet comparable corpora overcome the two limitations of parallel corpora, since sources for original, monolingual texts are much more abundant than translated texts. However, because of their nature, mining translations in comparable corpora is much more challenging than in parallel corpora. What constitutes a good comparable corpus, for a given task or per se, also requires specific attention: while the definition of a parallel corpus is fairly straightforward, building a non-parallel corpus requires control over the selection of source texts in both languages. Parallel corpora are a key resource as training data for statistical machine translation, and for building or extending bilingual lexicons and terminologies. However, beyond a few language pairs such as English- French or English-Chinese and a few contexts such as parliamentary debates or legal texts, they remain a scarce resource, despite the creation of automated methods to collect parallel corpora from the Web. To exemplify such issues in a practical setting, this year's special focus will be on Language Resources for Machine Translation in Less-Resourced Languages and Domains with the aim of overcoming the shortage of parallel resources when building MT systems for less-resourced languages and domains, particularly by usage of comparable corpora for finding parallel data within and by reaching out for "hidden" parallel data. Lack of sufficient language resources for many language pairs and domains is currently one of the major obstacles in further advancement of machine translation. TOPICS We solicit contributions including but not limited to the following topics: Topics related to the special theme: * comparable corpora use in MT * comparable corpora processing tools/kits for MT * parallel corpora usage * parallel corpora processing tools/platforms * MT for less-resourced languages * MT for less-resourced domains * open source SMT systems (Moses, etc.) * publicly available SMT Building Comparable Corpora: * Human translations * Automatic and semi-automatic methods * Methods to mine parallel and non-parallel corpora from the Web * Tools and criteria to evaluate the comparability of corpora * Parallel vs non-parallel corpora, monolingual corpora * Rare and minority languages * Across language families * Multi-media/multi-modal comparable corpora Applications of comparable corpora: * Human translations * Language learning * Cross-language information retrieval & document categorization * Bilingual projections * Machine translation * Writing assistance Mining from Comparable Corpora: * Extraction of parallel segments or paraphrases from comparable corpora * Extraction of bilingual and multilingual translations of single words and multi-word expressions; proper names, named entities, etc. IMPORTANT DATES (TENTATIVE) 15 February 2012 Deadline for submission of full papers 10 March 2012 Notification of acceptance 20 March 2012 Camera-ready papers due 26 May 2012 Workshop date SUBMISSION INFORMATION Papers should follow the LREC main conference formatting details (to be announced on the conference website http://www.lrec-conf.org/lrec2012/) and should be submitted as a PDF-file of no more than ten pages via the START workshop manager: https://www.softconf.com/lrec2012/BUCC2012/ Reviewing will be double blind, so the papers should not reveal the authors' identity. Accepted papers will be published in the workshop proceedings. Double submission policy: Parallel submission to other meetings or publications are possible but must be immediately notified to the workshop organizers. When submitting a paper through the START page, authors will be asked to provide information about the resources that have been used for the work described in their paper or are an outcome of their research. For details on this initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012. Authors will also be asked to contribute to the Language Library, the new initiative of LREC 2012. For further information, please contact Reinhard Rapp reinhardrapp (at) gmx (dot) de or Marko Tadic marko.tadic (at) ffzg (dot) hr ORGANISERS Reinhard Rapp, Universities of Mainz (Germany)and Leeds (UK) Marko Tadic, University of Zagreb (Croatia) Serge Sharoff, University of Leeds (UK) Andrejs Vasiljevs, Tilde SIA, Riga, Latvia Pierre Zweigenbaum, LIMSI, CNRS, Orsay, and ERTIM, INALCO, Paris (France) SCIENTIFIC COMMITTEE * Srinivas Bangalore (AT&T Labs, USA) * Caroline Barrière (National Research Council Canada) * Chris Biemann (Microsoft / Powerset, San Francisco, USA) * Lynne Bowker (University of Ottawa, Canada) * Hervé Déjean (Xerox Research Centre Europe, Grenoble, France) * Andreas Eisele (DFKI, Saarbrücken, Germany) * Rob Gaizauskas (University of Sheffield, UK) * Éric Gaussier (Université Joseph Fourier, Grenoble, France) * Nikos Glaros (ILSP, Athens, Greece) * Gregory Grefenstette (Exalead/Dassault Systemes, Paris, France) * Silvia Hansen-Schirra (University of Mainz, Germany) * Kyo Kageura (University of Tokyo, Japan) * Adam Kilgarriff (Lexical Computing Ltd, UK) * Natalie Kübler (Université Paris Diderot, France) * Philippe Langlais (Université de Montréal, Canada) * Tony McEnery (Lancaster University, UK) * Emmanuel Morin (Université de Nantes, France) * Dragos Stefan Munteanu (Language Weaver Inc., USA) * Lene Offersgaard (University of Copenhagen, Denmark) * Reinhard Rapp (Universities of Mainz, Germany, and Leeds, UK) * Sujith Ravi (Yahoo! Research, Santa Clara, CA, USA) * Serge Sharoff (University of Leeds, UK) * Michel Simard (National Research Council Canada) * Inguna Skadina (Tilde, Riga, Latvia) * Monique Slodzian (INALCO, Paris, France) * Benjamin Tsou (The Hong Kong Institute of Education, China) * Dan Tufis (Romanian Academy, Bucharest, Romania) * Justin Washtell (University of Leeds, UK) * Oliver Wilson (University of Edinburgh, UK) * Michael Zock (LIF, CNRS Marseille, France) * Pierre Zweigenbaum (LIMSI-CNRS, Orsay, France) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Thu Dec 29 10:17:05 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Thu, 29 Dec 2011 11:17:05 +0100 Subject: Appel: 1CfP Joint ISA-7, SRSL-3, and I2MRT, LREC 2012 Workshops Message-ID: Date: Wed, 28 Dec 2011 12:35:59 +0100 From: ELRA ELDA Information Message-ID: <4EFAFF1F.7090202 at elda.org> [Apologies for multiple postings] CALL FOR PAPERS Joint ISA-7, SRSL-3, and I2MRT LREC 2012 Workshop on Semantic Annotation and the Integration and Interoperability of Multimodal Resources and Tools Istanbul, 26-27 May 2012. BACKGROUND Three initiatives join forces in this workshop, which is concerned with issues in semantic annotation for language resources, especially in relation to spoken and multimodal language data, and with the interoperability and integration of multimodal resources and tools. ISA-7 is the Seventh Workshop on Interoperable Semantic Annotation, and forms part of a series of workshops of ISO TC 37/SC 4 (Language Resources) jointly with ACLSIGSEM (Computational Semantics). These workshops bring together experts in the annotation of semantic information as expressed in text, speech, gestures, graphics, video, images, and in multiple modalities combined. Examples of semantic annotation include the markup of events, time, space, dialogue acts, discourse relations, and semantic roles, for which the ISO organization pursues the establishment of annotation standards, in order to support the creation of interoperable semantic resources. SRSL-3 is the Third Workshop on Semantic Representation of Spoken Language in Speech and Multimodal Corpora. In these workshops researchers convene who are working on speech and multimodal resources for the semantic annotation of related corpora, and take their inspiration from the observation that the semantic gap between the content conveyed by speech and other modalities and their formal representation is a burning issue in a range of tasks such as content mining, information extraction, dialogue processing, interactive story-telling, assisted health care,and human-robot interaction. I2MRT (Integration and Interoperability for Multimodal Resources and Tools) is an initiative to address infrastructure aspects of the creation and use of interoperable multimodal resources. The main objectives of I2MRT are to create awareness of the need to make multimodal data visible via standardized methods and accessible via registered data centers; to discuss possibilities of harmonization and standardization with respect to multimodal annotation schemes and possible mappings between encoding schemes; to discuss ways to make cutting-edge technologies available to multimodality researchers that can currently only be used in specialized labs; and to build a community that is committed to work further on these issues. PAPER SUBMISSION Submission of papers will be electronically using the START conference management page of LREC 2012 for this workshop: https://www.softconf.com/lrec2012/SAIIMRT2012/ When submitting a paper to the START page, authors will be asked to provide essential information about resources (in a broad sense, including also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or that are a new result of your research. For further information on this new initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012 This 1.5 day workshop has two tracks, one related to interoperable semantic annotation, nd one related to multimodal resources and tools. For each track, research papers are invited describing original, unpublished research. Research papers can be either long (6-8 pages) or short (3-4 pages). Short papers are also invited describing ongoing or proposed projects or infrastructure-related activities. Topics for the tracks include, but are not limited to the following: Track "Interoperable Semantic Annotation": * methodological aspects of semantic annotation * semantic annotation and semantic interpretation * the semantics of semantic annotations * interoperability of spoken and written corpora * content identification and segmentation in spontaneous speech * semantic annotation in dialogue, with a focus on multimodality * segmentation of multimodal interaction data into semantic units * issues in semantic annotation in specific domains, such as time and space; relations in discourse and dialogue; semantic roles and predicate-argument structures * developments in recent, current, and planned ISO projects on developing semantic annotation standards. Track "Integration and Interoperability for Multimodal Resources and Tools": * encoding systems in use in multimodal applications * metadata concepts to describe multimedia/multimodal resources * attempts to harmonize encoding schemes and to map between them * cutting-edge multimedia processing technology that may help multimodality researchers to speed up annotation work * relevance and god examples of usability solutions to efficiently deal with the inherently imperfect results of any recognition technology * possibilities to create strong data hubs with attention for legal and ethical aspects of transferring multimedia/multimodal data * approaches to the creation of a service-oriented architecture in this field. IMPORTANT DATES: deadline for submitting papers: February 27 notification of acceptance: March 15 camera-ready papers: March 29 workshop dates: May 26 (full day) and May 27 (morning session) ORGANIZING COMMITTEE: Harry Bunt (co-chair) Manuel Alcántara-Pla (co-chair) Peter Wittenburg (co-chair) Thierry Declerck Dafydd Gibbon Nancy Ide Steven Krauwer Kiyong Lee Lorenza Mondada James Pustejovsky Laurent Romary Oliver Schreer PROGRAMME COMMITTEE: Jan Alexandersson Stefan Baumann Jonas Beskow Paul Buitelaar Harry Bunt (co-chair) Thierry Declerck Alex Fang Raquel Fernandez Rovira Annette Frank Dafydd Gibbon Koiti Hasida Nancy Ide Michael Kipp Kiyong Lee Inderjeet Mani Jean-Claude Martin Lorenza Mondada Martha Palmer Volha Petukhova Andrei Popescu-Belis Rashmi Prasad James Pustejovsky Laurent Romary Oliver Schreer Mark Steedman Mariet Theune Isabel Trancoso MORE INFORMATiON: For more information please contact either one of the three workshop co-chairs: Harry Bunt, harry.bunt at uvt.nl Manuel Alcántara Pla, manuel.alcantara at uam.es Peter Wittenburg, peter.wittenburg at mpi.nl ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Thu Dec 29 10:19:47 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Thu, 29 Dec 2011 11:19:47 +0100 Subject: Revue: Langages, n183 (3/2011) Message-ID: Date: Wed, 28 Dec 2011 15:23:45 +0100 From: "Catherine SCHNEDECKER" Message-ID: X-url: http://www.armand-colin.com/revues_article_info.php?idr=20&idnum=397692&ida Langages, nº 183 (3/2011) Néologie Septembre 2011 MEJRI Salah, SABLAYROLLES Jean-François (coord.) MEJRI Salah, SABLAYROLLES Jean-François : Présentation : Néologie, nouveaux modèles théoriques et NTIC http://www.armand-colin.com/revues_article_info.php?idr=20&idnum=397692&idart=8720 MORTUREUX Marie-Françoise La néologie lexicale : de l'impasse à l'ouverture http://www.armand-colin.com/revues_article_info.php?idr=20&idnum=397692&idart=8719 MEJRI Salah : Néologie et unité lexicale : renouvellement théorique, polylexicalité et emploi http://www.armand-colin.com/revues_article_info.php?idr=20&idnum=397692&idart=8718 SABLAYROLLES Jean-François : De la « néologie syntaxique » à la néologie combinatoire http://www.armand-colin.com/revues_article_info.php?idr=20&idnum=397692&idart=8717 HUMBLEY John : Vers une méthode de terminologie rétrospective http://www.armand-colin.com/revues_article_info.php?idr=20&idnum=397692&idart=8716 DUCOS Joëlle, SALVADOR Xavier-Laurent :Pour un dictionnaire de français scientifique médiéval : le projet Crealscience http://www.armand-colin.com/revues_article_info.php?idr=20&idnum=397692&idart=8715 LÉTURGIE Arnaud : À propos de l'amalgamation lexicale en français http://www.armand-colin.com/revues_article_info.php?idr=20&idnum=397692&idart=8714 ISSAC Fabrice : Cybernéologie : quelques outils informatiques pour l'identification et le traitement des néologismes sur le web http://www.armand-colin.com/revues_article_info.php?idr=20&idnum=397692&idart=8713 CARTIER Emmanuel : Néologie et description linguistique pour le TAL http://www.armand-colin.com/revues_article_info.php?idr=20&idnum=397692&idart=8712 ANASTASSIADIS-SYMÉONIDIS Anna, NIKOLAOU Georgia : L'adaptation morphologique des emprunts néologiques : en quoi est-elle précieuse ? http://www.armand-colin.com/revues_article_info.php?idr=20&idnum=397692&idart=8711 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Tue Dec 6 20:49:35 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Tue, 6 Dec 2011 21:49:35 +0100 Subject: Livre: Advanced Language Technologies for Digital Libraries, Springer Message-ID: Date: Wed, 30 Nov 2011 17:09:42 +0000 From: Fr?d?rique Segond Message-ID: X-url: http://www.springer.com/computer/database+management+%26+information+retrieval/book/978-3-642-23159-9?changeHeader Nouveau Volume paru chez Springer : ? Advanced Language Technologies for Digital Libraries" International Workshops on NLP4DL 2009, Viareggio, Italy, June 15, 2009 and AT4DL 2009, Trento, Italy, September 8, 2009 Series: Lecture Notes in Computer Science http://www.springer.com/series/558, Vol. 6699 Subseries: Theoretical Computer Science and General Issues http://www.springer.com/series/7407 Bernardi, Raffaella; Chambers , Sally; Gottfried, Bjorn; Segond, Frederique; Zaihrayeu, Ilya (Eds) Pour plus de details : http://www.springer.com/computer/database+management+%26+information+retrieval/book/978-3-642-23159-9?changeHeader ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Tue Dec 6 20:51:20 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Tue, 6 Dec 2011 21:51:20 +0100 Subject: Appel: EACL 2012 call for demos, Dernier rappel Message-ID: Date: Wed, 30 Nov 2011 17:21:38 +0000 From: Fr?d?rique Segond Message-ID: X-url: http://eacl2012.org/home/index.html" The Thirteenth Conference of the European Chapter of the Association for Computational Linguistics http://www.eacl2012.org Avignon, France April 23-27, 2012 Call for demos EACL 2012 features a special session for demonstrations. Topics of interest cover all aspects of computational linguistics, as outlined in the main conference call for papers. Demos will be presented at specific sessions from the main program of the conference. The demo track allows researchers and practitioners to demonstrate their new and innovative systems while interacting with their audience in an informal setting. Demos' submissions should be concerned with mature systems or prototypes in which computational linguistics or NLP technologies are used to solve practically important problems. Demo papers should make clear which aspects of the system will be demonstrated, and how the system is used to solve practical problems. It should include a discussion of the implementation and case studies of how the system is applied. Requirements At least one author of each accepted demo paper must register for and attend the conference, and demonstrate the system during the demo sessions (please see the Conference Policies). Authors of accepted demos may also be required to demonstrate their systems at additional events during the conference. Each accepted demo paper will be allocated four pages in the conference proceedings; no extra pages can be purchased. Demo authors are not required to transfer copyright. Submission Procedure Authors of demos should submit an original paper describing their proposed presentation. The length of the paper is limited to 4 pages excluding references which are unlimited. The submission should be in line with the EACL 2012 main conference formatting styles and instructions for authors. Proposals for demos should provide: 1. an overview of what the demonstration aims to achieve 2. how the demo illustrates novel ideas 3. any URLs that link to screen-shots, live demos, or related information 4. equipment or facilities required for the system demonstration Submission site: https://www.softconf.com/eacl2012/EACL-Demos-2012/ Reviewing Demos will be blind peer-reviewed by two members of the Demos Program Committee, who will judge the originality, significance, quality, and clarity of each submission. Demos Chair : Fr?d?rique Segond (Objet Direct, France) Contact: fsegond at objetdirect.com PC Members Sophia Annaniadou (University of Manchester, UK) Jordi Atserias (Yahoo, Spain) Galia Angelova, (Bulgarian Academy of Sciences, Bulgaria ) Raffaella Bernardi (University of Trento, Italy) Luca Dini (CELI , Italy) Gregory Grefenstette (Exalead, France) Michael Matthews (Yahoo, Spain) Piek Vossen (Vrije Universiteit of Amsterdam, Nederland) Stelios Piperidis (ILSP, Greece) Sebastian Pad? (University of Heidelberg, Germany) Ralf Steinberger (European Commission - Joint Research Centre (JRC), Italy) Evelyne Viegas (Microsoft, USA) Claire Waast (EDF France) Wlodek Zadrozny (IBM, USA) Important Dates * Paper submission deadline: December, 4, 2011 (11:59pm Samoa Time (UTC/GMT -11 hours)) * Notification of acceptance: February, 6, 2012 * Camera ready copies due: March 9, 2012 * EACL 2012 Conference: April 23-27, 2012 Submission DEMOS site: https://www.softconf.com/eacl2012/EACL-Demos-2012/ ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Tue Dec 6 20:53:07 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Tue, 6 Dec 2011 21:53:07 +0100 Subject: These: Loic Kervajan, Traduction automatique Fr/LSF Message-ID: Date: Wed, 30 Nov 2011 20:39:50 +0100 From: Lo?c Kervajan Message-ID: <006001ccaf97$d42615d0$7c724170$@fr> X-url: http://www.kervajan.fr ------------------------------------------------------------------------ Chers amis, J?ai le plaisir de vous annoncer que je soutiendrai ma Th?se de Doctorat le vendredi 9 d?cembre 2011 ? 14h Il s?agit d?une Th?se de linguistique appliqu?e au traitement automatique des langues. Le sujet est le suivant : Contribution ? la traduction automatique fran?ais/Langue des Signes Fran?aise au moyen de personnages virtuels La soutenance, publique et traduite en LSF, se d?roulera : ? l?Universit? de Sciences Humaines d?Aix-en-Provence (29, av Robert Schuman, 13621 Aix-en-Provence) en salle C.212 * Pour parfaire l?organisation, je vous remercie de me confirmer votre pr?sence par retour de mail. Amicalement, Lo?c Kervajan Mail : loic at kervajan.fr Site perso : http://www.kervajan.fr * pour aller en C212 : Arriver par le grand hall. Monter les grands escaliers (sur la droite lorsqu?on entre) Continuer tout droit jusqu?au bout du couloir. Monter encore d?un ?tage. Prendre le couloir de gauche, la C212 est sur la gauche... ... ce sera fl?ch? ! ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Tue Dec 6 20:54:45 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Tue, 6 Dec 2011 21:54:45 +0100 Subject: Appel: TextGraphs-7 Workshop associated to ACL 2012 Message-ID: Date: Thu, 01 Dec 2011 19:39:55 +0100 From: Ga?l Dias Message-ID: <4ED7C9FB.3020206 at unicaen.fr> X-url: http://www.textgraphs.org/ws12 ====================================================================== CALL FOR PAPERS TextGraphs-7: Graph-based Methods for Natural Language Processing (http://www.textgraphs.org/ws12) Special Theme: Understanding and Mining Social Media Using Graphs: Information Propagation in Blogs and Tweets Workshop of ACL 2012 July 12, 2012, Jeju, Republic of Korea Submission dealdine: March 18, 2012 ======================================================================== (1) Workshop Description (2) Workshop Submissions (3) Important Dates (4) Author Instructions (5) Invited Speaker (6) Poster Session (7) Workshop Organisers (8) Program Committee (9) Contacts ================================================== (1) Workshop Description ================================================== Recent years have shown an increased interest in integrating various aspects of the field of Graph Theory (GT) into Natural Language Processing (NLP). Many language phenomena can be naturally put into graph-based representations and in the last 5 years a significant number of NLP applications adopted efficient and elegant solutions from graph theoretical frameworks. These applications range from part-of- speech tagging, word sense disambiguation, ontology learning and parsing to information extraction, semantic role assignment, summarization and sentiment analysis to name but a few. The emergence of new fields of research focusing on the social media such as Twitter and Facebook brought the graph-based methods even more into focus. In particular, graph-based algorithms are used to explore social network connections and propagation of information in those networks in addition to exploring the connections between the language entities. As a consequence, many new applications have been emerging such as rumor proliferation, e-reputation, multiple identity detection, language dynamics learning and future events prediction to name but a few. TextGraphs-7 aims to bring together researchers interested in GT applied to NLP and provide an environment for further integration of graph-based solutions into NLP tasks. The workshop also encourages discussions about theoretical justifications from GT that explain empirical results obtained in the NLP community. As a consequence, a deeper understanding of new theories of graph-based algorithms is likely to help to create new approaches and widen the usage of graphs for NLP applications. ================================================= (2) Workshop Submissions ================================================= TextGraphs-7 will invite innovative submissions on the following, but not limited to, three main topics: Topic I: Graph-based methods for Information Retrieval, Information Extraction and Text Mining - Graph-based methods for word sense disambiguation, - Graph-based representations for ontology learning, - Graph-based strategies for semantic relations identification, - Encoding semantic distances in graphs, - Graph-based techniques for document navigation and visualization, - and more subtopics; Topic II: New graph-based methods for NLP applications - Random walk methods in graphs, - Spectral graph clustering, - Semi-supervised graph-based methods, - Methods and analyses for statistical networks, - Small world graphs, - Dynamic graph representations , - Topological and pretopological analysis of graphs, - and more subtopics; Topic III: Graph-based methods for applications on social networks - Rumor proliferation, - E-reputation, - Multiple identity detection, - Language dynamics studies, - Surveillance systems, - and more subtopics. ====================================================== (3) Important Dates ====================================================== Submission deadline: March 18, 2012 Notification of Acceptance: April 15, 2012 Camera-ready deadline: April 30, 2012 Workshop: July 12, 2012 ==================================================== (4) Authors Instructions ==================================================== Submissions will consist of (1) regular full papers of maximum 8 pages (for well established research), (2) regular short papers of maximum 4 pages (for on-going research), (3) position papers of maximum 4 pages (for prospective ideas) and (4) demo papers of maximum 4 pages (for software and architectures). One extra page will be accepted for the Reference section for all submissions? types. Textgraphs-7 will have a blind review policy. In order to facilitate blind reviewing, the authors should omit their names and affiliations from the paper. Furthermore, all self-references that may reveal the author identity must be avoided. Papers that do not conform to these requirements will be rejected without review. All submissions must be in PDF format and must be formatted using the ACL 2012 Style Files available a http://www.acl2012.org/call/sub01.asp. Papers should be submitted to the START Conference Manager available at the following url: https://www.softconf.com/acl2012/textgraphs-7/. ======================================================= (5) Invited Speaker ======================================================= TextGraphs-7 will have the pleasure to invite Lada Adamic to present her work on information propagation in blogs and tweets. Lada Adamic is an associate professor in the School of Information at the University of Michigan Ann Arbor. She is also affiliated with the Center for the Study of Complex Systems and the Electrical Engineering and Computer Science Department. Her research interests center on information dynamics in networks: how information diffuses, how it can be found, and how it influences the evolution of a network's structure. She worked previously in Hewlett-Packard's Information Dynamics Lab. Her projects have included identifying expertise in online question and answer forums, studying the dynamics of viral marketing, and characterizing the structure in blogs and other online communities. She recently received the 10 years best paper award from the ACM/IEEE/WIC Conference 2011. ================================================== (6) Poster Session ================================================== Authors of accepted papers and demos will be able to present their work on a poster in addition to the oral presentation. The posters will be displayed in the workshop room all day and there will be a short poster session in the workshop schedule. ================================================== (7) Workshop Organisers ================================================== Ahmed Hassan Microsoft Research, Redmond, WA, United States Phone: +14257222849, Email: hassanam at microsoft.com Ga?l Dias University of Caen Basse-Normandie, Caen, France Phone: +33231567487 Email: gael.dias at unicaen.fr Irina Matveeva Dieselpoint Inc., Chicago, United States Phone: +17737047852 Email: imatveeva at dieselpoint.com ================================================== (8) Program Committee ================================================== Aitor Soroa, University of the Basque Country, Spain Alessandro Moschitti, University of Trento, Italy Animesh Mukherjee, ISI Foundation, Italy Antoine Widl?cher, University of Caen Basse-Normandie, France Arzucan ?zg?r, Istanbul University, Turkey Bruno Cr?milleux, University of Caen Basse-Normandie, France Chris Biemann, Darmstadt University of Technology, Germany Dragomir Radev, University of Michigan, USA Fabio Massimo Zanzotto, University of Rome, Italy Gina-Anne Levow, University of Washington, USA Guillaume Cleuziou, University of Orl?ans, France Michael Gamon, Microsoft Research, USA Patrik Pantel, Microsoft Research, USA Paul Tarau, University of North Texas, USA Philippe Muller, Paul Sabatier University, France Rada Mihalcea, University of North Texas, USA Swapna Somasundaran, Siemens Corporate Research, USA Torsten Zesch, Darmstadt University of Technology, Germany Uwe Quasthoff, University of Leipzig, Germany Zornitsa Kozareva, ISI, University of Southern California, USA ==================================================== (9) Workshop Contacts ==================================================== For any question, please email us at textgraphs2012 at gmail.com or look at the web site http://www.textgraphs.org/ws12 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Tue Dec 6 20:58:15 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Tue, 6 Dec 2011 21:58:15 +0100 Subject: Job: Post-doc textometrie, SDL Message-ID: Date: Thu, 1 Dec 2011 12:08:21 -0800 (PST) From: Georgeta Cislaru Message-ID: <1322770101.8412.YahooMailNeo at web39424.mail.mud.yahoo.com> X-url: http://www.univ-paris3.fr/anr-ecritures/ Appel ? candidatures Contrat post-doctoral Intitul? du poste : ? Approche textom?trique longitudinale de brouillons align?s ? Lieu : Universit? Sorbonne nouvelle Paris 3, France, 19 rue des Bernardins et 46 rue Saint-Jacques, 75005 Paris. D?but du contrat : 1/01/2012 ou 1/02/2012 Dur?e : 12 mois R?mun?ration : environ 1900 euros net/mois (selon l?exp?rience et le profil du candidat, la r?mun?ration peut l?g?rement varier). Projet concern? : ANR ECRITURES ? Brouillons d??crits sociaux : approche g?n?tique, discursive et textom?trique de l'?criture professionnelle ? (Drafts of Social Workers? Writings: a Genetic, Discursive and Textometric Study of Professional Writing) Site Web: www.univ-paris3.fr/anr-ecritures/ Rattachement : EA 2290 SYLED (f?d?ration CLESTHIA), Universit? Sorbonne nouvelle Paris 3, France. Profil du/de la candidat/e : titulaire d?un doctorat Linguistique g?n?tique, Linguistique de corpus ou traitement automatique, le candidat ma?trisera les bases du traitement automatique (approche textom?trique de pr?f?rence) et sera int?ress? par le d?veloppement de m?thodes et d?outils textom?triques. Un int?r?t pour les m?thodes d?analyse qualitative est ?galement attendu (analyse de discours, s?mantique, lexicologue, syntaxe?). Mission du post-doc : traitement automatique du corpus brouillons dans une perspective g?n?tique et dans une perspective de d?veloppement d?une m?thode textom?trique longitudinale de brouillons align?s (Linguistique g?n?tique et traitement automatique) ; participation au recueil et ? l??tiquetage du corpus ; le post-doctorant contribuera au d?veloppement des outils et de la m?thode et prendra en charge une partie des analyses longitudinales. Le post-doctorant valorisera ce travail par une publication internationale. Participation ? la gestion globale du projet. Synth?se du projet ECRITURES Programme de travail financ? par l?ANR (Agence nationale de la Recherche), du 1 avril 2011 au 31 mars 2014. Le projet ECRITURES propose une approche linguistique de la performance langagi?re. Il vise ? rendre compte de l?impact que les contraintes linguistiques, discursives, sociales et cognitives exercent sur la production ?crite des travailleurs sociaux du champ de la protection de l?enfance. Domaines concern?s : analyse de discours, linguistique de corpus, g?n?tique textuelle, cognition sociale, psycholinguistique. Objectifs : - D?gager, ? partir d?analyses discursives et g?n?tiques des ?crits des travailleurs sociaux, les aspects linguistiques d?une dynamique d??criture encadr?e socialement et institutionnellement ; - Apporter une r?flexion analytique et m?thodologique en vue du d?veloppement de logiciels de suivi de r?daction ; d?velopper une m?thode textom?trique longitudinale des brouillons align?s ; - Proposer une grille et des crit?res pour des d?veloppements didactiques en vue de la formation ? l??criture professionnelle. Corpus : - un sous-corpus de brouillons d??crits sociaux (SAFE de Caen) produit sur un traitement de texte normal et enregistrant les versions successives des rapports ?ducatifs ; - un sous-corpus produit via un logiciel de suivi de r?daction (Inputlog) ; - un corpus ? de contr?le ? (rapports acad?miques, autres genres possibles). Retomb?es extradisciplinaires du projet : Les r?sultats de ce projet, outre d?autres disciplines telles que la psychologie ou la sociologie, int?resseront ?galement les travailleurs sociaux, en collaboration avec lesquels il est men?. Pour faire acte de candidature, il convient d'adresser ? Serge Fleury (serge.fleury at univ-paris3.fr) et ? Georgeta Cislaru (georgeta.cislaru at univ-paris3.fr) : - une lettre de motivation ; - un CV d?taill? ; - le rapport de soutenance et un r?sum? de la th?se. Date limite : le 15 d?cembre 2011 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Tue Dec 6 20:59:45 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Tue, 6 Dec 2011 21:59:45 +0100 Subject: Appel: NAACL 2012 Workshop on Computational Linguistics for Literature Message-ID: Date: Fri, 02 Dec 2011 15:26:52 -0500 From: Stan Szpakowicz Message-ID: <4ED9348C.4060409 at eecs.uottawa.ca> X-url: http://www.naaclhlt2012.org/conference/conference.php Workshop on Computational Linguistics for Literature Co-located with The 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies June 7 or 8, 2012 Montr?al, Qu?bec, Canada All information, including announcements and updates, can be found on the workshop's Web site: https://sites.google.com/site/clfl2012/ MOTIVATION AND SCOPE The amount of literary material available on-line keeps growing rapidly. Not only are there machine-readable texts in libraries, collections and e-book stores, but there is also more and more ?live? literature ? e-zines, blogs, self-published e-books and so on. There is a need for tools to help users navigate, visualize and appreciate high volume of available literature. Literary texts are quite different from technical and formal documents, which have been the focus of NLP research thus far. Most forms of statistical language processing rely on lexical information in one way or another. In literature, the primary mode is narrative rather than exposition. Stories may be cognitively easier to read than certain expository genres, such as scientific documents, but it is a challenging form of discourse for NLP tools and methods. For instance, literary prose lacks overt lexical clues and structural markers typically leveraged in the processing of more structured genres. Also, even conventional literary texts exhibit far less unity of time, space and topic than most formal discourse. Learning to handle these challenges in literary data may help move past heavy reliance on surface clues in general. Literature also differs from other genres because of the needs of its typical audience. For instance, reading, searching or browsing literature online is a different task than searching for the latest news on a particular topic. Search criteria would be rather abstract: not a keyword, but a literary style, similarity to another work, point of view and so on. When looking for a summary or a digest, a reader may prefer to know or visualize a text's broad characteristics than facts which summarize the plot. We invite papers that touch upon these areas, but also welcome other ideas which promote the processing of literary narrative or related forms of discourse. TOPICS OF INTEREST Note: Papers on other closely related topics will also be considered * the needs of the readers and how those needs translate into meaningful NLP tasks; * searching for literature; * recommendation systems for literature; * computational modelling of narratives, computational narratology; * summarization of literature; * differences between literature and other genres as relevant to computational linguistics; * discourse structure in literature; * emotion analysis for literature; * profiling and authorship attribution; * identification and analysis of literature genres; * building and analysing social networks of characters; * generation of literary narrative, dialogue or poetry; * modelling dialogue literary style for generation. SUBMISSION We invite submission of long and short papers, describing completed or ongoing research on systems, studies, theories and models which can inform the area of computational linguistics for literature. Long papers should be at most 8 pages, plus unlimited space for references. Short papers should be at most 4 pages plus references, and can be appropriate for either oral or poster presentation. Accepted long papers, and perhaps selected short papers, will be presented as talks. In addition, we encourage submission of position papers -- mapping out research ideas and programs -- of up to 6 pages plus references. There will be double-blind review: submissions must be anonymized. Style files and sample PDFs are available on this page: http://www.naaclhlt2012.org/conference/conference.php Submission page: please visit later IMPORTANT DATES (all deadlines 11:59 pm. Hawaii Time) Submission deadline: February 20, 2012 Notification of acceptance: March 23, 2012 Camera-ready version due: April 10, 2012 Workshop: June 7 or June 8, 2012 PROGRAM COMMITTEE * Cecilia Ovesdotter Alm (Rochester Institute of Technology) * Nicholas Dames (Columbia University) * Hal Daum? III (University of Maryland) * Anna Feldman (Montclair State University) * Mark Finlayson (MIT) * Pablo Gerv?s (Universidad Complutense de Madrid) * Roxana Girju (University of Illinois at Urbana-Champaign) * Amit Goyal (University of Maryland) * Katherine Havasi (MIT Media Lab) * Matthew Jockers (Stanford University) * James Lester (North Carolina State University) * Inderjeet Mani (Children's Organization of Southeast Asia) * Kathy McKeown (Columbia University) * Saif Mohammad (National Research Council, Canada) * Vivi Nastase (HITS gGmbH) * Rebecca Passonneau (Columbia University) * Livia Polanyi (LDM Associates) * Owen Rambow (Columbia University) * Michaela Regneri (Saarland University) * Reid Swanson (University of California, Santa Cruz) * Marilyn Walker (University of California, Santa Cruz) * Janice Wiebe (University of Pittsburgh) CO-ORGANIZERS * David Elson (Google) * Anna Kazantseva (University of Ottawa) * Rada Mihalcea (University of North Texas) * Stan Szpakowicz (University of Ottawa) CONTACT INFORMATION Send general inquiries to clfl.workshop at gmail.com ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Tue Dec 6 21:03:05 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Tue, 6 Dec 2011 22:03:05 +0100 Subject: Ecole: WSLST 2012 Message-ID: Date: Sun, 4 Dec 2011 11:03:06 +0100 From: "GRLMC" Message-ID: <8426A6D3003445D1A0850B9DFC44555F at Carlos1> X-url: http://grammars.grlmc.com/wslst2012/ ********************************************************************* 2012 INTERNATIONAL WINTER SCHOOL IN LANGUAGE AND SPEECH TECHNOLOGIES WSLST 2012 (formerly International PhD School in Language and Speech Technologies) Tarragona, Spain January 23-27, 2012 Organized by: Research Group on Mathematical Linguistics (GRLMC) Rovira i Virgili University http://grammars.grlmc.com/wslst2012/ ********************************************************************* AIM: WSLST 2012 offers a broad and intensive series of lectures on language and speech technologies at different levels. The students choose their preferred courses according to their interests and background. Instructors are top names in their respective fields. The School intends to help students initiate and foster their research career. The previous event in this series was SSLST 2011 (http://grammars.grlmc.com/sslst2011/). ADDRESSED TO: Undergraduate and graduate students from around the world. Most appropriate degrees include: Computer Science and Linguistics. Other students (for instance, from Mathematics, Electrical Engineering, Philosophy, or Cognitive Science) are welcome too. The School is appropriate also for people more advanced in their career who want to keep themselves updated on developments in the field. There is no overlap in the class schedule. COURSES AND PROFESSORS: - Eneko Agirre (U Basque Country), Semantic Processing of Text: Word Sense Disambiguation, Entity Linking and Semantic Similarity [introductory/intermediate, 8 hours] - William J. Byrne (Cambridge), Weighted Finite State Transducers in Statistical Machine Translation [introductory/advanced, 6 hours] - Marcello Federico (Fondazione Bruno Kessler, Trento), Statistical Language Modeling [introductory/advanced, 8 hours] - Ralph Grishman (New York), Information Extraction [intermediate, 8 hours] - John Nerbonne (Groningen), Using Edit-Distance to Understand Linguistic Variation [introductory/advanced, 4 hours] - Geoffrey K. Pullum (Edinburgh), The Formal Properties of Human Languages: Description with a View to Implementation [introductory/intermediate, 8 hours] - Jian Su (Institute for Infocomm Research, Singapore), Coreference Resolution and Discourse Relation Recognition [advanced, 4 hours] - Christoph Tillmann (IBM T.J. Watson Research Center), Simple and Effective Algorithms and Models for Non-hierarchical Statistical Machine Translation [intermediate, 6 hours] - David R. Traum (U Southern California), Approaches to Dialogue Systems and Dialogue Management [introductory, 8 hours] - Dekai Wu (Hong Kong U of Science and Technology), Syntactic and Semantic Statistical Machine Translation [introductory/advanced, 8 hours] REGISTRATION: It has to be done on line at http://grammars.grlmc.com/wslst2012/Registration.php FEES: They are variable, depending on the number of courses each student takes. The rule is: 1 hour = - 10 euros (for payments until November 13, 2011), - 15 euros (for payments after November 13, 2011). PAYMENT PROCEDURE: The fees must be paid to the School's bank account: Uno-e Bank (Julian Camarillo 4 C, 28037 Madrid, Spain): IBAN: ES3902270001820201823142 - Swift code: UNOEESM1 (account holder: Carlos Martin-Vide GRLMC; account holder?s address: Av. Catalunya, 35, 43002 Tarragona, Spain) Please mention WSLST 2012 and your full name in the subject. A receipt will be provided on site. Remarks: - Bank transfers should not involve any expense for the School. - People claiming early registration will be requested to prove that they gave the bank transfer order by the deadline. - Students may be refunded only in the case when a course gets cancelled due to the unavailability of the instructor. People registering on site at the beginning of the School must pay in cash. For the sake of local organization, however, it is much recommended to do it earlier. ACCOMMODATION: Information about accommodation is available on the website of the School. CERTIFICATES: Students will be delivered a certificate stating the courses attended, their contents, and their duration. IMPORTANT DATES: Announcement of the programme: September 26, 2011 Starting of the registration: September 26, 2011 Early registration deadline: November 13, 2011 Starting of the School: January 23, 2012 End of the School: January 27, 2012 QUESTIONS AND FURTHER INFORMATION: Florentina-Lilica Voicu: florentinalilica.voicu at urv.cat WEBSITE: http://grammars.grlmc.com/wslst2012/ POSTAL ADDRESS: WSLST 2012 Research Group on Mathematical Linguistics (GRLMC) Rovira i Virgili University Av. Catalunya, 35 43002 Tarragona, Spain Phone: +34-977-559543 Fax: +34-977-558386 ACKNOWLEDGEMENTS: Diputaci? de Tarragona Universitat Rovira i Virgili ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Tue Dec 6 21:05:02 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Tue, 6 Dec 2011 22:05:02 +0100 Subject: These: Juliette Kahn, Performance et confiance en identification biometrique vocale Message-ID: Date: Mon, 05 Dec 2011 09:27:19 +0100 From: Juliette KAHN Message-ID: <20111205092719.18459ficlldm6vs7 at webmail.univ-avignon.fr> Bonjour ? tous, J'ai le plaisir de vous inviter ? ma soutenance de th?se qui aura lieu le 19 d?cembre 2011 ? 14h30 dans l'amphi Blaise Pascal du Centre d'Enseignement et de Recherche en Informatique (CERI) de l'Universit? d'Avignon et des Pays de Vaucluse (339, chemin des Meinajaries-Agroparc BP 91228-84911 AVIGNON Cedex 9-FRANCE) . Vous ?tes ?galement les bienvenus au pot de th?se qui suivra la soutenance. Titre de la th?se Parole de locuteur : performance et confiance en identification biom?trique vocale R?sum? Ce travail de th?se explore l'usage biom?trique de la parole dont les applications sont tr?s nombreuses (s?curit?, environnements intelligents, criminalistique, surveillance du territoire ou authentification de transactions ?lectroniques). La parole est soumise ? de nombreuses contraintes fonction des origines du locuteur (g?ographique, sociale et culturelle) mais ?galement fonction de ses objectifs performatifs. Le locuteur peut ?tre consid?r? comme un facteur de variation de la parole, parmi d'autres. Dans ce travail, nous pr?sentons des ?l?ments de r?ponses aux deux questions suivantes : - Tous les extraits de parole d'un m?me locuteur sont-ils ?quivalents pour le reconna?tre ? - Comment se structurent les diff?rentes sources de variation qui v?hiculent directement ou indirectement la sp?cificit? du locuteur ? Nous construisons, dans un premier temps, un protocole pour ?valuer la capacit? humaine ? discriminer un locuteur ? partir d'un extrait de parole en utilisant les donn?es de la campagne NIST-HASR 2010. La t?che ainsi pos?e est difficile pour nos auditeurs, qu'ils soient na?fs ou plus exp?riment?s. Dans ce cadre, nous montrons que ni la (quasi)unanimit? des auditeurs ni l'auto-?valuation de leurs jugements ne sont des gages de confiance dans la v?racit? de la r?ponse soumise. Nous quantifions, dans un second temps, l'influence du choix d'un extrait de parole sur la performance des syst?mes automatiques. Nous avons utilis? deux bases de donn?es, NIST et BREF ainsi que deux syst?mes de RAL, ALIZE/SpkDet (LIA) et Idento (SRI). Les syst?mes de RAL, aussi bien fond?s sur une approche UBM-GMM que sur une approche i-vector montrent des ?carts de performances importants mesur?s ? l'aide d'un taux de variation autour de l'EER moyen, Vr (pour NIST, VrIdento = 1.41 et VrALIZE/SpkDet =1.47 et pour BREF, Vr = 3.11) selon le choix du fichier d'apprentissage utilis? pour chaque locuteur. Ces variations de performance, tr?s importantes, montrent la sensibilit? des syst?mes automatiques au choix des extraits de parole, sensibilit? qu'il est important de mesurer et de r?duire pour rendre les syst?mes de RAL plus fiables. Afin d'expliquer l'importance du choix des extraits de parole, nous cherchons les indices les plus pertinents pour distinguer les locuteurs de nos corpus en mesurant l'effet du facteur Locuteur sur la variance des indices. La F0 est fortement d?pendante du facteur Locuteur, et ce ind?pendamment de la voyelle. Certains phon?mes sont plus discriminants pour le locuteur : les consonnes nasales, les fricatives, les voyelles nasales, voyelles orales mi-ferm?es ? ouvertes. Ce travail constitue un premier pas vers une ?tude plus pr?cise de ce qu'est le locuteur aussi bien pour la perception humaine que pour les syst?mes automatiques. Si nous avons montr? qu'il existait bien une diff?rence cepstrale qui conduisait ? des mod?les plus ou moins performants, il reste encore ? comprendre comment lier le locuteur ? la production de la parole. Enfin, suite ? ces travaux, nous souhaitons explorer plus en d?tail l'influence de la langue sur la reconnaissance du locuteur. En effet, m?me si nos r?sultats indiquent qu'en anglais am?ricain et en fran?ais, les m?mes cat?gories de phon?mes sont les plus porteuses d'information sur le locuteur, il reste ? confirmer ce point et ? ?valuer ce qu'il en est pour d'autres langues. Jury Mme R?gine ANDRE-OBRECHT Professeur, IRIT, Toulouse Rapporteur M. Anders ERIKSSON Professeur, Universit? de G?teborg, G?teborg (Su?de) Rapporteur Mme Martine ADDA-DECKER Professeur, LPP, Paris Pr?sidente du jury M. Edouard GEOFFROIS Ing?nieur, DGA, Paris Examinateur Mme Solange ROSSATO Ma?tre de Conf?rences, LIG, Grenoble Co-encadrante M. Jean-Fran?ois BONASTRE Professeur, LIA, Avignon Directeur de th?se Juliette Kahn ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Tue Dec 6 21:07:07 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Tue, 6 Dec 2011 22:07:07 +0100 Subject: Conf: Concept Types and Frames 2012, Duesseldorf Message-ID: Date: Mon, 05 Dec 2011 15:43:25 +0100 From: Timm Lichte Message-ID: <4EDCD88D.1000309 at phil.uni-duesseldorf.de> X-url: http://www.sfb991.uni-duesseldorf.de/concept-types-and-frames-in-language-cognition-and-science/ X-url: http://www.sfb991.uni-duesseldorf.de/en/sfb991/ First Announcement. A call for papers will be issued in January. ======================================================================== International Interdisciplinary Conference "Concept Types and Frames in Language, Cognition, and Science" (CTF?12) ======================================================================== D?sseldorf (GER), August 22?24, 2012 http://www.sfb991.uni-duesseldorf.de/concept-types-and-frames-in-language-cognition-and-science/ Organized and financed by DFG Coordinated Research Centre CRC 991/SFB 991 ?The Structure of Representations in Language, Cognition, and Science? http://www.sfb991.uni-duesseldorf.de/en/sfb991/ Aims and scope --------------- A sequel to CTF?07 and CTF?09, CTF?12 is the third interdisciplinary conference to contribute to the development of a general frame theory of human concepts. The conference explores the application of frames in linguistics and other sciences. It discusses foundational issues for the development of a theory of frames from the perspectives of general and computational linguistics, mathematics, cognitive psychology, cognitive neuroscience, ontology, philosophy of mind, and philosophy of science. Invited speakers ----------------- t.b.a. ======================================================================== Frames ------- CTF?12 explores fundamental aspects of a formal theory of frames understood as recursive attribute-value structures with functional attributes. The aspects include representation and model-theoretic interpretation, attributes and value types, and appropriate frame structures for different types of concepts. A special focus will be on dynamic aspects of frame theory: modeling dynamic frame components for event and process frames; exploring operations on frames such as type shifts and frame composition. Cognition ---------- CTF?12 explores empirical evidence for and consequences of the frame approach. It addresses the distinction of types of concepts, in particular relational and functional nominal concepts. Major topics include the grounding of concepts in the sensory-motor system and dynamic aspects of concept formation. Language --------- CTF?12 explores the application of a formal?ized theory of frames at various levels of linguistic description: lexical semantics of the different conceptual types of nouns and verbs, deep lexical decomposition, semantics of word formation, the syntax-semantics interface, syntactic and semantic composition, and anaphora and coherence in discourse. Science -------- CTF?12 explores the applications of the frame approach in various fields of institutional categorization: e.g., the evolution of the frame notion in the history of philosophy; scientific frames and paradigm shifts; frames for legal concepts; frames in applied ontology; a frame model of the mind; and frame description of psychiatric disorders. ======================================================================== Important dates ---------------- Submission of abstracts (1000 words): March 15, Notification of acceptance: May 30 Programme committee -------------------- Sebastian L?bner (general chair), Rainer Osswald, Gottfried Vosgerau Organizational committee ------------------------- Dorothea Brenner, David Hommen, Timm Lichte, Daniel Schulzek ========================================================================= Topics for the CTF'12 Language --------- noun types and determination - definiteness and indefiniteness - possession - usage - processing - grammaticalization conceptual shifts of noun meanings - shifts and coercion - diachronic shifts decompositional frames for verbs and the syntax-semantics interface - verb classes in terms of frame types - verbal concepts and linking - construction-based decomposition of verb meanings - aspectual composition decompositional frames for nouns - frame analysis of legal terms - affordance attributes in noun frames and qualia syntactic and semantic composition - nominal concept types in the theory of semantic composition - semantic composition in terms of frames - syntax-driven frame composition frames in the semantics of word-formation - nominal compounding as frame combination - deverbal nouns as the result of frame transformation frames in discourse - frames and associative anaphora - the role of frames in discourse coherence Science -------- concepts - emergence of the modern frame notion of concepts - architecture of the mind - the role of function in object categorization frame analysis in institutional categorization - frames and paradigm shifts - frame analysis of legal concepts - frame analysis of psychiatric disorders - frame analysis in other scientific fields concept types and frames in ontology - attributes in ontology - relations and functions in ontology - formal approaches to ontology using functional relations Cognition ---------- embodiment of concepts - emergence of the modern frame notion of concepts - grounding of verb meanings in the sensory-motor system - affordances and the grounding of noun meanings in the sensory-motor system - common coding and simulation - frames and conceptual processing types of concepts - empirical evidence for different kinds of concepts - cognitive representation of dynamic concepts, in particular verb meanings neurocognition - frames and conceptual processing - neural correlates of attributes and frames - computational modelling of concept formation Frames ------- elements of frame theory - modelling the dynamics of verb meanings - modelling general operations on frames - modelling frame composition frame-related spaces - mathematical properties of value spaces - mathematical properties of attribute spaces - mathematical properties of frame spaces large frame systems - design and implementation ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Tue Dec 6 21:11:08 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Tue, 6 Dec 2011 22:11:08 +0100 Subject: Appel: EACL 2012 Joint Workshop on Information Retrieval and Hybrid Machine Translation Message-ID: Date: Tue, 6 Dec 2011 11:55:17 +0100 From: "Reinhard Rapp" Message-ID: X-url: http://www-lium.univ-lemans.fr/esirmt-hytra Apologies for multiple postings Please distribute to colleagues ======================================================================== Joint Workshop on EXPLOITING SYNERGIES BETWEEN INFORMATION RETRIEVAL AND MACHINE TRANSLATION (ESIRMT) and HYBRID APPROACHES TO MACHINE TRANSLATION (HyTra) Co-located with EACL 2012 (http://eacl2012.org/) Avignon, France April 23 and 24, 2012 Deadline for paper submissions: February 2, 2012 http://www-lium.univ-lemans.fr/esirmt-hytra ======================================================================== This two-day workshop addresses two specific but related research problems in computational linguistics. The ESIRMT event (1st day) aims at reducing the gap, both theoretical and practical, between information retrieval and machine translation research and applications. Although both fields have been already contributing to each other instrumentally, there is still too much work to be done in relation to solidly framing these two fields into a common ground of knowledge from both the procedural and paradigmatic perspectives. The HyTra event (2nd day) aims at sharing ideas among researchers developing and applying statistical, example-based, or rule-based machine translation systems and who wish to enhance their systems with elements from the other approaches. The joint workshop will provide participants with the opportunity of discussing research related to technology integration and system combination strategies at both the general level of cross-language information access and the specific level of machine translation technologies. Contributions are to be organized into two tracks, corresponding to ESIRMT and HyTra, respectively. Topics of interest include, but are not limited to: ESIRMT track: * machine translation and information retrieval hybridization * applications of MT and IR hybrid systems * IR techniques integrated in MT systems * MT techniques integrated in IR systems * any kind of innovative approach exploiting synergies between MT and IR * machine learning techniques for ranking HyTra track: * ways and techniques of MT hybridization * architectures for the rapid development of hybrid MT systems * hybrid systems dealing with underresourced languages and/or with morphologically rich languages * using linguistic information (morphology, syntax, semantics) to enhance statistical MT * bootstrapping rule-based systems from corpora * hybrid methods in spoken language translation * extraction of dictionaries from parallel and comparable corpora * machine learning techniques for hybrid MT * heuristics for limiting the search space in hybrid MT * alternative methods for the fair evaluation of the output of different types of MT systems * system combination approaches * open source tools and free language resources for hybrid MT SUBMISSION INSTRUCTIONS Authors are invited to submit papers on original and previously unpublished work. Formatting should be according to EACL 2012 specifications using LaTeX or MS-Word style files, see http://eacl2012.org/information-for-authors/index.html (paper length: up to 9 pages plus an extra page for references). Reviewing of papers will be double-blind, so the submissions should not reveal the authors' identity. Submission is electronic in PDF format using the START submission system at https://www.softconf.com/eacl2012/ESIRMT-HyTra/ Double submission policy: Parallel submission to other meetings or publications are possible but must be immediately notified to the workshop contact person (see below). Authors of accepted papers will be invited to present their research at the workshop. The papers will be published in the electronic workshop proceedings. These will be part of the EACL proceedings, published by ACL. They will also be made available online via the ACL Anthology. IMPORTANT DATES February 2, 2012: Paper submissions due February 24, 2012: Notification of acceptance March 9, 2012: Camera ready papers due April 23-24, 2012: Workshop in Avignon ORGANIZERS Contact person: Marta R. Costa-juss? (e-mail: marta.ruiz at barcelonamedia.org ) ESIRMT: Marta R. Costa-juss? (Barcelona Media Innovation Center), Patrik Lambert (University of Le Mans), Rafael E. Banchs (Institute for Infocomm Research) HyTra: Reinhard Rapp (Universities of Mainz and Leeds), Bogdan Babych (University of Leeds), Kurt Eberle (Lingenio GmbH), Tony Hartley (Toyohashi University of Technology and University of Leeds), Serge Sharoff (University of Leeds), Martin Thomas (University of Leeds) INVITED SPEAKERS Philipp Koehn (University of Edinburgh) TBA PROGRAMME COMMITTEE Jordi Atserias, Yahoo! Research , Barcelona, Spain Bogdan Babych, University of Leeds, UK Sivaji Bandyopadhyay, Jadavpur University, Kolkata, India N?ria Bel, Universitat Pompeu Fabra, Barcelona, Spain Pierrette Bouillon, ISSCO/TIM/ETI, University of Geneva, Switzerland Chris Callison-Burch, Johns Hopkins University, Baltimore, USA Michael Carl, Copenhagen Business School, Denmark Oliver Culo, ICSI, University of California, Berkeley, USA Kurt Eberle, Lingenio GmbH, Heidelberg, Germany Andreas Eisele, Directorate-General for Translation, European Commission, Luxembourg Marcello Federico, Fondazione Bruno Kessler, Trento, Italy Jos? A. R. Fonollosa, Universitat Polit?cnica de Catalunya, Barcelona, Spain Mikel Forcada, University of Alicante, Spain Alexander Fraser, Institute for Natural Language Processing (IMS), Stuttgart, Germany Johanna Gei?, Lingenio GmbH, Heidelberg, Germany Mireia Ginesti-Rosell, Lingenio GmbH, Heidelberg, Germany Silvia Hansen-Schirra, FTSK, University of Mainz, Germany Tony Hartley, Toyohashi University of Technology, Japan, and University of Leeds, England Gareth Jones, Dublin City University, Ireland Min-Yen Kan, National University of Singapore Udo Kruschwitz, University of Essez, UK Yanjun Ma, Baidu Inc. Beijing, China Maite Melero, Barcelona Media Innovation Center, Barcelona, Spain Haizhou Li, Institute for Infocomm Research, Singapore Reinhard Rapp, University of Mainz, Germany, and University of Leeds, UK Paul Schmidt, Institut for Applied Information Science, Saarbr?cken, Germany Uta Seewald-Heeg, Anhalt University of Applied Sciences, K?then, Germany Serge Sharoff, University of Leeds, UK Wade Shen, Massachusetts Institute of Technology, Cambridge, USA Fabrizio Silvestri, Instituto de Scienza e Tecnologia del'Informazione, Pisa, Italy Harold Somers, CNGL, Dublin City University, Ireland Anders S?gaard, University of Copenhagen, Denmark Martin Thomas, University of Leeds, UK J?rg Tiedemann, University of Uppsala, Sweden Zygmunt Vetulani, University of Poznan, Poland ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Tue Dec 6 21:14:30 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Tue, 6 Dec 2011 22:14:30 +0100 Subject: Job: 2 opportunities at the Health On the Net (HON) Foundation in Geneva Message-ID: Date: Tue, 6 Dec 2011 21:11:48 +0100 (CET) From: "Natalia GRABAR" Message-ID: X-url: http://www.hon.ch/Global/job_application_NLP.html X-url: http://www.hon.ch/Global/job_application_developer.html Job opportunities at the Health On the Net (HON) Foundation in Geneva: - Research NLP, text processing, information retrieval - Developer Please send your CV, letter of motivation, and salary expectations with the subject header 'JOB NLP' to HONsecretariat at healthonnet.org. =================================== Research NLP, text processing, information retrieval http://www.hon.ch/Global/job_application_NLP.html =================================== Location: Geneva, Switzerland Type: fulltime Job description: The Health On the Net (HON) Foundation, a Geneva-based non-governmental international organization dedicated to improving the quality of on-line health information for patients as well as health professionals is looking for an NLP expert. We are looking for somebody who combines an interest in NLP research with good programming skills. Key responsibilities: * Mapping out the research agenda, participating in (EU) research projects, writing deliverables and scientific articles, attending project meetings, giving presentations * Design and development of innovative techniques for improving search quality on text and multimedia formats - document classification, (multi-lingual) IR, automatic readability assessment and detection of ethical criteria in non-structured health web documents * Building fully scalable prototypes to test new algorithms and migrating from the research prototype to delivering the solution into production Desired experience and skills: * Post-graduate degree in Computer Science, Computational Linguistics, or 3+ years experience in R&D * Background in information retrieval, document classification, or question answering * Experience in scripting languages (Perl/​Python) * Experience in multi-threaded programming, and web-service applications are a great plus * Good command of English, other languages are a plus We offer: * An international research environment * A multi-disciplinary team of NLP researchers, web developers, health professionals etc. * A group active in the international NLP research scene, currently participating in the EU project Khresmoi * The opportunity to develop programming as well as research skills More information on www.healthonnet.org. Please send your CV, letter of motivation, and salary expectations with the subject header 'JOB NLP' to HONsecretariat at healthonnet.org. =================================== Developer http://www.hon.ch/Global/job_application_developer.html =================================== Location: Geneva Switzerland Type: fulltime HON is offering at wide range of services to citizens, medical professionals and health care providers. The HON Web development team makes HON services available to millions of users in order to access health websites respecting a set of quality criteria: the HONcode. The team combines software development, system administration expertise in Unix to run multilingual services available in 102 countries. We are looking from a software developer to contribute to major development in a multidisciplinary team. Key responsibilities: * manage a large scale crawling robot and search engine dedicated to health sites * support NLP team on software development aspects, from research to production * act as software support on many projects (CMS, log analysis etc.) Desired experience and skills * engineer or bachelor degree in Computer Science with at least 2 years of experiences, or provable very strong programming achievements; * applied Agile methodology experience & continuous testing/integration; * programming language: Perl in large structured project and J2EE (web application); * javascript (JQuery) & css; * Search engine et web crawling experience is a plus; * knowledge in Solr/lucene; * any open source contribution is a plus. You are responsible and organized; creative and proactive. You are fluent in French with a very good knowledge of written and spoken English, the German is a plus. Post to be filled immediately. Please submit your CV, letter of motivation, and salary expectations to HONsecretariat at healthonnet.org ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 7 10:57:13 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 7 Dec 2011 11:57:13 +0100 Subject: Appel: EACL 2012 Workshops - Multiple Call For Papers (10 workshops) Message-ID: Date: Mon, 05 Dec 2011 14:45:29 +0100 From: Frederic Bechet Message-ID: <4EDCCAF9.9000606 at lif.univ-mrs.fr> X-url: http://eacl2012.org X-url: http://stp.lingfil.uu.se/atanlp/ X-url: http://www.lingured.info/clw2012/ X-url: http://sites.google.com/site/eaclcogws/ X-url: http://www.chss.montclair.edu/linguistics/DeceptionDetection.html X-url: http://www-lium.univ-lemans.fr/esirmt-hytra X-url: http://www-limbio.smbh.univ-paris13.fr/membres/hamon/hybrid/ X-url: http://ilk.uvt.nl/LaTeCH2012/ X-url: http://www.eecs.uottawa.ca/~diana/eacl2012_social_media_workshop.html Call for papers - EACL 2012 workshops - April 23,24 Avignon, France http://eacl2012.org Ten workshops will be held in conjunction with the EACL 2012 conference in Avignon. They are listed in the mail below, with a link to each individual Call For Paper. ---- - EACL 2012 Workshop on Applications of Tree Automata Techniques in Natural Language Processing (ATANLP 2012) * April 24, 2012, Avignon, France * http://stp.lingfil.uu.se/atanlp/ * Submission deadline for papers: January 27, 2012 - Second Workshop on Computational Linguistics and Writing (CL&W 2012): Linguistic and cognitive aspects of document creation and document engineering * April 23 or 24, 2012, France * http://www.lingured.info/clw2012/ * Submission deadline: January 27, 2012 - EACL 2012 Workshop on Computational Models of Language Acquisition and Loss * April 23 or 24, 2012, France * http://sites.google.com/site/eaclcogws/ * Submission deadline: January, 20th, 2012 - EACL 2012 Workshop on Computational Approaches to Deception Detection * April 23, 2012, France * http://www.chss.montclair.edu/linguistics/DeceptionDetection.html * Submission deadline: Jan 27, 2012 - Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra) * April 23 and 24, 2012, Avignon, France * http://www-lium.univ-lemans.fr/esirmt-hytra * Submission deadline: February 2, 2012 - EACL 2012 Workshop on Innovative hybrid approaches to the processing of textual data * April 23 and 24, 2012, Avignon, France * http://www-limbio.smbh.univ-paris13.fr/membres/hamon/hybrid/ * Submission deadline: Jan 04, 2012 (Abstract deadline) - EACL 2012 workshop on: Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH 2012) * April 24, 2012 Avignon, France * http://ilk.uvt.nl/LaTeCH2012/ * Submission deadline: January 20, 2012 - EACL 2012 Joint Workshop of LINGVIS & UNCLH Visualization of Linguistic Patterns and Uncovering Language History from Multilingual Resources * April 23-24, 2012, Avignon, France * https://sites.google.com/site/lingvisunclh/ * Submission deadline: January 30, 2012 - ROBUS-UNSUP: Joint Workshop on Unsupervised and Semi-Supervised Learning in NLP * April 23-24, 2012, Avignon, France * https://sites.google.com/site/robusunsupeacl2012/home * Submission deadline: Jan 27, 2012 - EACL 2012 Workshop on Semantic Analysis in Social Networks * April 23, 2012, Avignon, France * http://www.eecs.uottawa.ca/~diana/eacl2012_social_media_workshop.html * Submission Deadline: Jan 30, 2012 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 7 10:59:06 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 7 Dec 2011 11:59:06 +0100 Subject: Appel: AI 2012 Message-ID: Date: Tue, 06 Dec 2011 20:59:04 -0500 From: Leila Kosseim Message-id: <4EDEC868.2080600 at cs.concordia.ca> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Apologies for cross-posting. Please distribute widely. =========================== Call for Papers ========================= AI 2012: The 25th Canadian Conference on Artificial Intelligence www.canadianai.ca May 28th to May 30th, 2012 York University, Toronto, Ontario, Canada ======================================================================== IMPORTANT DATES --------------- Full paper submission deadline: January 16th, 2012 Notification of acceptance: February 20th, 2012 Final papers due: March 5th, 2012 Conference: May 28th to May 30th, 2012 CALL FOR PAPERS --------------- AI'2012, the twenty-fifth Canadian Conference on Artificial Intelligence, invites papers that present original work in all areas of Artificial intelligence, either theoretical or applied. Topics of interest include, but are not limited to: Agent Systems AI Applications Automated Reasoning Bioinformatics and BioNLP Case-based Reasoning Cognitive Models Constraint Satisfaction Data Mining E-Commerce Evolutionary Computation Games Information Retrieval Knowledge Representation Machine Learning Multi-media Processing Natural Language Processing Neural Nets Planning Robotics Search Smart Graphics Uncertainty User Modeling Web Applications Papers will be reviewed by the Program Committee members according to their originality, technical merit and clarity of presentation. Each accepted paper will be allocated a maximum of 12 pages in the proceedings. All accepted papers, for which one of the authors will have registered and is present at the conference, will be published in the conference proceedings as Lecture Notes in Artificial Intelligence by Springer (www.springer.com/lncs). Papers submitted to AI'2012 must not have been accepted for publication elsewhere or be under review for another conference. BEST PAPER AWARD ---------------- The Best Paper Award will be presented at the conference to the authors of the best paper. SUBMISSIONS ----------- Authors are invited to submit electronically, by January 16th, 2012. Submissions must be uploaded on Easychair: https://www.easychair.org/account/signin.cgi?conf=canadianai2012 Paper format: - Papers must be written in English. - Papers must be up to 12 pages in length. - Papers must be in PDF format. - Papers must be formatted according to Springer LNCS style. The use of the LaTeX2e style file available from Springer is strongly encouraged. ORGANIZERS ---------- AI'2012 is collocated with two cognate conferences: the Canadian Graphical Interface Conference and the Canadian Conference on Computer and Robot Vision. General Co-chairs AI/GI/CRV: Chrysanne Dimarco, University of Waterloo, Waterloo John Barron, University of Western Ontario, London Program Co-chairs: Leila Kosseim, Concordia University, Montreal Diana Inkpen, University of Ottawa, Ottawa Industry Chair: Atefeh Farzindar, NLP Technologies Inc. & Universit? de Montr?al Local Arrangement Chair: Yves Lesp?rance, York University, Toronto Graduate student symposium co-chairs: Ebrahim Bagheri, Athabasca University, Athabasca Jocelyne Faddoul, Concordia University, Montreal Please mark the dates in your calendar. We hope to receive your submissions and see you at the conference. ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sat Dec 10 09:40:07 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sat, 10 Dec 2011 10:40:07 +0100 Subject: These: Ludovic Jean-Louis, Extraction d'information fondee sur une analyse linguistique profonde Message-ID: Date: Wed, 7 Dec 2011 16:39:02 +0100 From: "Ludovic Jean-Louis" Message-ID: <4edf88a6.59030e0a.2507.ffffb884 at mx.google.com> J'ai le plaisir de vous inviter ? ma soutenance de th?se, intitul?e : "Extraction d'information ? partir de textes fond?e sur une analyse linguistique profonde" La soutenance aura lieu le Jeudi 15 D?cembre 2011 ? 14h30 dans la salle du conseil au LIMSI, b?timent 508 de l'universit? Paris XI (LIMSI - CNRS B.P. 133 91403 ORSAY) Le jury sera compos? de : - M. Patrice Bellot, Rapporteur (Universit? Aix-Marseille, LSIS) - Mme Adeline Nazarenko, Rapporteur (Universit? Paris Nord, LIPN) - M. Claude de Loupy, Examinateur (Syllabs) - M. Pierre Zweigenbaum, Examinateur (Universit? Paris Sud, LIMSI) - M. Olivier Ferret, Directeur de th?se (CEA LIST, LVIC) - M. Romaric Besan?on, Encadrant (CEA LIST, LVIC) Vous ?tes cordialement invit?s au pot qui suivra. R?sum? : La plus grande partie des informations disponibles librement sur le Web se pr?sentent sous une forme textuelle, c'est-?-dire non-structur?e. Dans un contexte comme celui de la veille, il est tr?s utile de pouvoir pr?senter les informations pr?sentes dans les textes sous une forme structur?e en se focalisant sur celles jug?es pertinentes vis-?-vis du domaine d'int?r?t consid?r?. N?anmoins, lorsque l'on souhaite traiter ces informations de fa?on syst?matique, les m?thodes manuelles ne sont pas envisageables du fait du volume important des donn?es ? consid?rer. L'extraction d'information s'inscrit dans la perspective de l'automatisation de ce type de t?ches en identifiant dans des textes les informations concernant des faits (ou ?v?nements) afin de les stocker dans des structures de donn?es pr?alablement d?finies. Ces structures, appel?es templates (ou formulaires), agr?gent les informations caract?ristiques d'un ?v?nement ou d'un domaine d'int?r?t repr?sent?es sous la forme d'entit?s nomm?es (nom de lieux, etc.). Dans ce contexte, le travail de th?se que nous avons men? s'attache ? deux grandes probl?matiques : - l'identification des informations li?es ? un ?v?nement lorsque ces informations sont dispers?es ? une ?chelle textuelle en pr?sence de plusieurs occurrences d'?v?nements de m?me type ; - la r?duction de la d?pendance vis-?-vis de corpus annot?s pour la mise en oeuvre d'un syst?me d'extraction d'information. Concernant la premi?re probl?matique, nous avons propos? une d?marche originale reposant sur deux ?tapes. La premi?re consiste en une segmentation ?v?nementielle identifiant dans un document les zones de texte faisant r?f?rence ? un m?me type d'?v?nements, en s'appuyant sur des informations de nature temporelle. Cette segmentation d?termine ainsi les zones sur lesquelles le processus d'extraction doit se focaliser. La seconde ?tape s?lectionne ? l'int?rieur des segments identifi?s comme pertinents les entit?s associ?es aux ?v?nements. Elle conjugue pour ce faire une extraction de relations entre entit?s ? un niveau local et un processus de fusion global aboutissant ? un graphe d'entit?s. Un processus de d?sambigu?sation est finalement appliqu? ? ce graphe pour identifier l'entit? occupant un r?le donn? vis-?-vis d'un ?v?nement lorsque plusieurs sont possibles. La seconde probl?matique est abord?e dans un contexte de peuplement de bases de connaissances ? partir de larges ensembles de documents (plusieurs millions de documents) en consid?rant un grand nombre (une quarantaine) de types de relations binaires entre entit?s nomm?es. Compte tenu de l'effort repr?sent? par l'annotation d'un corpus pour un type de relations donn? et du nombre de types de relations consid?r?s, l'objectif est ici de s'affranchir le plus possible du recours ? une telle annotation tout en conservant une approche par apprentissage. Cet objectif est r?alis? par le biais d'une approche dite de supervision distante prenant comme point de d?part des exemples de relations issus d'une base de connaissances et op?rant une annotation non supervis?e de corpus en fonction de ces relations afin de constituer un ensemble de relations annot?es destin?es ? la construction d'un mod?le par apprentissage. Cette approche a ?t? ?valu?e ? large ?chelle sur les donn?es de la campagne TAC-KBP 2010. Mots-cl?s : Extraction d'information, extraction de relations, extraction d'?v?nements Abstract : The major part of the information available on the web is provided in textual form, i.e. in unstructured form. In a context such as technology watch, it is useful to present the information extracted from a text in a structured form, reporting only the pieces of information that are relevant to the considered field of interest. Such processing cannot be performed manually at large scale, given the large amount of data available. The automated processing of this task falls within the Information extraction (IE) domain. The purpose of IE is to identify, within documents, pieces of information related to facts (or events) in order to store this information in predefined data structures. These structures, called templates, aggregate fact properties ? often represented by named entities ? concerning an event or an area of interest. In this context, the research performed in this thesis addresses two problems: - identifying information related to a specific event, when the information is scattered across a text and several events of the same type are mentioned in the text; - reducing the dependency to annotated corpus for the implementation of an Information Extraction system. Concerning the first problem, we propose an original approach that relies on two steps. The first step operates an event-based text segmentation, which identifies within a document the text segments on which the IE process shall focus to look for the entities associated with a given event. The second step focuses on template filling and aims at selecting, within the segments identified as relevant by the event-based segmentation, the entities that should be used as fillers, using a graph-based method. This method is based on a local extraction of relations between entities, that are merged in a relation graph. A disambiguation step is then performed on the graph to identify the best candidates to fill the information template. The second problem is treated in the context of knowledge base (KB) population, using a large collection of texts (several millions) from which the information is extracted. This extraction also concerns a large number of relation types (more than 40), which makes the manual annotation of the collection too expensive. We propose, in this context, a distant supervision approach in order to use learning techniques for this extraction, without the need of a fully annotated corpus. This distant supervision approach uses a set of relations from an existing KB to perform an unsupervised annotation of a collection, from which we learn a model for relation extraction. This approach has been evaluated at a large scale on the data from the TAC-KBP 2010 evaluation campaign. Keywords: Information extraction, relation extraction, event extraction ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sat Dec 10 09:44:49 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sat, 10 Dec 2011 10:44:49 +0100 Subject: Seminaire: INFOLINGU, Myriam RAKHO, 12 decembre 2011, Universite Marne-la-Vallee Message-ID: Date: Wed, 7 Dec 2011 17:12:37 +0100 From: Myriam RAKHO Message-ID: ************************************************************************ INFOLINGU Le s?minaire de l'?quipe Informatique Linguistique du Laboratoire d'Informatique Gaspard Monge (LIGM) Universit? Paris-Est Marne-la-Vall?e ************************************************************************ Date : *Le Lundi 12 d?cembre 2011 ? 10h30* Lieu : Universit? Paris-Est Marne-la-Vall?e B?timent Copernic, 4?me ?tage, salle de s?minaires 4B08R Toute personne int?ress?e est la bienvenue. ************************************************************************ ------------------------------------------------------------------------ Intervenant : ------------------------------------------------------------------------ *Myriam RAKHO* (LIGM, Universit? Marne-la-Vall?e) ------------------------------------------------------------------------ Titre de la pr?sentation : ------------------------------------------------------------------------ * Un corpus de verbes polys?miques annot? s?mantiquement avec le Lexique-Grammaire* ------------------------------------------------------------------------ R?sum? : ------------------------------------------------------------------------ La d?sambigu?sation s?mantique des mots polys?miques (mots qui poss?dent la qualit? d'avoir plusieurs sens possibles) consiste ? assigner ? l'un de ces mots dans un contexte donn? (phrase, paragraphe) celui de ses sens qu'il instancie. Il s'agit, par exemple pour le verbe *comprendre*, de reconna?tre automatiquement la phrase *'Le principe de pr?caution ne comprend pas la politisation de la science'* comme une instance du sens ?1 Contenir, renfermer en soi? et la phrase '* Vous * *devez comprendre que nous ne vous laisserons pas faire cela*' comme une instance du sens ? 2 Se rendre compte de, que?. Dans le cadre multilingue, les diff?rentes traductions possibles des mots dans une langue cible donn?e sont utilis?es pour repr?senter l'inventaire de leurs sens. Si la langue cible est l'anglais, le verbe '*comprendre*' a donc pour ?tiquettes de sens les verbes *understand*, *realise*, *see*, *gather*, *grasp*, *sympathise*, * appreciate*, *include*, *involve*, *imply*, etc.. L'un des inconv?nients de cette approche provient du fait que la correspondance entre les sens d'un mot polys?mique et leurs ?quivalents en langue cible est rarement biunivoque. Un m?me sens peut ?tre traduit par plusieurs ?quivalents en langue cible et, inversement, un m?me ?quivalent en langue cible peut traduire plusieurs sens d'un mot. Jusqu'? pr?sent, la solution ? ce probl?me consistait ? d?couper l'ensemble des traductions possibles d'un mot en sous-ensembles, chacun correspondant ? l'un de ses sens. Par exemple, le sens (1) du verbe '* comprendre*' serait associ? ? la classe {*involve, imply, include*} alors que le sens (2) serait associ? ? la classe {*understand, realise, see, gather, grasp*}. Nous proposons une nouvelle forme d'?tiquettes de sens : les entr?es des mots dans les tables du Lexique-Grammaire. Le Lexique-Grammaire (LG) (Gross, 1975; Boons et al., 1976; Guillet et Lecl?re, 1992; Lecl?re, 2004) est un lexique du fran?ais dans lequel les diff?rents sens possibles des mots sont d?crits en termes de propri?t?s lexicales, syntaxiques et s?mantiques : un mot polys?mique a autant d'entr?es dans les tables du Lexique-Grammaire qu'il a de sens possibles. Notre ?tude porte sur 20 verbes polys?miques du fran?ais s?lectionn?s lors de la campagne ARCADE pour l'?valuation des syst?mes d'alignement de textes multilingues et utilis?s lors des campagnes Senseval/Romanseval pour l'?valuation des syst?mes de d?sambigu?sation s?mantique. Pour chacun de ces verbes, nous avons extrait du corpus multilingue EuroParl l'ensemble des contextes en fran?ais dans lesquels il appara?t ainsi que leur traduction en anglais. Nous avons ensuite associ? quatre ?tiquettes de sens ? chacun de ces contextes : la traduction du verbe polys?mique en question dans la version en anglais du contexte et l'entr?e du verbe dans le Lexique-Grammaire qu'il repr?sente, une ?lexie? de la forme entr?e_dans_le_LG#traduction (le r?sultat de la concat?nation des deux premi?res ?tiquettes de sens) et une classe de lexies (l'ensemble des lexies qui sont associ?es ? une m?me entr?e du mot dans le Lexique-Grammaire. Nous pr?senterons ce corpus ainsi que les premiers r?sultats de nos exp?riences en d?sambigu?sation s?mantique. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sat Dec 10 09:55:37 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sat, 10 Dec 2011 10:55:37 +0100 Subject: Appel: IC 2012 Message-ID: Date: Thu, 08 Dec 2011 09:33:58 +0100 From: Nathalie Aussenac-Gilles Message-ID: <4EE07676.4070309 at irit.fr> X-url: http://ic2012.crc.jussieu.fr/ ************************************************************************ *IC 2012 - 23es Journ?es francophones d?Ing?nierie des Connaissances * Tous les d?tails sur http://ic2012.crc.jussieu.fr/ Paris au Centre des Cordeliers du 25 juin au 29 juin 2012 ************************************************************************ ************************************************************************ *Dates importantes * Date limite de soumission : *17 f?vrier 2012 * Date de notification aux auteurs : 31 mars 2012 Version d?finitive : 2 mai 2012 ************************************************************************ Pr?sentation de la conf?rence : L'ing?nierie des connaissances est une discipline int?gr?e ? l'Intelligence Artificielle, fond?e sur les connaissances, leur acquisition, leurs mod?les, leurs traitements dans des environnements informatiques. Elle s'appuie sur d'autres disciplines comme la philosophie, la linguistique, les math?matiques, les sciences cognitives, l'ergonomie pour mettre ? la disposition d'activit?s humaines des m?thodes et outils "intelligents" ou tout du moins capables d'aider l'humain dans ses activit?s. La conf?rence IC r?unit la communaut? francophone et est un lieu d'?changes et de r?flexions sur cette discipline qui doit prendre en compte l'essor des Sciences et Technologies de l?Information et de la Communication, notamment des technologies li?es au Web et leurs retomb?es sur les pratiques individuelles et collectives. La conf?rence d'ing?nierie des connaissances est le lieu o? la communaut? francophone expose et confronte ses th?ories, ses pratiques, ses m?thodes et outils. Un des enjeux r?cents est la mise en place de standards d'?valuation de ses m?thodes et outils. Th?mes de la conf?rence : Les propositions portant sur le th?me ? Mod?les, outils d'?valuation en Ing?nierie des connaissance ? seront particuli?rement bienvenues mais les communications peuvent porter plus largement sur l'un des th?mes suivants d?s lors qu'elles sont originales et qu'elles ont une port?e th?orique, m?thodologique ou pratique (liste non exhaustive) : * Mod?les, outils d'?valuation en Ing?nierie des connaissance * Mod?les de connaissances : conception, r?utilisation, ?volution, ?valuation, exploitation * Ontologies : construction, types d?ontologies, int?gration et alignement, r?le des ontologies de r?f?rence * Gestion de la dynamique des ontologies (construction, association entre ontologies, gestion du temps et de l?espace, ?) * Traitements et raisonnements sur les connaissances * Acquisition de connaissances ? partir de textes, ? partir d'images, ? partir de donn?es non structur?es * D?couverte interactive de connaissances dans les donn?es, dans les historiques, dans les interactions, sur le web? * Web s?mantique, web social * Connaissance et r?seaux sociaux sur le web * Inf?rence et r?gles m?tier : acquisition conception, r?utilisation, ?volution, ?valuation, exploitations * Conception d'interactions et interfaces avec des syst?mes ? base de connaissances * Visualisation de connaissances * Applications en sant?, industrielles, ?ducation, juridiques, etc. * Activit?s coop?ratives, pratiques collectives et distribu?es * Ing?nierie et Gestion des connaissances pour l?entreprise * D?veloppements th?oriques et interdisciplinaires de l'ing?nierie des connaissances Soumissions : Les auteurs sont invit?s ? pr?senter des articles longs, des articles courts, des posters et des d?monstrations. Pour chaque type de soumission (article long, article court) un prix du meilleur papier sera d?cern? par le comit? de programme. Le prix du meilleur poster sera attribu? par les participants ? la conf?rence ? l'issue de la s?ance de posters. Articles longs : (16 p, pr?sentation orale 20 mm, 10 mn discussion) Sont concern?s des travaux aboutis, pr?sentant des r?sultats valid?s ou des synth?ses fouill?es. Une attention particuli?re sera port?e sur l'?valuation des r?sultats pr?sent?s. Articles courts : (8p, pr?sentation orale de 10 mn, 5 mn discussion) Sont concern?s des travaux en cours ou au stade de r?sultats partiels de recherches ou appliqu?s. Ce type de soumission correspond ? des travaux innovants de qualit?, mais dont la maturit? ne permet pas des ?laborations compl?tes et des conclusions d?finitives. Posters : (3p, pr?sentation dans un espace de discussion) Sont concern?s des travaux qui d?marrent, les premiers r?sultats de jeunes chercheurs. Les posters sont destin?s ? provoquer la discussion au sein de la communaut? pour accompagner au mieux le d?marrage des travaux pr?sent?s. Les auteurs sont invit?s ? indiquer le th?me le plus appropri? pour leur soumission. La conf?rence d'Ing?nierie des connaissances est sous l'?gide du GRACQ (groupe d'acquisition et d'ing?nierie des connaissances ) dont le site est http://www.irit.fr/GRACQ/ Pr?sidente du comit? de programme : Sylvie Szulman (LIPN -Universit? Paris 13) Pr?sident du comit? d'organisation : Jean Charlet (AP-HP-INSERM - Universit? Pierre et Marie Curie) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sat Dec 10 09:57:58 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sat, 10 Dec 2011 10:57:58 +0100 Subject: Seminaire: Nouveaux modes de publication scientifique, 14 decembre 2011 Message-ID: Date: Thu, 8 Dec 2011 09:38:41 +0100 From: Centre d'Alembert Message-Id: X-url: http://www.centre-dalembert.u-psud.fr X-url: http://h0.web.u-psud.fr/plan_sciences/satellite.php?bat=338 S?minaire interdisciplinaire 2011-2012 du Centre d'Alembert http://www.centre-dalembert.u-psud.fr Nouvelles pratiques et cultures du num?rique dans les sciences et les technologies Le Centre d'Alembert s'interroge cette ann?e sur l'?mergence de nouvelles pratiques et cultures dans la production du savoir scientifique, sa validation et sa circulation, sous l'influence des r?centes ?volutions du num?rique en particulier aff?rentes ? Internet. S?ance du mercredi 14 d?cembre 2011 : Nouveaux modes de publication scientifique, de validation et d'acc?s : quelles avanc?es pour le mouvement des archives ouvertes, pour la libre diffusion des connaissances scientifiques sur le web ? Le libre acc?s aux publications scientifiques : ?l?ments d'une prospective prudente Jean-Claude GUEDON, Professeur de litt?rature compar?e ? l'Universit? de Montr?al. Prix d'excellence de la Society for Digital Humanities. Ancien vice-pr?sident de la F?d?ration canadienne des sciences humaines pour la diss?mination de la recherche. Depuis environ dix ans, le mouvement de l'acc?s libre se d?ploie dans le monde. Ses progr?s sont ind?niables, et le sentiment que le libre acc?s est d?sormais ancr? dans la r?alit? des communications scientifiques ne semble plus faire de doute : m?me les grandes maisons d'?dition composent ouvertement avec ce mod?le pourtant vu comme irr?aliste et radical il y a seulement quelques ann?es. Fond? sur les d?p?ts institutionnels et th?matiques, d'une part, et les revues en libre acc?s d'autre part, le libre acc?s agit aussi ? titre d'indicateur de la num?risation croissante de nos cultures. Il r?v?le en particulier que la num?risation engendre des fonctions nouvelles pour les documents num?riques et le prochain grand d?fi du libre acc?s sera de r?pondre ? ces interpellations du num?rique : r?le des donn?es par rapport aux publications, convergences des m?dias d'expression, algorithmes "embarqu?s" dans les documents s'ajoutent aux questions plus traditionnelles des formats ad?quats pour la publication ?lectronique en libre acc?s. En parall?le, la cr?ation de la valeur symbolique en science m?rite d'?tre revisit?e au travers du prisme qu'offre le libre acc?s. Actuellement compl?tement domin?, voire perverti, par l'obsession des classements et la concurrence tous azimuts, les publications scientifiques actuelles r?v?lent l'existence d'une confusion f?cheuse entre qualit? et excellence. Relocaliser le r?le de l'excellence au sein d'une qu?te de la qualit? semble urgent, et il n'existe pas de meilleur moyen d'atteindre cet objectif que le libre acc?s. Informations pratiques : Les s?ances ont lieu ? la Facult? des Sciences d'Orsay, b?timent des colloques (338) de 13h30 ? 15h30 (entr?e libre). Acc?s : http://h0.web.u-psud.fr/plan_sciences/satellite.php?bat=338 Programme d?taill? et contact : Web : http://www.centre-dalembert.u-psud.fr M?l. : centre.dalembert at u-psud.fr M?dias sociaux : http://twitter.com/centredalembert ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sat Dec 10 10:04:06 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sat, 10 Dec 2011 11:04:06 +0100 Subject: Info: MULTI-2, Erasmus Mundus Action 2 Mobility Grants Europe - South East Asia Message-ID: Date: Thu, 8 Dec 2011 16:30:15 +0100 From: Laurent Pr?vot Message-ID: X-url: http://eacea.ec.europa.eu/erasmus_mundus/index_en.php X-url: http://www.em-multi.eu/ **************************************** ****Apologies for cross-postings**** **************************************** MULTI Erasmus Mundus Project offers attractive mobility grants to Asian/European students and researchers (master student, PhD student, young PhD holders) for studying or doing research in one of the consortium institutions (see below). The grants cover airfare as well as attractive grants covering living expenses and of a duration from 1 academic semester to 1 academic year. Multilingualism and Multiculturalism (MULTI-2) is an Action 2 Erasmus Mundus project funded by the European Commission to promote scientific exchanges between Europe and the Industrialized countries of South-East Asia (Hong Kong, Macao, Taiwan, Singapore, Brunei). It concerns both general disciplines such as Linguistics, Area Studies or Social Sciences and more applied fields such as Natural Language Processing or Translation. To be eligible to the mobility grant in the direction of Europe -> South East Asia, the applicant has to be a national of one of the 27 European Union countries and has to register or be affiliated at one of the European partner universities. To be eligible to the mobility grant in the direction of South East Asia -> Europe, the applicant has to be a national of one of the South East Asian countries participating in MULTI and has to register or be affiliated at one of the South East Asian partner universities. Partner institutions in Europe: Universit? de Provence (Coordinator) Freie Universit?t Berlin Universit?t des Saarlandes Universit? di Pisa City University London Universidad de Sevilla Charles University in Prague Partner institutions in South East Asia: The Hong Kong Polytechnic University (SEA Coordinator) University of Macao Nanyang Technological University National Taiwan University Universiti Brunei Darussalam Application deadline is *Jan. 20, 2012* and the results will be posted around mid March. For more information, please check MULTI website: http://www.em-multi.eu/ Email Contact (remove + and spaces): erasmusmundus ++ at ++ lpl-aix.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sat Dec 10 10:08:37 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sat, 10 Dec 2011 11:08:37 +0100 Subject: Job: Stage Postdoctoral, Traduction automatique, LIMSI, Orsay Message-ID: Date: Thu, 08 Dec 2011 23:28:18 +0100 From: Francois Yvon Message-ID: <4EE13A02.5050504 at limsi.fr> X-url: http://www.limsi.fr/Scientifique/tlp/mt [apologies for cross-posting] [and for sending this message only in English] **************************************************************** POST-DOC OPENING IN STATISTICAL MACHINE TRANSLATION AT LIMSI-CNRS, FRANCE **************************************************************** The "Spoken Language Processing" team at LIMSI-CNRS, Orsay (25 km south of Paris) is seeking qualified postdoctoral researchers in the field of Statistical Machine Translation (SMT). The main task of the candidate will be to design, implement and evaluate innovative models and associated training and inference procedures for components of statistical machine translation systems. These models will be used to improve LIMSI's machine translation toolbox. Other areas in which we have a strong interest are model adaptation, confidence measures and automatic evaluation metrics. The successful candidate will join a dynamic research team working on various aspects of SMT. See http://www.limsi.fr/Scientifique/tlp/mt for information regarding our activities. Candidates are expected to hold an Engineering BSc/Msc degree and a PhD in Computer Science. Knowledge of statistical or example-based approaches for Speech or Natural Language Processing is required; the candidate is also expected to have strong programmation skills, to be familiar with statistical machine learning techniques, and have a good publication record in the field. Experience with statistical machine translation systems and/or participation in large-scale open evaluations (NIST, IWSLT etc.) will be a strong advantage. The minimum remuneration will be EUR 2000 (net income) per month. A contract of one year will be awarded. Interested candidates should send a short cover letter stating motivation and interests, along with their CV (in .pdf format, only) to the email address given below as soon as possible and by dec 31th 2012 at the latest. Questions or other clarification should be directed to the same address. Application Deadline: 31-dec-2011 [the position(s) will remain open until filled] Email Address for Applications: yvon at limsi dot fr Contact Information: F. Yvon Phone: +331-69858066 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sat Dec 10 10:12:45 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sat, 10 Dec 2011 11:12:45 +0100 Subject: Journee: Journee d'etude, Qu'est-ce qu'apprendre une langue? Universite Paris Ouest, 14 decembre 2012 Message-ID: Date: Fri, 9 Dec 2011 09:26:45 +0100 From: "Didier Bottineau" Message-ID: <975E4DBFA15D4FA8BADFC75C71940DD3 at R12B0> X-url: http://www.modyco.fr/ ANNONCE D?UNE JOURNEE D?ETUDE Qu?est-ce qu?apprendre une langue ? Mercredi 14 d?cembre 2011, Universit? Paris Ouest, Nanterre, b?timent G, salle G 614, 9h-18h ACCES : RER A, arr?t Nanterre Universit? Journ?e scientifique organis?e le mercredi 14 d?cembre 2011 par Gilles Lou?s (EA 1586 Centre des Sciences de la Litt?rature fran?aise, F.ET.E : Fran?ais pour ETudiants Etrangers, Paris Ouest Nanterre La D?fense) avec le concours de l?Union des Professeurs de Langues des Grandes Ecoles, du Centre des Sciences de la Litt?rature fran?aise (Paris Ouest) et des ?coles doctorale Lettres Langues Spectacles et Connaissance, Langage, Mod?lisation (Paris Ouest). Lieu : b?timent G, salle G 614 Comit? scientifique Alain Cazade (EA 4405 CICLaS, Paris Dauphine) J?rg Eschenauer (Union des Professeurs de Langues des Grandes Ecoles / Ecole des Ponts, ParisTech) Dani?le Flament (UMR 7114 MoDyCo / Universit? Paris Ouest Nanterre la D?fense) Jacques Pr?vot (EA 1586 Centre des Sciences de la Litt?rature fran?aise / Universit? Paris Ouest Nanterre La D?fense) Jean-Louis Chiss (EA 2288 ERADLEC, Universit? Paris 3) Cette journ?e vise ? explorer et d?finir l?exp?rience du sujet qui s?exprime et communique dans une langue autre que la sienne et, corr?lativement, ? s?interroger sur la mani?re dont ce savoir-faire s?acquiert ? travers une gamme de pratiques p?dagogiques diverses emprunt?es ? des domaines d?application multiples. Le point de vue adopt? ici est de consid?rer que parler dans une autre langue, c'est passer d?un ? corps parlant ? ? un autre, pour autrui comme pour soi-m?me, se faire devenir momentan?ment un autre individu, avec les effets cognitifs, s?mantiques, ?motionnels, relationnels et culturels que cela suppose. L?hypoth?se d?fendue est que l?observation de cette d?marche d?passe la simple prise en compte du "para-verbal" (gestes, mimiques, postures). Par exemple, articuler des sons, tons ou m?lodies en japonais et en espagnol oblige le parleur ? placer diff?remment les divers muscles mobilis?s par l'expression de ce qu'il dit, ce qui lui donne litt?ralement un visage et une voix distincts selon la langue qu'il parle, produisant de lui-m?me, consciemment ou non, une r?alisation subjective et relationnelle distincte de sa mani?re d??tre dans sa langue maternelle. Cet aspect fondamental de l?action langagi?re humaine est relativement peu d?crit, et sa mise en ?uvre n?est gu?re syst?matis?e ni explicit?e en didactique des langues. Dans ce contexte, l?objectif de cette journ?e est de donner forme et substance ? ce point de vue et d?en explorer les cons?quences p?dagogiques susceptibles de devenir le lieu d'une rupture ?pist?mologique : a) du point de vue th?orique : du fait que les recherches en mati?re de langue (ou de discours) se focalisent en g?n?ral pr?cis?ment sur les unit?s dites "linguistiques" sans se pr?occuper de l'amont (la r?alit? des conditions) dont elles sont l'aval (le r?sultat et la formalisation souvent d?sincarn?e) ; b) du point de vue pratique : du fait que, de mani?re g?n?rale, les didactiques sont tributaires des domaines habituellement convoqu?s dans l?apprentissage des langues, tels la grammaire, le lexique, la phon?tique, la s?mantique, la pragmatique, la culture (repr?sentations, st?r?otypes?), et n?ont pas les moyens de se r?f?rer ? une compr?hension autre (et plus globale) de ce qu?est l?exp?rience de la parole. Ce colloque se propose de mettre en lumi?re les th?ories actuelles qui font toute sa place au v?cu corporel de l?apprenant dans la compr?hension ou la production linguistique (en tenant compte des rapports aux courants pertinents en linguistique, psychologie, ph?nom?nologie), et de les relier aux pratiques p?dagogiques faisant appel au ? corps parlant ? (th??tre, gestuelle, danse, musique, chant, orthophonie?) qui en illustrent sur le terrain la pertinence didactique. Programme 9h Ouverture et pr?sentation de la journ?e: Christophe Martin (Paris Ouest, directeur de l'EA 1586, co-directeur de l'UFR PHILLIA) L'exp?rience de la parole : pr?sident Robert Damien (Paris Ouest, directeur de l'ED 139) 9h15-10h Conf?rencier invit? : Didier Bottineau (CNRS, UMR 7114 Modyco & Paris Ouest) : ? Comment l'exp?rience de la parole constitue le sujet psychologiquement, intersubjectivement, cognitivement, culturellement, et ce ? travers l'interaction, par la motricit? et la sensorialit? ? 10h-10h15 Echange avec la salle 10h15-10h45 Pause Le corps parlant : pr?sident Alain Cazade (EA 4405 CICLaS, Paris Dauphine) 10h45-11h05 : Philippe Turchet, Chercheur en synergologie, Canada) : ? Et si l'apprentissage d'une langue passait aussi par l'int?gration non consciente d'un langage corporel associ? ? ? 11h05-11h25 : Catherine Garitte (EA 4431 Psychologie des processus et conduites complexes, Paris Ouest) et Larisa Olteanu : ? L?apprentissage des gestes conventionnels en roumain et en fran?ais chez les enfants bilingues de 7 ? 11 ans ? ? : 11h25-11h45 : Jean-R?mi Lapaire, Universit? Michel de Montaigne Bordeaux 3 : ? Corps pensant, corps parlant, corps en mouvement ? 11h45-12h Echange avec la salle 12h-13h30 D?jeuner Perspectives didactiques 1 : pr?sident Thomas Gomez (Paris Ouest, directeur de l'ED 138) 13h30-13h50 Vera Dickman (T?l?com ParisTech) : ? M?diation du corps dans les apprentissages langagiers et interculturels ? 13h50-14h10 J?rg Eschenauer (Ecole des Ponts ParisTech) : ? Apprendre une langue en tandem ? 14h10-14h30 Amina Bensallah-Poulain (UMR 7114 MoDyCo, CNRS & Paris Ouest) : ? L?ancrage corporel du langage ? Apport des pratiques orthophonistes ? l?enseignement d?une langue orale ? ?trang?re ? ? 14h30-15h Echange avec la salle 15h-15h30 Pause Perspectives didactiques 2: pr?sident J?rg Eschenauer (ParisTech & UPLEGESS) 15h30-15h50 : Jo?lle Aden, Universit? du Maine (INEDUM/CREN, EA 2661) : ? Empathie et approche th??trale en didactique des langues ? 15h50-16h10 : Gra?a Dos Santos (EA 369 Etudes romanes, Paris Ouest) : ? Je est un autre. L'exp?rience du bilinguisme et du th??tre pour enseigner et apprendre une langue. ? 16h10-16h30 : Stephen Scott Brewer (Centre de recherches Education Formation : EA 1589 Universit? Paris Ouest) : ? Apprendre une langue : l?enjeu du ?jeu int?rieur? ? 16h30-17h Echange avec la salle 17h-18h Table ronde pr?sid?e par Dani?le Flament-Boistrancourt (UMR 7114 MoDyCo, Paris Ouest), Boris Sautereau et Emmanuelle Sauvage (EA 1586, Paris Ouest, FETE), Sylvaine Hughes (EA 370 Centre de recherches anglophones, Paris Ouest), Idelette Muzart (EA 369 Etudes romanes, Paris Ouest), Jean-Marc Chamot (6e DAN A?kido, EA 370, Paris Ouest). 18h Cocktail dinatoire de cl?ture ---------------------------- ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 14 08:41:57 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 14 Dec 2011 09:41:57 +0100 Subject: Appel: ECAI 2012, Dernier appel a workshops Message-ID: Date: Sat, 10 Dec 2011 15:11:39 +0100 From: ECAI Publicity Message-ID: <4EE3689B.8030609 at lirmm.fr> X-url: http://www2.lirmm.fr/ecai2012/ Last call for ECAI 2012 Workshops www.lirmm.fr/ecai2012 Deadline: 6 Jan 2012 ECAI, the biennial European Conference on Artificial Intelligence organized by the European Coordinating Committee for Artificial Intelligence (ECCAI), is the premier forum for presenting Artificial Intelligence results in Europe. The twentieth conference in this series, ECAI 2012, will take place in the beautiful city of Montpellier (France) August 27-31. ECAI 2012 will gather researchers and practitioners of Artificial Intelligence from all over the world; they will present the latest results and identify new trends and challenges in all subfields of Artificial Intelligence as well as demonstrate innovative applications and uses of advanced Artificial Intelligence technology. In addition to the main technical track, the ECAI 2012 program will feature keynote and invited talks, invited tutorials, an elaborate workshop program, and a system-demonstration track. Workshops ----------- All workshop proposals are welcome on any topic related to AI, ranging from robotics to games, from knowledge modelling to reasoning to constraints, from machine learning to planning and scheduling, from uncertainty to perception to language, from multi-agents to interactions and user modelling (non-exhaustive list). The goal of the workshops is to provide an informal forum for researchers to discuss focused and/or emerging research questions and challenges. Workshops can come in several flavors: * presentations of peer-reviewed papers (standard workshop format), including discussions and possibly panels; * AI competitions or challenges; * Position papers and discussion on open or controversial problems. * any other format aimed at investigating new AI trends. What a workshop should *not* be: a mini-conference packed with short talks and with no time for discussion and interaction. We therefore encourage workshop organizers to include invited talks, short tutorials, panel-discussions and representation of alternative viewpoints in their workshop program in addition to the regular submissions. The typical duration for a workshop is one day; two-day workshops can also be accommodated. For any question, any intention to submit, please contact J?r?me Lang and Mich?le Sebag (lang at lamsade.dauphine.fr, sebag at lri.fr). The deadline for workshop proposals is January 6, 2012. Dates: 6 Jan 2012: workshop proposals due (early submissions are welcome!) 1 Feb 2012: workshop proposal notifications sent 19 Feb 2012: 1st call for papers sent; workshop websites ready; announcement of all ECAI-2012 workshops at the main conference site 28 May 2012 (suggested): workshop paper submission deadline 28 Jun 2012 (suggested, and subject to modification): notification on workshop paper submissions 22 Jul 2012: workshop proceedings to be sent to workshop chair Notes that the workshop submission deadline should preferably take place after ECAI notification (21 May 2012) and workshop paper notifications *must* be sent before the ECAI early registration deadline (which will be announced later). Workshop proposal submission: Please submit your proposal at http://www.easychair.org/conferences/?conf=ecai2012ws Each workshop proposal (pdf file) must include: (1) Title and acronym. (2) Names, affiliations, and contact details of all organizers (please indicate a primary contact person to whom correspondence should be directed). (3) Short description (workshop format and duration, tentative call for papers, tentative invited speakers and members of the program committee). (4) Audience: research groups working in the field; (if applicable) details about previous editions of the workshop; related workshops in other venues; expected number of submissions. (5) One paragraph about each organizer (scientific profile, previous events you have organized). (6) Every information you deem to be relevant. Policy: * all workshop participants, including the organizer, are required to register for both the workshop and the main conference. For up to three invited speakers per workshop the workshop registration will be waived, though they must still register for the main conference. Unfortunately, ECAI cannot provide any other support for these speakers. * Due to budget reasons, a workshop might be cancelled if there are not enough submissions or registrations. * Proceedings: The organizers will provide a single pdf file to be posted on the website. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 14 08:46:33 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 14 Dec 2011 09:46:33 +0100 Subject: Ecole: SSFLA 2012 Message-ID: Date: Sun, 11 Dec 2011 11:24:50 +0100 From: "GRLMC" Message-ID: X-url: http://grammars.grlmc.com/ssfla2012/ ********************************************************************* 2012 INTERNATIONAL SPRING SCHOOL IN FORMAL LANGUAGES AND APPLICATIONS SSFLA 2012 (formerly International PhD School in Formal Languages and Applications) Tarragona, Spain April 2-6, 2012 Organized by: Research Group on Mathematical Linguistics (GRLMC) Rovira i Virgili University http://grammars.grlmc.com/ssfla2012/ ********************************************************************* AIM: SSFLA 2012 offers a broad and intensive series of lectures at different levels on selected topics in language and automata theory and their applications. The students choose their preferred courses according to their interests and background. Instructors are top names in their respective fields. The School intends to help students initiate and foster their research career. The previous event in this series was FSFLA 2011 (http://grammars.grlmc.com/fsfla2011/). ADDRESSED TO: Undergraduate and graduate students from around the world. Most appropriate degrees include: Computer Science and Mathematics. Other students (for instance, from Linguistics, Electrical Engineering, Molecular Biology or Logic) are welcome too provided they have a good background in discrete mathematics. The School is appropriate also for people more advanced in their career who want to keep themselves updated on developments in the field. There will be no overlap in the class schedule. COURSES AND PROFESSORS: - Eric Allender (Rutgers), Circuit Complexity: Recent Progress in Lower Bounds [introductory/advanced, 8 hours] - Amihood Amir (Bar-Ilan), Periodicity and Approximate Periodicity in Pattern Matching [introductory, 6 hours] - Ahmed Bouajjani (Paris 7), Automated Verification of Concurrent Boolean Programs [introductory/advanced, 10 hours] - Andrea Corradini (Pisa), Computational Models Based on Rewriting [introductory, 12 hours] - Bruno Courcelle (Bordeaux), Automata for Monadic Second-order Model Checking [intermediate, 8 hours] - Manfred Droste (Leipzig), Weighted Automata and Weighted Logic [introductory/advanced, 8 hours] - J?rg Flum (Freiburg), The Halting Problem for Turing Machines [introductory/advanced, 6 hours] - Aart Middeldorp (Innsbruck), Termination of Rewrite Systems [introductory/intermediate, 8 hours] REGISTRATION: It has to be done on line at http://grammars.grlmc.com/ssfla2012/Registration.php FEES: They are variable, depending on the number of courses each student takes. The rule is: 1 hour = - 10 euros (for payments until January 22, 2012), - 15 euros (for payments after January 22, 2012). PAYMENT PROCEDURE: The fees must be paid to the School's bank account: Uno-e Bank (Julian Camarillo 4 C, 28037 Madrid, Spain): IBAN: ES3902270001820201823142 ? Swift/BIC code: UNOEESM1 (account holder: Carlos Martin-Vide GRLMC; account holder's address: Av. Catalunya, 35, 43002 Tarragona, Spain) Please mention SSFLA 2012 and your name in the subject. A receipt will be provided on site. Remarks: - Bank transfers should not involve any expense for the School. - People claiming early registration will be requested to prove that the bank transfer order was carried out by the deadline. - Students may be refunded only in the case when a course gets cancelled due to the unavailability of the instructor. People registering on site at the beginning of the School must pay in cash. For the sake of local organization, however, it is much recommended to do it earlier. ACCOMMODATION: Information about accommodation is available on the website of the School. CERTIFICATE: Students will be delivered a certificate stating the courses attended, their contents, and their duration. IMPORTANT DATES: Announcement of the programme: December 10, 2011 Starting of the registration: December 10, 2011 Early registration deadline: January 22, 2012 Starting of the School: April 2, 2012 End of the School: April 6, 2012 QUESTIONS AND FURTHER INFORMATION: Lilica Voicu: florentinalilica.voicu at urv.cat WEBSITE: http://grammars.grlmc.com/ssfla2012/ POSTAL ADDRESS: SSFLA 2012 Research Group on Mathematical Linguistics (GRLMC) Rovira i Virgili University Av. Catalunya, 35 43002 Tarragona, Spain Phone: +34-977-559543 Fax: +34-977-558386 ACKNOWLEDGEMENTS: Diputaci? de Tarragona Universitat Rovira i Virgili ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 14 08:49:28 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 14 Dec 2011 09:49:28 +0100 Subject: Appel: FOIS 2012 Message-ID: Date: Sun, 11 Dec 2011 23:56:35 +0100 From: Nathalie Aussenac-Gilles Message-ID: <4EE53523.3020508 at irit.fr> X-url: http://purl.org/icbofois2012 (apologies for cross-posting) ... (please distribute) ----------------------------------------------------------------- ------------- CALL FOR PAPERS | FOIS 2012 | Jul 24-27, Graz, Austria ------------- -> DEADLINE: Conference paper submission: Jan 31, 2012 (in ca. 8 weeks) [1] http://purl.org/icbofois2012 ------------------------------------------------ TITLE AND CO-LOCATION ------------------------------------------------ Seventh International Conference on Formal Ontologies in Information Systems (FOIS 2012) July, 24-27, 2012 in Graz, Austria held together with the Third International Conference on Biomedical Ontology (ICBO 2012) ------------------------------------------------ DEFINITION AND SCOPE ------------------------------------------------ Ontology, originally a fundamental part of philosophical enquiry, is concerned with the analysis and categorization of what exists. In recent years, however, a complementary focus of ontological inquiry gained significant momentum fueled by the advent of complex information systems which rely on robust and coherent representations of their subject matter. The systematic study of such representations, their reasoning techniques and their relations to reality, are at the center of the modern discipline of formal ontology. Formal ontology in this modern sense is now a research focus in such diverse domains as conceptual modeling, database design, software engineering, organizational modeling, artificial intelligence, computational linguistics, the life sciences, bioinformatics, geographic information science, knowledge engineering, information retrieval, and the semantic web. Researchers in all these areas increasingly recognize the need for serious engagement with ontology, understood as a general theory of the types of entities and relations making up their respective domains of enquiry, in providing a solid foundation for their work. The FOIS conference is designed to provide a meeting point for researchers from all disciplines with an interest in formal ontology. The conference encourages submission of high quality articles on both theoretical issues and concrete applications. As in previous years, FOIS 2012 is intended as a nexus of interdisciplinary research and communication [1]. The FOIS conference series [2] began with the first meeting in Trento, Italy in June 1998 followed by meetings in 2001, 2004, 2006, 2008, and 2010. The seventh FOIS conference will be held in Graz, Austria July 24-27, 2012, in conjunction with the third International Conference on Biomedical Ontology (ICBO 2012) [1]. FOIS is the flagship conference of the International Association for Ontology and its Applications (IAOA) [3], which is a non-profit organization the purpose of which is to promote interdisciplinary research and international collaboration at the intersection of philosophical ontology, linguistics, logic, cognitive science, and computer science, as well as in the applications of ontological analysis to conceptual modeling, knowledge engineering, knowledge management, information-systems development, library and information science, scientific research, and semantic technologies in general. ------------------------------------------------ TOPICS OF INTEREST ------------------------------------------------ We seek high-quality papers on a wide range of topics. While authors may focus on fairly narrow and specific issues, all papers should emphasize the relevance of the work described to formal ontology and to information systems. Papers that completely ignore one or the other of these aspects will be considered as lying outside the scope of the meeting. Topic areas of particular interest to the conference are: Foundational Issues * Kinds of entity: particulars/universals, continuants/occurrents, abstracta/concreta, dependent/independent entities, natural objects/artifacts * Formal relations: parthood, identity, connection, dependence, constitution, subsumption, instantiation * Vagueness and granularity * Identity and change * Formal comparison among ontologies * Ontology of physical reality (matter, space, time, motion, ...) * Ontology of biological reality (genes, proteins, cells, organisms, ...) * Ontology of artifacts, functions and roles * Ontology of mental reality and agency (beliefs, intentions, emotions, ...) * Ontology of social reality (institutions, organizations, norms, social relationships, artistic expressions, ...) * Ontology of the information society (information, communication, meaning negotiation, ...) * Ontology and Natural Language Semantics, Ontology and Cognition Methodologies and Applications * Top-level vs application ontologies * Ontology integration and alignment; role of reference ontologies * Ontology-driven information systems design * Ontological foundations for conceptual modeling * Ontology-based application systems * Requirements engineering * Knowledge engineering * Knowledge management and organization * Knowledge representation; Qualitative modeling * Computational lexicons; Terminology * Information retrieval; Question-answering * Semantic Web; Web services; Grid computing * Domain-specific ontologies, especially for: Biomedical science, E-business, Enterprise integration, Engineering, Geography, Law, Library science, Linguistics, ... ------------------------------------------------ KEYNOTE SPEAKERS ------------------------------------------------ Steven R. Ray (Carnegie Mellon Silicon Valley, California, USA) Laure Vieu (CNRS-IRIT, Toulouse, France) Edward N. Zalta (Stanford University, California, USA) ------------------------------------------------ IMPORTANT DATES ------------------------------------------------ We are calling for papers to be considered for inclusion in FOIS 2012. Conference paper submission: Jan 31, 2012<-- Conference paper notification: Mar 08, 2012 Camera-ready conference papers: Mar 24, 2012 Workshop paper submission: Mar 15, 2012 Workshop paper notification: Apr 15, 2012 Camera-ready workshop papers: Jun 30, 2012 Conference dates: Jul 24-27, 2012 ------------------------------------------------ AUTHOR INSTRUCTIONS FOR CONFERENCE PAPERS ------------------------------------------------ Submitted papers must not exceed 14 pages, including the bibliography and an abstract of no more than 300 words. Papers should be submitted electronically as PDF files prepared in accordance with the IOS formatting guidelines [4]. Submissions should be made via Easychair at the following address: [5] https://www.easychair.org/account/signin.cgi?conf=fois2012 Proceedings will be published by IOS Press and will be available at the time of the conference. Please note that at least one author must register for the conference in order for an accepted paper to be published in the proceedings. ------------------------------------------------ CONFERENCE ORGANISATION ------------------------------------------------ Conference chair: Michael Gruninger (University of Toronto, Canada) Program chairs: Maureen Donnelly (University at Buffalo, USA) Giancarlo Guizzardi (Fed. Univ. of Esp?rito Santo, Brazil) Local organization: Stefan Schulz (Graz University, Austria) ------------------------------------------------ PROGRAM COMMITTEE ------------------------------------------------ Achille Varzi (Columbia University, New York, USA) Alan Rector (University of Manchester, UK) Albert Goldfain (University at Buffalo, USA) Aldo Gangemi (ISTC-CNR, Rome, Italy) Alessandro Lenci (University of Pisa, Italy) Antony Galton (University of Exeter, UK) Barry Smith (University at Buffalo, USA) Bill Andersen (Highleet, Inc., USA) Brandon Bennett (University of Leeds, UK) Carola Eschenbach (University of Hamburg, Germany) Chris Menzel (Texas A&M University, USA) Chris Welty (IBM Research, Hawthorne, USA) Christiane Fellbaum (Princeton University, USA) Claudio Masolo (ISTC-CNR, Trento, Italy) Colin Batchelor (Royal Society of Chemistry, UK) Daniel Oberle (SAP Research, CEC Karlsruhe, Germany) David Mark (University at Buffalo, USA) Decio Krause (Fed. Univ. of Santa Catarina (UFSC), Brazil) Fabian Neuhaus (NIST, USA) Florian Probst (SAP, Germany) Frank Loebe (University of Leipzig, Germany) Fred Freitas (Fed. Univ. of Pernambuco, Recife, Brazil) Frederico Fonseca (Penn State University, USA) Gerd Wagner (Brandenburg University of Technology, Germany) Heinrich Herre (University of Leipzig, Germany) Jan Dietz (Delft University of Technology, The Netherlands) Janna Hastings (University of Geneva, Switzerland) Jeffrey Parsons (Memorial University of Newfoundland, Canada) Jerome Euzenat (INRIA, Grenoble, France) Jerry Hobbs (University of Southern California, USA) Johanna Seibt (University of Aarhus, Denmark) John Bateman (University of Bremen, Germany) John Mylopoulos (University of Toronto, Canada) John Sowa (Vivomind Intelligence Inc., USA) Joost Breuker (University of Amsterdam, The Netherlands) Ken Kaneiwa (National Institute of Informatics, Tokyo, Japan) Krzysztof Janowicz, (UCLA, USA) Laure Vieu (CNRS-IRIT, Toulouse, France) Leo Obrst (The Mitre Corporation, USA) Leonardo Lesmo (University of Torino, Italy) Maria Luiza Machado Campos (Fed. Univ. of Rio de Janeiro, Brazil) Mark Musen (Stanford University, USA) Martin Doerr (Foundation for Research and Technology, Greece) Mathias Brochhausen (Univ. of Arkansas for Medical Sciences, USA) Mathias Samwald (Medical University of Vienna, Austria) Matteo Cristani (University of Verona, Italy) Mauricio Barcelos de Almeida (Fed. Univ. of Minas Gerais, Brazil) Michael F. Uschold (Semantic Arts, Canada) Michael Gruninger (University of Toronto, Canada) Michael Sintek (DFKI GmbH, Kaiserslautern, Germany) Michel Dumontier (Carleton University, Canada) Natasha Noy (Stanford University, USA) Nicola Guarino (ISTC-CNR, Trento, Italy) Olivier Bodenreider (National Library of Medicine, USA) Oscar Pastor (Polytechnic University of Valencia, Spain) Paul Buitelaar (National University of Ireland, Galway) Peter Rittgen (University College Boras, Sweden) Pierdaniele Giaretta (University of Verona, Italy) Pierre Grenon (Open University, UK) Ricardo Falbo (Federal University of Esp?rito Santo, Brazil) Riichiro Mizoguchi (University of Osaka, Japan) Robert Hoehndorf (University of Cambridge, UK) Roberta Ferrario (ISTC-CNR, Trento, Italy) Roberto Casati (CNRS-EHSS, Paris, France) Stefano Borgo (ISTC-CNR, Trento, Italy) Terry Halpin (LogicBlox, Australia) Terry Langendoen (University of Arizona, USA) Thomas Bittner (University at Buffalo, USA) Tony Cohn (School of Computing, University of Leeds, UK) Ulrike Sattler (University of Manchester, UK) Veda Storey (Georgia State University, USA) Werner Ceusters (University at Buffalo, USA) Werner Kuhn (University of Munster, Germany) William McCarthy (Michigan State University, USA) Yair Wand (University of British Columbia, Canada) ------------------------------------------------ LINKS ------------------------------------------------ [1] http://purl.org/icbofois2012 == http://www.kr-med.org/icbofois2012 FOIS and ICBO 2012 website [2] http://www.formalontology.org/ FOIS conference series [3] http://www.iaoa.org International Association for Ontology and Its Applications [4] http://www.iospress.nl/authco/instruction_crc.html IOS Press formatting guidelines [5] https://www.easychair.org/account/signin.cgi?conf=fois2012 Submission at EasyChair ----------------------------------------------------------------- ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 14 08:51:09 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 14 Dec 2011 09:51:09 +0100 Subject: Revue: CORELA, vol. 9, num. 2 Message-ID: Date: Mon, 12 Dec 2011 16:35:07 +0100 From: Gilles Col Message-Id: X-url: http://corela.edel.univ-poitiers.fr/ X-url: http://corela.edel.univ-poitiers.fr/index.php?id=2127 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=2314 X-url: http://corela.edel.univ-poitiers.fr/index.php?id=2298 Chers coll?gues, L'?quipe ?ditoriale de la revue CORELA - Cognition / Repr?sentation / Langages (http://corela.edel.univ-poitiers.fr/) a le plaisir de vous annoncer la publication de son dernier num?ro (vol. 9, num. 2). Les propositions de publication pour le prochain num?ro de la revue (vol. 10, num. 1) sont ? envoyer ? Gilles Col (gilles.col at univ-poitiers.fr) avant le 13 avril 2012. Sommaire du num?ro Jean-Baptiste Guignard (2011). "Linguistique Cognitive et mod?les cat?goriaux : Quelques consid?rations ?pist?mologiques". http://corela.edel.univ-poitiers.fr/index.php?id=2127 Isabelle Ha?k (2011). "Kissing goodbye". http://corela.edel.univ-poitiers.fr/index.php?id=2314 B?n?dicte GUILLAUME (2011). "Will dans les subordonn?es en if." http://corela.edel.univ-poitiers.fr/index.php?id=2298 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 14 08:52:55 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 14 Dec 2011 09:52:55 +0100 Subject: These: Tantely Harinjaka Ravelonjatovo, Analyse systematique des termes malgaches Message-ID: Date: Tue, 13 Dec 2011 12:10:05 +0000 From: Tantely Harinjaka RAVELONJATOVO Message-ID: Chers chercheurs Vous ?tes convi? ? honorer de votre pr?sence la soutenance de th?se de doctorat de RAVELONJATOVO Tantely intitul? : ? CONTRIBUTION A LA METHODOLOGIE D?ANALYSE SYSTEMATIQUE DES TERMES MALGACHES. CAS DU DOMAINE DE L?ENVIRONNEMENT ? qui aura lieu le 22 d?cembre 2011 ? l?Amphi 24 de la Facult? des Lettres et Sciences Humaines de l?Universit? d?Antananarivo ? 8 h 30. Apr?s la soutenance, un pot sera offert ? la salle Sim?on Rajaona. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 14 08:56:25 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 14 Dec 2011 09:56:25 +0100 Subject: Appel: CAp 2012 Message-ID: Date: Tue, 13 Dec 2011 15:52:27 +0100 From: Armelle Brun Message-Id: X-url: http://cap2012.loria.fr/ ------------------------------------------------------------------------ 1er appel ? communication CAp 2012 Conf?rence Francophone sur l'Apprentissage Automatique LORIA, Nancy, 23 au 25 mai 2012 http://cap2012.loria.fr/ CAP'12 est organis?e avec le soutien de l'INRIA et de l'AFIA. ------------------------------------------------------------------------ La conf?rence sur l'apprentissage automatique (CAp) constitue maintenant un rendez-vous annuel incontournable pour les chercheurs de la communaut? francophone de l'apprentissage automatique. La fronti?re entre l'apprentissage symbolique et l'apprentissage num?rique ?tant de moins en moins tangible, la conf?rence CAp offre ? toute personne travaillant dans le domaine de l'apprentissage une opportunit? de pr?senter ses travaux et d'?changer ses id?es avec les autres membres de la communaut?. Elle est aussi l'occasion d'initier ou de poursuivre des ?changes scientifiques pluridisciplinaires entre doctorants, chercheurs et industriels. Cette conf?rence est aussi l'occasion pour les doctorants d'appr?hender un large panorama sur leur domaine de recherche et de b?n?ficier d'un premier contact ? la fois rigoureux et bienveillant avec l'ensemble des activit?s li?es ? la communication scientifique : r?daction d'un article, recueil et prise en compte d'avis ext?rieurs, pr?sentation orale, etc. Dates Importantes ---------------------- Soumission des articles complets : 1er f?vrier 2012, minuit Notification aux auteurs : 15 mars 2012 Envoi des articles d?finitifs : 1er avril 2012 Conf?rence : 23-25 mai 2012 Le comit? de programme de CAP'12 incite toutes les soumissions de travaux originaux sur diff?rents sujets ayant trait ? l'apprentissage automatique, comme (mais non limit?s ?) : * Apprentissage symbolique et alg?brique : arbres de d?cision, programmation logique inductive, apprentissage multi-instances, treillis de Galois, recherche de motifs fr?quents... * Apprentissage num?rique ou statistique : r?seaux neuronaux, r?seaux bay?siens, machines ? support de vecteurs, apprentissage semi-supervis?, co-training... * Approches hybrides : algorithmes ?volutionnaires, raisonnement ? partir de cas, analogie, approches cognitives, m?thodes ? noyaux... * Reformulation : s?lection de descripteurs, s?lection d'instances, propositionnalisation, abstraction... * Etude formelle de l'apprentissage : apprenabilit?, ?valuation des syst?mes d'apprentissage, liens avec l'expert et aspects interface homme-machine (biais, visualisation, mod?lisation de l'expert, etc), mesures de qualit? pour la fouille de donn?es... * Apprentissage ? partir de donn?es structur?es et semi-structur?es, de textes (fouille de textes, web mining...), de g?nomes, d'images... * Mod?les graphiques : mod?les de Markov, r?seaux Bay?siens, r?duction de la dimensionnalit?, s?lection de variables, clustering... * Applications : fouille de donn?es, fouille de textes, bio-informatique, m?decine, chimie... * ... En plus des sessions de pr?sentation d'articles (orales et posters), plusieurs chercheurs de renom dans le domaine de l'apprentissage ou dans l'application de l'apprentissage ? diff?rents domaines seront invit?s ? pr?senter leurs travaux. Nous vous invitons donc ? soumettre des articles sur vos recherches r?centes sur l'apprentissage. Ces recherches peuvent ?tre de natures diverses : travaux m?thodologiques, th?oriques ou exp?rimentaux, d?veloppement d'algorithmes, d'outils, de syst?mes ou d'applications. Les articles pourront ?tre soumis en langue fran?aise ou anglaise. Templates et Styles ----------------------- Afin d'assurer une homog?n?it? d'ensemble, seuls les articles soumis au format PDF seront recevables (maximum 16 pages, incluant bibliographie, figures et tables). Les templates LaTeX, Word et Pages ? utiliser sont disponibles sur le site de la conf?rence (http://cap2012.loria.fr). Merci de ne pas alt?rer la mise en page ou les polices telles que d?finies dans les styles. Comit?s ---------- Comit? d'organisation : Fr?d?ric Alexandre, Laurent Bougrain (pr?sident), Armelle Brun, Olivier Buffet, Anne-Lise Charbonnier, Yannick Toussaint Comit? de programme : [en cours de validation] Contact : bougrain at loria.fr Annonces ------------ Des prix pour les meilleures communications port?es par un ?tudiant seront attribu?s par le comit? de programme. Notez que cette ann?e CAp est organis?e conjointement avec les journ?es JFPDA (Journ?es Francophones Planification, D?cision, Apprentissage pour la conduite de syst?mes) (http://jfpda2012.loria.fr) et sera donc l'occasion d'?largir les ?changes pluridisciplinaires en s'ouvrant ? la communaut? planification, d?cision et apprentissage par renforcement. A vos agendas et au plaisir de vous voir ? Nancy en 2012 ! ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 14 08:58:03 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 14 Dec 2011 09:58:03 +0100 Subject: Job: Post-Doc LIF Marseille Message-ID: Date: Tue, 13 Dec 2011 15:54:28 +0100 From: "alexis.nasr" Message-ID: <4EE76724.8070004 at lif.univ-mrs.fr> ------------------------------------------------------------------------ [english version below] Le Laboratoire d'Informatique Fondamentale de Marseille propose une ann?e de post-doctorat portant sur le traitement linguistique de la parole spontan?e anglaise. Il s'agit d'appliquer ? la sortie d'un syst?me de transcription automatique de la parole, diff?rents traitements, en particulier de la reconaissance d'entit?s nomm?es, de l'?tiquetage morpho-syntaxique, de l'analyse syntaxique, de la d?sambiguisation lexicale et de l'identification de r?les s?mantiques. Le candidat retenu sera accueilli par l'?quipe Traitement Automatique du Langage Ecrit et Parl?. Profil recherch? : - bonne ma?trise de l'anglais - formation en TAL ou en apprentissage automatique - ma?trise des langages de programmation C ou C++. Contact: Fr?d?ric B?chet (frederic.bechet at lif.univ-mrs.fr) Alexis Nasr (alexis.nasr at lif.univ-mrs.fr) ------------------------------------------------------------------------ The Laboratoire d'Informatique Fondamentale de Marseille has an opening for a one year post-doctoral position in the field of Natural Language Processing, more precisely on the linguistic processing of the output of English Automatic Speech Recognition. Several processing modules will be developped or adapted: Named Entity Recognition, Part of Speech Tagging, syntactic parsing, Word Sense Disambiguation and Semantic Role Labeling. The candidate will integrate the Natural Language Processing Team (Traitement Automatique du Langage Ecrit et Parl?). Applicants are expected to have finished, or be about to finish their Ph.D. degrees. They must have a background in Natural Language Processing and master C or C++ programming language, as well as the English language. Contact: Fr?d?ric B?chet (frederic.bechet at lif.univ-mrs.fr) Alexis Nasr (alexis.nasr at lif.univ-mrs.fr) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 13:13:39 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 14:13:39 +0100 Subject: These: Mohamed Belgacem, Reconnaissance automatique de la parole et ALAO : Vers un systeme d=?WINDOWS-1252?Q?=92apprentissage_de_l=92arabe_?=oral Message-ID: Date: Tue, 13 Dec 2011 17:26:58 +0100 (CET) From: mohamed belgacem Message-ID: <2079155664.1004631323793618371.JavaMail.www at wwinf7143> Bonjour ? tous, J'ai le plaisir de vous inviter ? ma soutenance de th?se qui aura lieu le 16 d?cembre 2011 ? 14h30 ? la salle Jacques Cartier - Maison des langues et des cultures, Universit? Stendhal Grenoble-FRANCE. Vous ?tes ?galement les bienvenus au pot de th?se qui suivra la soutenance. Titre de la th?se : Reconnaissance automatique de la parole et ALAO : Vers un syst?me d?apprentissage de l?arabe oral R?sum? La composante phon?tique n?est pas suffisamment trait?e dans l?apprentissage des langues ?trang?res pourtant cette derni?re est une des sources de la diversit? d?accents observ?s chez les locuteurs non-natifs. Le signal de parole nous transmet des informations importantes sur le locuteur. Ces informations nous renseignent sur son niveau de langue et les probl?mes de prononciation qu?il peut avoir. Une analyse des caract?ristiques de la parole des apprenants peut contribuer ? am?liorer le processus d?apprentissage des langues ?trang?res. Cette analyse peut aider ? la r?alisation des applications technologiques comme la reconnaissance automatique de la parole (RAP), v?rification et identification du locuteur, l?apprentissage des langues assist? par ordinateur (ALAO). Ces applications ont connu ces derni?res ann?es d?importantes avanc?es gr?ce ? l??volution des technologies informatiques. Bas? sur des ressources issues du traitement automatique des langues (TAL), notre prototype 3AO ?labor? dans cette th?se, se veut d?apporter des nouvelles voies ? l?apprentissage de langue arabe assist? par ordinateur. Un de ses objectifs principaux est de donner aux apprenants de langue arabe des outils adapt?s ? leurs probl?matiques didactiques en rel?guant au second plan les aspects informatiques. Dans cette th?se, nous pr?sentons ci-apr?s des outils pour l?apprentissage de la langue arabe assist? par ordinateur. Parmi ces outils il y a la reconnaissance automatique de la parole arabe qui est bas?e sur la mod?lisation statistique acoustique ainsi que sur les mod?les de Markov cach?s (HMM) qui sont aujourd?hui utilis?s dans un tr?s grand nombre de syst?mes de reconnaissance automatique de la parole. On trouve notamment comme autres outils le corpus vocal, le dictionnaire de prononciation arabe, le phon?tiseur pour l?arabe, la technique de l?alignement forc?, l?identification automatique des dialectes arabes etc. Nous pr?sentons enfin notre prototype d?Apprentissage de l?Arabe Assist? par Ordinateur (3AO), les r?sultats de nos exp?riences et la proc?dure de correction qui permet ? l'?tudiant de cerner et de corriger ses fautes de prononciation. Jury - M. CHANIER Thierry, Professeur ? l?Universit? Blaise Pascal, Clermont-Ferrand 2 France, Rapporteur. - M. ZRIGUI Mounir, Maitre de conf?rences HDR ? l?Universit? de Monastir Tunisie, Rapporteur. - M. BESACIER Laurent, Professeur ? l?Universit? Joseph Fourrier, Grenoble France, Examinateur. - M. JERRAYA Ahmed, Directeur de recherches, CEA-LETI, MINATEC Grenoble France, Examinateur. - M. ANTONIADIS Georges, Maitre de conf?rences HDR ? l?Universit? Stendhal Grenoble France, Directeur de th?se. - Mme FRANJIE Lynne, Maitre de conf?rences ? l?Universit? Stendhal Grenoble France, Examinateur. Mohamed Belgacem ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 13:16:02 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 14:16:02 +0100 Subject: Job: Research Assistant in Natural Language Processing, UPF, Barcelona Message-ID: Date: Thu, 15 Dec 2011 09:43:02 +0100 From: Horacio Saggion Message-ID: X-url: http://www.taln.upf.edu/content/news/362 http://www.taln.upf.edu/content/news/362 The Natural Language Processing Research group (TALN) at the Department of Communication and Information Technologies, Universitat Pompeu Fabra has a 12-months open position in the area of automatic text simplification in Spanish in the context of the research project Simplext. We are looking for candidates with some experience in natural language processing (NLP). Candidates with interest and/or experience in automatic text summarization, text simplification, text-to-text generation, and development of NLP applications are particularly welcome. We are looking for a candidate willing to stay in our group after the 12-months period to pursue a PhD in Natural Language Processing. We require: * A degree in Computer Science or related field. * Experience in NLP, Machine Learning, or Artificial Intelligence * Experience in Programming and willing to learn Java We offer: * A 12-month research contract * Dynamic and collaborative working environment * A maximum gross salary of 19.400 EUROS depending on experience * The position is available immediately. Universitat Pompeu Fabra (UPF) is recognised internationally for the quality of its research, it is also one of the top 5 Spanish universities in terms of scientific production and academia efficiency. The NLP group is situated in Campus de la Comunicacic?n in the innovation district of Barcelona 22@ If you are interested in working with us, please send the following documentation: * Cover letter including your contact details * A CV and list of publications * The names and contact details of 2 people who can act as your references Send the documentation by e-mail to Dr. Horacio Saggion: horacio dot saggion at upf dot edu, the subject of your e-mail should be "Simplext-TALN-RA". If you have any informal query please contact Dr.Horacio Saggion: horacio dot saggion at upf dot edu. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 13:19:58 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 14:19:58 +0100 Subject: Appel: LREC 2012, Workshop NLP4ITA, Natural Language Processing for Improving Textual Accessibility Message-ID: Date: Sat, 17 Dec 2011 09:35:39 +0100 From: Horacio Saggion Message-ID: X-url: http://www.taln.upf.edu/nlp4ita ******************************************************************** First Call for Papers LREC 2012 workshop on: Natural Language Processing for Improving Textual Accessibility (NLP4ITA) May 27, 2012 (Morning Session) Istanbul, Turkey http://www.taln.upf.edu/nlp4ita ******************************************************************** === Introduction === This workshop aims to bring together researchers focused on tools and resources for making textual information more accessible to people with special needs including diverse ranges of hearing and sight disabilities, cognitive disabilities, elderly people, low-literacy readers and adults being alphabetized, among others. === Motivation and Topics of Interest === In recent years there has been an increasing interest in accessibility and usability issues. This interest is mainly due to the greater importance of the Web and the need to provide equal access and equal opportunity to people with diverse disabilities. The role of assistive technologies based on language processing has gained importance as it can be observed from the growing number of efforts (United Nations declarations on universal access to information or WAI guidelines related to content) and research in conferences and workshops (W4A, ICCHP, ASSETS, SPLAT, etc.). However, language resources and tools to develop assistive technologies are still scarce. This workshop will give an opportunity for individuals from different communities to present research findings, discover future challenges, and discuss potential collaboration. We welcome papers describing tools, resources, models, techniques and evaluation from all areas of natural language processing tailored to accessibility and assistive technologies. These include, but are not limited to, the following: - Resources: corpora of inaccessible documents, aligned corpora, gold-standards and annotation schemes. - Evaluation, complexity and readability methodologies and metrics. - Novel modeling and machine learning techniques for improving accessibility. - Natural language generation for improving accessibility. - NLP for Web accessibility. - Text adaptation, elaboration, simplification and summarization. - User studies on systems and components. - NLP tools for accessibility === Submission Information === We encourage contributions in the form of full papers (6 to 8 pages long). Papers should follow the LREC format which is available at the LREC 2012 Web Site. The submission process will be online using the START conference system at https://www.softconf.com/lrec2012/NLP4ITA All accepted papers will be presented orally and published in the workshop proceedings. === Important dates === Feb 24, 2012 Paper submission deadline March 23, 2012 Notification of acceptance March 30, 2012 Camera-ready version May 27, 2012 LREC 2012 Workshops === LRE Map 2012 === The LRE Map 2012, an initiative promoted for monitoring the production, use and evaluation of language resources and tools. When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. For further information on this new initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012. === Organizers === Ricardo Baeza-Yates (Universidad Pompeu Fabra, Yahoo!) Paloma Moreda (Universidad de Alicante) Luz Rello (Universidad Pompeu Fabra) Horacio Saggion (Universidad Pompeu Fabra) Lucia Specia (University of Wolverhampton) === Program Committee === Sandra Aluisio (University of Sao Paulo) Ricardo Baeza-Yates (Universidad Pompeu Fabra, Yahoo!) Delphine Bernhard (University of Strassbourg) Nadjet Bouayad-Agha (Universidad Pompeu Fabra) Richard Evans (University of Wolverhampton) Caroline Gasperin (TouchType Ltd) Pablo Gerv?s (Universidad Complutense de Madrid) Simon Harper (University of Manchester) David Kauchak (Middlebury College) Guy Lapalme (University of Montreal) Paloma Mart?nez (Universidad Carlos III de Madrid) Aurelien Max (Paris 11) Kathleen F. McCoy (University of Delaware) Ornella Mich (Foundazione Bruno Kessler) Ruslan Mitkov (University of Wolverhampton) Paloma Moreda (Universidad de Alicante) Constantin Orasan (University of Wolverhampton) Luz Rello (Universidad Pompeu Fabra) Horacio Saggion (Universidad Pompeu Fabra ) Advaith Siddharthan (University of Aberdeen) Lucia Specia (University of Wolverhampton) Juan Manuel Torres Moreno (University of Avignon) Markel Vigo (University of Manchester) Leo Wanner (Universidad Pompeu Fabra) Yeliz Yesilada (Middle East Technical University Northern Cyprus Campus) === Contact Address === For further information please contact us at: luz.rello at upf.edu or horacio.saggion at upf.edu ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 13:21:36 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 14:21:36 +0100 Subject: Info: ELDA Distribution Campaign 2011 Message-ID: Date: Thu, 15 Dec 2011 11:00:25 +0100 From: ELRA ELDA Information Message-ID: <4EE9C539.7080605 at elda.org> X-url: http://catalog.elra.info X-url: http://universal.elra.info X-url: http://www.elra.info/LRs-Announcements.html [Our apologies if you have received multiple copies of this announcement.] ***************************************************************** ELDA Distribution Campaign 2011 ***************************************************************** ELDA is launching a special distribution campaign offering very favorable conditions for the language resources acquisition, including discounts on public prices, from the ELRA Catalogue of Language Resources (see http://catalog.elra.info). This offer will be open until the end of December 2011. For more information on this offer, please contact Val?rie Mapelli (mapelli at elda.org) Visit our On-line Catalogue: http://catalog.elra.info Visit the Universal Catalogue: http://universal.elra.info Archives of ELRA Language Resources Catalogue Updates: http://www.elra.info/LRs-Announcements.html ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 13:24:20 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 14:24:20 +0100 Subject: Job: Assistant Chef de projet (H/F), ELDA Message-ID: Date: Thu, 15 Dec 2011 11:25:55 +0100 From: ELRA ELDA Information Message-ID: <4EE9CB33.4060708 at elda.org> X-url: http://www.elda.org/ ELDA (Evaluation and Language resources Distribution Agency, _www.elda.org_ ) a pour activit?s principales la distribution et la production de ressources linguistiques, ainsi que l'?valuation de technologies de la langue. Dans le cadre de ses activit?s de production, ELDA offre 1 poste d'assistant chef de projet (H/F). Contexte ELDA participe ? un projet visant ? ?valuer les syst?mes d'analyse automatique des documents ?crits. Pour les syst?mes en comp?tition, il s'agit de r?pondre automatiquement aux questions suivantes : * Comment le document est-il structur? (zones de texte, images...) ? * Quelles sont les ?critures pr?sentes, avec leur type (manuscrit/dactylo) et leur langue (Fran?ais, Anglais, Arabe, autre) ? * Quelles sont les informations principales du documents (auteur, destinataire, objet, date...) ? Afin de disposer de donn?es pour l'?valuation des syst?mes d'analyse automatique, des documents originaux sont collect?s. R?dig?s par des volontaires (r?mun?r?s) sous une identit? fictive qui leur est attribu?e, ces documents se basent sur des sc?narios fictifs et des mod?les de documents cr?es par ELDA (formulaires, bon de commande, page de catalogue, tract politique ou commercial, carte de voeux, en-t?tes...). Une fois collect?s, ces documents font l'objet d'une description manuelle de leur contenu, afin de pouvoir comparer l'analyse automatique des syst?mes avec les performances humaines. Mission En collaboration avec le chef de projet, l'assistant chef de projet supervisera, d'une part, la collecte des documents fictifs r?alis?s par les 1000 volontaires distants, et d'autre part, leur description par une ?quipe d'annotateurs : * Gestion des volontaires & validation des documents via le site web d?di? * Formation des annotateurs : prise en main des logiciels, du guide de production... * Suivi de la production : planning, t?ches, corrections, validation... Profil recherch? * Formation universitaire ou ing?nieur * Polyvalence, rigueur, sens du contact, autonomie * Ma?trise de l'outil informatique (outils office, web, la pratique du php/MySQL serait un plus) * Un bon niveau d'anglais, et une exp?rience de la transcription des documents ?crits seraient appr?ci?s Dur?e : CDD de 12 mois ? temps complet (prolongation possible) Ce poste, bas? ? Paris-13e (RER Cit? universitaire), est ? pourvoir rapidement. Le salaire est ? n?gocier en fonction du profil du poste et de celui du candidat. Les candidatures (CV, lettre de motivation, pr?tentions salariales) doivent ?tre adress?es ? Matthieu Carr? (carre at elda.org). Elles seront ?tudi?es ? partir de janvier 2012. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 13:27:57 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 14:27:57 +0100 Subject: Stage: Conception de documents textuels illustres (H/F), ELDA Message-ID: Date: Thu, 15 Dec 2011 11:27:44 +0100 From: ELRA ELDA Information Message-ID: <4EE9CBA0.6000905 at elda.org> X-url: http://www.elda.org/ ELDA (Evaluation and Language resources Distribution Agency, www.elda.org ) a pour activit?s principales la distribution et la production de ressources linguistiques, ainsi que l'?valuation de technologies de la langue. Dans le cadre de ses activit?s de production, ELDA offre 1 stage de concepteur de documents textuels illustr?s (H/F). Contexte ELDA participe ? un projet visant ? ?valuer les syst?mes d'analyse automatique des documents ?crits. Pour les syst?mes en comp?tition, il s'agit de r?pondre automatiquement aux questions suivantes : * Comment le document est-il structur? (zones de texte, images...) ? * Quelles sont les ?critures pr?sentes, avec leur type (manuscrit/dactylo) et leur langue (Fran?ais, Anglais, Arabe, autre)? * Quelles sont les informations principales du documents (auteur, destinataire, objet, date...) ? Afin de disposer de donn?es pour l'?valuation des syst?mes d'analyse automatique, des documents originaux sont collect?s. R?dig?s par des volontaires (r?mun?r?s) sous une identit? fictive qui leur est attribu?e, ces documents se basent sur des sc?narios fictifs et des mod?les de documents cr?es par ELDA (formulaires, bon de commande, page de catalogue, tract politique ou commercial, carte de voeux, en-t?tes...). Une fois collect?s, ces documents font l'objet d'une description manuelle de leur contenu, afin de pouvoir comparer l'analyse automatique des syst?mes avec les performances humaines. Mission Sous la responsabilit? du chef de projet, le candidat r?alisera les mod?les de document n?cessaires ? la r?daction des documents ? collecter. Les mod?les peuvent ?tre : * des fichiers images que les volontaires devront imprimer puis compl?ter manuellement ; * des fichiers .doc ou .ppt ? compl?ter ?lectroniquement (saisie au clavier) avant impression ; * des lettres-types au format image guidant la r?daction de courriers manuscrits ou dactylo. Selon ses comp?tences, le stagiaire pourra ?galement intervenir sur d'autres aspects du projet, comme la maintenance du site web de collecte (php/mySQL), la validation des documents collect?s, ou encore leur annotation (i.e. la description des documents via un logiciel d?di?). Profil recherch? * Formation universitaire ou ing?nieur * Ma?trise de l'?dition sous Word et Powerpoint * Comp?tences en traitement d'image (Gimp/Photoshop, gestion des formats) * Qualit?s r?dactionnelles : orthographe, grammaire, inspiration (invention des sc?narios fictifs) * Un bon niveau d'anglais et/ou d'arabe constitue un plus (afin de r?aliser/d?cliner les mod?les dans les 2 autres langues du projet, le fran?ais ?tant la langue principale). Dur?e : Stage long (4 mois minimum) Ce stage, bas? ? Paris-13e (RER Cit? universitaire), est ? pourvoir avant le printemps 2012. Les candidatures (CV, lettre de motivation) doivent ?tre adress?es ? Matthieu Carr? (carre at elda.org). Elles seront ?tudi?es ? partir de janvier 2012. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 13:30:34 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 14:30:34 +0100 Subject: Appel: Third Workshop on Building and Evaluating Resources for Biomedical Text Mining, LREC 2012 Workshops Message-ID: Date: Fri, 16 Dec 2011 18:14:06 +0100 From: ELRA ELDA Information Message-ID: <4EEB7C5E.2070106 at elda.org> X-url: http://www.nactem.ac.uk/biotxtm2012/ [Apologies for cross-postings] First Call for Papers ======================================================================== THIRD WORKSHOP ON BUILDING AND EVALUATING RESOURCES FOR BIOMEDICAL TEXT MINING Saturday 26th May 2012 organised in conjunction with LREC2012 (21-27 May 2012, L?tfi Kirdar Istanbul Exhibition and Congress Centre, Turkey) http://www.nactem.ac.uk/biotxtm2012/ ======================================================================== Over the past decade, biomedical text mining has received a large amount of interest. Faced with the rapidly increasing volume of biomedical literature, domain experts have an ever-increasing need for tools that can help them locate isolate relevant nuggets of information from this deluge of information in a timely and efficient manner. The response to such issues by the natural language processing community can be clearly evidenced in the biomedical natural language processing workshops that have been held over that past 10 years, in conjunction with ACL or NAACL meetings, to report the process in the field, as well as the founding of an ACL special interest group. Biomedical text mining applications are reliant on high quality resources. These include databases and ontologies (e.g., Biothesaurus, UMLS Metathesaurus, MeSH and the Gene Ontology) and dictionaries/computational lexicons (e.g., the BioLexicon and the UMLS SPECIALIST lexicon). Recent years have also evidenced a large increase in the number of freely-available corpora (e.g., GENIA, GREC, AIMED, BioInfer, CRAFT, BioDRB) annotated with an expanding range of information types. These now include not only named entities and simple relations that hold between them, but also more complex event structures and coreference, as well as higher level information about how events are to be interpreted (e.g., facts, analyses, speculations, etc.) and discourse structure. Community shared tasks and challenges (e.g., JNLPBA, LL05, Biocreative I/II/III, BioNLP?09, BioNLP 2011, i2b2, etc.) also normally involve the production of annotated corpora (on which the participating systems are trained and evaluated) as well as helping to steer research efforts to focus on open research problems. Following on from the success of two previous workshops, the workshop aims to bring together researchers who make use of biomedical text mining resources such as the above in their applications, or who are working on the development of new resources. The workshop will allow an assessment of the current state of the art of resources, and will provide a forum for the discussion of current problems, questions and open issues, which will be useful in guiding further research in this area. Such topics are very much relevant to META-NET (a Network of Excellence consisting of 54 research centres from 33 countries), which is dedicated to building the technological foundations of a multilingual European information society. META-NET aims to push forward research to allow a rapid expansion of language technologies; such efforts can only be acheived if appropriate resources are available. Since META-NET is concerned with enhancing information access for all European citizens, submissions concerning biomedical resources for languages other than English are particularly welcome. A further vital consideration to allow rapid building of new applications is that of interoperability and reuse. As a step towards this, several annotated corpora have been made UIMA-compliant, and are available in the U-Compare system, which allows easy construction of NLP workfows and evaluation against gold standard corpora. Some specific questions that the workshop will aim to answer include the following: * Among the available resources, which are the most used? What makes a good resource? How can easily can resources be employed for different purposes? What efforts have been made to make resources reusable or interoperable? To what extent have these efforts been successful? * Which resources are underused and why? What could be done to improve or extend them to improve their utility? * Which types of resources are still lacking and what is needed urgently? Are any resources planned or in development to address such gaps? Are any resources available that cover languages other than English? * To what extent do the existing resources support processing of text in different biomedical subdomains? How easily can they be adapted to deal with different domains? We invite papers reporting on resources that facilitate biomedical text mining, and the process of designing, building, updating, delivering, using and evaluating them. The workshop will focus both on the lexical and knowledge repositories themselves (e.g., terminologies, ontologies, controlled vocabularies, factual databases, annotated corpora, etc.) as well as on issues relating to their usability (e.g., design guidelines, standards for building resources, storage and exchange formats, interoperability issues, etc). Topics of interest include but are not limited to: * Building biomedical resources: controlled vocabularies, terminologies, ontologies, corpora * Guidelines and annotation schemas, tools, challenges, interoperability * Reengineering existing biomedical or general language resources * Update, evolution, extension or enrichment of resources * Adapting resources to new sub-domains * Interoperability of resources and standards * Lightly annotated and noisy resources * Tools for exploration of resources * Data exchange formats * Evaluation, comparison and critical assessment of resources / evaluation metrics * Test suites ORGANISATION *Sophia Ananiadou, National Centre for Text Mining, University of Manchester, UK *Kevin Bretonnel Cohen, Computational Bioscience Program, University of Colorado School of Medicine, USA *Dina Demner-Fushman, National Library of Medicine, USA *Paul Thompson, National Centre for Text Mining, University of Manchester, UK DATES February 10, 2012 Paper submissions due March 14, 2012 Paper notification of acceptance March 28, 2012 Camera-ready papers due May 26, 2012 Workshop SUBMISSIONS Papers must describe original, completed or in progress, and unpublished work. Each submission will be reviewed by two program committee members. Accepted papers will be given up to 8 pages in the workshop proceedings, and will be presented either as an oral presentation or poster. Papers should be formatted according to the stylesheet, which will be provided on the LREC 2012 website (http://www.lrec-conf.org/lrec2012/). Please submit papers in PDF format to: https://www.softconf.com/lrec2012/BiomedTextMining2012/ Paper review will be blind, so papers should not include authors' names and affiliations. Accepted papers will be published in the workshop proceedings. When submitting a paper through the START page, authors will be kindly asked to provide relevant information about the resources that have been used for the work described in their paper or that are the outcome of their research. For further information on this initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012. Authors will also be asked to contribute to the Language Library, the new initiative of LREC2012. PC MEMBERS Jari Bj?rne, University of Turku, Finland Olivier Bodenreider, National Library of Medicine, USA Wendy Chapman, University of Pittsburgh, USA Hongfang Liu, Mayo Clinic, USA Naoaki Okazaki, Tohoku University, Japan Sampo Pyysalo, University of Manchester, UK Andrey Rzhetsky, University of Chicago, USA Stefan Schulz, Medical University Graz, Austria Lucy Vanderwende, Microsoft, USA Karin Verspoor, NICTA, Australia John Wilbur, NCBI, NLM, NIH, USA Stephen Wu, Mayo Clinic, USA Pierre Zweigenbaum, LIMSI, France Workshop contact person: Paul.Thompson at manchester.ac.uk National Centre for Text Mining, School of Computer Science, University of Manchester, UK ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 13:33:36 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 14:33:36 +0100 Subject: Appel: Workshop on Language Technology for Patent Data: Language Resources and Evaluation, LREC 2012 Workshops Message-ID: Date: Fri, 16 Dec 2011 18:25:42 +0100 From: ELRA ELDA Information Message-ID: <4EEB7F16.9070301 at elda.org> X-url: http://workshops.elda.org/ltpd2012/ [Apologies for cross-postings] *CALL FOR PAPER* Workshop on Language Technology for Patent Data: Language Resources and Evaluation To be held in conjunction with the 8th International Language Resources and Evaluation Conference (LREC 2012) 27 May 2012 (afternoon) L?tfi Kirdar Istanbul Exhibition and Congress Centre, Istanbul, Turkey http://workshops.elda.org/ltpd2012/ *Workshop Description* In the last few years, the use of patents in automatic processing has shown a growing interest in the NLP community. This has been particularly the case in the context of Machine Translation (MT) or Cross-Lingual Information Retrieval (CLIR). Nowadays this has become a major topic and besides the development of the technology itself, some key points remain regarding the resources available and the way of evaluating the quality of the technology. A large number of language resources is already available for the community, but the development of systems, in particular the statistical ones, always requires more and more data. As there is a growing interest for patents and their processing, a workshop on the topic which gathers all those involved in the different aspects concerned is a good opportunity to move forward. The domain of patents itself is increasing and the amount of potential material does not cease to increase. It is this potential material that gives hope to the community for improving the systems. For instance, in China, the number of patents have been multiplied by 3 in 5 years and they exceed 1 million published documents per year by now. EPO (the European Patent Office) uses more than 150 translation pairs per day. Every patent office receives more and more patents every day, needs a daily use of automatic tools to translate the documents, looks for existing patents and their translation, manages complex content, etc. As we can see, this is a domain in considerable demand and since the content of the patents is technical and needs high skills in a specific domain, providing documents that are sufficiently understandable to the end users is very complex. This is a real challenge for all NLP developers. Above all, this challenge is about corpora and their management. The main topic concerns their acquisition and how to collect useful data. For most of the researchers, this consists in harvesting web pages, cleaning them, getting the useful content according to a specific task, aligning the sentences, etc. The acquisition task may also be done using OCR tools on PDF. Monolingual corpora are easier to retrieve (e.g. from databases) compared to parallel corpora. However, parallel translations exist and aligned corpora as well, or corpora that could be easily aligned. Following the question of the acquisition of such documents, there is that of database management. One could say that all these questions are not only related to patent data, however this workshop would like focus on this particular domain and make some effort to improve things. Currently, the corpora are mainly used for MT. For a technical end-user in a patent office, the end goal is to manage to understand the content of a document. This may not require a very high quality translation since this person only needs to grasp the relevance of the document. However, in MT, we still need to measure quantitatively the performance of the systems. This is basically made using automatic and/or human measures, while most of the system developers are using typical automatic metrics such as BLEU to get their results. Even if the drawbacks of such metrics are well-known, it could be still relevant, for instance, to compare different versions of a system. However, even when using BLEU, the content of patent documents is very particular, which implies that different kinds of linguistic specificity need to be tackled: these include the already expected terminological level, but also a syntactic level, a semantic one, and even the structure of the documents may be different from that of other documents (for instance, patents typically comprise of a title, an abstract, a technical description of the invention, and a list of novel claims). Human measures may be also difficult to apply as patent documents are written in a way which makes them difficult to read for the layman. Furthermore, both automatic and human evaluations should have the chance to realise a deep analysis of the results, which is not trivial working with patents. However, given the often formulaic nature of the text found in patents -- which is enforced on the author due to legal constraints -- there may be opportunities to exploit this for evaluation. For instance, claims are constructed as a single sentence with an introductory phrase and a body linked by frequently occurring terms such as "in a certain embodiment", "consisting essentially of", and clauses and lists introduced using colons, e.g. "comprising: ..." The use of patents in CLIR suffers from the same kind of issues, either for the evaluation of systems or for the collection of corpora. Sentence alignment may also have specific issues related to the content of the documents, and many other types of tools may have their own thoughts using patents. Through all those technologies, one can see their usage implies several challenges, such as the integration of tools into patent information applications. The different tools should help end-users to search, examine or classify patent documents, most of the time from translations and not available in English. Web services should also be an extension of the tools and web services should be connected through workflows, helping end-users in their daily work. Among all the topics previously mentioned, we would like to contribute to the improvement of the challenging patent field, by sharing the knowledge from the whole community. The different topics addressed during the workshop will be (but are not limited to): - Corpora aspects: collecting data, cleaning, alignment, parallel corpora, etc.; - Evaluation of technologies: definition of metrics, patent specificity; - Integration of patent applications: web services, end-user applications; - IPR issues and licensing. *Organising committee* Heidi Depraetere (Crosslang, Belgium) Olivier Hamon (ELDA -- Evaluations and Language resources Distribution Agency, France) John Tinsley (PLUTO -- Patent Language Translations Online, Ireland) *Programme committee* Victoria Arranz (ELDA -- Evaluations and Language resources Distribution Agency, France) Alexandru Ceasusu (PLUTO - Patent Language Translations Online, Ireland) Khalid Choukri (ELDA, France) Terumasa Ehara (Yamanashi Eiwa College, Japan) Cristina Espa?a-Bonet (UPC, Spain) Mihai Lupu (IRF and ESTeam, Austria) Bertrand Le Chapelain (EPO, Netherlands) Bente Maegaard (University of Copenhagen, Denmark) Bruno Pouliquen (World Intellectual Property Organization, Switzerland) Lucia Specia (University of Sheffield, United Kingdom) Gregor Thurmair (Linguatec, Germany) Dan Wang (China Patent Information Center, China) Shoichi Yokoyama (Yamagata University, Japan) More to follow... *Important dates* Deadline for submission: Friday 24 February 2010 Notification of acceptance: Friday 23 March 2010 Final version due: Friday 30 March 2010 Workshop : 27 May 2010 (afternoon) *Submission Format* Full papers up to 8 pages should be formatted according to LREC 2012 guidelines and be submitted through the online submission form (https://www.softconf.com/lrec2012/PATENT2012/) on START. For further queries, please contact Olivier Hamon at hamon_at_elda_dot_org. When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. For further information on this new initiative, please refer to http://www.lrec-conf.org/lrec2012/?LREMap-2012. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 13:36:10 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 14:36:10 +0100 Subject: Appel: Describing Language Resources with Metadata: Towards Flexibility and Interoperability in the Documentation of Language Resources LREC 2012 Workshops Message-ID: Date: Fri, 16 Dec 2011 18:37:36 +0100 From: ELRA ELDA Information Message-ID: <4EEB81E0.1030306 at elda.org> X-url: http://workshops.elda.org/metadata2012/ *CALL FOR PAPERS* Workshop on *Describing Language Resources with Metadata: * *Towards Flexibility and Interoperability in the Documentation of Language Resources* ** To be held in conjunction with the 8^th International Language Resources and Evaluation Conference (LREC 2012) *22 May 2012, **L?tfi Kirdar Istanbul Exhibition and Congress Centre, Istanbul, Turkey* http://workshops.elda.org/metadata2012/ /Deadline for submission: 19 February 2012/** *Description* The description of Language Resources (LRs) continues to be a crucial point in the life cycle of LRs, and more particularly, in their sustainable exchange. This has been so for a number of repositories or LR distribution centres in place (ELRA, GSK, LDC, OLAC, TST-Centrale, BAS, among others), who house LR catalogues following some proprietary metadata schema. A number of projects and initiatives have also focused these past few years in the sharing of LRs (ENABLER, CLARIN, FLaReNet, PANACEA, META-SHARE), for example for Language Technology (LT) Based on these initiatives a consensus emerges that shows a number of requirements for standardized metadata: 1. There should be a common publication channel for the LR descriptions in the world. 2. This channel allows users to carry out easy and efficient LR data discovery and possible subsequent retrieval of LRs. 3. Expert knowledge is required to create the data model for the metadata description. 4. Subject matter experts (both researchers and LR/LT providers and developers) are required to provide the content for the data model. 5. The data model needs to be clear, expressive, flexible, customizable and interoperable. 6.Metadata have to provide for different user groups, ranging from providers to consumers (both individuals and organisations). This applies both to the information contained in the metadata and the supporting tool infrastructure for creating, maintaining, distributing, harvesting and searching the metadata. Currently several initiatives focus on metadata. From the realm of work done within initiatives like ENABLER and CLARIN descended the Component MetaData Infrastructure (CMDI, ISO TC 37 SC 4 work item for ISO 24622), which allows the combination of standard data categories (for example from ISO 12620, isocat.org) to components, which are combined into metadata profiles. Early versions of this model have been operational in repositories such as ELRA's, which complied with the work done within INTERA. FLaReNet, as the result of a permanent and cyclical consultation, has issued a set of main recommendations where a global infrastructure of uniform and interoperable metadata sets appear among the Top Priorities for the field of LRs.For use within HLT, META-SHARE provides a fully-fledged schema for the description of LRs, in the framework of the component model, covering all the current resource types and media types of use, in all the stages of a resource's life-cycle. Our aim is to learn from one another's experiences and plans in this area. Making resources available for others and putting this to a second use in other projects has never been more widely accepted as a sensible efficient way to avoid a waste of efforts and resources. However, when it comes to the details, there is still a vast number of problems. This workshop will be a forum to address issues and challenges in the concrete work with metadata for LRs, not restricted to a single initiative for archiving LRs. The current state of the art for metadata provision allows for a very flexible approach, catering for the needs of different archives and communities, referring to common data category registries that describe the meaning of a data category at least to authors of metadata. Component models for metadata provisions are for example used by CLARIN and META-SHARE, but there is also an increased flexibility in other metadata schemas such as Dublin Core, which is usually not seen as appropriate for meaningful description of language resources. Topics of interest are: 1. Infrastructures for creating components and profiles for metadata 2. Editing and creating metadata 3. Porting legacy metadata 4. Metadata as a resource 5. Maintenance of metadata 6. Classification of language resources 7. Providing metadata concepts 8. Creating components and profiles 9. Services harvesting and interpreting metadata 10. Experience from the large LR data center catalogues: LDC, ELRA, BAS, and how to interoperate with them 11. Controlled vocabularies, terminology and metadata description 12. Formal models for metadata representation and standardized models of serialisation 13. Customization and reuse of metadata schemas 14. Plans or experiences with emerging metadata infrastructures as for example from CLARIN & META-SHARE 15. Experiences with the Component based metadata infrastructures 16. Integration and conversion of multiple repositories: experiences from META-SHARE, CESAR, METANET4U and META-NORD, etc. 17. Standardization issues for metadata We invite submissions for full papers and system demonstrations that address these questions and other related issues relevant to the workshop. *Workshop Programme and Audience Addressed* This full-day workshop aims at bringing together technology oriented working groups on metadata modeling or schema creation and both researchers and producers creating metadata in the course of their work. Those interested to use metadata in their projects should get the insights and come out with a clear idea of how to either describe their LRs or convert their schema. Those who have developed recently a model can share their experience, and those who have specific concerns with interoperability of metadata schemas as developed by the various initiatives can open the discussion in search for joint solutions. Tools and the tool infrastructures should also be part of the discussion given that the initiatives provide also editors, mappings, search interfaces, component and profile registries. *Organising Committee* Victoria Arranz (ELDA/ELRA, Paris, France, arranz at elda.org) Daan Broeder (MPI, Nijmegen, The Netherlands, daan.broeder at mpi.nl) Bertrand Gaiffe (ATILF, Nancy, France, Bertrand.Gaiffe at atilf.fr) Maria Gavrilidou (Athena Research and Innovation Center, Athens, Greece, maria at ilsp.athena-innovation.gr) Monica Monachini, (CNR-ILC, Pisa, Italy, monica.monachini at ilc.cnr.it) Thorsten Trippel (University of T?bingen, T?bingen, Germany, thorsten.trippel at uni-tuebingen.de) *Programme Committee* Helen Aristar-Dry (Michigan State University, USA) N?ria Bel (UPF, Barcelona, Spain) Antonio Branco, (University of Lisbon, Portugal) Lars Borin (Spr?kbanken, Sweden) Khalid Choukri (ELDA/ELRA, Paris, France) Thierry Declerck (DFKI, Germany) Matej Durco (Austrian Academy of Sciences, Austria) Gil Francopoulo (CNRS-LIMSI-IMMI + TAGMATICA, Paris, France) Francesca Frontini (CNR-ILC, Pisa, Italy) Erhard Hinrichs (Univerit?t T?bingen, Germany) Penny Labropoulou (ILSP-Athena, Athens, Greece) Val?rie Mapelli (ELDA/ELRA, Paris, France) Jan Odijk (Universiteit Utrecht, The Netherlands) Elena Pierazzo (Kings College, London, UK) Laurent Romary (INRIA, France) Mike Rosner (University of Malta, Malta) Andreas Witt (IDS, Germany) Peter Wittenburg (MPI, The Netherlands) Tam?s Varadi (Hungarian Academy of Sciences, Hungary) Marta Villegas (UPF, Barcelona, Spain) Sue Ellen Wright (Kent State University, USA) *Important dates* Submission of full papers: Sunday 19 February 2012 Notification of acceptance of papers and demonstrations: Thursday 22 March 2012 Submission of final version: Saturday 31 March 2012 Final programme available: Friday 13 April 2012 Workshop: Tuesday 22 May 2012 *Submission* Authors should use the START system accessible fromhttps://www.softconf.com/lrec2012/Metadata2012 and the LREC author's kit for submitting a two-column article of 4 to 8 pages. For further queries, please contact Victoria Arranz at arranz at elda.org or Thorsten Trippel at thorsten.trippel at uni-tuebingen.de. /When submitting a paper through START, authors will be kindly asked to provide relevant information about the resources that have been used for the work described in their paper or that are the outcome of their research. For further information on this initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012//. Authors will also be asked to contribute to the Language Library, the new initiative of LREC2012./ ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 13:42:01 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 14:42:01 +0100 Subject: Appel: Rencontres Jeunes Chercheurs 2012 ED 268 Paris 3 Message-ID: Date: Thu, 15 Dec 2011 14:44:29 +0100 From: RJC 2012 Message-ID: X-url: http://www.univ-paris3.fr/rjc2012 Bonjour, Veuillez trouver ci-joint l'appel ? communication pour les RJC 2012. Veuillez diffuser l'appel au sein de votre laboratoire au plus grand nombre. Nous vous prions de nous excuser pour les envois multiples. Cordialement, Jiayin Gao Comit? d'organisation -------------------------------------------------------------------- Ch?r(e)s doctorant(e)s, L'?cole Doctorale 268 *? Langage et langues : description, th?orisation, transmission ? *de l'Universit? Sorbonne Nouvelle - Paris 3 organise les 15?mes Rencontres Jeunes Chercheurs (RJC 2012) du 15 juin 2012 au 16 juin 2012 Vous trouverez ci-dessous l'appel ? communication. Rendez-vous sur le site des RJC, http://www.univ-paris3.fr/rjc2012 pour plus d'informations. *?cole Doctorale 268 ? Langage et langues : description, th?orisation, transmission ?* *XV?mes Rencontres Jeunes Chercheurs* Cr??es en 1998, les Rencontres Jeunes Chercheurs de l'?cole Doctorale ? Langage et langues? (ED 268, Universit? Paris III) offrent la possibilit? aux jeunes chercheurs inscrits en Doctorat ou en Master Recherche de pr?senter leurs travaux sous forme de communication orale ou de poster. Au carrefour des disciplines : Inter - et transdisciplinarit? dans les sciences du langage On assiste depuis la seconde moiti? du XIXe si?cle ? un processus de fragmentation des savoirs, ? la fois scientifique et politique, qui a contribu? ? construire le paysage actuel de la recherche. De nouvelles disciplines, telles que la psychologie ou la sociologie, se sont institutionnalis?es. Dans le m?me temps, les diff?rents domaines du savoir ont connu une sp?cialisation grandissante, tant dans le champ des sciences humaines que dans les autres sciences. Depuis quelques ann?es, un sentiment de ? crise d'identit? ? (Bouvier, 2004) des disciplines se manifeste par le nombre croissant de programmes de recherche qui visent ? promouvoir la collaboration ou la confrontation de diff?rentes sp?cialit?s, afin d'?tudier les ph?nom?nes dans leur globalit?. L'?tude des faits de langue requiert une multiplication des points de vue (?tude du son, de la structure, du sens, etc.) et des emprunts ? la logique, aux sciences cognitives, ? l'anthropologie, ? l'histoire, etc. Ainsi, les sciences du langage se r?v?lent embl?matiques de ces ?changes entre les domaines du savoir. S'int?resser aux rapports entre disciplines implique de s'interroger sur la notion de ? discipline ? elle-m?me, d'analyser les relations d'englobement (? sous-disciplines ?, ? composantes ?) et les d?limitations internes d'une discipline donn?e. On ne peut circonscrire une discipline sans faire appel ? des crit?res ? la fois ?pist?mologiques (d?finition de l'objet d'?tude, m?thodologie, formulation d'un ensemble d'explications coh?rentes, etc.), sociaux et historiques (institutionnalisation). Il faut ?galement d?finir quels sont les modes d'interactions possibles entre les disciplines. ? la suite de Fr?d?ric Darbellay (2005) et de Patrick Charaudeau (2010), nous en distinguons trois : la ? pluridisciplinarit? ?, l'? interdisciplinarit? ? et la ? transdisciplinarit? ?. Est ? pluridisciplinaire ? une d?marche de recherche dans laquelle chaque discipline conserve son autonomie, ses m?thodes et ses outils ; il s'agit ? d'une juxtaposition de points de vue qui apportent chacun une connaissance particuli?re sur le ph?nom?ne ?tudi? ? (Charaudeau, 2010). Est ? interdisciplinaire ? une approche qui s'efforce de faire dialoguer plusieurs disciplines sur un m?me th?me, cherchant ainsi ? ? ?tablir de v?ritables connexions entre concepts, outils d'analyse et modes d'interpr?tation ? (*ibid.*). Enfin, est ? transdisciplinaire ? une d?marche qui transcende les disciplines, r?alisant une ? int?gration des savoirs [...] de telle sorte qu'?merge un discours *sui generis* construisant son propre lieu de pens?e ? (*ibid.*). L'inter- et la transdisciplinarit? ne se r?sument pas ? un simple ph?nom?ne de ? mode intellectuelle ?. On peut les envisager comme un ? principe de pr?caution ? permettant de mettre ? distance ? les visions et les divisions du travail scientifique que les disciplines imposent ?. Ainsi que le rappellent Nicolas Freymond *et al.* (2003) : ? oublier que c'est le point de vue qui cr?e l'objet conduit ? enfermer la pratique scientifique dans des limites qui ne jouissent d'aucune pertinence th?orique ?. Le recours critique ? plusieurs (sous-)disciplines ou courants disciplinaires para?t donc n?cessaire. Dans cette perspective, les RJC 2012 invitent les participants ? r?fl?chir aux possibilit?s d'?change (de concepts, de m?thodes, de r?sultats, etc.), de confrontation, voire de fusion, entre diff?rentes sp?cialit?s. Nous retiendrons en particulier les communications qui s'int?ressent aux questions suivantes : - Qu'est-ce qu'une ? discipline ? ? Quelles sont les relations entre ? disciplines ? et ? savoirs ? ? Comment les disciplines et sous-disciplines actuelles se sont-elles constitu?es historiquement? - Quelles ont ?t? les ?volutions socio-historiques des th?ories et pratiques de l'inter- et de la transdisciplinarit? ? - La monodisciplinarit? est-elle encore possible dans les sciences du langage ? - Quelles perspectives offrent les recherches inter- et transdisciplinaires ? - Quels sont les risques du d?cloisonnement ? Quelles en sont les limites et les contraintes ? - Quelles passerelles peut-on r?ellement ?tablir entre les disciplines ? - Comment les diff?rentes sp?cialit?s se positionnent-elles par rapport aux ?changes inter- et transdisciplinaires ? - Comment l'interdisciplinarit? se construit-elle au sein des ?quipes de recherche ? *Bibliographie s?lective* - BOUTET, Josiane & MAINGUENEAU, Dominique (2005) ? Approches interdisciplinaires des pratiques langagi?res et discursives ?, *Langage et soci?t?*, 114, Paris, MSH. - BOUVIER, Pierre (2004) ? Interdisciplinarit?, monodisciplinarit?, transdisciplinarit? ? [en ligne], *Socio-anthropologie*, 14, URL : http://socio-anthropologie.revues.org/index372.html - CHARAUDEAU, Patrick (2010) ? Pour une interdisciplinarit? focalis?e'' dans les sciences humaines et sociales ? [en ligne], *Questions de Communication*, 17, URL : http://www.patrick-charaudeau.com/Pour-une-interdisciplinarite.html - CHISS, Jean-Louis & PUECH, Christian (1999) *Le Langage et ses disciplines*, Paris, Duculot. - DARBELLAY, Fr?d?ric (2005) *Interdisciplinarit? et trans-disciplinarit? en analyse des discours. Complexit? des textes, intertextualit? et transtextualit?*, Gen?ve, Slatkine. - FREYMOND, Nicolas *et al.* (2003) ? Ce qui donne sens ? l'interdisciplinarit? ? [en ligne], *A contrario*, 1, URL :www.cairn.info/revue-a-contrario-2003-1-page-3.htm - KOURILSKY, Fran?ois (dir.) (1990) *Carrefour des sciences*, Actes du colloque du CNRS ? Inter-disciplinarit? ?, Paris, ?ditions du CNRS. - ORRIGI, Gloria & DARBELLAY, Fr?d?ric (dir.) (2010) *Repenser l'interdisciplinarit?*, Gen?ve, Slatkine. - RAMADIER, Thierry (2011) ? L'interdisciplinarit? au service de la complexit? ? [en ligne], *Articulo - Journal of Urban Research*, Book Reviews, URL : http://articulo.revues.org/1731 *Conf?renciers invit?s :* Dan SAVATOVSKY, Professeur, Universit? de Bourgogne *Comit? Scientifique :* Violaine BIGOT Maria CANDEA Francine CICUREL Jeanne-Marie DEBAISIEUX C?cile FOUGERON Jean-Marie FOURNIER Emmanuel FRAISSE Florentina FREDET C?dric GENDROT Kim GERDES Daniel GILE Luca GRECO Jean-Patrick GUILLAUME Pierre HALL? Kkka KABORE Dominique KLINGLER Marie-Christine LALA Florence LEFEUVRE Jean-L?o L?ONARD Marc-Antoine MAHIEU Aliyah MORGENSTERN Jean-Paul NARCY-COMBES Gilles PHILIPPE Claire PILLOT-LOISEAU Konstantin POZDNIAKOV Christian PUECH Nicolas QUINT Sandrine REBOUL-TOUR? Annie RIALLAND Rachid RIDOUANE, Anne SALAZAR ORVIG Pollet SAMVELIAN Dan SAVATOVSKY Jacqueline VAISSI?RE Daniel V?RONIQUE Martine VERTALIER *Comit? d'Organisation : * Marion CARNIS, Alejandro DIAZ VILLALBA, Aur?lia ELALOUF, Rapha?lle FOUILLET, Jiayin GAO, Maximilien GU?RIN, Marine LE MEN?, Laura NICOLAS, Marie REETZ, Marie RIVI?RE, Camille SIMON, Marie VIAIN *Le colloque est ouvert ? tous : masterants, doctorants, chercheurs...* *Entr?e libre en fonction des places disponibles.* *Une attestation de pr?sence sera remise aux participants* *Soumission* Les propositions de communication orale se feront en police Times New Roman 12, interligne simple, sous forme d'un r?sum? de 1000 mots au maximum (r?f?rences incluses) et les propositions de poster sous forme d'un r?sum? de 500 mots au maximum (r?f?rences incluses). Dans le cas de transcriptions phon?tiques, veuillez utiliser la police SILDoulos t?l?chargeable ici. Les propositions sont ? envoyer au Comit? d'Organisation par courriel ( rjc.ed268.2012 at gmail.com) en double exemplaire au format .rtf : le premier sera nomm? ? anon_nom-de-l-auteur_rjc2012. rtf ? (par exemple ? anon_DUPONT_rjc2012.rtf ?) et contiendra : *Le titre* *5 mots-cl?s* *La ou les discipline*(s) *Le r?sum?* Le type de pr?sentation (communication orale ou poster) Le second nomm? ? nom-de-l-auteur_rjc2012.rtf ? (par exemple ? DUPONT_rjc2012.rtf ?) devra comporter les informations suivantes, en plus des pr?c?dentes : *Les coordonn?es (nom, pr?nom, courriel et adresse postale)* *L'affiliation **(nom de l'universit?, nom du laboratoire)* *Le niveau d'?tudes **(master / doctorat / post-doc ; pr?ciser le nombre d'ann?es pour le doctorat)* *Le directeur de recherche* *Une seule soumission par participant sera examin?e.* Les communications orales et les posters pourront se faire en anglais ou en fran?ais. La dur?e de la communication est fix?e ? 20 minutes + 10 minutes de discussion. *Contact et soumission:* rjc.ed268.2012 at gmail.com *Calendrier* Date limite pour les soumissions : 15 f?vrier 2012 Notification et d?but des inscriptions : 30 mars 2012 Date limite pour les inscriptions et le d?p?t du r?sum? : 15 avril 2012 Colloque : *15 & 16 juin 2012* ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 14:09:26 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 15:09:26 +0100 Subject: Appel: Ireland International Conference on Education (IICE-2012), Reminder Message-ID: Date: Thu, 15 Dec 2011 23:16:35 +0000 (GMT) From: L inda Woods Message-ID: <37988060.493992.1323990995212.JavaMail.open-xchange at email.1and1.co.uk X-url: http://www.iicedu.org/ Apologies for cross-postings. Kindly email this call for papers to your colleagues, faculty members and postgraduate students. CALL FOR PAPERS ********************************************************** Ireland International Conference on Education (IICE-2012), April 16-18, 2012, Dublin, Ireland (www.iicedu.org) ********************************************************** The IICE is an international refereed conference dedicated to the advancement of the theory and practices in education. The IICE promotes collaborative excellence between academicians and professionals from Education. The aim of IICE is to provide an opportunity for academicians and professionals from various educational fields with cross-disciplinary interests to bridge the knowledge gap, promote research esteem and the evolution of pedagogy. The IICE 2012 invites research papers that encompass conceptual analysis, design implementation and performance evaluation. All the accepted papers will appear in the proceedings and modified version of selected papers will be published in special issues peer reviewed journals. The topics in IICE-2012 include but are not confined to the following areas: * Academic Advising and Counselling * Art Education * Adult Education * APD/Listening and Acoustics in Education Environment * Business Education * Counsellor Education * Curriculum, Research and Development * Competitive Skills * Continuing Education * Distance Education * Early Childhood Education * Educational Administration * Educational Foundations * Educational Psychology * Educational Technology * Education Policy and Leadership * Elementary Education * E-Learning * E-Manufacturing * ESL/TESL * E-Society * Geographical Education * Geographic information systems * Health Education * Higher Education * History * Home Education * Human Computer Interaction * Human Resource Development * Indigenous Education * ICT Education * Internet technologies * Imaginative Education * Kinesiology & Leisure Science * K12 * Language Education * Mathematics Education * Mobile Applications * Multi-Virtual Environment * Music Education * Pedagogy * Physical Education (PE) * Reading Education * Writing Education * Religion and Education Studies * Research Assessment Exercise (RAE) * Rural Education * Science Education * Secondary Education * Second life Educators * Social Studies Education * Special Education * Student Affairs * Teacher Education * Cross-disciplinary areas of Education * Ubiquitous Computing * Virtual Reality * Wireless applications * Other Areas of Education Important Dates: * Extended Abstract (Work in Progress) Submission Date: January 15, 2012 * Research Paper, Student Paper, Case Study, Report Submission Date: January 20, 2012 * Proposal for Workshops: December 15, 2011 * Notification of Workshop Acceptance/Rejection: December 20, 2011 * Poster Proposal Submission Date: January 10, 2012 * Notification of Poster Acceptance/Rejection: January 15, 2012 * Notification of Extended Abstract (Work in Progress) Acceptance/Rejection: January 25, 2012 * Notification of Research Paper, Student Paper, Case Study, Report Acceptance/Rejection: January 30, 2012 * Camera Ready Paper Due: February 10, 2012 * Early Bird Registration Deadline (Authors and Participants): February 15, 2012 * Late Bird Registration Deadline (Authors and Participants): March 10, 2012 * Conference Dates: April 16-18, 2012 For further information please visit IICE-2012 atwww.iicedu.org ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 14:10:56 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 15:10:56 +0100 Subject: Appel: ESSLLI 2012 Student Session Message-ID: Date: Fri, 16 Dec 2011 09:29:47 +0100 From: ESSLLI 2012 Student Session Message-ID: <4EEB017B.6020905 at loriweb.org> X-url: http://loriweb.org/ESSLLI2012StuS/ X-url: http://esslli2012.pl/ ** APOLOGIES FOR MULTIPLE POSTINGS** *** PLEASE DISTRIBUTE TO STUDENTS *** First Call for Papers *ESSLLI 2012 STUDENT SESSION* Held during The 24th European Summer School in Logic, Language and Information Opole, Poland, August 6-17, 2012 Deadline for submissions: March 20, 2012 http://loriweb.org/ESSLLI2012StuS/ *ABOUT:* The Student Session of the 24th European Summer School in Logic, Language, and Information (ESSLLI) will take place in Opole, Poland on August 6-17, 2012. We invite submissions of original, unpublished work from students in any area at the intersection of Logic & Language, Language & Computation, or Logic & Computation. Submissions will be reviewed by several experts in the field, and accepted papers will be presented orally or as posters and will appear in the student session proceedings. This is an excellent opportunity to receive valuable feedback from expert readers and to present your work to a diverse audience. ESSLLI 2012 will feature a wide range of foundational and advanced courses and workshops in all areas of Logic, Language, and Computation. Consult the main ESSLLI website (link below) for further information. *INSTRUCTIONS FOR AUTHORS:* Authors must be students, i.e., may not have received the Ph.D. degree before August 2012. All submissions must be in PDF format and be submitted to the conference EasyChair website. Submissions may be singly or jointly authored. No one may submit more than one singly and one jointly authored paper. There are two types of papers. Long papers of up to 8 pages will be considered for both oral presentation and the poster session. Short papers of up to 4 pages will be considered as submissions for the poster session. Submissions must be anonymous, without any identifying information, and must be must be received by *March 20, 2012*. More detailed guidelines regarding submission can be found on the Student Session website: http://loriweb.org/ESSLLI2012StuS/ Links to previous years' proceedings are also available there. Please direct inquiries about submission procedures or other matters relating to the Student Session to esslli2012stus at loriweb.org. For general inquiries about ESSLLI 2012, please consult the main ESSLLI 2012 page, http://esslli2012.pl/. Kind regards, The ESSLLI 2012 Students Session Organization Committee, Rasmus K. Rendsvig (Roskilde University) (chair) Anders Johannsen (University of Copenhagen) Dominik Klein (Tilburg University) Margot Colinet (Universit? de Paris 7) Matthijs Westera (Universiteit van Amsterdam) Maxim Haddad (University of Osnabr?ck) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 14:12:00 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 15:12:00 +0100 Subject: Appel: LACL 2012 Message-ID: Date: Fri, 16 Dec 2011 11:34:51 +0100 From: Denis Bechet Message-Id: <201112161134.51185.bechet at info.univ-nantes.fr> X-url: http://lacl.gforge.inria.fr/lacl-2012 LACL 2012 Logical Aspects of Computational Linguistics July 2-4, 2012 LINA, Nantes, France http://lacl.gforge.inria.fr/lacl-2012 PRESENTATION LACL'2012 is the 7th edition of a series of international conferences on logical and formal methods in computational linguistics. This conference addresses the use of type theoretic, proof theoretic and model theoretic methods for describing natural language syntax and semantics, as well as the implementation of natural language processing software relying on such models. It will be held at the LINA, Nantes, France. Scope: Computer scientists, linguists, mathematicians and philosophers are invited to present their work on the use of logical methods in computational linguistics and natural language processing, in natural language analysis, generation or acquisition. Topics of either theoretical or applied interest include, but are not limited to: * logical foundation of syntactic formalisms o categorial grammars o minimalist grammars o dependency grammars o tree adjoining grammars o model theoretic syntax o formal language theory for natural language processing o data-driven approaches * logics for semantics of lexical items, sentences, discourse and dialog o discourse theories o Montague semantics o compositionality o dynamic logics o game semantics o situation semantics o generative lexicon o categorical semantics * applications of these models to natural language processing o software for natural language analysis o software for acquiring linguistic resources o software for natural language generation o software for information extraction o inference tasks o evaluation o scalability SUBMISSIONS and PROCEEDINGS Articles should be written in the LaTeX format of LNCS by Springer (see authors instructions at http://www.springer.com/computer/lncs?SGWID=0-164-6-793341-0) and should not exceed 16 pages (including figures, bibliography, possible appendices). It is expected that each accepted paper be presented at the meeting by one of its authors. Papers must be submitted electronically in PDF format at http://www.easychair.org/conferences/?conf=lacl2012 PROCEEDINGS Accepted papers will be published in advance of the meeting as a volume of the FoLLI subline of Lecture Notes in Computer Science (LNCS) by Springer (http://www.springer.com/lncs). INVITED SPEAKERS To be announced. PREVIOUS EDITIONS A selection of the 1995 articles appeared in a special issue of the Journal of Logic, Language and Information (7:4, 1998). The proceedings of the international conferences LACL'96 ,LACL'97, LACL'98, LACL'2001, LACL'2005 and LACL'2011 appeared in the series Lecture Notes in Artificial Intelligence (volumes 1328, 1582, 2014, 2099, 3492, 6736) published by Springer. IMPORTANT DATES Paper submission deadline: February 5, 2012 Notification of acceptance: March, 25 2012 Camera ready copies due: April 8, 2012 Conference dates: July 2-4, 2012 CONTACTS Alexandre.Dikovsky at univ-nantes.fr and Denis.Bechet at univ-nantes.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 14:15:54 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 15:15:54 +0100 Subject: Appel: TALN'2012 (Pendant JEP-TALN-RECITAL), 4 au 8 juin 2012, Grenoble Message-ID: Date: Fri, 16 Dec 2011 16:27:25 +0100 From: =?iso-8859-1?Q?Herv=E9_Blanchon?= Message-Id: <8B926340-50AC-4A58-B0F1-41F7B2014AC6 at imag.fr> X-url: http://www.jeptaln2012.org/ Bonjour, Le site de soumission de TALN est d?sormais ouvert. Vous trouverez les consignes aux auteurs sur : http://www.jeptaln2012.org/?page_ide et les liens pour soumettre sont sur http://www.jeptaln2012.org/?page_idi Attention, cette ann?e , les styles pr?sentent des modifications importantes par rapport aux versions des conf?rences pr?c?dentes car ils ont ?t? optimis?s pour une lecture ? l??cran. Nous souhaitons ainsi r?duire l?impact ?cologique de l?utilisation de nos articles en ?vitant des impressions dues ? un format inadapt?. Par ailleurs, les orateurs invit?s suivant ont confirm? leur venue ? JEP-TALN-RECITAL 2012 (titres ? venir) Ian MADDIESON, University of California, Berkeley Jacqueline LEON, CNRS, Paris Yoshinori SAGISAKA, University of Waseda, Tokyo Hans USZKOREIT, DFKI, Saarbrucken Nous esperons recevoir de tr?s nombreux articles de votre part ! Bonnes f?tes de fin d'ann?e ? tous ! ------------------------------------------------------------------------ Troisi?me APPEL ? COMMUNICATIONS TALN'2012 http://www.jeptaln2012.org/ 19e conf?rence sur le Traitement Automatique des Langues Naturelles Grenoble du 4 au 8 juin 2012 CALENDRIER - Articles LONGs : - Date limite de soumission : 24 janvier 2012 - Notification aux auteurs : 1 mars 2012 - Date limite de soumission des versions d?finitives : 15 avril 2012 - Articles COURTs et D?monstrations : - Date limite de soumission : 15 mars 2012 - Notification aux auteurs : 16 avril 2012 - Date limite de soumission des versions d?finitives : 20 avril 2012 PR?SENTATION Organis?e par l'?quipe GETALP du LIG (Laboratoire Informatique de Grenoble), le LIDILEM (Laboratoire de linguistique et didactique des langues ?trang?res et maternelles) et le d?partement DPC du Gipsa-lab, la conf?rence TALN'2012 se tiendra du 4 au 8 juin 2012 ? Grenoble ? l'occasion de la conf?rence jointe JEP-TALN-RECITAL'2012. JEP-TALN-RECITAL'2012 regroupe la 29?me ?dition des Journ?es d'?tude sur la Parole (JEP'2012), la 19?me ?dition de la conf?rence sur le Traitement Automatique des Langues Naturelles (TALN'2012) et la 14?me ?dition des Rencontres des ?tudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RECITAL'2012). Pour la quatri?me fois, apr?s Nancy en 2002, F?s en 2004, et Avignon en 2008, l'AFCP (Association Francophone pour la Communication Parl?e) et l'ATALA (Association pour le Traitement Automatique des Langues) organisent conjointement leur principale conf?rence afin de r?unir en un seul lieu les deux communaut?s du traitement de la langue orale et ?crite. La conf?rence TALN'2012 comprendra des communications orales et affich?es, des conf?rences invit?es et des ateliers. La langue officielle de la conf?rence est le fran?ais. Les communications en anglais sont accept?es pour les participants non-francophones. TYPES DE COMMUNICATIONS Deux formats de communications sont pr?vus : les articles longs (de 11 ? 14 pages) et les articles courts (de 6 ? 8 pages). Les auteurs sont invit?s ? pr?senter deux types de communications : - des articles pr?sentant des travaux de recherche originaux, - des prises de position pr?sentant un point de vue sur l'?tat des recherches en TAL (traitement automatique de la langue), fond?es sur une solide exp?rience du domaine. Les articles longs seront pr?sent?s sous forme de communication orale, les articles courts sous forme de poster. TH?MES Les communications pourront porter sur tous les th?mes habituels du TAL, incluant, de fa?on non limitative : - Analyse et g?n?ration dans les domaines suivants : - Phon?tique - Phonologie - Morphologie - Syntaxe - S?mantique - Discours - D?veloppement de ressources linguistiques pour le TAL : - Bases de donn?es comportant des informations morphologiques, syntaxiques, s?mantiques, et/ou phonologiques - Grammaires - Lexiques - Ontologies - Linguistique de corpus - Applications du TAL : - Analyse de sentiments ou d'opinions - Cat?gorisation ou classification automatique - D?sambigu?sation lexicale - Dialogue homme-machine en langage naturel - Enseignement assist? par ordinateur - Indexation automatique - Recherche et extraction d'information - R?sum? automatique - R?solution d'anaphores - Syst?mes de question-r?ponse - Traduction automatique - Web s?mantique - Approches: - Linguistiques formelles destin?es ? soutenir les traitements automatiques - Symboliques - Logiques - Statistiques - Bas?es sur l'apprentissage automatique -Prises de position pr?sentant un point de vue sur le TAL CRIT?RES DE S?LECTION Les soumissions seront examin?es par au moins deux sp?cialistes du domaine. Pour les travaux de recherches, seront consid?r?es en particulier : - l'ad?quation aux th?mes de la conf?rence. - l'importance et l'originalit? de la contribution, - la correction du contenu scientifique et technique, - la discussion critique des r?sultats, en particulier par rapport aux autres travaux du domaine, - la situation des travaux dans le contexte de la recherche internationale, - l'organisation et la clart? de la pr?sentation, Pour les prises de position, seront privil?gi?es : - la largeur de vue et la prise en compte de l'?tat de l'art, - l'originalit? et l'impact du point de vue pr?sent?. Les articles s?lectionn?s seront publi?s dans les actes de la conf?rence. Le comit? de programme s?lectionnera parmi les communications accept?es un article (prix TALN) pour recommandation ? publication (dans une version ?tendue) dans la revue Traitement Automatique des Langues (Revue TAL). MODALIT?S DE SOUMISSION Les articles seront r?dig?s en fran?ais pour les francophones, en anglais pour ceux qui ne ma?trisent pas le fran?ais. Une feuille de style LaTeX et un mod?le Word sont disponibles sur le site web de la conf?rence (http://www.jeptaln2012.org/ ). La soumission se fait en ligne (voir instructions sur le site web de la conf?rence). Contact : herve.blanchon at imag.fr et Georges.Antoniadis at u-grenoble3.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 14:22:41 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 15:22:41 +0100 Subject: Job: Stage M2, Info-linguiste, Scan research Message-ID: Date: Fri, 16 Dec 2011 17:54:54 +0100 From: Gilles Achache Message-ID: X-url: http://www.scan-research.net" SCAN-research Proposition de stage pour un info-linguiste (niveau Master II) Stage dans une soci?t? innovante d??tude et d?analyse de la conversation des Internautes 1. Pr?sentation de la soci?t? et du secteur d?activit? Scan-research est une jeune start-up cr??e en ao?t 2010, sp?cialis?e dans l'observation et l'analyse de l'opinion des Internautes. Nous sommes aujourd'hui h?berg?s par l'incubateur d?entreprises innovantes Agoranov (incubateur cr?? par UPMC, Paris IX-Dauphine, ENS, Paris tech) ; nous avons re?u le soutien d?Os?o et de la Mairie de Paris - Nos clients sont de grandes entreprises du luxe (LVMH?), de l'?nergie (EDF, ERDF?) et des administrations. - Nous avons conclu un partenariat avec de grands titres de presse pour suivre les ?volutions de la conversation des Internautes durant les ?lections pr?sidentielles. Nous sommes ? la recherche d??tudiants motiv?s, curieux et autonomes pour nous accompagner dans la r?alisation de nos ?tudes et de nos missions de veille et d?analyse sur la conversation des Internautes, ? l?aide de notre plateforme propri?taire. Les t?ches qui vous seront confi?es seront (entre autres) les suivantes : - Construire les ontologies en fonction desquelles seront cat?goris?es les conversations observ?es - R?diger les descripteurs linguistiques de ces ontologies selon les m?thodologies que nous avons d?velopp?es - Interagir avec les d?veloppeurs de notre plateforme, et les concepteurs du moteur de TAL avec lequel nous travaillons 2. Profil Vous ?tes infolinguistes, avec : - une bonne connaissance des probl?matiques du Traitement Automatique des Langues Naturelles - Une connaissance des probl?mes li?s ? la constitution d?ontologies appliqu?es ? la r?alit? sociale - Une bonne culture g?n?rale notamment en mati?re de sociologie, ne nuira pas?; un int?r?t pour les questions de formation de l?opinion et de s?mantique politique sera appr?ci?. - Une pratique courante de l?anglais est un requis Nous cherchons des candidats ? la fois rigoureux, imaginatifs, int?ress?s ? participer ? l??laboration de m?thodologies innovantes en mati?re de mise en ?uvre des techniques de linguistiques appliqu?es aux sciences sociales et au marketing. 3. Modalit?s pour postuler Envoyez votre candidature par mail ? gilles.achache at scan-research.net 4. D?roulement du stage Salaire : ? n?gocier Lieu du stage : Paris (boulevard Raspail) Dur?e : 6 mois / 12 mois (possibilit? de transformer le stage en premier emploi en cas de r?ussite) Date de d?marrage : 15 janvier 2012 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 14:25:09 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 15:25:09 +0100 Subject: Appel: EACL 2012 Workshop on Computational Models of Language Acquisition and Loss Message-ID: Date: Fri, 16 Dec 2011 19:14:32 +0100 From: Thierry Poibeau Message-Id: X-url: http://sites.google.com/site/eaclcogws/ ================================================== First Call for Papers ================================================== EACL 2012 Workshop on Computational Models of Language Acquisition and Loss http://sites.google.com/site/eaclcogws/ Deadline for Submissions: January, 20th, 2012 ------------------------------------------------------------------------ The past decades have seen a massive expansion in the application of statistical and machine learning methods to speech and natural language processing. This work has yielded impressive results which have generally been viewed as engineering achievements. Recently researchers have begun to investigate the relevance of computational learning methods for research on human language acquisition and loss. The human ability to acquire and process language has long attracted interest and generated much debate due to the apparent ease with which such a complex and dynamic system is learnt and used on the face of ambiguity, noise and uncertainty. On the other hand, changes in language abilities during aging and eventual losses related to conditions such as Alzheimer's disease and dementia have also attracted considerable investigative efforts. Parallels between the acquisition and loss have been raised, and a better understanding of the mechanisms involved in both, and of how the algorithms used to access concepts are affected in pathological cases can lead to earlier diagnosis and more targeted treatments. The use of computational modeling is a relatively recent trend boosted by advances in machine learning techniques, and the availability of resources like corpora of child and child-directed sentences, and data from psycholinguistic tasks by normal and pathological groups. Many of the existing computational models attempt to study language tasks under cognitively plausible criteria (such as memory and processing limitations that humans face), and to explain the developmental stages observed in the acquisition and evolution of the language abilities. The workshop is targeted at anyone interested in the relevance of computational techniques for understanding first, second and bilingual language acquisition and change or loss in normal and pathological conditions. Long and short papers are invited on, but not limited to, the following topics: * Computational learning theory and analysis of language learning * Computational models of first, second and bilingual language acquisition * Computational models of language changes in e.g. dementia and Alzheimer?s Disease * Computational models and analysis of factors that influence language acquisition and loss in different age groups and cultures * Computational models of various aspects of language and their interaction in acquisition and change * Computational models of the evolution of language * Data resources and tools for investigating computational models of human language processes * Empirical and theoretical comparisons of the environment and its impact on acquisition/loss * Cognitively oriented Bayesian models of language processes * Computational methods for acquiring various linguistic information (related to e.g. speech, lexicon, syntax, and semantics) and their relevance to research on human language acquisition * Investigations and comparisons of supervised, unsupervised and weakly-supervised methods for learning (e.g. machine learning, statistical, symbolic, biologically-inspired, active learning, various hybrid models) SUBMISSIONS We invite three different submission modalities: * Regular long papers (8 content pages + 1 page for references): Long papers should report on solid and finished research including new experimental results, resources and/or techniques. * Regular short papers (4 content pages + 1 page for references): Short papers should report on small experiments, focused contributions, ongoing research, negative results and/or philosophical discussion. * System demonstration (2 pages): System demonstration papers should describe and document the demonstrated system or resources. We encourage the demonstration of both early research prototypes and mature systems, that will be presented in a separate demo session. All submissions must be in PDF format and must follow the EACL 2012 formatting requirements (available at http://eacl2012.org/information-for-authors/index.html). We strongly advise the use of the provided Word or LaTeX template files. For long and short papers, the reported research should be substantially original. The papers will be presented orally or as posters. The decision as to which paper will be presented orally and which as poster will be made by the program committee based on the nature rather than on the quality of the work. Reviewing will be double-blind, and thus no author information should be included in the papers; self-reference should be avoided as well. Papers that do not conform to these requirements will be rejected without review. Accepted papers will appear in the workshop proceedings, where no distinction will be made between papers presented orally or as posters. Submission and reviewing will be electronic, managed by the START system: https://www.softconf.com/eacl2012/Cognitive2012/ Submissions must be uploaded onto the START system by the submission deadline: January 20, 2012 (11:59pm Samoa Time; UTC/GMT -11 hours) Please chose the appropriate submission type from the starting submission page, according to the category of your paper. IMPORTANT DATES Jan 20, 2012 Paper submission deadline Feb 20, 2012 Notification of acceptance Mar 09, 2012 Camera-ready deadline Apr 23 or 24, 2012 Workshop PROGRAM COMMITTEE Afra Alishahi, Tilburg University (Netherlands) Colin J Bannard, University of Texas at Austin (USA) Marco Baroni, University of Trento (Italy) Jim Blevins, University of Cambridge (UK) Rens Bod, University of Amsterdam (Netherlands) Antal van den Bosch, Tilburg University (Netherlands) Alexander Clark, Royal Holloway, University of London (UK) Robin Clark, University of Pennsylvania (USA) Matthew W. Crocker, Saarland University (Germany) James Cussens, University of York (UK) Walter Daelemans, University of Antwerp (Belgium) and Tilburg University (Netherlands) Barry Devereux, University of Cambridge (UK) Sonja Eisenbeiss, University of Essex (UK) Afsaneh Fazly, University of Toronto (Canada) Cynthia Fisher, University of Illinois (USA) Jeroen Geertzen, University of Cambridge (UK) Henriette Hendriks, University of Cambridge (UK) Marco Idiart, Federal University of Rio Grande do Sul (Brazil) Aravind Joshi, University of Pennsylvania (USA) Shalom Lappin, King's College London (UK) Alessandro Lenci, University of Pisa (Italy) Igor Malioutov, Massachusetts Institute of Technology (USA) Marie-Catherine de Marneffe, Stanford University (USA) Fanny Meunier, Lumi?re Lyon 2 University (France) Brian Murphy, Carnegie Mellon University (USA) Maria Alice Parente, Federal University of Rio Grande do Sul (Brazil) Massimo Poesio, University of Essex (UK) Brechtje Post, University of Cambridge (UK) Ari Rappoport, The Hebrew University of Jerusalem (Israel) Dan Roth, University of Illinois at Urbana-Champaign (USA) Kenji Sagae, University of Southern California (USA) Sabine Schulte im Walde, University of Stuttgart (Germany) Ekaterina Shutova, University of Cambridge (UK) Maity Siqueira, Federal University of Rio Grande do Sul (Brazil) Mark Steedman, University of Edinburgh (UK) Shuly Wintner, University of Haifa (Israel) Charles Yang, University of Pennsylvania (USA) Beracah Yankama, Massachusetts Institute of Technology (USA) Menno van Zaanen, Macquarie University (Australia) Michael Zock, LIF, CNRS, Marseille (France) WORKSHOP ORGANIZERS AND CONTACT Robert Berwick, Massachusetts Institute of Technology (USA) Anna Korhonen, University of Cambridge (UK) Thierry Poibeau, LaTTiCe-CNRS (France) and University of Cambridge (UK) Aline Villavicencio, Federal University of Rio Grande do Sul (Brazil) and Massachussets Institute of Technology (USA) For any inquiries regarding the workshop please send an email to cognitivemodels2012 at gmail.com ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sun Dec 18 14:27:44 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sun, 18 Dec 2011 15:27:44 +0100 Subject: Appel: MUE 2012 Message-ID: Date: Sat, 17 Dec 2011 19:25:04 +0100 From: MUE2012 Message-ID: <20111217182504.GA16915 at piojito.arcos.inf.uc3m.es> X-url: http://www.arcos.inf.uc3m.es/mue2012 Dear Sir or Madam, (We apologize if you receive multiple copies of this message) ================================== The 6th FTRA International Conference on Multimedia and Ubiquitous Engineering (MUE 2012) http://www.arcos.inf.uc3m.es/mue2012 Madrid, Spain, 10-13 July 2012 ** Important Dates ** ================================== Papers due: January, 15, 2012 Notification: March, 15 2012 Camera ready: April, 15 2012 Conference dates: July, 10-13, 2012 Context ================================= The new multimedia standards (for example, MPEG-21) facilitate the seamless integration of multiple modalities into interoperable multimedia frameworks, transforming the way people work and interact with multimedia data. These key technologies and multimedia solutions interact and collaborate with each other in increasingly effective ways, contributing to the multimedia revolution and having a significant impact across a wide spectrum of consumer, business, healthcare, education, and governmental domains. Moreover, the emerging mobile computing and ubiquitous networking technologies enable users to access fully broadband mobile applications and new services anytime and everywhere. The continuous efforts have been dedicated to research and development in this wide area including wireless mobile networks, ad-hoc and sensor networks, smart user devices and advanced sensor devices, mobile and ubiquitous computing platforms, and new applications and services including location-based, context-aware, or social networking services. This conference provides an opportunity for academic and industry professionals to discuss recent progress in the area of multimedia and ubiquitous environment including models and systems, new directions, novel applications associated with the utilization and acceptance of ubiquitous computing devices and systems. MUE 2012 is the next event in a series of highly successful the International Conference on Multimedia and Ubiquitous Engineering MUE-11 (Crete, Greece, June 2011), MUE-10 (Cebu, Philippines, August 2010), MUE-09 (Qingdao, China, June 2009), MUE-08 (Busan, Korea, April 2008), and MUE-07 (Seoul, Korea, April 2007). Topics of interest ================================= * Ubiquitous Computing and Technology * Context-Aware Ubiquitous Computing * Parallel/Distributed/Grid Computing * Novel Machine Architectures * Semantic Web and Knowledge Grid * Smart Home and Generic Interfaces * AI and Soft Computing in Multimedia * Computer Graphics and Simulation * Multimedia Information Retrieval (images, videos, hypertexts, etc.) * Internet Multimedia Mining * Medical Image and Signal Processing * Multimedia Indexing and Compression * Virtual Reality and Game Technology * Current Challenges in Multimedia * Protocols for Ubiquitous Services * Ubiquitous Database Methodologies * Ubiquitous Application Interfaces * IPv6 Foundations and Applications * Smart Home Network Middleware * Ubiquitous Sensor Networks / RFID * U-Commerce and Other Applications * Databases and Data Mining * Multimedia RDBMS Platforms * Multimedia in Telemedicine * Multimedia Embedded Systems * Multimedia Network Transmission/Streaming * Entertainment Industry * E-Commerce and E-Learning * Novel Multimedia Applications * Computer Graphics * Multimedia network transmission/streaming * Security in Commerce and Industry * Security in Ubiquitous Databases * Key Management and Authentication * Privacy in Ubiquitous Environment * Sensor Networks and RFID Security * Multimedia Information Security * Forensics and Image Watermarking * Cyber Security * Intrusion detection * Biometric Security * New developments in handheld and mobile information appliances * New paradigms: mobile cloud, personal networks, social and crowd computing, etc * Operating systems aspects for personal mobile devices * New technological advances for personal mobile devices * End-user interface issues in the design and use of personal technologies * Enabling technologies for personal multimedia and ubiquitous computing * Multimedia applications and techniques for personal computing devices * Usage of personal devices for on-line learning Submissions ================================= Submissions should not exceed 8 pages in IEEE CS proceedings paper format, including tables and figures. All paper submissions must represent original and unpublished work. Submission of a paper should be regarded as an undertaking that, should the paper be accepted, at least one of the authors will register for the conference and present the work. Submissions will be conducted electronically on the conference website. Journal publications ================================== Distinguished selected papers accepted and presented in MUE 2012, after further revisions, will be published in issues of several indexed international journals. Best paper awards ================================= The Program Committee of MUE 2012 will give two best paper awards: one NVIDIA Best Student Paper Award and one NVIDIA Best Paper Award. Best Paper Awards will be given to the author(s) of a full paper presented at the conference, selected by the Organizing Committee. The Best Paper Award is a Tesla C2075 computing processor, sponsored by NVIDIA. Steering Chair ================================= James J. (Jong Hyuk) Park (SeoulTech, Korea) General Chairs ================================= Jesus Carretero (University Carlos III of Madrid, Spain) Sangsoo Yeo (Mokwon University, Korea) General Vice-Chair ================================= Seungmin Rho (Korea University, Korea) Program Chairs ================================= Jianping Fan(University of North Carolina, USA) J. Daniel Garcia (University Carlos III of Madrid, Spain) Program Vice-Chairs ================================= Claudia Cannali (University of Modena e Reggio Emilia, Italy) Benoit Huet (Eurecom, France) Jiankun Hu (University of New South Wales at the Australian Defence Force Academy, Australia) Eleni Mangina (University College Dublin, Ireland) Feng Wu (Microsoft Research Asia, China) Zhiwen Yu (Northwestern Polytechnical University, China) Workshop Chairs ================================= Juan Manuel Dodero (University of Cadiz, Spain) Abelardo Pardo (University Carlos III of Madrid, Spain) International Program Committee ================================= **Please visit the conference web site (http://www.arcos.inf.uc3m.es/mue2012) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:15:44 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:15:44 +0100 Subject: Job: cherche charge-e-s de cours, Universite Paris III (urgent) Message-ID: Date: Mon, 19 Dec 2011 09:21:10 +0100 From: Alice Krieg-Planque Message-ID: <4EEEF3F6.7050602 at club-internet.fr> Le d?partement de litt?rature et linguistique fran?aises et latines de l'Universit? Paris III cherche un-e charg?-e de cours de linguistique pour le deuxi?me semestre (qui d?bute le 30 janvier 2012). Il s'agit d'un cours d'introduction ? l'analyse linguistique pour les ?tudiants de premi?re ann?e en Lettres Modernes. Le contenu du cours porte sur la syntaxe et la s?mantique lexicale. Les horaires du TD en question sont d?j? pr?cis?s : mardi 10h-13h. Contacter : yana.grinshpun at numericable.fr et gillesphilippe at online.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:19:10 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:19:10 +0100 Subject: Appel: TOTh 2012 Message-ID: Date: Mon, 19 Dec 2011 11:23:15 +0100 From: Luc Damas Message-ID: X-url: http://www.porphyre.org/toth/ TOTh 2012 : APPEL A COMMUNICATIONS Terminologie & Ontologie : Th?ories et applications Conf?rence : 7 et 8 juin 2012 Formation : 5 et 6 juin 2012 http://www.porphyre.org/toth/ http://www.porphyre.org/toth/toth-2012/appel-a-communications http://www.porphyre.org/toth/files/TOTh-2012-appel-a-communications.pdf Dates importantes : ------------------------- 30 janvier 2012 : Date limite de soumission 5 mars 2012 : Notification d?acceptation 9 avril 2012 : Version d?finitive Soumission par EasyChair : ----------------------------------- Les soumissions d'articles (articles complets) se font exclusivement par EasyChair : http://www.easychair.org/conferences/?conf=toth2012 Pr?sentation : ------------------ En rapprochant terminologie et ontologie, les conf?rences TOTh mettent l?accent sur les apports mutuels de ces deux disciplines et sur de nouvelles perspectives de d?veloppement tant th?oriques que pratiques. La terminologie s?int?resse aux d?nominations des concepts et des objets dans un domaine pr?cis, mais aussi aux unit?s telles que les collocations, les combinaisons lexicales ou les phras?ologies en usage, qui font partie des vocabulaires, des discours et des langues de sp?cialit?. D?un point de vue m?thodologique, le terminologue d?veloppe des pratiques qui visent la conception et la cr?ation de ressources terminologiques utiles ? toute fin langagi?re : traduction, am?nagement, lexicographie, enseignement des langues ? des fins sp?cifiques, etc. La qualit? et la rigueur des discours de sp?cialit? ?tant directement d?pendantes de la relation entre l?organisation lexicale et l?organisation des connaissances, une r?flexion conjointe entre les terminologues, traducteurs, lexicographes, ing?nieurs des connaissances et informaticiens s?impose. La terminologie et l?ontologie partagent des objectifs similaires dans leur compr?hension et la description de la r?alit?. Les ontologies issues de l?ing?nierie des connaissances constituent aujourd?hui une des voies les plus prometteuses pour la mod?lisation du syst?me conceptuel des terminologies. Au c?ur de nombreuses applications du monde num?rique, les ontologies permettent l?op?rationnalisation des terminologies et leur utilisation ? diff?rentes fins : aide ? la traduction, gestion et traitement de l?information, gestion des connaissances et de contenus multilingues, web s?mantique, etc. Par nature transdisciplinaires, la terminologie et l?ontologie mobilisent des pratiques et des th?ories qui rel?vent de nombreuses disciplines dont principalement la linguistique, les langues de sp?cialit?, l??pist?mologie, l?ing?nierie des connaissances et les sciences de l?information. Publics & Disciplines : ---------------------------- Les conf?rences TOTh visent ? rassembler chercheurs, enseignants, formateurs, utilisateurs et industriels dont les pr?occupations rel?vent ? la fois de la terminologie et de l?ontologie, et de fa?on plus g?n?rale de la langue et de l?ing?nierie des connaissances. Elles se veulent un lieu d??change et de partage o? sont expos?s probl?mes, solutions et retours d?exp?riences tant sur le plan th?orique qu?applicatif ainsi que les nouvelles tendances et perspectives des disciplines associ?es : terminologie, langues de sp?cialit?, linguistique, intelligence artificielle, ing?nierie des connaissances, syst?mes d?information, ing?nierie collaborative, sciences de l?information, documentation, etc. Prix ? Jeune chercheur ? : -------------------------------- Depuis 2011 un prix ? Jeune chercheur ? est d?cern? par le comit? de programme lors de la conf?rence. Th?mes de la Conf?rence (liste non exhaustive) : -------------------------------------------------------------- Approches linguistiques : - les langues de sp?cialit?, la traduction, l?am?nagement linguistique et terminologique, la communication de sp?cialit? multilingue, - les usages, les variations, les collocations terminologiques, les combinaisons lexicales, - les produits terminologiques, lexiques, dictionnaires, th?saurus, - le traitement de corpus : extraction de termes, extraction de connaissances, constitution et alignement de corpus, - la construction et la maintenance de ressources terminologiques, l?harmonisation des termes, l?harmonisation des concepts, la normalisation, les th?ories du sens, l?importance et la limite des corpus, - la validation de la qualit? des donn?es, le r?le des experts. Mod?lisation des connaissances : - les principes terminologiques pour la construction des syst?mes notionnels et conceptuels, - la repr?sentation et la gestion des connaissances : acquisition, mod?lisation des concepts, - les ontologies : construction, ?valuation (qualit?), maintenance, alignement, fusion, - les thesaurus mono- et multilingues, la gestion documentaire, - l?apport de l?intelligence artificielle, de la linguistique, de l??pist?mologie, des syst?mes formels (logique des descriptions par exemple). Applications : - les environnements logiciels : construction et maintenance de terminologies, th?saurus, dictionnaires, ontologies, - les applications informatiques ? base de connaissances et de ressources terminologiques : aide ? la traduction, gestion de contenus mono- et multilingues, web s?mantique, recherche d?information, classification, gestion des connaissances, syst?mes d?information, etc. - l??valuation des outils d?acquisition et de manipulation, l??volution des normes et standards dans le domaine des ressources linguistiques et ontologiques, la terminom?trie. http://www.porphyre.org/toth/ http://www.porphyre.org/toth/toth-2012/appel-a-communications http://www.porphyre.org/toth/files/TOTh-2012-appel-a-communications.pdf ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:20:41 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:20:41 +0100 Subject: Job: Offres de stage, LIMSI-CNRS, Orsay Message-ID: Date: Mon, 19 Dec 2011 11:44:07 +0100 From: Xavier Tannier Message-ID: <4EEF1577.1070708 at limsi.fr> X-url: http://www.limsi.fr/%7Extannier/fr/Stages/sujet_2012_M2_detection_de_reactions.html X-url: http://www.limsi.fr/%7Extannier/fr/Stages/sujet_2012_M1_phrase_saillante.html X-url: http://www.limsi.fr/%7Extannier/fr/Stages/sujet_2012_L_interface_graphique_pour_chronolines.html Bonjour, Dans le cadre du projet Chronolines, l'?quipe ILES du LIMSI propose les trois stages suivants : * *Master 2* : D?tection de r?actions et de promesses dans les articles de presse http://www.limsi.fr/%7Extannier/fr/Stages/sujet_2012_M2_detection_de_reactions.html * *Master 1* : S?lection automatique de passage repr?sentatif d'un ?v?nement http://www.limsi.fr/%7Extannier/fr/Stages/sujet_2012_M1_phrase_saillante.html * *DUT, premier cycle ing?nieur, L3 informatique* : R?alisation d'une interface graphique pour un outil de composition semi-automatique de chronologies http://www.limsi.fr/%7Extannier/fr/Stages/sujet_2012_L_interface_graphique_pour_chronolines.html Cordialement, Xavier Tannier. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:24:49 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:24:49 +0100 Subject: Job: Stage conventionne en Text Mining, EDF, La Defense Message-ID: Date: Mon, 19 Dec 2011 12:37:13 +0100 From: Anne-Laure GUENET Message-ID: Commerce ? Direction des Syst?me d?Information Domaine Analyse de Connaissance Client D?partement Analyse Client Proposition de stage?op?rationnel en Text-Mining Depuis le 1er juillet 2007, le march? de l??lectricit? est enti?rement ouvert ? la concurrence et permet au consommateur de choisir librement son fournisseur d??nergie. Dans ce contexte, il est devenu strat?gique pour EDF de comprendre les besoins de ses clients, mais ?galement d?expliquer et de pr?dire leur comportement. Le Domaine Analyse de la Connaissance Client au sein du D?partement des Syst?mes d?Information de la Branche Commerce a, pour partie, la mission d?analyser les diff?rents syst?mes d?information d?EDF et notamment les donn?es textuelles pr?sentes dans ses bases. Actuellement, nous utilisons des techniques de Text Mining pour analyser automatiquement des commentaires et notamment des r?clamations de clients provenant de nos diff?rents SI. Le stage que nous proposons est op?rationnel et va comprendre plusieurs axes : La cr?ation de cartouches d?extraction de connaissances (Technologie Skill Cartridge d?velopp?e par la soci?t? TEMIS) permettant ? la fois d?am?liorer et de rep?rer des concepts m?tier, mais ?galement de contribuer ? l?am?lioration de nos mod?les de classement. Une ?tude portant sur la comparaison des r?sultats obtenus en Text Mining sur l?analyse des commentaires ? r?clamation ? avec les informations rentr?es manuellement par le conseiller. La mise en place d?un processus d?industrialisation plus l?ger, permettant, non pas d?analyser l?int?gralit? des champs commentaires, mais un ?chantillon repr?sentatif de nos clients. Le stage aura une dur?e de 6 mois et se d?roulera dans la tour EDF ? la D?fense au cours de l?ann?e 2012. Le stagiaire sera r?mun?r?. Le profil recherch? est un ?tudiant de niveau BAC +5 sp?cialis? en Traitement Automatique du Langage. De bonnes connaissances linguistiques et informatiques sont indispensables. Des connaissances statistiques ou des outils d?velopp?s par TEMIS seraient un plus appr?ci?. Les candidatures (CV + lettre de motivation) sont ? envoyer ? anne-laure.guenet at edf.fr. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:30:49 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:30:49 +0100 Subject: Appel: Colloque international de linguistique francaise, Le Caire Message-ID: Date: Mon, 19 Dec 2011 13:12:59 +0000 (GMT) From: EL KHAMISSY RIHAM Message-ID: <1324300379.1649.YahooMailClassic at web29506.mail.ird.yahoo.com> PROLONGEMENT DU DELAI AU 23 DECEMBRE 2011 Veuillez diffuser l'appel Conf?rence de linguistique de la facult? des langues (AL ALSUN) Universit? de Ain Shams (Le Caire- ?gypte) Femme, je n'?cris pas ton nom. Elle au carrefour des sciences du langage Le d?partement de fran?ais de la facult? des Langues (AL Alsun) organise une conf?rence internationale?de linguistique fran?aise qui?se tiendra ? l'Universit? d'Ain Chams (Le Caire- ?gypte), 29?-30 Avril 2012. Date limite de la r?ception des r?sum?s - (200 ? 400 mots) des propositions + Mots cl?s : 19 d?cembre 2011 Elle pronom personnel, elle non-personne, elle le f?minin de il, Elle symbole de la femme. Sous la d?nomination de genre, Elle impr?gne le corps du langage par sa pr?sence. Le traitement de la cat?gorie de sexe dans le langage touche la troisi?me personne en grammaire tout en for?ant chaque locuteur, s'il en est une, ? proclamer son sexe physique (sociologique), c'est-?-dire ? appara?tre dans le langage, repr?sent? sous une forme concr?te par les diff?rentes marques du f?minin et non sous la forme abstraite que la g?n?ralisation n?cessite, celle que tout locuteur masculin a le droit inquestionnable d'utiliser. La pr?sence du elle, en manifestant dans sa forme subjective le genre, arrive ? le r?aliser par exc?s de pr?sence de f?minin (et de f?minisation) couvrant un large territoire du domaine linguistique. Elle propose donc un examen des probl?mes li?s ? la tradition grammaticale fran?aise, une analyse syntaxique, et une ?tude du r?le de ces constructions dans la dynamique du texte et dans la structuration du champ de r?f?rence et du champ communicationnel. En outre, Elle ne se borne pas ? manifester la troisi?me personne au f?minin. Elle en linguistique n'est pas uniquement le f?minin, la f?minit?, et autres f?tiches essentialisants. Dire d'une femme ? elle ? et s'abstenir de mentionner son pr?nom, quand on ne l'ignore pas, n'est pas d?nu? de sens. En outre, Elle peut porter l'h?ritage du discours philosophique. Assumant ? son tour le ? postulat nominaliste ? d?crit par Michel de Certeau ? propos de Montaigne, elle, dans le dispositif d'?criture, signifie ? la chose ? (par opposition au nom), celle ? qui est l'?trang?re, [celle qui] n'est jamais l? o? le mot la dit. ?. Enfin, N. Sarraute ?crit : ? contre elle, on ne peut rien ? (Elle est l?) signalant l'obstination et la force, si souvent sous-estim?e d'ELLE. Quel que soit l'approche linguistique, elle dispose d'une grande vari?t? d'usages et reste un territoire d'exploration riche en apports. Les contributions seront li?es aux domaines de la linguistique cit?s ci-apr?s : ? Grammaire ? Grammaire de texte ? ?nonciation ? Pragmatique ? Argumentation ? Syntaxe ? S?mantique ? Stylistique ? Approches contrastives et comparatives (langues autoris?es : fran?ais-arabe, fran?ais-anglais). NB. Toute approche interdisciplinaire ou cumulant plus d'une discipline sera la bienvenue. Mais il faut le pr?ciser d?s l'envoi de votre r?sum?. La langue du colloque : le fran?ais. La langue des contributions : le fran?ais (NB. pour les approches contrastives des exemples peuvent ?tre r?dig?s en anglais ou en arabe). Comit? d'organisation: Dr. Racha EL KHAMISSY Dr. Riham EL KHAMISSY Dr. Yomna SAFWAT Comit? scientifique : Prof. Dr. Mona ABDEL AZIZ Prof. Dr. Lorine Zikri Prof. Dr. Elweya AL HAKIM Prof. Dr. Cam?lia SOBHI Prof. Dr. Dalia EL TOUKHY Calendrier : ? Date limite de la r?ception des r?sum?s - (200 ? 400 mots) + 4 mots-cl?s : 19 d?cembre 2011. Prolongement au 23 d?cembre 2011 ? Date limite de notification de l'acceptation ou du refus des propositions de communications : 1er janvier 2012 ? Date limite de la r?ception du texte complet de la communication : 19 mars 2012 ? Directives pour la version d?finitive : d?but avril 2012 ? Date de la conf?rence : 29-30 avril 2012 Pour Candidater : Les propositions de communications doivent ?tre anonymes, ce qui signifie qu'? aucun moment les lecteurs ne doivent ?tre en mesure d'identifier le ou les auteurs. Des r?sum?s-New Times Roman 12 - (200 ? 400 mots) + 5 mots cl?s + bibliographie indicative. Les propositions doivent parvenir ? l'adresse ?lectronique suivante : conf_alsun2012 at yahoo.com Dans le corps du message ?lectronique, vous pr?ciserez : 1. Vos informations personnelles Nom et pr?nom Nationalit? Statut (chercheur, enseignant, enseignant-chercheur) Poste et Affiliation Titre de votre contribution Br?ve notice biobibliographique de l'auteur (5 lignes max.) NB. En cas de co-?criture : Pour des raisons de commodit?, un seul auteur sera d?sign? comme ? correspondant ?. Le ou les autre(s) seront seulement "pr?sentateur(s)".C'est l'auteur "correspondant" qui recevra les diff?rents messages ?lectroniques li?s ? la proposition de communication (rapport d'?valuation). Seule son adresse ?lectronique est donc n?cessaire, celle de son ?ventuel co-auteur est facultative. 2. La th?matique dans laquelle vous souhaitez vous inscrire. Sera attach? au courriel, en version WORD. Doc, le r?sum? anonyme et le titre de votre contribution. 3. Le type de votre contribution : ? Pr?sentation + publication ? Publication seulement Pour plus d'infos, consultez la page du colloque sur le site de la facult? des langues ? (AL ALSUN). Apr?s acceptation Frais d'inscriptions selon le type de votre participation : Pr?senter + Publier Publier seulement Assister seulement Pour les enseignants chercheurs ?gyptiens : 550 LE 350 LE Gratuitement Pour les enseignants-chercheurs ?trangers : 330 euros Gratuitement Pour les ?tudiants ?gyptiens Gratuitement L'article complet est de 15 pages maximum. Or, vous ne pr?senterez, lors de la conf?rence, qu'une communication de (2500 signes sans espaces compris ni r?f?rences). Un texte plus complet figurera dans les Actes du colloque. La mise en forme de votre article complet 1. Votre article doit ?tre envoy? au format .doc 2. Marge de 3cm de chaque c?t? 3. Titre principal de votre contribution en Times Roman 16 4. Sous ?titres Times Roman en 14 5. Corps du texte : Police Times Roman en12 6. Espace entre paragraphe (Auto) 7. Interligne simple 8. Les paragraphes sont justifi?s 9. Les titres et les sous-titres sont centr?s 10. Les citations courtes sont ins?r?es entre guillemets dans le corps de texte : les citations longues ?plus de trois lignes) peuvent ?tre distingu?es par un retour ? la ligne et un retrait d'1 cm. Elles sont sans guillemets et r?dig?es en New Times roman 11. 11. Les r?f?rences Les r?f?rences figurent dans le corps du texte (syst?me (auteur : date)) Pour la bibliographie, Times ou Times New Roman, taille de police 10, justifi?. La premi?re ligne de chaque r?f?rence (donc la premi?re ligne de chaque paragraphe) doit ?tre affect?e d'un retrait n?gatif de 0,5 cm . Vous trouverez ci-dessous le format des r?f?rences pour les articles et les ouvrages. Nom de l'auteur, Pr?nom. (Ann?e de publication). Titre de l'article. Nom de la revue, Num?ro de la revue, num?ros des pages. Nom de l'auteur, Pr?nom. (Ann?e de publication). Titre de l'ouvrage. Ville : ?diteur. Les renvois aux sites doivent pr?ciser la date consultation et ?ventuellement la date de publication. Les notes infrapaginales en New Times roman 10 ne sont utilis?es que pour des remarques ou des explicitations. Elles doivent ?tre limit?es autant que possible. Vous ?tes pri?s de suivre les normes de pr?sentations ci-dessus. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:33:38 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:33:38 +0100 Subject: Info: Elections au CoNRS, listes electorales Message-ID: Date: Mon, 19 Dec 2011 14:19:25 +0100 From: Anne Vilnat Message-ID: <4EEF39DD.20609 at limsi.fr> X-url: http://www.dgdr.cnrs.fr/elections/scn/actualites/default.htm X-url: http://www.dgdr.cnrs.fr/elections/scn/textes/Liste%20cnSL%5B1%5D.pdf X-url: http://www.dgdr.cnrs.fr/elections/scn/dispositif/electorat.htm X-url: http://liste-electorale.dsi.cnrs.fr/inscriptions/index.php?election=section Ch?res et chers coll?gues, En 2012 auront lieu les ?lections du Comit? national de la recherche scientifique pour le mandat 2012-2016. Elles permettent d'?lire les futurs membres commissions des 41 (nouvelles) sections du Comit? national. En vue de ces ?lections, ce message donne des informations sur la proc?dure pour v?rifier son inscription ou pour s'inscrire sur les listes ?lectorales. *Rappel* : Les sections du CoNRS exercent plusieurs fonctions clefs au sein du fonctionnement du CNRS. Elles participent notamment : - au processus de recrutement des CR et DR - ? l'?valuation des chercheurs - au processus de changements de section, de grade, etc. Elles sont consult?es lors du processus de cr?ation, renouvellement, suppressions des unit?s de recherche, ou pour toutes questions relevant de leur domaine. (pour plus de d?tails, voir : http://www.dgdr.cnrs.fr/elections/scn/actualites/default.htm) Il est primordial que des chercheurs du domaine du TAL soient repr?sent?s dans la commission dont ils rel?vent : la "nouvelle" section 7 pour "l'informatique linguistique" et la 34 pour "la linguistique informatique" (pour plus d'informations, voir : http://www.dgdr.cnrs.fr/elections/scn/textes/Liste%20cnSL%5B1%5D.pdf). C'est pourquoi l'ATALA tient ? souligner l'importance de ces ?lections et souhaite inciter tous les ?lecteurs potentiels ? y participer. Pour information, sont ?lecteurs les agents CNRS correspondant ? l'un des crit?res suivants : - agent fonctionnaire au 31 janvier 2012 ou personnel regi par les d?crets n?80-31 et n?59-1405 - agent contractuel de droit public ou priv? au 31 janvier 2012, sous contrat depuis au moins un an ? la date du scrutin (post-doc entre autres). Pour les agents fonctionnaires, l'inscription dans un coll?ge et une section est automatique. Vous pouvez contr?ler votre situation ? l'adresse suivante : http://www.dgdr.cnrs.fr/elections/scn/dispositif/electorat.htm Pour les agents contractuels : vous devez vous inscrire dans un coll?ge et une section avant le 16 janvier 2012 ? l'adresse suivante : http://liste-electorale.dsi.cnrs.fr/inscriptions/index.php?election=section Pour effectuer cette inscription, sachez que : - il faudra ins?rer en document attach? dans la rubrique pr?vue incluant votre CV ainsi qu'une liste des derni?res publications, afin de faciliter votre classement dans la section et le coll?ge appropri? - pour ?tre formellement valid?e, votre demande, devra ?tre renvoy?e au CNRS par courriel elections at cnrs-dir.fr , fax : 0144965391 ou courrier ? CNRS - Organisation des ?lections - 3 rue Michel Ange - 75016 Paris De plus, il faudra indiquer un coll?ge et la section demand?e. Une description des trois coll?ges A, B et C est disponible sur le site. La section dont vous relevez tr?s certainement en tant que chercheur dans le domaine du TAL est la section 7 ou la 34 (http://www.dgdr.cnrs.fr/elections/scn/textes/Liste%20cnSL%5B1%5D.pdf). Des informations compl?mentaires sont disponibles sur le site suivant : http://www.dgdr.cnrs.fr/elections/scn/actualites/default.htm Anne Vilnat Groupe ILES du LIMSI Professeur IUT Orsay BP 133 91403 Orsay Cedex Tel : 01 69 85 80 02 courriel : Anne.Vilnat(au)limsi.fr telecopie : 01 69 85 80 88 http://www.limsi.fr/Individu/anne ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:38:50 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:38:50 +0100 Subject: Appel: Multimodal Corpora: How should multimodal corpora, LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 15:33:52 +0100 From: ELRA ELDA Information Message-ID: <4EEF4B50.10901 at elda.org> X-url: http://www.multimodal-corpora.org/ [Apologies for cross-postings] *LREC 2012 Workshop Multimodal Corpora: How should multimodal corpora deal with the situation?* 1st Call for Papers 22 May 2012, Istanbul, Turkey http://www.multimodal-corpora.org/ Currently, the creation of a multimodal corpus involves the recording, annotation and analysis of a selection of many possible communication modalities such as speech, hand gesture, facial expression, and body posture. Simultaneously, an increasing number of research areas are transgressing from focused single modality research to full-fledged multimodality research. Multimodal corpora are becoming a core research asset and they provide an opportunity for interdisciplinary exchange of ideas, concepts and data. The increasing interest in multimodal communication and multimodal corpora evidenced by European Networks of Excellence and integrated projects such as HUMAINE, SIMILAR, CHIL, AMI, CALLAS and SSPNet; the success of recent conferences and workshops dedicated to multimodal communication (ICMI-MLMI, IVA, Gesture, PIT, Nordic Symposium on Multimodal Communication, Embodied Language Processing); and the creation of the Journal of Multimodal User Interfaces also testifies to the growing interest in this area, and the general need for data on multimodal behaviours. In 2012, the 8th Workshop on Multimodal Corpora will again be collocated with LREC. This year, LREC has selected Speech and Multimodal Resources as its special topic. This points to the significance of the workshop's general scope, and the fact that the main conference special topic largely covers the broad scope of the workshop provides us with a unique opportunity to step outside the boundaries and look further into the future. The workshop follows similar events held at LREC 00, 02, 04, 06, 08, 10, and ICMI 11. All workshops are documented under www.multimodal-corpora.org and complemented by a special issue of the Journal of Language Resources and Evaluation which came out in 2008 and a state-of-the-art book published by Springer in 2009. *Aims* As always, we aim for a wide cross-section of the field, with contributions ranging from collection efforts, coding, validation and analysis methods, to tools and applications of multimodal corpora. This year, however, we also want to look ahead and emphasize the fact that a growing segment of research takes a view of spoken language as situated action, where linguistic and non-linguistic actions are intertwined with the dynamic conditions given by the situation and the place in which the actions occur. In spite of this, most corpora capture little more than the linguistic and meta-linguistic actions per se, and contain little or no information about the situation in which they take place. For this reason, we encourage contributions that raise the question of what the additions to future multimodal corpora will be -- with possibilities ranging from simple dynamic information such as background noise, room temperature, light conditions and room dimensions to more complex models of room contents, external events, scents, or cognitive load modelling including physiological data such as breathing or pulse. We hope that with your help, the workshop will serve to examine the way language is conceived in corpus creation and to spark a discussion of its boundaries and how these should be accounted for in annotations and in interpretation. *Time schedule* The workshop will consist of a morning session and an afternoon session. There will be time for collective discussions. *Topics* The LREC'2012 workshop on multimodal corpora will feature a special session on the collection, annotation and analysis of corpora of situated interaction. Other topics to be addressed include, but are not limited to: * Multimodal corpus collection activities (e.g. direction-giving dialogues, emotional behaviour, human-avatar interaction, human-robot interaction, etc.) and descriptions of existing multimodal resources * Relations between modalities in natural (human) interaction and in human-computer interaction * Multimodal interaction in specific scenarios, e.g. group interaction in meetings * Coding schemes for the annotation of multimodal corpora * Evaluation and validation of multimodal annotations * Methods, tools, and best practices for the acquisition, creation, management, access, distribution, and use of multimedia and multimodal corpora * Interoperability between multimodal annotation tools (exchange formats, conversion tools, standardization) * Collaborative coding * Metadata descriptions of multimodal corpora * Automatic annotation, based e.g. on motion capture or image processing, and the integration with manual annotations * Corpus-based design of multimodal and multimedia systems, in particular systems that involve human-like modalities either in input (Virtual Reality, motion capture, etc.) and output (virtual characters) * Automated multimodal fusion and/or generation (e.g., coordinated speech, gaze, gesture, facial expressions) * Machine learning applied to multimodal data * Multimodal dialogue modelling *Important dates* * Deadline for paper submission (complete paper): 12 February 2012 * Notification of acceptance: 10 March * Final version of accepted paper: 26 March * Final program and proceedings: 20 April * Workshop: 22 May *Submissions* The workshop will consist primarily of paper presentations and discussion/working sessions. Submissions should be 4 pages long, must be in English, and follow the submission guidelines at http://www.lrec-conf.org/lrec2012/ Submission should be made at: https://www.softconf.com/lrec2012/MMCorpora2012/ Demonstrations of multimodal corpora and related tools are encouraged as well (a demonstration outline of 2 pages can be submitted). *LREC Map of Language Resources, Technologies and Evaluation* When submitting a paper, from the START page authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that either have been used for the work described in the paper or are a new result of your research (contribution to building the LREC2012 Map). *Organizing committee* Jens Edlund, KTH Royal Institute of Technology, Sweden Dirk Heylen, University of Twente, The Netherlands Patrizia Paggio, University of Copenhagen, Denmark/University of Malta, Malta ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:37:00 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:37:00 +0100 Subject: Appel: ColabTKR 2012 - Terminology and Knowledge Representation, LREC 2012 workshop Message-ID: Date: Mon, 19 Dec 2011 15:16:48 +0100 From: ELRA ELDA Information Message-ID: <4EEF4750.9020004 at elda.org> **[Apologies for cross-postings]// //*ColabTKR 2012 - Terminology and Knowledge Representation* Linguistics and ontology studies have a long record of fruitful cooperation. Cross-research in areas such as computational linguistics, natural language processing, information retrieval and ontology development, maintenance and integration have produced a wealth of multidisciplinary theories, methods, models and tools (Roche, 2008) (Staab, 2008) (Pereira et al. 2009) (Costa, 2006). More specifically, the relationship between the lexicon (lexical approaches and resources) and ontology development methods and tools, have been recently well explored in research (Huang et al, 2010). On the contrary, the relationship between terminology and ontology studies, in particular in what concerns to the initial phases of ontology development, has not received so much attention from the scientific communities involved. On the other side, in diverse professional areas, new challenges are appearing related with information and knowledge management in highly specialised technical domains, under tightly constrained time requirements, unfolding in collaborative networking contexts. Short-term collaborative networking between individuals, groups and organisations, is recognised by researchers and practitioners as possible solution to cope with an increasingly complex social and economic business environment. Moreover, the current demand for continuous innovation leads to an higher heterogeneity in the technical and scientific domains simultaneously involved in collaborative projects and activities (e.g involving SMEs and research centres) (Camarinha-Matos, 2006). Managing information and knowledge in this context places new and interesting challenges to terminology and knowledge representation, particularly when these challenges are seen from an integrated terminology/knowledge representation perspective. Terminological or ontological approaches alone are not likely to be enough in answering to the needs of precision and detail of the specialised technical domains, as much as the research efforts of articulated terminology/ontology approaches are likely to be inadequate in terms of the required resources (time and persons). Thus, these challenges call for more than the setup and configuration of common terminological or ontological resources, particularly when considering the usually accepted time-frames for developing semantic and terminological artifacts. Effective ways to collaboratively construct shared conceptualisations by the means of negotiation and representational artifacts, such as semi-formal ontologies, are then required. *Topics of interest* This workshop intends to join, under a multi-disciplinary tent, specialists in terminology, information/knowledge management, ontology development, and collaboration processes, to debate the interplay between terminology and knowledge representation methods and techniques in contexts of collaborative work. Papers are invited on, but not limited to, the following topics: . The interplay between terminology and conceptualization processes . Interfaces between terminology work and ontology development/maintenance . Collaborative processes in terminology work . Collaborative conceptualization processes and representations of knowledge . Conceptualization processes and semi-formal ontologies . Cognitive semantics and semi-formal ontologies . Knowledge organisation systems and collaboration . Modelling networks of actors and semantic networks (socio-semantic networks) . Theory, methods and tools for conceptual negotiation . Using multimodal corpora for semi-formal ontology development . Design and management of semi-formal ontology libraries . Using terminological resources for semi-formal ontology development . Term extraction and validation in domain-dependent, time-constrained applications . Terminological approaches to support the identification of conceptual relations . The role of conceptual relations in the development of semi-formal ontologies * Type of submission* Extended abstract (1500-2000 words) Extended abstracts must be submitted in https://www.softconf.com/lrec2012/TermKnowledge2012/ *LRE map* When submitting a paper through the START page, authors will be kindly asked to provide relevant information about the resources that have been used for the work described in their paper or that are the outcome of their research. For further information on this initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012. Authors will also be asked to contribute to the Language Library, the new initiative of LREC2012. *Important dates* . Submission deadline: 1FEB12 . Acceptance notification: 1MAR12 . Camera ready full paper: 30MAR12 . Workshop date: 22MAY12, afternoon session *Organizing Committee* Ant?nio Lucas Soares (als at fe.up.pt) - University of Porto and INESC Porto, Portugal Rute Costa (rute.costa at fcsh.unl.pt) - New University of Lisbon, Portugal Carla Pereira (cpereira at estgf.ipp.pt) - IPP/ESTGF and INESC Porto, Portugal Alessandro Oltramari (oltramale at gmail.com) - Carnegie-Mellon University, USA Christophe Roche (christophe.roche at univ-savoie.fr) - University of Savoie, France Anita Nuopponen (atn at UWasa.fi) - University of Vaasa, Finland *Programme Committee* Gerhard Budin - University of Vienna Chiara Ghidini - Bruno Kessler Foundation (FBK) - Trento, Italy Guadalupe Aguado de Cea - Universidad Polit?cnica de Madrid Hanne ErdmanThomsen - Copenhagen Business School Mustafa Jarrar - University of Birzeit, Palestine Ant?nio Lucas Soares - University of Porto and INESC Porto, Portugal Rute Costa - Universidade Nova de Lisboa, Portugal Carla Sofia Pereira - Polytechnic Institute of Porto and INESC Porto, Portugal) Alessandro Oltramari - Carnegie-Mellon University, USA Christophe Roche - University of Savoie, France Anita Nuopponen - University fo Vaasa, Finland Piek Vossen - VU University Amsterdam, Netherlands *References * Chu-Ren Huang, Nicoletta Calzolari, Aldo Gangemi, Alessandro Lenci, Alessandro Oltramari, and Laurent Pre?vot (eds.). 2010. Ontology and the Lexicon: A Natural Language Processing Perspective. Cambridge University Press. Roche, C. 2008. Terminologie & Ontologie: The?ories et Applications. Actes de la deuxie?me confe?rence TOTh -- Annecy-5 et 6 juin. Staab, S. 2008. On understanding the collaborative construction of conceptualisations. International and Interdisciplinary Conference "Processing Text -Technological Resources" at the Center for Interdisciplinary Research, Bielefeld University, 13-15 March. Pereira, C.; Sousa, C.; Soares, A. 2009. A socio-semantic approach to collaborative domain conceptualization. On the Move to Meaningful Internet Systems: OTM 2009 Workshops, Lecture Notes in Computer Science, Springer Berlin / Heidelberg. Costa, R. 2006. Terminology, Corpus Linguistics and Ontology, Constrastive Studies and Valency. Studies in Honor of Hans Ulrich Boas. Petra C. Steiner, Hans C. Boas. Stefan Scheirholz [eds.]. Berlin -- Bern: Peter Lang Verlag. Camarinha-Matos, L. 2006. Collaborative networks in industry -- Trends and foundations. InProc. of DET'06 - 3rd International CIRP Conference in Digital Enterprise Technology. Nuopponen, A. 2011. Methods of concept analysis - tools for systematic concept analysis (part 3 of 3). In: LSP, professional communication, knowledge management and cognition. http://rauli.cbs.dk/index.php/lspcog/index ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:40:53 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:40:53 +0100 Subject: Appel: META-RESEARCH Workshop on Advanced Treebanking, LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 15:55:57 +0100 From: ELRA ELDA Information Message-ID: <4EEF507D.9000502 at elda.org> X-url: http://www.meta-net.eu/meta-research/events/lrec2012-treebanking-workshop [Apologies for multiple postings] META-RESEARCH Workshop on Advanced Treebanking (LREC 2012) - Call for papers *The Workshop* Many EU and world-wide projects and research groups are creating, standardizing, converting and/or using treebanks, thereby often tackling the same issues and reinventing methods and tools. The workshop has been conceived by four projects, namely T4ME, META-NORD, CESAR and META4U, which under the META-NET umbrella project strive to make many treebanks and other LRT available for R&D. However, the workshop is not limited to participants of META-NET, but it is widely open: papers of authors of any origin will be welcome to share their experiences among those working in this area. Also, papers on using treebanks in innovative ways and/or analyzing issues with current treebanks (both in content and format) will be welcome, to provide feedback to treebank developers. The emphasis is not on linguistic theory or grammar, but on treebanks as language resources. *The Call* As part of the LREC'2012 Language Resources Conference, to be held in Istanbul, Turkey, the META-RESEARCH Workshop on Advanced Treebanking (http://www.meta-net.eu/meta-research/events/lrec2012-treebanking-workshop) invites papers on all aspects of treebanking, especially but not limited to - Treebank annotation (manual, semi-automatic, automatic) and related issues such as quality control, searching large collections of treebanks, parallel treebanks - Treebank conversion between different linguistic representations (e.g. dependency vs. phrase-based) and between different formats - Treebank standardization and format unification across languages and styles - Combinations of treebanks and other annotations -- including but not limited to semantic role labeling, named entity annotation, discourse annotation, information structure - Tools and online web services for the management of treebanks, visualization, search, processing, upload and download, etc. - Innovative use of treebanks in all areas of natural language research and applications - Reuse of treebank annotation for non-traditional purposes and in related disciplines (cognitive science, psycholinguistics, etc.) - Improving the use of current treebanks in linguistic and computational R&D. Oral presentations will have around 20 minutes each; demonstrations and poster presentations will be encouraged (but optional) in addition to the oral presentations, during the entire workshop in the meeting room to stimulate discussions. Submissions cannot be more than 8 pages long and will be reviewed using the START system in the usual double-blind fashion, with at least three reviews for each submission by the members of the Programme Committee. Authors will be asked to provide details about the resources used when preparing material for the paper (see http://www.lrec-conf.org/lrec2012/?LRE-Map-2012) and to the Language Library, the new initiative of LREC 2012. *Organizers* Jan Hajic, hajic at ufal.mff.cuni.cz, Charles Univ., Prague, Czech Rep. Koenraad De Smedt, desmedt at uib.no, Univ. of Bergen, Norway Marko Tadic, marko.tadic at ffzg.hr, Univ. of Zagreb, Croatia Antonio Branco, Antonio.branco at di.fc.ul.pt, University of Lisbon, Portugal *Deadlines* Submission: Monday, February 20, 2012 Decision: Monday, March 19, 2012 Camera Ready in START: Wednesday, March, 28, 2012 Workshop date: Tuesday, May 22, 2012 *Programme Committee (to be extended)* Antonio Branco, Portugal Silvie Cinkova, Czech Rep. Dan Cristea, Romania Koenraad De Smedt, Norway Rebecca Dridan, Australia Jan Hajic, Czech Rep. Silvia Hansen-Schirra, Germany Nancy Ide, USA Daisuke Kawahara, Japan Valia Kordoni, Germany Sandra Kuebler, USA Krister Linden, Finland Adam Meyers, USA Joakim Nivre, Sweden Stephan Oepen, Norway Marco Passarotti, Italy Adam Przepiorkowski, Poland Eirikur Rognvaldsson, Iceland Victoria Rosen, Norway Marko Tadic, Croatia Barbora Vidova Hladka, Czech Rep. Juergen Wedekind, Denmark Fei Xia, USA Daniel Zeman, Czech Rep. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:43:50 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:43:50 +0100 Subject: Appel: 1st CfP Challenges in the management of large corpora, LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 16:00:05 +0100 From: ELRA ELDA Information Message-ID: <4EEF5175.80803 at elda.org> X-url: http://corpora.ids-mannheim.de/cmlc.html [Apologies for multiple postings] Call for Papers: LREC 2012 Workshop Challenges in the management of large corpora We live in an age where the well-known maxim that "the only thing better than data is more data" is something that no longer sets unattainable goals. Creating extremely large corpora is no longer a challenge, given the proven methods that lie behind e.g. applying the Web-as-Corpus approach or utilizing Google's n-gram collection. Indeed, the challenge is now shifted towards dealing with the large amounts of primary data and much larger amounts of annotation data. On the one hand, this challenge concerns finding new (corpus-) linguistic methodologies that can make use of such extremely large corpora e.g. in order to investigate rare phenomena involving multiple lexical items or to find and represent fine-grained sub-regularities; on the other hand, some fundamental technical methods and strategies are being called into question. These include e.g. successful curation of the data, management of collections that span multiple volumes or that are distributed across several centres, methods to clean the data from non-linguistic intrusions or duplicates, as well as automatic annotation methods or innovative corpus architectures that maximise the usefulness of data or allow to search and to analyze it efficiently. Among the new tasks are also collaborative manual annotation and methods to manage it as well as new challenges to the statistical analysis of such data and metadata. The half-day workshop on "Challenges in the management of large corpora" aims at gathering the leading researchers in the field of Language Resource creation and Corpus Linguistics, in order to provide for an intensive exchange of expertise, results and ideas. We invite submissions dealing with: 1. building tools for all aspects of management of very large corpora, 2. dealing with large data sets (file system architecture, database architecture), 3. dealing with heavily annotated corpora, 4. managing multiple and concurrent annotation layers, 5. use of annotation standards for large data sets, 6. issues of interoperability and tool-chaining, 7. crowd sourcing for large data sets, 8. quality control of annotations in large data sets, 9. analytic tools used in research infrastructure initiatives, such as, e.g., the Common Language Resource and Technology Infrastructure (CLARIN), 10. dealing with corpora physically distributed over different locations, 11. managing metadata for extremely large corpus collections, 12. efficient user interfaces, 13. effective querying of large corpora with multiple annotation layers, 14. "bringing the code to the data" as the strategy for dealing with IPR restrictions, 15. open-source software and open-data corpora strategies, 16. other issues that arise in the context of management of large datasets. Current information is available at: http://corpora.ids-mannheim.de/cmlc.html Abstract submission We invite extended abstracts (1500 to 2000 words) for 20+10 minute presentations, as well as posters and demos. All abstracts have to be submitted via the START Conference Manager, available from https://www.softconf.com/lrec2012/LargeCorpora2012/. Please note: when submitting a contribution to the START, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the contribution or are a new result of their research. For further information on this new initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012 Important dates Workshop: 22 May 2012, afternoon session. Deadline for submission of extended abstracts: February 15. Notification of acceptance: February 29. Submission of full, camera-ready papers: March 23. Venue The workshop will take place at the Conference venue, the L?tfi Kirdar Istanbul Exhibition and Congress Centre. Further details will be available in due time from conference homepage. Organizing Committee The workshop is co-organized by the following three institutions: Institut f?r Deutsche Sprache, Mannheim Piotr Ban'ski, Marc Kupietz, Andreas Witt Institute for Language Information and Technology, Eastern Michigan University Helen Aristar-Dry, Anthony Aristar, Damir C'avar ICAR Laboratory, Lyon University Serge Heiden Programme committee: N?ria Bel (Universitat Pompeu Fabra) Mark Davies (Brigham Young University) Stefanie Dipper (Ruhr-Universit?t Bochum) Tomaz( Erjavec (Joz(ef Stefan Institute) Stefan Evert (Technische Universit?t Darmstadt) Alexander Geyken (Berlin-Brandenburgische Akademie der Wissenschaften) Andrew Hardie (University of Lancaster) Nancy Ide (Vassar College) Sandra K?bler (Indiana University) Martin Mueller (Northwestern University) Mark Olsen (University of Chicago) Adam Przepi?rkowski (Polish Academy of Sciences, University of Warsaw) Reinhard Rapp (Johannes Gutenberg-Universit?t Mainz, University of Leeds) Laurent Romary (INRIA, Humboldt-Universit?t zu Berlin) Serge Sharoff (University of Leeds) Pavel Stran(?k (Charles University in Prague) Amir Zeldes (Humboldt-Universit?t zu Berlin) Workshop homepage: http://corpora.ids-mannheim.de/cmlc.html ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:45:51 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:45:51 +0100 Subject: Appel: 1st CfP Language Resource Merging, LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 16:04:59 +0100 From: ELRA ELDA Information Message-ID: <4EEF529B.2080304 at elda.org> X-url: http://www.meta-net.eu/meta-share X-url: http://www.panacea-lr.eu/ [Apologies for multiple postings] **1st Call for Papers** LREC 2012 Workshop on: Language Resource Merging 22 May 2012 -- Afternoon Session *CONTEXT* The availability of adequate language resources has been a well-known bottleneck for most high-level language technology applications, e.g. Machine Translation, parsing, and Information Extraction, for at least 15 years , and the impact of the bottleneck is becoming all the more apparent with the availability of higher computational power and massive storage, since modern language technologies are capable of using far more resources than the community produces. The present landscape is characterized by the existence of numerous scattered resources, many of which have differing levels of coverage, types of information and granularity. Taken singularly, existing resources do not have sufficient coverage, quality or richness for robust large-scale applications, and yet they contain valuable information (Monachini et al. 2004 and 2006; Soria et al. 2006; Molinero, Sagot and Nicolas 2009; Necsulescu et al. 2011). Differing technology or application requirements, ignorance of the existence of certain resources, and difficulties in accessing and using them, has led to the proliferation of multiple, unconnected resources that, if merged, could constitute a much richer repository of information augmenting either coverage or granularity, or both, and consequently multiplying the number of potential language technology applications. Merging, combining and/or compiling larger resources from existing ones thus appears to be a promising direction to take. The re-use and merging of existing resources is not altogether unknown. For example, WordNet (Fellbaum, 1998) has been successfully reused in a variety of applications. But this is the exception rather than the rule; in fact, merging, and enhancing existing resources is uncommon, probably because it is by no means a trivial task given the profound differences in formats, formalisms, metadata, and linguistic assumptions. The language resource landscape is on the brink of a large change, however. With the proliferation of accessible metadata catalogues, and resource repositories (such as the new META-SHARE (http://www.meta-net.eu/meta-share) infrastructure), a potentially large number of existing resources will be more easily located, accessed and downloaded. Also, with the advent of distributed platforms for the automatic production of language resources, such as PANACEA (http://www.panacea-lr.eu/), new language resources and linguistic information capable of being integrated into those resources will be produced more easily and at a lower cost. Thus, it is likely that researchers and application developers will seek out resources already available before developing new, costly ones, and will require methods for merging/combining various resources and adapting them to their specific needs. Up to the present day, most resource merging has been done manually, with only a small number of attempts reported in the literature towards (semi-)automatic merging of resources (Crouch & King 2005; Pustejovsky et al. 2005; Molinero, Sagot and Nicolas 2009; Necsulescu et al. 2011). In order to take a further step towards the scenario depicted above, in which resource merging and enhancing is a reliable and accessible first step for researchers and application developers, experience and best practices must be shared and discussed, as this will help the whole community avoid any waste of time and resources. * AIMS OF THE WORKSHOP* This half-day workshop is meant to be part of a series of meetings constituting an ongoing forum for sharing and evaluating the results of different methods and systems for the automatic production of language resources (the first one was the LREC 2010 Workshop on Methods for the Automatic Production of Language Resources and their Evaluation Methods). The main focus of this workshop is on (semi-)automatic means of merging language resources, such as lexicons, corpora and grammars. Merging makes it possible to re-use, adapt, and enhance existing resources, alongside new, automatically created ones, with the goal of reducing the manual intervention required in language resource production, and thus ultimately production costs. *WORKSHOP TOPICS* The topics of the workshop are related to best practices, methods, techniques and experimental results regarding the merging of various types of language resources, such as lexicons and corpora, especially in support of language technology applications. In particular, new methods for automatic merging with a view towards reducing human intervention will be most welcome. Topics for submission include, but are not limited to: - Experiments on (semi-)automatic merging of automatically produced resources - Experiments on the merging of two or more existing resources containing the same or different levels of linguistic information - Studies or experiments on merging resources at different levels of granularity (corpora, lexicons, grammars) - Studies or experiments on unifying, mapping or converting encoding formats - Comparison between different resources and mapping algorithms to provide desired merging - Use of linguistic information from different sources in high-level language applications - Use of new, merged language resources in language technology applications *SUBMISSIONS* Interested participants must submit a preliminary paper of about 4-6 pages including references (between 2000-2500 words). For the submission please use the online form on START LREC Conference Manager at: https://www.softconf.com/lrec2012/MergingLR2012/ When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. For further information on this new initiative, please refer to http://www.lrecconf.org/lrec2012/?LRE-Map-2012 Papers will be peer-reviewed by the workshop Program Committee. *IMPORTANT DATES* ? Deadline for paper submission: 15 February 2012 ? Notification of acceptance: 15 March 2012 ? Submission of camera-ready version of papers: 31 March 2012 ? Workshop date: 22 May 2012 -- Afternoon Session *ORGANIZING COMMITTEE* N?ria Bel, UPF, Barcelona, Spain Maria Gavrilidou, ILSP-"Athena", Athens, Greece, Monica Monachini, CNR-ILC, Pisa, Italy Valeria Quochi, CNR-ILC, Pisa, Italy Laura Rimell, University of Cambridge, UK Contacts lrec12_workshop_merging at ilc.cnr.it *PROGRAMME COMMITTEE:* Victoria Arranz, ELDA, Paris, France Paul Buitelaaar, National University of Ireland, Galway, Ireland Nicoletta Calzolari, CNR-ILC, Pisa, Italy Olivier Hamon, ELDA, Paris, France Ales( Hor?k, Masaryk University, Brno, Czech Republic Nancy Ide, Vassar College, Mass. USA Bernardo Magnini, FBK, Trento, Italy Paola Monachesi, Utrecht University, Utrecht, The Netherlands Jan Odijk, , Utrecht University, Utrecht, The Netherlands Muntsa Padr?, IULA, Barcellona, Spain Karel Pala, Masaryk University, Brno, Czech Republic Thierry Poibeau University of Cambridge, UK and CNRS, Paris, France Beno?t Sagot, INRIA, Paris, France Kiril Simov, Bulgarian Academy of Sciences, Sofia, Bulgaria Claudia Soria, CNR-ILC, Pisa, Italy Maurizio Tesconi, CNR-IIT, Pisa ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:48:24 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:48:24 +0100 Subject: Appel: 1st CfP Workshop on Language technology for normalisation of less-resourced languages, LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 16:33:04 +0100 From: ELRA ELDA Information Message-ID: <4EEF5930.9080206 at elda.org> X-url: http://ixa2.si.ehu.es/saltmil/ X-url: http://AfLaT.org/ [Apologies for multiple postings] Call for Papers: Workshop on "Language technology for normalisation of less-resourced languages" 8th SALTMIL Workshop on Minority Languages and the 4th workshop on African Language Technology (AfLaT2012). A full-day workshop at LREC 2012 Tuesday, 22 May 2012. L?tfi Kirdar Istanbul Exhibition and Congress Centre, Istanbul, Turkey SALTMIL: http://ixa2.si.ehu.es/saltmil/ AfLaT: http://AfLaT.org/ LREC 2012: http://www.lrec-conf.org/lrec2012/ Paper submission: https://www.softconf.com/lrec2012/Less- RessourcedLang2012/ Papers are invited for the above full-day workshop, in the format outlined below. Most submitted papers will be presented in poster form, though some authors may be invited to present in lecture format. Context and focus The 8th International Workshop of the ISCA Special Interest Group on Speech and Language Technology for Minority Languages (SALTMIL, http://ixa2.si.ehu.es/saltmil) and the 4th Workshop on African Language Technology (AfLaT2012) will be held as a joint effort in Istanbul, in May 2012, as part of the 2012 International Language Resources and Evaluation Conference (LREC 2012). Entitled "Language technology for normalisation of less-resourced languages", the workshop is intended to continue the series of SALTMIL/LREC workshops on computational language resources for minority languages, held in Granada (1998), Athens (2000), Las Palmas de Gran Canaria (2002) and Lisbon (2004), Genoa (2006), Marrakech (2008) and Malta (2010) and the series of AfLaT workshops, held in Athens (EACL2009), Malta (LREC2010) and Addis Ababa (AGIS11). The Istanbul 2012 workshop aims to share information on tools and best practices, so that isolated researchers will not need to start from scratch. An important aspect will be the forming of personal contacts, which can minimize duplication of effort. There will be a balance between presentations of existing language resources, and more general presentations designed to give background information needed by all researchers. While less-resourced languages and minority languages often struggle to find their place in a digital world dominated by only a handful of commercially interesting languages, a growing number of researchers are working on alleviating this linguistic digital divide, through localisation efforts, the development of BLARKs (basic language resource kits) and practical applications of human language technologies. The joint SALTMIL/AfLaT workshop on "Language technology for normalisation of less-resourced languages" provides a unique opportunity to connect these researchers and set up a common forum to meet and share the latest developments in the field. *Topics* The workshop takes an inclusive approach to the word "normalisation", considering it to include both technologies that help make languages more "normal" in society and everyday life, as well as technologies that normalise languages, i.e. help create or maintain a written standard or support diversity in standards. We particularly focus on the challenges less-resourced and minority languages face in the digital world. Papers are invited that describe research and development in the following areas in the area of technologies for language normalisation, including (but not limited to) topics such as: * Keyboard layouts and entry methods * Standardisation in machine readable lexicons/dictionaries * Computer-aided language learning (CALL) * Dealing with language variants in NLP * Automatic identification of varieties, dialects * Corpus construction and annotation * Terminology development and management * MT between varieties of the same language * Spelling correction/normalisation * Machine translation (MT) * Morphological analysers * Part-of-speech taggers and parsers * Speech recognition and synthesis * Information extraction/retrieval * Localisation efforts * Mobile phones as a platform for HLT *Schedule* 09.00 Registration/Opening 09.30 Invited talk 1 11.00 Oral Papers (3x 25+5mins) 12.30 Lunch 14:00 Oral Papers (3x 25+5mins) 16:00 Poster Session 17:00 Panel Discussion 17:30 SALTMIL Assembly 18:30 Closing *Organisers (SALTMIL and AfLaT)* * Mikel L. Forcada (SALTMIL): Machine Translation Group, School of Computing, Dublin City University, Dublin, Ireland * Guy De Pauw (AfLaT): CLiPS - Computational Linguistics Group, University of Antwerp, Antwerp, Belgium * Gilles-Maurice de Schryver(AfLaT): African Languages and Cultures, TshwaneDJe HLT, South Africa & Ghent University, Belgium * Kepa Sarasola(SALTMIL): Dept. of Computer Languages, University of the Basque Country * Francis M. Tyers(SALTMIL), Departament de Llenguatges i Sistemes Inform?tics, Universitat d'Alacant, Spain * Peter Waiganjo Wagacha(AfLaT): School of Computing & Informatics, University of Nairobi, Nairobi, Kenya *Programme committee* * I?aki Alegria: University of the Basque Country * N?ria Bel, Universitat Pompeu Fabra, Barcelona, Spain * Lars Borin, G?teborgs universitet, Sweden * Sonja Bosch, University of South Africa, South Africa * Mikel L. Forcada, Universitat d'Alacant * Dafydd Gibbon, University of Bielefeld, Germany * Girish Nath Jha, Jawaharlal Nehru University, India * Hrafn Loftsson, University of Reykjavik * Guy De Pauw, CLiPS, Universiteit Antwerpen * Laurette Pretorius, University of South Africa, South Africa * Lori Levin, Carnegie Mellon University, USA * Odetunji Odejobi, Obafemi Awolowo University, Nigeria * Felipe S?nchez-Mart?nez, Universitat d'Alacant * Kepa Sarasola, University of the Basque Country * Kevin Scannell, Saint Louis University, USA * Gilles-Maurice de Schryver, Universiteit Gent * Francis M. Tyers, Universitat d'Alacant * Peter Waiganjo Wagacha, University of Nairobi *Submissions* We expect short papers of max 6,000 words (up to 6 pages) describing research addressing one of the above topics, to be submitted as PDF documents by using the LREC2012 START conference management system (URL: https://www.softconf.com/lrec2012/Less-RessourcedLang2012//). Submissions should be anonymized. When submitting a paper through the START page, authors will be kindly asked to provide relevant information about the resources that have been used for the work described in their paper or that are the outcome of their research. For further information on this initiative, please refer to http://www.lrecconf.org/lrec2012/?LRE-Map-2012. Authors will also be asked to contribute to the Language Library, the new initiative of LREC2012. Submissions of papers should follow the same style as the papers for the main LREC conference (an Author's Kit made of specific guidelines and downloadable templates will be published on the conference web site in due time). All contributions (including invited papers) will be included in the workshop proceedings (CD). They will also be published on the SALTMIL website. *Important dates* * 27 February 2012: Deadline for submission * 14 March 2012: Notification * 28 March 2012: Final version * 22 May 2012: Workshop *Registration* Registration details will be announced in due course ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:53:34 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:53:34 +0100 Subject: Appel: 1st CfP @NLP can u tag #user_generated_content ?! via lrec-conf.org, LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 16:47:25 +0100 From: ELRA ELDA Information Message-ID: <4EEF5C8D.5090804 at elda.org> [Apologies for multiple postings] Background and Motivation The Web 2.0 has transferred the authorship of contents from institutions to the people; the web has become a channel where users exchange, explain or write about their lives and interests, give opinions and rate others' opinions. The so-called User Generated Content (UGC) in text form is a valuable resource that can be exploited for many purposes, such as cross-lingual information retrieval, opinion mining, enhanced web search, social science analysis, intelligent advertising, and so on. In order to mine the data from the Web 2.0 we first need to understand its contents. Analysis of UG content is challenging because of its casual language, with plenty of abbreviations, slang, domain specific terms and, compared to published edited text, with a higher rate of spelling and grammar errors. Standard NLP techniques, which are used to analyze text and provide formal representations of surface data, have been typically developed to deal with standard language and may not yield the expected results on UGC. For example, shortened or misspelled words, which are very frequent in the Web 2.0 informal style, increase the variability in the forms for expressing a single concept. This workshop aims at providing a meeting point for researchers working in the processing of UGC in textual form in one way or another, as well as developers of UGC-based applications and technologies, both from industry and academia. *Topics of Interest* We are mainly interested in, but not restricted to, the following research questions: - What characterises UGC? Linguistic and textual phenomena that distinguish UGC from standard written text, and may pose a challenge for NLP. - Definition of norm, concept of error, deviation and variation in UGC. - Criteria and standards for the annotation of evaluation corpora in UGC at various levels of linguistic analysis (form, part of speech, constituents, dependencies, speech acts, deviation types, etc.). - How quality of text affects processing tasks (tokenization, POS tagging, chunking, parsing, named-entity detection, etc.) - Architecture and software design for flexible adaptation of NLP processing pipelines to new domains (topic domains and text-genre domains) - Text normalisation vs adaptation of processing tools: - Pros and cons - Task dependent? - Costs and benefits - Hybrid solutions - Approaches to normalisation (text checking, ASR, MT techniques, etc.) - Evaluation issues related to processing and normalising UGC *Intended Audience* The workshop aims at bringing together researchers and developers from academia and industry. In particular, perspectives from the following user groups are welcome: - UGC-based application developers, from both research and industry - Researchers from the NLP, IR and IE communities - Ph.D students interested or working in the processing of UGC *Submissions* - Oral papers and posters should follow the main conference formatting requirements (http://www.lrec-conf.org/lrec2012/). - To submit contributions, please follow the instructions at https://www.softconf.com/lrec2012/UGC2012/ - The contributions will undergo a double review by members of the programme committee. When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. For further information on this new initiative, please refer to: http://www.lrec-conf.org/lrec2012/?LRE-Map-2012 *Important Dates* February 15: Paper submission deadline March 15: Acceptance notifications March 30: Camera-ready papers May 26: Afternoon Workshop at LREC *Organising Committee* Laura Alonso i Alemany, Universidad Nacional de C?rdoba (Argentina) Jordi Atserias, Yahoo! Research (Spain) Toni Badia, Universitat Pompeu Fabra (Spain) Maite Melero, Barcelona Media Innovation Center (Spain) Mart? Quixal, Barcelona Media Innovation Center (Spain) *Programme Committee* Rafael Banchs, Institute for Infocomm Research - A*Star (Singapore) Steven Bedrick, Oregon Health & Science University Joan Codina, Universitat Pompeu Fabra (Spain) Louise-Am?lie Cougnon, Universit? Catholique de Louvain, ILC, Cental, (Belgium) Jennifer Foster, Dublin City University (Ireland) Michael Gamon, Microsoft Research (USA) Fei Liu, Bosch Research (USA) Ulrike Pado, VICO Research&Consulting GmbH Llu?s Padr?, Universitat Polit?cnica de Catalunya (Spain) Alan Ritter, CSE, University of Washington (USA) Roser Saur?, Barcelona Media Innovation Center (Spain) Paul Schmidt, Institut der Gesellschaft zur F?rderung der Angewandten Informationsforschung (Germany) L Venkata Subramaniam, IBM Research (India) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:55:13 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:55:13 +0100 Subject: Appel: 1st Cfp Natural Language Processing for Improving Textual Accessibility (NLP4ITA), LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 16:53:26 +0100 From: ELRA ELDA Information Message-ID: <4EEF5DF6.4000801 at elda.org> X-url: http://www.taln.upf.edu/nlp4ita [Apologies for multiple postings] ******************************************************************** First Call for Papers LREC 2012 workshop on: Natural Language Processing for Improving Textual Accessibility (NLP4ITA) May 27, 2012 (Morning Session) Istanbul, Turkey http://www.taln.upf.edu/nlp4ita ******************************************************************** === Introduction === This workshop aims to bring together researchers focused on tools and resources for making textual information more accessible to people with special needs including diverse ranges of hearing and sight disabilities, cognitive disabilities, elderly people, low-literacy readers and adults being alphabetized, among others. === Motivation and Topics of Interest === In recent years there has been an increasing interest in accessibility and usability issues. This interest is mainly due to the greater importance of the Web and the need to provide equal access and equal opportunity to people with diverse disabilities. The role of assistive technologies based on language processing has gained importance as it can be observed from the growing number of efforts (United Nations declarations on universal access to information or WAI guidelines related to content) and research in conferences and workshops (W4A, ICCHP, ASSETS, SPLAT, etc.). However, language resources and tools to develop assistive technologies are still scarce. This workshop will give an opportunity for individuals from different communities to present research findings, discover future challenges, and discuss potential collaboration. We welcome papers describing tools, resources, models, techniques and evaluation from all areas of natural language processing tailored to accessibility and assistive technologies. These include, but are not limited to, the following: - Resources: corpora of inaccessible documents, aligned corpora, gold-standards and annotation schemes. - Evaluation, complexity and readability methodologies and metrics. - Novel modeling and machine learning techniques for improving accessibility. - Natural language generation for improving accessibility. - NLP for Web accessibility. - Text adaptation, elaboration, simplification and summarization. - User studies on systems and components. - NLP tools for accessibility === Submission Information === We encourage contributions in the form of full papers (6 to 8 pages long). Papers should follow the LREC format which is available at the LREC 2012 Web Site. The submission process will be online using the START conference system at https://www.softconf.com/lrec2012/NLP4ITA All accepted papers will be presented orally and published in the workshop proceedings. === Important dates === Feb 24, 2012 Paper submission deadline March 23, 2012 Notification of acceptance March 30, 2012 Camera-ready version May 27, 2012 LREC 2012 Workshops === LRE Map 2012 === The LRE Map 2012, an initiative promoted for monitoring the production, use and evaluation of language resources and tools. When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. For further information on this new initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012. === Organizers === Ricardo Baeza-Yates (Universidad Pompeu Fabra, Yahoo!) Paloma Moreda (Universidad de Alicante) Luz Rello (Universidad Pompeu Fabra) Horacio Saggion (Universidad Pompeu Fabra) Lucia Specia (University of Wolverhampton) === Program Committee === Sandra Aluisio (University of Sao Paulo) Ricardo Baeza-Yates (Universidad Pompeu Fabra, Yahoo!) Delphine Bernhard (University of Strassbourg) Nadjet Bouayad-Agha (Universidad Pompeu Fabra) Richard Evans (University of Wolverhampton) Caroline Gasperin (TouchType Ltd) Pablo Gerv?s (Universidad Complutense de Madrid) Simon Harper (University of Manchester) David Kauchak (Middlebury College) Guy Lapalme (University of Montreal) Paloma Mart?nez (Universidad Carlos III de Madrid) Aurelien Max (Paris 11) Kathleen F. McCoy (University of Delaware) Ornella Mich (Foundazione Bruno Kessler) Ruslan Mitkov (University of Wolverhampton) Paloma Moreda (Universidad de Alicante) Constantin Orasan (University of Wolverhampton) Luz Rello (Universidad Pompeu Fabra) Horacio Saggion (Universidad Pompeu Fabra ) Advaith Siddharthan (University of Aberdeen) Lucia Specia (University of Wolverhampton) Juan Manuel Torres Moreno (University of Avignon) Markel Vigo (University of Manchester) Leo Wanner (Universidad Pompeu Fabra) Yeliz Yesilada (Middle East Technical University Northern Cyprus Campus) === Contact Address === For further information please contact us at: luz.rello at upf.edu or horacio.saggion at upf.edu ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:57:18 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:57:18 +0100 Subject: Appel: 1st CfP Semantic Processing of Legal Texts (SPLeT-2012), LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 17:03:42 +0100 From: ELRA ELDA Information Message-ID: <4EEF605E.1020005 at elda.org> [Apologies for multiple postings] LREC 2012 Workshop on Semantic Processing of Legal Texts (SPLeT-2012) CALL FOR PAPERS 27 May 2012, Istanbul *Workshop description* The legal domain represents a primary candidate for web-based information distribution, exchange and management, as testified by the numerous e-government, e-justice and e-democracy initiatives worldwide. The last few years have seen a growing body of research and practice in the field of Artificial Intelligence and Law which addresses a range of topics: automated legal reasoning and argumentation, semantic and cross-language legal information retrieval, document classification, legal drafting, legal knowledge discovery and extraction, as well as the construction of legal ontologies and their application to the law domain. In this context, it is of paramount importance to use Natural Language Processing techniques and tools that automate and facilitate the process of knowledge extraction from legal texts. Since 2008, the SPLeT workshops have been a venue where researchers from the Computational Linguistics and Artificial Intelligence and Law communities meet, exchange information, compare perspectives, and share experiences and concerns on the topic of legal knowledge extraction and management, with particular emphasis on the semantic processing of legal texts. Within the Artificial Intelligence and Law community, there have also been a number of dedicated workshops and tutorials specifically focussing on different aspects of semantic processing of legal texts at conferences such as JURIX-2008, ICAIL-2009, ICAIL-2011, as well as in the International Summer School "Managing Legal Resources in the Semantic Web" (2007, 2008, 2009, 2010, 2011). To continue this momentum and to advance research, a 4th Workshop on "Semantic Processing of Legal Texts" is being organized at the LREC-2012 conference to bring to the attention of the broader LR/HLT (Language Resources/Human Language Technology) community the specific technical challenges posed by the semantic processing of legal texts and also share with the community the motivations and objectives which make it of interest to researchers in legal informatics. The outcome of these interactions are expected to advance research and applications and foster interdisciplinary collaboration within the legal domain. New to this edition of the workshop are two sub-events to provide common and consistent task definitions, datasets, and evaluation for legal-IE systems along with a forum for the presentation of varying but focused efforts on their development. The first sub-event will be a shared task specifically focusing on dependency parsing of legal texts: although this is not a domain-specific task, it is a task which creates the prerequisites for advanced IE applications operating on legal texts, which can benefit from reliable preprocessing tools. For this year our aim is to create the prerequisites for more advanced domain-specific tasks (e.g. event extraction) to be organized in future SPLeT editions. We strongly believe that this could be a way to attract the attention of the LR/HLT community to the specific challenges posed by the analysis of this type of texts and to have a clearer idea of the current state of the art. The languages dealt with will be Italian and English. A specific Call for Participation for the shared task is available in a dedicated page. The second sub-event will be an online, manual, collaborative, semantic annotation exercise, the results of which will be presented and discussed at the workshop. The goals of the exercise are: (1) to gain insight on and work towards the creation of a gold standard corpus of legal documents in a cohesive domain; and (2) to test the feasibility of the exercise and to get feedback on its annotation structure and workflow. The corpus to be annotated will be a selection of documents drawn from EU and US legislation, regulation, and case law in a particular domain (e.g. consumer or environmental protection). For this exercise, the language will be English. A specific Call for Participation for this annotation exercise is available in a dedicated page. The main goals of the workshop and associated events are to provide an overview of the state-ofthe-art in legal knowledge extraction and management, to explore new research and development directions and emerging trends, and to exchange information regarding legal language resources and human language technologies and their applications. *Areas of Interest* The workshop will focus on the topics of the automatic extraction of information from legal texts and the structural organisation of the extracted knowledge. Particular emphasis will be given to the crucial role of language resources and human language technologies. Papers are invited on, but not limited to, the following topics: - Construction, extension, merging, customization of legal language resources: terminologies, ontologies - Information retrieval and extraction from legal texts - Semantic annotation of legal textual corpora - Legal text processing - Multilingual aspects of legal text semantic processing - Legal thesauri mapping - Automatic Classification of legal documents - Logical analysis of legal language - Automated parsing and translation of natural language arguments into a logical formalism - Linguistically-oriented XML mark up of legal arguments - Dialogue protocols for argumentation - Legal argument ontology - Computational theories of argumentation that are suitable to natural language - Controlled language systems for law. *Submissions* Submissions are solicited from researchers working on all aspects of semantic processing of legal texts. Authors are invited to submit papers describing original completed work, work in progress, interesting problems, case studies or research trends related to one or more of the topics of interest listed above. The final version of the accepted papers will be published in the Workshop Proceedings. Short or full papers can be submitted. Short papers are expected to present new ideas or new visions that may influence the direction of future research, yet they may be less mature than full papers. While an exhaustive evaluation of the proposed ideas is not necessary, insight and in-depth understanding of the issues is expected. Full papers should be more well developed and evaluated. Short papers will be reviewed the same way as full papers by the Program Committee and will be published in the Workshop Proceedings. Full paper submissions should not exceed 10 pages, short papers 6 pages; both should be typeset using a font size of 11 points. Style files will be made available by LREC for the camera-ready versions of accepted papers. Papers should be submitted electronically, no later than February 10, 2012. The only accepted format for submitted papers is Adobe PDF. Submission will be electronic using START paper submission software available at https://www.softconf.com/lrec2012/SPLeT2012/. Note that when submitting a paper through the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. For further information on this new initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012. Selected contributions to a Special Issue of AI&Law Journal After the Workshop a number of selected, revised, peer-reviewed articles will be published in a Special Issue on Semantic Processing of Legal Texts of the AI and Law Journal (Springer). *Important Dates* Paper submission deadline: 10 February 2012 Acceptance notification sent: 5 March 2012 Final version deadline: 23 March 2012 Workshop date: 27 May 2012 *Workshop Chairs* - Enrico Francesconi (Istituto di Teoria e Tecniche dell'Informazione Giuridica of CNR, Florence, Italy) - Simonetta Montemagni (Istituto di Linguistica Computazionale of CNR, Pisa, Italy) - Wim Peters (Natural Language Processing Research Group, University of Sheffield, UK) - Adam Wyner (Department of Computer Science, University of Liverpool, UK) Address any queries regarding the workshop to: lrec_legalWS at ilc.cnr.it *Program Committee (tbc)* Kevin Ashley (Univ of Pittsburgh) Johan Bos (University of Rome, Italy) Dani?le Bourcier (Humboldt Universit?t, Berlin, Germany) Thomas R. Bruce (Cornell Law School, Ithaca, NY, USA) Pompeu Casanovas (Institut de Dret i Tecnologia, UAB, Barcelona, Spain) Jack Conrad (Thomson-Reuters) Matthias Grabmair (University of Pittsburgh, USA) Carole Hafner (Northeaster Univ.) Antonio Lazari (Scuola Superiore S.Anna, Pisa, Italy) Leonardo Lesmo (Dipartimento di Informatica, Universit? di Torino, Torino, Italy) Carl Malamud (Public.Resource.Org) Marie-Francine Moens (Katholieke Universiteit Leuven, Belgium) Thorne McCarty (Reutgers Univ.) Raquel Mochales Palau (Catholic University of Leuven, Belgium) Paulo Quaresma (Universidade de ?vora, Portugal) Robert Richards (Legal Informatics blog) Tony Russell-Rose (Endeca) Erich Schweighofer (Universit?t Wien, Rechtswissenschaftliche Fakult?t, Wien, Austria) Rolf Schwitter (Macquarie Univ) Manfred Stede (University of Potsdam, Germany) Daniela Tiscornia (Istituto di Teoria e Tecniche dell'Informazione Giuridica of CNR, Florence, Italy) Tom van Engers (Leibniz Center for Law, University of Amsterdam, Netherlands) Giulia Venturi (Scuola Superiore S.Anna, Pisa, Italy) Vern R. Walker (Hofstra University School of Law, Hofstra University, USA) Radboud Winkels (Leibniz Center for Law, University of Amsterdam, Netherlands) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 21:58:49 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 22:58:49 +0100 Subject: Appel: 1st CfP Creating Cross-language Resources for Disconnected Languages and Styles (CREDISLAS), LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 17:06:32 +0100 From: ELRA ELDA Information Message-ID: <4EEF6108.4030909 at elda.org> X-url: http://www-lium.univ-lemans.fr/credislas2012 [Apologies for multiple postings] ======================================================================== Workshop on CREATING CROSS-LANGUAGE RESOURCES FOR DISCONNECTED LANGUAGES AND STYLES Co-located with LREC 2012 (http://www.lrec-conf.org/lrec2012/) Istanbul, Turkey May 27, 2012 (afternoon session) Deadline for paper submissions: February 26, 2012 http://www-lium.univ-lemans.fr/credislas2012 ======================================================================== This half-day workshop aims at developing strategies and sharing experiences on creating resources for reducing the linguistic gap between those language pairs for which cross-language resources are scarce. Although this specific situation has been most commonly addressed for the case of minority languages that have scarce resources by themselves, it also happens to be an important issue in some other situations such as: majority languages that, because of their cultural, historical and/or geographical disconnection, do not count with a significant amount of cross-language resources between them (as Chinese and Spanish, just to mention an excellent example in this category); or, single languages for which new communication trends and styles do not have available cross-language resources between the main formal language and it (as chat speak style communications and formal languages). Current computational and data storage capabilities have favoured the proliferation of data-driven and statistical approaches in natural language processing and computational linguistics. Empirical evidence has demonstrated in a large number of cases and applications how the availability of appropriate datasets can boost the performance of processing methods and analysis techniques. In this scenario, the availability of data has become to play a fundamental role. On the other hand, both the diversity of languages and the emergence of new communication media and stylistic trends are responsible for the scarcity of resources in the case of some specific tasks and applications. In this sense, this workshop attempts to focus its attention on those specific applications or cases for which data scarcity poses a restrictive problem for data-driven approaches. This includes the following three specific situations: Minority Languages, for which scarcity of resources is a consequence of the minority nature of the language itself. In this case, attention is focused on the development of both monolingual and cross-lingual resources. Some examples in this category include: Basque, Pashto and Haitian Creole, just to mention a few. Disconnected Languages, for which a large amount of monolingual resources are available, but due to cultural, historical and/or geographical reasons cross-language resources are actually scarce. Some examples in this category include language pairs such as Chinese and Spanish, Russian and Portuguese, and Arabic and Japanese, just to mention a few. New Language Styles, which represent different communication forms or emerging stylistic trends in languages for which the available resources are practically useless. This case includes the typical examples of tweets and chat speak communications, as well as other informal form of communications, in many languages. The main topics of interest for this workshop include, but are not limited to, the following ones: * Construction and collection of monolingual resources * Construction and collection of cross-language resources * Annotation guidelines and evaluation * Automatic extraction of linguistic resources * Automatic annotation of linguistic resources * Use of crowdsourcing for generating and annotating resources * Use of pivot languages for bridging unconnected languages * Methods to adapt existing resources to new domains and styles * Generation of resources for informal communication styles * Evaluation of monolingual resources: tasks and protocols * Evaluation of cross-language resources: tasks and protocols SUBMISSION INSTRUCTIONS Authors are invited to submit papers on original and previously unpublished work. Formatting should be according to LREC 2012 specifications using LaTeX or MS-Word style files (available soon at the conference website, see http://www.lrec-conf.org/lrec2012/). Submission is electronic in PDF format using the START submission system at https://www.softconf.com/lrec2012/CREDISLAS2012/ Double submission policy: Parallel submission to other meetings or publications are possible but must be immediately notified to the workshop contact person (see below). Authors of accepted papers will be invited to present their research at the workshop. The workshop papers will be part of the LREC proceedings and published on the web site of LREC 2012 before the conference. IMPORTANT DATES February 26, 2012: Paper submissions due March 16, 2012: Notification of acceptance March 30, 2012: Camera ready papers due May 27, 2012: Workshop in Istanbul (afternoon session) ORGANIZERS Contact person: Patrik Lambert (e-mail: patrik.lambert at lium.univ-lemans.fr ) Patrik Lambert (University of Le Mans), Marta R. Costa-juss? (Barcelona Media Innovation Center), Rafael E. Banchs (Institute for Infocomm Research) PROGRAMME COMMITTEE Marianna Apidianaki, LIMSI-CNRS, Orsay, France Jordi Atserias, Yahoo! Research, Barcelona, Spain Victoria Arranz, ELDA, Paris, France Gareth Jones, Dublin City University, Ireland Min-Yen Kan, National University of Singapore Philipp Koehn, University of Edinburgh, UK Udo Kruschwitz, University of Essex, UK Yanjun Ma, Baidu Inc. Beijing, China Sara Morrissey, Dublin City University, Ireland Maja Popovic, DFKI, Berlin, Germany Paolo Rosso, Universidad de Valencia, Spain Marta Recasens, Stanford University, USA Wade Shen, Massachusetts Institute of Technology, Cambridge, USA Haifeng Wang, Baidu Inc. Beijing, China ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 22:00:43 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 23:00:43 +0100 Subject: Appel: 1st CfP Collaborative Resource Development and Delivery, LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 17:13:30 +0100 From: ELRA ELDA Information Message-ID: <4EEF62AA.9080103 at elda.org> X-url: http://www.anc.org/Collaborative_Resource_Development X-url: http://www.anc.org/MASC [Apologies for multiple postings] LREC Workshop Collaborative Resource Development and Delivery L?tfi Kirdar Istanbul Exhibition and Congress Centre Istanbul, Turkey May 27, 2012 http://www.anc.org/Collaborative_Resource_Development A confluence of needs and activities points to a new emphasis in computational linguistics to address lexical, propositional, and discourse semantics through corpora. A few examples are: - the demand for high quality linguistic annotations of corpora representing a wide range of phenomena, especially at the semantic level, to support machine learning and computational linguistics research in general; - the demand for high quality annotated corpora representing a broad range of genres that are flexible and extensible as need demands; - the demand for high quality lexical and semantic resources to incorporate into the annotation process, and for the annotation process to produce; - the need for easy-to-use, open access to all of these resources for everyone. Such resources can be very costly to produce, due to the need for manual creation or validation to ensure quality. Therefore, to answer the growing need and lower the costs of resource creation and enhancement, there is a movement within the community toward collaborative resource development, including collaborative corpus annotation and collective reation/enhancement of lexical resources and knowledge bases. Collaborative development encompasses both engaging the community in annotation and development of common resources, as well as crowd-sourcing and similar solutions. Technological advances now enable development of web-based environments for collaborative annotation and enhancement of language resources, including annotated corpora, lexicons, and others; and platforms to support web services that deliver data, annotations, and other resources as well as high-quality automated linguistic annotations of language data. At the same time, crowdsourcing is being explored as a viable means of producing high quality resources. Given the recent advancements in technology plus novel methods to collect manually annotated data, it is important to develop new methods of quality control, hopefully ones that permit rapid acquisition and sharing of resources. This workshop seeks contributions in all dimensions of collaborative resource development and delivery, with a specific focus on case studies and lessons learned. We invite submissions that address but are not limited to the following topics: - Web services and platforms for collaborative resource development and distribution; - Crowd sourcing for resource development, including studies of efficacy; - Strategies and issues for open resource distribution; - Evaluation of collaboratively developed resources; - Position papers outlining issues and proposing solutions for community-based collaborative resource development and/or delivery. Special session --------------- The workshop will include a special session devoted to means and considerations for community-based linguistic annotation, with a special emphasis on the Manually Annotated Sub-Corpus (MASC) (http://www.anc.org/MASC). We invite submissions to this session on the following topics: - position papers concerning any aspect of collaborative resource development, including means to get the community fully invested in such efforts; - case studies describing collaborative development efforts, including assessment of what works and what doesn't; - results obtained using collaboratively developed resources; - the role of standards and best practices in collaboratively developed resources and contributed annotations. Special consideration will be given to contributions that have used MASC data in a way that highlights the benefits of community-based annotation. Submission information ---------------------- Submissions may be long papers or short papers, following the formatting guidelines for submissions to the main conference given at http://www.lrec-conf.org/lrec2012/. All submissions should be made using the START system at https://www.softconf.com/lrec2012/CollaborativeDev2012/. Important Dates --------------- Submissions due: February 15, 2012 Acceptance notification to authors: March 15, 2012 Camera ready due: April 1, 2012 Workshop: May 27, 2012 Workshop organizers ------------------- Nancy Ide, Vassar College, USA Collin Baker, ICSI/UC Berkeley, USA Christiane Fellbaum, Princeton University, USA Rebecca Passonneau, Columbia University, USA Contact: collaboration-workshop at anc.org Program Committee Members (tentative) ------------------------------------- Collin Baker, ICSI/UC Berkeley, USA Jason Baldridge, University of Texas at Austin, USA Jordan Boyd-Graber, University of Maryland, USA Nicoletta Calzolari, ILC/CNR, Italy Bob Carpenter, Alias I,Inc., USA Chris Cieri, LDC, University of Pennsylvania, USA Mona Diab, Columbia University, USA Bill Dolan, Microsoft Corp., USA Christiane Fellbaum, Princeton University, USA Dan Flickinger, Stanford University, USA Terry Langendoen, University of Arizona, USA Eric Nyberg, Carnegie-Mellon University, USA Rebecca Passonneau, Columbia University, USA Massimo Poesio, University of Trento, Italy Sameer Pradhan, BBN Technologies, USA James Pustejovsky, Brandeis University, USA Owen Rambow, Columbia University, USA Manfred Stede, Universitat Potsdam, Germany ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 22:02:20 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 23:02:20 +0100 Subject: Appel: 1CfP 5th Workshop on the Representation and Processing of Sign Languages: Interactions between Corpus and Lexicon, LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 17:39:57 +0100 From: ELRA ELDA Information Message-ID: <4EEF68DD.20002 at elda.org> [Apologies for multiple postings] Call for Papers: 5th Workshop on the Representation and Processing of Sign Languages: Interactions between Corpus and Lexicon at LREC 2012, Istanbul, May 27th, 2012 Abstracts are invited for a full day workshop on the interplay between sign language corpora and lexicons, to take place preceding/following the 2012 LREC conference. Recent technological developments allow sign language researchers to create relatively large video corpora of sign language use that were unimaginable ten years ago. Several national projects are currently underway, and more are planned. This workshop aims to share experiences from current and past efforts. What are the problems that were encountered and the solutions created, what are the linguistic decisions taken? The special focus of this workshop is on the relation of corpus and lexicon work. For some sign languages, a corpus was built after a dictionary had been compiled, while in other cases lexicographic work and corpus annotation are done hand in hand. In some current projects, a corpus is collected without a dictionary being available, and the creation of such is only planned once the corpus is fully annotated. What are the implications of the different approaches, and what tool support is necessary for the different situations and phases? We invite abstracts for 20-minute oral/signed presentations or posters (with or without demonstrations) on the following topics: . Experiences in building sign language corpora and lexical resources . Proposals for standards for linguistic annotation . Experiences from linguistic research using corpora in combination with lexical resources . Use of (parallel) corpora and lexicons in translation studies . Tool development . Lexicon construction from corpora . Linking corpora and lexicons . Integrated presentation of corpus and dictionary contents Papers of both oral/signed presentations and posters (4-8 pages) of this workshop will be published as workshop proceedings that are included in the conference package. Please submit your abstract (max. 1000 words) through the LREC START system at https://www.softconf.com/lrec2012/SignLanguage2012/ not later than Feb 6th. When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. For further information on this new initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012. *Organising Committee* Onno Crasborn, Radboud U Nijmegen, NL Eleni Efthimiou, ILSP Athens, GR Evita Fotinea, ILSP Athens, GR Thomas Hanke, Hamburg U, DE Jette Kristoffersen, UCC Copenhagen, DK Johanna Mesch, Stockholm U, SE Programme Committee (tbc) Richard Bowden, Guildford GB Penny Boyes Braem, Z?rich CH Annelies Braffort, Orsay FR Christophe Collet, Toulouse FR Kearsy Cormier, London GB John Glauert, Norwich GB Alexis Heloir, Saarbr?cken DE Jens He?mann, Magdeburg DE Trevor Johnston, Sydney AU Reiner Konrad, Hamburg DE Lorraine Leeson, Dublin IE Petros Maragos, Athens GR Carol Neidle, Boston US Christian Rathmann, Hamburg DE Adam Schembri, Melbourne AU Meike Vaupel, Zwickau DE Rosalee Wolfe, Chicago US *Interpreting* With financial support from the LREC conference and other institutions, the oral/signed presentations will be interpreted into International Sign. The International Sign interpreters will also be around for the poster sessions to help out where necessary. Deaf participants wanting to bring interpreters for their national sign languages are kindly asked to contact us so that we can plan accordingly. *Important Dates* Deadline for abstracts: February 6th, 2012 Notification of acceptance: February 29th, 2012 Submission of paper (for both oral/signed presentations and posters): March 19th, 2012 Submission of slides for interpreters' preparation (oral/signed presentations only): May 22nd, 2012 Workshop: May 27th, 2012 LREC conference: May 23rd-May 25th (main conference) May 21st & May 22nd (pre-conf workshops) May 26th & May 27th (post-conf workshops) *More details* In due time, we will update the workshop website at http://www.sign-lang.unihamburg.de/lrec2012/ with travel advise etc. For further information, please contact the organising committee by email to lrec2012 (at) sign-lang.uni-hamburg.de ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 22:03:56 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 23:03:56 +0100 Subject: Appel: 2nd CfP Workshop on Computational Models of Narrative, LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 17:43:47 +0100 From: ELRA ELDA Information Message-ID: <4EEF69C3.1030506 at elda.org> X-url: http://narrative.csail.mit.edu/ws12 [Apologies for multiple postings] 2012 Workshop on Computational Models of Narrative May 26-27, 2012 L?tfi Kirdar Istanbul Exhibition and Congress Centre Istanbul, Turkey to be co-located with the LREC'2012, the Language Resources and Evaluation Conference Second CALL FOR PAPERS Paper submission deadline: February 24, 2012 *Workshop Aims* Narratives are ubiquitous in human experience. We use them to communicate, convince, explain, and entertain. As far as we know, every society in the world has narratives, which suggests they are rooted in our psychology and serve an important cognitive function. It is becoming increasingly clear that, to truly understand and explain human intelligence, beliefs, and behaviors, we will have to understand why narrative is universal and explain (or explain away) the function it serves. The aim of this workshop (and its predecessors) is to address key, fundamental questions about narrative that advance our fundamental understanding of narrative and our ability model it computationally. *Special Focus: Shared Resources* In addition to fundamental questions, the field has yet to address key needs with regard to shared resources and corpora that could smooth and hasten the way forward. The vast majority of work on narrative uses fewer than four stories to perform their experiments, and rarely re-use narratives from previous studies. Because NLP technology cannot yet take us all the way to the highly-accurate formal representations of language semantics, this implies significant amounts of repeated work in annotation. The way forward could be catalyzed by a carefully constructed set of shared resources. This meeting will be an appropriate venue for papers addressing fundamental topics and questions regarding narrative. Moreover, the meeting will have a special focus on the identification, collection, and construction of shared resources and corpora that facilitate the computational modeling of narrative. Papers should focus on issues fundamental to computational modeling and scientific understanding, or issues related to building shared resources to advance the field. Discussing technological applications or motivations is not discouraged, but is not required. *Illustrative Topics and Questions* * What kinds of shared resources are required for the computational study of narrative? * What content and modalities should be put in a "Story Bank"? What formal representations should be used? * What shared resources are available, or how can already-extant resources be adapted to common needs? * What makes narrative different from a list of events or facts? What is special that makes something a narrative? * What are the details of the relationship between narrative and common sense? * How are narratives indexed and retrieved? Is there a "universal" scheme for encoding episodes? * What impact do the purpose, function, and genre of a narrative have on its form and content? * What comprises the set of possible narrative arcs? Is there such a set? How many possible story lines are there? * Are there systematic differences in the formal properties of narratives from different cultures? * What are appropriate representations for narrative? What representations underlie the extraction of narrative schemas? * How should we evaluate computational models of narrative? *Organizing Committee* Mark A. Finlayson, Massachusetts Institute of Technology, USA Pablo Gerv?s, Universidad Complutense de Madrid, Spain Deniz Yuret, Koc University, Turkey Floris Bex, University of Dundee, Scotland *Sponsors* There will be a number of travel grants available to workshop authors via our sponsors: * ONR Global * Office of Naval Research * Defense Advanced Research Projects Agency *Contact* narrative-ws12 at csail.mit.edu http://narrative.csail.mit.edu/ws12 Note: Workshop dates have changed slightly since the first call ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 22:05:48 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 23:05:48 +0100 Subject: Appel: Workshop on Semantic Relations-II. Enhancing Resources and Applications, LREC 2012 Workshop Message-ID: Date: Mon, 19 Dec 2011 15:45:20 +0100 From: Helene Mazo Message-ID: <4EEF4E00.5070609 at elda.org> X-url: http://www.racai.ro/SemRel2012/ [Apologies for multiple postings] Call for papers Workshop on ?Semantic Relations-II. Enhancing Resources and Applications? http://www.racai.ro/SemRel2012/ 22nd May, L?tfi Kirdar Istanbul Exhibition and Congress Centre, Istanbul, Turkey Held in conjunction with LREC2012 *Workshop description* Building on the interest in semantic relations manifested by the participants to the first workshop Semantic Relations. Theory and Applications (held in conjunction with LREC2010), we organize a second edition, Semantic Relations-II. Enhancing Resources and Applications, in line with current interests shown by various journals, conferences, international projects, the recent Multilingual European Technological Alliance and the synergic projects MetaNet, MetaNet4u (supporting this workshop), MetaNord and CESAR. Lexical-semantic relations are organizing principles of the lexicon. Linguists have been more concerned with the definition, characterization and classification of semantic relations, the focus being on their paradigmatic aspects. Corpus linguistics offers the material and tools for grounding theories of semantic relations in the empirical analysis of usage and phraseology, for investigating syntagmatic aspects of semantic relations, which are exploited for various tasks in computational linguistics, such as Information Retrieval, Information Extraction, Question Answering, summarization, text generation and others. Recently, semantic relations within compounds, between simplex and compounds, between stems and derived words have been dedicated in-depth analyses due to their importance for natural language understanding, rephrasing and generation. In this workshop we will highlight the interrelation between theoretical and practical aspects in the study of semantic relations, a theme with important benefits for both theoretical and computational linguistics. The intended audience for our workshop is represented by both linguists and computer scientists interested in theoretical and practical aspects of semantic relations. The aim is to highlight the various ways in which lexical-semantic relations can enhance language resources and the results of various applications using them. *Topics* Topics of the workshop will include, but are not limited to: * Co-occurrence and semantic relations * Semantic resources for discourse analysis * Extraction of semantic relations from various sources * Exploitation of semantic relations in NLP applications * Knowledge representation and semantic relations * Lexical semantics and word sense * Semantic relations and word formation (compounding and derivation) * Semantic relations and language learning and acquisition * Semantic relations in Natural Language Generation applications * Semantic relations and terminology * WordNet and WordNet-like resources *Invited speaker* Our invitated speaker, professor Patrick Hanks, will deliver a presentation on the Theory of Norm and Explotation with the title Mapping Semantic Relations onto Pattern of Word Use using Corpus Evidence. *Submission information* Papers will be submitted to the workshop via the START LREC Conference Manager, under https://www.softconf.com/lrec2012/SemRel2012/. Authors should submit a PDF file of no more than 10 pages, following the LREC conference formatting details. Papers will be blind reviewed by three members of the Program Committee, that is why the author(s)?s names and affiliation(s) should not appear in the paper. Accepted papers will be published in the workshop Proceedings. When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. For further information on this new initiative, please refer to http://www.lrec-conf.org/lrec2012/?LREMap-2012. *Important dates* Paper submission: 2nd March 2012 Notification of acceptance: 23rd March 2012 Camera-ready papers: 30th March 2012 Workshop: 22nd May 2012 *Program Committee* Eduard Barbu, CIMEC (Italy) Antonio Branco, FCL (Portugal) Elena Cabrio, LIMSI (France) Corina Forascu, UAIC (Romania) Nuria Gala, LIF-CNRS (France) Patrick Hanks, UWE (UK) Amac Herdagdelen, New England Complex Systems Institute (UK) Diana Inkpen, University of Ottawa (Canada) Radu Ion, RACAI (Romania) Elisabetta Jezek , PAVIA (Italy) Svetla Koeva, Institute for Bulgarian Language (Bulgaria) Claudia Kunze, Qualisys GmbH (Germany) Gerhard Kremer, Institut fuer Computerlinguistik, Universitaet Heidelberg (Germany) Hristina Kukova, Institute for Bulgarian Language (Bulgaria) Svetlozara Leseva, Institute for Bulgarian Language (Bulgaria) Bernardo Magnini, FBK-IRST (Italy) Emanuela Pianta, FBK-IRST (Italy) Reinhard Rapp, University of Leeds (UK) Didier Schwab, Laboratoire d'Informatique de Grenoble (France) Carlo Strapparava, FBK-IRST (Italy) Sara Tonelli, FBK-IRST (Italy) Dan Tufis, RACAI (Romania) Michael Zock, CNRS&LIF (France) *Contact person* Please email vergi AT racai DOT ro if you have any questions regarding this workshop. *Organizing Committee* Verginica Barbu Mititelu Romanian Academy Research Institute for Artificial Intelligence E-mail: verginicabm at gmail.com Octavian Popescu Human Language Technology Group, Fondazione Bruno Kessler E-mail: popescu at fbk.eu Viktor Pekar Dictionaries Dept, Academic Division, Oxford University Press E-mail: v.pekar at gmail.com ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 22:07:40 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 23:07:40 +0100 Subject: Conf: Colloque SHESL / HTL 2012, 26-28 janvier 2012, Paris Message-ID: Date: Tue, 20 Dec 2011 11:36:54 +0100 From: Christian Puech Message-Id: X-url: http://www.shesl.org/ Chers coll?gues, Le colloque annuel de la Soci?t? d'Histoire et d'Epist?mologie des Sciences du Langage organis? en collaboration avec l'UMR 7597 "Histoire des th?ories linguistiques" aura lieu les : 26 (am), 27 et 28 janvier 2012: "Linguistiques d'intervention - des usages socio-politiques des savoirs sur le langage et les langues" P?les des langues et civilisations (Auditorium BULAC INALCO) - 65 rue des Grands Moulins - 75013 - Paris Programme: Contact: shesl at neuf.fr Site: http://www.shesl.org/ Christian PUECH Universit? Sorbonne Nouvelle Paris III Pr?sident de la SHESL Centre de Linguistique Fran?aise. UFR LLD. Centre Censier. 13, rue de Santeuil. 75005 - Paris. christian.puech at univ-paris3.fr UMR CNRS 7597 Paris 7/Paris 3: "Histoire des th?ories linguistiques" ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 22:09:16 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 23:09:16 +0100 Subject: Livre: Les enonces averbaux autonomes entre grammaire et discours Message-ID: Date: Tue, 20 Dec 2011 14:41:18 +0100 From: "Annick Deniel" Message-ID: <31A343C6D09247D1B86593C2A51F1573 at DELLAnnick> COMMUNIQUE DE PRESSE Nous avons le plaisir de vous annoncer la parution de l'ouvrage ci-dessous : Les ?nonc?s averbaux autonomes entre grammaire et discours par Florence Lefeuvre et Irmtraud Behr, respectivement Ma?tre de Conf?rences habilit?e ? diriger les recherches et Professeure ? l?Universit? de la Sorbonne Nouvelle. Elles sont membres de deux ?quipes de recherche de la Sorbonne Nouvelle, l?EA 2290 en linguistique fran?aise et l?EA 4223 en linguistique allemande. Elles sont toutes deux auteures de nombreux ouvrages et articles sur la probl?matique des ?nonc?s averbaux. Ce recueil interroge les grammaires de l??crit et de l?oral sur leur conception des unit?s syntaxiques autonomes, que celles-ci soient appel?es phrases, ?nonc?s, clauses. Acceptent-elles, outre les unit?s syntaxiques bas?es sur le verbe conjugu?, d?autres types d?unit?s, cette fois bas?es sur un segment averbal tel que Incroyable ? L?objectif de cet ouvrage est d?examiner le lien entre le cadre grammatical adopt? et l?analyse en discours de ces unit?s averbales autonomes. Divers types de discours sont examin?s, le discours litt?raire, celui de la presse, et ?galement le discours oral qui fait l?objet des recherches les plus r?centes. Cet ouvrage s?adresse aux chercheurs, enseignants, ?tudiants ainsi qu?? tous ceux qui souhaitent approfondir leurs connaissances sur la phrase sans verbe consid?r?e en discours. Sommaire : Jacqueline Guillemin-Flescher : L'?nonc? averbal : rep?rage et subjectivit? / Zsuzsanna G?cseg : Etre ou ne pas ?tre : pr?dication averbale et localisation spatio-temporelle en hongrois / Fr?d?ric Laurens : Analyse des ?nonc?s averbaux pr?dicatifs autonomes / Henri-Jos? Deulofeu : Existe-t-il un "imp?ratif averbal" en fran?ais ? / Gilles Corminboeuf : Diptyques introduits par des ?nonciations nominales / Claire Blanche-Benveniste : Enonc?s sans verbe en fran?ais parl? : un mod?le ? deux composantes / Mustapha Krazem : Zidane, qui passe ? Sganarelle, qui lance ? Platini, qui n'a pas vu G?ronte? Ou comment des noms propres deviennent phrase averbales / Catherine Chauvin : Existe-t-il des constructions averbales ? Le cas des ?nonc?s ?valuatifs avec adjectif de type "Great !" en anglais / Fran?oise Mignon : Le statut pr?dicatif du marqueur de n?gation non en discours / Florence Lefeuvre : Bon ? l'oral : une unit? averbale autonome ? / Yana Grinshpun : Phrase averbale et presse ?crite : le cas des constructions en [UN + N + expansion] / Sophie-Anne Brault-Scappini, Sandra Teston-Bonnard : Les fonctionnements du GP ? partir de l? : entre grammaticalisation et pragmaticalisation / Noalig Tanguy : Les segments averbaux comme unit?s syntaxiques ? l'oral / Irmtraud Behr : Etude d'?nonc? averbal ? un terme : entre grammaire et discours / Angelika Redder : Enonc?s averbaux en s?rie ? des segments narratifs sp?cifiques Caract?ristiques : 276 pages, 21 ? - ISBN 978-2-7080-1320-9 Nous sommes certains que cet ouvrage int?ressera vos lecteurs et vous serions reconnaissants si vous pouviez l'annoncer dans les pages bibliographiques de votre magazine. Si vous d?sirez en faire para?tre une critique plus approfondie, nous sommes pr?ts ? vous adresser un exemplaire de l'ouvrage en service de presse ( adeniel at editionstechnip.com). Nous vous demandons simplement de penser ? nous envoyer le justificatif de parution, indispensable ? notre dossier presse. Nous vous remercions de l'accueil que vous r?serverez ? ce titre. Meilleures salutations, Annick DENIEL Responsable de la communication ~ adeniel at editionstechnip.com Ligne Directe : 01 45 78 33 87 Editions OPHRYS 25, rue Ginoux ? 75015 Paris T?l : 33 (0)1 45 78 33 80 Fax : 33 (0)1 45 75 37 11 Site : www.ophrys.fr ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 22:11:03 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 23:11:03 +0100 Subject: Appel: Traitement automatique des langues naturelles, analyses et applications, Congres annuel de l'ACFAS Message-ID: Date: Tue, 20 Dec 2011 09:28:57 -0500 From: Adel Jebali Message-ID: X-url: http://www.acfas.net/membre/compte.php APPEL Colloque : traitement automatique des langues naturelles, analyses et applications Congr?s annuel de l'ACFAS Montr?al, Canada Le vendredi 11 mai 2012 DATES IMPORTANTES Date limite de soumission des r?sum?s : le 5 f?vrier 2012 Notification aux auteur(e)s : le 10 f?vrier 2012 Transmission du programme final ? l?ACFAS : le 15 f?vrier 2012 Colloque : le vendredi 11 mai 2012 Soumission de l'article pour publication dans les actes : ? d?terminer. DUR?E Les communications seront d'une dur?e de 20 minutes, suivies d'une p?riode de 10 minutes pour les questions. CONF?RENCES Les conf?rences peuvent porter sur tous les th?mes du traitement automatique des langues et concerner une langue ou plus. Les th?mes pouvant ?tre trait?s s'inscrivent dans les domaines suivants, mais ne s'y limitent pas : Fondements - Phon?tique et phonologie - Morphologie - Syntaxe - S?mantique - Lexique - lexicom?trie - classification etc. Applications - Enseignement assist? par ordinateur - Extraction de l'information sur le Web - Traduction automatique - Analyse et g?n?ration de textes - R?sum? automatique - Dialogue personne-machine etc. MODALIT?S - Les r?sum?s ne doivent pas d?passer 1500 caract?res (incluant les espaces). Tout caract?re additionnel ne pourra figurer dans le programme. - Les propositions doivent ?tre envoy?es en pi?ce jointe (fichier doc, rtf ou pdf) sans mention du nom de l'auteur(e) ni de son affiliation ? l'adresse suivante : ajebali at alcor.concordia.ca - Dans l'objet de votre message, veuillez inscrire ? Proposition ACFAS 2012 ?. - Veuillez inscrire les informations suivantes dans le corps de votre message : Nom de l'auteur(e). Titre de la communication. Affiliation. - Les auteur(e)s dont la proposition est accept?e sont tenus d'ouvrir un compte et de s'inscrire au congr?s annuel de l'ACFAS ? l'adresse Web suivante : http://www.acfas.net/membre/compte.php. - Les auteur(e)s des communications accept?es seront appel?(e)s dans un deuxi?me appel ? ?crire un plus long article qui sera publi? dans un collectif. ORGANISATEURS Adel Jebali, Universit? Concordia : ajebali at alcor.concordia.ca Isma?l Biskri, Universit? du Qu?bec ? Trois-Rivi?res : ismail.biskri at uqtr.ca ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 22:12:37 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 23:12:37 +0100 Subject: Job: Poste de PR section 27 en Technologie du Langage Humain, DLU/GREYC - Universite de Caen Basse-Normandie Message-ID: Date: Tue, 20 Dec 2011 20:09:24 +0100 From: Ga?l Dias Message-ID: <4EF0DD64.3080905 at unicaen.fr> X-url: http://www.greyc.fr/dlu ==================================================================== Poste de PR section 27 en Technologie du Langage Humain DLU/GREYC - Universit? de Caen Basse-Normandie ==================================================================== L?Universit? de Caen Basse-Normandie recrute un Professeur des Universit?s section 27 dans le domaine de la Technologie du Langage Humain (Challenge 4 du "Programme Information and Communication Technologies" de l?Union Europ?enne) pour la rentr?e universitaire 2012/2013. Le candidat devra d?montrer une comp?tence renforc?e de haut niveau dans l?un ou plusieurs des domaines suivants : Recherche d?Information ("Information Retrieval"), Extraction d?Information ("Information Extraction"), Traitement Automatique des Langues, Traduction Automatique, Fouille de Textes, Extraction de Connaissances, R?sum? de Textes, Interface Homme-Machine entre autres. Le candidat sera int?gr? ? l??quipe Document, Langues, Usages (DLU - http://www.greyc.fr/dlu) au sein du laboratoire Groupe de Recherche en Informatique, Image, Automatique et Instrumentation de Caen (GREYC - UMR 6072 CNRS UCBN/ENSICAEN - http://www.greyc.fr) class? niveau A. Le candidat sera amen? ? dynamiser et participer ? l?internationalisation et l?excellence de l??quipe DLU compos?e de 13 Ma?tres de Conf?rences ou Charg?s de Recherche permanents et d?un Professeur des Universit?s r?cemment recrut?. DLU s?appuie sur un acquis reconnu nationalement et internationalement dans le domaine du Document Num?rique, de l?Ing?nierie Documentaire, du Traitement Automatique des Langues, de l'Extraction des Connaissances Textuelles et des Interfaces Homme-Machine. A DLU, le candidat trouvera un environnement jeune et dynamique orientant sa recherche autour de th?mes porteurs comme la personnalisation, la collaboration, l?accessibilit?, l?adaptabilit? et l?ubiquit? d?applications du langage humain dans des environnements num?riques h?t?rog?nes multilingues tels que la toile ou les r?seaux sociaux. Ce poste vise ?galement ? renforcer la th?matique des Technologies du Langage Humain dans le dispositif d?enseignement du D?partement Informatique de l?UFR Sciences de l?Universit? de Caen Basse-Normandie. Dans ce cadre, le candidat sera amen? ? participer activement au succ?s du nouveau Master Image et Texte (dont la premi?re ?dition commencera ? la rentr?e 2012/2013) dans lequel de nombreux modules devront ?tre assur?s. Son exp?rience sera ?galement importante dans l?ensemble du D?partement d?Informatique, notamment dans la prochaine direction du Master Document. Le candidat int?ress? par un poste ? hautes responsabilit?s dans le domaine de la Technologie du Langage Humain est invit? ? prendre contact aupr?s des Responsables de l??quipe DLU, Ga?l Dias (gael.dias at unicaen.fr) et Nadine Lucas (nadine.lucas at unicaen.fr). Pour toute information suppl?mentaire sur la partie enseignement, le candidat devra contacter le Directeur du D?partement d?Informatique, Bruno Cr?milleux (bruno.cremilleux at unicaen.fr). En ce qui concerne la partie Recherche, le candidat pourra ?galement contacter les Directeurs du GREYC, Mohamed M?Saad (mohammed.msaad at ensicaen.fr) et Fr?d?ric Jurie (frederic.jurie at unicaen.fr). ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 22:14:44 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 23:14:44 +0100 Subject: Appel: 2nd CFP, EACL 2012, Joint Workshop on Information Retrieval and Hybrid Machine Translation Message-ID: Date: Wed, 21 Dec 2011 08:48:03 +0100 From: Marta Ruiz Message-ID: X-url: http://www-lium.univ-lemans.fr/esirmt-hytra Joint Workshop on EXPLOITING SYNERGIES BETWEEN INFORMATION RETRIEVAL AND MACHINE TRANSLATION (ESIRMT) and HYBRID APPROACHES TO MACHINE TRANSLATION (HyTra) Co-located with EACL 2012 (http://eacl2012.org/) Avignon, France April 23 and 24, 2012 Deadline for paper submissions: February 2, 2012 http://www-lium.univ-lemans.fr/esirmt-hytra ======================================================================== This two-day workshop addresses two specific but related research problems in computational linguistics. The ESIRMT event (1st day) aims at reducing the gap, both theoretical and practical, between information retrieval and machine translation research and applications. Although both fields have been already contributing to each other instrumentally, there is still too much work to be done in relation to solidly framing these two fields into a common ground of knowledge from both the procedural and paradigmatic perspectives. The HyTra event (2nd day) aims at sharing ideas among researchers developing and applying statistical, example-based, or rule-based machine translation systems and who wish to enhance their systems with elements from the other approaches. The joint workshop will provide participants with the opportunity of discussing research related to technology integration and system combination strategies at both the general level of cross-language information access and the specific level of machine translation technologies. Contributions are to be organized into two tracks, corresponding to ESIRMT and HyTra, respectively. Topics of interest include, but are not limited to: ESIRMT track: * machine translation and information retrieval hybridization * applications of MT and IR hybrid systems * IR techniques integrated in MT systems * MT techniques integrated in IR systems * any kind of innovative approach exploiting synergies between MT and IR * machine learning techniques for ranking HyTra track: * ways and techniques of MT hybridization * architectures for the rapid development of hybrid MT systems * hybrid systems dealing with underresourced languages and/or with morphologically rich languages * using linguistic information (morphology, syntax, semantics) to enhance statistical MT * bootstrapping rule-based systems from corpora * hybrid methods in spoken language translation * extraction of dictionaries from parallel and comparable corpora * machine learning techniques for hybrid MT * heuristics for limiting the search space in hybrid MT * alternative methods for the fair evaluation of the output of different types of MT systems * system combination approaches * open source tools and free language resources for hybrid MT SUBMISSION INSTRUCTIONS Authors are invited to submit papers on original and previously unpublished work. Formatting should be according to EACL 2012 specifications using LaTeX or MS-Word style files, see http://eacl2012.org/information-for-authors/index.html (paper length: up to 9 pages plus an extra page for references). Reviewing of papers will be double-blind, so the submissions should not reveal the authors' identity. Submission is electronic in PDF format using the START submission system at https://www.softconf.com/eacl2012/ESIRMT-HyTra/ Double submission policy: Parallel submission to other meetings or publications are possible but must be immediately notified to the workshop contact person (see below). Authors of accepted papers will be invited to present their research at the workshop. The papers will be published in the electronic workshop proceedings. These will be part of the EACL proceedings, published by ACL. They will also be made available online via the ACL Anthology. IMPORTANT DATES February 2, 2012: Paper submissions due February 24, 2012: Notification of acceptance March 9, 2012: Camera ready papers due April 23-24, 2012: Workshop in Avignon ORGANIZERS Contact person: Marta R. Costa-juss? (e-mail: marta.ruiz at barcelonamedia.org) ESIRMT: Marta R. Costa-juss? (Barcelona Media Innovation Center), Patrik Lambert (University of Le Mans), Rafael E. Banchs (Institute for Infocomm Research) HyTra: Reinhard Rapp (Universities of Mainz and Leeds), Bogdan Babych (University of Leeds), Kurt Eberle (Lingenio GmbH), Tony Hartley (Toyohashi University of Technology and University of Leeds), Serge Sharoff (University of Leeds), Martin Thomas (University of Leeds) INVITED SPEAKERS Philipp Koehn (University of Edinburgh) Christof Monz (University of Amsterdam) PROGRAMME COMMITTEE Jordi Atserias, Yahoo! Research , Barcelona, Spain Bogdan Babych, University of Leeds, UK Sivaji Bandyopadhyay, Jadavpur University, Kolkata, India N?ria Bel, Universitat Pompeu Fabra, Barcelona, Spain Pierrette Bouillon, ISSCO/TIM/ETI, University of Geneva, Switzerland Chris Callison-Burch, Johns Hopkins University, Baltimore, USA Michael Carl, Copenhagen Business School, Denmark Oliver Culo, ICSI, University of California, Berkeley, USA Kurt Eberle, Lingenio GmbH, Heidelberg, Germany Andreas Eisele, Directorate-General for Translation, European Commission, Luxembourg Marcello Federico, Fondazione Bruno Kessler, Trento, Italy Jos? A. R. Fonollosa, Universitat Polit?cnica de Catalunya, Barcelona, Spain Mikel Forcada, University of Alicante, Spain Alexander Fraser, Institute for Natural Language Processing (IMS), Stuttgart, Germany Johanna Gei?, Lingenio GmbH, Heidelberg, Germany Mireia Ginesti-Rosell, Lingenio GmbH, Heidelberg, Germany Silvia Hansen-Schirra, FTSK, University of Mainz, Germany Tony Hartley, Toyohashi University of Technology, Japan, and University of Leeds, England Gareth Jones, Dublin City University, Ireland Min-Yen Kan, National University of Singapore Udo Kruschwitz, University of Essez, UK Yanjun Ma, Baidu Inc. Beijing, China Jos? B. Mari?o, Universitat Polit?cnica de Catalunya, Spain Maite Melero, Barcelona Media Innovation Center, Barcelona, Spain Haizhou Li, Institute for Infocomm Research, Singapore Reinhard Rapp, University of Mainz, Germany, and University of Leeds, UK Paul Schmidt, Institut for Applied Information Science, Saarbr?cken, Germany Uta Seewald-Heeg, Anhalt University of Applied Sciences, K?then, Germany Nasredine Semmar, CEA LIST, France Serge Sharoff, University of Leeds, UK Wade Shen, Massachusetts Institute of Technology, Cambridge, USA Fabrizio Silvestri, Instituto de Scienza e Tecnologia del'Informazione, Pisa, Italy Harold Somers, CNGL, Dublin City University, Ireland Anders S?gaard, University of Copenhagen, Denmark Martin Thomas, University of Leeds, UK J?rg Tiedemann, University of Uppsala, Sweden Zygmunt Vetulani, University of Poznan, Poland ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 22:16:55 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 23:16:55 +0100 Subject: Appel: LREC 2012 Workshop on Language Resources for Public Security Message-ID: Date: Wed, 21 Dec 2011 10:32:56 +0000 From: Fr?d?rique Segond Message-ID: LREC 2012 Workshop on Language Resources for Public Security Applications CALL FOR PAPERS 27 May 2012, Istanbul (half-day) Conference venue, the L?tfi Kirdar Istanbul Exhibition and Congress Centre Deadline: March 5, 2012 Workshop description Public security in Europe and in the World is facing several threats. These include threats connected with intended human activities such as terrorism, spontaneous risks related to uncontrolled behavior of individuals involved in mass events, natural disasters, etc. Combating these dangers generates challenges for information and communication technologies which in many cases directly involve various forms of natural language processing. Gathering, maintaining and processing language resources specific for security applications is of primary importance for the language technologies concerned. In some cases it appears useful to investigate and use sensitive linguistic data which generates technological and legal problems connected with privacy, ownership civic rights protection etc. The workshop is intended to serve as a thematic discussion forum open to: - language resources suppliers, - researchers and language engineers interested in the development of systems for security applications involving language technologies, - potential/actual users of such systems, - people concerned with legal aspects of gathering, maintenance and applications of language resources for public security purposes. Generation of long term cooperation projects involving the workshop participants would be a desired side effect of the workshop. Areas of Interest The workshop will focus on the knowledge processing applications serving public security. Particular emphasis will be given to the crucial role of language resources and related technologies. Contributions are invited on - but not limited to - the following topics: - security specific corpora, - security specific terminology, - language models for specific sub-languages and language registers important for security research, - language technology based tools to enhance public security, - linguistic tools for risk assessment, - controlled languages for public security applications, - AI and NLP decision supporting systems, - sharing and processing sensitive linguistic data, - legal aspects of security-oriented natural language processing and engineering, - access to sensitive data, - IPR issues, - protection and use of sensitive source data, - international collaboration issues, - issues related with national and international funding. Submissions We invite contributions on one or more of the above listed topics, as well as any other related to the workshop area of interest. We expect papers presenting novel ideas, completed or advanced projects or discussing hot problems of the area. The paper selection procedure will be based on peer reviewing of a 2000 word abstract and will be basically the same as for the general LREC. The form of presentation will be decided later by the Program Committee, depending of the number of submissions. Abstract and full papers should be submitted electronically from the conference web site through the START Conference Manager . Templates (Latex and Word) for preparation of full papers will be accessible from the conference web site. The only accepted format for submitted papers is Adobe PDF. When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e., also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. For further information on this new initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012. The full versions of the accepted papers will be published in the Workshop Proceedings. Important Dates Abstract submission deadline: March 5, 2012 Acceptance notification: March 18, 2012 Final version deadline: March 26, 2012 LREC (Conference, Tutorials and Workshops): May 21-27, 2012 LRPS Workshop: May 27, afternoon (Initial) Program Committee: Edouard Geoffrois (co-chair) (Direction G?n?rale de l'Armement, Paris, France) Laura Chaubard (Direction G?n?rale de l'Armement, Paris, France) Jakub Gorczy?ski (Polish National Police, Pozna?, Poland) Fryni Kakoyianni-Doa (University of Cyprus, Nicosia, Cyprus) Nasrullah Memon (University of Southern Denmark, Odense, Denmark) Mario Montoleone (Salerno University, Italy) Karel Pala (Masaryk University, Brno, Czech Republic) Frederique Segond (Viseo, Grenoble, France) Tadeusz Tomaszewski (University of Warsaw, Poland) Zygmunt Vetulani (co-chair) (Adam Mickiewicz University in Pozna?, Poland) Contact: vetulani at amu.edu.pl Fr?d?rique SEGOND 34, avenue F?lix Viallet - 38000 GRENOBLE T?l. +33 (0)4 76 53 72 72 Mob. +33 (0)6 89 44 60 88 fsegond at viseo.net Responsable P?le Innovation Research & Development Manager http://wiki.objetdirect.com/expertise/Fr%C3%A9d%C3%A9rique_Segond Objet Direct is partner of the GALATEAS and LEILAS EU Projects http://www.galateas.eu/index.html http://www.leilas.eu/ ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Wed Dec 21 22:18:38 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Wed, 21 Dec 2011 23:18:38 +0100 Subject: Appel: Workshop on Open Data in a Semantic Web Perspective Message-ID: Date: Wed, 21 Dec 2011 15:25:01 -0500 From: eric charton Message-ID: X-url: http://wodswp2012.wikimeta.org/ First Call for paper ------------------------------------------------------------------------ Workshop on Open Data in a Semantic Web Perspective http://wodswp2012.wikimeta.org/ May 27th, 2012 York University, Toronto, Ontario, Canada (co-located with Canadia AI 2012) ------------------------------------------------------------------------ WORKSHOP DESCRIPTION Open Data is the idea that certain data should be freely available to everyone to use and republish as they wish, without restrictions from copyright, patents or other mechanisms of control. Open Data recently, gaining popularity with the rise of the W3C Semantic Web prospective vision. The LinkedData network, promoted by the W3C, is the most visible part of what an open Web of data, as a complement of a Web of document could be. Vast repositories of normalized data like those from DBPedia, PubMed, GeneID, DrugBank, OECD or UNESCO are now stimulating an intensive new field of research related to the possible uses of those data in IA and knowledge management applications. Major companies involved in data exploration and information retrieval invest in this new field and study new interfaces and algorithms to bring the gap between data and users like Google and the Public Data Explorer. Governments are increasingly recognizing the benefits of making their data open and reusable. The launch of open-data government initiatives such as Data.gov in USA or various initiatives in Europe like the Open Data Challenge, or data.gov.uk is a strong encouragement to the development of the open data movement. The wide availability of Public Sector Information can be a key driver to develop content markets in North America, which can generate new businesses and jobs and provide more information to consumers and citizens. In Canada, the Open Data Pilot is part of the Government commitment to open government, which is being pursued along three streams: open data, open information and open dialogue, and aims to drive innovation and economic opportunities for all Canadians. We would like to invite researchers and actors of the open data movement to present their work to the workshop. This workshop would like to bring together data providers, initiative groups, technology providers, and researchers to facilitate the dissemination of ideas, methods and promote new research perspectives related to the Open Data and the Semantic Web applications. TOPICS Topics of interest include, but are not limited to: 1. Open Data sets and catalogs (including Semantic Web, LinkedData, public data sets) 2. Open Data pilot initiatives (including governmental, non-profit, academic) 3. Relation between Open Data and Natural Language Processing 4. Architectures and systems using Open Data 5. Tools and resources related to Open Data 6. Open Data construction tools 7. Format of Open Data (including RDF, OWL) 8. Application of Open Data in maps, encyclopedic knowledge, genomics, chemical compounds, mathematical and scientific formulae, medical data and practice, bioscience and biodiversity. 9. Evaluation and metrics related to Open Data sets quality, processing and software performances 10.Open Data and translation 11.Open Data visualization Demos and papers related to working application or systems under development will be encouraged. We hope to cover three main perspectives: government and administrations (e.g., open ), industry (e.g., marketing, software applications and services), and academic (e.g., theoretical research related to Open Data production and uses). SUBMISSION INFORMATIONS: Two formats are proposed : 1.Full paper (up to 8 pages) 2.Demo paper (up to 4 pages) Submission of commercial applications and research related papers are strongly encouraged when they are strictly related to real open data (e.g., innovative commercial software tool using public data-sets) Authors are invited to submit electronically on the website of the workshop by February 22th, 2012, full papers in PDF, Postscript or MS-Word RTF. All papers must be written in English. For submission please use our easychair site. IMPORTANT DATES: Paper submission: February, 22, 2012 Notification of acceptance: March, 30, 2012 Camera ready: April, 27, 2012 Workshop: May 27, 2012 PROGRAM COMMITTE Christopher Baker, UNB, Saint John, Canada Pierre-Antoine Champin, Universit? Claude Bernard Lyon 1, Lyon, France Dragan Gasevic, Athabasca University, Athabasca, Canada Volker Haarslev, Concordia University, Montr?al, Canada Anja Jentzsch, , Freie Universit?t, Berlin, Germany Birgitta K?nig-Ries, University of Jena, Jena, Germany Guy Lapalme, Universit? de Montr?al, Montr?al, Canada Daniel Lemire, UQAM, Montr?al, Canada Roger Nkambou, UQAM, Montr?al, Canada Raphael Troncy, Eur?com, Sophia Antipolis, France Petko Valtchev, UQAM, Montr?al, Canada Ren? Witte, Semantic Software Lab - Concordia University, Montr?al, Canada WORKSHOP ORGANIZERS AND CONTACT Eric Charton (?cole Polytechnique de Montr?al) Michel Gagnon (?cole Polytechnique de Montr?al) Marie-Jean Meurs (Concordia University) Contact eric.charton at polymtl.ca ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sat Dec 24 09:24:28 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sat, 24 Dec 2011 10:24:28 +0100 Subject: Stage: 3 offres, Ingenieur Developpement, GEOLSemantics Message-ID: Date: Thu, 22 Dec 2011 14:02:46 +0100 From: "Emmanuel DUPONT" Message-ID: <00a101ccc0aa$0098d0d0$01ca7270$@geolsemantics.com> -------------------------------- Stage fin d??tudes pr?-embauche : Ing?nieur D?veloppement Avant-vente: Authentification forte gr?ce ? la voix La soci?t? : La soci?t? GEOLSemantics est une soci?t? fran?aise bas?e ? Paris et qui compte une quinzaine de personnes. Elle a pour vocation de proposer des solutions logicielles pour aider ? la d?tection et au tracking d?activit?s hostiles envers les Etats, les entreprises et les personnes. Ces solutions s?appliquent ? l?information contenue dans les diff?rents m?dia : texte, audio et vid?o. Elles utilisent des algorithmes sophistiqu?s d?analyse s?mantique multilingue pour permettre aux op?rationnels de disposer d?une information synth?tique repr?sentant la connaissance extraite des diff?rentes sources d?information. Elles utilisent ?galement des technologies d?avant-garde d?identification de locuteur dans les m?dias audio ou vid?o. Sujet du stage GEOLSemantics recherche un stagiaire bac +5 en informatique pour la r?alisation d?un prototype ?authentification forte de locuteur? utilisant nos outils de biom?trie vocale. Int?gr?(e) au sein d'une ?quipe ? taille humaine, vous aurez la possibilit? de vous ?panouir dans le monde de la s?curit? renforc?e et de la lutte contre l?usurpation d?identit?. Contenu du stage : Le but de ce stage est de r?aliser un prototype pour appuyer la vente de nos solutions de biom?trie vocale. L?objectif est d?am?liorer la s?curit? d?acc?s des applications gr?ce ? l?identification vocale du locuteur. Cette authentification forte vient en compl?ment des m?thodes classiques bas?es sur un login/mot de passe. D?roulement du stage : - Choix et mise en place de l?architecture, - Elaboration de scenarii, - D?veloppement de l?infrastructure du prototype, - D?veloppement d?une application sous Andro?d pour la mise ? disposition du prototype sur mobile, - Test et pr?sentation aux avant projets client. Vos capacit?s d'autonomie, vos aptitudes ? vous exprimer et ? r?diger vous permettront rapidement d'?tre responsable de la r?alisation et du suivi complet de vos travaux sous la conduite d?un ing?nieur exp?riment?. Profil du stagiaire : Comp?tences techniques requises : - Master 2 ou ing?nieur en informatique - bonnes connaissances en Java Web (notion de HTML/CSS) - des connaissances en Andro?d (ou capacit? ? acqu?rir ces connaissances rapidement) - capacit? ? appr?hender un domaine nouveau (biom?trie vocale) - Ma?trise du XML et de RDF Qualit?s requises : Bon r?dactionnel. Forte curiosit? fonctionnelle, forte autonomie Le stage est plac? sous la responsabilit? du responsable avant-vente - Dur?e : 6 mois minimum - D?but du stage : d?s que possible - Lieu : Paris 15?me - Indemnit?s : ? d?battre Candidature ? adresser (CV, lettre de motivation, photo) ? : Emmanuel Dupont, Directeur des Op?rations Soci?t? GEOLSemantics 32, rue Brancion 75015 Paris emmanuel.dupont at geolsemantics.com Site WEB : en cours de construction ---------------------------------------- Stage fin d??tudes pr?-embauche : Ing?nieur D?veloppement Module d'?laboration de r?gles linguisitiques La soci?t? : Bas?e ? Paris, GEOLSemantics est une jeune entreprise innovante qui d?veloppe et commerciale des logiciels de traitement automatique du langage naturel pour satisfaire des besoins de veille strat?gique. Les applications sont multiples et principalement centr?es sur la d?tection automatique de menace, de rumeur ou d?opinion ? partir de l?analyse syst?matique des documents multi-lingues publi?s sur le WEB, dans les sites, les blogs, les forums ou les r?seaux sociaux. Sujet du stage GEOLSemantics recherche un stagiaire bac +5 en informatique pour la r?alisation de son module d??laboration de r?gles linguistiques de haut niveaudestin?e ? extraire des connaissances structur?es exprim?es dans une ontologie. Int?gr?(e) au sein d'une ?quipe ? taille humaine, vous aurez la possibilit? de vous ?panouir dans le monde du Web 3.0 sur des technologies J2ee (Portlet sous Liferay, Java). Contenu du stage : Actuellement, l??criture de r?gles linguistiques n?cessite une expertise pointue sur notre moteur de traitement. GEOLS?mantics souhaite mettre ? disposition de ses ing?nieurs et de ses partenaires un SDK d?assistance ? l??criture de r?gles linguistiques. Le but de cet outil est d?augmenter la productivit? en interne et de faciliter la prise en main de notre progiciel par des intervenants ext?rieurs. A partir du cahier des charges, vous devrez comprendre le besoin fonctionnel pour r?diger les sp?cifications fonctionnelles et mettre en place rapidement une premi?re maquette. Apr?s validation aupr?s des utilisateurs finaux, vous aurez la responsabilit? de d?velopper et livrer le SDK. La d?composition des activit?s est la suivante : 60% de sp?cification : compr?hension du besoin, des langages manipul?s par les linguistes, du m?tier du Web 3.0; 40% de d?veloppement en Java (ex?cutable int?gr? ? notre moteur s?mantique et IHM). Vos capacit?s d'autonomie, vos aptitudes ? vous exprimer et ? r?diger vous permettront rapidement d'?tre responsable de la r?alisation et du suivi complet de vos travaux sous la conduite d?un ing?nieur exp?riment?. Profil du stagiaire : Comp?tences techniques requises : - Capacit? ? appr?hender un domaine nouveau (le Web s?mantique) - Base algorithmique sur les automates - Bonnes comp?tences en Java - Ma?trise du XML et de RDF Qualit?s requises : Bon r?dactionnel. Forte curiosit? fonctionnelle Langue : Anglais exig? Le stage est plac? sous la responsabilit? du Directeur des Op?rations - Dur?e : 6 mois minimum - D?but du stage : d?s que possible - Lieu : Paris 15?me - Indemnit?s : ? d?battre Candidature ? adresser (CV, lettre de motivation, photo) ? : Emmanuel Dupont, Directeur des Op?rations Soci?t? Cad?ge/GEOLSemantics 32, rue Brancion 75015 Paris emmanuel.dupont at geolsemantics.com Site WEB : en cours de construction --------------------------------------------- Stage fin d??tudes pr?-embauche : Ing?nieur D?veloppement Algorithme de clustering La soci?t? : Bas?e ? Paris, GEOLSemantics est une jeune entreprise innovante qui d?veloppe et commerciale des logiciels de traitement automatique du langage naturel pour satisfaire des besoins de veille strat?gique. Les applications sont multiples et principalement centr?es sur la d?tection automatique de menace, de rumeur ou d?opinion ? partir de l?analyse syst?matique des documents multi-lingues publi?s sur le WEB, dans les sites, les blogs, les forums ou les r?seaux sociaux. Sujet du stage GEOLSemantics recherche un stagiaire bac +5 en informatique pour la r?alisation d?un algorithme de clustering. Int?gr?(e) au sein d'une ?quipe ? taille humaine, vous aurez la possibilit? de vous ?panouir dans le monde du Web 3.0. Contenu du stage : De fa?on ? ?valuer la qualit? des outils de regroupement d?information concernant des personnes (comme 123people), la communaut? scientifique a organis? plusieurs comp?titions internationales (campagnes WEBS). A partir d?un ensemble de textes qui concernent des personnes avec des homonymes (personnes diff?rentes de m?me nom), une premi?re phase consiste ? regrouper les documents parlant de la m?me personne par des technologies de clustering et dans un deuxi?me temps pour chaque cluster qui concerne une seule personne (les homonymes ?tant sens? avoir ?t? distingu?s par le clustering) on extrait un certain nombre d?information concernant la personne (date et lieu de naissance, adresse, t?l?phone, dipl?mes, ...). La soci?t? GEOLSemantics a d?velopp? des traitements linguistiques automatiques qui permettent d?identifier dans les textes les ?l?ments significatifs et d?extraire des informations concernant les personnes. En s?appuyant sur les outils existant dans la soci?t?, il est demand? au stagiaire de d?velopper un algorithme de clustering qui s?appuiera sur les r?sultats de l?analyse linguistique des textes d?crivant les personnes. Ensuite de compl?ter les r?gles d?extraction des connaissances pour extraire de chaque cluster les informations demand?es par la campagne d??valuation WEBS. Les r?sultats pourront ?tre compar?s aux r?sultats obtenus pas les participants aux campagnes pr?c?dentes. La langue de la campagne ?tant l?anglais, il est demand? au stagiaire d?avoir une ma?trise de cette langue. Vos capacit?s d'autonomie, vos aptitudes ? vous exprimer et ? r?diger vous permettront rapidement d'?tre responsable de la r?alisation et du suivi complet de vos travaux sous la conduite d?un ing?nieur exp?riment?. Profil du stagiaire : Comp?tences techniques requises : Profil demand? : - Master 2 ou ing?nieur en informatique ayant une connaissance en traitement automatique des langues. - Go?t de la comp?tition (il faut sortir de meilleurs r?sultats que les autres). Les r?sultats pourront faire l?objet d?une publication scientifique. - Capacit? ? appr?hender un domaine nouveau (le Web s?mantique) - Bonnes comp?tences en Java - Ma?trise du XML et de RDF Qualit?s requises : Bon r?dactionnel. Forte curiosit? fonctionnelle Langue : Anglais exig? Le stage est plac? sous la responsabilit? du Directeur Scientifique - Dur?e : 6 mois minimum - D?but du stage : d?s que possible - Lieu : Paris 15?me - Indemnit?s : ? d?battre Candidature ? adresser (CV, lettre de motivation, photo) ? : Emmanuel Dupont, Directeur des Op?rations Soci?t? Cad?ge/GEOLSemantics 32, rue Brancion 75015 Paris emmanuel.dupont at geolsemantics.com Site WEB : en cours de construction ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sat Dec 24 09:26:33 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sat, 24 Dec 2011 10:26:33 +0100 Subject: Appel: ECAI 2012 Message-ID: Date: Thu, 22 Dec 2011 15:44:47 +0100 From: ECAI Publicity Message-ID: <4EF3425F.5090402 at lirmm.fr> X-url: http://www2.lirmm.fr/ecai2012/ ###################################################################### ## ECAI-2012 ## Call for Papers ## ECAI-2012 ## ###################################################################### The Twentieth European Conference on Artificial Intelligence 27-31 August 2012 Montpellier, France http://www2.lirmm.fr/ecai2012/ Deadline for submission of papers: 6 March 2012 ###################################################################### ## INTRODUCTION ## ###################################################################### ECAI is the biennial European Conference on Artificial Intelligence organized by the European Coordinating Committee for Artificial Intelligence (ECCAI); it is the premier forum for presenting Artificial Intelligence results in Europe. The twentieth conference in this series, ECAI 2012, will take place in Montpellier (France) August 27-31. ECAI 2012 will gather researchers and practitioners of Artificial Intelligence from all over the world; they will present the latest results and identify new trends and challenges in all subfields of Artificial Intelligence as well as demonstrate innovative applications and uses of advanced Artificial Intelligence technology. In addition to the main technical track, ECAI 2012 will feature keynote and invited talks, invited tutorials, an elaborate workshop program, a system demonstration track, a Turing centenary celebration, in which we will celebrate the 100th anniversary of the birth of Alan Turing and his influence on Artificial Intelligence, and a special session on Highlights of European AI, to celebrate the 20th ECAI conference and 25 years of AI Communications. ECAI 2012 will also feature the 6th STAIRS (STarting AI Researcher Symposium) and the 6th PAIS (Conference on Prestigious Applications of Intelligent Systems) as sub-conferences. ECAI-2012 will be held in the beautiful city of Montpellier, France. With excellent opportunities for sightseeing and gastronomy, Montpellier promises to be a wonderful venue for a memorable conference. ###################################################################### ## SUBMISSIONS ## ###################################################################### ECAI 2012 welcomes submissions of technical papers to the main track. Submitted papers should present original, high-quality and previously unpublished results on Artificial Intelligence. Submissions are welcome from all areas of AI; the following list of topics is indicative only. * Agents & Multiagent Systems * Case-Based Reasoning * Cognitive Modeling & Interaction * Constraints & Search * Knowledge Representation & Reasoning * Machine Learning & Data Mining * Model-Based Reasoning * Natural Language Processing * Perception & Sensing * Planning & Scheduling * Robotics * Uncertainty in AI * Applications of AI ECAI 2012 especially encourages submissions that cross discipline boundaries within Artificial Intelligence, and between Artificial Intelligence and other disciplines. Both long (6-page) and short (2-page) papers can be submitted. Whereas long papers should report on substantial research results, short papers are intended for highly promising but possibly more preliminary work. Short papers will be presented in poster form. Rejected long papers will not be considered for the short paper track. ECAI 2012 also welcomes submissions of proposals for workshops, descriptions of system-demonstrations, papers on prestigious applications of AI for PAIS, and contributions of starting researchers in AI for STAIRS; cf. the separate calls for these tracks. ###################################################################### ## IMPORTANT DATES ECAI Technical Program ## ###################################################################### Paper submission deadline: Tuesday, 6 March 2012 Author response period between(indicative): 18-23 April 2012 Notification of acceptance/rejection: Monday, 21 May 2012 Camera-ready paper due: Sunday, 3 June 2012 Conference: 27-31 August 2012 ###################################################################### ## PAPER FORMATTING & SUBMISSION INFORMATION ## ###################################################################### Long papers must not exceed SIX (6) and short papers must not exceed TWO (2) pages in camera-ready format for ECAI. Over-length submissions will be rejected without review. Papers for ECAI should be submitted using the ECAI formatting style; details of the style are available at: http://people.cs.kuleuven.be/~luc.deraedt/ecai2012-style.zip This file unpacks to a directory "ecai2012", and contains LaTeX and other style files. The file "ecai2012.tex" is an example paper in LaTeX format using the appropriate styles, and can be used as a template for ECAI submissions. A Microsoft Word template is also available. Reviewing for ECAI 2012 will be blind: reviewers will not be presented with the identity of paper authors. To allow for blind review, author names in a submitted paper or poster should be replaced by the unique tracking number assigned by the conference website at the submission of an electronic abstract. Authors should avoid writing anything that makes their identity obvious in the text. Submissions should be original, and in particular should not previously have been formally published. (Any publication venue with an ISBN or ISSN number counts as a formal publication; this includes LNCS/LNAI volumes, for example.) Submissions should not be submitted elsewhere during the ECAI 2012 review phase. The proceedings of the ECAI conference, together with those of the PAIS symposium will be published and distributed by IOS Press as an online open access book. (The proceedings of some past ECAI conferences are freely available on the IOS website). The authors will be responsible for producing camera-ready copies of papers, conforming to the ECAI formatting guidelines for inclusion in the proceedings. At least one author of each accepted long or short paper is required to attend the conference to present the contribution. Submission and review of papers for ECAI-2012 will be managed via the ConfMaster system. Confmaster may be live for ECAI submissions before 31 January. Prior to this date, the site will be used for configuration and testing. All data uploaded or registered before this date will be deleted and will not be considered for the conference. Do NOT register or upload papers before 31 January 2012! All submissions will be subject to peer review by the ECAI 2012 Programme Committee, and evaluated on the basis of: relevance; significance of contribution; technical quality; scholarship; and quality of presentation. The authors of the best papers of ECAI 2012 will be invited to submit an extended version of their paper to the Artificial Intelligence Journal. The primary authors of submitted papers will be offered the opportunity to respond to the reviews for their papers before the final decision on acceptance or otherwise is made. The author feedback phase is currently planned between 18-23 April 2012. ###################################################################### ## ORGANISATION ## ###################################################################### Conference Chair : Didier Dubois ECAI Program Chair : Luc De Raedt Organizing Committee Chair: Christian Bessiere ECCAI Chair: Gerhard Brewka Workshop Chairs: J??r??me Lang and Mich??le Sebag System-Demonstration Chairs: Patrick Doherty and Fredrik Heintz STAIRS Program Chair: Kristian Kersting and Marc Toussaint PAIS Program Chairs: Paolo Frasconi and Peter Lucas Turing Centenary Celebration Chair: Michael Wooldridge ###################################################################### ## LOCAL ORGANIZING COMMITTEE ## ###################################################################### LIRMM supervisor: Christophe Dhenaut Logistics: Elisabeth Greverie, Justine Landais, Eric Bourreau Treasurer: Michel Leclere, Laetitia Megual Technical support: Remi Coletta Web site: Michel Liquiere Sponsoring: Remi Coletta Publicity: Marie-Laure Mugnier, Madalina Croitoru Other members: Stefano Cerri, Abdel Gouaich, Joel Quinqueton AFIA contacts: Patrick Albert, Yves Demazeau, Thomas Guyet, Eunika Mercier-Laurent, Jean-Denis Muller ###################################################################### ## AREA CHAIRS ## ###################################################################### Area chairs currently confirmed include: Michael Beetz [Germany] Ronen Brafman [Israel] Fabio Cozman [Brazil] James Cussens [UK] Thomas Eiter [Austria] Johannes Fuernkranz [Germany] Hector Geffner [Spain] Marco Gori [Italy] Udo Hahn [Germany] Malte Helmert [Switzerland] Manfred Jaeger [Denmark] Sarit Kraus [Israel] Andreas Krause [Switzerland] Jerome Lang [France] Pierre Marquis [France] John-Jules Meyer [The Netherlands] Bernhard Nebel [Germany] Ann Now?? [Belgium] Barry O'Sullivan [Ireland] Francesca Rossi [Italy] Michele Sebag [France] Carles Sierra [Spain] Steffen Staab [Germany] Peter Struss [Germany] Michael Thielscher [Australia] Hannu Toivonen [Finland] Antal van den Bosch [The Netherlands] Pascal Van Hentenryck [US] Michael Wooldridge [UK] Frank Wolter [UK] ###################################################################### ## KEYNOTE SPEAKERS ## ###################################################################### The ECAI-2012 programme will include plenary talks from internationally leading researchers in AI and cognate disciplines. Keynote speakers so far confirmed include: * Wofram Burgard [Freiburg, Germany] * Adnan Darwiche [UCLA, US] * Tom Mitchell [CMU, US] ###################################################################### ECAI 2012 http://www2.lirmm.fr/ecai2012/ ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sat Dec 24 09:29:30 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sat, 24 Dec 2011 10:29:30 +0100 Subject: Appel: IEEE 2nd DICTAP2011- Thailand Message-ID: Date: Thu, 22 Dec 2011 16:00:07 -0600 From: "The Second International Conference on Digital Information and Communication Technology and its Applications (DICTAP2012)" Message-ID: <0b7b8911cb11c40c6e79c4404598207d at localhost.localdomain> X-url: http://www.sdiwc.net/thi/ The Second International Conference on Digital Information and Communication Technology and its Applications (DICTAP2012) University of the Thai Chamber of Commerce(UTCC), Bangkok Thailand May 16-18, 2012 Technical co-sponsored by IEEE Thailand Section All papers will be submitted to IEEE for potential inclusion to IEEE Xplore http://www.sdiwc.net/thi/ ======================================================================= The proposed conference on the above theme will be held at University of the Thai Chamber of Commerce(UTCC), Bangkok, Thailand during the period of May 16-18, 2012. The DICTAP2012 aims to enable researchers build connections between different digital applications. The conference welcomes papers on the following (but not limited to) research topics: 1) Communication Basic and Infrastructure - Telecommunications - Communication protocols - Algorithms, Architecture, and Infrastructures - Transmission Techniques - Communication Systems 2) Networks Basic and Management - Network Systems and Devices - Wireless Networks, Ad-Hoc and Sensor Networks - Network Modeling and Simulation - Network Management Techniques - Networks Security, Encryption and Cryptography 3) Multimedia Application - Management and Diffusion of Multimedia Applications - Multimedia Data Base - Information Retrieval - Digital Rights Management 4) Image, Video, Signal and Information Processing - Compression and Coding - Analysis and Processing - Computer Graphics - Data mining - Soft Computing - Web based applications Researchers are encouraged to submit their work electronically. All papers will be fully refereed by a minimum of two specialized referees. Before final acceptance, all referees comments must be considered. Important Dates ============== Submission Date : Mar. 10, 2012 Notification of acceptance : Apr. 1, 2012 Camera Ready submission : Apr. 14, 2012 Registration : Apr. 14, 2012 Conference dates : May 16-18, 2012 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sat Dec 24 09:32:14 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sat, 24 Dec 2011 10:32:14 +0100 Subject: Appel: TKE 2012 - New frontiers in the constructive symbiosis of terminology and knowledge engineering Message-ID: Date: Fri, 23 Dec 2011 00:45:46 +0100 From: Nathalie Aussenac-Gilles Message-ID: <4EF3C12A.2010303 at irit.fr> X-url: http://www.oeg-upm.net/tke2012 [Apologies for cross-posting] ------------------------------------------------------------------------ SECOND CALL FOR PAPERS TKE 2012 New frontiers in the constructive symbiosis of terminology and knowledge engineering TKE (Terminology and Knowledge Engineering) Conference http://www.oeg-upm.net/tke2012 ------------------------------------------------------------------------ When - Conference 20-21 June 2012 When - Workshops 19& 22 June 2012 Where Madrid, Spain Submission Deadline 21-01-2012 Notification Due 18-02-2012 Final Version Due 07-04-2012 Conference Aim and Topics TKE 2012 will be the 10th edition of this conference that started out under the auspices of TermNet, and has developed towards wider fields in knowledge transfer for special language communication in a multilingual global society. In this context, interdisciplinary research emphasizing the intersection of different disciplines dealing with terminological aspects and knowledge engineering resources online is welcome. This conference will mainly focus on those theoretical, methodological and practical aspects that show the symbiosis of terminology and knowledge engineering by highlighting the recent advances in these related fields. We invite papers on all aspects of I. Terminology and knowledge engineering * Ontologies as semantic resources * Lexicons and ontologies as means for knowledge transfer * Classification systems and thesauri * Multimodal information management in terminology * Metadata in terminological and knowledge resources * Ontology localization * Multilingual terminology extraction and alignment for lexical resources * Representation of terms and conceptual relations in knowledge-based applications * Comparative studies of terminological resources and/or ontological resources from different languages, domains and approaches * Interoperability and reusability in knowledge-based tools and applications II. Terminology policies and terminological tools * Terminology in Languages for Specialized Purposes (LSPs) * Terminology and lexicography teaching and learning * Terminology for language planning * Terminological resources in the 21st century * Multilinguality and multimodality in terminological resources * Reusing, standardizing and merging terminological or ontological resources. III. Terminology applications * Applications of terminological resources * Terminology in industrial and business settings * Terminological tools for machine translation * Terminology in natural language processing tools * Terminology in e-government and sector standardization Submission of extended abstracts for oral presentations and posters Extended abstracts of papers for oral presentations should consist of not more than four pages, excluding references and illustrations, via the Conference Submission System. To submit an abstract, go to the submission site (soon available) and select "new submission". The abstract submitted will be used to assign reviewers, so be sure it identifies the subject area and expertise required to evaluate your work.The program committee may be decide to change the category of a paper before acceptance. Complete papers will be submitted electronically. The extension of papers will be no longer than 16 pages (5000-6000 words). Papers that exceed this limit will be rejected without review. Submissions must be in PDF, formatted in the style of the Springer Publications format for Lecture Notes in Computer Science (LNCS). For details on the LNCS style, see Springer?s Author Instructions. The 2nd Call for Papers will include further details on accommodation and how to arrive. Organizing Committee Guadalupe Aguado de Cea, UPM, Spain (Conference chair) Gerhard Budin, Viena University, Austria Key Sun Choi, KAIST, Korea Ra?l Garc?a Castro, UPM, Spain Elena Montiel-Ponsoda, UPM, Spain Bodil Nistrup Madsen, CBS and GTW, Denmark Ana Rold?n Riejos, UPM, Spain Mari Carmen Su?rez-Figueroa, UPM, Spain Klaus-Dirk Schmitz, Fachhochschule K?ln and GTW, Germany Hanne Erdman Thomsen, CBS and GTW, Denmark Programme Committee Bassey Antia, Nigeria Nathalie Aussenac-Gilles, France Lynne Bowker, Canada Paul Buitelaar, Ireland Maria Teresa Cabr?, Spain Rute Costa, Portugal Anne Condamines, France Beatrice Daille, France Thierry Declerck, Germany Pamela Faber, Spain Gil Francopoulo, France Asunci?n G?mez P?rez, Spain Fidelia Ibewke-Sanjuan, France Kyo Kageura, Japan Hendrick Kockaert, Belgium Marita Kristiansen, Norway Marie-Claude L'Homme, Canada Adeline Nazarenko, France Henrik Nilsson, Sweeden Anita Nuopponen, Finland Antonio Pareja-Lora, Spain Maria Pozzi, Mexico Christophe Roche, France Laurent Romary, France Sylvie Szluman, France Frieda Steurs, Belgium Birthe Toft, Denmark Kara Warburton, Hong-Kong Sue Ellen Wright, USA Pierre Zweigenbaum, France ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Sat Dec 24 09:34:16 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Sat, 24 Dec 2011 10:34:16 +0100 Subject: Appel: TKE 2012 - Call for Workshops Message-ID: Date: Fri, 23 Dec 2011 23:46:50 +0100 From: Nathalie Aussenac-Gilles Message-ID: <4EF504DA.6070904 at irit.fr> Apologies for multiple posting **************************************************************** Call for Workshop Proposals 10th Terminology and Knowledge Engineering Conference (TKE 2012) 19th - 22nd June 2012 Madrid, Spain **************************************************************** TKE 2012 will be the 10th edition of this conference that started out under the auspices of TermNet, and has developed towards wider fields in knowledge transfer for special language communication in a multilingual global society. In this context, interdisciplinary research emphasizing the intersection of different disciplines dealing with terminological aspects and knowledge engineering resources online is welcome. Co-located workshops at TKE conferences are distinguished meeting points for discussing ongoing work and latest ideas related to terminology and knowledge engineering. General Information and criteria -------------------------------- The decision on acceptance/rejection of workshop proposals will be made on the basis of the overall quality of the proposal as well as of how it fits in with the conference. The organizers of successful workshops will be responsible for their own reviewing process, publicity (e.g., website and call for papers), and proceedings production. They will be required to closely cooperate with the conference organizers to finalize all organizational details. Workshop attendees must pay the TKE 2011 workshop registration fee, as well as the conference registration fee. Criteria for workshop proposals: * Co-located workshops cover topics falling in the general scope of the TKE conference. * Workshops are intended to be genuine interactive events and not mini-conferences. * We welcome workshops with creative structures and organizations that attract various types of contributions and ensure rich interactions. * They are clearly focused. * There is potentially a significant community interested in the workshop's topic. * Workshop duration can be half a day or a full day. * We advise having more than one organizer and no more than four, preferably from different institutions, bringing different perspectives to the workshop topic. Submission Guidelines --------------------- Workshop proposals must be no more than 3 pages in length, in txt or PDF format, must be sent via email to Ra?l Garc?a Castro (rgarcia at fi.upm.es), and should contain the following information: * The title and brief technical description of the workshop, specifying its goals and motivation. * A brief discussion of why the topic is of particular interest at this time. * A brief description of why and to whom the workshop is of interest, the workshop audience, as well as the expected number of participants. * A brief description (draft outline) of the proposed workshop format, discussing the mix of events such as paper presentations, invited talks, panels, and general discussion. * A list of (potential) members of the program committee (at least 50% have to be known and confirmed at the time of the proposal). * An indication of whether the workshop should be considered for a half-day or full-day event. * If there have been previous editions of the workshop, their short description, including URLs as well as some statistics about submissions and acceptance rate. * The names and contact information (web page, email address) of the proposed organizing committee and short description of their relevant experience. The organizing committee should also inform about whether they plan to prepare a workshop web site or the workshop information will be included in the main conference website. Important Dates (11:59 pm Hawaii time) Workshop Proposals Due: January 25, 2012 Notification of Acceptance: February 10, 2012 Workshop Website Due: February 25, 2012 Workshop Camera-ready Proceedings Due: June 1, 2012 Workshop Day: June 19 or 22, 2012 Workshop Chair -------------- Ra?l Garc?a Castro (UPM, Spain) (tke2012workshops at delicias.dia.fi.upm.es) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Thu Dec 29 10:01:56 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Thu, 29 Dec 2011 11:01:56 +0100 Subject: Appel: Ireland International Conference on Education (IICE-2012), Reminder Message-ID: Date: Sun, 25 Dec 2011 16:21:17 +0000 (GMT) From: L inda Woods Message-ID: <322433556.44731.1324830077963.JavaMail.open-xchange at email.1and1.co.uk X-url: http://www.iicedu.org/ Apologies for cross-postings. Kindly email this call for papers to your colleagues, faculty members and postgraduate students. CALL FOR PAPERS ********************************************************** Ireland International Conference on Education (IICE-2012), April 16-18, 2012, Dublin, Ireland (www.iicedu.org) ********************************************************** The IICE is an international refereed conference dedicated to the advancement of the theory and practices in education. The IICE promotes collaborative excellence between academicians and professionals from Education. The aim of IICE is to provide an opportunity for academicians and professionals from various educational fields with cross-disciplinary interests to bridge the knowledge gap, promote research esteem and the evolution of pedagogy. The IICE 2012 invites research papers that encompass conceptual analysis, design implementation and performance evaluation. All the accepted papers will appear in the proceedings and modified version of selected papers will be published in special issues peer reviewed journals. The topics in IICE-2012 include but are not confined to the following areas: * Academic Advising and Counselling * Art Education * Adult Education * APD/Listening and Acoustics in Education Environment * Business Education * Counsellor Education * Curriculum, Research and Development * Competitive Skills * Continuing Education * Distance Education * Early Childhood Education * Educational Administration * Educational Foundations * Educational Psychology * Educational Technology * Education Policy and Leadership * Elementary Education * E-Learning * E-Manufacturing * ESL/TESL * E-Society * Geographical Education * Geographic information systems * Health Education * Higher Education * History * Home Education * Human Computer Interaction * Human Resource Development * Indigenous Education * ICT Education * Internet technologies * Imaginative Education * Kinesiology & Leisure Science * K12 * Language Education * Mathematics Education * Mobile Applications * Multi-Virtual Environment * Music Education * Pedagogy * Physical Education (PE) * Reading Education * Writing Education * Religion and Education Studies * Research Assessment Exercise (RAE) * Rural Education * Science Education * Secondary Education * Second life Educators * Social Studies Education * Special Education * Student Affairs * Teacher Education * Cross-disciplinary areas of Education * Ubiquitous Computing * Virtual Reality * Wireless applications * Other Areas of Education Important Dates: * Extended Abstract (Work in Progress) Submission Date: January 15, 2012 * Research Paper, Student Paper, Case Study, Report Submission Date: January 20, 2012 * Proposal for Workshops: January 15, 2012 * Notification of Workshop Acceptance/Rejection: January 20, 2012 * Poster Proposal Submission Date: January 10, 2012 * Notification of Poster Acceptance/Rejection: January 15, 2012 * Notification of Extended Abstract (Work in Progress) Acceptance/Rejection: January 25, 2012 * Notification of Research Paper, Student Paper, Case Study, Report Acceptance /Rejection: January 30, 2012 * Camera Ready Paper Due: February 10, 2012 * Early Bird Registration Deadline (Authors and Participants): February 15, 2012 * Late Bird Registration Deadline (Authors and Participants): March 10, 2012 * Conference Dates: April 16-18, 2012 For further information please visit IICE-2012 atwww.iicedu.org ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Thu Dec 29 10:04:49 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Thu, 29 Dec 2011 11:04:49 +0100 Subject: Appel: EACL 2012 Workshop on Innovative hybrid approaches to the processing of textual data Message-ID: Date: Mon, 26 Dec 2011 18:51:42 +0100 From: Thierry Hamon Message-ID: <87d3bbz0ep.fsf at smtp.univ-paris13.fr> X-url: http://www-limbio.smbh.univ-paris13.fr/membres/hamon/hybrid/ Call for papers Innovative hybrid approaches to the processing of textual data April 22, 2012, Avignon, France Website: http://www-limbio.smbh.univ-paris13.fr/membres/hamon/hybrid/ Submission Deadline: Jan 27, 2012 The hybrid approach term covers a large set of situations in which different approaches are combined in order to better process textual data and to attempt a better achievement of the dedicated task. Among the hybridizations the possible combinations are unlimited. The most frequent combination, as stressed during The Balancing Act in 1994, addressed machine learning and rule-based systems. Beyond this, the hybridization can be augmented with distributionnal approaches, syntactic and morphological analyses, semantic distances and similarities, graph theory models, cooccurrences of linguistic units (e.g., word and their dependencies, word senses and pos-tag, NEs and semantic roles,...), knowledge-based approaches (terminologies and ontologies), etc. As a matter of fact, the hybridization implies to define a strategy to efficiently combine several approaches: cooperation between approaches, filtering, voting or ranking of the multiple system outputs, etc. Indeed, the combination of these different methods and approaches appears to provide more complete and performant results. The reason is that each method is sensitive and efficient with given data and within given contexts. Hence, their combination may improve both precision and recall. The coverage is indeed improved, while the exploitation of different methods may also lead to the improvement of the precision since their use within filtering, voting etc. modes becomes possible. In this workshop, we favour the extended meaning of the hybridization of methods, applied to various application areas, such as (but do not feel constrained by these): - automatic creation of linguistic resources - POS tagging - building and structuring of terminologies - information retrieval and filtering - information extraction - linguistic annotation - semantic labeling - sign language recognition and transcription - oral data transcription - filtering and validation of lexical resources - text summarization - question/answering system - natural language generation - etc. We invite authors to submit novel methods and novel conceptions of the hybridization performed in various areas related to the textual data processing. Important dates Nov 25, 2011: 1st workshop CFP Jan 04, 2012: Abstract deadline Jan 27, 2012: Paper due date Feb 24, 2012: Notification of acceptance Mar 09, 2012: Camera-ready deadline Apr 22, 2012: Workshop Submission instructions: Authors are invited to submit full papers on original, unpublished work in the topic area of this workshop. Submissions should be formatted using the EACL 2012 stylefiles for latex or MS Word, with blind review and not exceeding 8 pages plus an extra page for references. The PDF files will be submitted electronically at https://www.softconf.com/eacl2012/Hybrid2012/ Program Committee Philipp Cimiano, CITEC, University of Bielefeld, Germany Vincent Claveau, IRISA, CNRS, France Kevin Cohen, University of Colorado Health Sciences Center, USA Marie-Claude l'Homme, OLST, Universit? de Montreal, Canada B?atrice Daille, LINA, Universit? de Nantes, France Sampo Pyysalo, NaCTeM, University of Manchester, United Kingdom Mathieu Roche, LIRMM, Universit? de Montpellier 2, France Patrick Ruch, Haute ?cole de gestion de Gen?ve, Switzerland Paul Thompson, NaCTeM, University of Manchester, United Kingdom ?zlem Uzuner, University at Albany, State University of New York, USA Organization Committee Natalia Grabar, CNRS UMR 8163 STL, Universit? Lille 1&3, France Marie Dupuch, MOSTRARE/LIFL & CNRS UMR 8163 STL, Universit? Lille 1&3, France Amandine P?rinet, LIM&BIO, Universit? Paris 13, France Thierry Hamon, LIM&BIO, Universit? Paris 13, France ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Thu Dec 29 10:06:44 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Thu, 29 Dec 2011 11:06:44 +0100 Subject: Appel: 1st CfP Workshop on Language Resources and Technologies for Turkic Languages, LREC 2012 Workshop Message-ID: Date: Tue, 27 Dec 2011 18:14:27 +0100 From: ELRA ELDA Information Message-ID: <4EF9FCF3.3030805 at elda.org> [Apologies for cross-postings] First Workshop on Language Resources and Technologies for Turkic Languages May 21, 2012 (afternoon session) First Call for Papers Turkic languages (Turkish, Azerbaijani, Turkmen, Kazakh, Tatar, Uzbek and many others) are spoken as native languages by more than 150 million people all around the world. Turkic languages have complex agglutinative morphology with very productive inflectional and derivational processes leading to a very large vocabulary size. They also have a very free constituent order with almost no formal constraints. Furthermore, due to various historical and social reasons these languages have employed a wide-variety of writing systems and still do so. These aspects bring numerous challenges to computational processing of these languages in tasks such as language modeling, parsing, statistical machine translation, speech-to-speech translation, etc. Time is now ripe to focus on the development of language resources and computational processing techniques for these languages to bring their status up to par with more studied languages in the context of speech and language processing. We are pleased to invite you to the "First Workshop on Language Resources and Technologies for Turkic Languages" to be held during LREC 2012 in Istanbul. This half-day workshop will be the first attempt to bring together the researchers, commercial interests and other stakeholders actively involved in developing and using speech and language technologies for Turkic languages. The workshop invites submission of papers on original and unpublished research on all aspects of speech and language technologies for Turkic languages. Selected contributions will be presented as short-oral presentations or posters. Topics include but are not limited to: - Morphological and Syntactic Analysis for Turkic languages - Novel language modeling techniques for Turkic languages - Development and adaptation of language and speech resources for Turkic languages - Machine translation into/from/between Turkic languages - Challenges for speech recognition and generation for Turkic languages Submission Information Submissions must describe original and unpublished work. Each submission will be judged on its originality, technical content, and relevance to the workshop. Contributions accompanied by language resources and datasets that are described in the paper are particularly encouraged. Paper submissions up to 4 pages (excluding references) must conform to the official LREC 2012 style guidelines (will be distributed soon on the LREC website). Submissions should be in PDF format and submitted through the START online submission system https://www.softconf.com/lrec2012/TurkicLanguage2012/. Since the reviewing will not be blind, submissions must include the authors' names and affiliations. Double submissions to other conferences are not allowed. More information on the submission procedure will be available in the next call of the papers. When submitting a paper through the START page, authors will be kindly asked to provide relevant information about the resources that have been used for the work described in their paper or that are the outcome of their research. For further information on this initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012. Authors will also be asked to contribute to the Language Library, the new initiative of LREC2012. Important Dates Paper submission deadline: February 27, 2012 Notification of acceptance: March 19, 2012 Camera-ready submission deadline: March 30, 2012 Workshop: May 21, 2012 (afternoon session) Organizing Committee Kemal Oflazer, Carnegie Mellon University - Qatar Mehmed ?zkan, Bo?azi?i University Mehmet U?ur Do?an, T?B?TAK-B?LGEM Dilek Hakkani-T?r, Microsoft Alper Kanak, T?B?TAK-B?LGEM ?lknur Durgar El-Kahlout, T?B?TAK-B?LGEM ?eniz Demir, T?B?TAK-B?LGEM Y?cel Bicil, T?B?TAK-B?LGEM Program Committee Adil Alpko?ak, Dokuz Eyl?l University Ahmet C?neyd Tantu?, ?stanbul Technical University Arzucan ?zg?r, Bo?azi?i University Atakan Kurt, Fatih University Banu Diri, Y?ld?z Technical University Bar?? Bozkurt, Bah?e?ehir University Bilge Say, Middle East Technical University Cem Boz?ahin, Middle East Technical University Cemil Demir, T?B?TAK-B?LGEM Cenk Demiro?lu, ?zye?in University Co?kun Mermer, T?B?TAK-B?LGEM Deniz Y?ret, Ko? University Deniz Zeyrek, Middle East Technical University Ebru Ar?soy, IBM T.J. Watson Research Center Engin Erzin, Ko? University Erdem ?nal, T?B?TAK-B?LGEM Fatma Canan Pembe, T?B?TAK-B?LGEM G?l?en Cebiro?lu Eryi?it, ?stanbul Technical University Levent Arslan, Bo?azi?i University Mehmet Fatih Amasyal?, Y?ld?z Technical University Murat Can Ganiz, Do?u? University Murat Sara?lar, Google - Bo?azi?i University O?uzhan K?lek?i, T?B?TAK-B?LGEM ?zlem ?etino?lu, University of Stuttgart Ruken Cakici, Middle East Technical University Sel?uk K?pr?, Teknoloji Yaz?l?mevi ??kriye Ruhi, Middle East Technical University Tunga G?ng?r, Bo?azi?i University ?mit G?z, I??k University Ye?im Aksan, Mersin University Yusuf Ziya I??k, T?B?TAK-B?LGEM Sponsor The workshop is being supported by the European Commission FP7-REGPOT-2008-1 project (#229861) MULTISAUND (MULTilingualism Integrated to Speech and Audio UNDerstanding) - "Improvement of interactive and secure language and speech processing potential of TUBITAK-UEKAE for an increased multilingual capability in ERA". Contact Information Please contact Mehmet U?ur Do?an (mugur at uekae.tubitak.gov.tr) or ?lknur Durgar El-Kahlout (idurgar at uekae.tubitak.gov.tr) for any further inquiries. ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Thu Dec 29 10:10:07 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Thu, 29 Dec 2011 11:10:07 +0100 Subject: Appel: 1st CfP Workshop on Indian Language Data: Resources and Evaluation, LREC 2012 Workshops Message-ID: Date: Tue, 27 Dec 2011 18:18:48 +0100 From: ELRA ELDA Information Message-ID: <4EF9FDF8.1010607 at elda.org> X-url: http://sanskrit.jnu.ac.in/conf/wildre [Apologies for cross-postings] WILDRE- Workshop on Indian Language Data: Resources and Evaluation FIRST WORKSHOP ON INDIAN LANGUAGE DATA: RESOURCES AND EVALUATION (WILDRE) Date: Monday, 21st May 2012 (morning session) Venue: L?tfi Kirdar Istanbul Exhibition and Congress Centre, Turkey (Organized in under the platform of LREC2012 (21-27 May 2012)) Website: http://sanskrit.jnu.ac.in/conf/wildre WILDRE ? the first workshop on Indian Language Data: Resources and Evaluation is being organized in Istanbul, Turkey on 21st May, 2012 under the LREC platform. India has a huge linguistic diversity and has seen concerted efforts from the Indian government and industry towards developing language resources. European Language Resource Association (ELRA) and its associate organizations have been very active and successful in addressing the challenges and opportunities related to language resource creation and evaluation. It is therefore a great opportunity for resource creators of Indian languages to showcase their work on this platform and also to interact and learn from those involved in similar initiatives all over the world. The broader objectives of the WILDRE will be - To map the status of Indian Language Resources - To investigate challenges related to creating and sharing various levels of language resources - To promote a dialogue between language resource developers and users - To provide opportunity for researchers from India to collaborate with researchers from other parts of the world DATES February 12, 2012 Paper submissions due March 18, 2012 Paper notification of acceptance March 30, 2012 Camera-ready papers due May 21, 2012 Workshop (morning session) SUBMISSIONS Papers must describe original, completed or in progress, and unpublished work. Each submission will be reviewed by two program committee members. Accepted papers will be given up to 10 pages (for full papers) 5 pages (for short papers and posters) in the workshop proceedings, and will be presented oral presentation or poster. Papers should be formatted according to the style-sheet, which will be provided on the LREC 2012 website (http://www.lrec-conf.org/lrec2012/). Please submit papers in PDF/doc format to: https://www.softconf.com/lrec2012/WILDRE2012/ We are seeking submissions under the following category - Full papers (10 pages) - Short papers (work in progress ? 5 pages) - Posters (innovative ideas/proposals, research proposal of students) - Demo (of working online/standalone systems) Though our area of interest covers all NLP/language technology related activity for Indian languages, we would like to focus on the resource creation in the following areas- - Text corpora - Speech corpora - Lexicons and Machine-readable dictionaries - Ontologies - Grammars - Annotation of corpora - Language resources for basic NLP, IR and Speech Technology tasks, tools and - Infrastructure for constructing and sharing language resources - Standards or specifications for language resources applications - Licensing and copyright issues Both submission and review processes will handled electronically using the Start interface of the LREC website. The workshop website will provide the submission guidelines and the link for the electronic submission. When submitting a paper through the START page, authors will be kindly asked to provide relevant information about the resources that have been used for the work described in their paper or that are the outcome of their research. For further information on this initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012 . Authors will also be asked to contribute to the Language Library, the new initiative of LREC2012 Conference Chairs - Girish Nath Jha, Jawaharlal Nehru University, New Delhi - Kalika Bali, Microsoft Research India Lab, Bangalore - Sobha L, AU-KBC Research Centre, Anna University, Chennai Program Committee 1. A. Kumaran, MSRI, Bangalore 2. A G Ramakrishnan, I.I.Sc Bangalore 3. Amba Kulkarni, University of Hyderabad 4. Chris Cieri, LDC, University of Pennsylvania 5. Dafydd Gibbon, Universit?t Bielefeld, Germany 6. Dipti Mishra Sharma, IIIT, Hyderabad 7. Girish Nath Jha, Jawaharlal Nehru University, New Delhi 8. Hema Murthy, IIT, Chennai 9. Joseph Mariani, LIMSI-CNRS, France 10. Kalika Bali, MSRI, Bangalore 11. Khalid Choukri, ELRA, France 12. L Ramamoorthy, LDC-IL, CIIL, Mysore 13. Monojit Choudhary, MSRI Bangalore 14. Nicoletta Calzolari, ILC-CNR, Pisa, Italy 15. Niladri Shekhar Dash, ISI Kolkata 16. Shivaji Bandhopadhyay, Jadavpur University, Kolkata 17. Shyamal Das Mondal, IIT Kharagpur 18. Sobha L, AU-KBC Research Centre, Anna University 19. Soma Paul, IIIT, Hyderabad 20. Umamaheshwar Rao, University of Hyderabad Workshop contact: diwakar.mishra at gmail.com Diwakar Mishra, Special Center for Sanskrit Studies, Jawaharlal Nehru University, New Delhi ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Thu Dec 29 10:12:19 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Thu, 29 Dec 2011 11:12:19 +0100 Subject: Appel: 1CfP LRE-Rel: Language Resources and Evaluation for Religious Texts, LREC 2012 Workshops Message-ID: Date: Wed, 28 Dec 2011 11:49:40 +0100 From: ELRA ELDA Information Message-ID: <4EFAF444.7080701 at elda.org> X-url: http://www.comp.leeds.ac.uk/arabic/lre-rel.html [apologies for multiple postings] LRE-Rel: Language Resources and Evaluation for Religious Texts LREC 2012 pre-conference workshop 22 May 2012 (morning) L?tfi Kirdar Istanbul Exhibition and Congress Centre http://www.comp.leeds.ac.uk/arabic/lre-rel.html FIRST CALL FOR PAPERS We invite submissions for the first workshop on: Language Resources and Evalution for Religious Texts. The focus of this workshop is the application of computer-supported and Text Analytics techniques to religious texts ranging from: the faith-defining religious canon; authoritative interpretations and commentary; sermons; liturgy; prayers; poetry; and lyrics. We see this as an inclusive and cross-disciplinary topic, and the workshop aims to bring together researchers with a generic interest in religious texts to raise awareness of different perspectives and practices, and to identify some common themes. We therefore welcome submissions on a range of topics, including but not limited to: - analysis of ceremonial, liturgical, and ritual speech; recitation styles; speech decorum; discourse analysis for religious texts; - formulaic language and multi-word expressions in religious texts; - suitability of modal and other logic types for knowledge representation and inference in religious texts; - issues in, and evaluation of, machine translation in religious texts; - text-mining, stylometry, and authorship attribution for religious texts; - corpus query languages and tools for exploring religious corpora; - dictionaries, thesaurai, Wordnet, and ontologies for religious texts; - measuring semantic relatedness between multiple religious texts; - (new) corpora and rich and novel annotation schemes for religious texts; - annotation and analysis of religious metaphor; - genre analysis for religious texts; - application in other disciplines (e.g. theology, classics, philosophy, literature) of computer-supported methods for analysing religious texts. Important Dates 13.02.2012 Deadline for paper submissions 27.02.2012 Notification of acceptance 09.03.2012 Camera-ready copies due 22.05.2012 Workshop Submissions Papers should follow the LREC main conference formatting details (to be announced on the conference website: http://www.lrec-conf.org/lrec2012/) and should be submitted either as a Word document or a .pdf file (preferably both) via the START conference manager for our workshop: https://www.softconf.com/lrec2012/LRE-Rel2012/ Papers will be reviewed by three members of the Programme Committee. Accepted papers will be published in the workshop proceedings, and possibly via special issue of an appropriate journal. This year, when submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. For further information on this new initiative, please refer to: http://www.lrec-conf.org/lrec2012/LRE-Map-2012 Workshop Organisers and Co-chairs, with contact persons identified via email address Abdul Malik Al-Salman: King Saud University, Saudi Arabia Eric Atwell: University of Leeds, UK Claire Brierley: University of Leeds, UK scscb at leeds.ac.uk Azzeddine Mazroui: Mohammed First University, Morocco Majdi Sawalha: University of Jordan sawalha.majdi at gmail.com Abdul-Baquee M. Sharaf: University of Leeds, UK Bayan Abu Shawar: Arab Open University, Jordan Programme Committee Nawal Alhelwal: Arabic Department, Princess Nora bint Abdulrahman University, Saudi Arabia Qasem Al-Radaideh: Computer Information Systems, Yarmouk University, Jordan Abdul Malik Al-Salman: Computer and Information Sciences, King Saud University, Saudi Arabia Eric Atwell: School of Computing, University of Leeds, UK Amna Basharat: Foundation for Advancement of Science and Technology, FAST-NU, Pakistan James Dickins: Arabic and Middle Eastern Studies, University of Leeds, UK Kais Dukes: School of Computing, University of Leeds, UK Mahmoud El-Haj: Computer Science and Electronic Engineering, University of Essex, UK Nizar Habash: Center for Computational Learning Systems, Columbia University, US Salwa Hamada: Electronics Research Institute, Egypt Bassam Hasan Hammo: Information Systems, King Saud University, Saudi Arabia Dag Haug: Philosophy, Classics, History of Art and Ideas, University of Oslo, Norway Moshe Koppel: Department of Computer Science, Bar-Ilan University, Israel Rohana Mahmud: Computer Science and Information Technology, University of Malaya, Malaysia Azzeddine Mazroui: Mathematics and Computer Science, Mohammed 1st University, Morocco Tony McEnery: English Language and Linguistics, University of Lancaster, UK Aida Mustapha: Computer Science and Information Technology, University of Putra, Malaysia Mohamadou Nassourou: Computer Philology and Modern German Literature, University of W?rzburg, Germany Nils Reiter: Department of Computational Linguistics, Heidelberg University, Germany Abdul-Baquee M. Sharaf: School of Computing, University of Leeds, UK Bayan Abu Shawar: Information Technology and Computing, Arab Open University, Jordan Andrew Wilson: Linguistics and English Language, University of Lancaster, UK Nagwa Younis: English Department, Ain Shams University, Egypt Wajdi Zaghouani: Linguistic Data Consortium, University of Pennsylvania, US Further Information If you have questions, please consult the workshop website at: http://www.comp.leeds.ac.uk/arabic/lre-rel.html Alternatively, contact: Claire Brierley scscb at leeds.ac.uk or Majdi Sawalha sawalha.majdi at gmail.com ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Thu Dec 29 10:14:16 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Thu, 29 Dec 2011 11:14:16 +0100 Subject: Appel: 1st CfP Workshop on Building and Using Comparable Corpora, LREC 2012 Workshops Message-ID: Date: Wed, 28 Dec 2011 12:17:57 +0100 From: ELRA ELDA Information Message-ID: <4EFAFAE5.6020903 at elda.org> X-url: http://hnk.ffzg.hr/5bucc2012 [Apologies for multiple postings] ============================================================ 5th WORKSHOP ON BUILDING AND USING COMPARABLE CORPORA Language Resources for Machine Translation in Less-Resourced Languages and Domains Co-located with LREC 2012 L?tfi Kirdar Istanbul Exhibition and Congress Centre Saturday, 26 May 2012 DEADLINE FOR PAPERS: 15 February 2012 http://hnk.ffzg.hr/5bucc2012 Endorsed by * ACL SIGWAC (Special Interest Group on Web as Corpus) * FLaReNet (Fostering Language Resources Network) ============================================================ MOTIVATION In the language engineering and the linguistics communities, research in comparable corpora has been motivated by two main reasons. In language engineering, it is chiefly motivated by the need to use comparable corpora as training data for statistical NLP applications such as statistical machine translation or cross-lingual retrieval. In linguistics, on the other hand, comparable corpora are of interest in themselves by making possible inter-linguistic discoveries and comparisons. It is generally accepted in both communities that comparable corpora are documents in one or several languages that are comparable in content and form in various degrees and dimensions. We believe that the linguistic definitions and observations related to comparable corpora can improve methods to mine such corpora for applications of statistical NLP. As such, it is of great interest to bring together builders and users of such corpora. The scarcity of parallel corpora has motivated research concerning the use of comparable corpora: pairs of monolingual corpora selected according to the same set of criteria, but in different languages or language varieties. Non-parallel yet comparable corpora overcome the two limitations of parallel corpora, since sources for original, monolingual texts are much more abundant than translated texts. However, because of their nature, mining translations in comparable corpora is much more challenging than in parallel corpora. What constitutes a good comparable corpus, for a given task or per se, also requires specific attention: while the definition of a parallel corpus is fairly straightforward, building a non-parallel corpus requires control over the selection of source texts in both languages. Parallel corpora are a key resource as training data for statistical machine translation, and for building or extending bilingual lexicons and terminologies. However, beyond a few language pairs such as English- French or English-Chinese and a few contexts such as parliamentary debates or legal texts, they remain a scarce resource, despite the creation of automated methods to collect parallel corpora from the Web. To exemplify such issues in a practical setting, this year's special focus will be on Language Resources for Machine Translation in Less-Resourced Languages and Domains with the aim of overcoming the shortage of parallel resources when building MT systems for less-resourced languages and domains, particularly by usage of comparable corpora for finding parallel data within and by reaching out for "hidden" parallel data. Lack of sufficient language resources for many language pairs and domains is currently one of the major obstacles in further advancement of machine translation. TOPICS We solicit contributions including but not limited to the following topics: Topics related to the special theme: * comparable corpora use in MT * comparable corpora processing tools/kits for MT * parallel corpora usage * parallel corpora processing tools/platforms * MT for less-resourced languages * MT for less-resourced domains * open source SMT systems (Moses, etc.) * publicly available SMT Building Comparable Corpora: * Human translations * Automatic and semi-automatic methods * Methods to mine parallel and non-parallel corpora from the Web * Tools and criteria to evaluate the comparability of corpora * Parallel vs non-parallel corpora, monolingual corpora * Rare and minority languages * Across language families * Multi-media/multi-modal comparable corpora Applications of comparable corpora: * Human translations * Language learning * Cross-language information retrieval & document categorization * Bilingual projections * Machine translation * Writing assistance Mining from Comparable Corpora: * Extraction of parallel segments or paraphrases from comparable corpora * Extraction of bilingual and multilingual translations of single words and multi-word expressions; proper names, named entities, etc. IMPORTANT DATES (TENTATIVE) 15 February 2012 Deadline for submission of full papers 10 March 2012 Notification of acceptance 20 March 2012 Camera-ready papers due 26 May 2012 Workshop date SUBMISSION INFORMATION Papers should follow the LREC main conference formatting details (to be announced on the conference website http://www.lrec-conf.org/lrec2012/) and should be submitted as a PDF-file of no more than ten pages via the START workshop manager: https://www.softconf.com/lrec2012/BUCC2012/ Reviewing will be double blind, so the papers should not reveal the authors' identity. Accepted papers will be published in the workshop proceedings. Double submission policy: Parallel submission to other meetings or publications are possible but must be immediately notified to the workshop organizers. When submitting a paper through the START page, authors will be asked to provide information about the resources that have been used for the work described in their paper or are an outcome of their research. For details on this initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012. Authors will also be asked to contribute to the Language Library, the new initiative of LREC 2012. For further information, please contact Reinhard Rapp reinhardrapp (at) gmx (dot) de or Marko Tadic marko.tadic (at) ffzg (dot) hr ORGANISERS Reinhard Rapp, Universities of Mainz (Germany)and Leeds (UK) Marko Tadic, University of Zagreb (Croatia) Serge Sharoff, University of Leeds (UK) Andrejs Vasiljevs, Tilde SIA, Riga, Latvia Pierre Zweigenbaum, LIMSI, CNRS, Orsay, and ERTIM, INALCO, Paris (France) SCIENTIFIC COMMITTEE * Srinivas Bangalore (AT&T Labs, USA) * Caroline Barri?re (National Research Council Canada) * Chris Biemann (Microsoft / Powerset, San Francisco, USA) * Lynne Bowker (University of Ottawa, Canada) * Herv? D?jean (Xerox Research Centre Europe, Grenoble, France) * Andreas Eisele (DFKI, Saarbr?cken, Germany) * Rob Gaizauskas (University of Sheffield, UK) * ?ric Gaussier (Universit? Joseph Fourier, Grenoble, France) * Nikos Glaros (ILSP, Athens, Greece) * Gregory Grefenstette (Exalead/Dassault Systemes, Paris, France) * Silvia Hansen-Schirra (University of Mainz, Germany) * Kyo Kageura (University of Tokyo, Japan) * Adam Kilgarriff (Lexical Computing Ltd, UK) * Natalie K?bler (Universit? Paris Diderot, France) * Philippe Langlais (Universit? de Montr?al, Canada) * Tony McEnery (Lancaster University, UK) * Emmanuel Morin (Universit? de Nantes, France) * Dragos Stefan Munteanu (Language Weaver Inc., USA) * Lene Offersgaard (University of Copenhagen, Denmark) * Reinhard Rapp (Universities of Mainz, Germany, and Leeds, UK) * Sujith Ravi (Yahoo! Research, Santa Clara, CA, USA) * Serge Sharoff (University of Leeds, UK) * Michel Simard (National Research Council Canada) * Inguna Skadina (Tilde, Riga, Latvia) * Monique Slodzian (INALCO, Paris, France) * Benjamin Tsou (The Hong Kong Institute of Education, China) * Dan Tufis (Romanian Academy, Bucharest, Romania) * Justin Washtell (University of Leeds, UK) * Oliver Wilson (University of Edinburgh, UK) * Michael Zock (LIF, CNRS Marseille, France) * Pierre Zweigenbaum (LIMSI-CNRS, Orsay, France) ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Thu Dec 29 10:17:05 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Thu, 29 Dec 2011 11:17:05 +0100 Subject: Appel: 1CfP Joint ISA-7, SRSL-3, and I2MRT, LREC 2012 Workshops Message-ID: Date: Wed, 28 Dec 2011 12:35:59 +0100 From: ELRA ELDA Information Message-ID: <4EFAFF1F.7090202 at elda.org> [Apologies for multiple postings] CALL FOR PAPERS Joint ISA-7, SRSL-3, and I2MRT LREC 2012 Workshop on Semantic Annotation and the Integration and Interoperability of Multimodal Resources and Tools Istanbul, 26-27 May 2012. BACKGROUND Three initiatives join forces in this workshop, which is concerned with issues in semantic annotation for language resources, especially in relation to spoken and multimodal language data, and with the interoperability and integration of multimodal resources and tools. ISA-7 is the Seventh Workshop on Interoperable Semantic Annotation, and forms part of a series of workshops of ISO TC 37/SC 4 (Language Resources) jointly with ACLSIGSEM (Computational Semantics). These workshops bring together experts in the annotation of semantic information as expressed in text, speech, gestures, graphics, video, images, and in multiple modalities combined. Examples of semantic annotation include the markup of events, time, space, dialogue acts, discourse relations, and semantic roles, for which the ISO organization pursues the establishment of annotation standards, in order to support the creation of interoperable semantic resources. SRSL-3 is the Third Workshop on Semantic Representation of Spoken Language in Speech and Multimodal Corpora. In these workshops researchers convene who are working on speech and multimodal resources for the semantic annotation of related corpora, and take their inspiration from the observation that the semantic gap between the content conveyed by speech and other modalities and their formal representation is a burning issue in a range of tasks such as content mining, information extraction, dialogue processing, interactive story-telling, assisted health care,and human-robot interaction. I2MRT (Integration and Interoperability for Multimodal Resources and Tools) is an initiative to address infrastructure aspects of the creation and use of interoperable multimodal resources. The main objectives of I2MRT are to create awareness of the need to make multimodal data visible via standardized methods and accessible via registered data centers; to discuss possibilities of harmonization and standardization with respect to multimodal annotation schemes and possible mappings between encoding schemes; to discuss ways to make cutting-edge technologies available to multimodality researchers that can currently only be used in specialized labs; and to build a community that is committed to work further on these issues. PAPER SUBMISSION Submission of papers will be electronically using the START conference management page of LREC 2012 for this workshop: https://www.softconf.com/lrec2012/SAIIMRT2012/ When submitting a paper to the START page, authors will be asked to provide essential information about resources (in a broad sense, including also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or that are a new result of your research. For further information on this new initiative, please refer to http://www.lrec-conf.org/lrec2012/?LRE-Map-2012 This 1.5 day workshop has two tracks, one related to interoperable semantic annotation, nd one related to multimodal resources and tools. For each track, research papers are invited describing original, unpublished research. Research papers can be either long (6-8 pages) or short (3-4 pages). Short papers are also invited describing ongoing or proposed projects or infrastructure-related activities. Topics for the tracks include, but are not limited to the following: Track "Interoperable Semantic Annotation": * methodological aspects of semantic annotation * semantic annotation and semantic interpretation * the semantics of semantic annotations * interoperability of spoken and written corpora * content identification and segmentation in spontaneous speech * semantic annotation in dialogue, with a focus on multimodality * segmentation of multimodal interaction data into semantic units * issues in semantic annotation in specific domains, such as time and space; relations in discourse and dialogue; semantic roles and predicate-argument structures * developments in recent, current, and planned ISO projects on developing semantic annotation standards. Track "Integration and Interoperability for Multimodal Resources and Tools": * encoding systems in use in multimodal applications * metadata concepts to describe multimedia/multimodal resources * attempts to harmonize encoding schemes and to map between them * cutting-edge multimedia processing technology that may help multimodality researchers to speed up annotation work * relevance and god examples of usability solutions to efficiently deal with the inherently imperfect results of any recognition technology * possibilities to create strong data hubs with attention for legal and ethical aspects of transferring multimedia/multimodal data * approaches to the creation of a service-oriented architecture in this field. IMPORTANT DATES: deadline for submitting papers: February 27 notification of acceptance: March 15 camera-ready papers: March 29 workshop dates: May 26 (full day) and May 27 (morning session) ORGANIZING COMMITTEE: Harry Bunt (co-chair) Manuel Alc?ntara-Pla (co-chair) Peter Wittenburg (co-chair) Thierry Declerck Dafydd Gibbon Nancy Ide Steven Krauwer Kiyong Lee Lorenza Mondada James Pustejovsky Laurent Romary Oliver Schreer PROGRAMME COMMITTEE: Jan Alexandersson Stefan Baumann Jonas Beskow Paul Buitelaar Harry Bunt (co-chair) Thierry Declerck Alex Fang Raquel Fernandez Rovira Annette Frank Dafydd Gibbon Koiti Hasida Nancy Ide Michael Kipp Kiyong Lee Inderjeet Mani Jean-Claude Martin Lorenza Mondada Martha Palmer Volha Petukhova Andrei Popescu-Belis Rashmi Prasad James Pustejovsky Laurent Romary Oliver Schreer Mark Steedman Mariet Theune Isabel Trancoso MORE INFORMATiON: For more information please contact either one of the three workshop co-chairs: Harry Bunt, harry.bunt at uvt.nl Manuel Alc?ntara Pla, manuel.alcantara at uam.es Peter Wittenburg, peter.wittenburg at mpi.nl ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ ------------------------------------------------------------------------- From thierry.hamon at UNIV-PARIS13.FR Thu Dec 29 10:19:47 2011 From: thierry.hamon at UNIV-PARIS13.FR (Thierry Hamon) Date: Thu, 29 Dec 2011 11:19:47 +0100 Subject: Revue: Langages, n183 (3/2011) Message-ID: Date: Wed, 28 Dec 2011 15:23:45 +0100 From: "Catherine SCHNEDECKER" Message-ID: X-url: http://www.armand-colin.com/revues_article_info.php?idr=20&idnum=397692&ida Langages, n? 183 (3/2011) N?ologie Septembre 2011 MEJRI Salah, SABLAYROLLES Jean-Fran?ois (coord.) MEJRI Salah, SABLAYROLLES Jean-Fran?ois : Pr?sentation : N?ologie, nouveaux mod?les th?oriques et NTIC http://www.armand-colin.com/revues_article_info.php?idr=20&idnum=397692&idart=8720 MORTUREUX Marie-Fran?oise La n?ologie lexicale : de l'impasse ? l'ouverture http://www.armand-colin.com/revues_article_info.php?idr=20&idnum=397692&idart=8719 MEJRI Salah : N?ologie et unit? lexicale : renouvellement th?orique, polylexicalit? et emploi http://www.armand-colin.com/revues_article_info.php?idr=20&idnum=397692&idart=8718 SABLAYROLLES Jean-Fran?ois : De la ? n?ologie syntaxique ? ? la n?ologie combinatoire http://www.armand-colin.com/revues_article_info.php?idr=20&idnum=397692&idart=8717 HUMBLEY John : Vers une m?thode de terminologie r?trospective http://www.armand-colin.com/revues_article_info.php?idr=20&idnum=397692&idart=8716 DUCOS Jo?lle, SALVADOR Xavier-Laurent :Pour un dictionnaire de fran?ais scientifique m?di?val : le projet Crealscience http://www.armand-colin.com/revues_article_info.php?idr=20&idnum=397692&idart=8715 L?TURGIE Arnaud : ? propos de l'amalgamation lexicale en fran?ais http://www.armand-colin.com/revues_article_info.php?idr=20&idnum=397692&idart=8714 ISSAC Fabrice : Cybern?ologie : quelques outils informatiques pour l'identification et le traitement des n?ologismes sur le web http://www.armand-colin.com/revues_article_info.php?idr=20&idnum=397692&idart=8713 CARTIER Emmanuel : N?ologie et description linguistique pour le TAL http://www.armand-colin.com/revues_article_info.php?idr=20&idnum=397692&idart=8712 ANASTASSIADIS-SYM?ONIDIS Anna, NIKOLAOU Georgia : L'adaptation morphologique des emprunts n?ologiques : en quoi est-elle pr?cieuse ? http://www.armand-colin.com/revues_article_info.php?idr=20&idnum=397692&idart=8711 ------------------------------------------------------------------------- Message diffuse par la liste Langage Naturel Informations, abonnement : http://www.atala.org/article.php3?id_article=48 English version : Archives : http://listserv.linguistlist.org/archives/ln.html http://liste.cines.fr/info/ln La liste LN est parrainee par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhesion : http://www.atala.org/ -------------------------------------------------------------------------