From alexis.nasr at LINGUIST.JUSSIEU.FR Tue Jan 6 09:23:34 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Tue, 6 Jan 2004 10:23:34 +0100 Subject: Appel: 7th INTEX/NooJ Workshop Message-ID: 7th INTEX/NooJ Workshop Tours, June 7-9 2004 Call for papers - Deadline: March 31, 2004 ORGANIZERS * Laboratoire d'Informatique de l'Université de Tours (E.A. 2101) * Langues et Représentation, Equipe de recherche en linguistique, Université de Tours * LAboratoire de SEmioLinguistique, Didactique et Informatique (E.A. 2281) CALL We invite the submission of papers for the forthcoming seventh INTEX/NooJ workshop, to be held in Tours, June 7-9 2004. INTEX is a linguistic development environment that includes large-coverage dictionaries and grammars, and parses texts of several million words in real time. INTEX includes tools to create and maintain large-coverage lexical resources, as well as morphological and syntactic grammars. Dictionaries and grammars are applied to texts in order to locate morphological, lexical and syntactic patterns, remove ambiguities, and tag simple and compound words. INTEX can build lemmatized concordances of large texts from Finite-State or Context-Free grammars, and can accordingly perform transformation operations on texts in cascade, in order to annotate the text, or to generate paraphrases; these features, when applied in cascade, give INTEX the power of a Turing Machine. INTEX is used as a linguistic platform, an information retrieval system, to teach second languages, as a terminological extractor, as well as to teach computational linguistics to students. NooJ, which uses a new technology, a new linguistic engine and a new interface, is meant to replace INTEX. NooJ's architecture was presented in the 5th INTEX Workshop (Marseille, June 2002) and its first alpha version was demoed at the 6th INTEX Workshop (Sofia, May 27-29 2003). As in the previous workshops (1996, 1999, 2000, 2001, 2002 and 2003), this meeting will be the opportunity for INTEX and NooJ users, as well as other researchers interested in NLP, to meet and to exchange their experience of development, research or teaching. It will also be the occasion to present the recent developments of NooJ. Please, send before March 31st 2004 a one-page abstract to Denis Maurel by email. The abstract, in French or English, should contain the title of the article, name, author affiliations, surface mail and electronic address of each author. All papers will be reviewed by the program committee. Authors will be notified whether their papers are accepted or rejected by April 15th 2004. The timeslot is 30 minutes for presentations (including 5 minutes for discussions). After the conference, authors will be invited to send a definitive version of their papers for publishing. We are planning to combine a subset of the proceedings of the 6th and the 7th INTEX Workshops in a published volume. PROGRAM COMMITTEE * Xavier Blanco (Universidad Autonoma de Barcelona, Spain) * Gisèle Chevalier (Université de Moncton, Canada) * Ibekwe-SanJuan Fidelia (Université de Lyon 3, France) * Nathalie Friburger (LI, Université de Tours, France) * Svetla Koeva (BACL, IBL - BAS, Sofia, Bulgaria) * Stoyan Mihov (BACL, CLPP - BAS, Sofia, Bulgaria) * Denis Maurel (LI, Université de Tours, France) * Paul Sabatier (LIM, CNRS, Marseille, France) * Agata Savary (LI, Université de Tours, France) * Henrik Selsoe Sorensen (Copenhagen Business School, Danemark) * Max Silberztein (LASELDI, Université de Franche-Comté, France) * Tamas Varadi (Hungarian Academy of Sciences, Hungary) * Dusko Vitas (MATF, University of Belgrade, Serbia) DEADLINES Submission due date: March 31, 2004 Notification date: April 15, 2004 Registration: May 1, 2004 Camera ready date: June 30, 2004 NooJ TUTORIALS by Max Silberztein * Initiation Tutorial, 20 persons maximum * Teaching Linguistics with NooJ, 20 persons maximum REGISTRATION FEE The registration fee for the workshop is 30 euros for researchers, 15 euros for students and 40 euros for other categories. The conference will begin on Monday morning and last till Wednesday evening. During the Conference there will be a reception on Monday evening and an optional excursion on Tuesday afternoon. CONTACT denis.maurel at univ-tours.f max.silberztein at univ-fcomte.fr Web site: http://tln.li.univ-tours.fr/JIntex2004/Index.html ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Tue Jan 6 09:23:41 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Tue, 6 Jan 2004 10:23:41 +0100 Subject: Appel: TeL'04 - Technology Enhanced Learning Message-ID: ------------CFP Tel04 -------------------------------------- "Raising the issues on School e-laboratories, utilizing novel pedagogical and evaluation theories" 22 August 2004 Toulouse, France http://tel04.systema.gr/ TeL'04 - Technology Enhanced Learning --------------------- Scope and organization: Technology Enhanced Learning (TeL) has provided tools and infrastructure to education and training disciplines for over a decade. Related issues are as various as pedagogical and evaluation theories, integrated learning environments, experiments, trials and results from R&TD deployment. Relying on recent experiences and promising results from R&TD projects, in particular EU endorsed initiatives (e.g. IST Projects: Lab at Future, Laboratory of Tomorrow, Mobilearn), the workshop will give educational institutions, experts, practitioners and technologists an opportunity to share their experience and possibly come up with a consensus on open issues. Tel'04 is a one day workshop co-located with the WCC'2004 conference. It will take place among other WCC events in the Congress Center (downtown Toulouse). The program will include refereed papers and invited talks by distinguished researchers and practitioners. The proceedings will be published by Kluwer, the official publisher of IFIP conference. Topics of interest This workshop will comply with the trend of most international conferences relating to learning technologies today. Nevertheless the distinct shaping factor will comprise the identification of the enabling parameters to ?leverage the promotion of key initiatives in putting the grassroots for TeL, especially for school e-laboratories utilizing novel pedagogical and evaluation theories?. Papers are solicited in the following areas: ·E-Learning ·Mobile learning ·Mixed and augmented reality in training ·Technologies in the school of tomorrow ·The learning citizen ·Collaborative learning ·Applying pedagogical theories ·The evaluation process of learning applications ·Shared virtual environments for learning ·Learning management systems ·Combining individualised with collaborative learning ·Applying and using eLearning standards ·Open learning environments ·Learning for All ·Technologies for science education ·Technologies for arts and humanities education Accepted papers will appear in a book published by Kluwer. Please refer to Call for Papers page for details on papers submission. Important Dates: February 20, 2004:Submission of short and full papers(firm deadline) March 20, 2004:Notification of acceptance April 20, 2004:Camera ready papers For more details and submission visit : http://tel04.systema.gr/ ----------------------------- -- Best whishes -- ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Tue Jan 6 09:23:44 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Tue, 6 Jan 2004 10:23:44 +0100 Subject: Appel: TALN 2004 Message-ID: ********************************************************************** - T A L N ' 0 4 - Traitement Automatique du Langage Naturel Palais des Congrès Fès (Maroc) du 19 au 22 avril 2004 ********************************************************************** (see English version below) TALN'04 --- En conjonction avec JEP'04 --- APPEL À COMMUNICATIONS CALENDRIER Date limite de soumission : 15 janvier 2004 Notification aux auteurs : 20 février 2004 Version finale (prêt-à-clicher): 8 mars 2004 Conférence : 19-22 avril 2004 Conjointement organisée par le LPL (Laboratoire Parole et Langage, Aix-en-Provence, France), l'Université de Fès et l'Ecole Normale Supérieure de Fès , la 11ème édition de la conférence sur le Traitement Automatique des Langues Naturelles (TALN'04) se tiendra, du 19 au 23 avril 2004, au Palais des Congrès de Fès, Maroc. La conférence comprendra des communications orales et affichées, des conférences invitées, des ateliers et des tutoriels. Les langues officielles de la conférence sont le français et l'anglais. TALN 2004 est organisée sous l'égide de l'ATALA (Association pour le Traitement Automatique des LAngues) et se tiendra conjointement à la conférence pour jeunes chercheurs RECITAL'04 (appel à communications à paraître séparement). Comme en 2002, TALN sera organisée conjointement avec les Journées d'Etude sur la Parole (JEP'04). Des sessions communes seront organisées, et les participants recevront les actes des deux conférences sur CDROM. THÈMES Les communications, d'une durée de trente minutes, questions comprises, pourront porter sur tous les thèmes habituels du TALN, incluant, de façon non limitative: lexique morphologie syntaxe sémantique pragmatique discours analyse génération résumé dialogue traduction automatique approches logiques, symboliques et statistiques Compte tenu de la jonction TALN/JEP et de la localisation de la conférence, TALN'04 encourage la soumission de contributions dans les domaines suivants : . techniques pour le traitement de la parole et de l'écrit . traitement de l'arabe Le comité de programme sélectionnera parmi les communications acceptées deux articles pour publication (dans une version étendue) dans la revue Traitement Automatique des Langues (t.a.l.). Ces articles seront considérés par la revue comme "acceptés sous réserve de modification", la modification étant la mise au format de la revue. CRITÈRES DE SÉLECTION --------------------- Les auteurs sont invités à soumettre des travaux de recherche originaux, n'ayant pas fait l'objet de publications antérieures. Les soumissions seront examinées par au moins deux spécialistes du domaine. Seront considérées en particulier : - l'importance et l'originalité de la contribution, - la correction du contenu scientifique et technique, - la discussion critique des résultats, en particulier par rapport aux autres travaux du domaine, - la situation des travaux dans le contexte de la recherche internationale, - l'organisation et la clarté de la présentation, - l'adéquation aux thèmes de la conférence. Les articles sélectionnés seront publiés dans les actes de la conférence. MODALITÉS DE SOUMISSION ----------------------- Les articles soumis ne devront pas dépasser 10 pages en Times 12, espacement simple, soit environ 3000 mots, figures, exemples et références compris. Les propositions de démonstrations ou les posters ne devront pas dépasser 6 pages. Une feuille de style LaTeX et un modèle Word sont disponibles sur le site web de la conférence http://www.lpl.univ-aix.fr/jep-taln04/. Les articles devront parvenir au comité d'organisation avant le 15 janvier 2004, en utilisant le formulaire de soumission en ligne à l'adresse suivante : http://www.lpl.univ-aix.fr/jep-taln04/ L'un des formats suivants devra IMPÉRATIVEMENT être employé: - PDF, RTF (Word) Les versions devront être au format A4. En cas d'impossibilité d'envoi par courrier électronique, une soumission "papier" pourra être admise. 3 exemplaires papier de la contribution devront être envoyés à l'adresse suivante: Philippe Blache - TALN 2004 LPL, Université de Provence 29, Avenue Robert Schuman 13621 Aix-en-Provence France e-mail: taln2004 at lpl.univ-aix.fr INFORMATIONS PRATIQUES ---------------------- Les informations pratiques seront précisées ultérieurement, notamment sur le site web de la conférence http://www.lpl.univ-aix.fr/jep-taln04/ ********************************************************************** - T A L N ' 0 4 - Traitement Automatique du Langage Naturel Palais des Congrès Fez (Morocco) April 19 - 22, 2004 ********************************************************************** --- In conjunction with JEP'04 --- CALL FOR PAPERS Important Dates --------------- Submission deadline: 15 january 2004 Notification to authors: 20 february 2004 Camera-ready: 8 march 2004 Conference: 19-22 april 2004 Jointly organized by the LPL (Laboratoire Parole et Langage, Aix-en-Provence, France), the University of Fez and the Ecole Normale Supérieure of Fez , the 11th Conference on Natural Language Processing (TALN'04) will be held at the Palais des Congrès, Fez, Morocco, 19. - 22. April, 2004. The conference will include oral and poster communications, invited conferences, workshops and tutorials. Official languages are French and English. TALN'04 is organized under the aegis of ATALA (Association pour le Traitement Automatique des Langues, Association for NLP) and will be held jointly with JEP'04 (Journées d'Etude sur la Parole) and the conference for young researchers RECITAL'04 conference (call for papers to be issued separately). Common sessions will be organized. The participants will receive the proceedings of the conferences on CD-ROM. TOPICS ------ Papers are invited for thirty minute talks, including questions, in all areas of NLP, including (but not restricted to) : . lexicon, morphology, syntax, semantics . pragmatics, discourse, parsing, text generation . abstraction/summarization, dialogue, machine translation . logical, symbolical and statistical approaches Moreover, TALN' 04 encourages submissions in the following fields: . techniques for speech and language processing . applications for Arabic language All selected papers will be published in the proceedings. In addition, the programme committee will select two papers, extended version which will be published in the journal "Traitement Automatique des Langues" (T.A.L.). SELECTION --------- Authors are invited to submit original, previously unpublished research work. Submissions will be reviewed by at least two specialists of the domain. Decisions will be based on the following criteria : . importance and originality of the paper . soundness of the scientific and technical content . comparison of the results obtained with other relevant works . clarity of the exposition . relevance to the topics of the conference Accepted papers will be published in the proceedings. SUBMISSION PROCEDURE -------------------- Submitted papers must not exceed ten pages, in Times 12, single spaced (about 3000 words), including figures, examples and references. Posters or demo papers should not exceed 6 pages. A LaTeX style file and a Word template are available on the web site of the conference: http://www.lpl.univ-aix.fr/jep-taln04/ Papers are to be submitted before January 15, 2004 through the online submission procedure available on the Website : http://www.lpl.univ-aix.fr/jep-taln04/ Papers MUST be sent in PDF. In particular cases, we may accept submissions in RTF (Word) format. IMPORTANT: All the PostScript versions must be in A4 format, and not US Letter. In case of impossibility, we accept to receive a printed version of the submission. In this case, three hard-copies of the paper must be received by January 15, 2004 by: Philippe Blache - TALN 2004 LPL, Université de Provence 29, Avenue Robert Schuman 13621 Aix-en-Provence France e-mail: taln2004 at lpl.univ-aix.fr PRACTICAL INFORMATION --------------------- Practical information will be detailed shortly on the conference web site (http://www.lpl.univ-aix.fr/jep-taln04/) and in a further call. Please note that members of the ATALA association will benefit from reduced registration fees. ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Thu Jan 8 17:04:13 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Thu, 8 Jan 2004 18:04:13 +0100 Subject: Appel: MEMURA-2004 : Workshop on Methodologies and Evaluation of Multiword Units ... Message-ID: ********************* CALL FOR PAPERS ********************* MEMURA-2004 Workshop on Methodologies and Evaluation of Multiword Units in Real-world Applications (MEMURA Workshop) INVITED SPEAKER: KENNETH W. CHURCH In association with the 4th International Conference On Language Resources and Evaluation - LREC 2004 Centro Cultural de Belém, Lisbon, Portugal May 25, 2004 http://memura2004.di.ubi.pt ********************* CALL FOR PAPERS ********************* This annoucement contains: [1] Workshop Description [2] Target Audience [3] Areas of Interest [4] Invited Speaker [5] Important dates [6] Abstract Submission [7] Workshop Chairs [8] Program Committee [9] Contact ------------------------------------------------------------------------- [1] Workshop Description: ------------------------------------------------------------------------- Multiword units (MWUs) include a large range of linguistic phenomena, such as phrasal verbs (e.g. "look forward"), nominal compounds (e.g. "interior designer"), named entities (e.g. "United Nations"), set phrases (e.g. "con carne") or compound adverbs (e.g. "by the way"), and they can be syntactically and/or semantically idiosyncratic in nature. MWUs are used frequently in everyday language, usually to express precisely ideas and concepts that cannot be compressed into a single word. A considerable amount of research has been devoted to this subject, both in terms of theory and practice, but despite increasing interest in idiomaticity within linguistic research, many questions still remain unanswered. The objective of this workshop is to deal with three important questions that are of great interest for real-world applications. 1) Comparison of MWU extraction methodologies Many methodologies have been proposed in order to automatically extract or identify MWUs. However, not many efforts have been devoted to compare their results. The core differences between the methodologies is certainly the main reason why such works are so rare. For instance, it is not easy to compare language-dependent methodologies as the results depend on the efficiency of parameter tuning in the broad sense of its acception (i.e. semantic tagging, local specific grammars, lematization, part-of-speech tagging etc.). Another important problem is the fact that there is no real agreement between researchers about the definition of MWUs which would provide the basis for an objective evaluation. The objective of the workshop is to gather people that have recently been working in this area so that new trends in comparing MWU extraction methodologies and their evaluation can be pointed at. 2) Evaluation of the benefits of the integration of MWUs in real-world applications It is not yet clear whether MWUs really improve NLP applications. It is common sense that Machine Translation is one application that takes great advantage of MWUs databanks. However, does the same apply to applications in Automatic Summarization, Information Retrieval (IR), Cross-language IR, Information Extraction, Text Clustering/Classification, Parallel Corpus Alignment? Indeed, could the identification of MWUs introduce new constraints that are not present in original texts? Should MWUs be considered as units that should not be analysable in terms of their components meaning? Or should they be treated as unanalysable? Should NLP methods work both on isolated words and on agregated MWUs? The answers are anything but clear. Here, the objective of the workshop is to point at successes and failures of the integration of MWUs in real-world applications. 3) Comparison of scalable architectures for the extraction and identification of MWUs Real-world applications are constrained by variables like processing time and memory space. However, identifying and extracting MWUs is usually a computationally heavy process. In recent years, new algorithms and new technologies have been proposed to introduce MWU treatmement in large scale applications, thus avoiding previous untractable implementations. Previous workshops on MWUs have mainly focused on the unconstrained extraction process. In this workshop, we would like to focus on the comparison of different factors that can influence the scalability of the treatment of MWUs in real-world applications, namely data structures, algorithms, parallel and distributed computing, grid computing etc. Indeed, as we said earlier, some extraction strategies may not scale to deal with huge volumes of data. ------------------------------------------------------------------------- [2] Target Audience: ------------------------------------------------------------------------- This workshop is intended to bring together NLP researchers working on all areas of MWUs. The objective is to summarise what has been achieved in the area of MWU in real-world applications, to establish common themes between different approaches, and to discuss future trends. ------------------------------------------------------------------------- [3] Areas of Interest: ------------------------------------------------------------------------- Abstracts are invited on, but not limited to, the following topics: * Automatic, semi-automatic and manual evaluations of MWUs extractors * Resources for evaluating MWUs extractors * Evaluation Standards * Cross-language and Cross-domain evaluations of MWUs extractors * Comparative evaluation of MWUs extractors * Evaluation of the integration of MWUs in NLP applications: Summarization, (Cross-language) Information Retrieval, Information Extraction, Machine Translation, Text Classification etc. * Scalable algorithms, new data structures, Parallel and Distributed processing and Grid computing for MWUs extraction and/or identification * Comparative evaluation of extraction software architectures * Role of isolated words and MWUs for a sense-based definition of MWUs Abstracts can cover one or more of these areas. ------------------------------------------------------------------------- [4] Invited Speaker: ------------------------------------------------------------------------- Kenneth W. Church (AT&T Labs Research, USA) ------------------------------------------------------------------------- [5] Important dates: ------------------------------------------------------------------------- Abstract submission deadline: February 23, 2004 Notification: March 15, 2004 Camera ready papers: April 12, 2004 Workshop: May 25, 2004 ------------------------------------------------------------------------- [6] Abstract Submission: ------------------------------------------------------------------------- Abstracts should consist of about 1000 words. Abstracts should be submitted electronically in pdf format only to Gaël Harry Dias [ddg at di.ubi.pt]. The following URL transforms postscript files to pdf files (http://www.ps2pdf.com/). The subject line should be "LREC 2004 MEMURA WORKSHOP PAPER SUBMISSION". Because reviewing is blind, no author information should be included as part of the abstract (i.e. the names of the authors and references that could identify the authors). An identification page must be sent in a separate email with the subject line "LREC 2004 MEMURA WORKSHOP ID PAGE" and must include title, author(s), keywords, word count and name and email of the contact author. Late submissions will not be accepted. Notification of receipt will be emailed to the contact author shortly after receipt. ------------------------------------------------------------------------- [7] Workshop Chairs: ------------------------------------------------------------------------- Gaël Harry Dias (Beira Interior University, Portugal) José Gabriel Pereira Lopes (New University of Lisbon, Portugal) Spela Vintar (University of Ljubljana, Slovenia) ------------------------------------------------------------------------- [8] Program Committee: ------------------------------------------------------------------------- Timothy Baldwin (Stanford University, United States of America) Sophia Ananiadou (University of Salford, England) Didier Bourigault (University of Toulouse, France) Pascale Fung (University of Science and Technology, Hong Kong) Mikio Yamamoto (University of Tsukuba, Japan) Dekang Lin (University of Alberta, Canada) Aline Villavicencio (University of Cambridge, England) Heiki Kaalep (University of Tartu, Estonia) Joaquim da Silva (New University of Lisbon) Eric Gaussier (Xerox Research Centre Europe, France) Adeline Nazarenko (University Paris XIII, France) António Branco (Lisbon University, Portugal) ------------------------------------------------------------------------- [9] Contact: ------------------------------------------------------------------------- Contact: Gaël Harry Dias Human Language Technology Interest Group Departamento de Informática Universidade da Beira Interior Rua Marquês d'Ã?vila e Bolama 6201-001 Covilhã Portugal email: ddg at di.ubi.pt Tel: +351 275 319 700 Fax: +351 275 319 732 ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Thu Jan 8 17:04:19 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Thu, 8 Jan 2004 18:04:19 +0100 Subject: Appel: BULAG : La correction automatique : bilan et perspectives Message-ID: ***************************************************************** APPEL A PUBLICATION (English version below) BULAG : "La correction automatique : bilan et perspectives" http://tesniere.univ-fcomte.fr/bulag/appel.htm ***************************************************************** Problématique ------------- La correction automatique semble être actuellement une application négligée par les recherches en Traitement Automatique des Langues. Pourtant, vu les performances des correcteurs sur le marché et les possibilités d'applications qu'offrent les nouvelles formes de communication écrite (e-mail, forum de discussion, minimessage sms...), il paraît toujours utile de posséder de bons outils de correction automatique. L'objectif de cet ouvrage est double. D'une part, nous proposons de faire le point sur les avancées dans le domaine de la correction automatique. D'autre part, nous voulons mettre en valeur les recherches actuelles et les perspectives de ce domaine du Traitement Automatique des Langues. Coordination ------------ Mounira BIOUD et Séverine VIENNEY Calendrier ---------- Date limite de soumission : 20 mai 2004 Notification aux auteurs : 30 juin 2004 Thèmes ------ Les thèmes qui seront abordés dans ce Bulag incluent de façon non limitative : - la correction orthographique - la correction grammaticale - l'évaluation des correcteurs automatiques actuellement sur le marché - les limites du domaine - les perspectives - les applications spécifiques Langues ------- Les articles devront être rédigés en français ou en anglais. Format des articles ------------------- Le format RTF devra être employé. La longueur des articles devra être de 5000 mots maximum. Chaque article devra être édité sous la forme suivante : Titre de l'article Prénom et NOM de l'auteur Nom du Centre de Recherche Université Ville Pays Résumé de l'article en français Mots clefs Abstract Key-words Article Références Vous pouvez télécharger un modèle au format RTF à l'adresse suivante : http://tesniere.univ-fcomte.fr/ressources/blgmodfr.rtf Modalités de soumission ----------------------- Les soumissions devront être envoyées en priorité par courrier électronique aux adresses suivantes : mounira.bioud at edu.univ-fcomte.fr severine.vienney at univ-fcomte.fr En cas d'impossibilité d'envoi par courrier électronique, une soumission par voie postale sera acceptée. Une disquette et un exemplaire papier de la contribution devront être envoyés à l'adresse suivante : Séverine VIENNEY Faculté des Lettres et Sciences Humaines Centre Tesnière 30, rue Mégevand 25030 Besançon cedex FRANCE ***************************************************************** CALL FOR PAPERS BULAG : "La correction automatique : bilan et perspectives" http://tesniere.univ-fcomte.fr/bulag/appelang.htm ***************************************************************** Scope ----- Spelling and grammar checking and correction seem to be neglected by current Natural Language Processing research. However, considering the performance of the current checkers and the possibilities of applications offered by forms of written communication (e- mail, discussion fora, short message sms...), it appears always useful to have good tools for automatic checking and correction. This number of the BULAG has two objectives. Firstly, we wish to have a state-of-the-art survey of the field, and secondly, we wish to emphasise current research and prospects in this Natural Language Processing application. Coordination ------------ Mounira BIOUD and Séverine VIENNEY Dates ----- Paper's submission deadline: 05 20 2004 Acceptance notification: 06 30 2004 Themes ------ The themes to be addressed are: - spelling checker - grammar checker - evaluation of current grammar/spelling correcting systems - limits of the domain - prospects - specific applications Languages --------- Papers can be written in either English or French. Format ------ RTF format should be used. The paper should be a WORD document, 5000 words maximum. Each paper should have the following form: Title Author Name of research centre University City Country Résumé en français Mots clefs Abstract in English Key-words Paper References You can download an RTF format model from the following adress: http://tesniere.univ-fcomte.fr/ressources/blgmoden.rtf Submission ---------- Papers should be sent by email to this two following adresses: mounira.bioud at edu.univ-fcomte.fr severine.vienney at univ-fcomte.fr Those without e-mail access can send a floppy disk and one printed copie of the paper to: Séverine VIENNEY Faculté des Lettres et Sciences Humaines Centre Tesnière 30, rue Mégevand 25030 Besançon cedex FRANCE ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Thu Jan 8 17:04:22 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Thu, 8 Jan 2004 18:04:22 +0100 Subject: Publications: revue CORPUS Message-ID: Le numéro 2 de la revue "CORPUS" vient de paraître : il est consacré aux distances intertextuelles (méthodes de calcul, applications diverses) et a été coordonné par X. Luong, J.P. Barthélémy et S. Mellet. Au sommaire, des articles de : M. Bécue; É. Brunet; M. Kastberg; C. et D. Labbé; D. Longrée et X. Luong; X. Luong et S. Mellet; T. Merriam; D. Valentin, S. Chollet et H. Abdi; une présentation de la thèse de J.P. Anfosso et deux comptes rendus de lecture (234 pages). Cette revue peut être commandée aux Edizioni dell'Orso (via U. Rattazzi 47, I-15100 Alessandria). ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Fri Jan 9 17:20:23 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Fri, 9 Jan 2004 18:20:23 +0100 Subject: Soft: Morphix-NLP Message-ID: The Morphix-NLP project may be of interest for all those teaching in NLP-related fields. As the original mirrors are located in China and slow, or available through Bittorrent (which is generally firewalled), the LIMSI-CNRS set up a mirror of the CD's iso image which should have better transfer rates for europe. Following is a quick presentation of the project. More information and links to the iso image at http://www.nlplab.cn/zhangle/morphix-nlp/ Guillaume Pitel - LIMSI-CNRS - Morphix-NLP is a Live CD Linux distribution with a rich collection of Natural Language Processing (NLP) applications. Though the field of NLP has undergone decades of intensive research, software designed in the NLP community are often scattered around the net and are not known by the larger computer user community. Consequently, most NLP software can not be found in mainstream distributions even years after the first public release. The purpose of this CD is twofold: * In the first place, it tries to break the software acquisition and installation barrier facing many researchers and students in the NLP community by providing most NLP related software on a single Live CD. * In the second place, the CD can be used to promote Natural Language Processing among average computer users. Simply plugging the CD into cd-drive and watching some NLP applications in action, most users will get some knowledge of Natural Language Processing and what NLP can do. ------------------------------------------------------------------------- Message diffus� par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain�e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh�sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Fri Jan 9 17:20:16 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Fri, 9 Jan 2004 18:20:16 +0100 Subject: Appel: R E C I T A L 2004 Message-ID: ************************************************************** R E C I T A L 2004 RAPPEL - RAPPEL - RAPPEL - RAPPEL - RAPPEL - RAPPEL - RAPPEL - DATE LIMITE DE SOUMISSION = 15 JANVIER 2004 ************************************************************** R E C I T A L 2 0 0 4 Appel à Communication - Call for Papers Rencontre des Etudiants Chercheurs en Informatique pour le Traitement Automatique des Langues 2004 19-22 avril 2004 Fès, Maroc Date limite de soumission : 15 janvier 2004 Informations: http://www.lpl.univ-aix.fr/jep-taln04/ (English version below) La conférence RECITAL 2004 (Rencontre des Etudiants Chercheurs en Informatique pour le Traitement Automatique des Langues) est la conférence annuelle de l'ATALA des jeunes chercheurs. Elle est organisée en parallèle des conférences JEP et TALN 2004 qui auront lieu en conjonction à Fès, au Maroc, du 19 au 22 avril 2004. Toutes les informations relatives à RECITAL, ainsi qu'aux deux conférences, les appels à communication, et les renseignements pratiques, sont accessibles sur le site : http://www.lpl.univ-aix.fr/jep-taln04/ RECITAL 04 est organisée par : . le Laboratoire Parole et Langage, Aix-en Provence (France) . l'Université de Fès (Maroc) . l'Ecole Normale Supérieure de Fès (Maroc) Calendrier : - Soumission des articles : 15 janvier 2004 - Notification aux auteurs : 20 février 2004 - Version finale : 8 mars 2004 - Conférence : 19-22 avril 2004 --------------------------------------------------------------------- RECITAL 2004 (Rencontre des Etudiants Chercheurs en Informatique pour le Traitement Automatique des Langues) is the annual conference of the ATALA association (Association pour le Traitement Automatique des Langues). It will be held April 19-22, 2004, in Fez, Morocco, jointly with JEP and TALN 2004. All details about these conferences with complete call for papers and practical information are available online at: http://www.lpl.univ-aix.fr/jep-taln04/ RECITAL 04 is organized by : . the Laboratoire Parole et Langage, Aix-en Provence (France) . the University of Fès (Morocco) . the Ecole Normale Supérieure of Fez (Morocco) Calendar: - Submission deadline: 15 January 2004 - Notification to authors: 20 February 2004 - Camera-ready: 8 March 2004 - Conference: 19-22 April 2004 ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Fri Jan 9 17:20:31 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Fri, 9 Jan 2004 18:20:31 +0100 Subject: Appel: ACL 2004 WORKSHOP : TEXT MEANING and INTERPRETATION Message-ID: ACL 2004 WORKSHOP 2nd Workshop on TEXT MEANING and INTERPRETATION 25-26 July 2004, Barcelona In conjunction with the 42nd annual meeting of the Association for Computational Linguistics (www.acl2004.org) Workshop home page: www.cs.toronto.edu/~gh/TextMeaning.html Overview This 1.5-day workshop will continue the success of the 2003 Workshop on Text Meaning, which was held at HLT/NAACL-2003 in Edmonton. It aims to: * Re-establish the research community of knowledge-based interpretation of text meaning. * Explicate the implicit treatments of meaning in current knowledge-lean approaches and how they and knowledge-rich methods can work together. * Emphasize the construction of systems that extract, represent, manipulate, and interpret the meaning of text (rather than theoretical and formal methods in semantics). Most, if not all, high-end NLP applications -- such as machine translation, question answering and text summarization -- stand to benefit from being able to use text meaning in their processing. But the bulk of work in the field in recent years has not pertained to treatment of meaning. The main reason given is the complexity of the task of comprehensive meaning analysis and interpretation. Computational linguistics has always been interested in meaning, of course. The tradition of formal semantics, logics, and common-sense reasoning system has been continuously maintained for many years. But also, much work has been devoted to building practical, increasingly broad-coverage meaning-oriented analysis and synthesis systems. Lexical semantics has made significant progress in theories, description, and processing. Formal aspects of ontology work have also been studied. The Semantic Web has further popularized the need for automatic extraction, representation, and manipulation of text meaning: for the Semantic Web to really succeed, capability of automatically marking text for content is essential, and this cannot be attained reliably using only knowledge-lean, semantics-poor methods. While there has recently been a flurry of specialized meetings devoted to formal semantics, lexical semantics, semantic web, formal ontology and others, the number of meetings devoted to knowledge-based text meaning processing -- content rather than formalism -- has been much smaller. The first Workshop on Text Meaning began to remedy this, and ten papers were presented on implemented systems and on related topics. Suggested Topics (not necessarily limited to the following) * Implemented systems that extract, represent, or manipulate text meaning. * Broad-coverage semantic analysis and interpretation. * Knowledge-based text synthesis. * The nature of text meaning required for various practical broad-coverage applications. * Manual annotation of text meaning, including interlingual annotations. * Pragmatics and discourse issues as parts of meaning extraction and manipulation. * Ontologies supporting automatic processing of text meaning. * Semantic lexicons. * Microtheories to support text meaning extraction and manipulation: aspect, modality, reference, etc. * Text meaning representations in semantic analysis. * Reasoning to support semantic analysis and synthesis. * Multilingual aspects of meaning representation and manipulation. * Integrating semantic analysis and non-semantic language processing. * Semantic analysis and synthesis systems based on knowledge-lean stochastic corpus-oriented methods. We encourage discussion of theoretical issues that are relevant to computational applications, including descriptions of processors and static knowledge resources. We specifically prefer discussions of content and meaning over discussions of formalisms for encoding meaning, and discussions of decision heuristics in processing over discussions of generic processing architectures and theorem-proving mechanisms. Submission Procedure Submit papers electronically (no more than 8 pages in the ACL two-column format available at www.acl2004.org), PDF strongly preferred, to gh at cs.toronto.edu Deadlines * Paper submission 1 April 2004 * Notification re acceptance 30 April 2004 * Camera-ready version due 16 May 2004 * Workshop dates 25-26 July 2004 Organizers * Graeme Hirst, University of Toronto (gh at cs.toronto.edu) * Sergei Nirenburg, University of Maryland, Baltimore County (sergei at umbc.edu) Program Committee * Jan Alexandersson (DFKI Saarbrücken) * Collin Baker (ICSI Berkeley) * Peter Clark (Boeing) * Dick Crouch (PARC) * Richard Kittredge (University of Montreal) * Paul Kingsbury (Penn) * Tanya Korelsky (CoGenTex, Inc.) * Claudia Leacock (ETS Technologies) * Dan Moldovan (University of Texas at Dallas) * Antonio Moreno Ortiz (University of Málaga) * Martha Palmer (University of Pennsylvania) * Gerald Penn (University of Toronto) * Victor Raskin (Purdue University) * Ellen Riloff (University of Utah) * Graeme Ritchie (University of Edinburgh) * Manfred Stede (University of Potsdam) * Karin Verspoor (Los Alamos National Labs) * Yorick Wilks (University of Sheffield) Additional information Graeme Hirst Department of Computer Science University of Toronto Toronto, Ontario, Canada M5S 3G4 gh at cs.toronto.edu ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Fri Jan 9 17:20:37 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Fri, 9 Jan 2004 18:20:37 +0100 Subject: Appel: COLDOC' 2004 : The Setting up of Observables in Linguistics Message-ID: (apologies for multiple postings) COLDOC' 2004 2nd CALL FOR PAPERS The Setting up of Observables in Linguistics Young researchers' conference - Nanterre, France - April 29 & 30, 2004 The young researchers of Modèle, Dynamique, Corpus (UMR 7114 CNRS Université Paris-X Nanterre) research team, are organizing a young researchers' conference, scheduled for April 29 and 30, 2004, at Paris X-Nanterre Université campus. The setting up of observables in linguistics is the central topic of this conference, i.e. defining and making use of both attested and constructed data. Young researchers from all fields and domains of linguistics are, therefore, invited to submit a paper. Postgraduate, Ph. D. and postdoc students are invited to provide useful insights and experience on their respective research areas. Communications (in English or French) addressing methodological and theoretical issues related to the process of setting up linguistic data, as well as data collection and utilization are expected. For example, communications addressing one of the following issues are expected: - Relevance and selection of linguistic data; - Corpora and emerging linguistic phenomena; - Oral, written or signed data collection methodology and practice; - Questions related to corpora related tools, transcription and encoding; - The use and place of quantitative methods, both generic and specific; - Qualitative methods; - Language, text genres or discourse comparison. Each conference session will start by an invited speaker's talk. A roundtable will be held at the end of the conference. Communications should last 20 minutes, followed by 10 minutes for questions. The deadline for proposals is set on January 26, 2004. Communication proposals will be evaluated anonymously by the scientific committee. Authors are invited to send two separate files, in Word format: first a two pages long summary (3000 signs) of their communication, second a file stating the authors' names, e-mail address, affiliation, together with the title of their communication. Authors may also state their preference regarding the format of their communication: oral, or poster. Communications will be evaluated according to a range of selection criterions, favoring those papers which fully address the issue stated above, which show methodological relevance and scientific interest, and which state their point clearly. Communication proposals, as well as other requests should be addressed to: , or by postal mail, to the following address: ColDoc' 2004 MoDyCo (UMR 7114) Secrétariat sciences du langage Université Paris-X Nanterre, Bât. L 200, avenue de la République 92001 Nanterre Cedex France We look forward to welcoming you at Nanterre Université for the occasion of the conference. The Organizing Committee: Antonio Balvet, Sophie Hamon, Sylvain Loiseau, Ali Tifrit, Cécile Vigouroux. Scientific Committee: --------------------- Driss Ablali Karine Baschung Gabriel Bergounioux Simon Bouquet Nick Clements Marcel Cori Sophie David Annie Delaveau Bernard Fradin Françoise Gadet Nathalie Gasiglia Philippe Gréa Françoise Kerleroux Mark Klein Anne Lacheret Bernard Laks Sarah Leroy Colette Noyau Thierry Poibeau François Rastier Tobias Scheer Pascale Sébillot Anna Sores Nathalie Vallée Florence Villoing Geoffrey Williams. Important dates: --------------------- Submission deadline: January 26, 2004 Authors' notification of acceptance: March 22, 2004 Conference: April 29 & 30, 2004 The Setting up of Observables in Linguistics ColDoc'2004 Modyco (UMR 7114) young researchers' conference Paris X Nanterre, Salle des colloques, Bâtiment B 200, avenue de la République 92001 Nanterre Cedex France Web site: http://infolang.u-paris10.fr/modyco/textes/actualites/Page.html ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Mon Jan 12 09:39:05 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Mon, 12 Jan 2004 10:39:05 +0100 Subject: Conf: TALN04 : deadline extension Message-ID: ********************************************************************** - T A L N ' 0 4 - Traitement Automatique du Langage Naturel Palais des Congrès Fez (Morocco) April 19 - 22, 2004 http://www.lpl.univ-aix.fr/jep-taln04/ ********************************************************************** ->>>> NEW DEADLINE : 20 January CALL FOR PAPERS Important Dates --------------- Submission deadline: 20 january 2004 Notification to authors: 20 february 2004 Camera-ready: 8 march 2004 Conference: 19-22 april 2004 Submitted papers must not exceed ten pages, in Times 12, single spaced (about 3000 words), including figures, examples and references. Posters or demo papers should not exceed 6 pages. A LaTeX style file and a Word template are available on the web site of the conference: http://www.lpl.univ-aix.fr/jep-taln04/ Papers are to be submitted before January 20, 2004 through the online submission procedure available on the Website : http://www.lpl.univ-aix.fr/jep-taln04/ Papers MUST be sent in PDF. In particular cases, we may accept submissions in RTF (Word) format. IMPORTANT: All the PostScript versions must be in A4 format, and not US Letter. In case of impossibility, we accept to receive a printed version of the submission. In this case, three hard-copies of the paper must be received by January 20, 2004 by: Philippe Blache - TALN 2004 LPL, Université de Provence 29, Avenue Robert Schuman 13621 Aix-en-Provence France e-mail: taln2004 at lpl.univ-aix.fr ********************************************************************** - T A L N ' 0 4 - Traitement Automatique du Langage Naturel Palais des Congrès Fès (Maroc) du 19 au 22 avril 2004 http://www.lpl.univ-aix.fr/jep-taln04/ ********************************************************************** ->>>> NOUVELLE DATE LIMITE : 20 Janvier APPEL À COMMUNICATIONS CALENDRIER Date limite de soumission : 20 janvier 2004 Notification aux auteurs : 20 février 2004 Version finale (prêt-à-clicher): 8 mars 2004 Conférence : 19-22 avril 2004 Les articles soumis ne devront pas dépasser 10 pages en Times 12, espacement simple, soit environ 3000 mots, figures, exemples et références compris. Les propositions de démonstrations ou les posters ne devront pas dépasser 6 pages. Une feuille de style LaTeX et un modèle Word sont disponibles sur le site web de la conférence http://www.lpl.univ-aix.fr/jep-taln04/. Les articles devront parvenir au comité d'organisation avant le 20 janvier 2004, en utilisant le formulaire de soumission en ligne à l'adresse suivante : http://www.lpl.univ-aix.fr/jep-taln04/ L'un des formats suivants devra IMPÉRATIVEMENT être employé: - PDF, RTF (Word) Les versions devront être au format A4. En cas d'impossibilité d'envoi par courrier électronique, une soumission "papier" pourra être admise. 3 exemplaires papier de la contribution devront être envoyés à l'adresse suivante: Philippe Blache - TALN 2004 LPL, Université de Provence 29, Avenue Robert Schuman 13621 Aix-en-Provence France e-mail: taln2004 at lpl.univ-aix.fr ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Mon Jan 12 09:39:10 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Mon, 12 Jan 2004 10:39:10 +0100 Subject: Appel: SENSEVAL-3 Message-ID: [Apologies for multiple postings] ===================================================================== CALL FOR PARTICIPATION IN THE SENSEVAL-3 EVALUATIONS SENSEVAL-3 Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text An ACL-2004 Workshop Barcelona, Spain July 25-26, 2004 http://www.senseval.org/senseval3 ====================================================================== The main purpose of this workshop is to analyze and discuss the results of systems participating in the Senseval-3 evaluations, to be held in March-April 2004. Fourteen different tasks are planned for Senseval-3, to conduct evaluations of systems that perform automatic semantic analysis of text, including: word sense disambiguation for various languages, identification of semantic roles, logic forms, multilingual annotations, subcategorization acquisition. This is an advance notice of the evaluation exercise and workshop. Registration for the evaluation will open in February (watch the website for updates). Papers will be accepted from participants only. [BACKGROUND] There are now many computer systems that do automatic semantic analysis of text. The purpose of Senseval is to evaluate the strengths and weaknesses of such systems with respect to different words, relations, types of texts, different varieties of language, and different languages. This workshop is a follow-up to Senseval-1 and Senseval-2. Senseval-1 took place in the summer of 1998 for English, French, and Italian, culminating in a workshop held at Herstmonceux Castle, Sussex, England on September 2-4. Senseval-2 took place in the summer of 2001, and was followed by a workshop held in July 2001 in Toulouse, in conjunction with ACL-2001. Senseval-2 included tasks for Basque, Chinese, Czech, Danish, Dutch, English, Estonian, Italian, Japanese, Korean, Spanish, Swedish. [TASKS] The following tasks are planned for Senseval-3 (see webpage for a description of each task): 1. English all words 2. Italian all words 3. Basque lexical sample 4. Catalan lexical sample 5. Chinese lexical sample 6. English lexical sample 7. Italian lexical sample 8. Romanian lexical sample 9. Spanish lexical sample 10. Automatic subcategorization acquisition 11. Multilingual lexical sample 12. WSD of WordNet glosses 13. Semantic Roles 14. Logic Forms This 2-day workshop will consist of several Senseval-3 task and system presentations, including analyses of results obtained during the evaluations, with comparisons across different systems, techniques, and languages. We also plan for two panels on (1) the interaction between systems for semantic analysis of text and other NLP applications, and (2) planning Senseval-4. [SUBMISSION FORMAT] Submissions will consist of refereed papers describing the Senseval-3 tasks and participating systems: - one paper for each task, limited to four pages - one paper for each participating team, limited to four pages for the first task, and one extra page for each additional task Papers will have to follow the ACL 2004 formatting guidelines. Submissions will be entered via the Senseval-3 website. [IMPORTANT DATES] Registration February Evaluations March - April Deadline for paper submissions April 20 Deadline for camera-ready papers May 18 Workshop July 25-26 [ORGANIZING COMMITTEE] Phil Edmonds, Sharp Laboratories of Europe Rada Mihalcea, University of North Texas [PROGRAM COMMITTEE] Eneko Agirre, University of the Basque Country Rebecca Bruce, University of North Carolina at Asheville Nicoletta Calzolari, ILC-CNR, Pisa Tim Chklovski, Information Sciences Institute Massimiliano Ciaramita, Brown University Silviu Cucerzan, Microsoft Research Walter Daelemans, University of Antwerp Florentina Hristea, University of Bucharest Nancy Ide, Vassar College Diana Inkpen, University of Ottawa Adam Kilgarriff, University of Brighton Dimitrios Kokkinakis, Goteborg University Anna Korhonen, University of Cambridge Robert Krovetz, Teoma Sadao Kurohashi, The University of Kyoto Dekang Lin, University of Alberta Ken Litkowski, CL Research PengYuan Liu, Harbin Institute of Technology Bernardo Magnini, ITC-IRST, Trento Lluis Marquez, University of Catalunya Diana McCarthy, University of Sussex Vivi Nastase, University of Ottawa Hwee Tou Ng, National University of Singapore Martha Palmer, University of Pennsylvania Patrick Pantel, Information Sciences Institute Ted Pedersen, University of Minnesota, Duluth Judita Preiss, University of Cambridge Amruta Purandare, University of Minnesota, Duluth German Rigau, University of the Basque Country Vasile Rus, Indiana University South-Bend Charles Schafer, John Hopkins University Carlo Strapparava, ITC-IRST, Trento Dan Tufis, Romanian Academy Cynthia Thompson, University of Utah Paola Velardi, La Sapienza, Rome Janyce Wiebe, University of Pittsburgh David Yarowsky, John Hopkins University Deniz Yuret, Koc University ------------------------------------------------------------------------- Message diffus� par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain�e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh�sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Tue Jan 13 16:49:19 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Tue, 13 Jan 2004 17:49:19 +0100 Subject: Jobs: Sinequa : moteur de recherche en portugais et en danois Message-ID: Sinequa est une entreprise développant, entre autres, un moteur de recherche fortement linguistique. Voir le site http://www.sinequa.com pour plus de renseignements. Nous recherchons deux personnes pour développer le moteur en portugais et en danois. Les compétences demandées sont les suivantes : - Fortes compétences en linguistique ou terminologie (niveau maîtrise ou DESSS) ; - Maîtrise de l'outil informatique (outils de bureautique, Internet) exigée ; - Programmation de scripts (Perl ou autre) fortement appréciée ; - Parfaite maîtrise du portugais et du danois. Le travail qui sera demandé consistera à développer des lexiques morpho-syntaxiques, des corpus étiquetés, des automates d'analyse (reconnaissance d'entités nommées, etc.), etc. La durée du contrat sera de 3 à 6 mois. Si vous êtes intéressé(e), merci d'envoyer votre CV par courriel à loupy at sinequa.com Cordialement -- Claude de Loupy - Responsable Recherche Sinequa - http://www.sinequa.com courriel : loupy at sinequa.com tél. : 33 1 49 87 06 00 - fax : 33 1 49 87 06 ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Tue Jan 13 16:49:22 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Tue, 13 Jan 2004 17:49:22 +0100 Subject: Conf: Workshop "Terminology, Ontology & Knowledge Representation" : Program Message-ID: Please note that the workshop will now be held from 22-23rd January 2004. Venue : Manufacture des Tabacs, University de Lyon 3 4 cours Albert Thomas, 69008 Lyon - France phttp://www.univ-lyon3.fr/partagedessavoirs/termino2004 -------------------------------WORKSHOP PROGRAM---------------------- 22nd January 2004 9h00 : Welcome remarks / Accueil des participants § Ontology & Knowledge representation : methodological approaches & applications 9h30-10h00 : Cartographier la connaissance. Anthony Frémaux (Société Cognito) 10h10 - 10h40 : Partage des connaissances terminologiques en milieu industriel : approche théorique et implantation informatique Jérémy Roy (ERSICOM, Universite de Lyon 3) 10h50-11h00 : Pause 11h00 - 11h30 : Extraction d'informations sémantiques pour l'aide à la construction d'ontologies différentielles. V. Malaisé, P. Zweigenbaum, B. Bachimont (STIM AH/HP, CRIM-INALCO) 11h40-12h20 : Base de connaissances GENOMA : le rôle de l'ontologie M. Teresa Cabré, Judit Feliu, Jorge Vivaldi (Université Pompeu Fabra, Barcelona) 12h20 - 14h00 : Lunch break / Pause déjeuner § Invited speaker / Conférencière invitée 14h00 - 15h00 : Du corpus à une représentation relationnelle du lexique : la question des marqueurs des relations conceptuelles. Anne Condamines (Erss, Toulouse) 15h00-15h30 : Towards multilingual, termontological support in ontology engineering. Koens Kerremans, Rita Temmerman (CVC, Brussels) 15h40 - 16h10 : Ontology Via Terminology ? Lee Gillam, Mariam Tariq (University of Surrey, UK) 16h20 - 16h30 : Break / Pause 16h30 - 17h00 : ONTOLOGICO : vers un outil d'assistance au développement itératif des ontologies. Yassine Gargouri, Bernard Lefebvre, Jean Guy Meunier (LANCI, Université de Québec à Montréal) § Terminology : methodological approaches I 17h10-17h40 : Variations et traitement automatique de la terminologie Béatrice Daille (IRIN, Université de Nantes) 17h50-18h30 : A methodology for classifying documents using terminological taxonomies Bellomi & Crestani (Université de Verona, Italy) ****** 23rd January 2004 ****** § Terminological theories 9h00 : 9h30 : La philosophie comme multi-terminologie B. Hufschmitt (Université de Franche-Comté) § Terminology : methodological approaches II 9h40 - 10h10 : Repérage humain ou automatique des relations lexico-sémantiques; bilan d'une tentative de formalisation Jeanne Dancette, Sonia Halimi (Ecole de Traduction, Univ. de Genève & Université de Montréal) 10h20 - 10h50 : Adjectifs dérivés sémantiques (ADS) dans la structuration des terminologies Marie-Claude L'Homme (Université de Montréal) 11h00 - 11h10 : Break / Pause 11h10 -11h40 : A computer-aided terminology processing system prototype Le An Ha (University of Wolverhampton, UK) 11h50 - 12h20 : Terminology expansion and relation identification between genes and pathways James Dowdall, Fabio Rinaldi, Andreas Persidisy, et al. (IFI, University of Zurich) 12h30 - 14h00 : Lunch break / Pause déjeuner § Terminology : applications q14h00 - 14h30 : Une terminologie du domaine médical : structure et exploitation L. Soaulmia, A. Névéol, M. Douyère et al. (CHU, Université de Rouen ) 14h40 - 15h10 : Referencing text documents in multidimensional concept spaces for technology and scientific watch. A conceptual overview of text models in the context of a collaborative scientific watch system Jean-Sébastien Brunner, Thibaud Latour (CRP Henri Tudor, Centre for IT Innovation, Luxembourg) 15h20 - 15h30 : Pause 15h30 - 16h00 : De l'élaboration d'un dictionnaire de description de sens des termes médicaux vietnamien-français-anglais à la recherche d'information médicale par croisement de langues: une approche socioterminologique Tuan Duc Tran, N. Garcelon, D. Delamare (Faculté de Médecine-Université de Rennes ) 16h10 : Plenary discussion / Discussion plénière 17h00 : End of workshop / Fin de l'atelier. -------------------------------------- Ibekwe-SanJuan Fidelia Workshop "Terminology, Ontology & Knowledge representation" 22-23 january 2004, University of Lyon 3. http://www.univ-lyon3.fr/partagedessavoirs/termino2004 ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Tue Jan 13 16:49:23 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Tue, 13 Jan 2004 17:49:23 +0100 Subject: Appel: CIDE 7 : EXTENDED DEADLINE : FEBRUARY 8 Message-ID: *************************************************** CIDE.7 SECOND CALL - EXTENDED DEADLINE : FEBRUARY 8 International Colloquium for Electronic Document Colloque International sur le Document Electronique *************************************************** "Semantic Approaches of Electronic Document" La Rochelle (France, June 22-25, 2004 http://infodoc.unicaen.fr/cide/cide7/ To be held as part of "la Semaine du Document Numérique" ("Week for Electronic Document") Since 1998, CIDE organizes scientific meetings on topics of broad interest and importance for progress in electronic document studies. The objectives are to put together complementary approaches from various disciplines, and to promote academic and industrial results in this area. The seventh main conference purpose is to focus on semantic approaches of electronic document processing. Semantic-oriented approaches have long been considered sceptically by practitioners or researchers, to the benefit of so-called "surface" processing, considering "form" rather than "content" or "meaning". This view already began to change. Significant progress has been done in the last years, either relative to text document (e.g. in the area of information extraction, question answering, automatic summarisation...), or other medias (content-based indexing of audio-video documents, pictorial or sound information extraction, summary of musical or video works...). Moreover, the big challenge of the "semantic web" project is to elaborate formal descriptions of the content of documents and other resources, in order to make them easily accessible and interoperable. Another, radical, viewpoint would be to consider that even "surface" or "numerical" processing is in fact, if closely observed, of semantic nature. If "sense" does not reduce to "information", producing any information is producing sense. Lexical disambiguation, even if based on a statistical method not relying on any linguistic theory, does solve a lexical-semantic problem. A program extracting thematic descriptors computes this minimal meaning : "what this document is all about", etc. The aim of CIDE.7 is to bring the light on these questions. Two aspects are to be considered : - Presentation and discussion of experiences and advances addressing the semantic analysis of electronic documents, according to the various medias (text, audio, video), and their networking (semantic web) ; - Methodological investigations in order to establish the basis of a truly semantic approach in document engineering. The conference will include : - A presentation of communications in response to the present call ; - Invited conferences providing syntheses on the different kinds of semantic processing ; - A final panel, in collaboration with other conferences taking part in the "Week for Electronic Document". ================= Conference topics ================= The topics addressed by CIDE.7 include (but are not limited to) the following : - Applications : content-based information retrieval, information extraction, inside-document browsing, hypertext structuring, analysis of technical, as well as artistic or literary documents... - Description of document content : indexing, tagging, enrichment... of the whole or segments of documents, constitution of terminologies or ontologies, formalisms for representation of descriptions (rdf, topic maps...), semantic trans-modality modelling... - Processing methods for analysis and use : semantic and semiotic methods specific to the different kinds of documents (text, image, audio, video), collaboration of symbolic and numerical methods, constitution and use of corpora, document bases integration, web services... - Methodological investigations : sense and use, relation between forms and sense, similarities and differences between medias, collaboration for certain tasks... ========================== Language of the conference ========================== The main language is French. However, papers and presentations in English are welcome. ========== Submission ========== Instructions for authors are accessible on the web site of CIDE.7. Declarations of intention to submit a paper will include keywords and a 200 words summary. They have to be sent in pdf format. The full papers will not exceed 15 pages (according to the provided style sheets). The presentation of submitted papers should be the same as for the final ones. =============== Important dates =============== - Declarations of intention to submit (optional) : as soon as possible. - Paper submission : February 8, 2004. - Notification of acceptation : March 15, 2004. - Final papers due : April 15, 2004. - Conference : June 22-25, 2004. ======================== Contact and informations ======================== Lydie Sauvé, Département d'informatique, Campus II, bd Maréchal Juin, Université de Caen, 14032 Caen Cedex Web Site : http://infodoc.unicaen.fr/cide/cide7/ Email (informations) : cide7 at infodoc.unicaen.fr Email (submission) : cide7-soumission at infodoc.unicaen.fr ========================== Program Committee of CIDE.7 ========================== Chair : P. Enjalbert (U. Caen), M. Gaio (U. Pau) M.H. Antoni (U. Poitiers), T. Baccino (U. Nice), B. Bachimont (INA et UTC), F. Cerbah (Dassault Aviation), J.P. Desclés (Lalicc, U. Paris 4), C. Faure (ENST), S. Ferrari (U. Caen), C. Fluhr (CEA), B. Grau (LIMSI), P. Laublet (Lalicc, U . Paris 4), G. Mourad (Lalicc, U. Paris 4), A. Napoli (LORIA), M-P. Pery Woodley (U. Toulouse 2), I. Saleh (U. Paris 8), K. Tombre (LORIA), B. Victorri (CNRS-ENS), G. Vignaux (CNRS-LCP), H. Vinet (IRCAM), J.Vivier (U. Caen). ==================== Organising Committee ==================== S. Ferrari (coordination), F. Bilhaut, E. Faurot, V. Perlerin, C. Turbout, A. Widlöcher ========================================= Permanent Committee of the CIDE Conference ========================================= M. Bellafkih (Morocco), J. Caelen (France), J. Ducloy (France), M. Gaio (France), J. Gardes (France), J-L. Hainaut (Belgium), P. King (Canada), J. Labiche (France), M. Leonard (Swiss), J-P. Raysz (France), J-M. Robert (Canada), Z. Sahnoun (Algeria), M. Szmurlo (France), L. Thomazo (France), E. Trupin (France), J. Virbel (France), J. Vivier (France), C. Vanoirbeek (Swiss), K. Zreik (France, coordination). ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Tue Jan 13 16:49:25 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Tue, 13 Jan 2004 17:49:25 +0100 Subject: Appel: TALN2004: Workshop on QUESTION-ANSWERING Message-ID: ********************************************************************** CALL FOR PAPER Held in conjunction with T A L N 2 0 0 4 Workshop on QUESTION-ANSWERING --- Palais de Congrès Fès (Maroc) April 22, 2004 http://www.lpl.univ-aix.fr/jep-taln04/ ********************************************************************** Facing a question such as «What is the most expensive car in the world?», classical search engines return the documents that are the most strongly linked to the words of the question, sometimes extract the excerpts where these words are the most numerous, but let the user browse texts to actually find an answer. This need leads to develop systems that are able to extract the parts of documents that are the most relevant in relation to a question, providing either an answer when the question is about a precise fact or a summary when it is a topical question. These functions can be implemented only if IR systems are able to analyze both queries and documents more deeply. As a consequence, question answering is at the crossing of several research fields: of course, it is grounded in Information Retrieval but it also concerns Natural Language Processing (NLP) in an important way and to some extent, fields such as Machine Learning. Most QA systems are based on a classical search engine that is enhanced by a question analysis module, a set of modules for extracting various linguistic features from documents, such as named entities, terms or syntactic relations, and a module that relies on all these data for extracting answers by mixing linguistic and numerical criteria. Moreover, the QA problem puts forward new functions, or functions that are still in an embryonic state in current IR systems: evaluating if an answer to a question exists in a document collection, achieving a synthesis from multiple or partial answers, using dialog for constructing a query, or text understanding capabilities for dealing with anaphora, inferences, or for determining if a set of several answers is coherent. More precisely, submissions will present a question answering system as a whole or will focus on one of its processes provided that it is put in the question answering context. These processes include but are not limited to: - question analysis: question typology, extraction of the question focus, of the question context or more generally, of semantic constraints - named entity recognition: fine-grained named entities, unrestricted domains - passage extraction - full or partial similarity of syntactic structures - terminological tools: extraction and recognition of terms and their variants - extraction and justification of answers: answer patterns, inferences, paraphrase ... This workshop is particularly concerned by papers that focus on QA systems for large collections of documents or the Web but papers about QA systems for restricted domains or dedicated to knowledge bases or database will also be taken into account. Submissions can also tackle cross-domain topics in relation to Question Answering , such as: - QA and machine learning: use of machine learning for selecting and extracting answers to a question but also for building on a large scale resources that are necessary for QA systems; - multilingual and crosslingual QA: what are the difficulties for adapting an existing QA system most of them only work for English to another language; asking a question in a language and searching an answer in a collection of documents in another language; - QA and the Web: using the Web as a source of knowledge or a source of answers; what are the specific aspects of searching an answer on the Web; - multi-document QA: fusion and coherence of multiple answers. SUBMISSION: Submissions will be minimum 4 page summaries or long papers of no more than 10 pages, written in French or English, according to the style of the main conference TALN 2004. The final version will be a long paper. Submission format will be PDF, but .doc and .ps will be also admitted. Papers have to be sent to Brigitte.Grau at limsi.fr, with TALN-QA as subject. IMPORTANT DATES: Submission deadline: 15 January 2004 Notification to authors: 20 February 2004 Camera-ready: 8 March 2004 Question-Answering workshop: 22 April 2004 Groupe LIR - LIMSI BP 133, 91403 Orsay Cedex tel. 01 69 85 80 03, fax 01 69 85 80 88 et Institut d'Informatique d'Entreprise (IIE) 18 allée Jean Rostand, 91025 Evry Cedex tel. 01 69 36 73 44, fax 01 69 36 73 09 ---------------------------------------------------- ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Tue Jan 13 16:49:26 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Tue, 13 Jan 2004 17:49:26 +0100 Subject: Appel: OntoLex 2004:Ontologies and Lexical Resources in Distributed Environments Message-ID: ****APOLOGIES FOR MULTIPLE POSTINGS**** SECOND AND FINAL CALL FOR PAPERS Workshop OntoLex 2004: Ontologies and Lexical Resources in Distributed Environments http://www.loa-cnr.it/ontolex2004.html Centro Cultural de Belem LISBON, Portugal 29th may 2004 In Association with 4th INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION LREC2004 http://www.lrec-conf.org/lrec2004/index.php Main conference 26-27-28 May 2004 Motivations and aim The use of ontological knowledge in language technology applications goes a long way back. Recently, however, the project of turning the World Wide Web into a machine understandable resource to access digital information (the so-called Semantic Web) has stimulated a renewed interest in ontologies. In several recent workshops and conferences, researchers have investigated their nature and application potential for knowledge management, information retrieval and extraction, information exchange in agent-based systems as well as dialogue systems. Attention is being drawn to new aspects of ontology research such as ontology coordination and mapping ? aspects that are particularly relevant for distributed environments such as Knowledge Grid and Semantic web. In fact the annotation of web resources in agreement with concepts and relations as defined in ontologies, is useful for establishing a conceptual support for knowledge communication. From this perspective, lexicographers, lexical semanticists and ontologists are joining forces to build innovative systems for integrating ontological knowledge with lexical and semantic resources. Important examples of this interaction are the recent works on the conceptual analysis of WordNet (one of the first lexical knowledge bases), and the wide use of upper ontologies in innovative international projects like EuroWordNet, SIMPLE, Balkanet, DWDSnet. WordNet was designed and built entirely by psychologists, linguists, and lexicographers. Nevertheless, there are obvious parallels with ontologies, especially in the kinds of structuring relations used (taxonomical links, meronymy or part-of, etc.), and indeed WordNet has for years attracted the attention of philosophers and ontologists. In this context, the distinction between conceptual (possibly axiomatic) ontologies and lexical ontologies (which contain both linguistic and ontological information) has become more and more central in the field. In this workshop we want to discuss ontologies as resources per se, as well as for what concerns the relation between ontological knowledge and language. This relation can be investigated from a number of different angles, for example what differences and similarities there are between ontologies and more traditional lexical resources such as dictionaries and wordnets; how ontologies can be extracted from language corpora; what role language plays in the definition and mapping of ontologies; and finally, how ontologies can be used to treat language in language technology applications ? in particular applications for distributed environments. Topics to be addressed in the workshop include, but are not limited to: - Design principles and methodologies for upper-level ontologies and semantic lexical resources - Evaluation, comparison, mapping and integration of ontologies and lexical resources - Applications of ontologies and semantic lexical resources in LT applications (e.g. QA, Information Retrieval, Information Extraction, Machine Translation) - Role of semantic lexical resources in ontology learning - Methods to derive ontological knowledge from text - Methods to annotate text with reference to an ontology - Ontology-based query expansion techniques - Ontologies and multi-lingual lexical resources - Ontologies and ontology mapping in multi-lingual applications Ontologies and lexical resources for meaning negotiation Two discussions will be organised around the following topics: - Filling the gap between axiomatic and linguistic ontologies - The role of lexical resources in the Semantic Web and the Knowledge Grid Reasons of interest A new scientific community is growing around this largely interdisciplinary area: following the spirit of the previous two OntoLex workshops, this workshop aims at being an important meeting point for researchers involved in the fields of lexical resources and ontologies, favouring the exchange of scientific experiences and proposing new directions of inquiry. This year, the workshop particularly welcomes contributions from researchers that are investigating the application of ontologies and lexical resources in distributed environments such as Knowledge Grid and Semantic Web. Important dates - 4th December 2003: Call for papers and demonstrations - 30 January 2004: Deadline for paper submission - 5 March 2004: Acceptance notifications and preliminary program - 29 March 2004: Deadline final version of accepted papers - 29 May 2004: Workshop Submissions Participants are invited to submit an extended abstract of max 3000 words related to one or more of the topics of interest. Papers can describe research results as well as work in progress. Each accepted paper will receive a slot of 30 minutes for presentation (20 minutes talk and 10 minutes for discussion). Demonstrations of ontology applications are encouraged as well (a demonstration outline of 2 pages can be submitted). Each submission should show: title; author(s); affiliation(s); and contact author's e-mail address, postal address, telephone and fax numbers. Submissions must be sent electronically in PDF to Alessandro Oltramari (oltramari at loa-cnr.it) As soon as possible, authors are encouraged to send a brief email indicating their intention to participate, including their contact information and the topic they intend to address in their submissions. Proceedings of the workshop will be printed by the LREC Local Organising Committee. Time schedule and registration fee The workshop will consist of a morning session and an afternoon session, and include scientific paper presentations from workshop participants as well as general discussions. For this full-day workshop, the registration fee is 100 EURO for LREC conference participants and 170 EURO for other participants. These fees will include a coffee break and the Proceedings of the Workshop. Organising Committee Alessandro Oltramari (Laboratory for Applied Ontology, ISTC-CNR; Department of Cognition and Education Sciences, Trento University) Patrizia Paggio (Center for Sprogteknologi, University of Copenhagen) Aldo Gangemi (Laboratory for Applied Ontology, ISTC-CNR Rome) Maria Teresa Pazienza (Roma Tor Vergata University) Nicoletta Calzolari (Istituto di Linguistica Computazionale del CNR) Bolette Sandford Pedersen (Center for Sprogteknologi, University of Copenhagen) Kiril Simov (Bulgarian Academy of Sciences) Programme Committee Roberto Basili (Roma Tor Vergata University) Werner Ceusters (Language & Computing) Nicoletta Calzolari (Istituto di Linguistica Computazionale del CNR) Aldo Gangemi (Laboratory for Applied Ontology, ISTC-CNR, Rome) Eric Gaussier (Xerox Research Centre Europe, Grenoble Laboratory) Maria Toporowska Gronostaj (Språkdata, University of Gothenburg) Nicola Guarino (Laboratory for Applied Ontology, ISTC-CNR, Trento) Arne Jönsson (Linköping Universitet) Dimitrios Kokkinakis (Språkdata, University of Gothenburg) Alessandro Lenci (Universitá di Pisa) Claude de Loupy (Sinequa and University of Paris 10) Bernardo Magnini (ITC-IRST, Trento) Jørgen Fischer Nilsson (Technical University of Denmark) Alessandro Oltramari, (Laboratory for Applied Ontology, ISTC-CNR, Trento) Patrizia Paggio (Center for Sprogteknologi) Maria Teresa Pazienza (Roma Tor Vergata University) Bolette Sandford Pedersen (Center for Sprogteknologi) Guus Schreiber (Vrije Universiteit Amsterdam) Kiril Simov (Bulgarian Academy of Sciences) Atanas Kiryakov (Ontotext Lab, Sirma AI) Paola Velardi (Università La Sapienza, Rome) ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Tue Jan 13 16:49:27 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Tue, 13 Jan 2004 17:49:27 +0100 Subject: Appel: ACL04: WORKSHOP ON QUESTION ANSWERING IN RESTRICTED DOMAINS Message-ID: FIRST CALL FOR PAPERS ACL04 WORKSHOP ON QUESTION ANSWERING IN RESTRICTED DOMAINS Barcelona, Spain, 25-26 July 2004 Submission deadline: 15 March 2004 http://www.clt.mq.edu.au/Events/Conferences/acl04qa/ Much of the current research in question answering systems is driven by programs such as AQUAINT and evaluation exercises such as TREC, NTCIR and CLEF, all of which focus on open-domain question answering. The availability of large volumes of data (e.g. documents extracted from the World Wide Web) has prompted the development of systems that focus on shallow text processing. But there are many document sets in restricted domains that are potentially valuable as a source for question answering systems. For example, the documentation pages of Unix and Linux systems would make an ideal corpus for QA systems targeted at users that want to know how to use these operating systems. There is a wealth of information in other technical documentation such as software manuals, car maintenance manuals, and encyclopediae of specific areas such as medicine. Users interested in these specific areas would benefit from QA systems targeted to their areas of interest. Restricted domains typically have limited data available and therefore conventional techniques based on data redundancy can simply not be applied in an effective way. The scarcity of data available seems to prompt for a more targeted, NLP-intensive approach to QA. The use of additional corpora such as the WWW raises a number of interesting questions. For instance, will these corpora help or obstruct the proper functioning of an NLP-intensive approach to QA? And, how do we find good pockets of information that are appropriate to the chosen domains? On the other hand, restricted domains (e.g. law, medicine) have specific stylistic conventions. Often these domains use terminology that is not stored in conventional lexica. Consequently NLP approaches devised for open-domain systems may under-perform on these specific domains, thus raising the question of how portable these systems can be. In this workshop we aim at answering some of the following questions: * Are open-domain question answering techniques appropriate for QA in restricted domains? * Can we use generic large corpora and/or the WWW? How can we identify specific pockets of information in these generic corpora? * How can we use specific sources such as the CIA factbook, acronym lists, e-commerce sites (e.g. e-bay), and specialized glossaries and encyclopedia? How can we discover new specific sources? * What types of question-answering techniques are best for what types of restricted domains? * Is it easy/possible/worthwhile to develop domain-independent QA systems for restricted domains? What would be the cost of porting a QA system to a specific domain? * Are restricted domains more suitable than open domains to drive research in NLP? * Is evaluation of restricted-domain QA systems different than that of open-domain QA systems? We welcome papers that address any of the above questions or that focus on any of the following topics: * Comparison between open-domain and restricted-domain QA * Characterisation of the types of restricted domains and the technology required for QA on those domains * Methodologies and/or tools for restricted-domain QA * Description of specific restricted-domain QA systems * Development of modules (e.g. document preselection, NE extraction, terminology extraction) for use in restricted-domain QA systems * Portability of QA systems between different restricted domains * Evaluation of restricted-domain QA systems SUBMISSION PROCEDURE Authors should submit full papers of maximum 8 pages, including references and figures, following the main conference ACL style format (http://www.acl2004.org/aclstyles/style.html). The review will not be blind. Submissions must be in PS or PDF format and they should be sent to diego at ics.mq.edu.au PROGRAM COMMITTEE Organizers: ----------- Diego Mollá Macquarie University, Australia José Luis Vicedo Alicante University, Spain Committee: ---------- In alphabetical order by first name: Anselmo Peñas UNED, Spain Antonio Ferrández Alicante University, Spain Bernardo Magnini ITC-Irst, Italy Bonnie Webber University of Edinburgh, UK Donna Harman NIST, USA Ellen Voorhees NIST, USA Fabio Rinaldi University of Zurich, Switzerland Felisa Verdejo UNED, Spain Graeme Hirst University of Toronto, Canada Horacio Rodríguez Universitat de Catalunya, Spain Ingrid Zukerman Monash University, Australia Jimmy Lin MIT, USA Johan Bos University of Edinburgh, UK Juergen Franke DaimlerChrysler AG, Germany Julio Gonzalo UNED, Spain Lynette Hirschman MITRE, USA Maarten de Rijke University of Amsterdam, The Netherlands Manuel Palomar Alicante University, Spain Mark Maybury MITRE, USA Michael Hess University of Zurich, Switzerland Pierre Zweigenbaum AP-HP, INSERM & INaLCO, France Richard Sutcliffe University of Limerick, Ireland Rolf Schwitter Macquarie University, Australia Sanda Harabagiu University of Texas, USA IMPORTANT DATES * 15 March 04 Paper submission * 15 April 04 Notification of acceptance * 15 May 04 Camera ready version * 25 or 26 July 04 Workshop (final date not yet determined) CONTACT DETAILS Diego Mollá Centre for Language Technology Division of Information and Communication Sciences Macquarie University New South Wales 2109 Australia Tel. +61 2 9850 9531 Fax +61 2 9850 9551 diego at ics.mq.edu.au ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Fri Jan 16 17:12:38 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Fri, 16 Jan 2004 18:12:38 +0100 Subject: Jobs: stages =?ISO-8859-1?Q?=E0_ATLIF?= Message-ID: Bonjour, sur http://www.atilf.fr/ananas/ figurent des offres de stage 2004 pour des étudiants en SdL, TAL ou informatique (niveau licence, maîtrise, DEA/DESS). Merci de les diffuser autour de vous. Cordialement, Susanne Salmon -- Susanne Salmon-Alt Chargée de Recherche - CNRS ATILF : 03.83.96.86.98 LORIA : 03.83.59.20.35 ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Fri Jan 16 17:12:41 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Fri, 16 Jan 2004 18:12:41 +0100 Subject: Conf: TALN'04: SDRT workshop : DEADLINE EXTENSION Message-ID: SDRT workshop of TALN'04 The submission deadline is postponed : it is now the 22 Janvier La date de soumissiom est reportée au 22 janvier ---------------------------------------------------- Call for papers TALN (Traitement Automatique du Langage Naturel) is the francophone annual conference on NLP. TALN'04 (http://www.lpl.univ-aix.fr/jep-taln04/.) will be held in conjunction with JEP 2004 (Journées d'Etude sur la Parole) in Fez, Morocco, from 19 to 22 April 2004, under the aegis of the Association Francophone de la Communication Parlée (AFCP) and of the Association pour le Traitement Automatique des Langues (ATALA, the French equivalent of ACL). The workhop on SDRT will be held on April 22 2004. Official languages are French and English. Topic Papers are invited for thirty-minute talks, including questions, on theoretical and implementational issues on SDRT. SDRT is an approach to discourse interpretation that has many advantages for computational NLP research and applications. Originally (Asher 1993) an extension of Hans Kamp's Discourse Representation Theory (DRT), it combines the insights of dynamic semantics on anaphora with a richer theory of discourse structure, in which each clause plays one or more rhetorical functions within the discourse. More than a decade of work has shown the theoretical fruitfulness of marrying a rich notion of discourse structure with dynamic semantics. For example, it has been shown that rhetorical functions have semantic effects in the following domains: - temporal and spatio-temporal structure of the text, - pronominal anaphora, - presupposition, - resolution of bridging expressions (like definite descriptions), - resolution of lexical and other ambiguities like VP ellipsis and quantifier scope, - analysis of plural quantification, - calculation of implicatures and conversational goals of agents in dialogue. In addition to these theoretical aspects, SDRT has been designed from the outset to aid with implementation in automated or semi-automated textual analysis and text generation. It is a modular theory which contains both a theory of information content and a theory of information packaging, i.e. how to construct the logical form of a discourse. The former is straightforwardly an extension of dynamic semantics, and any implementation of the dynamic semantic ideas (e.g. DRT, DPL, DMG) is compatible with SDRT conception of discourse content. The latter exploits diverse resources that are understood in modular form, and it exploits also the notion of an underspecified representation at several levels. An additional feature of SDRT is that it uses a nonmonotonic system of inference. The aim of this workshop is twofold : theoretical and implementational issues on SDRT. So we expect papers either presenting the treatment of a given linguistic phenomenon in SDRT with possibly a comparison with treatments in other discourse semantics framework. Or papers presenting some implementational issues, for example : - How HPSG or LFG grammars can provide suitable inputs to fragments of SDRT implementation?- - Is the inference engine used to compute logical forms for discourse. Should this system itself be implemented or should approximations, perhaps even monotonic ones, be used? How can we expect the nonmonotonic logic to scale up for large scale applications? How much logical inference do we really need for shallow applications of SDRT? - How to make use of statistical approaches to getting lexical information and other information that would be useful in computing discourse structure? How machine learning might apply to learning rules for the computation of discourse structure in SDRT ? All selected papers will be published in the proceedings. Authors are invited to submit original, previously unpublished research work. Submissions will be reviewed by at least two members of the program committee Program committee: Chairmen Asher Nicholas (Austin), nasher at mail.la.utexas.edu Danlos Laurence (Paris), laurence.danlos at linguist.jussieu.fr, Members Amsili Pascal (Paris), pascal.amsili at linguist.jussieu.fr Bras Myriam (Toulouse), bras@ univ-tlse2.fr Corblin Francis (Paris), corblin at paris7.jussieu.fr, Gaiffe Bertrand (Nancy), bertrand.gaiffe at loria.fr, Kamp Hans (Stuttgart), Hans.Kamp at ims.uni-stuttgart.de, Kruijff-Korbayova Ivana (Saarbrücken), korbay at coli.uni-sb.de, Le Draoulec Anne (Toulouse), draoulec at univ-tlse2.fr, Muller Philippe (Toulouse), muller at irit.fr, Pustejovsky James (Boston), jamesp at cs.brandeis.edu, Roussarie Laurent (Paris), laurent.roussarie at linguist.jussieu.fr, Vieu Laure (Toulouse), Laure.Vieu at irit.fr Submission procedure Submitted papers must not exceed ten pages, in Times 12, single spaced (about 3000 words), including figures, examples and references. A LaTeX style file and a Word template are available on the web site of the conference: Papers MUST be sent in PDF format To : Laurence.Danlos at linguist.jussieu.fr subject: SDRT'04 paper. Important All the PDF versions must be in A4 format, and not US Letter. New Submission deadline: 22 January 2004 Notification to authors: 20 February 2004 Camera-ready: 8 March 2004 SDRT workshop: 22 April 2004 ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Fri Jan 16 17:12:43 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Fri, 16 Jan 2004 18:12:43 +0100 Subject: RESSOURCES: constitution d'une base de =?ISO-8859-1?Q?donn=E9es_?= =?ISO-8859-1?Q?vocales?= Message-ID: *************************************** VOTRE VOIX NOUS INTERESSE *************************************** Afin de constituer une base de données vocales pour la recherche et le développement dans le domaine du traitement de la parole, nous recherchons : - 1000 locuteurs, - Hommes et femmes, - De 18 ans et plus, - De langue maternelle française. Ces enregistrements vocaux sont rémunérés et durent environ 10 minutes. Si vous êtes intéressés et pour obtenir des informations complémentaires, vous pouvez nous contacter : Tél. : 01 43 13 33 47 *** Merci. --------------------------------------------------------------------------- 55-57, rue Brillat-Savarin 75013 Paris FRANCE Tel: (+33) 1 43 13 33 33 / Fax: (+33) 1 43 13 33 30 URL: http://www.elra.info or http://www.elda.fr LREC conference: http://www.lrec-conf.org LangTech forum: http://www.lang-tech.org --------------------------------------------------------------------------- ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Fri Jan 16 17:12:44 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Fri, 16 Jan 2004 18:12:44 +0100 Subject: APPEL: revue TAL : Automatic Text Summarization: Solutions and Perspectives Message-ID: Automatic Text Summarization: Solutions and Perspectives Deadline: February 10th, 2004 Special Issue Editor: Jean-Luc Minel (LaLICC, CNRS) Context: Automatic Text Summarization From a scientific point of view, the problem of summarization extends beyond its immediate boundaries, largely due to certain rhetorical relations that need to be taken into consideration, such as discourse frames, or direct or indirect levels of discourse. In recent years, this research area has moved beyond the processing of phrases as the unit of analysis to include concept search and the design of suitable methods for detecting and representing textual structures. Until the mid-1990?s, scientific texts provided a field of experimentation for automatic summarization methods, but the digitization of texts and their increased availability on the Web and intranets has fundamentally changed user needs and uses. The abundance of texts brings with it an urgent requirement for the creation and production of summarization tools capable of finding, selecting and extracting textual information concisely. In terms of the need, it has become necessary to bring web-search tools and company networks together with tools for automatic summarization. In terms of use, the development of text-search tools requires careful consideration of text representation and the design of user interfaces, which in turn leads to studies being carried out in the domain of information science on new types of written text. Topic This special issue aims: - on the one hand, to present new approaches or methods which may lead to promising prototypes for automated systems. It also concerns the development of an awareness of the importance of automatic summarization, alongside technology, for linguistic engineering. Whatever the automatic summarization project, there are only two underlying techniques for its realization. Firstly, techniques which extract full phrases from source texts, and secondly, those which generate a new, condensed text. Only the first kind of technique has allowed for the implementation of systems that can be considered as providing reasonable results according when evaluated for comercial potential. The second technique is of interest in a research context because of the various linguistic problems left unsolved at the level of interpretation, in view of the limitations in their computational implementation. - on the other hand, to propose bridges between the various areas in which text constitutes the main object of study (i.e., in the domain of information science). Papers are invited which contribute to the following themes: - Numerical approaches versus linguistic approaches, with a particular focus on papers that explore the complementarity between the two. - The automatic detection of textual structures, including: o the identification of topic o the identification of argumentative structures o the improvement of coherence and cohesion (dealing with anaphora, etc) - Different dimensions of summary o Types of summary o Translingual summaries o Multi-document summaries - Linguistic resources necessary for summarization systems o the application of terminology and ontologies o Generic versus specific-purpose resources - Summary and Normalization o Integrating summary systems into networks o Integrating summarization systems into linguistic (s?) - Methods of assessment for summarization systems - Extension of the summarization issue to the semantic filtering of texts: o Exploring new opportunities o The production of summaries in response to specific user needs o The use of navigation tools and interfaces which exploit textual structure o summarization and annotation of texts in a Semantic Web context Editorial Committee John Atkinson (Université de Concepcion, Chili) Michel Charolles (LATTICE, Université Paris 3, France) Jean-Pierre Desclés (LaLICC, Université Paris-Sorbonne, France) Michael Elhadad (Computer Science Department, Ben Gurion University, Israel) Noemie Elhadad (Computer Science Department, Columbia University, USA) Guy Lapalme (Université de Montréal, Canada) Inderjeet Mani (Société MITRE, USA) Jean-Guy Meunier (UQUAM, Canada) Dragomir Radev (University of Michigan, USA) Antoinette Renouf (University of Liverpool, UK) Horacio Saggion (Computer Science Department, University of Sheffield, UK) Dina Wonsever (Université de la République, Uruguay) Format Contributions (25 pages maximum) should be submitted in Word, Postscript or Acrobat format. The file styles are provided as part of the regulations on the journal homepage. Language Papers should be written either in French or English. However, English submissions will only be accepted from non-French speakers. Dates Submission deadline: February 10th, 2004 Final committee decisions: April 15, 2004. The camera-ready version of the accepted articles should reach the editors by June 1st, 2004, for publication in 2004. Those who intend to submit an article are encouraged to contact Dr. Jean-Luc Minel (jean-luc.minel at paris4.sorbonne.fr). Paper submission The articles should be submitted by electronic mail to: jean-luc.minel at paris4.sorbonne.fr or by normal mail to the following address: Jean-Luc MINEL Laboratoire LaLICC, UMR 8139 (CNRS - Université Paris-Sorbonne) ISHA, 96 Boulevard Raspail 75 006 Paris ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Fri Jan 16 17:12:47 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Fri, 16 Jan 2004 18:12:47 +0100 Subject: APPEL: Workshop on COMPILING AND PROCESSING SPOKEN LANGUAGE CORPORA Message-ID: This message was posted to several lists. We apologize for any cross-postings. 2ND CALL FOR PAPERS Workshop on COMPILING AND PROCESSING SPOKEN LANGUAGE CORPORA http://lands.let.kun.nl/CPSLC/ Centro Cultural de Belem, Lisbon, Portugal 24th May 2004 Workshop to be held in conjunction with the 4th International Conference on Language Resources and Evaluation (LREC 2004) Main conference: 26-27-28 May 2004 http://www.lrec-conf.org/lrec2004/ Aim The aim of the workshop is to bring together people working on the development (compilation and processing) of spoken language corpora.* The workshop will provide participants with the opportunity to exchange views and share experiences. Moreover, the workshop is instrumental in taking stock of and evaluating the present state-of-the-art. The workshop thus aims to contribute to the development of a future roadmap that will guide the development of standards, tools, etc. for use with spoken language corpora. *The term ?spoken language corpora? is used here to distinguish such corpora from speech corpora or speech databases: speech corpora are collections of spoken data that are typically recorded for specific purposes by specific users (speech corpora/databases such as SpeechDat Car that are used for developing consumer applications). Usually such databases lack the richness of linguistic annations that is pursued for spoken language corpora. Background and motivation Despite the wide experience gained in the compilation of written language corpora, working with spoken language data is not immediately straightforward as spoken language involves many novel aspects that need to be taken care of. The fact that spoken language is transient is sometimes offered as an explanation for why it is more difficult to collect spoken data than it is to compile a corpus of written data. However, it is not just the capturing of data that is anything but trivial. Once the (audio) data have been collected and stored, the next step is to produce some kind of transcript (whether orthographic or phonetic). Further annotations such as POS tagging, lemmatisation, syntactic annotation, and prosodic annotation may then build upon this transcription. Among the problems encountered in the processing of spoken language data are the following: * There is as yet little experience with the large scale transcription of spoken language data. Procedures and guidelines must be developed, and tools implemented. * Well-established practices that have originated from working on written language corpora do not hold up when trying to cope with the idiosyncracies of the spoken language. This is true for all levels of linguistic annotation. Annotation schemes need to be reconsidered and tools must be adapted. * In so far as standards have emerged (eg CES), they need to be adapted in order to be able to cater for the needs of spoken language corpora. * By their very nature, spoken language corpora bring together speech and language technologists and linguists from various backgrounds. Ideally, such corpora should address the needs of all these different user groups. Often, however, there is a conflict of interest. For example, the quality of recordings of spontaneous conversations in noisy environments although highly interesting and worthwhile from a linguistic perspective will prove too poor to be of any use to someone doing research into speech recognition. Workshop topics Topics of interest include orthographic transcription, phonetic transcription, prosodic annotation, segmentation, POS tagging and lemmatisation, parsing, and discourse analysis. Contributions on the development and implementation of standards or guidelines for spoken language corpora (annotation schemes, meta-data descriptions) are also invited, as are contributions describing software for the exploitation of spoken language corpora. Format of the Workshop The workshop will comprise of oral presentations of previously submitted papers that went through a double peer review process. The proceedings of the workshop will be published by the local organising committee. Important dates 24th January 2004 Deadline for submission of (full) papers 1st March 2004 Notification of acceptance and preliminary programme 21st March 2004 Deadline for submission of final versions of accepted papers for the proceedings 3rd April 2004 Definitive programme 24th May 2004 Workshop Submissions Prospective authors are invited to submit papers for oral presentation. Only full papers in English will be accepted, and the length of the paper should not exceed 6000 words (or the equivalent in space for diagrams). Submissions in MS Word, Postscript, PDF or RTF should be submitted through the workshop website: http://lands.let.kun.nl/CPSLC/ Registration Workshop participants need to register through the LREC website: http://www.lrec-conf.org/lrec2004/ The fee for this half-day workshop is 50 Euro for conference participants and 85 for others and includes a coffee break and the workshop proceedings. Organising committee Nelleke OOSTDIJK, University of Nijmegen Gjert KRISTOFFERSEN, University of Bergen Geoffrey SAMPSON, University of Sussex Programme committee Daan BROEDER Max Planck Institute Emanuela CRESTI University of Florence Gjert KRISTOFFERSEN University of Bergen Tony MCENERY University of Lancaster Nelleke OOSTDIJK University of Nijmegen Pavel IRCING University of Western Bohemia Geoffrey SAMPSON University of Sussex Antonio Moreno SANDOVAL University of Madrid Jean VERÓNIS Université de Provence ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Mon Jan 19 19:27:23 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Mon, 19 Jan 2004 20:27:23 +0100 Subject: Appel: Workshop : Beyond Named Entity Recognition Semantic labelling for NLP tasks Message-ID: SECOND ANNOUNCEMENT AND CALL FOR PAPERS Workshop Beyond Named Entity Recognition Semantic labelling for NLP tasks URL: http://ai-nlp.info.uniroma2.it/ws_lrec04/ Centro Cultural de Belem LISBON, Portugal 25th may 2004 In Association with 4th INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION LREC2004 Main conference 26-27-28 May 2004 Motivation and Aims Although it is generally assumed that improvements in language processing will be made through the integration of linguistic information and statistical techniques, the reality is that language is very diverse and looking for specific patterns of words that repeat enough to be statistically significant tends not to be a very fruitful task: sequences longer than three words are not generally repeated often enough to be statistically significant. At the same time, the identification of named entities: Names, dates, places, organizations etc., has proved to be avery useful preliminary task in many natural language processing systems are interested in pursuing approaches which extend this notion by identifying and labeling other semantic information in a text, in such as way as to allow repeatable semantic patterns to emerge. Our interest is in attacking the data sparseness problem by exploring ways to collapse (semantically) related phrases which are expressed by different word sequences. As this seems closely related to previously proposed class-based language models (see for example Brown et al. 90 in Computational Linguistics), it is distinguished because the empirical notion of classes used in the previous work (e.g. classes made up of collocationally similar words) are replaced by semantically justified sets. Notice how Name Entity (NE) tagging and Word Sense Disambiguation (WSD) represent, in terms of granularity and representational complexity, two extremes of a single general problem: semantic disambiguation. Semantic disambiguation serves thus the purpose of improving the generalization power of statistical models. One of the questions here is how to determine a suitable level of clustering (for NE identification and for WSD) that would lead to high accuracy and to performance improvement by obtained statistical models. Reason of Interest It is to be noticed that a set of independent research work focused recently on the statistical treatment of semantic phenomena (e.g. WordNet navigation as a stochastic process, as studied in Light and Abney or in Ciaramita & Johnson) highly correlates with the research program proposed above. The workshop will represent a forum where experience from lexical semantics and statistical learning will be presented and fruitful discussion among researchers in both fields will be promoted. The workshop is expected to attract researchers and practitioners from a range of areas as well as developers of large scale semantic resources who are interested in effective methods of semantic labeling. Topics (to be addressed in the workshop include, but are not limited to) * Methods for lexical - semantic annotation of corpora * Methods and Standards for lexical semantic representation of dictionary information * Lexico-semantic taxonomies * Existing sources of classification: dictionaries, thesauri and computerized ontologies * Corpus-driven methods for semantic disambiguation * Feature selection for semantic disambiguation * Lexico-semantic tagging of very large corpora * Algorithms and methods for disambiguation of semantic phenomena * Statistical learning models and their applications to semantic labeling * Computational learning frameworks for Natural Language Learning * Semi-supervised and unsupervised statistical semantic disambiguation * Evaluation of semantic disambiguation Workshop format The workshop will be a half-day event with position statements from invited speakers (half an hour each) with two hours for 4-6 presentations of scientific papers. Submissions are intended to present works in progress and more completed works which fall within the scope defined by the topics listed above. A final 1 hour open discussion among all the workshop participants will be moderated by the organizers. In order to stimulate an interesting general discussion each member of the program committee will be invited to submit a position statement of max. 1000 words. Submission Participants are invited to submit an extended abstract of max. 3500 words concerning one or more of the topics of interest. Each accepted paper receives a slot of 25 minutes for presentation (15 minutes talk and 10 minutes for discussion). Each submission should show: title; author(s); affiliation(s); and contact author's e-mail address, postal address, telephone and fax numbers. Submissions must be sent electronically in PDF to the following address: Roberto Basili Dept. of Computer Science, Systems and Management University of Roma Tor Vergata e-mail: basili at info.uniroma2.it Proceedings and Publications Proceedings of the workshop will be printed by the LREC Local Organising Committee. The Computer, Speech and Language journal will dedicate to the workshop topics a Special Issue on Semantic tagging/labelling for NLP tasks. Relevant papers submitted to the workshop will be selected to appear in that special issue. Important dates Extended abstract submission (max. 3500 words): 2nd of February 2004 Notification of acceptance: 5th of March 2004 Preliminary Program: 10th of March 2004 Submission of the final version of paper: 20th of April 2004 Workshop: 25th May 2004 Organising Committee Louise Guthrie - University of Sheffield, UK Roberto Basili - University of Rome, Tor Vergata, Italy Eva Hajicova - Charles University, Czech Republic Frederick Jelinek - Johns Hopkins University, Maryland, USA Further Information For any information related to the organization, please contact: Roberto Basili e-mail: basili at info.uniroma2.it Dept. of Computer Science, Systems and Management University of Roma Tor Vergata Via di Tor Vergata 00133 Roma (ITALY) tel: +39 06 72597391 fax: +39 06 72597460 ------------------------------------------------------------------------- Message diffus� par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain�e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh�sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Mon Jan 19 19:27:21 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Mon, 19 Jan 2004 20:27:21 +0100 Subject: Appel: JEP/TALN2004 : TRAITEMENT AUTOMATIQUE DE LA L'ARABE : DEADLINE EXTENSION Message-ID: !!!!!! Extension du deadline au 20/01/2004 !!!!!! **************************************************** J E P 2 0 0 4 - T A L N 2 0 0 4 - Session Spéciale - TRAITEMENT AUTOMATIQUE DE LA LANGUE ARABE ECRITE ET ORALE 2ème appel à communications Palais des Congrès Fès (Maroc) du 19 au 22 avril 2004 http://www.lpl.univ-aix.fr/jep-taln04/ http://www.fsdmfes.ac.ma/jep-taln04/ **************************************************** Par ses propriétés morphologiques, syntaxiques, phonétiques et phonologiques, la langue arabe est considérée comme faisant partie des langues difficiles à appréhender dans le domaine du traitement automatique du langage écrit et parlé. Dans le domaine du traitement automatique de l'arabe écrit, les recherches ont débuté vers les années 1970, avant même que les problèmes d'édition de textes arabes ne soient complètement maîtrisés. Les premiers travaux concernaient notamment les lexiques et la morphologie. Depuis une dizaine d'années, l'internationalisation du Web et la prolifération des moyens de communication en langue arabe, ont révélé un grand nombre d'applications du TALN arabe. Les travaux de recherche ont ainsi commencé à aborder des problématiques plus variées comme la syntaxe, la traduction automatique, l'indexation automatique des documents, la recherche d'information, etc. Dans le domaine du traitement automatique de l'arabe parlé, des progrès considérables ont été réalisés grâce à l'amélioration des technologies du traitement du signal, à l'enrichissement des connaissances sur les caractéristiques prosodiques et segmentales et sur les différentes modélisations acoustiques relatives aux schèmes arabes. Ces résultats devraient permettre de mieux appréhender des domaines variés et innovants tels que la reconnaissance et la synthèse de la parole, la traduction orale ou la reconnaissance automatique du locuteur et de ses origines géographiques, etc. L'objectif de cette session est de réunir des chercheurs sur le traitement automatique de la langue arabe, aussi bien dans la communauté de l'écrit que dans celle de l'oral. Cette rencontre sera l'occasion de faire le point sur les avancées dans ces domaines, au niveau scientifique et applicatif et dans des contextes monolingues ou multilingues. Le renforcement des liens de collaboration entre les communautés de l'écrit et de l'oral de l'arabe est également un des objectifs de cette session. THEMES Les thèmes qui seront abordés dans cette session consacrée au traitement automatique de l'arabe écrit et parlé incluent, de façon non limitative : - Reconnaissance et compréhension de la parole, - Synthèse de la parole, - Génération automatique de la prosodie, - Reconnaissance de la langue, du locuteur et de ses origines géographiques, - Corpus arabes et ressources langagières, - Acquisition de la parole dans les systèmes de synthèse et de RAP, - Morphologie, - Syntaxe, - Sémantique, - Analyse et génération, - Analyse du discours, - Résumé automatique, - Dialogue, - Traduction automatique. CALENDRIER Date limite de soumission : 20 janvier 2004 Notification aux auteurs : 20 février 2004 Version finale (prêt-à-clicher) : 8 mars 2004 Conférence : 19-22 avril 2004 CRITERES DE SELECTION Les auteurs sont invités à soumettre des travaux de recherche originaux, n'ayant pas fait l'objet de publications antérieures. Les soumissions seront examinées par au moins deux spécialistes du domaine. Seront considérées en particulier : - l'importance et l'originalité de la contribution, - la correction du contenu scientifique et technique, - la discussion critique des résultats, en particulier par rapport aux autres travaux du domaine, - la situation des travaux dans le contexte de la recherche internationale, - l'organisation et la clarté de la présentation, - l'adéquation aux thèmes de la conférence. LANGUES Les articles devront être rédigés en français ou en anglais. FORMAT DES SOUMISSIONS Le format PDF devra IMPÉRATIVEMENT être employé. Dans certains cas particuliers, nous accepterons des contribution en format RTF (Word). Les articles soumis ne devront pas dépasser 6 à 10 pages en Times 12, espacement simple, soit environ 3000 mots, figures, exemples et références compris. Les articles devront être au format A4. - Télécharger la feuille de style LaTeX : - Télécharger le modèle Word (version française) : - Instructions pour la création de fichiers PDF : MODALITES DE SOUMISSION Les auteurs devront envoyer leur soumission sous la forme d'un document attaché à un courrier électronique contenant le titre de la communication, le nom, l'affiliation, l'adresse postale, l'adresse électronique, le numéro de téléphone et le fax de l'auteur principal. Les soumissions par courrier électronique devront être envoyées à l'adresse suivante : L'objet du message devra obligatoirement comporter la mention : JEP-TALN-2004-Arabic En cas d'impossibilité d'envoi par courrier électronique, une soumission par voie postale sera acceptée. Une disquette et 3 exemplaires papier de la contribution devront être envoyés à l'une des deux adresses suivantes : Malek Boualem France Telecom R&D - DMI/GRI 2, avenue Pierre Marzin 22307 Lannion - France ou Noureddine Chenfour Département de Math. et Informatique Faculté des Sciences Dhar El Mahraz, Fès BP : 1796 Atlas, Fès - Maroc COMITE SCIENTIFIQUE - Abderrahim Benabbou, FST de Fès, Maroc. - Mohammed Benkhalifa, Faculté des Sciences, Rabat, Maroc. - Thami Benkirane, Université Sidi Mohammed, Maroc. - Malek Boualem, France Telecom R&D, France. - Achraf Chalabi, Sakhr, Egypte. - Noureddine Chenfour, université Sidi Mohammed, Fès, Maroc. - Khalid Choukri, ELRA/ELDA, France. - Fethi Debili, CNRS, Paris, France. - Emilie De Neef, France Telecom R&D, France. - Joseph Dichy, Université Lumière-Lyon 2, France. - Everhard Ditters, University of Nijmegen, Pays-Bas. - Mohamed Embarki, Laboratoire de Phonétique Montpellier, France. - Mohammed Hassoun, ENSSIB, Lyon, France. - Med Tayeb Laskri, Université Badji Mokhtar, Algérie. - Fabrice Lefevre, LIMSI, Université Paris-Sud Orsay, France. - Chafic Mokbel, Université Balimand, Liban. - Abdelhak Mouradi, ENSIAS Rabat, Maroc. - Omar Nouali, CERIST, Algérie. - Abdenbi Rajouani, ENS de Fès, Maroc. - Mustafa Yaseen, ATS Online, Jordan. - Mohamed Yeou, Université Chouaib Doukkali El-Jadida, Maroc. - Chakir Zeroual, Université Sidi Mohamed, Fès, Maroc. - Adnane Zribi, ISG, Université de Tunis, Tunisie. **************************************************** ------------------------------------------------------ Malek Boualem France Telecom R&D - DMI/GRI 2, avenue Pierre Marzin - 22307 Lannion - France Tel: (33)(0)2.96.05.29.83 Fax: (33)(0)2.96.05.32.86 Email: malek.boualem at rd.francetelecom.com ------------------------------------------------------ ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Mon Jan 19 19:27:24 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Mon, 19 Jan 2004 20:27:24 +0100 Subject: Appel: MODELLING AND DESCRIBING DISCOURSE ORGANISATION IN THE AGE OF THE DIGITAL DOCUMENT Message-ID: =============================================== 2nd CALL FOR PAPERS: =============================================== MODELLING AND DESCRIBING DISCOURSE ORGANISATION IN THE AGE OF THE DIGITAL DOCUMENT =============================================== A Workshop proposed by ATALA as part of the Digital Document Week (http://www.univ-lr.fr/sdn2004/) La Rochelle, 22 juin 2004 organised by Marie-Paule Péry-Woodley, ERSS/Université de Toulouse-Le Mirail (pery at univ-tlse2.fr) The Digital Document Week aims to gather research communities dealing with digital documents from a variety of angles: media, technical and social modes of mediation, relation with human activity. This ATALA workshop wishes to broach these questions from a linguistic point of view, focussing on digital documents as discourse, characterised by an internal organisation which needs to be understood and may be exploited in computer-based systems. The workshop aims to bring together three research areas concerned with the development of digital documents: the study of discourse organisation, corpus linguistics, computer-based applications for the exploitation of digital documents. For text and discourse linguistics, the proliferation of digital documents leads to new opportunities and new research questions, such as: - the application of corpus analysis methods to discourse: what kind of data can be regarded as relevant at this level of linguistic investigation? - the development of novel ways of accessing documents, which leads to a new emphasis on text structure and the potential exploitation of surface markers; - the impact of new document types on basic concepts in the field: cohesion, coherence, metadiscursive signalling. This workshop on written discourse organisation aims to bring together research from three domains which must seek points of convergence in the light of these new prospects: 1. Discourse organisation In order to apprehend a sequence of utterances as discourse, it is necessary to understand its organisation (to identify its segments and perceive their hierarchy and their relations). An old and fertile tradition approaches discourse organisation via the notion of discourse relations: semantico-pragmatic links between segments (propositions or sets of propositions) (cf. Péry-Woodley (ed) 2001). Other modes of organisation may be envisaged, via the notion of theme or topic for instance, or more recently through the discourse framing hypothesis (Charolles 1997). Research in this field can be placed in a continuum from pure 'conceptual' modelling to empirical methods (automatic segmenting, cf. Hearst 1997; shallow analyses human or automatic - cf. Teufel et Moens 1999). The challenge is to hold both ends of the continuum in order to draw connections between the way 'things are put' in texts and the processes underlying discourse organisation at different levels of granularity (local vs. global organisation). The relationship between modelling approaches and empirical research has often seemed problematic, with empirical studies running the risk of losing track of structure as they focus on surface markers, while conceptual models tend to be difficult to test empirically. Corpus-based approaches greatly facilitated by progression into the digital age are in the process of bringing considerable changes in the discourse field, as they have done elsewhere in linguistics (Conrad 2002). 2. Corpus-based studies of linguistic correlates of discourse organisation As noted by several authors (Biber et al 1998 inter alia), though research on discourse organisation tends to make regular use of authentic data, the corpus is often seen as a source of examples rather than the object of the analysis as such. The implementation of a fully-fledged 'corpus approach' in the field of discourse organisation carries with it many difficulties: corpus construction (common sampling-based techniques make it impossible?), the role of quantitative analysis, and most of all definition of relevant data making it possible to draw the connection between surface markers (which may be just epiphenomena) and the multiple principles underlying complex hierarchic organisation. A gap can also be observed between linguistic approaches (low coverage and high reliability) and numerical approaches (high coverage and low reliability). Articulating these approaches may open new prospects, leading to fresh insights into discourse organisation principles as well as more operational methods for applications. 3. Computer-based systems for the exploitation of digital documents Applications for which the relevant unit is the whole document are little concerned by questions of discourse organisation, but those concerned with intra-document browsing, selective synthesis or multi-level visualisation must work their way inside the documents and therefore cannot consider them as simple 'bags of words': they have to take into account the organisation into thematic or rhetorical chunks and text architecture (cf. Luc & Virbel 2001). These objectives bring about new research questions, in particular around the articulation of different organisational levels in long documents (where browsing aids acquire particular relevance). This call for papers concerns researchers who are already working on these interactions, as well as those whose work is in one of the domains referred to but who are interested in a dialogue with other discourse approaches. Descriptive studies which pay specific attention to methodology will be particularly welcome. Some relevant themes (non-exhaustive list): - identification of objects or text zones corresponding to text or discourse acts (conclusions, explanations, evaluations, ?) - discourse organisation markers (from markers to relations: inductive approach): connection, indexing (discourse frames), textual metadiscourse - linguistic characterisation of discourse functions (from functions to markers: deductive approach) - segmentation (automatic or manual): 'topic shifts', clues to segment boundaries (lexico-syntactic, typographical, dispositional) - articulation between local and global organisation - impact of discourse genre on discourse organisation and its linguistic markers - analysis and exploitation of document architecture - topological approaches - discourse annotation SUBMISSION (MODALITIES) A summary (2-4 pages, Word, pdf or ps) to be e-mailed by January 30th 2004 to Marie-Paule Péry-Woodley (). Notification of acceptance will be given by March 15th 2004. *************************************************************************** References Biber, D., Conrad, S., & Reppen, R. (1998). Corpus linguistics: Investigating language structure and use. Cambridge: Cambridge University Press. Conrad, S. (2002). Corpus linguistics approaches for discourse analysis. Annual Review of Applied Linguistics, 22, 75-95. Charolles, M. (1997). L'encadrement du discours : Univers, champs, domaines et espaces (Cahier de Recherche Linguistique 6): Université de Nancy2. Hearst, M. (1997). TextTiling: segmenting text into multi-paragraph subtopic passages. Computational Linguistics, 23(1), 33-64. Luc, C., & Virbel, J. (2001). Le modèle d'architecture textuelle : fondements et expérimentation. Verbum, 23(1), 103-123. Péry-Woodley, M.-P. (ed.) (2001). Cohérence et relations de discours à l'écrit. Présentation. Verbum, 23(1). Teufel S. & Moens, M. (1999). Discourse-level argumentation in scientific articles: human and automatic annotation. In: Towards Standards and Tools for Discourse Tagging. ACL 1999 Workshop. ___ Marie-Paule PERY-WOODLEY ___________________________________________________________________ ERSS / Sciences du Langage Universite de Toulouse Le Mirail Tel.: 33(0)5 61 50 46 76/-36 09 5 allees Antonio-Machado Fax: 33(0)5 61 50 42 12 F-31058 TOULOUSE CEDEX Email: pery at univ-tlse2.fr ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Wed Jan 21 17:09:30 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Wed, 21 Jan 2004 18:09:30 +0100 Subject: Conf: ATALA : Characterisation of Internet content: beyond keywords. Semantic approaches Message-ID: ====================================================== Workshops of the Association pour le Traitement Automatique des LAngues (ATALA) Subject: Characterisation of Internet content: beyond keywords. Semantic approach. Workshop organized by : François Rastier (CNRS - UMR 7114, Paris X - MoDyCo), Natalia Grabar (CRIM/INaLCO, STIM / DSI / AP-HP, Paris 6) and Thomas Beauvisage (France Télécom R&D - DIH/UCE, Paris X - MoDyCo) Date: Jan. 31 2004 Location: ENST, 49, rue Vergnault, 75013 Paris Amphithéatre Emeraude Métro : Corvisart Free entry ATALA membership recommended (http://www.atala.org) Contact for this workshop : indices.internet at ml.free.fr Program: 9h15 Presentation of the Workshop 9h30 Thomas Beauvisage (France Télécom R&D) Utiliser les annuaires du Web pour décrire les parcours sur la Toile (Using Web directories to describe users' paths) 10h00 Kamel Smaïli et Armelle Brun (LORIA) Routage automatique de courriers électroniques (Automatic routing of emails ) 10h30 Break 11h00 Antoine Marzin, Lionel Martin, Christel Vrain et Guillaume Cleuziou (LIFO, U. Orléans) Classification de pages Web en Genre (Genre-based Web pages classification) 11h30 Martine Hurault-Plantet (LIMSI-CNRS) Sélection de traits et détection de thèmes pour l'analyse d'un corpus de pages personnelles Web (Selection of traits and topic detection for the analysis of a corpus of personal Web pages) 12h00 Lunch 14h00 Aurélie Névéol, Lina Soualmia, Alexandrina Rogozan, Magaly Douyère, Benoît Thirion, Stéfan Darmoni (CISMeF, Rouen / PSI-CNRS / U. Rouen) Caractérisation des contenus de l'Internet en santé : l'exemple CISMeF (Characterisation of Health-related Internet content: the CISMeF example) 14h30 Mathieu Valette (CRIM, Inalco) Projet Princip : application de règles sémantiques à la détection de documents racistes sur Internet (The Princip project: application of semantic rules to the detection of racists documents on the Internet) 15h00 Break 15h30 Monika Nicinski, (CRIM, Inalco) Typologie et description sémantique des images utilisées dans les sites Internet racistes (Typology and semantic description of images used in racist Web sites) 16h00 François Rastier (CNRS - UMR 7114, Paris X - MoDyCo) La sémiotique du document numérique et son incidence sur les traitements sémantiques (The semiotics of electronic document and its incidence on semantic processing) 16h30 Round table 17h00 End of the Workshop Important: Le samedi, l'accès a l'ENST se fait par la rue Vergnaud (de l'autre côté du pâté de maison par rapport à la rue Barrault). N'oubliez pas de vous munir du programme de la journée ; ce programme vous sera demandé au poste de sécurité. =================================================== ______________________________ Thomas Beauvisage France Télécom R&D/DIH/UCE 38-40, rue du Général Leclerc 92794 Issy Moulineaux Cedex 9 - France Tel : + 33 (0)1 45 29 58 11 Fax : + 33 (0)1 45 29 01 06 http://www.francetelecom.com/rd/ ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Wed Jan 21 17:09:35 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Wed, 21 Jan 2004 18:09:35 +0100 Subject: Appel: WORKSHOP ON THE REPRESENTATION AND PROCESSING OF SIGN LANGUAGES Message-ID: CALL FOR PAPERS ============================== WORKSHOP ON THE REPRESENTATION AND PROCESSING OF SIGN LANGUAGES Workshop URL: http://dev.eurac.edu:8080/sign/index.html From SignWriting to Image Processing. Information techniques and their implications for teaching, documentation and communication. Workshop on the occasion of the 4th International Conference on Language Resources and Evaluation (LREC 2004) Date: 30-May-2004 (at the closing of the main conference LREC 2004) Location: Lisbon, Portugal Linguistic Sub-field: SignWriting, Computational Linguistics, Corpus Linguistics, Image Processing, Lexicography Meeting Background ============================== Sign Languages are the languages used by deaf communities for non-written communication. This kind of linguistic codes relies on the visual-gestural modality of communication. Sign languages all share properties which do not exist in spoken languages, especially through co-occurring sign elements. For storing and retrieving information, sign languages may be encoded and processed electronically. Different techniques are possible and have been proposed. The workshop will focus on the problem of representing sign languages electronically in order to facilitate communication among the deaf as well as between deaf and hearing people. Moreover, it will promote the documentation and teaching of sign languages to both communities and stimulate linguistic research on sign languages. The task of transcribing these languages onto electronic media as a main technique for storing, retrieving or communicating (email, telephone, snake mail, children's e-book) is technically and linguistically challenging. Recent advances in the field of corpus linguistics, image processing and the development of XML standards, promise to pave the way for a broader application of these techniques. The workshop will set forth to provide an introduction into the different approaches and techniques currently employed, discuss their applications and respective advantages. Preliminary Meeting program ============================== 30-May-2004 9:00-10:30: Presentations of invited talks - Richard Gleaves (Deaf Action Committee For SignWriting) - Thomas Hanke (Institute of German Sign language and Communication of the Deaf University of Hamburg) - Carol Neidle & Robert G. Lee (Department of Modern Foreign Languages and Literatures, Boston University, Boston MA) 11:00-18:00: Oral presentations, poster presentations and demos Call for papers ============================== Papers are invited on substantial, original and unpublished research on all aspects of sign language representation and processing, including, but not limited to: *sign writing *corpus construction for sign languages *sign language dictionaries *sign language technologies *e-learning of sign languages *any topic related to sign language treatment and processing Submissions of papers for oral and poster presentations should follow the same style as the ones for regular LREC paper and not be longer than 6000 words. The final details will be published as soon as they become available. Demonstrations and related tools will be reviewed as well. You should send an outline of about 400 words. If a demo is connected to a paper, please attach the outline to the paper. The papers and demonstration outlines, written in English, should be attached to an email message sent to the following address (ostreiter at eurac dot edu). Please include the name and the affiliation of the author(s) in the body of the email message. The deadline for paper submission is February 11th, 2004. Notice of acceptance or rejection will be sent on February 24, 2004. We allow simultaneous paper submission to the workshop and the LREC main conference. If a paper is accepted by both the conference and the workshop, the paper will be presented at the conference, rather than at the workshop. The author(s) should notify the workshop chair. Papers will be published in the proceedings of this workshop (each workshop and the main conference have separate proceedings) and may, depending on the conference politics, be included into the the main conference CD-ROM. Organizing Committee ============================== - Antônio Carlos da Rocha Costa (Escola de Informática, Universidade Católica de Pelotas, Brazil): rocha at atlas.ucpel.tche.br - Carol Neidle (Department of Modern Foreign Languages and Literatures, Boston University, Boston MA): carol at bu.edu - Chiara Vettori (Language and Law, European Academy Bolzano, Italy): cvettori at eurac.edu - Christian Retoré (Laboratoire Bordelais de Recherche en Informatique, France): retore at labri.fr - Eva Safar (School of Computing Sciences, University of East Anglia, Norwich, England): esafar at yahoo.com - Ian Marshall (School of Computing Sciences, University of East Anglia, Norwich, England): im at cmp.uea.ac.uk - Marco Consolati (Cooperativa Alba , Torino, Italy): bigmark at mclink.it - Oliver Streiter (Language and Law, European Academy Bolzano, Italy): ostreiter at eurac.edu - Patrice Dalle (Équipe "Traitement et Compréhension d'Images", IRIT - Université Paul Sabatier France): dalle at irit.fr Important Dates ============================== 11 February 2004 : Deadline for paper submissions 25 February 2004 : Notification of acceptance to authors ** : Deadline for Camera-ready papers 30 May 2004 : Workshop Contact ============================== Contact person: Oliver Streiter Contact Email: ostreiter at eurac.edu Workshop URL: http://dev.eurac.edu:8080/sign/index.html ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Wed Jan 21 17:09:42 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Wed, 21 Jan 2004 18:09:42 +0100 Subject: Conf: =?ISO-8859-1?Q?COLDOC=27_2004_=3A_NEW_DEADLINE_F?= =?ISO-8859-1?Q?OR_SUBMISSION_=3A_15_FEBRUARY_2004?= Message-ID: (apologies for multiple postings) COLDOC' 2004 CALL FOR PAPERS The Setting up of Observables in Linguistics ************************** NEW DEADLINE FOR SUBMISSION : 15 FEBRUARY 2004 ************************** Young researchers' conference - Nanterre, France - April 29 & 30, 2004 The young researchers of Modèle, Dynamique, Corpus (UMR 7114 CNRS - Université Paris-X Nanterre) research team, are organizing a young researchers' conference, scheduled for April 29 and 30, 2004, at Paris X-Nanterre Université campus. The setting up of observables in linguistics is the central topic of this conference, i.e. defining and making use of both attested and constructed data. Young researchers from all fields and domains of linguistics are, therefore, invited to submit a paper. Postgraduate, Ph. D. and postdoc students are invited to provide useful insights and experience on their respective research areas. Communications addressing methodological and theoretical issues related to the process of setting up linguistic data, as well as data collection and utilization are expected. For example, communications addressing one of the following issues are expected: - Relevance and selection of linguistic data; - Corpora and emerging linguistic phenomena; - Oral, written or signed data collection methodology and practice; - Questions related to corpora related tools, transcription and encoding; - The use and place of quantitative methods, both generic and specific; - Qualitative methods; - Language, text genres or discourse comparison. Each conference session will start by an invited speaker's talk. A roundtable will be held at the end of the conference. Communications should last 20 minutes, followed by 10 minutes for questions. The deadline for proposals is set on January 26, 2004. Communication proposals will be evaluated anonymously by the scientific committee. Authors are invited to send two separate files, in Word format: first a two pages long summary (3000 signs) of their communication, second a file stating the authors' names, e-mail address, affiliation, together with the title of their communication. Authors may also state their preference regarding the format of their communication: oral, or poster. Communications will be evaluated according to a range of selection criterions, favoring those papers which fully address the issue stated above, which show methodological relevance and scientific interest, and which state their point clearly. Communication proposals, as well as other requests should be addressed to: , or by postal mail, to the following address: ColDoc' 2004 MoDyCo (UMR 7114) Secrétariat sciences du langage Université Paris-X Nanterre, Bât. L 200, avenue de la République 92001 Nanterre Cedex France We look forward to welcoming you at Nanterre Université for the occasion of the conference. The Organizing Committee: Antonio Balvet, Sophie Hamon, Sylvain Loiseau, Ali Tifrit, Cécile Vigouroux. Scientific Committee: --------------------- Driss Ablali Karine Baschung Gabriel Bergounioux Simon Bouquet Nick Clements Marcel Cori Sophie David Annie Delaveau Bernard Fradin Françoise Gadet Nathalie Gasiglia Philippe Gréa Françoise Kerleroux Mark Klein Anne Lacheret Bernard Laks Sarah Leroy Colette Noyau Thierry Poibeau François Rastier Tobias Scheer Pascale Sébillot Anna Sores Nathalie Vallée Florence Villoing Geoffrey Williams. Important dates: --------------------- New submission deadline: February 15, 2004 Authors' notification of acceptance: March 22, 2004 Conference: April 29 & 30, 2004 The Setting up of Observables in Linguistics ColDoc'2004 - Modyco (UMR 7114) young researchers' conference Paris X Nanterre, Salle des colloques, Bâtiment B 200, avenue de la République 92001 Nanterre Cedex France Web site: http://infolang.u-paris10.fr/modyco/textes/actualites/Page.html ---------------------------------------------------------------- This message was sent using IMP, the Internet Messaging Program. ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Wed Jan 21 17:09:45 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Wed, 21 Jan 2004 18:09:45 +0100 Subject: Appel: ACL2004 Message-ID: ACL2004 NEWSLETTER NO. 1 (January 20, 2004) The Association for Computational Linguistics invites the submission of papers for its 42nd Annual Meeting hosted jointly with the European Chapter of the ACL. Papers are invited on substantial, original, and unpublished research on all aspects of computational linguistics. ACL 2004 will be held at the new Barcelona Forum Convention Centre, which is scheduled to be completed in December 2003, officially opening on April, 2004.The ACL meeting will be part of the programme of the Forum of Cultures that will take place in Barcelona from April to September 2004. :: Main Dates Conference Tutorials: July 21, 2004 Main Conference: July 22-24, 2004 Post-conference Workshops: July 25-26, 2004 Paper submission due: February 25, 2004 :: Contents This news letter includes: 1. Area chairs of the main conference 2. Mentoring service 3. Call for sponsorship 4. List of accepted workshops 5. Programme Committee of poster and demos 6. Call for papers of student research workshop 7. Call for tutorials 8. Others More details can be found at the website of the ACL2004 Conference. :: Area chairs of the main conference Elisabeth Andre (University of Augsburg, Germany) Jill Burstein (Educational Testing Service, USA) Claire Cardie (Cornell University, USA) Pascale Fung (University of Science and Technology, Hong Kong) Hitoshi Isahara (Communications Research Laboratory, Japan) Michael Johnston (AT&T, USA) Rada Mihalcea (University of North Texas, USA) Jon Oberlander (University of Edinburgh, UK) Kemal Oflazer (Sabanci University, Turkey) Kees van Deemter (University of Brighton, UK) Antal van den Bosch (University of Tilburg, The Netherlands) top :: Mentoring service ACL is providing a mentoring (coaching) service for authors from regions of the world where English is not the language of scientific exchange. Many authors from these regions, although able to read the scientific literature in English, have little or no experience in writing papers in English for conferences such as the ACL meetings. The service will be arranged as follows. A set of potential mentors will be identified by Richard Power, who has agreed to organize this service for ACL'04. If you would like to take advantage of the service, send a draft of your paper to: Richard Power Information Technology Research Institute University of Brighton Watts Building Lewes Road Brighton BN24GJ UK +44 1273 642904 (office) +44 1273 642908 (fax) Email: Richard.Power at itri.brighton.ac.uk To take advantage of this service, send the paper electronically to the above email address, using pdf, ps or doc format. Alternatively, hard copy can be sent to the postal address. The paper should arrive before 1st February. An appropriate mentor will be assigned to your paper and the mentor will get back to you by 15th February, at least ten days before the deadline for the submission to ACL'04 program committee. Please note that this service is for the benefit of the authors as described above. It is not a general mentoring service for authors to improve their papers. If you have any questions about this service please feel free to send a message to Richard Power. top :: Call for sponsorship Chair: Deborah Dahl (Conversational Technologies, USA) Local Chair: Antònia Martí (University of Barcelona, Spain) On behalf of the Association for Computational Linguistics (ACL), we invite commercial, government, and academic organizations who value and wish to promote the field of natural language processing technology to become sponsors of ACL2004, the 42nd Annual Meeting of the ACL. If you are interested in becoming a sponsor, please see the sponsors page for more details. Please also consider exhibiting your products at the conference. We are happy to announce that the following organizations have agreed to give their support at ACL04. Ajuntament de Barcelona Generalitat de Catalunya Spanish Government Universitat de Barcelona Universitat Autónoma de Barcelona Universitat Politècnica de Catalunya Universitat Pompeu Fabra Universitat Ramon Llull Deadlines: Sponsorship registration deadline: by 1st April 2004 top :: List of accepted workshops Workshop Committee: Srinivas Bangalore (AT&T Labs-Research, USA) ACL-2004 Workshop C Christopher Manning (Stanford University, USA) ACL-2004 Workshop C Helen Meng (CUHK, Hong Kong) ACL-2004 Workshop C Marcello Federico (IRST, Italy) :: Current Themes in Computational Phonology and Morphology Organizing Committee: Richard Wicentowski, Swarthmore College John Goldsmith, University of Chicago Important dates: Paper submission deadline: April 16, 2004 Notification of acceptance: May 7, 2004 Camera ready papers due: May 24, 2004 Workshop date: July 26, 2004 :: Discourse Annotation Organizing Committee: Bonnie Webber, University of Edinburgh Donna Byron, Ohio State University Important dates: Paper submission deadline: March 22, 2004 Notification of acceptance: April 30, 2004 Camera ready papers due: May 24, 2004 Workshop date: July 25, 2004 ::Incremental Parsing: Bringing Engineering and Cognition Together Organizing Committee: Stephen Clark, University of Edinburgh Matthew Crocker, Saarland University Frank Keller, University of Edinburgh Mark Steedman, University of Edinburgh Important dates: Paper submission deadline: March 22, 2004 Notification of acceptance: May 3, 2004 Camera ready papers due: May 24, 2004 Workshop date: July 25, 2004 :: Multiword Expressions: Integrating Processing Organizing Committee: Takaaki Tanaka, NTT Communication Science Laboratories, Japan Aline Villavicencio, University of Cambridge, UK Francis Bond , NTT Communication Science Laboratories, Japan Anna Korhonen, University of Cambridge, UK Important dates: Paper submission deadline: April 1, 2004 Notification of acceptance: May 1, 2004 Camera ready papers due: May 15, 2004 Workshop date: July 26, 2004 :: Question Answering in Restricted Domains Organizing Committee: Diego Mollá, Macquarie University, Australia José Luis Vicedo, Alicante University, Spain Important dates: Paper submission deadline: March 15, 2004 Notification of acceptance: April 15, 2004 Camera ready papers due: May 15, 2004 Workshop date: July 25 or 26, 2004 :: RDF/RDFS and OWL in Language Technology: 4th Workshop on NLP and XML (NLPXML-2004) Organizing Committee: Nancy Ide, Vasar College, USA Laurent Romary, Loria/CNRS, France Graham Wilcock, University of Helsinki, Finland Important dates: Paper submission deadline: April 1, 2004 Notification of acceptance: May 1, 2004 Camera ready papers due: May 15, 2004 Workshop date: July 25, 2004 :: Reference Resolution and Its Applications Organizing Committee: Sanda Harabagiu, University of Texas at Dallas David Farwell, New Mexico State University Important dates: Paper submission deadline: April 5, 2004 Notification of acceptance: April 25, 2004 Camera ready papers due: May 15, 2004 Workshop date: July 25-26, 2004 :: SENSEVAL-3 Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text Organizing Committee: Phil Edmonds, Sharp Laboratories of Europe Rada Mihalcea, University of North Texas Important dates: Registration: February 2004 Evaluations: March - April 2004 Paper submission deadline: April 20, 2004 Camera ready papers due: May 18, 2004 Workshop date: July 25-26, 2004 :: Tackling the challenges of terascale human language problems Organizing Committee: Miles Osborne, Univ. of Edinburgh Robert Malouf, San Diego State University Srinivas Bangalore, AT&T Labs-Research Important dates: Paper submission deadline: April 18, 2004 Notification of acceptance: April 30, 2004 Camera ready papers due: May 15, 2004 Workshop date: July 26, 2004 :: 2nd Workshop on Text Meaning and Interpretation Organizing Committee: Graeme Hirst, University of Toronto Sergei Nirenburg, University of Maryland, Baltimore County Important dates: Paper submission deadline: April 1, 2004 Notification of acceptance: April 30, 2004 Camera ready papers due: May 16, 2004 Workshop date: July 25-26, 2004 :: Text Summarization Branches Out Organizing Committee: Eduard Hovy, Information Sciences Institute, University of Southern California, USA Marie-Francine Moens (co-chair), Interdisciplinary Centre for Law & Information Technology, Katholieke Universiteit Leuven, Belgium Dragomir Radev, School of Information and Department of Electrical Engineering and Computer Science, University of Michigan, USA Stan Szpakowicz (co-chair), School of Information Technology and Engineering, University of Ottawa, Canada Important dates: Paper submission deadline: March 25, 2004 Notification of acceptance: April 25, 2004 Camera ready papers due: May 15, 2004 Workshop date: July 25-26, 2004 :: Third SIGHAN Workshop on Chinese Language Processing Organizing Committee: Oliver Streiter, Eurac, Italy Qin Lu, The Hong Kong Polytechnic University Important dates: Paper submission deadline: April 1, 2004 Notification of acceptance: May 1, 2004 Camera ready papers due: May 15, 2004 Workshop date: July 25-26, 2004 top :: Programme committee of poster and demos Philippe Blache, Université de Provence, France Rens Bod, University of Amsterdam, The Netherlands Christian Boitet, Université Joseph Fourier, France Antonio Branco, University of Lisbon, Portugal Francisco Casacuberta, Universitat Politècnica de València, Spain Ken Church, ATT Labs, USA Tomaz Erjavec, Jozef Stefan Institute in Ljubljana, Slovenia Roger Evans, University of Brighton, UK Marcello Federico, IRST, Italy Julio Gonzalo, UNED, Spain Nancy Ide, Vassar College, USA Ruslan Mitkov, Wolverhampton, UK Diego Mollá, Macquarie University, Australia Stefan Muller, Universität Bremen, Germany Kemal Oflazer, Sabanci University Istanbul, Turkey Patrick Paroubek, LIMSI, France German Rigau, EHU, Spain Horacio Rodríguez, Universitat Politècnica de Catalunya, Spain Laurent Romary, INRIA, France Graham Russell, RALI, Canada Eric Wehrli, LATL, Switzerland Shuly Wintner, University of Haifa, Israel Pierre Zweigenbaum, DIAM, France top :: Call for papers of student research workshop Faculty Advisor: Justine Cassell (Northwestern University, USA) Student Co-Chairs: Daniel Midgley (University of Western Australia, Australia) Dmitriy Genzel (Brown University, USA) Leonoor van der Beek (University of Groningen, Netherlands) The Student Research Workshop is an established tradition at ACL conferences. The workshop provides a venue for student researchers investigating topics in Computational Linguistics and Natural Language Processing to present their work and receive feedback. Participants will have the opportunity to receive feedback both from the general audience and from selected panelists -- experienced researchers who prepare in-depth comments and questions in advance of the presentation. One paper will be selected for the ACL-04 Student Research Workshop Best Paper Award. We invite all student researchers to submit their work to the workshop. As the main goal of the workshop is to provide feedback, the emphasis is on work in progress. Original and unpublished research is therefore invited on all aspects of computational linguistics. Papers should describe original work, still in progress. Submission will therefore normally be open only to students who have settled on their thesis direction but who still have significant research left to do; those students in the final stages of their thesis should consider submitting instead to the main conference. Submissions should follow the two-column format of ACL proceedings and should not exceed six (6) pages, including references. We strongly recommend the use of ACL LaTeX style files or Microsoft Word Style files tailored for this year's conference. Submission must be electronic. The electronic submissions should be sent in an attachment to the following e-mail address: acl04-student at list.cs.brown.edu. Note that reviewing of papers will be blind; therefore, please make sure your paper shows the title, but no author information. You should likewise not have any self-identifying references anywhere in the paper submitted for review. For example, rather than this "We showed previously (Smith, 2001), ..." use citations such as "Smith (2001) previously showed ..." Deadlines: Paper submissions deadline: 8th March 2004 Notification of acceptance: 26th April 2004 Camera ready papers due: 25th May 2004 top :: Call for tutorials Tutorials chair: Inderjeet Mani (Georgetown University, USA) The Program Committee of the 42nd Annual Meeting of the Association for Computational Linguistics (ACL?04) invites proposals for the Tutorial Program for ACL'04. Proposals for tutorials should contain a title, the instructors? names, the length (3 hours or 6 hours), the expected audience size, followed by a brief (< 500 word) description of the content of the tutorial. The description should explain clearly the relevance of the tutorial to the ACL community. The description should also include a brief outline of the structure of the tutorial broken down by time (what topics will covered in the different sections of the tutorial, and in what order, and how much time for each). Also, please include a brief statement of what the tutorial attendees can expect to learn from the tutorial, and what backgrounds (e.g., information extraction, statistical NLP, etc.) is expected of the attendees. Finally, if you have given a prior tutorial on this subject, or are aware of one, please let us know. Each proposal should also provide the names, postal addresses, phone numbers, and email addresses of the tutorial speakers, with a one-paragraph statement of their research interests and areas of expertise, along with any links to further on-line information. The proposal should also include any special requirements for technical needs (e.g., internet access). Proposals should be submitted by electronic mail, in plain ASCII text (iso8859-1). The subject line should be: "ACL'04 TUTORIAL PROPOSAL". Please submit your proposals and address any inquiries to tutorials at acl2004.org. Deadlines: Submission Deadline for Tutorial Proposals: 1 February 2004 Notification of acceptance of Tutorial Proposals: 25 February 2004 Tutorial Announcements due: 19 March 2004 Tutorial Course material due: 1 June 2004 top :: Others (i) ACL LaTeX style files or Microsoft Word Style files for this year's conference. http://www.acl2004.org/aclstyles/style.html (ii) Submissions for the main conference will be entered via a website http://pcger33.uia.ac.be:8080/acl04 top ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Wed Jan 21 17:09:49 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Wed, 21 Jan 2004 18:09:49 +0100 Subject: Conf: CROSS-LANGUAGE EVALUATION FORUM Message-ID: (Apologies for multiple postings) ********************************************************************* CROSS-LANGUAGE EVALUATION FORUM ELRA is happy to announce that registration for CLEF 2004 evaluation campaign is now open. CLEF 2004 - CALL FOR PARTICIPATION ********************************************************************* The CLEF series of system evaluation campaigns aims at promoting research and development in Cross-Language Information Retrieval. Registration is now open for CLEF 2004. The objective of CLEF 2004 will be to test different aspects of mono- and cross-language information retrieval system performance. There will be eight tracks this year: a/ Multilingual Information Retrieval b/ Bilingual Information Retrieval c/ Monolingual (non-English) Information Retrieval d/ Mono- and Cross-Language IR for Scientific Collections (GIRT) e/ Interactive Cross-Language Information Retrieval (iCLEF) f/ Multiple Language Question Answering (QAatCLEF) g/ Cross-language Retrieval in Image Collections (ImageCLEF) h/ Cross-Language Spoken Document Retrieval (CL-SDR) IMPORTANT DATES: - Data Release - from 15 February 2004 - Topic Release - from 15 March 2004 - Submission of Runs by Participants - 15 May 2004 (may vary slightly for some tracks) - Release of relevance assessments and individual results - from 15 July 2004 - Submission of paper for Working Notes - 15 August 2004 - Workshop - 16-17 September (in conjunction with ECDL 2004) For full details on the CLEF Agenda and Task Description for 2004 and instructions on How to Participate, see http://www.clef-campaign.org For further information, contact: Carol Peters - ISTI-CNR Tel: +39 050 315 2987 Fax: +39 050 315 2810 E-mail: carol.peters at isti.cnr.it --------------------------------------------------------------------------- ELRA / ELDA 55-57, rue Brillat-Savarin 75013 Paris FRANCE Tel: (+33) 1 43 13 33 33 / Fax: (+33) 1 43 13 33 30 URL: http://www.elra.info or http://www.elda.fr LREC conference: http://www.lrec-conf.org LangTech forum: http://www.lang-tech.org --------------------------------------------------------------------------- ------------------------------------------------------------------------- Message diffus� par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain�e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh�sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Wed Jan 21 17:09:52 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Wed, 21 Jan 2004 18:09:52 +0100 Subject: Appel: International Conference on Formal Ontology in Information Systems Message-ID: please distribute please distribute please distribute please distribute Apologies for multiple copies **** FOIS-2004 CALL FOR PAPERS **** International Conference on Formal Ontology in Information Systems http://www.fois.org November 4-6, 2004, Torino (Italy) Conference Description ---------------------- Just as ontology developed over the centuries as part of philosophy, so in recent years ontology has become intertwined with the development of the information sciences. Researchers in such areas as artificial intelligence, formal and computational linguistics, biomedical informatics, conceptual modeling, knowledge engineering and information retrieval have come to realize that a solid foundation for their research calls for serious work in ontology, understood as a general theory of the types of entities and relations that make up their respective domains of inquiry. In all these areas, attention has started to focus on the content of information rather than on just the formats and languages in terms of which information is represented. The clearest example of this development is provided by the many initiatives growing up around the project of the Semantic Web. And as the need for integrating research in these different fields arises, so does the realization that strong principles for building well-founded ontologies might provide significant advantages over ad hoc, case-based solutions. The tools of Formal Ontology address precisely these needs, but a real effort is required in order to apply such philosophical tools to the domain of Information Systems. Reciprocally, research in the information science raises specific ontological questions which call for further philosophical investigations. The purpose of FOIS is to provide a forum for genuine interdisciplinary exchange in the spirit of a unified ontological wanalysis effort. Although the primary focus of the conference is on theoretical issues, methodological proposals as well as papers dealing with concrete applications from a well-founded theoretical perspective are welcome. Invited Speakers ----------------- Peter Gärdenfors, Lund University Cognitive Science, Sweden Amie Thomasson, Department of Philosophy, University of Miami, USA Deadlines and Further Information --------------------------------- Abstracts: May 3, 2004 Final submissions: May 7, 2004 Acceptance Notification: June 25, 2004 Submission of camera-ready paper: July 30, 2004 Proceedings will be published by IOS Press and available at the conference. Submission is a two-step procedure: first abstracts, then full papers. Submitted papers must not exceed 5000 words (including bibliography). Abstracts should be less than 300 words. Electronic submission via the website is strongly preferred; if unavailable, submission via email or postal mail is possible. For details see: http://www.fois.org or contact one of the program chairs. Chairs ------ Conference Chair: Nicola Guarino (ISTC-CNR, Trento, Italy) nicola.guarino at loa-cnr.it Program Chairs: Achille Varzi (Columbia University, New York, USA) achille.varzi at columbia.edu Laure Vieu (IRIT-CNRS, Toulouse, France) laure.vieu at irit.fr Local Chairs: Maurizio Ferraris (University of Torino, Italy) ferraris at cisi.unito.it Leonardo Lesmo (University of Torino, Italy) lesmo at di.unito.it Topics ------ We seek high-quality papers on a wide range of topics. While authors may focus on fairly narrow and specific issues, all papers should emphasize the relevance of the work described to formal ontology and to information systems. Papers that completely ignore one or the other of these aspects will be considered as lying outside the scope of the meeting. Topic areas of particular interest to the conference are: Foundational Issues - Kinds of entity: particulars vs. universals, continuants vs. occurrents, abstracta vs. concreta, dependent vs. independent, natural vs. artificial - Formal relations: parthood, identity, connection, dependence, constitution, subsumption, instantiation - Vagueness and granularity - Identity and change - Formal comparison among ontologies - Ontology of physical reality (matter, space, time, motion, ...) - Ontology of biological reality (genes, proteins, cells, organisms, ...) - Ontology of mental reality and agency (beliefs, intentions and other mental attitudes; emotions, ...) - Ontology of social reality (institutions, organizations, norms, social relationships, artistic expressions, ...) - Ontology of the information society (information, communication, meaning negotiation, ...) - Ontology and Natural Language Semantics, Ontology and Cognition Methodologies and Applications - Top-level vs. application ontologies - Ontology integration and alignment; role of reference ontologies - Ontology-driven information systems design - Requirements engineering - Knowledge engineering - Knowledge management and organization - Knowledge representation; Qualitative modeling - Computational lexica; Terminology - Information retrieval; Question-answering - Semantic web; Web services; Grid computing - Domain-specific ontologies, especially for: Linguistics, Geography, Law, Library science, Biomedical science, E-business, Enterprise integration, ... Programme Committee (to be confirmed) -------------------- Bill Andersen, OntologyWorks, USA Nicholas Asher, Dept of Philosophy, University of Texas at Austin, USA Nathalie Aussenac-Gilles, Research Institute for Computer Science, CNRS, Toulouse, France John Bateman, Dept of Applied English Linguistics, University of Bremen, Germany Brandon Bennett, Division of Artificial Intelligence, University of Leeds, UK Andrea Bottani, Dept of Philosophy, University of Bergamo, Italy Joost Breuker, Dept of Computer Science & Law, University of Amsterdam, The Netherlands Roberto Casati, Jean Nicod Institute, CNRS, Paris, France Werner Ceusters, Language & Computing, Belgium Tony Cohn, Division of Artificial Intelligence, University of Leeds, UK Robert Colomb, School of Computer Science and Electrical Engineering, University of Queensland, Australia Ernest Davis, Dept of Computer Science, New York University, USA Randall Dipert, Dept of Philosophy, State University of New York, Buffalo, USA Martin Dörr, Institute of Computer Science, FORTH, Heraklion, Greece Carola Eschenbach, Dept for Informatics, University of Hamburg, Germany Jérôme Euzenat, INRIA Rhône-Alpes, Grenoble, France Christiane Fellbaum, Cognitive Science Laboratory, Princeton University, USA & Berlin Brandenburg Academy of Sciences and Humanities, Berlin, Germany Maurizio Ferraris, Dept of Philosophy, University of Torino, Italy Antony Galton, School of Engineering and Computer Science, University of Exeter, UK Aldo Gangemi, Institute of Cognitive Sciences and Technologies, CNR, Rome, Italy Peter Gärdenfors, Lund University Cognitive Science, Sweden Pierdaniele Giaretta, Dept of Philosophy, University of Padova, Italy Michael Gruninger, Institute for Systems Research, University of Maryland College Park, USA & National Institute for Standards and Technology, USA Nicola Guarino, Institute of Cognitive Sciences and Technologies, CNR, Trento, Italy Patrick J. Hayes, Institute for Human and Machine Cognition, University of West Florida, USA Heinrich Herre, Institute of Informatics, University of Leipzig , Germany Jacques Jayez, ENS-Humanities, Lyon, France Ingvar Johansson, Institute for Formal Ontology and Medical Information Science, University of Leipzig, Germany Hannu Kangassalo, Dept of Computer and Information Sciences, University of Tampere, Finland Fritz Lehmann, USA Leonardo Lesmo, Dept of Computer Science, University of Torino, Italy Bernardo Magnini, Centre for Scientific and Technological Research, ITC, Trento, Italy David Mark, Dept of Geography, State University of New York, Buffalo, USA William E. McCarthy, Department of Accounting, Michigan State University, USA Robert Meersman, Dept of Computer Science, Free University of Brussels, Belgium Chris Menzel, Dept of Philosophy, Texas A&M University, USA Friederike Moltmann, Dept of Philosophy, Stirling University, UK Philippe Muller, Research Institute for Computer Science, University of Toulouse III, France John Mylopoulos, Dept of Computer Science, University of Toronto, Canada Sergei Nirenburg, Dept of Computer Science & Electrical Engineering, University of Maryland Baltimore County, USA Leo Obrst, MITRE, USA Massimo Poesio, Dept of Computer Science, University of Essex, UK Ian Pratt-Hartmann, Dept of Computer Science, University of Manchester, UK James Pustejovsky, Dept of Computer Science, Brandeis University, USA Steffen Schulze-Kremer, German Resource Center for Genome Research, Berlin, Germany Peter Simons, School of Philosophy, University of Leeds, UK Barry Smith, Dept of Philosophy, State University of New York, Buffalo, USA & Institute for Formal Ontology and Medical Information Science, University of Leipzig, Germany John Sowa, USA Veda Storey, Dept of Computer Information Systems, Georgia State University, USA Mike Uschold, The Boeing Company, USA Achille Varzi, Dept of Philosophy, Columbia University, USA Laure Vieu, Research Institute for Computer Science, CNRS, Toulouse, France Yair Wand, Management Information Systems Division, University of British Columbia, Vancouver, Canada Chris Welty, IBM Watson Research Center, USA Roel Wieringa, Computer Science Department, University of Twente, The Netherlands ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Wed Jan 21 17:09:56 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Wed, 21 Jan 2004 18:09:56 +0100 Subject: Publications: BULAG no28: Modelling, Systemics, Translatability Message-ID: Just published Bulag (Bulletin de linguistique appliquee et generale) n°28, revue annuelle "Modelisation, systemique, traductibilite" Modelling, Systemics, Translatability Coordinated by Sylviane Cardey Published by PUFC (Presses Universitaires de Franche-Comté) Information and orders: http://tesniere.univ-fcomte.fr/bulag/bulag28.htm http://tesniere.univ-fcomte.fr/bulag/numero28.pdf Presses Universitaires de Franche-Comté (PUFC) UFR des Sciences Médicales et Pharmaceutiques Place St. Jacques 25030 BESANCON cedex France Vient de paraitre Bulag (Bulletin de linguistique appliquee et generale) n°28, revue annuelle "Modelisation, systemique, traductibilite" Coordonné par Sylviane Cardey Publie aux PUFC (Presses Universitaires de Franche-Comté) renseignements et commandes http://tesniere.univ-fcomte.fr/bulag/numero28.pdf http://tesniere.univ-fcomte.fr/bulag/bulag28.htm Presses Universitaires de Franche-Comté (PUFC) UFR des Sciences Médicales et Pharmaceutiques Place St. Jacques 25030 BESANCON cedex France ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Wed Jan 28 10:20:07 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Wed, 28 Jan 2004 11:20:07 +0100 Subject: Publications: Alain =?ISO-8859-1?Q?Polgu=E8re_=282003=29_Lex?= =?ISO-8859-1?Q?icologie_et_s=E9mantique_lexicale=2E_Notions_fo?= =?ISO-8859-1?Q?ndamentales?= Message-ID: Vient de paraître : Alain Polguère (2003) Lexicologie et sémantique lexicale. Notions fondamentales. Coll. "Paramètres", Montréal, Les Presses de l'Université de Montréal, 264 p. (ISBN : 2-7606-1860-9) Pour en savoir plus sur l'ouvrage, aller à l'adresse ci-dessous sur le site de l'éditeur : http://www.pum.umontreal.ca/livres/fiches/2-7606-1860-9.html Distribustion Canada : Éditions Fides Distribution France, Belgique et Suisse : Sofédis Distribution autres pays : Exportlivre ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Wed Jan 28 10:20:18 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Wed, 28 Jan 2004 11:20:18 +0100 Subject: Jobs: Sinequa : developpement d'un moteur de recherche en grec Message-ID: Sinequa est une entreprise développant, entre autres, un moteur de recherche fortement linguistique. Voir le site http://www.sinequa.com pour plus de renseignements. Nous recherchons une personne pour développer le moteur en grec. Les compétences demandées sont les suivantes : - Fortes compétences en linguistique ou terminologie (niveau maîtrise ou DESSS) ; - Maîtrise de l'outil informatique (outils de bureautique, Internet) exigée ; - Programmation de scripts (Perl ou autre) fortement appréciée ; - Parfaite maîtrise du grec. Le travail qui sera demandé consistera à développer des lexiques morpho-syntaxiques, des corpus étiquetés, des automates d'analyse (reconnaissance d'entités nommées, etc.), etc. La durée du contrat sera de 3 à 6 mois. Si vous êtes intéressé(e), merci d'envoyer votre CV par courriel à loupy at sinequa.com Cordialement -- Claude de Loupy - Responsable Recherche Sinequa - http://www.sinequa.com courriel : loupy at sinequa.com tél. : 33 1 49 87 06 00 - fax : 33 1 49 87 06 01 ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Wed Jan 28 10:20:19 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Wed, 28 Jan 2004 11:20:19 +0100 Subject: Conf: JADT 2004 Message-ID: JADT 2004 - Call for participation 7th International Conference on the Statistical Analysis of Textual Data March 10-12 2004 - Louvain-la-Neuve, Belgium www.jadt.org *************************************************************************** Following Barcelona (1990), Montpellier (1993), Rome (1995), Nice (1998), Lausanne (2000), Saint-Malo (2002) the 7th International Conference on the Statistical Analysis of Textual Data will be held in Louvain-la-Neuve (Belgium), on March 10-12, 2004. This biennial conference, which has constantly been gaining in importance since its first occurrence, is open to all scholars working in the vast field of textual data analysis; ranging from lexicography to the analysis of political discourse, from documentary research to marketing research, from computational linguistics to sociolinguistics, from the processing of data to content analysis. After the success of the previous meetings, the three-day conference in Belgium will continue to provide a workshop-style forum through technical paper sessions, invited talks, and panel discussions. 1/ PROGRAM The JADT 2004 program features two keynote speakers: - Douglas BIBER, University of Northern Arizona, "A corpus analysis of vocabulary-based discourse unit types in conversation" - Claudia LEACOCK, Educational Testing Service (ETS, Princeton), "Statistical Analysis of Text for Educational Measurement" A first version of the program is available online: http://www.jadt.org/program.html 2/ REGISTRATION For the registration forms and more information, see the conference Web site (www.jadt.org). Please, note that we offer reduced registration fees until January 31st. 3/ IBIS FELLOWSHIPS We are pleased to announce the names of the IBIS-JADT 2004 grantees 6 IBIS Fellowships to take part in JADT 2004 have been awarded. Barbu Ana-Maria (Roumanie) Research Institute for Artificial Intelligence of the Romanian Academy (RACAI) "Simple Linguistic Methods for Improving a Word Alignment Algorithm" Edel Greevy (Irlande) Dublin City University "Text Categorisation of Racist Texts Using a Support Vector Machine" Forest Dominic (Quebec) UQAM, Laboratoire LANCI "Classification et categorisation automatiques: application a l'analyse thematique des donnees textuelles" Jalam Radwan (France) Universite: Lumiere Lyon 2 "Cadre pour la categorisation de textes multilingues" Misuraca Michelangelo (Italie) "Relazioni non Simmetriche tra Corpora" (Grassia, Misuraca, Scepi) University: Federico II of Naples M. Bagavandas et G. Manimannam (Inde) Madras Christian College, Department of statistics, Tambaram "Quantification Of Stylistic Traits: A Statistical Approach" -- Cedrick Fairon Directeur du CENTAL Centre de traitement automatique du langage Universite de Louvain Place Blaise Pascal, 1 1348 Louvain-la-Neuve Belgique ======================================= **** JADT 2004 in Louvain-la-Neuve **** 10-12 March 2004 7th International Conference on the statistical analysis of textual data 7th Journees internationales d'analyse statistique des donnees textuelles http://www.jadt.org Visit our web sites: http://cental.fltr.ucl.ac.be http://glossa.fltr.ucl.ac.be ======================================= ------------------------------------------------------------------------- Message diffusé par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrainée par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adhésion : http://www.atala.org/ ------------------------------------------------------------------------- From hunnordgaardveg at NORDGAARD.COM Thu Jan 22 11:49:27 2004 From: hunnordgaardveg at NORDGAARD.COM (Jeff Spivey) Date: Thu, 22 Jan 2004 09:49:27 -0200 Subject: Can you imagine that you are healthy? Message-ID: LegalRXMedications chemist's shop acquaints you with all medicinal remedies you require to recover your health for a little cost. We manage across the planet with clients from Europe, America and Asia. This time you don't have to search for drug shop somewhere at your local area. We certainly convey high-quality pharmasworld-wide. Come please to our site to obtain preparations that you immediately need straightly to your dwelling. http://babyfraction.hk/ We’re accredited by VISA and VeriSign then we ensure secure and confidential buying. -------------- next part -------------- An HTML attachment was scrubbed... URL: From alexis.nasr at LINGUIST.JUSSIEU.FR Tue Jan 6 09:23:34 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Tue, 6 Jan 2004 10:23:34 +0100 Subject: Appel: 7th INTEX/NooJ Workshop Message-ID: 7th INTEX/NooJ Workshop Tours, June 7-9 2004 Call for papers - Deadline: March 31, 2004 ORGANIZERS * Laboratoire d'Informatique de l'Universit? de Tours (E.A. 2101) * Langues et Repr?sentation, Equipe de recherche en linguistique, Universit? de Tours * LAboratoire de SEmioLinguistique, Didactique et Informatique (E.A. 2281) CALL We invite the submission of papers for the forthcoming seventh INTEX/NooJ workshop, to be held in Tours, June 7-9 2004. INTEX is a linguistic development environment that includes large-coverage dictionaries and grammars, and parses texts of several million words in real time. INTEX includes tools to create and maintain large-coverage lexical resources, as well as morphological and syntactic grammars. Dictionaries and grammars are applied to texts in order to locate morphological, lexical and syntactic patterns, remove ambiguities, and tag simple and compound words. INTEX can build lemmatized concordances of large texts from Finite-State or Context-Free grammars, and can accordingly perform transformation operations on texts in cascade, in order to annotate the text, or to generate paraphrases; these features, when applied in cascade, give INTEX the power of a Turing Machine. INTEX is used as a linguistic platform, an information retrieval system, to teach second languages, as a terminological extractor, as well as to teach computational linguistics to students. NooJ, which uses a new technology, a new linguistic engine and a new interface, is meant to replace INTEX. NooJ's architecture was presented in the 5th INTEX Workshop (Marseille, June 2002) and its first alpha version was demoed at the 6th INTEX Workshop (Sofia, May 27-29 2003). As in the previous workshops (1996, 1999, 2000, 2001, 2002 and 2003), this meeting will be the opportunity for INTEX and NooJ users, as well as other researchers interested in NLP, to meet and to exchange their experience of development, research or teaching. It will also be the occasion to present the recent developments of NooJ. Please, send before March 31st 2004 a one-page abstract to Denis Maurel by email. The abstract, in French or English, should contain the title of the article, name, author affiliations, surface mail and electronic address of each author. All papers will be reviewed by the program committee. Authors will be notified whether their papers are accepted or rejected by April 15th 2004. The timeslot is 30 minutes for presentations (including 5 minutes for discussions). After the conference, authors will be invited to send a definitive version of their papers for publishing. We are planning to combine a subset of the proceedings of the 6th and the 7th INTEX Workshops in a published volume. PROGRAM COMMITTEE * Xavier Blanco (Universidad Autonoma de Barcelona, Spain) * Gis?le Chevalier (Universit? de Moncton, Canada) * Ibekwe-SanJuan Fidelia (Universit? de Lyon 3, France) * Nathalie Friburger (LI, Universit? de Tours, France) * Svetla Koeva (BACL, IBL - BAS, Sofia, Bulgaria) * Stoyan Mihov (BACL, CLPP - BAS, Sofia, Bulgaria) * Denis Maurel (LI, Universit? de Tours, France) * Paul Sabatier (LIM, CNRS, Marseille, France) * Agata Savary (LI, Universit? de Tours, France) * Henrik Selsoe Sorensen (Copenhagen Business School, Danemark) * Max Silberztein (LASELDI, Universit? de Franche-Comt?, France) * Tamas Varadi (Hungarian Academy of Sciences, Hungary) * Dusko Vitas (MATF, University of Belgrade, Serbia) DEADLINES Submission due date: March 31, 2004 Notification date: April 15, 2004 Registration: May 1, 2004 Camera ready date: June 30, 2004 NooJ TUTORIALS by Max Silberztein * Initiation Tutorial, 20 persons maximum * Teaching Linguistics with NooJ, 20 persons maximum REGISTRATION FEE The registration fee for the workshop is 30 euros for researchers, 15 euros for students and 40 euros for other categories. The conference will begin on Monday morning and last till Wednesday evening. During the Conference there will be a reception on Monday evening and an optional excursion on Tuesday afternoon. CONTACT denis.maurel at univ-tours.f max.silberztein at univ-fcomte.fr Web site: http://tln.li.univ-tours.fr/JIntex2004/Index.html ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Tue Jan 6 09:23:41 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Tue, 6 Jan 2004 10:23:41 +0100 Subject: Appel: TeL'04 - Technology Enhanced Learning Message-ID: ------------CFP Tel04 -------------------------------------- "Raising the issues on School e-laboratories, utilizing novel pedagogical and evaluation theories" 22 August 2004 Toulouse, France http://tel04.systema.gr/ TeL'04 - Technology Enhanced Learning --------------------- Scope and organization: Technology Enhanced Learning (TeL) has provided tools and infrastructure to education and training disciplines for over a decade. Related issues are as various as pedagogical and evaluation theories, integrated learning environments, experiments, trials and results from R&TD deployment. Relying on recent experiences and promising results from R&TD projects, in particular EU endorsed initiatives (e.g. IST Projects: Lab at Future, Laboratory of Tomorrow, Mobilearn), the workshop will give educational institutions, experts, practitioners and technologists an opportunity to share their experience and possibly come up with a consensus on open issues. Tel'04 is a one day workshop co-located with the WCC'2004 conference. It will take place among other WCC events in the Congress Center (downtown Toulouse). The program will include refereed papers and invited talks by distinguished researchers and practitioners. The proceedings will be published by Kluwer, the official publisher of IFIP conference. Topics of interest This workshop will comply with the trend of most international conferences relating to learning technologies today. Nevertheless the distinct shaping factor will comprise the identification of the enabling parameters to ?leverage the promotion of key initiatives in putting the grassroots for TeL, especially for school e-laboratories utilizing novel pedagogical and evaluation theories?. Papers are solicited in the following areas: ?E-Learning ?Mobile learning ?Mixed and augmented reality in training ?Technologies in the school of tomorrow ?The learning citizen ?Collaborative learning ?Applying pedagogical theories ?The evaluation process of learning applications ?Shared virtual environments for learning ?Learning management systems ?Combining individualised with collaborative learning ?Applying and using eLearning standards ?Open learning environments ?Learning for All ?Technologies for science education ?Technologies for arts and humanities education Accepted papers will appear in a book published by Kluwer. Please refer to Call for Papers page for details on papers submission. Important Dates: February 20, 2004:Submission of short and full papers(firm deadline) March 20, 2004:Notification of acceptance April 20, 2004:Camera ready papers For more details and submission visit : http://tel04.systema.gr/ ----------------------------- -- Best whishes -- ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Tue Jan 6 09:23:44 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Tue, 6 Jan 2004 10:23:44 +0100 Subject: Appel: TALN 2004 Message-ID: ********************************************************************** - T A L N ' 0 4 - Traitement Automatique du Langage Naturel Palais des Congr?s F?s (Maroc) du 19 au 22 avril 2004 ********************************************************************** (see English version below) TALN'04 --- En conjonction avec JEP'04 --- APPEL ? COMMUNICATIONS CALENDRIER Date limite de soumission : 15 janvier 2004 Notification aux auteurs : 20 f?vrier 2004 Version finale (pr?t-?-clicher): 8 mars 2004 Conf?rence : 19-22 avril 2004 Conjointement organis?e par le LPL (Laboratoire Parole et Langage, Aix-en-Provence, France), l'Universit? de F?s et l'Ecole Normale Sup?rieure de F?s , la 11?me ?dition de la conf?rence sur le Traitement Automatique des Langues Naturelles (TALN'04) se tiendra, du 19 au 23 avril 2004, au Palais des Congr?s de F?s, Maroc. La conf?rence comprendra des communications orales et affich?es, des conf?rences invit?es, des ateliers et des tutoriels. Les langues officielles de la conf?rence sont le fran?ais et l'anglais. TALN 2004 est organis?e sous l'?gide de l'ATALA (Association pour le Traitement Automatique des LAngues) et se tiendra conjointement ? la conf?rence pour jeunes chercheurs RECITAL'04 (appel ? communications ? para?tre s?parement). Comme en 2002, TALN sera organis?e conjointement avec les Journ?es d'Etude sur la Parole (JEP'04). Des sessions communes seront organis?es, et les participants recevront les actes des deux conf?rences sur CDROM. TH?MES Les communications, d'une dur?e de trente minutes, questions comprises, pourront porter sur tous les th?mes habituels du TALN, incluant, de fa?on non limitative: lexique morphologie syntaxe s?mantique pragmatique discours analyse g?n?ration r?sum? dialogue traduction automatique approches logiques, symboliques et statistiques Compte tenu de la jonction TALN/JEP et de la localisation de la conf?rence, TALN'04 encourage la soumission de contributions dans les domaines suivants : . techniques pour le traitement de la parole et de l'?crit . traitement de l'arabe Le comit? de programme s?lectionnera parmi les communications accept?es deux articles pour publication (dans une version ?tendue) dans la revue Traitement Automatique des Langues (t.a.l.). Ces articles seront consid?r?s par la revue comme "accept?s sous r?serve de modification", la modification ?tant la mise au format de la revue. CRIT?RES DE S?LECTION --------------------- Les auteurs sont invit?s ? soumettre des travaux de recherche originaux, n'ayant pas fait l'objet de publications ant?rieures. Les soumissions seront examin?es par au moins deux sp?cialistes du domaine. Seront consid?r?es en particulier : - l'importance et l'originalit? de la contribution, - la correction du contenu scientifique et technique, - la discussion critique des r?sultats, en particulier par rapport aux autres travaux du domaine, - la situation des travaux dans le contexte de la recherche internationale, - l'organisation et la clart? de la pr?sentation, - l'ad?quation aux th?mes de la conf?rence. Les articles s?lectionn?s seront publi?s dans les actes de la conf?rence. MODALIT?S DE SOUMISSION ----------------------- Les articles soumis ne devront pas d?passer 10 pages en Times 12, espacement simple, soit environ 3000 mots, figures, exemples et r?f?rences compris. Les propositions de d?monstrations ou les posters ne devront pas d?passer 6 pages. Une feuille de style LaTeX et un mod?le Word sont disponibles sur le site web de la conf?rence http://www.lpl.univ-aix.fr/jep-taln04/. Les articles devront parvenir au comit? d'organisation avant le 15 janvier 2004, en utilisant le formulaire de soumission en ligne ? l'adresse suivante : http://www.lpl.univ-aix.fr/jep-taln04/ L'un des formats suivants devra IMP?RATIVEMENT ?tre employ?: - PDF, RTF (Word) Les versions devront ?tre au format A4. En cas d'impossibilit? d'envoi par courrier ?lectronique, une soumission "papier" pourra ?tre admise. 3 exemplaires papier de la contribution devront ?tre envoy?s ? l'adresse suivante: Philippe Blache - TALN 2004 LPL, Universit? de Provence 29, Avenue Robert Schuman 13621 Aix-en-Provence France e-mail: taln2004 at lpl.univ-aix.fr INFORMATIONS PRATIQUES ---------------------- Les informations pratiques seront pr?cis?es ult?rieurement, notamment sur le site web de la conf?rence http://www.lpl.univ-aix.fr/jep-taln04/ ********************************************************************** - T A L N ' 0 4 - Traitement Automatique du Langage Naturel Palais des Congr?s Fez (Morocco) April 19 - 22, 2004 ********************************************************************** --- In conjunction with JEP'04 --- CALL FOR PAPERS Important Dates --------------- Submission deadline: 15 january 2004 Notification to authors: 20 february 2004 Camera-ready: 8 march 2004 Conference: 19-22 april 2004 Jointly organized by the LPL (Laboratoire Parole et Langage, Aix-en-Provence, France), the University of Fez and the Ecole Normale Sup?rieure of Fez , the 11th Conference on Natural Language Processing (TALN'04) will be held at the Palais des Congr?s, Fez, Morocco, 19. - 22. April, 2004. The conference will include oral and poster communications, invited conferences, workshops and tutorials. Official languages are French and English. TALN'04 is organized under the aegis of ATALA (Association pour le Traitement Automatique des Langues, Association for NLP) and will be held jointly with JEP'04 (Journ?es d'Etude sur la Parole) and the conference for young researchers RECITAL'04 conference (call for papers to be issued separately). Common sessions will be organized. The participants will receive the proceedings of the conferences on CD-ROM. TOPICS ------ Papers are invited for thirty minute talks, including questions, in all areas of NLP, including (but not restricted to) : . lexicon, morphology, syntax, semantics . pragmatics, discourse, parsing, text generation . abstraction/summarization, dialogue, machine translation . logical, symbolical and statistical approaches Moreover, TALN' 04 encourages submissions in the following fields: . techniques for speech and language processing . applications for Arabic language All selected papers will be published in the proceedings. In addition, the programme committee will select two papers, extended version which will be published in the journal "Traitement Automatique des Langues" (T.A.L.). SELECTION --------- Authors are invited to submit original, previously unpublished research work. Submissions will be reviewed by at least two specialists of the domain. Decisions will be based on the following criteria : . importance and originality of the paper . soundness of the scientific and technical content . comparison of the results obtained with other relevant works . clarity of the exposition . relevance to the topics of the conference Accepted papers will be published in the proceedings. SUBMISSION PROCEDURE -------------------- Submitted papers must not exceed ten pages, in Times 12, single spaced (about 3000 words), including figures, examples and references. Posters or demo papers should not exceed 6 pages. A LaTeX style file and a Word template are available on the web site of the conference: http://www.lpl.univ-aix.fr/jep-taln04/ Papers are to be submitted before January 15, 2004 through the online submission procedure available on the Website : http://www.lpl.univ-aix.fr/jep-taln04/ Papers MUST be sent in PDF. In particular cases, we may accept submissions in RTF (Word) format. IMPORTANT: All the PostScript versions must be in A4 format, and not US Letter. In case of impossibility, we accept to receive a printed version of the submission. In this case, three hard-copies of the paper must be received by January 15, 2004 by: Philippe Blache - TALN 2004 LPL, Universit? de Provence 29, Avenue Robert Schuman 13621 Aix-en-Provence France e-mail: taln2004 at lpl.univ-aix.fr PRACTICAL INFORMATION --------------------- Practical information will be detailed shortly on the conference web site (http://www.lpl.univ-aix.fr/jep-taln04/) and in a further call. Please note that members of the ATALA association will benefit from reduced registration fees. ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Thu Jan 8 17:04:13 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Thu, 8 Jan 2004 18:04:13 +0100 Subject: Appel: MEMURA-2004 : Workshop on Methodologies and Evaluation of Multiword Units ... Message-ID: ********************* CALL FOR PAPERS ********************* MEMURA-2004 Workshop on Methodologies and Evaluation of Multiword Units in Real-world Applications (MEMURA Workshop) INVITED SPEAKER: KENNETH W. CHURCH In association with the 4th International Conference On Language Resources and Evaluation - LREC 2004 Centro Cultural de Bel??m, Lisbon, Portugal May 25, 2004 http://memura2004.di.ubi.pt ********************* CALL FOR PAPERS ********************* This annoucement contains: [1] Workshop Description [2] Target Audience [3] Areas of Interest [4] Invited Speaker [5] Important dates [6] Abstract Submission [7] Workshop Chairs [8] Program Committee [9] Contact ------------------------------------------------------------------------- [1] Workshop Description: ------------------------------------------------------------------------- Multiword units (MWUs) include a large range of linguistic phenomena, such as phrasal verbs (e.g. "look forward"), nominal compounds (e.g. "interior designer"), named entities (e.g. "United Nations"), set phrases (e.g. "con carne") or compound adverbs (e.g. "by the way"), and they can be syntactically and/or semantically idiosyncratic in nature. MWUs are used frequently in everyday language, usually to express precisely ideas and concepts that cannot be compressed into a single word. A considerable amount of research has been devoted to this subject, both in terms of theory and practice, but despite increasing interest in idiomaticity within linguistic research, many questions still remain unanswered. The objective of this workshop is to deal with three important questions that are of great interest for real-world applications. 1) Comparison of MWU extraction methodologies Many methodologies have been proposed in order to automatically extract or identify MWUs. However, not many efforts have been devoted to compare their results. The core differences between the methodologies is certainly the main reason why such works are so rare. For instance, it is not easy to compare language-dependent methodologies as the results depend on the efficiency of parameter tuning in the broad sense of its acception (i.e. semantic tagging, local specific grammars, lematization, part-of-speech tagging etc.). Another important problem is the fact that there is no real agreement between researchers about the definition of MWUs which would provide the basis for an objective evaluation. The objective of the workshop is to gather people that have recently been working in this area so that new trends in comparing MWU extraction methodologies and their evaluation can be pointed at. 2) Evaluation of the benefits of the integration of MWUs in real-world applications It is not yet clear whether MWUs really improve NLP applications. It is common sense that Machine Translation is one application that takes great advantage of MWUs databanks. However, does the same apply to applications in Automatic Summarization, Information Retrieval (IR), Cross-language IR, Information Extraction, Text Clustering/Classification, Parallel Corpus Alignment? Indeed, could the identification of MWUs introduce new constraints that are not present in original texts? Should MWUs be considered as units that should not be analysable in terms of their components meaning? Or should they be treated as unanalysable? Should NLP methods work both on isolated words and on agregated MWUs? The answers are anything but clear. Here, the objective of the workshop is to point at successes and failures of the integration of MWUs in real-world applications. 3) Comparison of scalable architectures for the extraction and identification of MWUs Real-world applications are constrained by variables like processing time and memory space. However, identifying and extracting MWUs is usually a computationally heavy process. In recent years, new algorithms and new technologies have been proposed to introduce MWU treatmement in large scale applications, thus avoiding previous untractable implementations. Previous workshops on MWUs have mainly focused on the unconstrained extraction process. In this workshop, we would like to focus on the comparison of different factors that can influence the scalability of the treatment of MWUs in real-world applications, namely data structures, algorithms, parallel and distributed computing, grid computing etc. Indeed, as we said earlier, some extraction strategies may not scale to deal with huge volumes of data. ------------------------------------------------------------------------- [2] Target Audience: ------------------------------------------------------------------------- This workshop is intended to bring together NLP researchers working on all areas of MWUs. The objective is to summarise what has been achieved in the area of MWU in real-world applications, to establish common themes between different approaches, and to discuss future trends. ------------------------------------------------------------------------- [3] Areas of Interest: ------------------------------------------------------------------------- Abstracts are invited on, but not limited to, the following topics: * Automatic, semi-automatic and manual evaluations of MWUs extractors * Resources for evaluating MWUs extractors * Evaluation Standards * Cross-language and Cross-domain evaluations of MWUs extractors * Comparative evaluation of MWUs extractors * Evaluation of the integration of MWUs in NLP applications: Summarization, (Cross-language) Information Retrieval, Information Extraction, Machine Translation, Text Classification etc. * Scalable algorithms, new data structures, Parallel and Distributed processing and Grid computing for MWUs extraction and/or identification * Comparative evaluation of extraction software architectures * Role of isolated words and MWUs for a sense-based definition of MWUs Abstracts can cover one or more of these areas. ------------------------------------------------------------------------- [4] Invited Speaker: ------------------------------------------------------------------------- Kenneth W. Church (AT&T Labs Research, USA) ------------------------------------------------------------------------- [5] Important dates: ------------------------------------------------------------------------- Abstract submission deadline: February 23, 2004 Notification: March 15, 2004 Camera ready papers: April 12, 2004 Workshop: May 25, 2004 ------------------------------------------------------------------------- [6] Abstract Submission: ------------------------------------------------------------------------- Abstracts should consist of about 1000 words. Abstracts should be submitted electronically in pdf format only to Ga??l Harry Dias [ddg at di.ubi.pt]. The following URL transforms postscript files to pdf files (http://www.ps2pdf.com/). The subject line should be "LREC 2004 MEMURA WORKSHOP PAPER SUBMISSION". Because reviewing is blind, no author information should be included as part of the abstract (i.e. the names of the authors and references that could identify the authors). An identification page must be sent in a separate email with the subject line "LREC 2004 MEMURA WORKSHOP ID PAGE" and must include title, author(s), keywords, word count and name and email of the contact author. Late submissions will not be accepted. Notification of receipt will be emailed to the contact author shortly after receipt. ------------------------------------------------------------------------- [7] Workshop Chairs: ------------------------------------------------------------------------- Ga??l Harry Dias (Beira Interior University, Portugal) Jos?? Gabriel Pereira Lopes (New University of Lisbon, Portugal) Spela Vintar (University of Ljubljana, Slovenia) ------------------------------------------------------------------------- [8] Program Committee: ------------------------------------------------------------------------- Timothy Baldwin (Stanford University, United States of America) Sophia Ananiadou (University of Salford, England) Didier Bourigault (University of Toulouse, France) Pascale Fung (University of Science and Technology, Hong Kong) Mikio Yamamoto (University of Tsukuba, Japan) Dekang Lin (University of Alberta, Canada) Aline Villavicencio (University of Cambridge, England) Heiki Kaalep (University of Tartu, Estonia) Joaquim da Silva (New University of Lisbon) Eric Gaussier (Xerox Research Centre Europe, France) Adeline Nazarenko (University Paris XIII, France) Ant??nio Branco (Lisbon University, Portugal) ------------------------------------------------------------------------- [9] Contact: ------------------------------------------------------------------------- Contact: Ga??l Harry Dias Human Language Technology Interest Group Departamento de Inform??tica Universidade da Beira Interior Rua Marqu??s d'??vila e Bolama 6201-001 Covilh?? Portugal email: ddg at di.ubi.pt Tel: +351 275 319 700 Fax: +351 275 319 732 ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Thu Jan 8 17:04:19 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Thu, 8 Jan 2004 18:04:19 +0100 Subject: Appel: BULAG : La correction automatique : bilan et perspectives Message-ID: ***************************************************************** APPEL A PUBLICATION (English version below) BULAG : "La correction automatique : bilan et perspectives" http://tesniere.univ-fcomte.fr/bulag/appel.htm ***************************************************************** Probl?matique ------------- La correction automatique semble ?tre actuellement une application n?glig?e par les recherches en Traitement Automatique des Langues. Pourtant, vu les performances des correcteurs sur le march? et les possibilit?s d'applications qu'offrent les nouvelles formes de communication ?crite (e-mail, forum de discussion, minimessage sms...), il para?t toujours utile de poss?der de bons outils de correction automatique. L'objectif de cet ouvrage est double. D'une part, nous proposons de faire le point sur les avanc?es dans le domaine de la correction automatique. D'autre part, nous voulons mettre en valeur les recherches actuelles et les perspectives de ce domaine du Traitement Automatique des Langues. Coordination ------------ Mounira BIOUD et S?verine VIENNEY Calendrier ---------- Date limite de soumission : 20 mai 2004 Notification aux auteurs : 30 juin 2004 Th?mes ------ Les th?mes qui seront abord?s dans ce Bulag incluent de fa?on non limitative : - la correction orthographique - la correction grammaticale - l'?valuation des correcteurs automatiques actuellement sur le march? - les limites du domaine - les perspectives - les applications sp?cifiques Langues ------- Les articles devront ?tre r?dig?s en fran?ais ou en anglais. Format des articles ------------------- Le format RTF devra ?tre employ?. La longueur des articles devra ?tre de 5000 mots maximum. Chaque article devra ?tre ?dit? sous la forme suivante : Titre de l'article Pr?nom et NOM de l'auteur Nom du Centre de Recherche Universit? Ville Pays R?sum? de l'article en fran?ais Mots clefs Abstract Key-words Article R?f?rences Vous pouvez t?l?charger un mod?le au format RTF ? l'adresse suivante : http://tesniere.univ-fcomte.fr/ressources/blgmodfr.rtf Modalit?s de soumission ----------------------- Les soumissions devront ?tre envoy?es en priorit? par courrier ?lectronique aux adresses suivantes : mounira.bioud at edu.univ-fcomte.fr severine.vienney at univ-fcomte.fr En cas d'impossibilit? d'envoi par courrier ?lectronique, une soumission par voie postale sera accept?e. Une disquette et un exemplaire papier de la contribution devront ?tre envoy?s ? l'adresse suivante : S?verine VIENNEY Facult? des Lettres et Sciences Humaines Centre Tesni?re 30, rue M?gevand 25030 Besan?on cedex FRANCE ***************************************************************** CALL FOR PAPERS BULAG : "La correction automatique : bilan et perspectives" http://tesniere.univ-fcomte.fr/bulag/appelang.htm ***************************************************************** Scope ----- Spelling and grammar checking and correction seem to be neglected by current Natural Language Processing research. However, considering the performance of the current checkers and the possibilities of applications offered by forms of written communication (e- mail, discussion fora, short message sms...), it appears always useful to have good tools for automatic checking and correction. This number of the BULAG has two objectives. Firstly, we wish to have a state-of-the-art survey of the field, and secondly, we wish to emphasise current research and prospects in this Natural Language Processing application. Coordination ------------ Mounira BIOUD and S?verine VIENNEY Dates ----- Paper's submission deadline: 05 20 2004 Acceptance notification: 06 30 2004 Themes ------ The themes to be addressed are: - spelling checker - grammar checker - evaluation of current grammar/spelling correcting systems - limits of the domain - prospects - specific applications Languages --------- Papers can be written in either English or French. Format ------ RTF format should be used. The paper should be a WORD document, 5000 words maximum. Each paper should have the following form: Title Author Name of research centre University City Country R?sum? en fran?ais Mots clefs Abstract in English Key-words Paper References You can download an RTF format model from the following adress: http://tesniere.univ-fcomte.fr/ressources/blgmoden.rtf Submission ---------- Papers should be sent by email to this two following adresses: mounira.bioud at edu.univ-fcomte.fr severine.vienney at univ-fcomte.fr Those without e-mail access can send a floppy disk and one printed copie of the paper to: S?verine VIENNEY Facult? des Lettres et Sciences Humaines Centre Tesni?re 30, rue M?gevand 25030 Besan?on cedex FRANCE ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Thu Jan 8 17:04:22 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Thu, 8 Jan 2004 18:04:22 +0100 Subject: Publications: revue CORPUS Message-ID: Le num?ro 2 de la revue "CORPUS" vient de para?tre : il est consacr? aux distances intertextuelles (m?thodes de calcul, applications diverses) et a ?t? coordonn? par X. Luong, J.P. Barth?l?my et S. Mellet. Au sommaire, des articles de : M. B?cue; ?. Brunet; M. Kastberg; C. et D. Labb?; D. Longr?e et X. Luong; X. Luong et S. Mellet; T. Merriam; D. Valentin, S. Chollet et H. Abdi; une pr?sentation de la th?se de J.P. Anfosso et deux comptes rendus de lecture (234 pages). Cette revue peut ?tre command?e aux Edizioni dell'Orso (via U. Rattazzi 47, I-15100 Alessandria). ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Fri Jan 9 17:20:23 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Fri, 9 Jan 2004 18:20:23 +0100 Subject: Soft: Morphix-NLP Message-ID: The Morphix-NLP project may be of interest for all those teaching in NLP-related fields. As the original mirrors are located in China and slow, or available through Bittorrent (which is generally firewalled), the LIMSI-CNRS set up a mirror of the CD's iso image which should have better transfer rates for europe. Following is a quick presentation of the project. More information and links to the iso image at http://www.nlplab.cn/zhangle/morphix-nlp/ Guillaume Pitel - LIMSI-CNRS - Morphix-NLP is a Live CD Linux distribution with a rich collection of Natural Language Processing (NLP) applications. Though the field of NLP has undergone decades of intensive research, software designed in the NLP community are often scattered around the net and are not known by the larger computer user community. Consequently, most NLP software can not be found in mainstream distributions even years after the first public release. The purpose of this CD is twofold: * In the first place, it tries to break the software acquisition and installation barrier facing many researchers and students in the NLP community by providing most NLP related software on a single Live CD. * In the second place, the CD can be used to promote Natural Language Processing among average computer users. Simply plugging the CD into cd-drive and watching some NLP applications in action, most users will get some knowledge of Natural Language Processing and what NLP can do. ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Fri Jan 9 17:20:16 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Fri, 9 Jan 2004 18:20:16 +0100 Subject: Appel: R E C I T A L 2004 Message-ID: ************************************************************** R E C I T A L 2004 RAPPEL - RAPPEL - RAPPEL - RAPPEL - RAPPEL - RAPPEL - RAPPEL - DATE LIMITE DE SOUMISSION = 15 JANVIER 2004 ************************************************************** R E C I T A L 2 0 0 4 Appel ? Communication - Call for Papers Rencontre des Etudiants Chercheurs en Informatique pour le Traitement Automatique des Langues 2004 19-22 avril 2004 F?s, Maroc Date limite de soumission : 15 janvier 2004 Informations: http://www.lpl.univ-aix.fr/jep-taln04/ (English version below) La conf?rence RECITAL 2004 (Rencontre des Etudiants Chercheurs en Informatique pour le Traitement Automatique des Langues) est la conf?rence annuelle de l'ATALA des jeunes chercheurs. Elle est organis?e en parall?le des conf?rences JEP et TALN 2004 qui auront lieu en conjonction ? F?s, au Maroc, du 19 au 22 avril 2004. Toutes les informations relatives ? RECITAL, ainsi qu'aux deux conf?rences, les appels ? communication, et les renseignements pratiques, sont accessibles sur le site : http://www.lpl.univ-aix.fr/jep-taln04/ RECITAL 04 est organis?e par : . le Laboratoire Parole et Langage, Aix-en Provence (France) . l'Universit? de F?s (Maroc) . l'Ecole Normale Sup?rieure de F?s (Maroc) Calendrier : - Soumission des articles : 15 janvier 2004 - Notification aux auteurs : 20 f?vrier 2004 - Version finale : 8 mars 2004 - Conf?rence : 19-22 avril 2004 --------------------------------------------------------------------- RECITAL 2004 (Rencontre des Etudiants Chercheurs en Informatique pour le Traitement Automatique des Langues) is the annual conference of the ATALA association (Association pour le Traitement Automatique des Langues). It will be held April 19-22, 2004, in Fez, Morocco, jointly with JEP and TALN 2004. All details about these conferences with complete call for papers and practical information are available online at: http://www.lpl.univ-aix.fr/jep-taln04/ RECITAL 04 is organized by : . the Laboratoire Parole et Langage, Aix-en Provence (France) . the University of F?s (Morocco) . the Ecole Normale Sup?rieure of Fez (Morocco) Calendar: - Submission deadline: 15 January 2004 - Notification to authors: 20 February 2004 - Camera-ready: 8 March 2004 - Conference: 19-22 April 2004 ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Fri Jan 9 17:20:31 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Fri, 9 Jan 2004 18:20:31 +0100 Subject: Appel: ACL 2004 WORKSHOP : TEXT MEANING and INTERPRETATION Message-ID: ACL 2004 WORKSHOP 2nd Workshop on TEXT MEANING and INTERPRETATION 25-26 July 2004, Barcelona In conjunction with the 42nd annual meeting of the Association for Computational Linguistics (www.acl2004.org) Workshop home page: www.cs.toronto.edu/~gh/TextMeaning.html Overview This 1.5-day workshop will continue the success of the 2003 Workshop on Text Meaning, which was held at HLT/NAACL-2003 in Edmonton. It aims to: * Re-establish the research community of knowledge-based interpretation of text meaning. * Explicate the implicit treatments of meaning in current knowledge-lean approaches and how they and knowledge-rich methods can work together. * Emphasize the construction of systems that extract, represent, manipulate, and interpret the meaning of text (rather than theoretical and formal methods in semantics). Most, if not all, high-end NLP applications -- such as machine translation, question answering and text summarization -- stand to benefit from being able to use text meaning in their processing. But the bulk of work in the field in recent years has not pertained to treatment of meaning. The main reason given is the complexity of the task of comprehensive meaning analysis and interpretation. Computational linguistics has always been interested in meaning, of course. The tradition of formal semantics, logics, and common-sense reasoning system has been continuously maintained for many years. But also, much work has been devoted to building practical, increasingly broad-coverage meaning-oriented analysis and synthesis systems. Lexical semantics has made significant progress in theories, description, and processing. Formal aspects of ontology work have also been studied. The Semantic Web has further popularized the need for automatic extraction, representation, and manipulation of text meaning: for the Semantic Web to really succeed, capability of automatically marking text for content is essential, and this cannot be attained reliably using only knowledge-lean, semantics-poor methods. While there has recently been a flurry of specialized meetings devoted to formal semantics, lexical semantics, semantic web, formal ontology and others, the number of meetings devoted to knowledge-based text meaning processing -- content rather than formalism -- has been much smaller. The first Workshop on Text Meaning began to remedy this, and ten papers were presented on implemented systems and on related topics. Suggested Topics (not necessarily limited to the following) * Implemented systems that extract, represent, or manipulate text meaning. * Broad-coverage semantic analysis and interpretation. * Knowledge-based text synthesis. * The nature of text meaning required for various practical broad-coverage applications. * Manual annotation of text meaning, including interlingual annotations. * Pragmatics and discourse issues as parts of meaning extraction and manipulation. * Ontologies supporting automatic processing of text meaning. * Semantic lexicons. * Microtheories to support text meaning extraction and manipulation: aspect, modality, reference, etc. * Text meaning representations in semantic analysis. * Reasoning to support semantic analysis and synthesis. * Multilingual aspects of meaning representation and manipulation. * Integrating semantic analysis and non-semantic language processing. * Semantic analysis and synthesis systems based on knowledge-lean stochastic corpus-oriented methods. We encourage discussion of theoretical issues that are relevant to computational applications, including descriptions of processors and static knowledge resources. We specifically prefer discussions of content and meaning over discussions of formalisms for encoding meaning, and discussions of decision heuristics in processing over discussions of generic processing architectures and theorem-proving mechanisms. Submission Procedure Submit papers electronically (no more than 8 pages in the ACL two-column format available at www.acl2004.org), PDF strongly preferred, to gh at cs.toronto.edu Deadlines * Paper submission 1 April 2004 * Notification re acceptance 30 April 2004 * Camera-ready version due 16 May 2004 * Workshop dates 25-26 July 2004 Organizers * Graeme Hirst, University of Toronto (gh at cs.toronto.edu) * Sergei Nirenburg, University of Maryland, Baltimore County (sergei at umbc.edu) Program Committee * Jan Alexandersson (DFKI Saarbr?cken) * Collin Baker (ICSI Berkeley) * Peter Clark (Boeing) * Dick Crouch (PARC) * Richard Kittredge (University of Montreal) * Paul Kingsbury (Penn) * Tanya Korelsky (CoGenTex, Inc.) * Claudia Leacock (ETS Technologies) * Dan Moldovan (University of Texas at Dallas) * Antonio Moreno Ortiz (University of M?laga) * Martha Palmer (University of Pennsylvania) * Gerald Penn (University of Toronto) * Victor Raskin (Purdue University) * Ellen Riloff (University of Utah) * Graeme Ritchie (University of Edinburgh) * Manfred Stede (University of Potsdam) * Karin Verspoor (Los Alamos National Labs) * Yorick Wilks (University of Sheffield) Additional information Graeme Hirst Department of Computer Science University of Toronto Toronto, Ontario, Canada M5S 3G4 gh at cs.toronto.edu ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Fri Jan 9 17:20:37 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Fri, 9 Jan 2004 18:20:37 +0100 Subject: Appel: COLDOC' 2004 : The Setting up of Observables in Linguistics Message-ID: (apologies for multiple postings) COLDOC' 2004 2nd CALL FOR PAPERS The Setting up of Observables in Linguistics Young researchers' conference - Nanterre, France - April 29 & 30, 2004 The young researchers of Mod?le, Dynamique, Corpus (UMR 7114 CNRS Universit? Paris-X Nanterre) research team, are organizing a young researchers' conference, scheduled for April 29 and 30, 2004, at Paris X-Nanterre Universit? campus. The setting up of observables in linguistics is the central topic of this conference, i.e. defining and making use of both attested and constructed data. Young researchers from all fields and domains of linguistics are, therefore, invited to submit a paper. Postgraduate, Ph. D. and postdoc students are invited to provide useful insights and experience on their respective research areas. Communications (in English or French) addressing methodological and theoretical issues related to the process of setting up linguistic data, as well as data collection and utilization are expected. For example, communications addressing one of the following issues are expected: - Relevance and selection of linguistic data; - Corpora and emerging linguistic phenomena; - Oral, written or signed data collection methodology and practice; - Questions related to corpora related tools, transcription and encoding; - The use and place of quantitative methods, both generic and specific; - Qualitative methods; - Language, text genres or discourse comparison. Each conference session will start by an invited speaker's talk. A roundtable will be held at the end of the conference. Communications should last 20 minutes, followed by 10 minutes for questions. The deadline for proposals is set on January 26, 2004. Communication proposals will be evaluated anonymously by the scientific committee. Authors are invited to send two separate files, in Word format: first a two pages long summary (3000 signs) of their communication, second a file stating the authors' names, e-mail address, affiliation, together with the title of their communication. Authors may also state their preference regarding the format of their communication: oral, or poster. Communications will be evaluated according to a range of selection criterions, favoring those papers which fully address the issue stated above, which show methodological relevance and scientific interest, and which state their point clearly. Communication proposals, as well as other requests should be addressed to: , or by postal mail, to the following address: ColDoc' 2004 MoDyCo (UMR 7114) Secr?tariat sciences du langage Universit? Paris-X Nanterre, B?t. L 200, avenue de la R?publique 92001 Nanterre Cedex France We look forward to welcoming you at Nanterre Universit? for the occasion of the conference. The Organizing Committee: Antonio Balvet, Sophie Hamon, Sylvain Loiseau, Ali Tifrit, C?cile Vigouroux. Scientific Committee: --------------------- Driss Ablali Karine Baschung Gabriel Bergounioux Simon Bouquet Nick Clements Marcel Cori Sophie David Annie Delaveau Bernard Fradin Fran?oise Gadet Nathalie Gasiglia Philippe Gr?a Fran?oise Kerleroux Mark Klein Anne Lacheret Bernard Laks Sarah Leroy Colette Noyau Thierry Poibeau Fran?ois Rastier Tobias Scheer Pascale S?billot Anna Sores Nathalie Vall?e Florence Villoing Geoffrey Williams. Important dates: --------------------- Submission deadline: January 26, 2004 Authors' notification of acceptance: March 22, 2004 Conference: April 29 & 30, 2004 The Setting up of Observables in Linguistics ColDoc'2004 Modyco (UMR 7114) young researchers' conference Paris X Nanterre, Salle des colloques, B?timent B 200, avenue de la R?publique 92001 Nanterre Cedex France Web site: http://infolang.u-paris10.fr/modyco/textes/actualites/Page.html ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Mon Jan 12 09:39:05 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Mon, 12 Jan 2004 10:39:05 +0100 Subject: Conf: TALN04 : deadline extension Message-ID: ********************************************************************** - T A L N ' 0 4 - Traitement Automatique du Langage Naturel Palais des Congr?s Fez (Morocco) April 19 - 22, 2004 http://www.lpl.univ-aix.fr/jep-taln04/ ********************************************************************** ->>>> NEW DEADLINE : 20 January CALL FOR PAPERS Important Dates --------------- Submission deadline: 20 january 2004 Notification to authors: 20 february 2004 Camera-ready: 8 march 2004 Conference: 19-22 april 2004 Submitted papers must not exceed ten pages, in Times 12, single spaced (about 3000 words), including figures, examples and references. Posters or demo papers should not exceed 6 pages. A LaTeX style file and a Word template are available on the web site of the conference: http://www.lpl.univ-aix.fr/jep-taln04/ Papers are to be submitted before January 20, 2004 through the online submission procedure available on the Website : http://www.lpl.univ-aix.fr/jep-taln04/ Papers MUST be sent in PDF. In particular cases, we may accept submissions in RTF (Word) format. IMPORTANT: All the PostScript versions must be in A4 format, and not US Letter. In case of impossibility, we accept to receive a printed version of the submission. In this case, three hard-copies of the paper must be received by January 20, 2004 by: Philippe Blache - TALN 2004 LPL, Universit? de Provence 29, Avenue Robert Schuman 13621 Aix-en-Provence France e-mail: taln2004 at lpl.univ-aix.fr ********************************************************************** - T A L N ' 0 4 - Traitement Automatique du Langage Naturel Palais des Congr?s F?s (Maroc) du 19 au 22 avril 2004 http://www.lpl.univ-aix.fr/jep-taln04/ ********************************************************************** ->>>> NOUVELLE DATE LIMITE : 20 Janvier APPEL ? COMMUNICATIONS CALENDRIER Date limite de soumission : 20 janvier 2004 Notification aux auteurs : 20 f?vrier 2004 Version finale (pr?t-?-clicher): 8 mars 2004 Conf?rence : 19-22 avril 2004 Les articles soumis ne devront pas d?passer 10 pages en Times 12, espacement simple, soit environ 3000 mots, figures, exemples et r?f?rences compris. Les propositions de d?monstrations ou les posters ne devront pas d?passer 6 pages. Une feuille de style LaTeX et un mod?le Word sont disponibles sur le site web de la conf?rence http://www.lpl.univ-aix.fr/jep-taln04/. Les articles devront parvenir au comit? d'organisation avant le 20 janvier 2004, en utilisant le formulaire de soumission en ligne ? l'adresse suivante : http://www.lpl.univ-aix.fr/jep-taln04/ L'un des formats suivants devra IMP?RATIVEMENT ?tre employ?: - PDF, RTF (Word) Les versions devront ?tre au format A4. En cas d'impossibilit? d'envoi par courrier ?lectronique, une soumission "papier" pourra ?tre admise. 3 exemplaires papier de la contribution devront ?tre envoy?s ? l'adresse suivante: Philippe Blache - TALN 2004 LPL, Universit? de Provence 29, Avenue Robert Schuman 13621 Aix-en-Provence France e-mail: taln2004 at lpl.univ-aix.fr ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Mon Jan 12 09:39:10 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Mon, 12 Jan 2004 10:39:10 +0100 Subject: Appel: SENSEVAL-3 Message-ID: [Apologies for multiple postings] ===================================================================== CALL FOR PARTICIPATION IN THE SENSEVAL-3 EVALUATIONS SENSEVAL-3 Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text An ACL-2004 Workshop Barcelona, Spain July 25-26, 2004 http://www.senseval.org/senseval3 ====================================================================== The main purpose of this workshop is to analyze and discuss the results of systems participating in the Senseval-3 evaluations, to be held in March-April 2004. Fourteen different tasks are planned for Senseval-3, to conduct evaluations of systems that perform automatic semantic analysis of text, including: word sense disambiguation for various languages, identification of semantic roles, logic forms, multilingual annotations, subcategorization acquisition. This is an advance notice of the evaluation exercise and workshop. Registration for the evaluation will open in February (watch the website for updates). Papers will be accepted from participants only. [BACKGROUND] There are now many computer systems that do automatic semantic analysis of text. The purpose of Senseval is to evaluate the strengths and weaknesses of such systems with respect to different words, relations, types of texts, different varieties of language, and different languages. This workshop is a follow-up to Senseval-1 and Senseval-2. Senseval-1 took place in the summer of 1998 for English, French, and Italian, culminating in a workshop held at Herstmonceux Castle, Sussex, England on September 2-4. Senseval-2 took place in the summer of 2001, and was followed by a workshop held in July 2001 in Toulouse, in conjunction with ACL-2001. Senseval-2 included tasks for Basque, Chinese, Czech, Danish, Dutch, English, Estonian, Italian, Japanese, Korean, Spanish, Swedish. [TASKS] The following tasks are planned for Senseval-3 (see webpage for a description of each task): 1. English all words 2. Italian all words 3. Basque lexical sample 4. Catalan lexical sample 5. Chinese lexical sample 6. English lexical sample 7. Italian lexical sample 8. Romanian lexical sample 9. Spanish lexical sample 10. Automatic subcategorization acquisition 11. Multilingual lexical sample 12. WSD of WordNet glosses 13. Semantic Roles 14. Logic Forms This 2-day workshop will consist of several Senseval-3 task and system presentations, including analyses of results obtained during the evaluations, with comparisons across different systems, techniques, and languages. We also plan for two panels on (1) the interaction between systems for semantic analysis of text and other NLP applications, and (2) planning Senseval-4. [SUBMISSION FORMAT] Submissions will consist of refereed papers describing the Senseval-3 tasks and participating systems: - one paper for each task, limited to four pages - one paper for each participating team, limited to four pages for the first task, and one extra page for each additional task Papers will have to follow the ACL 2004 formatting guidelines. Submissions will be entered via the Senseval-3 website. [IMPORTANT DATES] Registration February Evaluations March - April Deadline for paper submissions April 20 Deadline for camera-ready papers May 18 Workshop July 25-26 [ORGANIZING COMMITTEE] Phil Edmonds, Sharp Laboratories of Europe Rada Mihalcea, University of North Texas [PROGRAM COMMITTEE] Eneko Agirre, University of the Basque Country Rebecca Bruce, University of North Carolina at Asheville Nicoletta Calzolari, ILC-CNR, Pisa Tim Chklovski, Information Sciences Institute Massimiliano Ciaramita, Brown University Silviu Cucerzan, Microsoft Research Walter Daelemans, University of Antwerp Florentina Hristea, University of Bucharest Nancy Ide, Vassar College Diana Inkpen, University of Ottawa Adam Kilgarriff, University of Brighton Dimitrios Kokkinakis, Goteborg University Anna Korhonen, University of Cambridge Robert Krovetz, Teoma Sadao Kurohashi, The University of Kyoto Dekang Lin, University of Alberta Ken Litkowski, CL Research PengYuan Liu, Harbin Institute of Technology Bernardo Magnini, ITC-IRST, Trento Lluis Marquez, University of Catalunya Diana McCarthy, University of Sussex Vivi Nastase, University of Ottawa Hwee Tou Ng, National University of Singapore Martha Palmer, University of Pennsylvania Patrick Pantel, Information Sciences Institute Ted Pedersen, University of Minnesota, Duluth Judita Preiss, University of Cambridge Amruta Purandare, University of Minnesota, Duluth German Rigau, University of the Basque Country Vasile Rus, Indiana University South-Bend Charles Schafer, John Hopkins University Carlo Strapparava, ITC-IRST, Trento Dan Tufis, Romanian Academy Cynthia Thompson, University of Utah Paola Velardi, La Sapienza, Rome Janyce Wiebe, University of Pittsburgh David Yarowsky, John Hopkins University Deniz Yuret, Koc University ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Tue Jan 13 16:49:19 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Tue, 13 Jan 2004 17:49:19 +0100 Subject: Jobs: Sinequa : moteur de recherche en portugais et en danois Message-ID: Sinequa est une entreprise d?veloppant, entre autres, un moteur de recherche fortement linguistique. Voir le site http://www.sinequa.com pour plus de renseignements. Nous recherchons deux personnes pour d?velopper le moteur en portugais et en danois. Les comp?tences demand?es sont les suivantes : - Fortes comp?tences en linguistique ou terminologie (niveau ma?trise ou DESSS) ; - Ma?trise de l'outil informatique (outils de bureautique, Internet) exig?e ; - Programmation de scripts (Perl ou autre) fortement appr?ci?e ; - Parfaite ma?trise du portugais et du danois. Le travail qui sera demand? consistera ? d?velopper des lexiques morpho-syntaxiques, des corpus ?tiquet?s, des automates d'analyse (reconnaissance d'entit?s nomm?es, etc.), etc. La dur?e du contrat sera de 3 ? 6 mois. Si vous ?tes int?ress?(e), merci d'envoyer votre CV par courriel ? loupy at sinequa.com Cordialement -- Claude de Loupy - Responsable Recherche Sinequa - http://www.sinequa.com courriel : loupy at sinequa.com t?l. : 33 1 49 87 06 00 - fax : 33 1 49 87 06 ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Tue Jan 13 16:49:22 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Tue, 13 Jan 2004 17:49:22 +0100 Subject: Conf: Workshop "Terminology, Ontology & Knowledge Representation" : Program Message-ID: Please note that the workshop will now be held from 22-23rd January 2004. Venue : Manufacture des Tabacs, University de Lyon 3 4 cours Albert Thomas, 69008 Lyon - France phttp://www.univ-lyon3.fr/partagedessavoirs/termino2004 -------------------------------WORKSHOP PROGRAM---------------------- 22nd January 2004 9h00 : Welcome remarks / Accueil des participants ? Ontology & Knowledge representation : methodological approaches & applications 9h30-10h00 : Cartographier la connaissance. Anthony Fr?maux (Soci?t? Cognito) 10h10 - 10h40 : Partage des connaissances terminologiques en milieu industriel : approche th?orique et implantation informatique J?r?my Roy (ERSICOM, Universite de Lyon 3) 10h50-11h00 : Pause 11h00 - 11h30 : Extraction d'informations s?mantiques pour l'aide ? la construction d'ontologies diff?rentielles. V. Malais?, P. Zweigenbaum, B. Bachimont (STIM AH/HP, CRIM-INALCO) 11h40-12h20 : Base de connaissances GENOMA : le r?le de l'ontologie M. Teresa Cabr?, Judit Feliu, Jorge Vivaldi (Universit? Pompeu Fabra, Barcelona) 12h20 - 14h00 : Lunch break / Pause d?jeuner ? Invited speaker / Conf?renci?re invit?e 14h00 - 15h00 : Du corpus ? une repr?sentation relationnelle du lexique : la question des marqueurs des relations conceptuelles. Anne Condamines (Erss, Toulouse) 15h00-15h30 : Towards multilingual, termontological support in ontology engineering. Koens Kerremans, Rita Temmerman (CVC, Brussels) 15h40 - 16h10 : Ontology Via Terminology ? Lee Gillam, Mariam Tariq (University of Surrey, UK) 16h20 - 16h30 : Break / Pause 16h30 - 17h00 : ONTOLOGICO : vers un outil d'assistance au d?veloppement it?ratif des ontologies. Yassine Gargouri, Bernard Lefebvre, Jean Guy Meunier (LANCI, Universit? de Qu?bec ? Montr?al) ? Terminology : methodological approaches I 17h10-17h40 : Variations et traitement automatique de la terminologie B?atrice Daille (IRIN, Universit? de Nantes) 17h50-18h30 : A methodology for classifying documents using terminological taxonomies Bellomi & Crestani (Universit? de Verona, Italy) ****** 23rd January 2004 ****** ? Terminological theories 9h00 : 9h30 : La philosophie comme multi-terminologie B. Hufschmitt (Universit? de Franche-Comt?) ? Terminology : methodological approaches II 9h40 - 10h10 : Rep?rage humain ou automatique des relations lexico-s?mantiques; bilan d'une tentative de formalisation Jeanne Dancette, Sonia Halimi (Ecole de Traduction, Univ. de Gen?ve & Universit? de Montr?al) 10h20 - 10h50 : Adjectifs d?riv?s s?mantiques (ADS) dans la structuration des terminologies Marie-Claude L'Homme (Universit? de Montr?al) 11h00 - 11h10 : Break / Pause 11h10 -11h40 : A computer-aided terminology processing system prototype Le An Ha (University of Wolverhampton, UK) 11h50 - 12h20 : Terminology expansion and relation identification between genes and pathways James Dowdall, Fabio Rinaldi, Andreas Persidisy, et al. (IFI, University of Zurich) 12h30 - 14h00 : Lunch break / Pause d?jeuner ? Terminology : applications q14h00 - 14h30 : Une terminologie du domaine m?dical : structure et exploitation L. Soaulmia, A. N?v?ol, M. Douy?re et al. (CHU, Universit? de Rouen ) 14h40 - 15h10 : Referencing text documents in multidimensional concept spaces for technology and scientific watch. A conceptual overview of text models in the context of a collaborative scientific watch system Jean-S?bastien Brunner, Thibaud Latour (CRP Henri Tudor, Centre for IT Innovation, Luxembourg) 15h20 - 15h30 : Pause 15h30 - 16h00 : De l'?laboration d'un dictionnaire de description de sens des termes m?dicaux vietnamien-fran?ais-anglais ? la recherche d'information m?dicale par croisement de langues: une approche socioterminologique Tuan Duc Tran, N. Garcelon, D. Delamare (Facult? de M?decine-Universit? de Rennes ) 16h10 : Plenary discussion / Discussion pl?ni?re 17h00 : End of workshop / Fin de l'atelier. -------------------------------------- Ibekwe-SanJuan Fidelia Workshop "Terminology, Ontology & Knowledge representation" 22-23 january 2004, University of Lyon 3. http://www.univ-lyon3.fr/partagedessavoirs/termino2004 ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Tue Jan 13 16:49:23 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Tue, 13 Jan 2004 17:49:23 +0100 Subject: Appel: CIDE 7 : EXTENDED DEADLINE : FEBRUARY 8 Message-ID: *************************************************** CIDE.7 SECOND CALL - EXTENDED DEADLINE : FEBRUARY 8 International Colloquium for Electronic Document Colloque International sur le Document Electronique *************************************************** "Semantic Approaches of Electronic Document" La Rochelle (France, June 22-25, 2004 http://infodoc.unicaen.fr/cide/cide7/ To be held as part of "la Semaine du Document Num?rique" ("Week for Electronic Document") Since 1998, CIDE organizes scientific meetings on topics of broad interest and importance for progress in electronic document studies. The objectives are to put together complementary approaches from various disciplines, and to promote academic and industrial results in this area. The seventh main conference purpose is to focus on semantic approaches of electronic document processing. Semantic-oriented approaches have long been considered sceptically by practitioners or researchers, to the benefit of so-called "surface" processing, considering "form" rather than "content" or "meaning". This view already began to change. Significant progress has been done in the last years, either relative to text document (e.g. in the area of information extraction, question answering, automatic summarisation...), or other medias (content-based indexing of audio-video documents, pictorial or sound information extraction, summary of musical or video works...). Moreover, the big challenge of the "semantic web" project is to elaborate formal descriptions of the content of documents and other resources, in order to make them easily accessible and interoperable. Another, radical, viewpoint would be to consider that even "surface" or "numerical" processing is in fact, if closely observed, of semantic nature. If "sense" does not reduce to "information", producing any information is producing sense. Lexical disambiguation, even if based on a statistical method not relying on any linguistic theory, does solve a lexical-semantic problem. A program extracting thematic descriptors computes this minimal meaning : "what this document is all about", etc. The aim of CIDE.7 is to bring the light on these questions. Two aspects are to be considered : - Presentation and discussion of experiences and advances addressing the semantic analysis of electronic documents, according to the various medias (text, audio, video), and their networking (semantic web) ; - Methodological investigations in order to establish the basis of a truly semantic approach in document engineering. The conference will include : - A presentation of communications in response to the present call ; - Invited conferences providing syntheses on the different kinds of semantic processing ; - A final panel, in collaboration with other conferences taking part in the "Week for Electronic Document". ================= Conference topics ================= The topics addressed by CIDE.7 include (but are not limited to) the following : - Applications : content-based information retrieval, information extraction, inside-document browsing, hypertext structuring, analysis of technical, as well as artistic or literary documents... - Description of document content : indexing, tagging, enrichment... of the whole or segments of documents, constitution of terminologies or ontologies, formalisms for representation of descriptions (rdf, topic maps...), semantic trans-modality modelling... - Processing methods for analysis and use : semantic and semiotic methods specific to the different kinds of documents (text, image, audio, video), collaboration of symbolic and numerical methods, constitution and use of corpora, document bases integration, web services... - Methodological investigations : sense and use, relation between forms and sense, similarities and differences between medias, collaboration for certain tasks... ========================== Language of the conference ========================== The main language is French. However, papers and presentations in English are welcome. ========== Submission ========== Instructions for authors are accessible on the web site of CIDE.7. Declarations of intention to submit a paper will include keywords and a 200 words summary. They have to be sent in pdf format. The full papers will not exceed 15 pages (according to the provided style sheets). The presentation of submitted papers should be the same as for the final ones. =============== Important dates =============== - Declarations of intention to submit (optional) : as soon as possible. - Paper submission : February 8, 2004. - Notification of acceptation : March 15, 2004. - Final papers due : April 15, 2004. - Conference : June 22-25, 2004. ======================== Contact and informations ======================== Lydie Sauv?, D?partement d'informatique, Campus II, bd Mar?chal Juin, Universit? de Caen, 14032 Caen Cedex Web Site : http://infodoc.unicaen.fr/cide/cide7/ Email (informations) : cide7 at infodoc.unicaen.fr Email (submission) : cide7-soumission at infodoc.unicaen.fr ========================== Program Committee of CIDE.7 ========================== Chair : P. Enjalbert (U. Caen), M. Gaio (U. Pau) M.H. Antoni (U. Poitiers), T. Baccino (U. Nice), B. Bachimont (INA et UTC), F. Cerbah (Dassault Aviation), J.P. Descl?s (Lalicc, U. Paris 4), C. Faure (ENST), S. Ferrari (U. Caen), C. Fluhr (CEA), B. Grau (LIMSI), P. Laublet (Lalicc, U . Paris 4), G. Mourad (Lalicc, U. Paris 4), A. Napoli (LORIA), M-P. Pery Woodley (U. Toulouse 2), I. Saleh (U. Paris 8), K. Tombre (LORIA), B. Victorri (CNRS-ENS), G. Vignaux (CNRS-LCP), H. Vinet (IRCAM), J.Vivier (U. Caen). ==================== Organising Committee ==================== S. Ferrari (coordination), F. Bilhaut, E. Faurot, V. Perlerin, C. Turbout, A. Widl?cher ========================================= Permanent Committee of the CIDE Conference ========================================= M. Bellafkih (Morocco), J. Caelen (France), J. Ducloy (France), M. Gaio (France), J. Gardes (France), J-L. Hainaut (Belgium), P. King (Canada), J. Labiche (France), M. Leonard (Swiss), J-P. Raysz (France), J-M. Robert (Canada), Z. Sahnoun (Algeria), M. Szmurlo (France), L. Thomazo (France), E. Trupin (France), J. Virbel (France), J. Vivier (France), C. Vanoirbeek (Swiss), K. Zreik (France, coordination). ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Tue Jan 13 16:49:25 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Tue, 13 Jan 2004 17:49:25 +0100 Subject: Appel: TALN2004: Workshop on QUESTION-ANSWERING Message-ID: ********************************************************************** CALL FOR PAPER Held in conjunction with T A L N 2 0 0 4 Workshop on QUESTION-ANSWERING --- Palais de Congr?s F?s (Maroc) April 22, 2004 http://www.lpl.univ-aix.fr/jep-taln04/ ********************************************************************** Facing a question such as ?What is the most expensive car in the world??, classical search engines return the documents that are the most strongly linked to the words of the question, sometimes extract the excerpts where these words are the most numerous, but let the user browse texts to actually find an answer. This need leads to develop systems that are able to extract the parts of documents that are the most relevant in relation to a question, providing either an answer when the question is about a precise fact or a summary when it is a topical question. These functions can be implemented only if IR systems are able to analyze both queries and documents more deeply. As a consequence, question answering is at the crossing of several research fields: of course, it is grounded in Information Retrieval but it also concerns Natural Language Processing (NLP) in an important way and to some extent, fields such as Machine Learning. Most QA systems are based on a classical search engine that is enhanced by a question analysis module, a set of modules for extracting various linguistic features from documents, such as named entities, terms or syntactic relations, and a module that relies on all these data for extracting answers by mixing linguistic and numerical criteria. Moreover, the QA problem puts forward new functions, or functions that are still in an embryonic state in current IR systems: evaluating if an answer to a question exists in a document collection, achieving a synthesis from multiple or partial answers, using dialog for constructing a query, or text understanding capabilities for dealing with anaphora, inferences, or for determining if a set of several answers is coherent. More precisely, submissions will present a question answering system as a whole or will focus on one of its processes provided that it is put in the question answering context. These processes include but are not limited to: - question analysis: question typology, extraction of the question focus, of the question context or more generally, of semantic constraints - named entity recognition: fine-grained named entities, unrestricted domains - passage extraction - full or partial similarity of syntactic structures - terminological tools: extraction and recognition of terms and their variants - extraction and justification of answers: answer patterns, inferences, paraphrase ... This workshop is particularly concerned by papers that focus on QA systems for large collections of documents or the Web but papers about QA systems for restricted domains or dedicated to knowledge bases or database will also be taken into account. Submissions can also tackle cross-domain topics in relation to Question Answering , such as: - QA and machine learning: use of machine learning for selecting and extracting answers to a question but also for building on a large scale resources that are necessary for QA systems; - multilingual and crosslingual QA: what are the difficulties for adapting an existing QA system most of them only work for English to another language; asking a question in a language and searching an answer in a collection of documents in another language; - QA and the Web: using the Web as a source of knowledge or a source of answers; what are the specific aspects of searching an answer on the Web; - multi-document QA: fusion and coherence of multiple answers. SUBMISSION: Submissions will be minimum 4 page summaries or long papers of no more than 10 pages, written in French or English, according to the style of the main conference TALN 2004. The final version will be a long paper. Submission format will be PDF, but .doc and .ps will be also admitted. Papers have to be sent to Brigitte.Grau at limsi.fr, with TALN-QA as subject. IMPORTANT DATES: Submission deadline: 15 January 2004 Notification to authors: 20 February 2004 Camera-ready: 8 March 2004 Question-Answering workshop: 22 April 2004 Groupe LIR - LIMSI BP 133, 91403 Orsay Cedex tel. 01 69 85 80 03, fax 01 69 85 80 88 et Institut d'Informatique d'Entreprise (IIE) 18 all?e Jean Rostand, 91025 Evry Cedex tel. 01 69 36 73 44, fax 01 69 36 73 09 ---------------------------------------------------- ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Tue Jan 13 16:49:26 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Tue, 13 Jan 2004 17:49:26 +0100 Subject: Appel: OntoLex 2004:Ontologies and Lexical Resources in Distributed Environments Message-ID: ****APOLOGIES FOR MULTIPLE POSTINGS**** SECOND AND FINAL CALL FOR PAPERS Workshop OntoLex 2004: Ontologies and Lexical Resources in Distributed Environments http://www.loa-cnr.it/ontolex2004.html Centro Cultural de Belem LISBON, Portugal 29th may 2004 In Association with 4th INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION LREC2004 http://www.lrec-conf.org/lrec2004/index.php Main conference 26-27-28 May 2004 Motivations and aim The use of ontological knowledge in language technology applications goes a long way back. Recently, however, the project of turning the World Wide Web into a machine understandable resource to access digital information (the so-called Semantic Web) has stimulated a renewed interest in ontologies. In several recent workshops and conferences, researchers have investigated their nature and application potential for knowledge management, information retrieval and extraction, information exchange in agent-based systems as well as dialogue systems. Attention is being drawn to new aspects of ontology research such as ontology coordination and mapping ? aspects that are particularly relevant for distributed environments such as Knowledge Grid and Semantic web. In fact the annotation of web resources in agreement with concepts and relations as defined in ontologies, is useful for establishing a conceptual support for knowledge communication. From this perspective, lexicographers, lexical semanticists and ontologists are joining forces to build innovative systems for integrating ontological knowledge with lexical and semantic resources. Important examples of this interaction are the recent works on the conceptual analysis of WordNet (one of the first lexical knowledge bases), and the wide use of upper ontologies in innovative international projects like EuroWordNet, SIMPLE, Balkanet, DWDSnet. WordNet was designed and built entirely by psychologists, linguists, and lexicographers. Nevertheless, there are obvious parallels with ontologies, especially in the kinds of structuring relations used (taxonomical links, meronymy or part-of, etc.), and indeed WordNet has for years attracted the attention of philosophers and ontologists. In this context, the distinction between conceptual (possibly axiomatic) ontologies and lexical ontologies (which contain both linguistic and ontological information) has become more and more central in the field. In this workshop we want to discuss ontologies as resources per se, as well as for what concerns the relation between ontological knowledge and language. This relation can be investigated from a number of different angles, for example what differences and similarities there are between ontologies and more traditional lexical resources such as dictionaries and wordnets; how ontologies can be extracted from language corpora; what role language plays in the definition and mapping of ontologies; and finally, how ontologies can be used to treat language in language technology applications ? in particular applications for distributed environments. Topics to be addressed in the workshop include, but are not limited to: - Design principles and methodologies for upper-level ontologies and semantic lexical resources - Evaluation, comparison, mapping and integration of ontologies and lexical resources - Applications of ontologies and semantic lexical resources in LT applications (e.g. QA, Information Retrieval, Information Extraction, Machine Translation) - Role of semantic lexical resources in ontology learning - Methods to derive ontological knowledge from text - Methods to annotate text with reference to an ontology - Ontology-based query expansion techniques - Ontologies and multi-lingual lexical resources - Ontologies and ontology mapping in multi-lingual applications Ontologies and lexical resources for meaning negotiation Two discussions will be organised around the following topics: - Filling the gap between axiomatic and linguistic ontologies - The role of lexical resources in the Semantic Web and the Knowledge Grid Reasons of interest A new scientific community is growing around this largely interdisciplinary area: following the spirit of the previous two OntoLex workshops, this workshop aims at being an important meeting point for researchers involved in the fields of lexical resources and ontologies, favouring the exchange of scientific experiences and proposing new directions of inquiry. This year, the workshop particularly welcomes contributions from researchers that are investigating the application of ontologies and lexical resources in distributed environments such as Knowledge Grid and Semantic Web. Important dates - 4th December 2003: Call for papers and demonstrations - 30 January 2004: Deadline for paper submission - 5 March 2004: Acceptance notifications and preliminary program - 29 March 2004: Deadline final version of accepted papers - 29 May 2004: Workshop Submissions Participants are invited to submit an extended abstract of max 3000 words related to one or more of the topics of interest. Papers can describe research results as well as work in progress. Each accepted paper will receive a slot of 30 minutes for presentation (20 minutes talk and 10 minutes for discussion). Demonstrations of ontology applications are encouraged as well (a demonstration outline of 2 pages can be submitted). Each submission should show: title; author(s); affiliation(s); and contact author's e-mail address, postal address, telephone and fax numbers. Submissions must be sent electronically in PDF to Alessandro Oltramari (oltramari at loa-cnr.it) As soon as possible, authors are encouraged to send a brief email indicating their intention to participate, including their contact information and the topic they intend to address in their submissions. Proceedings of the workshop will be printed by the LREC Local Organising Committee. Time schedule and registration fee The workshop will consist of a morning session and an afternoon session, and include scientific paper presentations from workshop participants as well as general discussions. For this full-day workshop, the registration fee is 100 EURO for LREC conference participants and 170 EURO for other participants. These fees will include a coffee break and the Proceedings of the Workshop. Organising Committee Alessandro Oltramari (Laboratory for Applied Ontology, ISTC-CNR; Department of Cognition and Education Sciences, Trento University) Patrizia Paggio (Center for Sprogteknologi, University of Copenhagen) Aldo Gangemi (Laboratory for Applied Ontology, ISTC-CNR Rome) Maria Teresa Pazienza (Roma Tor Vergata University) Nicoletta Calzolari (Istituto di Linguistica Computazionale del CNR) Bolette Sandford Pedersen (Center for Sprogteknologi, University of Copenhagen) Kiril Simov (Bulgarian Academy of Sciences) Programme Committee Roberto Basili (Roma Tor Vergata University) Werner Ceusters (Language & Computing) Nicoletta Calzolari (Istituto di Linguistica Computazionale del CNR) Aldo Gangemi (Laboratory for Applied Ontology, ISTC-CNR, Rome) Eric Gaussier (Xerox Research Centre Europe, Grenoble Laboratory) Maria Toporowska Gronostaj (Spr?kdata, University of Gothenburg) Nicola Guarino (Laboratory for Applied Ontology, ISTC-CNR, Trento) Arne J?nsson (Link?ping Universitet) Dimitrios Kokkinakis (Spr?kdata, University of Gothenburg) Alessandro Lenci (Universit? di Pisa) Claude de Loupy (Sinequa and University of Paris 10) Bernardo Magnini (ITC-IRST, Trento) J?rgen Fischer Nilsson (Technical University of Denmark) Alessandro Oltramari, (Laboratory for Applied Ontology, ISTC-CNR, Trento) Patrizia Paggio (Center for Sprogteknologi) Maria Teresa Pazienza (Roma Tor Vergata University) Bolette Sandford Pedersen (Center for Sprogteknologi) Guus Schreiber (Vrije Universiteit Amsterdam) Kiril Simov (Bulgarian Academy of Sciences) Atanas Kiryakov (Ontotext Lab, Sirma AI) Paola Velardi (Universit? La Sapienza, Rome) ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Tue Jan 13 16:49:27 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Tue, 13 Jan 2004 17:49:27 +0100 Subject: Appel: ACL04: WORKSHOP ON QUESTION ANSWERING IN RESTRICTED DOMAINS Message-ID: FIRST CALL FOR PAPERS ACL04 WORKSHOP ON QUESTION ANSWERING IN RESTRICTED DOMAINS Barcelona, Spain, 25-26 July 2004 Submission deadline: 15 March 2004 http://www.clt.mq.edu.au/Events/Conferences/acl04qa/ Much of the current research in question answering systems is driven by programs such as AQUAINT and evaluation exercises such as TREC, NTCIR and CLEF, all of which focus on open-domain question answering. The availability of large volumes of data (e.g. documents extracted from the World Wide Web) has prompted the development of systems that focus on shallow text processing. But there are many document sets in restricted domains that are potentially valuable as a source for question answering systems. For example, the documentation pages of Unix and Linux systems would make an ideal corpus for QA systems targeted at users that want to know how to use these operating systems. There is a wealth of information in other technical documentation such as software manuals, car maintenance manuals, and encyclopediae of specific areas such as medicine. Users interested in these specific areas would benefit from QA systems targeted to their areas of interest. Restricted domains typically have limited data available and therefore conventional techniques based on data redundancy can simply not be applied in an effective way. The scarcity of data available seems to prompt for a more targeted, NLP-intensive approach to QA. The use of additional corpora such as the WWW raises a number of interesting questions. For instance, will these corpora help or obstruct the proper functioning of an NLP-intensive approach to QA? And, how do we find good pockets of information that are appropriate to the chosen domains? On the other hand, restricted domains (e.g. law, medicine) have specific stylistic conventions. Often these domains use terminology that is not stored in conventional lexica. Consequently NLP approaches devised for open-domain systems may under-perform on these specific domains, thus raising the question of how portable these systems can be. In this workshop we aim at answering some of the following questions: * Are open-domain question answering techniques appropriate for QA in restricted domains? * Can we use generic large corpora and/or the WWW? How can we identify specific pockets of information in these generic corpora? * How can we use specific sources such as the CIA factbook, acronym lists, e-commerce sites (e.g. e-bay), and specialized glossaries and encyclopedia? How can we discover new specific sources? * What types of question-answering techniques are best for what types of restricted domains? * Is it easy/possible/worthwhile to develop domain-independent QA systems for restricted domains? What would be the cost of porting a QA system to a specific domain? * Are restricted domains more suitable than open domains to drive research in NLP? * Is evaluation of restricted-domain QA systems different than that of open-domain QA systems? We welcome papers that address any of the above questions or that focus on any of the following topics: * Comparison between open-domain and restricted-domain QA * Characterisation of the types of restricted domains and the technology required for QA on those domains * Methodologies and/or tools for restricted-domain QA * Description of specific restricted-domain QA systems * Development of modules (e.g. document preselection, NE extraction, terminology extraction) for use in restricted-domain QA systems * Portability of QA systems between different restricted domains * Evaluation of restricted-domain QA systems SUBMISSION PROCEDURE Authors should submit full papers of maximum 8 pages, including references and figures, following the main conference ACL style format (http://www.acl2004.org/aclstyles/style.html). The review will not be blind. Submissions must be in PS or PDF format and they should be sent to diego at ics.mq.edu.au PROGRAM COMMITTEE Organizers: ----------- Diego Moll? Macquarie University, Australia Jos? Luis Vicedo Alicante University, Spain Committee: ---------- In alphabetical order by first name: Anselmo Pe?as UNED, Spain Antonio Ferr?ndez Alicante University, Spain Bernardo Magnini ITC-Irst, Italy Bonnie Webber University of Edinburgh, UK Donna Harman NIST, USA Ellen Voorhees NIST, USA Fabio Rinaldi University of Zurich, Switzerland Felisa Verdejo UNED, Spain Graeme Hirst University of Toronto, Canada Horacio Rodr?guez Universitat de Catalunya, Spain Ingrid Zukerman Monash University, Australia Jimmy Lin MIT, USA Johan Bos University of Edinburgh, UK Juergen Franke DaimlerChrysler AG, Germany Julio Gonzalo UNED, Spain Lynette Hirschman MITRE, USA Maarten de Rijke University of Amsterdam, The Netherlands Manuel Palomar Alicante University, Spain Mark Maybury MITRE, USA Michael Hess University of Zurich, Switzerland Pierre Zweigenbaum AP-HP, INSERM & INaLCO, France Richard Sutcliffe University of Limerick, Ireland Rolf Schwitter Macquarie University, Australia Sanda Harabagiu University of Texas, USA IMPORTANT DATES * 15 March 04 Paper submission * 15 April 04 Notification of acceptance * 15 May 04 Camera ready version * 25 or 26 July 04 Workshop (final date not yet determined) CONTACT DETAILS Diego Moll? Centre for Language Technology Division of Information and Communication Sciences Macquarie University New South Wales 2109 Australia Tel. +61 2 9850 9531 Fax +61 2 9850 9551 diego at ics.mq.edu.au ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Fri Jan 16 17:12:38 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Fri, 16 Jan 2004 18:12:38 +0100 Subject: Jobs: stages =?ISO-8859-1?Q?=E0_ATLIF?= Message-ID: Bonjour, sur http://www.atilf.fr/ananas/ figurent des offres de stage 2004 pour des ?tudiants en SdL, TAL ou informatique (niveau licence, ma?trise, DEA/DESS). Merci de les diffuser autour de vous. Cordialement, Susanne Salmon -- Susanne Salmon-Alt Charg?e de Recherche - CNRS ATILF : 03.83.96.86.98 LORIA : 03.83.59.20.35 ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Fri Jan 16 17:12:41 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Fri, 16 Jan 2004 18:12:41 +0100 Subject: Conf: TALN'04: SDRT workshop : DEADLINE EXTENSION Message-ID: SDRT workshop of TALN'04 The submission deadline is postponed : it is now the 22 Janvier La date de soumissiom est report?e au 22 janvier ---------------------------------------------------- Call for papers TALN (Traitement Automatique du Langage Naturel) is the francophone annual conference on NLP. TALN'04 (http://www.lpl.univ-aix.fr/jep-taln04/.) will be held in conjunction with JEP 2004 (Journ?es d'Etude sur la Parole) in Fez, Morocco, from 19 to 22 April 2004, under the aegis of the Association Francophone de la Communication Parl?e (AFCP) and of the Association pour le Traitement Automatique des Langues (ATALA, the French equivalent of ACL). The workhop on SDRT will be held on April 22 2004. Official languages are French and English. Topic Papers are invited for thirty-minute talks, including questions, on theoretical and implementational issues on SDRT. SDRT is an approach to discourse interpretation that has many advantages for computational NLP research and applications. Originally (Asher 1993) an extension of Hans Kamp's Discourse Representation Theory (DRT), it combines the insights of dynamic semantics on anaphora with a richer theory of discourse structure, in which each clause plays one or more rhetorical functions within the discourse. More than a decade of work has shown the theoretical fruitfulness of marrying a rich notion of discourse structure with dynamic semantics. For example, it has been shown that rhetorical functions have semantic effects in the following domains: - temporal and spatio-temporal structure of the text, - pronominal anaphora, - presupposition, - resolution of bridging expressions (like definite descriptions), - resolution of lexical and other ambiguities like VP ellipsis and quantifier scope, - analysis of plural quantification, - calculation of implicatures and conversational goals of agents in dialogue. In addition to these theoretical aspects, SDRT has been designed from the outset to aid with implementation in automated or semi-automated textual analysis and text generation. It is a modular theory which contains both a theory of information content and a theory of information packaging, i.e. how to construct the logical form of a discourse. The former is straightforwardly an extension of dynamic semantics, and any implementation of the dynamic semantic ideas (e.g. DRT, DPL, DMG) is compatible with SDRT conception of discourse content. The latter exploits diverse resources that are understood in modular form, and it exploits also the notion of an underspecified representation at several levels. An additional feature of SDRT is that it uses a nonmonotonic system of inference. The aim of this workshop is twofold : theoretical and implementational issues on SDRT. So we expect papers either presenting the treatment of a given linguistic phenomenon in SDRT with possibly a comparison with treatments in other discourse semantics framework. Or papers presenting some implementational issues, for example : - How HPSG or LFG grammars can provide suitable inputs to fragments of SDRT implementation?- - Is the inference engine used to compute logical forms for discourse. Should this system itself be implemented or should approximations, perhaps even monotonic ones, be used? How can we expect the nonmonotonic logic to scale up for large scale applications? How much logical inference do we really need for shallow applications of SDRT? - How to make use of statistical approaches to getting lexical information and other information that would be useful in computing discourse structure? How machine learning might apply to learning rules for the computation of discourse structure in SDRT ? All selected papers will be published in the proceedings. Authors are invited to submit original, previously unpublished research work. Submissions will be reviewed by at least two members of the program committee Program committee: Chairmen Asher Nicholas (Austin), nasher at mail.la.utexas.edu Danlos Laurence (Paris), laurence.danlos at linguist.jussieu.fr, Members Amsili Pascal (Paris), pascal.amsili at linguist.jussieu.fr Bras Myriam (Toulouse), bras@ univ-tlse2.fr Corblin Francis (Paris), corblin at paris7.jussieu.fr, Gaiffe Bertrand (Nancy), bertrand.gaiffe at loria.fr, Kamp Hans (Stuttgart), Hans.Kamp at ims.uni-stuttgart.de, Kruijff-Korbayova Ivana (Saarbr?cken), korbay at coli.uni-sb.de, Le Draoulec Anne (Toulouse), draoulec at univ-tlse2.fr, Muller Philippe (Toulouse), muller at irit.fr, Pustejovsky James (Boston), jamesp at cs.brandeis.edu, Roussarie Laurent (Paris), laurent.roussarie at linguist.jussieu.fr, Vieu Laure (Toulouse), Laure.Vieu at irit.fr Submission procedure Submitted papers must not exceed ten pages, in Times 12, single spaced (about 3000 words), including figures, examples and references. A LaTeX style file and a Word template are available on the web site of the conference: Papers MUST be sent in PDF format To : Laurence.Danlos at linguist.jussieu.fr subject: SDRT'04 paper. Important All the PDF versions must be in A4 format, and not US Letter. New Submission deadline: 22 January 2004 Notification to authors: 20 February 2004 Camera-ready: 8 March 2004 SDRT workshop: 22 April 2004 ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Fri Jan 16 17:12:43 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Fri, 16 Jan 2004 18:12:43 +0100 Subject: RESSOURCES: constitution d'une base de =?ISO-8859-1?Q?donn=E9es_?= =?ISO-8859-1?Q?vocales?= Message-ID: *************************************** VOTRE VOIX NOUS INTERESSE *************************************** Afin de constituer une base de donn?es vocales pour la recherche et le d?veloppement dans le domaine du traitement de la parole, nous recherchons : - 1000 locuteurs, - Hommes et femmes, - De 18 ans et plus, - De langue maternelle fran?aise. Ces enregistrements vocaux sont r?mun?r?s et durent environ 10 minutes. Si vous ?tes int?ress?s et pour obtenir des informations compl?mentaires, vous pouvez nous contacter : T?l. : 01 43 13 33 47 *** Merci. --------------------------------------------------------------------------- 55-57, rue Brillat-Savarin 75013 Paris FRANCE Tel: (+33) 1 43 13 33 33 / Fax: (+33) 1 43 13 33 30 URL: http://www.elra.info or http://www.elda.fr LREC conference: http://www.lrec-conf.org LangTech forum: http://www.lang-tech.org --------------------------------------------------------------------------- ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Fri Jan 16 17:12:44 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Fri, 16 Jan 2004 18:12:44 +0100 Subject: APPEL: revue TAL : Automatic Text Summarization: Solutions and Perspectives Message-ID: Automatic Text Summarization: Solutions and Perspectives Deadline: February 10th, 2004 Special Issue Editor: Jean-Luc Minel (LaLICC, CNRS) Context: Automatic Text Summarization From a scientific point of view, the problem of summarization extends beyond its immediate boundaries, largely due to certain rhetorical relations that need to be taken into consideration, such as discourse frames, or direct or indirect levels of discourse. In recent years, this research area has moved beyond the processing of phrases as the unit of analysis to include concept search and the design of suitable methods for detecting and representing textual structures. Until the mid-1990?s, scientific texts provided a field of experimentation for automatic summarization methods, but the digitization of texts and their increased availability on the Web and intranets has fundamentally changed user needs and uses. The abundance of texts brings with it an urgent requirement for the creation and production of summarization tools capable of finding, selecting and extracting textual information concisely. In terms of the need, it has become necessary to bring web-search tools and company networks together with tools for automatic summarization. In terms of use, the development of text-search tools requires careful consideration of text representation and the design of user interfaces, which in turn leads to studies being carried out in the domain of information science on new types of written text. Topic This special issue aims: - on the one hand, to present new approaches or methods which may lead to promising prototypes for automated systems. It also concerns the development of an awareness of the importance of automatic summarization, alongside technology, for linguistic engineering. Whatever the automatic summarization project, there are only two underlying techniques for its realization. Firstly, techniques which extract full phrases from source texts, and secondly, those which generate a new, condensed text. Only the first kind of technique has allowed for the implementation of systems that can be considered as providing reasonable results according when evaluated for comercial potential. The second technique is of interest in a research context because of the various linguistic problems left unsolved at the level of interpretation, in view of the limitations in their computational implementation. - on the other hand, to propose bridges between the various areas in which text constitutes the main object of study (i.e., in the domain of information science). Papers are invited which contribute to the following themes: - Numerical approaches versus linguistic approaches, with a particular focus on papers that explore the complementarity between the two. - The automatic detection of textual structures, including: o the identification of topic o the identification of argumentative structures o the improvement of coherence and cohesion (dealing with anaphora, etc) - Different dimensions of summary o Types of summary o Translingual summaries o Multi-document summaries - Linguistic resources necessary for summarization systems o the application of terminology and ontologies o Generic versus specific-purpose resources - Summary and Normalization o Integrating summary systems into networks o Integrating summarization systems into linguistic (s?) - Methods of assessment for summarization systems - Extension of the summarization issue to the semantic filtering of texts: o Exploring new opportunities o The production of summaries in response to specific user needs o The use of navigation tools and interfaces which exploit textual structure o summarization and annotation of texts in a Semantic Web context Editorial Committee John Atkinson (Universit? de Concepcion, Chili) Michel Charolles (LATTICE, Universit? Paris 3, France) Jean-Pierre Descl?s (LaLICC, Universit? Paris-Sorbonne, France) Michael Elhadad (Computer Science Department, Ben Gurion University, Israel) Noemie Elhadad (Computer Science Department, Columbia University, USA) Guy Lapalme (Universit? de Montr?al, Canada) Inderjeet Mani (Soci?t? MITRE, USA) Jean-Guy Meunier (UQUAM, Canada) Dragomir Radev (University of Michigan, USA) Antoinette Renouf (University of Liverpool, UK) Horacio Saggion (Computer Science Department, University of Sheffield, UK) Dina Wonsever (Universit? de la R?publique, Uruguay) Format Contributions (25 pages maximum) should be submitted in Word, Postscript or Acrobat format. The file styles are provided as part of the regulations on the journal homepage. Language Papers should be written either in French or English. However, English submissions will only be accepted from non-French speakers. Dates Submission deadline: February 10th, 2004 Final committee decisions: April 15, 2004. The camera-ready version of the accepted articles should reach the editors by June 1st, 2004, for publication in 2004. Those who intend to submit an article are encouraged to contact Dr. Jean-Luc Minel (jean-luc.minel at paris4.sorbonne.fr). Paper submission The articles should be submitted by electronic mail to: jean-luc.minel at paris4.sorbonne.fr or by normal mail to the following address: Jean-Luc MINEL Laboratoire LaLICC, UMR 8139 (CNRS - Universit? Paris-Sorbonne) ISHA, 96 Boulevard Raspail 75 006 Paris ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Fri Jan 16 17:12:47 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Fri, 16 Jan 2004 18:12:47 +0100 Subject: APPEL: Workshop on COMPILING AND PROCESSING SPOKEN LANGUAGE CORPORA Message-ID: This message was posted to several lists. We apologize for any cross-postings. 2ND CALL FOR PAPERS Workshop on COMPILING AND PROCESSING SPOKEN LANGUAGE CORPORA http://lands.let.kun.nl/CPSLC/ Centro Cultural de Belem, Lisbon, Portugal 24th May 2004 Workshop to be held in conjunction with the 4th International Conference on Language Resources and Evaluation (LREC 2004) Main conference: 26-27-28 May 2004 http://www.lrec-conf.org/lrec2004/ Aim The aim of the workshop is to bring together people working on the development (compilation and processing) of spoken language corpora.* The workshop will provide participants with the opportunity to exchange views and share experiences. Moreover, the workshop is instrumental in taking stock of and evaluating the present state-of-the-art. The workshop thus aims to contribute to the development of a future roadmap that will guide the development of standards, tools, etc. for use with spoken language corpora. *The term ?spoken language corpora? is used here to distinguish such corpora from speech corpora or speech databases: speech corpora are collections of spoken data that are typically recorded for specific purposes by specific users (speech corpora/databases such as SpeechDat Car that are used for developing consumer applications). Usually such databases lack the richness of linguistic annations that is pursued for spoken language corpora. Background and motivation Despite the wide experience gained in the compilation of written language corpora, working with spoken language data is not immediately straightforward as spoken language involves many novel aspects that need to be taken care of. The fact that spoken language is transient is sometimes offered as an explanation for why it is more difficult to collect spoken data than it is to compile a corpus of written data. However, it is not just the capturing of data that is anything but trivial. Once the (audio) data have been collected and stored, the next step is to produce some kind of transcript (whether orthographic or phonetic). Further annotations such as POS tagging, lemmatisation, syntactic annotation, and prosodic annotation may then build upon this transcription. Among the problems encountered in the processing of spoken language data are the following: * There is as yet little experience with the large scale transcription of spoken language data. Procedures and guidelines must be developed, and tools implemented. * Well-established practices that have originated from working on written language corpora do not hold up when trying to cope with the idiosyncracies of the spoken language. This is true for all levels of linguistic annotation. Annotation schemes need to be reconsidered and tools must be adapted. * In so far as standards have emerged (eg CES), they need to be adapted in order to be able to cater for the needs of spoken language corpora. * By their very nature, spoken language corpora bring together speech and language technologists and linguists from various backgrounds. Ideally, such corpora should address the needs of all these different user groups. Often, however, there is a conflict of interest. For example, the quality of recordings of spontaneous conversations in noisy environments although highly interesting and worthwhile from a linguistic perspective will prove too poor to be of any use to someone doing research into speech recognition. Workshop topics Topics of interest include orthographic transcription, phonetic transcription, prosodic annotation, segmentation, POS tagging and lemmatisation, parsing, and discourse analysis. Contributions on the development and implementation of standards or guidelines for spoken language corpora (annotation schemes, meta-data descriptions) are also invited, as are contributions describing software for the exploitation of spoken language corpora. Format of the Workshop The workshop will comprise of oral presentations of previously submitted papers that went through a double peer review process. The proceedings of the workshop will be published by the local organising committee. Important dates 24th January 2004 Deadline for submission of (full) papers 1st March 2004 Notification of acceptance and preliminary programme 21st March 2004 Deadline for submission of final versions of accepted papers for the proceedings 3rd April 2004 Definitive programme 24th May 2004 Workshop Submissions Prospective authors are invited to submit papers for oral presentation. Only full papers in English will be accepted, and the length of the paper should not exceed 6000 words (or the equivalent in space for diagrams). Submissions in MS Word, Postscript, PDF or RTF should be submitted through the workshop website: http://lands.let.kun.nl/CPSLC/ Registration Workshop participants need to register through the LREC website: http://www.lrec-conf.org/lrec2004/ The fee for this half-day workshop is 50 Euro for conference participants and 85 for others and includes a coffee break and the workshop proceedings. Organising committee Nelleke OOSTDIJK, University of Nijmegen Gjert KRISTOFFERSEN, University of Bergen Geoffrey SAMPSON, University of Sussex Programme committee Daan BROEDER Max Planck Institute Emanuela CRESTI University of Florence Gjert KRISTOFFERSEN University of Bergen Tony MCENERY University of Lancaster Nelleke OOSTDIJK University of Nijmegen Pavel IRCING University of Western Bohemia Geoffrey SAMPSON University of Sussex Antonio Moreno SANDOVAL University of Madrid Jean VER?NIS Universit? de Provence ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Mon Jan 19 19:27:23 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Mon, 19 Jan 2004 20:27:23 +0100 Subject: Appel: Workshop : Beyond Named Entity Recognition Semantic labelling for NLP tasks Message-ID: SECOND ANNOUNCEMENT AND CALL FOR PAPERS Workshop Beyond Named Entity Recognition Semantic labelling for NLP tasks URL: http://ai-nlp.info.uniroma2.it/ws_lrec04/ Centro Cultural de Belem LISBON, Portugal 25th may 2004 In Association with 4th INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION LREC2004 Main conference 26-27-28 May 2004 Motivation and Aims Although it is generally assumed that improvements in language processing will be made through the integration of linguistic information and statistical techniques, the reality is that language is very diverse and looking for specific patterns of words that repeat enough to be statistically significant tends not to be a very fruitful task: sequences longer than three words are not generally repeated often enough to be statistically significant. At the same time, the identification of named entities: Names, dates, places, organizations etc., has proved to be avery useful preliminary task in many natural language processing systems are interested in pursuing approaches which extend this notion by identifying and labeling other semantic information in a text, in such as way as to allow repeatable semantic patterns to emerge. Our interest is in attacking the data sparseness problem by exploring ways to collapse (semantically) related phrases which are expressed by different word sequences. As this seems closely related to previously proposed class-based language models (see for example Brown et al. 90 in Computational Linguistics), it is distinguished because the empirical notion of classes used in the previous work (e.g. classes made up of collocationally similar words) are replaced by semantically justified sets. Notice how Name Entity (NE) tagging and Word Sense Disambiguation (WSD) represent, in terms of granularity and representational complexity, two extremes of a single general problem: semantic disambiguation. Semantic disambiguation serves thus the purpose of improving the generalization power of statistical models. One of the questions here is how to determine a suitable level of clustering (for NE identification and for WSD) that would lead to high accuracy and to performance improvement by obtained statistical models. Reason of Interest It is to be noticed that a set of independent research work focused recently on the statistical treatment of semantic phenomena (e.g. WordNet navigation as a stochastic process, as studied in Light and Abney or in Ciaramita & Johnson) highly correlates with the research program proposed above. The workshop will represent a forum where experience from lexical semantics and statistical learning will be presented and fruitful discussion among researchers in both fields will be promoted. The workshop is expected to attract researchers and practitioners from a range of areas as well as developers of large scale semantic resources who are interested in effective methods of semantic labeling. Topics (to be addressed in the workshop include, but are not limited to) * Methods for lexical - semantic annotation of corpora * Methods and Standards for lexical semantic representation of dictionary information * Lexico-semantic taxonomies * Existing sources of classification: dictionaries, thesauri and computerized ontologies * Corpus-driven methods for semantic disambiguation * Feature selection for semantic disambiguation * Lexico-semantic tagging of very large corpora * Algorithms and methods for disambiguation of semantic phenomena * Statistical learning models and their applications to semantic labeling * Computational learning frameworks for Natural Language Learning * Semi-supervised and unsupervised statistical semantic disambiguation * Evaluation of semantic disambiguation Workshop format The workshop will be a half-day event with position statements from invited speakers (half an hour each) with two hours for 4-6 presentations of scientific papers. Submissions are intended to present works in progress and more completed works which fall within the scope defined by the topics listed above. A final 1 hour open discussion among all the workshop participants will be moderated by the organizers. In order to stimulate an interesting general discussion each member of the program committee will be invited to submit a position statement of max. 1000 words. Submission Participants are invited to submit an extended abstract of max. 3500 words concerning one or more of the topics of interest. Each accepted paper receives a slot of 25 minutes for presentation (15 minutes talk and 10 minutes for discussion). Each submission should show: title; author(s); affiliation(s); and contact author's e-mail address, postal address, telephone and fax numbers. Submissions must be sent electronically in PDF to the following address: Roberto Basili Dept. of Computer Science, Systems and Management University of Roma Tor Vergata e-mail: basili at info.uniroma2.it Proceedings and Publications Proceedings of the workshop will be printed by the LREC Local Organising Committee. The Computer, Speech and Language journal will dedicate to the workshop topics a Special Issue on Semantic tagging/labelling for NLP tasks. Relevant papers submitted to the workshop will be selected to appear in that special issue. Important dates Extended abstract submission (max. 3500 words): 2nd of February 2004 Notification of acceptance: 5th of March 2004 Preliminary Program: 10th of March 2004 Submission of the final version of paper: 20th of April 2004 Workshop: 25th May 2004 Organising Committee Louise Guthrie - University of Sheffield, UK Roberto Basili - University of Rome, Tor Vergata, Italy Eva Hajicova - Charles University, Czech Republic Frederick Jelinek - Johns Hopkins University, Maryland, USA Further Information For any information related to the organization, please contact: Roberto Basili e-mail: basili at info.uniroma2.it Dept. of Computer Science, Systems and Management University of Roma Tor Vergata Via di Tor Vergata 00133 Roma (ITALY) tel: +39 06 72597391 fax: +39 06 72597460 ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Mon Jan 19 19:27:21 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Mon, 19 Jan 2004 20:27:21 +0100 Subject: Appel: JEP/TALN2004 : TRAITEMENT AUTOMATIQUE DE LA L'ARABE : DEADLINE EXTENSION Message-ID: !!!!!! Extension du deadline au 20/01/2004 !!!!!! **************************************************** J E P 2 0 0 4 - T A L N 2 0 0 4 - Session Sp?ciale - TRAITEMENT AUTOMATIQUE DE LA LANGUE ARABE ECRITE ET ORALE 2?me appel ? communications Palais des Congr?s F?s (Maroc) du 19 au 22 avril 2004 http://www.lpl.univ-aix.fr/jep-taln04/ http://www.fsdmfes.ac.ma/jep-taln04/ **************************************************** Par ses propri?t?s morphologiques, syntaxiques, phon?tiques et phonologiques, la langue arabe est consid?r?e comme faisant partie des langues difficiles ? appr?hender dans le domaine du traitement automatique du langage ?crit et parl?. Dans le domaine du traitement automatique de l'arabe ?crit, les recherches ont d?but? vers les ann?es 1970, avant m?me que les probl?mes d'?dition de textes arabes ne soient compl?tement ma?tris?s. Les premiers travaux concernaient notamment les lexiques et la morphologie. Depuis une dizaine d'ann?es, l'internationalisation du Web et la prolif?ration des moyens de communication en langue arabe, ont r?v?l? un grand nombre d'applications du TALN arabe. Les travaux de recherche ont ainsi commenc? ? aborder des probl?matiques plus vari?es comme la syntaxe, la traduction automatique, l'indexation automatique des documents, la recherche d'information, etc. Dans le domaine du traitement automatique de l'arabe parl?, des progr?s consid?rables ont ?t? r?alis?s gr?ce ? l'am?lioration des technologies du traitement du signal, ? l'enrichissement des connaissances sur les caract?ristiques prosodiques et segmentales et sur les diff?rentes mod?lisations acoustiques relatives aux sch?mes arabes. Ces r?sultats devraient permettre de mieux appr?hender des domaines vari?s et innovants tels que la reconnaissance et la synth?se de la parole, la traduction orale ou la reconnaissance automatique du locuteur et de ses origines g?ographiques, etc. L'objectif de cette session est de r?unir des chercheurs sur le traitement automatique de la langue arabe, aussi bien dans la communaut? de l'?crit que dans celle de l'oral. Cette rencontre sera l'occasion de faire le point sur les avanc?es dans ces domaines, au niveau scientifique et applicatif et dans des contextes monolingues ou multilingues. Le renforcement des liens de collaboration entre les communaut?s de l'?crit et de l'oral de l'arabe est ?galement un des objectifs de cette session. THEMES Les th?mes qui seront abord?s dans cette session consacr?e au traitement automatique de l'arabe ?crit et parl? incluent, de fa?on non limitative : - Reconnaissance et compr?hension de la parole, - Synth?se de la parole, - G?n?ration automatique de la prosodie, - Reconnaissance de la langue, du locuteur et de ses origines g?ographiques, - Corpus arabes et ressources langagi?res, - Acquisition de la parole dans les syst?mes de synth?se et de RAP, - Morphologie, - Syntaxe, - S?mantique, - Analyse et g?n?ration, - Analyse du discours, - R?sum? automatique, - Dialogue, - Traduction automatique. CALENDRIER Date limite de soumission : 20 janvier 2004 Notification aux auteurs : 20 f?vrier 2004 Version finale (pr?t-?-clicher) : 8 mars 2004 Conf?rence : 19-22 avril 2004 CRITERES DE SELECTION Les auteurs sont invit?s ? soumettre des travaux de recherche originaux, n'ayant pas fait l'objet de publications ant?rieures. Les soumissions seront examin?es par au moins deux sp?cialistes du domaine. Seront consid?r?es en particulier : - l'importance et l'originalit? de la contribution, - la correction du contenu scientifique et technique, - la discussion critique des r?sultats, en particulier par rapport aux autres travaux du domaine, - la situation des travaux dans le contexte de la recherche internationale, - l'organisation et la clart? de la pr?sentation, - l'ad?quation aux th?mes de la conf?rence. LANGUES Les articles devront ?tre r?dig?s en fran?ais ou en anglais. FORMAT DES SOUMISSIONS Le format PDF devra IMP?RATIVEMENT ?tre employ?. Dans certains cas particuliers, nous accepterons des contribution en format RTF (Word). Les articles soumis ne devront pas d?passer 6 ? 10 pages en Times 12, espacement simple, soit environ 3000 mots, figures, exemples et r?f?rences compris. Les articles devront ?tre au format A4. - T?l?charger la feuille de style LaTeX : - T?l?charger le mod?le Word (version fran?aise) : - Instructions pour la cr?ation de fichiers PDF : MODALITES DE SOUMISSION Les auteurs devront envoyer leur soumission sous la forme d'un document attach? ? un courrier ?lectronique contenant le titre de la communication, le nom, l'affiliation, l'adresse postale, l'adresse ?lectronique, le num?ro de t?l?phone et le fax de l'auteur principal. Les soumissions par courrier ?lectronique devront ?tre envoy?es ? l'adresse suivante : L'objet du message devra obligatoirement comporter la mention : JEP-TALN-2004-Arabic En cas d'impossibilit? d'envoi par courrier ?lectronique, une soumission par voie postale sera accept?e. Une disquette et 3 exemplaires papier de la contribution devront ?tre envoy?s ? l'une des deux adresses suivantes : Malek Boualem France Telecom R&D - DMI/GRI 2, avenue Pierre Marzin 22307 Lannion - France ou Noureddine Chenfour D?partement de Math. et Informatique Facult? des Sciences Dhar El Mahraz, F?s BP : 1796 Atlas, F?s - Maroc COMITE SCIENTIFIQUE - Abderrahim Benabbou, FST de F?s, Maroc. - Mohammed Benkhalifa, Facult? des Sciences, Rabat, Maroc. - Thami Benkirane, Universit? Sidi Mohammed, Maroc. - Malek Boualem, France Telecom R&D, France. - Achraf Chalabi, Sakhr, Egypte. - Noureddine Chenfour, universit? Sidi Mohammed, F?s, Maroc. - Khalid Choukri, ELRA/ELDA, France. - Fethi Debili, CNRS, Paris, France. - Emilie De Neef, France Telecom R&D, France. - Joseph Dichy, Universit? Lumi?re-Lyon 2, France. - Everhard Ditters, University of Nijmegen, Pays-Bas. - Mohamed Embarki, Laboratoire de Phon?tique Montpellier, France. - Mohammed Hassoun, ENSSIB, Lyon, France. - Med Tayeb Laskri, Universit? Badji Mokhtar, Alg?rie. - Fabrice Lefevre, LIMSI, Universit? Paris-Sud Orsay, France. - Chafic Mokbel, Universit? Balimand, Liban. - Abdelhak Mouradi, ENSIAS Rabat, Maroc. - Omar Nouali, CERIST, Alg?rie. - Abdenbi Rajouani, ENS de F?s, Maroc. - Mustafa Yaseen, ATS Online, Jordan. - Mohamed Yeou, Universit? Chouaib Doukkali El-Jadida, Maroc. - Chakir Zeroual, Universit? Sidi Mohamed, F?s, Maroc. - Adnane Zribi, ISG, Universit? de Tunis, Tunisie. **************************************************** ------------------------------------------------------ Malek Boualem France Telecom R&D - DMI/GRI 2, avenue Pierre Marzin - 22307 Lannion - France Tel: (33)(0)2.96.05.29.83 Fax: (33)(0)2.96.05.32.86 Email: malek.boualem at rd.francetelecom.com ------------------------------------------------------ ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Mon Jan 19 19:27:24 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Mon, 19 Jan 2004 20:27:24 +0100 Subject: Appel: MODELLING AND DESCRIBING DISCOURSE ORGANISATION IN THE AGE OF THE DIGITAL DOCUMENT Message-ID: =============================================== 2nd CALL FOR PAPERS: =============================================== MODELLING AND DESCRIBING DISCOURSE ORGANISATION IN THE AGE OF THE DIGITAL DOCUMENT =============================================== A Workshop proposed by ATALA as part of the Digital Document Week (http://www.univ-lr.fr/sdn2004/) La Rochelle, 22 juin 2004 organised by Marie-Paule P?ry-Woodley, ERSS/Universit? de Toulouse-Le Mirail (pery at univ-tlse2.fr) The Digital Document Week aims to gather research communities dealing with digital documents from a variety of angles: media, technical and social modes of mediation, relation with human activity. This ATALA workshop wishes to broach these questions from a linguistic point of view, focussing on digital documents as discourse, characterised by an internal organisation which needs to be understood and may be exploited in computer-based systems. The workshop aims to bring together three research areas concerned with the development of digital documents: the study of discourse organisation, corpus linguistics, computer-based applications for the exploitation of digital documents. For text and discourse linguistics, the proliferation of digital documents leads to new opportunities and new research questions, such as: - the application of corpus analysis methods to discourse: what kind of data can be regarded as relevant at this level of linguistic investigation? - the development of novel ways of accessing documents, which leads to a new emphasis on text structure and the potential exploitation of surface markers; - the impact of new document types on basic concepts in the field: cohesion, coherence, metadiscursive signalling. This workshop on written discourse organisation aims to bring together research from three domains which must seek points of convergence in the light of these new prospects: 1. Discourse organisation In order to apprehend a sequence of utterances as discourse, it is necessary to understand its organisation (to identify its segments and perceive their hierarchy and their relations). An old and fertile tradition approaches discourse organisation via the notion of discourse relations: semantico-pragmatic links between segments (propositions or sets of propositions) (cf. P?ry-Woodley (ed) 2001). Other modes of organisation may be envisaged, via the notion of theme or topic for instance, or more recently through the discourse framing hypothesis (Charolles 1997). Research in this field can be placed in a continuum from pure 'conceptual' modelling to empirical methods (automatic segmenting, cf. Hearst 1997; shallow analyses human or automatic - cf. Teufel et Moens 1999). The challenge is to hold both ends of the continuum in order to draw connections between the way 'things are put' in texts and the processes underlying discourse organisation at different levels of granularity (local vs. global organisation). The relationship between modelling approaches and empirical research has often seemed problematic, with empirical studies running the risk of losing track of structure as they focus on surface markers, while conceptual models tend to be difficult to test empirically. Corpus-based approaches greatly facilitated by progression into the digital age are in the process of bringing considerable changes in the discourse field, as they have done elsewhere in linguistics (Conrad 2002). 2. Corpus-based studies of linguistic correlates of discourse organisation As noted by several authors (Biber et al 1998 inter alia), though research on discourse organisation tends to make regular use of authentic data, the corpus is often seen as a source of examples rather than the object of the analysis as such. The implementation of a fully-fledged 'corpus approach' in the field of discourse organisation carries with it many difficulties: corpus construction (common sampling-based techniques make it impossible?), the role of quantitative analysis, and most of all definition of relevant data making it possible to draw the connection between surface markers (which may be just epiphenomena) and the multiple principles underlying complex hierarchic organisation. A gap can also be observed between linguistic approaches (low coverage and high reliability) and numerical approaches (high coverage and low reliability). Articulating these approaches may open new prospects, leading to fresh insights into discourse organisation principles as well as more operational methods for applications. 3. Computer-based systems for the exploitation of digital documents Applications for which the relevant unit is the whole document are little concerned by questions of discourse organisation, but those concerned with intra-document browsing, selective synthesis or multi-level visualisation must work their way inside the documents and therefore cannot consider them as simple 'bags of words': they have to take into account the organisation into thematic or rhetorical chunks and text architecture (cf. Luc & Virbel 2001). These objectives bring about new research questions, in particular around the articulation of different organisational levels in long documents (where browsing aids acquire particular relevance). This call for papers concerns researchers who are already working on these interactions, as well as those whose work is in one of the domains referred to but who are interested in a dialogue with other discourse approaches. Descriptive studies which pay specific attention to methodology will be particularly welcome. Some relevant themes (non-exhaustive list): - identification of objects or text zones corresponding to text or discourse acts (conclusions, explanations, evaluations, ?) - discourse organisation markers (from markers to relations: inductive approach): connection, indexing (discourse frames), textual metadiscourse - linguistic characterisation of discourse functions (from functions to markers: deductive approach) - segmentation (automatic or manual): 'topic shifts', clues to segment boundaries (lexico-syntactic, typographical, dispositional) - articulation between local and global organisation - impact of discourse genre on discourse organisation and its linguistic markers - analysis and exploitation of document architecture - topological approaches - discourse annotation SUBMISSION (MODALITIES) A summary (2-4 pages, Word, pdf or ps) to be e-mailed by January 30th 2004 to Marie-Paule P?ry-Woodley (). Notification of acceptance will be given by March 15th 2004. *************************************************************************** References Biber, D., Conrad, S., & Reppen, R. (1998). Corpus linguistics: Investigating language structure and use. Cambridge: Cambridge University Press. Conrad, S. (2002). Corpus linguistics approaches for discourse analysis. Annual Review of Applied Linguistics, 22, 75-95. Charolles, M. (1997). L'encadrement du discours : Univers, champs, domaines et espaces (Cahier de Recherche Linguistique 6): Universit? de Nancy2. Hearst, M. (1997). TextTiling: segmenting text into multi-paragraph subtopic passages. Computational Linguistics, 23(1), 33-64. Luc, C., & Virbel, J. (2001). Le mod?le d'architecture textuelle : fondements et exp?rimentation. Verbum, 23(1), 103-123. P?ry-Woodley, M.-P. (ed.) (2001). Coh?rence et relations de discours ? l'?crit. Pr?sentation. Verbum, 23(1). Teufel S. & Moens, M. (1999). Discourse-level argumentation in scientific articles: human and automatic annotation. In: Towards Standards and Tools for Discourse Tagging. ACL 1999 Workshop. ___ Marie-Paule PERY-WOODLEY ___________________________________________________________________ ERSS / Sciences du Langage Universite de Toulouse Le Mirail Tel.: 33(0)5 61 50 46 76/-36 09 5 allees Antonio-Machado Fax: 33(0)5 61 50 42 12 F-31058 TOULOUSE CEDEX Email: pery at univ-tlse2.fr ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Wed Jan 21 17:09:30 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Wed, 21 Jan 2004 18:09:30 +0100 Subject: Conf: ATALA : Characterisation of Internet content: beyond keywords. Semantic approaches Message-ID: ====================================================== Workshops of the Association pour le Traitement Automatique des LAngues (ATALA) Subject: Characterisation of Internet content: beyond keywords. Semantic approach. Workshop organized by : Fran?ois Rastier (CNRS - UMR 7114, Paris X - MoDyCo), Natalia Grabar (CRIM/INaLCO, STIM / DSI / AP-HP, Paris 6) and Thomas Beauvisage (France T?l?com R&D - DIH/UCE, Paris X - MoDyCo) Date: Jan. 31 2004 Location: ENST, 49, rue Vergnault, 75013 Paris Amphith?atre Emeraude M?tro : Corvisart Free entry ATALA membership recommended (http://www.atala.org) Contact for this workshop : indices.internet at ml.free.fr Program: 9h15 Presentation of the Workshop 9h30 Thomas Beauvisage (France T?l?com R&D) Utiliser les annuaires du Web pour d?crire les parcours sur la Toile (Using Web directories to describe users' paths) 10h00 Kamel Sma?li et Armelle Brun (LORIA) Routage automatique de courriers ?lectroniques (Automatic routing of emails ) 10h30 Break 11h00 Antoine Marzin, Lionel Martin, Christel Vrain et Guillaume Cleuziou (LIFO, U. Orl?ans) Classification de pages Web en Genre (Genre-based Web pages classification) 11h30 Martine Hurault-Plantet (LIMSI-CNRS) S?lection de traits et d?tection de th?mes pour l'analyse d'un corpus de pages personnelles Web (Selection of traits and topic detection for the analysis of a corpus of personal Web pages) 12h00 Lunch 14h00 Aur?lie N?v?ol, Lina Soualmia, Alexandrina Rogozan, Magaly Douy?re, Beno?t Thirion, St?fan Darmoni (CISMeF, Rouen / PSI-CNRS / U. Rouen) Caract?risation des contenus de l'Internet en sant? : l'exemple CISMeF (Characterisation of Health-related Internet content: the CISMeF example) 14h30 Mathieu Valette (CRIM, Inalco) Projet Princip : application de r?gles s?mantiques ? la d?tection de documents racistes sur Internet (The Princip project: application of semantic rules to the detection of racists documents on the Internet) 15h00 Break 15h30 Monika Nicinski, (CRIM, Inalco) Typologie et description s?mantique des images utilis?es dans les sites Internet racistes (Typology and semantic description of images used in racist Web sites) 16h00 Fran?ois Rastier (CNRS - UMR 7114, Paris X - MoDyCo) La s?miotique du document num?rique et son incidence sur les traitements s?mantiques (The semiotics of electronic document and its incidence on semantic processing) 16h30 Round table 17h00 End of the Workshop Important: Le samedi, l'acc?s a l'ENST se fait par la rue Vergnaud (de l'autre c?t? du p?t? de maison par rapport ? la rue Barrault). N'oubliez pas de vous munir du programme de la journ?e ; ce programme vous sera demand? au poste de s?curit?. =================================================== ______________________________ Thomas Beauvisage France T?l?com R&D/DIH/UCE 38-40, rue du G?n?ral Leclerc 92794 Issy Moulineaux Cedex 9 - France Tel : + 33 (0)1 45 29 58 11 Fax : + 33 (0)1 45 29 01 06 http://www.francetelecom.com/rd/ ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Wed Jan 21 17:09:35 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Wed, 21 Jan 2004 18:09:35 +0100 Subject: Appel: WORKSHOP ON THE REPRESENTATION AND PROCESSING OF SIGN LANGUAGES Message-ID: CALL FOR PAPERS ============================== WORKSHOP ON THE REPRESENTATION AND PROCESSING OF SIGN LANGUAGES Workshop URL: http://dev.eurac.edu:8080/sign/index.html From SignWriting to Image Processing. Information techniques and their implications for teaching, documentation and communication. Workshop on the occasion of the 4th International Conference on Language Resources and Evaluation (LREC 2004) Date: 30-May-2004 (at the closing of the main conference LREC 2004) Location: Lisbon, Portugal Linguistic Sub-field: SignWriting, Computational Linguistics, Corpus Linguistics, Image Processing, Lexicography Meeting Background ============================== Sign Languages are the languages used by deaf communities for non-written communication. This kind of linguistic codes relies on the visual-gestural modality of communication. Sign languages all share properties which do not exist in spoken languages, especially through co-occurring sign elements. For storing and retrieving information, sign languages may be encoded and processed electronically. Different techniques are possible and have been proposed. The workshop will focus on the problem of representing sign languages electronically in order to facilitate communication among the deaf as well as between deaf and hearing people. Moreover, it will promote the documentation and teaching of sign languages to both communities and stimulate linguistic research on sign languages. The task of transcribing these languages onto electronic media as a main technique for storing, retrieving or communicating (email, telephone, snake mail, children's e-book) is technically and linguistically challenging. Recent advances in the field of corpus linguistics, image processing and the development of XML standards, promise to pave the way for a broader application of these techniques. The workshop will set forth to provide an introduction into the different approaches and techniques currently employed, discuss their applications and respective advantages. Preliminary Meeting program ============================== 30-May-2004 9:00-10:30: Presentations of invited talks - Richard Gleaves (Deaf Action Committee For SignWriting) - Thomas Hanke (Institute of German Sign language and Communication of the Deaf University of Hamburg) - Carol Neidle & Robert G. Lee (Department of Modern Foreign Languages and Literatures, Boston University, Boston MA) 11:00-18:00: Oral presentations, poster presentations and demos Call for papers ============================== Papers are invited on substantial, original and unpublished research on all aspects of sign language representation and processing, including, but not limited to: *sign writing *corpus construction for sign languages *sign language dictionaries *sign language technologies *e-learning of sign languages *any topic related to sign language treatment and processing Submissions of papers for oral and poster presentations should follow the same style as the ones for regular LREC paper and not be longer than 6000 words. The final details will be published as soon as they become available. Demonstrations and related tools will be reviewed as well. You should send an outline of about 400 words. If a demo is connected to a paper, please attach the outline to the paper. The papers and demonstration outlines, written in English, should be attached to an email message sent to the following address (ostreiter at eurac dot edu). Please include the name and the affiliation of the author(s) in the body of the email message. The deadline for paper submission is February 11th, 2004. Notice of acceptance or rejection will be sent on February 24, 2004. We allow simultaneous paper submission to the workshop and the LREC main conference. If a paper is accepted by both the conference and the workshop, the paper will be presented at the conference, rather than at the workshop. The author(s) should notify the workshop chair. Papers will be published in the proceedings of this workshop (each workshop and the main conference have separate proceedings) and may, depending on the conference politics, be included into the the main conference CD-ROM. Organizing Committee ============================== - Ant?nio Carlos da Rocha Costa (Escola de Inform?tica, Universidade Cat?lica de Pelotas, Brazil): rocha at atlas.ucpel.tche.br - Carol Neidle (Department of Modern Foreign Languages and Literatures, Boston University, Boston MA): carol at bu.edu - Chiara Vettori (Language and Law, European Academy Bolzano, Italy): cvettori at eurac.edu - Christian Retor? (Laboratoire Bordelais de Recherche en Informatique, France): retore at labri.fr - Eva Safar (School of Computing Sciences, University of East Anglia, Norwich, England): esafar at yahoo.com - Ian Marshall (School of Computing Sciences, University of East Anglia, Norwich, England): im at cmp.uea.ac.uk - Marco Consolati (Cooperativa Alba , Torino, Italy): bigmark at mclink.it - Oliver Streiter (Language and Law, European Academy Bolzano, Italy): ostreiter at eurac.edu - Patrice Dalle (?quipe "Traitement et Compr?hension d'Images", IRIT - Universit? Paul Sabatier France): dalle at irit.fr Important Dates ============================== 11 February 2004 : Deadline for paper submissions 25 February 2004 : Notification of acceptance to authors ** : Deadline for Camera-ready papers 30 May 2004 : Workshop Contact ============================== Contact person: Oliver Streiter Contact Email: ostreiter at eurac.edu Workshop URL: http://dev.eurac.edu:8080/sign/index.html ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Wed Jan 21 17:09:42 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Wed, 21 Jan 2004 18:09:42 +0100 Subject: Conf: =?ISO-8859-1?Q?COLDOC=27_2004_=3A_NEW_DEADLINE_F?= =?ISO-8859-1?Q?OR_SUBMISSION_=3A_15_FEBRUARY_2004?= Message-ID: (apologies for multiple postings) COLDOC' 2004 CALL FOR PAPERS The Setting up of Observables in Linguistics ************************** NEW DEADLINE FOR SUBMISSION : 15 FEBRUARY 2004 ************************** Young researchers' conference - Nanterre, France - April 29 & 30, 2004 The young researchers of Mod?le, Dynamique, Corpus (UMR 7114 CNRS - Universit? Paris-X Nanterre) research team, are organizing a young researchers' conference, scheduled for April 29 and 30, 2004, at Paris X-Nanterre Universit? campus. The setting up of observables in linguistics is the central topic of this conference, i.e. defining and making use of both attested and constructed data. Young researchers from all fields and domains of linguistics are, therefore, invited to submit a paper. Postgraduate, Ph. D. and postdoc students are invited to provide useful insights and experience on their respective research areas. Communications addressing methodological and theoretical issues related to the process of setting up linguistic data, as well as data collection and utilization are expected. For example, communications addressing one of the following issues are expected: - Relevance and selection of linguistic data; - Corpora and emerging linguistic phenomena; - Oral, written or signed data collection methodology and practice; - Questions related to corpora related tools, transcription and encoding; - The use and place of quantitative methods, both generic and specific; - Qualitative methods; - Language, text genres or discourse comparison. Each conference session will start by an invited speaker's talk. A roundtable will be held at the end of the conference. Communications should last 20 minutes, followed by 10 minutes for questions. The deadline for proposals is set on January 26, 2004. Communication proposals will be evaluated anonymously by the scientific committee. Authors are invited to send two separate files, in Word format: first a two pages long summary (3000 signs) of their communication, second a file stating the authors' names, e-mail address, affiliation, together with the title of their communication. Authors may also state their preference regarding the format of their communication: oral, or poster. Communications will be evaluated according to a range of selection criterions, favoring those papers which fully address the issue stated above, which show methodological relevance and scientific interest, and which state their point clearly. Communication proposals, as well as other requests should be addressed to: , or by postal mail, to the following address: ColDoc' 2004 MoDyCo (UMR 7114) Secr?tariat sciences du langage Universit? Paris-X Nanterre, B?t. L 200, avenue de la R?publique 92001 Nanterre Cedex France We look forward to welcoming you at Nanterre Universit? for the occasion of the conference. The Organizing Committee: Antonio Balvet, Sophie Hamon, Sylvain Loiseau, Ali Tifrit, C?cile Vigouroux. Scientific Committee: --------------------- Driss Ablali Karine Baschung Gabriel Bergounioux Simon Bouquet Nick Clements Marcel Cori Sophie David Annie Delaveau Bernard Fradin Fran?oise Gadet Nathalie Gasiglia Philippe Gr?a Fran?oise Kerleroux Mark Klein Anne Lacheret Bernard Laks Sarah Leroy Colette Noyau Thierry Poibeau Fran?ois Rastier Tobias Scheer Pascale S?billot Anna Sores Nathalie Vall?e Florence Villoing Geoffrey Williams. Important dates: --------------------- New submission deadline: February 15, 2004 Authors' notification of acceptance: March 22, 2004 Conference: April 29 & 30, 2004 The Setting up of Observables in Linguistics ColDoc'2004 - Modyco (UMR 7114) young researchers' conference Paris X Nanterre, Salle des colloques, B?timent B 200, avenue de la R?publique 92001 Nanterre Cedex France Web site: http://infolang.u-paris10.fr/modyco/textes/actualites/Page.html ---------------------------------------------------------------- This message was sent using IMP, the Internet Messaging Program. ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Wed Jan 21 17:09:45 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Wed, 21 Jan 2004 18:09:45 +0100 Subject: Appel: ACL2004 Message-ID: ACL2004 NEWSLETTER NO. 1 (January 20, 2004) The Association for Computational Linguistics invites the submission of papers for its 42nd Annual Meeting hosted jointly with the European Chapter of the ACL. Papers are invited on substantial, original, and unpublished research on all aspects of computational linguistics. ACL 2004 will be held at the new Barcelona Forum Convention Centre, which is scheduled to be completed in December 2003, officially opening on April, 2004.The ACL meeting will be part of the programme of the Forum of Cultures that will take place in Barcelona from April to September 2004. :: Main Dates Conference Tutorials: July 21, 2004 Main Conference: July 22-24, 2004 Post-conference Workshops: July 25-26, 2004 Paper submission due: February 25, 2004 :: Contents This news letter includes: 1. Area chairs of the main conference 2. Mentoring service 3. Call for sponsorship 4. List of accepted workshops 5. Programme Committee of poster and demos 6. Call for papers of student research workshop 7. Call for tutorials 8. Others More details can be found at the website of the ACL2004 Conference. :: Area chairs of the main conference Elisabeth Andre (University of Augsburg, Germany) Jill Burstein (Educational Testing Service, USA) Claire Cardie (Cornell University, USA) Pascale Fung (University of Science and Technology, Hong Kong) Hitoshi Isahara (Communications Research Laboratory, Japan) Michael Johnston (AT&T, USA) Rada Mihalcea (University of North Texas, USA) Jon Oberlander (University of Edinburgh, UK) Kemal Oflazer (Sabanci University, Turkey) Kees van Deemter (University of Brighton, UK) Antal van den Bosch (University of Tilburg, The Netherlands) top :: Mentoring service ACL is providing a mentoring (coaching) service for authors from regions of the world where English is not the language of scientific exchange. Many authors from these regions, although able to read the scientific literature in English, have little or no experience in writing papers in English for conferences such as the ACL meetings. The service will be arranged as follows. A set of potential mentors will be identified by Richard Power, who has agreed to organize this service for ACL'04. If you would like to take advantage of the service, send a draft of your paper to: Richard Power Information Technology Research Institute University of Brighton Watts Building Lewes Road Brighton BN24GJ UK +44 1273 642904 (office) +44 1273 642908 (fax) Email: Richard.Power at itri.brighton.ac.uk To take advantage of this service, send the paper electronically to the above email address, using pdf, ps or doc format. Alternatively, hard copy can be sent to the postal address. The paper should arrive before 1st February. An appropriate mentor will be assigned to your paper and the mentor will get back to you by 15th February, at least ten days before the deadline for the submission to ACL'04 program committee. Please note that this service is for the benefit of the authors as described above. It is not a general mentoring service for authors to improve their papers. If you have any questions about this service please feel free to send a message to Richard Power. top :: Call for sponsorship Chair: Deborah Dahl (Conversational Technologies, USA) Local Chair: Ant?nia Mart? (University of Barcelona, Spain) On behalf of the Association for Computational Linguistics (ACL), we invite commercial, government, and academic organizations who value and wish to promote the field of natural language processing technology to become sponsors of ACL2004, the 42nd Annual Meeting of the ACL. If you are interested in becoming a sponsor, please see the sponsors page for more details. Please also consider exhibiting your products at the conference. We are happy to announce that the following organizations have agreed to give their support at ACL04. Ajuntament de Barcelona Generalitat de Catalunya Spanish Government Universitat de Barcelona Universitat Aut?noma de Barcelona Universitat Polit?cnica de Catalunya Universitat Pompeu Fabra Universitat Ramon Llull Deadlines: Sponsorship registration deadline: by 1st April 2004 top :: List of accepted workshops Workshop Committee: Srinivas Bangalore (AT&T Labs-Research, USA) ACL-2004 Workshop C Christopher Manning (Stanford University, USA) ACL-2004 Workshop C Helen Meng (CUHK, Hong Kong) ACL-2004 Workshop C Marcello Federico (IRST, Italy) :: Current Themes in Computational Phonology and Morphology Organizing Committee: Richard Wicentowski, Swarthmore College John Goldsmith, University of Chicago Important dates: Paper submission deadline: April 16, 2004 Notification of acceptance: May 7, 2004 Camera ready papers due: May 24, 2004 Workshop date: July 26, 2004 :: Discourse Annotation Organizing Committee: Bonnie Webber, University of Edinburgh Donna Byron, Ohio State University Important dates: Paper submission deadline: March 22, 2004 Notification of acceptance: April 30, 2004 Camera ready papers due: May 24, 2004 Workshop date: July 25, 2004 ::Incremental Parsing: Bringing Engineering and Cognition Together Organizing Committee: Stephen Clark, University of Edinburgh Matthew Crocker, Saarland University Frank Keller, University of Edinburgh Mark Steedman, University of Edinburgh Important dates: Paper submission deadline: March 22, 2004 Notification of acceptance: May 3, 2004 Camera ready papers due: May 24, 2004 Workshop date: July 25, 2004 :: Multiword Expressions: Integrating Processing Organizing Committee: Takaaki Tanaka, NTT Communication Science Laboratories, Japan Aline Villavicencio, University of Cambridge, UK Francis Bond , NTT Communication Science Laboratories, Japan Anna Korhonen, University of Cambridge, UK Important dates: Paper submission deadline: April 1, 2004 Notification of acceptance: May 1, 2004 Camera ready papers due: May 15, 2004 Workshop date: July 26, 2004 :: Question Answering in Restricted Domains Organizing Committee: Diego Moll?, Macquarie University, Australia Jos? Luis Vicedo, Alicante University, Spain Important dates: Paper submission deadline: March 15, 2004 Notification of acceptance: April 15, 2004 Camera ready papers due: May 15, 2004 Workshop date: July 25 or 26, 2004 :: RDF/RDFS and OWL in Language Technology: 4th Workshop on NLP and XML (NLPXML-2004) Organizing Committee: Nancy Ide, Vasar College, USA Laurent Romary, Loria/CNRS, France Graham Wilcock, University of Helsinki, Finland Important dates: Paper submission deadline: April 1, 2004 Notification of acceptance: May 1, 2004 Camera ready papers due: May 15, 2004 Workshop date: July 25, 2004 :: Reference Resolution and Its Applications Organizing Committee: Sanda Harabagiu, University of Texas at Dallas David Farwell, New Mexico State University Important dates: Paper submission deadline: April 5, 2004 Notification of acceptance: April 25, 2004 Camera ready papers due: May 15, 2004 Workshop date: July 25-26, 2004 :: SENSEVAL-3 Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text Organizing Committee: Phil Edmonds, Sharp Laboratories of Europe Rada Mihalcea, University of North Texas Important dates: Registration: February 2004 Evaluations: March - April 2004 Paper submission deadline: April 20, 2004 Camera ready papers due: May 18, 2004 Workshop date: July 25-26, 2004 :: Tackling the challenges of terascale human language problems Organizing Committee: Miles Osborne, Univ. of Edinburgh Robert Malouf, San Diego State University Srinivas Bangalore, AT&T Labs-Research Important dates: Paper submission deadline: April 18, 2004 Notification of acceptance: April 30, 2004 Camera ready papers due: May 15, 2004 Workshop date: July 26, 2004 :: 2nd Workshop on Text Meaning and Interpretation Organizing Committee: Graeme Hirst, University of Toronto Sergei Nirenburg, University of Maryland, Baltimore County Important dates: Paper submission deadline: April 1, 2004 Notification of acceptance: April 30, 2004 Camera ready papers due: May 16, 2004 Workshop date: July 25-26, 2004 :: Text Summarization Branches Out Organizing Committee: Eduard Hovy, Information Sciences Institute, University of Southern California, USA Marie-Francine Moens (co-chair), Interdisciplinary Centre for Law & Information Technology, Katholieke Universiteit Leuven, Belgium Dragomir Radev, School of Information and Department of Electrical Engineering and Computer Science, University of Michigan, USA Stan Szpakowicz (co-chair), School of Information Technology and Engineering, University of Ottawa, Canada Important dates: Paper submission deadline: March 25, 2004 Notification of acceptance: April 25, 2004 Camera ready papers due: May 15, 2004 Workshop date: July 25-26, 2004 :: Third SIGHAN Workshop on Chinese Language Processing Organizing Committee: Oliver Streiter, Eurac, Italy Qin Lu, The Hong Kong Polytechnic University Important dates: Paper submission deadline: April 1, 2004 Notification of acceptance: May 1, 2004 Camera ready papers due: May 15, 2004 Workshop date: July 25-26, 2004 top :: Programme committee of poster and demos Philippe Blache, Universit? de Provence, France Rens Bod, University of Amsterdam, The Netherlands Christian Boitet, Universit? Joseph Fourier, France Antonio Branco, University of Lisbon, Portugal Francisco Casacuberta, Universitat Polit?cnica de Val?ncia, Spain Ken Church, ATT Labs, USA Tomaz Erjavec, Jozef Stefan Institute in Ljubljana, Slovenia Roger Evans, University of Brighton, UK Marcello Federico, IRST, Italy Julio Gonzalo, UNED, Spain Nancy Ide, Vassar College, USA Ruslan Mitkov, Wolverhampton, UK Diego Moll?, Macquarie University, Australia Stefan Muller, Universit?t Bremen, Germany Kemal Oflazer, Sabanci University Istanbul, Turkey Patrick Paroubek, LIMSI, France German Rigau, EHU, Spain Horacio Rodr?guez, Universitat Polit?cnica de Catalunya, Spain Laurent Romary, INRIA, France Graham Russell, RALI, Canada Eric Wehrli, LATL, Switzerland Shuly Wintner, University of Haifa, Israel Pierre Zweigenbaum, DIAM, France top :: Call for papers of student research workshop Faculty Advisor: Justine Cassell (Northwestern University, USA) Student Co-Chairs: Daniel Midgley (University of Western Australia, Australia) Dmitriy Genzel (Brown University, USA) Leonoor van der Beek (University of Groningen, Netherlands) The Student Research Workshop is an established tradition at ACL conferences. The workshop provides a venue for student researchers investigating topics in Computational Linguistics and Natural Language Processing to present their work and receive feedback. Participants will have the opportunity to receive feedback both from the general audience and from selected panelists -- experienced researchers who prepare in-depth comments and questions in advance of the presentation. One paper will be selected for the ACL-04 Student Research Workshop Best Paper Award. We invite all student researchers to submit their work to the workshop. As the main goal of the workshop is to provide feedback, the emphasis is on work in progress. Original and unpublished research is therefore invited on all aspects of computational linguistics. Papers should describe original work, still in progress. Submission will therefore normally be open only to students who have settled on their thesis direction but who still have significant research left to do; those students in the final stages of their thesis should consider submitting instead to the main conference. Submissions should follow the two-column format of ACL proceedings and should not exceed six (6) pages, including references. We strongly recommend the use of ACL LaTeX style files or Microsoft Word Style files tailored for this year's conference. Submission must be electronic. The electronic submissions should be sent in an attachment to the following e-mail address: acl04-student at list.cs.brown.edu. Note that reviewing of papers will be blind; therefore, please make sure your paper shows the title, but no author information. You should likewise not have any self-identifying references anywhere in the paper submitted for review. For example, rather than this "We showed previously (Smith, 2001), ..." use citations such as "Smith (2001) previously showed ..." Deadlines: Paper submissions deadline: 8th March 2004 Notification of acceptance: 26th April 2004 Camera ready papers due: 25th May 2004 top :: Call for tutorials Tutorials chair: Inderjeet Mani (Georgetown University, USA) The Program Committee of the 42nd Annual Meeting of the Association for Computational Linguistics (ACL?04) invites proposals for the Tutorial Program for ACL'04. Proposals for tutorials should contain a title, the instructors? names, the length (3 hours or 6 hours), the expected audience size, followed by a brief (< 500 word) description of the content of the tutorial. The description should explain clearly the relevance of the tutorial to the ACL community. The description should also include a brief outline of the structure of the tutorial broken down by time (what topics will covered in the different sections of the tutorial, and in what order, and how much time for each). Also, please include a brief statement of what the tutorial attendees can expect to learn from the tutorial, and what backgrounds (e.g., information extraction, statistical NLP, etc.) is expected of the attendees. Finally, if you have given a prior tutorial on this subject, or are aware of one, please let us know. Each proposal should also provide the names, postal addresses, phone numbers, and email addresses of the tutorial speakers, with a one-paragraph statement of their research interests and areas of expertise, along with any links to further on-line information. The proposal should also include any special requirements for technical needs (e.g., internet access). Proposals should be submitted by electronic mail, in plain ASCII text (iso8859-1). The subject line should be: "ACL'04 TUTORIAL PROPOSAL". Please submit your proposals and address any inquiries to tutorials at acl2004.org. Deadlines: Submission Deadline for Tutorial Proposals: 1 February 2004 Notification of acceptance of Tutorial Proposals: 25 February 2004 Tutorial Announcements due: 19 March 2004 Tutorial Course material due: 1 June 2004 top :: Others (i) ACL LaTeX style files or Microsoft Word Style files for this year's conference. http://www.acl2004.org/aclstyles/style.html (ii) Submissions for the main conference will be entered via a website http://pcger33.uia.ac.be:8080/acl04 top ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Wed Jan 21 17:09:49 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Wed, 21 Jan 2004 18:09:49 +0100 Subject: Conf: CROSS-LANGUAGE EVALUATION FORUM Message-ID: (Apologies for multiple postings) ********************************************************************* CROSS-LANGUAGE EVALUATION FORUM ELRA is happy to announce that registration for CLEF 2004 evaluation campaign is now open. CLEF 2004 - CALL FOR PARTICIPATION ********************************************************************* The CLEF series of system evaluation campaigns aims at promoting research and development in Cross-Language Information Retrieval. Registration is now open for CLEF 2004. The objective of CLEF 2004 will be to test different aspects of mono- and cross-language information retrieval system performance. There will be eight tracks this year: a/ Multilingual Information Retrieval b/ Bilingual Information Retrieval c/ Monolingual (non-English) Information Retrieval d/ Mono- and Cross-Language IR for Scientific Collections (GIRT) e/ Interactive Cross-Language Information Retrieval (iCLEF) f/ Multiple Language Question Answering (QAatCLEF) g/ Cross-language Retrieval in Image Collections (ImageCLEF) h/ Cross-Language Spoken Document Retrieval (CL-SDR) IMPORTANT DATES: - Data Release - from 15 February 2004 - Topic Release - from 15 March 2004 - Submission of Runs by Participants - 15 May 2004 (may vary slightly for some tracks) - Release of relevance assessments and individual results - from 15 July 2004 - Submission of paper for Working Notes - 15 August 2004 - Workshop - 16-17 September (in conjunction with ECDL 2004) For full details on the CLEF Agenda and Task Description for 2004 and instructions on How to Participate, see http://www.clef-campaign.org For further information, contact: Carol Peters - ISTI-CNR Tel: +39 050 315 2987 Fax: +39 050 315 2810 E-mail: carol.peters at isti.cnr.it --------------------------------------------------------------------------- ELRA / ELDA 55-57, rue Brillat-Savarin 75013 Paris FRANCE Tel: (+33) 1 43 13 33 33 / Fax: (+33) 1 43 13 33 30 URL: http://www.elra.info or http://www.elda.fr LREC conference: http://www.lrec-conf.org LangTech forum: http://www.lang-tech.org --------------------------------------------------------------------------- ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Wed Jan 21 17:09:52 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Wed, 21 Jan 2004 18:09:52 +0100 Subject: Appel: International Conference on Formal Ontology in Information Systems Message-ID: please distribute please distribute please distribute please distribute Apologies for multiple copies **** FOIS-2004 CALL FOR PAPERS **** International Conference on Formal Ontology in Information Systems http://www.fois.org November 4-6, 2004, Torino (Italy) Conference Description ---------------------- Just as ontology developed over the centuries as part of philosophy, so in recent years ontology has become intertwined with the development of the information sciences. Researchers in such areas as artificial intelligence, formal and computational linguistics, biomedical informatics, conceptual modeling, knowledge engineering and information retrieval have come to realize that a solid foundation for their research calls for serious work in ontology, understood as a general theory of the types of entities and relations that make up their respective domains of inquiry. In all these areas, attention has started to focus on the content of information rather than on just the formats and languages in terms of which information is represented. The clearest example of this development is provided by the many initiatives growing up around the project of the Semantic Web. And as the need for integrating research in these different fields arises, so does the realization that strong principles for building well-founded ontologies might provide significant advantages over ad hoc, case-based solutions. The tools of Formal Ontology address precisely these needs, but a real effort is required in order to apply such philosophical tools to the domain of Information Systems. Reciprocally, research in the information science raises specific ontological questions which call for further philosophical investigations. The purpose of FOIS is to provide a forum for genuine interdisciplinary exchange in the spirit of a unified ontological wanalysis effort. Although the primary focus of the conference is on theoretical issues, methodological proposals as well as papers dealing with concrete applications from a well-founded theoretical perspective are welcome. Invited Speakers ----------------- Peter G?rdenfors, Lund University Cognitive Science, Sweden Amie Thomasson, Department of Philosophy, University of Miami, USA Deadlines and Further Information --------------------------------- Abstracts: May 3, 2004 Final submissions: May 7, 2004 Acceptance Notification: June 25, 2004 Submission of camera-ready paper: July 30, 2004 Proceedings will be published by IOS Press and available at the conference. Submission is a two-step procedure: first abstracts, then full papers. Submitted papers must not exceed 5000 words (including bibliography). Abstracts should be less than 300 words. Electronic submission via the website is strongly preferred; if unavailable, submission via email or postal mail is possible. For details see: http://www.fois.org or contact one of the program chairs. Chairs ------ Conference Chair: Nicola Guarino (ISTC-CNR, Trento, Italy) nicola.guarino at loa-cnr.it Program Chairs: Achille Varzi (Columbia University, New York, USA) achille.varzi at columbia.edu Laure Vieu (IRIT-CNRS, Toulouse, France) laure.vieu at irit.fr Local Chairs: Maurizio Ferraris (University of Torino, Italy) ferraris at cisi.unito.it Leonardo Lesmo (University of Torino, Italy) lesmo at di.unito.it Topics ------ We seek high-quality papers on a wide range of topics. While authors may focus on fairly narrow and specific issues, all papers should emphasize the relevance of the work described to formal ontology and to information systems. Papers that completely ignore one or the other of these aspects will be considered as lying outside the scope of the meeting. Topic areas of particular interest to the conference are: Foundational Issues - Kinds of entity: particulars vs. universals, continuants vs. occurrents, abstracta vs. concreta, dependent vs. independent, natural vs. artificial - Formal relations: parthood, identity, connection, dependence, constitution, subsumption, instantiation - Vagueness and granularity - Identity and change - Formal comparison among ontologies - Ontology of physical reality (matter, space, time, motion, ...) - Ontology of biological reality (genes, proteins, cells, organisms, ...) - Ontology of mental reality and agency (beliefs, intentions and other mental attitudes; emotions, ...) - Ontology of social reality (institutions, organizations, norms, social relationships, artistic expressions, ...) - Ontology of the information society (information, communication, meaning negotiation, ...) - Ontology and Natural Language Semantics, Ontology and Cognition Methodologies and Applications - Top-level vs. application ontologies - Ontology integration and alignment; role of reference ontologies - Ontology-driven information systems design - Requirements engineering - Knowledge engineering - Knowledge management and organization - Knowledge representation; Qualitative modeling - Computational lexica; Terminology - Information retrieval; Question-answering - Semantic web; Web services; Grid computing - Domain-specific ontologies, especially for: Linguistics, Geography, Law, Library science, Biomedical science, E-business, Enterprise integration, ... Programme Committee (to be confirmed) -------------------- Bill Andersen, OntologyWorks, USA Nicholas Asher, Dept of Philosophy, University of Texas at Austin, USA Nathalie Aussenac-Gilles, Research Institute for Computer Science, CNRS, Toulouse, France John Bateman, Dept of Applied English Linguistics, University of Bremen, Germany Brandon Bennett, Division of Artificial Intelligence, University of Leeds, UK Andrea Bottani, Dept of Philosophy, University of Bergamo, Italy Joost Breuker, Dept of Computer Science & Law, University of Amsterdam, The Netherlands Roberto Casati, Jean Nicod Institute, CNRS, Paris, France Werner Ceusters, Language & Computing, Belgium Tony Cohn, Division of Artificial Intelligence, University of Leeds, UK Robert Colomb, School of Computer Science and Electrical Engineering, University of Queensland, Australia Ernest Davis, Dept of Computer Science, New York University, USA Randall Dipert, Dept of Philosophy, State University of New York, Buffalo, USA Martin D?rr, Institute of Computer Science, FORTH, Heraklion, Greece Carola Eschenbach, Dept for Informatics, University of Hamburg, Germany J?r?me Euzenat, INRIA Rh?ne-Alpes, Grenoble, France Christiane Fellbaum, Cognitive Science Laboratory, Princeton University, USA & Berlin Brandenburg Academy of Sciences and Humanities, Berlin, Germany Maurizio Ferraris, Dept of Philosophy, University of Torino, Italy Antony Galton, School of Engineering and Computer Science, University of Exeter, UK Aldo Gangemi, Institute of Cognitive Sciences and Technologies, CNR, Rome, Italy Peter G?rdenfors, Lund University Cognitive Science, Sweden Pierdaniele Giaretta, Dept of Philosophy, University of Padova, Italy Michael Gruninger, Institute for Systems Research, University of Maryland College Park, USA & National Institute for Standards and Technology, USA Nicola Guarino, Institute of Cognitive Sciences and Technologies, CNR, Trento, Italy Patrick J. Hayes, Institute for Human and Machine Cognition, University of West Florida, USA Heinrich Herre, Institute of Informatics, University of Leipzig , Germany Jacques Jayez, ENS-Humanities, Lyon, France Ingvar Johansson, Institute for Formal Ontology and Medical Information Science, University of Leipzig, Germany Hannu Kangassalo, Dept of Computer and Information Sciences, University of Tampere, Finland Fritz Lehmann, USA Leonardo Lesmo, Dept of Computer Science, University of Torino, Italy Bernardo Magnini, Centre for Scientific and Technological Research, ITC, Trento, Italy David Mark, Dept of Geography, State University of New York, Buffalo, USA William E. McCarthy, Department of Accounting, Michigan State University, USA Robert Meersman, Dept of Computer Science, Free University of Brussels, Belgium Chris Menzel, Dept of Philosophy, Texas A&M University, USA Friederike Moltmann, Dept of Philosophy, Stirling University, UK Philippe Muller, Research Institute for Computer Science, University of Toulouse III, France John Mylopoulos, Dept of Computer Science, University of Toronto, Canada Sergei Nirenburg, Dept of Computer Science & Electrical Engineering, University of Maryland Baltimore County, USA Leo Obrst, MITRE, USA Massimo Poesio, Dept of Computer Science, University of Essex, UK Ian Pratt-Hartmann, Dept of Computer Science, University of Manchester, UK James Pustejovsky, Dept of Computer Science, Brandeis University, USA Steffen Schulze-Kremer, German Resource Center for Genome Research, Berlin, Germany Peter Simons, School of Philosophy, University of Leeds, UK Barry Smith, Dept of Philosophy, State University of New York, Buffalo, USA & Institute for Formal Ontology and Medical Information Science, University of Leipzig, Germany John Sowa, USA Veda Storey, Dept of Computer Information Systems, Georgia State University, USA Mike Uschold, The Boeing Company, USA Achille Varzi, Dept of Philosophy, Columbia University, USA Laure Vieu, Research Institute for Computer Science, CNRS, Toulouse, France Yair Wand, Management Information Systems Division, University of British Columbia, Vancouver, Canada Chris Welty, IBM Watson Research Center, USA Roel Wieringa, Computer Science Department, University of Twente, The Netherlands ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Wed Jan 21 17:09:56 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Wed, 21 Jan 2004 18:09:56 +0100 Subject: Publications: BULAG no28: Modelling, Systemics, Translatability Message-ID: Just published Bulag (Bulletin de linguistique appliquee et generale) n?28, revue annuelle "Modelisation, systemique, traductibilite" Modelling, Systemics, Translatability Coordinated by Sylviane Cardey Published by PUFC (Presses Universitaires de Franche-Comt?) Information and orders: http://tesniere.univ-fcomte.fr/bulag/bulag28.htm http://tesniere.univ-fcomte.fr/bulag/numero28.pdf Presses Universitaires de Franche-Comt? (PUFC) UFR des Sciences M?dicales et Pharmaceutiques Place St. Jacques 25030 BESANCON cedex France Vient de paraitre Bulag (Bulletin de linguistique appliquee et generale) n?28, revue annuelle "Modelisation, systemique, traductibilite" Coordonn? par Sylviane Cardey Publie aux PUFC (Presses Universitaires de Franche-Comt?) renseignements et commandes http://tesniere.univ-fcomte.fr/bulag/numero28.pdf http://tesniere.univ-fcomte.fr/bulag/bulag28.htm Presses Universitaires de Franche-Comt? (PUFC) UFR des Sciences M?dicales et Pharmaceutiques Place St. Jacques 25030 BESANCON cedex France ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Wed Jan 28 10:20:07 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Wed, 28 Jan 2004 11:20:07 +0100 Subject: Publications: Alain =?ISO-8859-1?Q?Polgu=E8re_=282003=29_Lex?= =?ISO-8859-1?Q?icologie_et_s=E9mantique_lexicale=2E_Notions_fo?= =?ISO-8859-1?Q?ndamentales?= Message-ID: Vient de para?tre : Alain Polgu?re (2003) Lexicologie et s?mantique lexicale. Notions fondamentales. Coll. "Param?tres", Montr?al, Les Presses de l'Universit? de Montr?al, 264 p. (ISBN : 2-7606-1860-9) Pour en savoir plus sur l'ouvrage, aller ? l'adresse ci-dessous sur le site de l'?diteur : http://www.pum.umontreal.ca/livres/fiches/2-7606-1860-9.html Distribustion Canada : ?ditions Fides Distribution France, Belgique et Suisse : Sof?dis Distribution autres pays : Exportlivre ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Wed Jan 28 10:20:18 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Wed, 28 Jan 2004 11:20:18 +0100 Subject: Jobs: Sinequa : developpement d'un moteur de recherche en grec Message-ID: Sinequa est une entreprise d?veloppant, entre autres, un moteur de recherche fortement linguistique. Voir le site http://www.sinequa.com pour plus de renseignements. Nous recherchons une personne pour d?velopper le moteur en grec. Les comp?tences demand?es sont les suivantes : - Fortes comp?tences en linguistique ou terminologie (niveau ma?trise ou DESSS) ; - Ma?trise de l'outil informatique (outils de bureautique, Internet) exig?e ; - Programmation de scripts (Perl ou autre) fortement appr?ci?e ; - Parfaite ma?trise du grec. Le travail qui sera demand? consistera ? d?velopper des lexiques morpho-syntaxiques, des corpus ?tiquet?s, des automates d'analyse (reconnaissance d'entit?s nomm?es, etc.), etc. La dur?e du contrat sera de 3 ? 6 mois. Si vous ?tes int?ress?(e), merci d'envoyer votre CV par courriel ? loupy at sinequa.com Cordialement -- Claude de Loupy - Responsable Recherche Sinequa - http://www.sinequa.com courriel : loupy at sinequa.com t?l. : 33 1 49 87 06 00 - fax : 33 1 49 87 06 01 ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From alexis.nasr at LINGUIST.JUSSIEU.FR Wed Jan 28 10:20:19 2004 From: alexis.nasr at LINGUIST.JUSSIEU.FR (alexis.nasr at LINGUIST.JUSSIEU.FR) Date: Wed, 28 Jan 2004 11:20:19 +0100 Subject: Conf: JADT 2004 Message-ID: JADT 2004 - Call for participation 7th International Conference on the Statistical Analysis of Textual Data March 10-12 2004 - Louvain-la-Neuve, Belgium www.jadt.org *************************************************************************** Following Barcelona (1990), Montpellier (1993), Rome (1995), Nice (1998), Lausanne (2000), Saint-Malo (2002) the 7th International Conference on the Statistical Analysis of Textual Data will be held in Louvain-la-Neuve (Belgium), on March 10-12, 2004. This biennial conference, which has constantly been gaining in importance since its first occurrence, is open to all scholars working in the vast field of textual data analysis; ranging from lexicography to the analysis of political discourse, from documentary research to marketing research, from computational linguistics to sociolinguistics, from the processing of data to content analysis. After the success of the previous meetings, the three-day conference in Belgium will continue to provide a workshop-style forum through technical paper sessions, invited talks, and panel discussions. 1/ PROGRAM The JADT 2004 program features two keynote speakers: - Douglas BIBER, University of Northern Arizona, "A corpus analysis of vocabulary-based discourse unit types in conversation" - Claudia LEACOCK, Educational Testing Service (ETS, Princeton), "Statistical Analysis of Text for Educational Measurement" A first version of the program is available online: http://www.jadt.org/program.html 2/ REGISTRATION For the registration forms and more information, see the conference Web site (www.jadt.org). Please, note that we offer reduced registration fees until January 31st. 3/ IBIS FELLOWSHIPS We are pleased to announce the names of the IBIS-JADT 2004 grantees 6 IBIS Fellowships to take part in JADT 2004 have been awarded. Barbu Ana-Maria (Roumanie) Research Institute for Artificial Intelligence of the Romanian Academy (RACAI) "Simple Linguistic Methods for Improving a Word Alignment Algorithm" Edel? Greevy (Irlande) Dublin City University "Text Categorisation of Racist Texts Using a Support Vector Machine" Forest Dominic (Quebec) UQAM, Laboratoire LANCI "Classification et categorisation automatiques: application a l'analyse thematique des donnees textuelles" Jalam Radwan (France) Universite: Lumiere Lyon 2 "Cadre pour la categorisation de textes multilingues" Misuraca? Michelangelo (Italie) "Relazioni non Simmetriche tra Corpora" (Grassia, Misuraca, Scepi) University: Federico II of Naples M. Bagavandas et G. Manimannam (Inde) Madras Christian College, Department of statistics, Tambaram "Quantification Of Stylistic Traits: A Statistical Approach" -- Cedrick Fairon Directeur du CENTAL Centre de traitement automatique du langage Universite de Louvain Place Blaise Pascal, 1 1348 Louvain-la-Neuve Belgique ======================================= **** JADT 2004 in Louvain-la-Neuve **** 10-12 March 2004 7th International Conference on the statistical analysis of textual data 7th Journees internationales d'analyse statistique des donnees textuelles http://www.jadt.org Visit our web sites: http://cental.fltr.ucl.ac.be http://glossa.fltr.ucl.ac.be ======================================= ------------------------------------------------------------------------- Message diffus? par la liste Langage Naturel Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/ English version : http://www.biomath.jussieu.fr/LN/LN/ Archives : http://listserv.linguistlist.org/archives/ln.html La liste LN est parrain?e par l'ATALA (Association pour le Traitement Automatique des Langues) Information et adh?sion : http://www.atala.org/ ------------------------------------------------------------------------- From hunnordgaardveg at NORDGAARD.COM Thu Jan 22 11:49:27 2004 From: hunnordgaardveg at NORDGAARD.COM (Jeff Spivey) Date: Thu, 22 Jan 2004 09:49:27 -0200 Subject: Can you imagine that you are healthy? Message-ID: LegalRXMedications chemist's shop acquaints you with all medicinal remedies you require to recover your health for a little cost. We manage across the planet with clients from Europe, America and Asia. This time you don't have to search for drug shop somewhere at your local area. We certainly convey high-quality pharmasworld-wide. Come please to our site to obtain preparations that you immediately need straightly to your dwelling. http://babyfraction.hk/ We?re accredited by VISA and VeriSign then we ensure secure and confidential buying. -------------- next part -------------- An HTML attachment was scrubbed... URL: