[Corpora-List] Grenoble: PAPILLON 2004 Workshop on Multilingual Lexical Databases
Gilles Serasset
Gilles.Serasset at imag.fr
Mon Jul 12 10:50:20 UTC 2004
**********************************************************************
CALL FOR PARTICIPATION TO
PAPILLON-2004 Workshop
on Multilingual Lexical Databases
Grenoble, August 30th-September 1st, 2004
immediatly after COLING 2004
Venue: IMAG Institute, Grenoble, France
**********************************************************************
Overview
--------
Multilingual lexical databases are (i) databases for (ii) structured
lexical data which can be used either (iii) by humans e.g. to define
their own dictionaries or (iv) by natural language processing (NLP)
applications. Such databases are now felt indispensable in language
science with the advances of language engineering. Like databases in
genomics, multilingual lexical databases need rich annotations; they
are complex, and they evolve as time goes by.
The Papillon project is a Web collaborative project with the aim to
build an open source multilingual lexical database for several
languages (French, German, English, Japanese, Lao, Malay, Thai and
Vietnamese). The provided lexical information has to be rich enough
for a human to be able to query and generate his/her own tailored
dictionary (e.g. for language learning or for translation work) and
for NLP applications to be able to extract a whole range of data or
to directly exploit some particular data.
The 2004 Papillon workshop, the fifth in a series of workshops
organized every year by the Papillon members, will aim at identifying
problems relevant to the multilingual-lexical-database community. The
workshop aims to promote exchanges between practitioners from several
fields and is thus open to anybody working in a domain pertaining to
lexical databases such as: databases, man-machine interface for
dictionaries, data annotation, XML, standardization of dictionaries or
lexical data; lexicography, translation, computational linguistics,
etc.
Tentative Program
-----------------
The program will have a varied format, designed to maximize
cross-fertilization among the various specialties, and to allow
extended open discussion. Components of the program will include:
- Tutorials on relevant models from linguistics, databases or
annotation, e.g. the structure of lexical entries and semi-structured
query languages;
- Panel sessions on annotated text and lexicons (and possibly
others);
- Paper presentations reporting new research;
- Demonstrations of systems for creating and/or managing lexical
data.
The following papers will be presented during the conference:
1. Refining Algorithm of Extracted Pattern Rule Set from Penn
TreeBank Corpus. Akira Adachi, and Takenori Makino
2. LC-STAR: XML-coded Phonetic Lexica and Bilingual Corpora for
Speech to Speech Translation. Folkert de Vriend, Nuria Castell, Jesus
Giménez, and Giulio Maltese
3. Low Cost Automated Conceptual Vector Generation from Mono and
Bilingual Resources. Mathieu Lafourcade, Frédéric Rodrigo, and Didier
Schwab
4. ITOLDU: Accessing to Vocabulary learning in a technical English
resource pooling environment. Valérie Bellynck, and John Kenwright
5. Ressource pooling for technical English learning via lexical
access. Valérie Bellynck, Christian Boitet, and John Kenwright
6. Electronic Data for the Description of Japanese Kanji - The
Analyses of Brush Strokes, Stroke Groups and their Position and the
Building of Path Data to Display and Search Kanji. Ulrich Apel, and
Julien Quint
7. Why have them work for peanuts, when it is so easy to provide
reward? One of the many possibilities of a dictionary converted into a
drill tutor. Michael Zock, and Julien Quint
8. Multilingual Dictionary of Lexicographical Terms. Svetlana
Krestova, and Peter J. Nürnberg
9. Expanding the Lexicon: the Search for Abbreviations. James Breen
10. The Design of (Psycho)Linguistically-motivated Lexicons for
Natural Language Processing. Ariani Di Filippo, Bento Carlos
Dias-da-Silva
11. Building a Specialised Multilingual Dictionary from General
Monolingual Dictionaries. Choy-Kim Chuah
12. A semantic representation of emotions based on a dialogue
corpus analysis. Mutsuko Tomokiyo, and Solange Hollard
13. An XML-based Tool for Tracking English Inclusions in German
Text. Beatrice Alex, and Claire Grover
14. Historical-Comparative Reconstruction and Multilingual Lexica.
James Kilbury, and Katina Bontcheva
15. Building an Ontology-based Multilingual Lexicon for Word Sense
Disambiguation in Machine Translation. Lian-Tze Lim and Tang Enya Kong
REGISTRATION
------------
Registration fee for the Papillon 2004 workshop is fixed at 50€.
This Registration fee includes:
- Attendance at all sessions
- Coffee and refreshments at official breaks
- Official diner Tuesday 31st of August
Registration fee will be payable in cash at the registration desk.
Please, pre-register to the conference by sending a mail to
papillon2004 at imag.fr with your name.
Venue
-----
Papillon 2004 workshop will take place at the "Maison Jean Kuntzman"
amphitheater of the IMAG institute on Grenoble university's campus
(Site de Saint Martin d'Hères et Gières).Directions to reach the
"Maison Jean Kuntzman" are available at
http://www.imag.fr/public/Documents/InfosPratiques/StMartin.html.
Miscellaneous Information
-------------------------
- Papillon project Web site: http://www.papillon-dictionary.org/
- CLIPS: http://www-clips.imag.fr/
- IMAG Institute: http://www.imag.fr/
- Grenoble tourist information:
http://www.grenoble-isere-tourisme.com/
Contact
For any enquiry, please contact the Papillon 2004 organizers at
papillon2004 at imag.fr.
More information about the Corpora
mailing list