Appel: PAPILLON-2004 Workshop on Multilingual Lexical Databases

alexis.nasr at LINGUIST.JUSSIEU.FR alexis.nasr at LINGUIST.JUSSIEU.FR
Tue May 25 16:53:13 UTC 2004

*                 PAPILLON-2004 Workshop                  *
*           on Multilingual Lexical Databases             *
*                                                         *
*       Grenoble, August 30th-September  1st, 2004        *
*                                                         *
*             immediatly  after COLING 2004               *
*                                                         *
*        Venue: IMAG Institute, Grenoble, France          *
*                                                         *
*     Submission deadline extended to June, 1st 2004      *


Multilingual lexical databases are (i) databases for (ii) structured
lexical data which can be used either (iii) by humans e.g. to define
their own dictionaries or (iv) by natural language processing (NLP)
applications. Such databases are now felt indispensable in language
science with the advances of language engineering. Like databases in
genomics, multilingual lexical databases need rich annotations; they
are complex, and they evolve as time goes by.

The Papillon project is a Web collaborative project with the aim to
build an open source multilingual lexical database for several
languages (French, German, English, Japanese, Lao, Malay, Thai and
Vietnamese). The provided lexical information has to be rich enough
for a human to be able to query and generate his/her own tailored
dictionary (e.g. for language learning or for translation work) and
for NLP applications to be able to extract a whole range of data or to
directly exploit some particular data.

The 2004 Papillon workshop, the fifth in a series of workshops
organized every year by the Papillon members, will aim at identifying
problems relevant to the multilingual-lexical-database community. The
workshop aims to promote exchanges between practitioners from several
fields and is thus open to anybody working in a domain pertaining to
lexical databases such as: databases, man-machine interface for
dictionaries, data annotation, XML, standardization of dictionaries or
lexical data; lexicography, translation, computational linguistics,
etc.  This workshop is open and particularly encourages submissions by
researchers from outside the Papillon project.

The "Papillon 2004" workshop particularly welcomes general submissions
on the following topics:

(i) Aspect: databases
- Macrostructures for dictionaries and general architecture
- Databases and lexical databases: differences and similarities
- Query languages (XML, XQL, etc.)
- Fast indexes and archiving techniques
- Standards for lexical databases

(ii) Aspect: lexical data
- Microstructures of lexical entries and linguistic specifications
- Reuse or automatic production of dictionaries
- Automatic feeding of lexical databases
- Internationalization/localization of multilingual  lexical data
- Integration of multimodal information and meta-information
- Interaction between lexical data and linguistic data (corpora,
   examples, illustrations, etc.)

(iii) Aspect: human use
- New ways of dictionary lookup
- Interfaces for querying dictionaries, user-friendliness
- Legal aspects of dictionaries (copyright, etc.)
- Adequation/inadequation of existing distionnary to  certain users

(iv) Aspect: NLP use
- NLP systems using lexical databases
- Direct use of dictionaries in NLP systems
- Building lexical data sets for specific applications
- Interchangeability of lexical data
- Neutrality of NLP systems with regard to lexical data

The workshop will also accept submission on aspects more specific to
the Papillon project

Meaning-text theory and lexicography
- Linguistic problems
- Composing semantic formulae for lexies
- Practical problems in indexing specifing entries/lexies
- Examples of lexies or axies

Proposals for collaboration with the Papillon project
- Query interfaces
- Automatic Indexing
- Contribution to the paradise, the purgatory, or the limbo

Summary/state of collaboration to the Papillon project
- Contribution to the paradise, the purgatory, or the limbo
- Attractiveness of the Papillon Web site

Collaborative work
- Methods/services for lexical collaborative work
- Motivations for voluntary contributers
- Recognition and traceability of contributed work

Contributions should not exceed 10 pages.


The program will have a varied format, designed to maximize
cross-fertilization among the various specialties, and to allow
extended open discussion. Components of the program will include:

- Tutorials on relevant models from linguistics, databases or
   annotation, e.g. the structure of lexical entries and semi-structured
   query languages;
- Panel sessions on annotated text and lexicons (and  possibly others);
- Paper presentations reporting new research;
- Demonstrations of systems for creating and/or managing  lexical data.

- Submission Deadline: 1st june 2004
- Notificationof Acceptance: 25 june 2004
- Camera Ready Papers: 16 juillet 2004


Submissions will be sent after crompession with zip or gzip by
electronic mailto:papillon2004 at .

They will consist in a full paper, of max. 10 pages (inclusive of
references, tables, figures and equations) written in english in PDF
format only. Do not forget to include fonts in your PDF document if
you use non-English characters. You can check that your file is
correct by disabling the "Use Local Font" option when viewing you PDF
document with Adobe Reader.

For conversion from proprietary format into PDF, see e.g. the CRI74 or
the CERN conversion facilities.

We strongly recommend you to use the Springer Verlag template
available at the following address:


President: Gilles Sérasset, GETA-CLIPS-IMAG, Grenoble, France
- Aree Teeraparbseree, GETA-CLIPS-IMAG, Grenoble, France
- Jean-Philippe Guilbaud, GETA-CLIPS-IMAG, Grenoble, France
- ...


President: Gilles Sérasset, GETA-CLIPS-IMAG, Grenoble, France
- Mr. Christian Boitet, GETA-CLIPS-IMAG, Grenoble, France
- Mr. Francis Bond, NTT, Keihanna, Japan
- Mr. Jim Breen, Université Monash, Australia
- Mr. François Brown de Colstoun, INRIA, Rocquencourt, France
- Mr. Kyo Kageura, NII, Tokyo, Japan
- Ms. Asanee Kawtrakul, Kasetsart University, Bangkok, Thailand
- Ms. Kyoko Kuroda, Shimane University, Japan
- Mr. Mathieu Lafourcade, LIRMM, Montpellier, France
- Mr. Francois Lareau, OLST, Université de Montréal,  Montreal, Canada
- Mr. Yves Lepage, ATR, Keihanna, Japan
- Mr. Mathieu Mangeot, UTMK, Penang, Malaysia
- Mr. Emmanuel Planas, GETA-CLIPS-IMAG, Grenoble, France
- Mr. Alain Polguère, OLST, Université de Montréal,  Montreal, France
- Mr. Gilles Sérasset, GETA-CLIPS-IMAG, Grenoble, France
- Mr. Enyakong Tang, UTMK, Penang, Malaysia
- Mr. David Thevenin, NII, Tokyo, Japan
- Mr. Takenobu Tokunaga, TITech, Tokyo, Japan
- Ms. Mutsuko Tomokiyo, GETA-CLIPS-IMAG, Grenoble, France
- Mr. Michael Zock, LIMSI, Orsay, France

Miscellaneous Information
- Papillon project Web site:
- IMAG Institute:
- Grenoble tourist information:

Message diffusé par la liste Langage Naturel <LN at>
Informations, abonnement :
English version          :
Archives                 :

La liste LN est parrainée par l'ATALA (Association pour le Traitement
Automatique des Langues)
Information et adhésion  :

More information about the Ln mailing list