[Corpora-List] PAPILLON 2004: 5th Workshop on Multilingual Lexical Databases, Grenoble, France
Gilles Serasset
Gilles.Serasset at imag.fr
Mon Apr 19 10:15:02 UTC 2004
PAPILLON-2004 Workshop
on Multilingual Lexical Databases
Grenoble, August 30th-September 1st, 2004
immediatly after COLING 2004
Venue: IMAG Institute, Grenoble, France
Submission deadline May, 21st 2004
Overview
Multilingual lexical databases are (i) databases for (ii) structured
lexical data which can be used either (iii) by humans e.g. to define
their own dictionaries or (iv) by natural language processing (NLP)
applications. Such databases are now felt indispensable in language
science with the advances of language engineering. Like databases in
genomics, multilingual lexical databases need rich annotations; they
are complex, and they evolve as time goes by.
The Papillon project is a Web collaborative project with the aim to
build an open source multilingual lexical database for several
languages (French, German, English, Japanese, Lao, Malay, Thai and
Vietnamese). The provided lexical information has to be rich enough
for a human to be able to query and generate his/her own tailored
dictionary (e.g. for language learning or for translation work) and
for NLP applications to be able to extract a whole range of data or
to directly exploit some particular data.
The 2004 Papillon workshop, the fifth in a series of workshops
organized every year by the Papillon members, will aim at identifying
problems relevant to the multilingual-lexical-database community. The
workshop aims to promote exchanges between practitioners from several
fields and is thus open to anybody working in a domain pertaining to
lexical databases such as: databases, man-machine interface for
dictionaries, data annotation, XML, standardization of dictionaries or
lexical data; lexicography, translation, computational linguistics,
etc. This workshop is open and particularly encourages submissions by
researchers from outside the Papillon project.
The "Papillon 2004" workshop particularly welcomes general
submissions on the following topics:
(i) Aspect: databases
• Macrostructures for dictionaries and general architecture
• Databases and lexical databases: differences and similarities
• Query languages (XML, XQL, etc.)
• Fast indexes and archiving techniques
• Standards for lexical databases
(ii) Aspect: lexical data
• Microstructures of lexical entries and linguistic specifications
• Reuse or automatic production of dictionaries
• Automatic feeding of lexical databases
• Internationalization/localization of multilingual lexical data
• Integration of multimodal information and meta-information
• Interaction between lexical data and linguistic data (corpora,
examples, illustrations, etc.)
(iii) Aspect: human use
• New ways of dictionary lookup
• Interfaces for querying dictionaries, user-friendliness
• Legal aspects of dictionaries (copyright, etc.)
• Adequation/inadequation of existing distionnary to certain users
(iv) Aspect: NLP use
• NLP systems using lexical databases
• Direct use of dictionaries in NLP systems
• Building lexical data sets for specific applications
• Interchangeability of lexical data
• Neutrality of NLP systems with regard to lexical data
The workshop will also accept submission on aspects more specific to
the Papillon project
Meaning-text theory and lexicography
• Linguistic problems
• Composing semantic formulae for lexies
• Practical problems in indexing specifing entries/lexies
• Examples of lexies or axies
Proposals for collaboration with the Papillon project
• Query interfaces
• Automatic Indexing
• Contribution to the paradise, the purgatory, or the limbo
Summary/state of collaboration to the Papillon project
• Contribution to the paradise, the purgatory, or the limbo
• Attractiveness of the Papillon Web site
Collaborative work
• Methods/services for lexical collaborative work
• Motivations for voluntary contributers
• Recognition and traceability of contributed work
Contributions should not exceed 10 pages.
Program
The program will have a varied format, designed to maximize
cross-fertilization among the various specialties, and to allow
extended open discussion. Components of the program will include:
• Tutorials on relevant models from linguistics, databases or
annotation, e.g. the structure of lexical entries and semi-structured
query languages;
• Panel sessions on annotated text and lexicons (and possibly
others);
• Paper presentations reporting new research;
• Demonstrations of systems for creating and/or managing lexical
data.
IMPORTANT DATES
• Submission Deadline: 21 mai 2004
• Notificationof Acceptance: 23 juin 2004
• Camera Ready Papers: 16 juillet 2004
Call for Papers
Submissions will be sent after crompession with zip or gzip by
electronic mailto:papillon2004 at imag.fr .
They will consist in a full paper, of max. 10 pages (inclusive of
references, tables, figures and equations) written in english in PDF
format only. Do not forget to include fonts in your PDF document if
you use non-English characters. You can check that your file is
correct by disabling the "Use Local Font" option when viewing you PDF
document with Adobe Reader.
For conversion from proprietary format into PDF, see e.g. the CRI74
or the CERN conversion facilities.
We strongly recommend you to use the Springer Verlag template
available at the following address:
http://www.springer.de/comp/lncs/authors.html
Local Organizing Committee
President: Gilles Sérasset, GETA-CLIPS-IMAG, Grenoble, France
• Aree Teeraparbseree, GETA-CLIPS-IMAG, Grenoble, France
• Jean-Philippe Guilbaud, GETA-CLIPS-IMAG, Grenoble, France
• ...
Program Committee
President: Gilles Sérasset, GETA-CLIPS-IMAG, Grenoble, France
• Mr. Christian Boitet, GETA-CLIPS-IMAG, Grenoble, France
• Mr. Francis Bond, NTT, Keihanna, Japan
• Mr. Jim Breen, Université Monash, Australia
• Mr. François Brown de Colstoun, INRIA, Rocquencourt, France
• Mr. Kyo Kageura, NII, Tokyo, Japan
• Ms. Asanee Kawtrakul, Kasetsart University, Bangkok, Thailand
• Ms. Kyoko Kuroda, Shimane University, Japan
• Mr. Mathieu Lafourcade, LIRMM, Montpellier, France
• Mr. Francois Lareau, OLST, Université de Montréal, Montreal, Canada
• Mr. Yves Lepage, ATR, Keihanna, Japan
• Mr. Mathieu Mangeot, UTMK, Penang, Malaysia
• Mr. Emmanuel Planas, GETA-CLIPS-IMAG, Grenoble, France
• Mr. Alain Polguère, OLST, Université de Montréal, Montreal, France
• Mr. Gilles Sérasset, GETA-CLIPS-IMAG, Grenoble, France
• Mr. Enyakong Tang, UTMK, Penang, Malaysia
• Mr. David Thevenin, NII, Tokyo, Japan
• Mr. Takenobu Tokunaga, TITech, Tokyo, Japan
• Ms. Mutsuko Tomokiyo, GETA-CLIPS-IMAG, Grenoble, France
• Mr. Michael Zock, LIMSI, Orsay, France
Miscellaneous Information
• Papillon project Web site: http://www.papillon-dictionary.org/
• CLIPS: http://www-clips.imag.fr/
• IMAG Institute: http://www.imag.fr/
• Grenoble tourist information:
http://www.grenoble-isere-tourisme.com/
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: text/enriched
Size: 7413 bytes
Desc: not available
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20040419/060bf623/attachment-0001.bin>
More information about the Corpora
mailing list