[Corpora-List] PAPILLON 2004: 5th Workshop on Multilingual Lexical Databases, Grenoble, France

Gilles Serasset Gilles.Serasset at imag.fr
Mon Apr 19 10:15:02 UTC 2004


PAPILLON-2004 Workshop
on Multilingual Lexical Databases

Grenoble, August 30th-September  1st, 2004

immediatly  after COLING 2004

Venue: IMAG Institute, Grenoble, France

  Submission deadline May, 21st 2004

Overview

Multilingual lexical databases are (i) databases for (ii)  structured 
lexical data which can be used either (iii) by  humans e.g. to define 
their own dictionaries or (iv) by  natural language processing (NLP) 
applications. Such  databases are now felt indispensable in language 
science  with the advances of language engineering. Like databases in  
genomics, multilingual lexical databases need rich  annotations; they 
are complex, and they evolve as time goes by.

  The Papillon project is a Web collaborative project with the  aim to 
build an open source multilingual lexical database  for several 
languages (French, German, English, Japanese,  Lao, Malay, Thai and 
Vietnamese). The provided lexical  information has to be rich enough 
for a human to be able to  query and generate his/her own tailored 
dictionary (e.g. for  language learning or for translation work) and 
for NLP  applications to be able to extract a whole range of data or  
to directly exploit some particular data.

  The 2004 Papillon workshop, the fifth in a series of  workshops 
organized every year by the Papillon members, will  aim at identifying 
problems relevant to the  multilingual-lexical-database community. The 
workshop aims  to promote exchanges between practitioners from several  
fields and is thus open to anybody working in a domain  pertaining to 
lexical databases such as: databases,  man-machine interface for 
dictionaries, data annotation,  XML, standardization of dictionaries or 
lexical data;  lexicography, translation, computational linguistics, 
etc.  This workshop is open and particularly encourages  submissions by 
researchers from outside the Papillon project.

  The "Papillon 2004"  workshop particularly welcomes general 
submissions on the  following topics:

  (i) Aspect: databases
	• 	Macrostructures for dictionaries and general architecture
	• 	Databases and lexical databases: differences and similarities
	• 	Query languages (XML, XQL, etc.)
	• 	Fast indexes and archiving techniques
	• 	Standards for lexical databases

(ii) Aspect: lexical data
	• 	Microstructures of lexical entries and linguistic specifications
	• 	Reuse or automatic production of dictionaries
	• 	Automatic feeding of lexical databases
	• 	Internationalization/localization of multilingual  lexical data
	• 	Integration of multimodal information and meta-information
	• 	Interaction between lexical data and linguistic data  (corpora, 
examples, illustrations, etc.)

(iii) Aspect: human use
	• 	New ways of dictionary lookup
	• 	Interfaces for querying dictionaries, user-friendliness
	• 	Legal aspects of dictionaries (copyright, etc.)
	• 	Adequation/inadequation of existing distionnary to  certain users

(iv) Aspect: NLP use
	• 	NLP systems using lexical databases
	• 	Direct use of dictionaries in NLP systems
	• 	Building lexical data sets for specific applications
	• 	Interchangeability of lexical data
	• 	Neutrality of NLP systems with regard to lexical data

The workshop will also accept  submission on aspects more specific to 
the Papillon project

  Meaning-text theory and lexicography
	• 	Linguistic problems
	• 	Composing semantic formulae for lexies
	• 	Practical problems in indexing specifing entries/lexies
	• 	Examples of lexies or axies

Proposals for collaboration with the Papillon project
	• 	Query interfaces
	• 	Automatic Indexing
	• 	Contribution to the paradise, the purgatory, or the limbo

Summary/state of collaboration to the Papillon project
	• 	Contribution to the paradise, the purgatory, or the limbo
	• 	Attractiveness of the Papillon Web site

Collaborative work
	• 	Methods/services for lexical collaborative work
	• 	Motivations for voluntary contributers
	• 	Recognition and traceability of contributed work

Contributions should not exceed 10 pages.

  Program

The program will have a varied format, designed to maximize  
cross-fertilization among the various specialties, and to  allow 
extended open discussion. Components of the program  will include:
	• 	 Tutorials on relevant models from linguistics, databases  or 
annotation, e.g. the structure of lexical entries and  semi-structured 
query languages;
	• 	Panel sessions on annotated text and lexicons (and  possibly 
others);
	• 	Paper presentations reporting new research;
	• 	Demonstrations of systems for creating and/or managing  lexical 
data.

IMPORTANT DATES
	• 	Submission Deadline: 21 mai 2004
	• 	Notificationof Acceptance: 23 juin 2004
	• 	Camera Ready Papers: 16 juillet 2004

Call for Papers

Submissions will be sent after crompession with zip or  gzip by 
electronic mailto:papillon2004 at imag.fr  .

  They will consist in a full paper, of max. 10 pages  (inclusive of 
references, tables, figures and equations)  written in english in PDF 
format only. Do not forget  to include fonts in your PDF document if 
you use non-English  characters. You can check that your file is 
correct by disabling the "Use Local Font" option when viewing  you PDF 
document with Adobe Reader.

  For conversion from proprietary format into PDF, see e.g. the  CRI74  
or the CERN  conversion facilities.

  We strongly recommend you to use the Springer Verlag template  
available at the following address: 
http://www.springer.de/comp/lncs/authors.html

Local Organizing Committee

President: Gilles Sérasset, GETA-CLIPS-IMAG, Grenoble, France
	• 	 Aree Teeraparbseree, GETA-CLIPS-IMAG, Grenoble, France
	• 	Jean-Philippe Guilbaud, GETA-CLIPS-IMAG, Grenoble, France
	• 	...

Program Committee

President: Gilles Sérasset, GETA-CLIPS-IMAG, Grenoble, France
	• 	 Mr. Christian Boitet, GETA-CLIPS-IMAG, Grenoble, France
	• 	Mr. Francis Bond, NTT, Keihanna, Japan
	• 	Mr. Jim Breen, Université Monash, Australia
	• 	Mr. François Brown de Colstoun, INRIA, Rocquencourt, France
	• 	Mr. Kyo Kageura, NII, Tokyo, Japan
	• 	Ms. Asanee Kawtrakul, Kasetsart University, Bangkok, Thailand
	• 	Ms. Kyoko Kuroda, Shimane University, Japan
	• 	Mr. Mathieu Lafourcade, LIRMM, Montpellier, France
	• 	Mr. Francois Lareau, OLST, Université de Montréal,  Montreal, Canada
	• 	Mr. Yves Lepage, ATR, Keihanna, Japan
	• 	Mr. Mathieu Mangeot, UTMK, Penang, Malaysia
	• 	Mr. Emmanuel Planas, GETA-CLIPS-IMAG, Grenoble, France
	• 	Mr. Alain Polguère, OLST, Université de Montréal,  Montreal, France
	• 	Mr. Gilles Sérasset, GETA-CLIPS-IMAG, Grenoble, France
	• 	Mr. Enyakong Tang, UTMK, Penang, Malaysia
	• 	Mr. David Thevenin, NII, Tokyo, Japan
	• 	Mr. Takenobu Tokunaga, TITech, Tokyo, Japan
	• 	Ms. Mutsuko Tomokiyo, GETA-CLIPS-IMAG, Grenoble, France
	• 	Mr. Michael Zock, LIMSI, Orsay, France

Miscellaneous Information
	• 	Papillon project Web site: http://www.papillon-dictionary.org/
	• 	CLIPS: http://www-clips.imag.fr/
	• 	IMAG Institute: http://www.imag.fr/
	• 	Grenoble tourist information: 
http://www.grenoble-isere-tourisme.com/

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: text/enriched
Size: 7413 bytes
Desc: not available
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20040419/060bf623/attachment-0001.bin>


More information about the Corpora mailing list