[Corpora-List] Unsupervised Morpheme Analysis -- Morpho Challenge 2007 Workshop: Call for participation
Mikko.Kurimo at tkk.fi
Mikko.Kurimo at tkk.fi
Thu Jul 26 13:46:13 UTC 2007
Call for Participation:
Unsupervised Morpheme Analysis -- Morpho Challenge 2007 Workshop
In conjunction with: CLEF 2007 (Cross-Language Evaluation Forum)
Budapest, Hungary, 19 September 2007
http://www.cis.hut.fi/morphochallenge2007/
Chair: Mikko Kurimo (Helsinki University of Technology)
Morpho Challenge 2007 is part of the EU Network of Excellence PASCAL
Challenge Program and is organized in collaboration with CLEF.
*** Topic of the Challenge and Workshop ***
The objective of the Challenge was to design a statistical machine
learning algorithm that discovers which morphemes (smallest
individually meaningful units of language) words consist of. Ideally,
these are basic vocabulary units suitable for different tasks, such as
text understanding, machine translation, information retrieval, and
statistical language modeling.
The scientific goals are:
* To learn of the phenomena underlying word construction in
natural languages
* To discover approaches suitable for a wide range of languages
* To advance machine learning methodology
Morpho Challenge 2007 is a follow-up to our previous Morpho Challenge
2005 (Unsupervised Segmentation of Words into Morphemes). The new task
is more general in that we are not necessarily looking for an explicit
segmentation of words this time, but a morpheme analysis of the word
forms in the data. (For instance, the English words "boot, boots,
foot, feet" might obtain the analyses "boot, boot + plural, foot, foot
+ plural", respectively.)
*** Competitions ***
The submitted morpheme analysis were evaluated in two complementary ways:
* Competition 1: The proposed morpheme analyses were compared to
a linguistic "gold standard".
* Competition 2: Information retrieval (IR) experiments were
performed, where the words in the documents and queries were replaced
by their proposed morpheme representations. The search is then based
on morphemes instead of words.
*** Workshop Schedule (subject to change) ***
09:00 Opening
09:10 Mikko Kurimo: "Unsupervised Morpheme Analysis -- Morpho
Challenge 2007: Introduction and Overview"
09:30 Mikko Kurimo, Mathias Creutz and Matti Varjokallio: "Competition
1: Morpheme Analysis: Evaluation and Results. A simple reference
method using Morfessor"
09:50 Mikko Kurimo and Ville Turunen: "Competition 2: Information
Retrieval using the Morpheme Analysis results: Evaluation and Results"
10:20 Paul McNamee: "Applying ngrams and morpheme analysis in IR"
10:40 Discussion
11:00 Break
11:30 Delphine Bernhard: "Simple Morpheme Labelling in Unsupervised
Morpheme Analysis"
11:50 Stefan Bordag: "Unsupervised and Knowledge-free Morpheme
Segmentation and Analysis"
12:10 Christian Monson: "ParaMor: Finding Paradigms across Morphology"
12:30 Discussion
12:50 Conclusion
*** Program Committee ***
Levent Arslan, Bo?aziçi University
Eric Atwell, University of Leeds
Samy Bengio, Google
Tolga Cilogu, Middle-East Technical University
Kadri Hacioglu, Colorado University
Colin de la Higuera, Jean Monnet University, Saint-Etienne
Chun Yu Kit, City University of Hong Kong
Dietrich Klakow, Saarland University
James Martin, University of Colorado at Boulder
Jan Nouza,Technical University of Liberec
Erkki Oja, Helsinki University of Technology
Murat Saraçlar, Bo?aziçi University
Richard Sproat, University of Illinois, Urbana-Champaign
Richard Wicentowski, Swarthmore College
*** Further Information ***
http://www.cis.hut.fi/morphochallenge2007/
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list