[Corpora-List] Re: learning affix rules from wordlist

xuri tang tangxuriyz at yahoo.com.cn
Thu May 11 03:10:42 UTC 2006


Hi, listmemebers.
  Several weeks ago, I posted an inquiry about statistical learning of affix rules from wordlist. Thanks to the kindness of Noah Smith of Johns Hopkins University, Eric Artwell of Leed University, Peter Adolphs, Leonid Kontorovich of CMU and some others, I was able to obtain a list of articles and other relevant information in the field. My heart-felt gratitude goes to all of them. 
Here is a summary:
  R. Wicentowski. "Multilingual Noise-Robust Supervised Morphological Analysis using the WordFrame Model." In Proceedings of Seventh Meeting of the ACL Special Interest Group on Computational Phonology (SIGPHON), pp. 70-77, 2004. 
R. Wicentowski. Improving Statistical MT Through Morphological Analysis. Sharon Goldwater and David McClosky. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Vancouver, 2005. 
  Antal van den Bosch and Walter Daelemans. Memory-based morphological analysis
Reference: In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, ACL'99, University of Maryland, USA, June 20-26, 1999, pp. 285-292. ILK pub: ILK-9909
  Leonid Kontorovich et al. 2003. A Markov Model for the Acquisition of Morphological Structure. Available at http://reports-archive.adm.cs.cmu.edu/anon/2003/CMU-CS-03-147.pdf
  The PASCAL MorphoChallenge contest results at http://www.cis.hut.fi/morphochallenge2005/results.shtml
  The following is attributed to Peter Adolphs:
* Bosch & Daelemans (1999). A. van den Bosch & Walter Daelemans:
"Memory-Based Morphological Analysis". Proceedings of the 37th Annual
Meeting of the ACL. San Francisco/CA 1999: Morgan Kaufmann, 285-292.
* Cavar et al (2006). Ćavar, Damir; Rodriguez, Paul & Schrementi,
Giancarlo: "Unsupervised morphology induction for
part-of-speech-tagging". In: Penn Working Papers in Linguistics:
Proceedings of the 29th Annual Penn Linguistics Colloquium. Vol. 12.1.
2006. pp. 29�2.
* Creutz (2003). Mathias Creutz: "Unsupervised Segmentation of Words
Using Prior Distributions of Morph Length and Frequency". Proceedings of
the 41st Annual Meeting of the Association for Computational
Linguistics, July 2003, pp. 280-287.
* Creutz & Lagus (2002). Mathias Creutz and Krista Lagus:
"Unsupervised Discovery of Morphemes". Morphological and Phonological
Learning: Proceedings of the 6th Workshop of the ACL Special Interest
Group in Computational Phonology (SIGPHON), Philadelphia, July 2002, pp.
21-30. Association for Computational Linguistics.
* Creutz & Lagus (2005a). Mathias Creutz and Krista Lagus:
"Unsupervised Morpheme Segmentation and Morphology Induction from Text
Corpora Using Morfessor 1.0". Publications in Computer and Information
Science, Report A81, Helsinki University of Technology, March 2005.
* Davies (2003). Mark Davies: "Annotation without lexicons. an
alternative to the standard bootstrapping approach". In: Dawn Archer,
Paul Rayson, Andrew Wilson and Tony McEnery (eds.): Proceedings of the
Corpus Linguistics 2003 conference. UCREL technical paper number 16.
UCREL, Lancaster University. pp. 583-590.
* Federici & Pirelli (1992). Stefano Federici & Vito Pirelli (1992):
"A Bootstrapping strategy for Lemmatisation: Learning Through Examples".
In: Kiefer et al (1992). pp. 123�35.
* Freitag (2005). Dayne Freitag: "Morphology Induction from Term
Clusters". Proceedings of the Ninth Conference on Computational Natural
Language Learning (CoNLL-2005), pp. 128-135. Ann Arbor, MI, June 2005.
* Goldsmith (2000). Goldsmith, John. "Linguistica: An Automatic
Morphological Analyzer". To appear in: John Boyle, Jung-Hyuck Lee, and
Arika Okrent: Papers from the 36th Meeting of the Chicago Linguistics
Society [CLS 36], Volume 1: The Main Session. 2000.
* Goldsmith (2001). Goldsmith, John: "Unsupervised learning of the
morphology of a natural language". In: Computational Linguistics Vol.
27, Nr. 2, 2001, p. 153 - 198.
* Goldsmith et al (2005). Goldsmith, John; Hu, Yu; Matveeva, Irina &
Sprague, Colin. A heuristic for morpheme discovery based on string edit
distance. Technical report TR-2005-04, Department of Computer Science,
University of Chicago.
* Maxwell (2002). Mike Maxwell: Resources for Morphology Learning
and Evaluation. In: Gonzalez Rodriguez, Manuel; Suarez Araujo, Carmen
Paz (eds.): LREC 2002: Third International Conference on Language
Resources and Evaluation, Vol. III. Paris 2002: ELRA, 967-974.
* Novák et al (2003). Attila Novák, Viktor Nagy & Csaba Oravecz:
"Corpus assisted development of a Hungarian morphological analyser and
guesser". In: Dawn Archer, Paul Rayson, Andrew Wilson and Tony McEnery
(eds.): Proceedings of the Corpus Linguistics 2003 conference. UCREL
technical paper number 16. UCREL, Lancaster University. pp. 583-590.
* Novák et al (2004). Attila Novák, Viktor Nagy & Csaba Oravecz:
"Combining symbolic and statistical methods in morphological analysis
and unknown word guessing". In: Proceedings of LREC 2004, Lisbon, 2004.
* Oflazer et al (2001). Kemal Oflazer, Sergei Nirenburg, Marjorie
McShan: "Bootstrapping Morphological Analyzers by Combining Human
Elicitation and Machine Learning". Computational Linguistics 27.1, 2001,
59-86.
* Reichel & Weilhammer (2004). Uwe D. Reichel & Karl Weilhammer:
"Automated Morphological Segmentation and Evaluation". In: Proceedings
of LREC 2004, Lisbon, 2004.
* Stroppa & Yvon (2005). Nicolas Stroppa, François Yvon: "An
Analogical Learner for Morphological Analysis". In: Proceedings of the
Ninth Conference on Computational Natural Language Learning
(CoNLL-2005). Ann Arbor, Michigan, 2005: Association for Computational
Linguistics. pp. 120�27.
* Yarowsky & Wicentowski (2000). D. Yarowsky & R. Wicentowski:
"Minimally Supervised Morphological Analysis by Multimodal Alignment".
Proceedings of ACL-2000. San Francisco/CA 2000: Morgan Kaufmann, 207-216.

  Xuri Tang
   
  Wuhan University of Science and Engineering
  Wuhan, P.R. China

		
---------------------------------
ÇÀ×¢ÑÅ»¢Ãâ·ÑÓÊÏä-3.5GÈÝÁ¿£¬20M¸½¼þ£¡ 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20060511/ff4d8188/attachment.htm>


More information about the Corpora mailing list