16.800, Support: Comp Ling: PhD Student, University of Edinburgh
LINGUIST List
linguist at linguistlist.org
Wed Mar 16 22:08:59 UTC 2005
LINGUIST List: Vol-16-800. Wed Mar 16 2005. ISSN: 1068 - 4875.
Subject: 16.800, Support: Comp Ling: PhD Student, University of Edinburgh
Moderators: Anthony Aristar, Wayne State U <aristar at linguistlist.org>
Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
Reviews (reviews at linguistlist.org)
Sheila Collberg, U of Arizona
Terry Langendoen, U of Arizona
Homepage: http://linguistlist.org/
The LINGUIST List is funded by Eastern Michigan University, Wayne
State University, and donations from subscribers and publishers.
Editor for this issue: Jessica Boynton <jessica at linguistlist.org>
================================================================
To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.html.
===========================Directory==============================
1)
Date: 16-Mar-2005
From: Mirella Lapata < mlap at inf.ed.ac.uk >
Subject: Computational Linguistics: PhD Student, University of Edinburgh, United Kingdom
-------------------------Message 1 ----------------------------------
Date: Wed, 16 Mar 2005 17:02:50
From: Mirella Lapata < mlap at inf.ed.ac.uk >
Subject: Computational Linguistics: PhD Student, University of Edinburgh, United Kingdom
University or Organization: University of Edinburgh
Job Rank: PhD
School of Informatics, University of Edinburgh
The Institute of Communicating and Collaborative Systems (ICCS) within
the Division of Informatics and the Human Communication Research
Centre (HCRC) invites applications for a three-year EPSRC studentship
award to commence in September 2005. The successful applicant will work
on a project aiming to devise unsupervised models for word sense
disambiguation. A brief summary of the aims of this project is given
below.
-----------------------------------------------------------------------
Graphical Models for Word Sense Disambiguation
The most accurate techniques for word sense disambiguation (WSD) to
date are those which are trained on text in which each word has been
manually annotated with its intended sense. A major shortcoming of
these methods, though, is that accuracy is strongly correlated with
the quantity of training data available, and this is in short supply
because its production is very labour intensive. For many words the
distribution of their senses is highly skewed and WSD systems work
best when they take the most frequent sense into account. However, the
most frequent sense of a word is often not known, particularly in
domains (subject areas) in which no text has ever been manually
annotated.
This project is concerned with developing novel algorithms for
alleviating the data requirements for large scale WSD. More
specifically the project will involve:
o Exploring the use of probabilistic graphical models for word sense
disambiguation. Graphical models are a powerful modeling framework
that is well-suited for characterizing and studying the interactions
among varied information sources, thus allowing to represent
concurrently many aspects of the WSD problem.
o devising sense ranking models for structured (e.g., WordNet) and
unstructured (e.g., dictionary definitions) sense inventories.
o Demonstrate the benefit of unsupervised WSD in application to
Question Answering.
------------------------------------------------------------------------
The EPSRC baseline rate of maintenance is currently approx. £12.000
and the studentship will also pay the three years' tuition fees at
home/EU rates. Applicants should have a good honours degree or
equivalent in Computer Science or Computational
Linguistics. Programming skills, preferably in Perl, Java, C or C++,
are essential. Familiarity with statistical NLP, machine learning
methods and corpus processing is an advantage.
------------------------------------------------------------------------
The project will be conducted in collaboration with the Natural
Language and Computational Linguistics (NLCL) group at the University
of Sussex (see http://www.informatics.susx.ac.uk/research/nlp/). ICCS
and HCRC have close research links with a number of other academic
institutions (e.g., Saarland University, DFKI, Stanford University)
and companies from which the student will benefit.
------------------------------------------------------------------------
For further information about the project please e-mail Dr. Mirella
Lapata (mlap at inf.ed.ac.uk). Application forms and details of how to
apply are on-line at
http://www.informatics.ed.ac.uk/prospectus/graduate/research.html.
PLEASE MARK 'Graphical Models for Word Sense Disambiguation' ON THE
APPLICATION.
Address for Applications:
College of Science & Engineering, The University of Edinburgh
The Weir Building, The King's Buildings
Edinburgh EH9 3JY
United Kingdom
Contact Information:
Dr. Mirella Lapata
mlap at inf.ed.ac.uk
+44-131-650-4413
+44-131-650-6626
- - - - - -
United Kingdom
Website: http://www.informatics.ed.ac.uk/prospectus/graduate/research.html
-----------------------------------------------------------
LINGUIST List: Vol-16-800
More information about the LINGUIST
mailing list