[Corpora-List] PhD position in Web Information Extraction

Maarten de Rijke mdr at science.uva.nl
Tue Sep 26 04:21:31 UTC 2006


PhD position in Web Information Extraction
(fully funded, three years)

We are looking for a versatile, highly motivated PhD candidate to
work on information access in the domain of access to cultural
heritage.  The position is part of the MultiMATCH project
(Multilingual / Multimedia Access to Cultural Heritage), an
EU-funded project in which the University of Amsterdam (UvA) is a
partner.

* Tasks

The aim of the MultiMATCH project is to enable users to explore
and interact with online cultural heritage content, across media
types and language boundaries.  Part of the UvA's activities
within MultiMATCH will be to crawl information from cultural
heritage sites.  The PhD student will focus on text mining and
classification, using machine learning and language technology,
both to help the crawler remain focused and to discover relations
between crawled documents, across both languages and media.

* Requirements

The PhD candidate must hold a research-oriented Master's degree
(MSc/MA) or equivalent qualification, and have a strong
background or demonstrable interest in machine learning and/or
applied natural language processing.  Solid programming skills
are a requirement.  Industrial experience, or a track record of
project-based work, is a definite advantage.

* More information.

- For application details, see the official vacancy page at
http://www.uva.nl/vacatures/object.cfm/objectid=E5EE6010-B5A3-4C85-857228F8D6D3A1B9

- For a detailed description of the project, look at
http://staff.science.uva.nl/~mdr/Research/Projects/MultiMATCH/

- For informal inquiries, please contact Maarten de Rijke, email
mdr at science.uva.nl, telephone +31 20 525 5358/7561, Informatics
Institute, University of Amsterdam, or Jaap Kamps, telephone
+31 20 525 3011, Archive and Information Studies, University
of Amsterdam.


-- 
ISLA * University of Amsterdam * http://ilps.science.uva.nl
SIGIR 2007 * http://www.sigir2007.org
MoodViews * http://moodviews.com



More information about the Corpora mailing list