Arabic-L:LING:Post-doc in multilingual text processing in Italy
Dilworth Parkinson
dilworth_parkinson at BYU.EDU
Wed Apr 16 18:23:15 UTC 2008
------------------------------------------------------------------------
Arabic-L: Wed 16 Apr 2008
Moderator: Dilworth Parkinson <dilworth_parkinson at byu.edu>
[To post messages to the list, send them to arabic-l at byu.edu]
[To unsubscribe, send message from same address you subscribed from to
listserv at byu.edu with first line reading:
unsubscribe arabic-l ]
-------------------------Directory------------------------------------
1) Subject:Post-doc in multilingual text processing in Italy
-------------------------Messages-----------------------------------
1)
Date: 16 Apr 2008
From:Ralf Steinberger <ralf.steinberger at jrc.it>
Subject:Post-doc in multilingual text processing in Italy
Application deadline is 18 April midnight CET! Please excuse the late
posting!
The European Commission’s Joint Research Centre (JRC <http://ec.europa.eu/dgs/jrc/index.cfm
> ) in Ispra, at the Lago Maggiore in Northern Italy has an opening
for a post-doc position in multilingual text analysis (see below). The
JRC is running several public news aggregation and analysis web
portals (see http://emm.jrc.it/overview.html) and provides a number of
services to a wide range of international customers. A strong focus in
the JRC’s work is on multilinguality and on tools to provide cross-
lingual information access.
Applications (3-page <http://ipsc.jrc.ec.europa.eu/job/appl_form_grantholders.xls
> application form and an updated <http://ipsc.jrc.ec.europa.eu/job/EU_CV_template_EN.doc
> CV in English) should be submitted by e-mail to the following e-
mail address: JRC-IPSC-GRANTHOLDERS at ec.europa.eu .
According to the Vademecum for grantholders (see http://ipsc.jrc.ec.europa.eu/showdoc.php?doc=job/VademecumforGholders2008.pdf)
, the remuneration is about 54,000 Euro/year plus allowances.
------------------------------------------------------------------------------
Automatic Multilingual Text Analysis
CALL REFERENCE NO. : IPSC/G02/5
Category: Post-Doc researcher (category 30)
Duration: 36 months
Action: EMM
Remuneration: see Vademecum for grantholders <http://ipsc.jrc.ec.europa.eu/showdoc.php?doc=job/VademecumforGholders2008.pdf
>
URL generic call: http://ipsc.jrc.ec.europa.eu/jobs.php?id=8
URL specific post: <http://ipsc.jrc.ec.europa.eu/showgrant.php?id=7> http://ipsc.jrc.ec.europa.eu/showgrant.php?id=7
In the Web Mining and Intelligence (EMM) activity, the person will be
working on research activities on automatic multilingual text
analysis. Typical examples of subjects being studied currently are
automatic event extraction, automatic entity recognition and cross-
language clustering.
These techniques are already being deployed in several operational
applications and part of the work would be in support of these
applications. The on-going research has a strong focus on
applicability in a multilingual environment
A new area of research is the automatic generation of summaries from
multi-document texts, in particular from news article clusters. The
work is highly practical and goal oriented. Research results are
expected to be used operationally. The system within which the results
will be deployed is implemented in Java as a set of servlets in Tomcat.
University degree in computer science or computational linguistics.
Doctoral degree in similar discipline, or equivalent work experience
of 5 years. Good programming skills, preferably in Java are therefore
recommended. The working language of the action is English and strong
English language skills are required. Given the multilingual aspect of
the work, active knowledge of at least one other language and an
understanding of at least another one is also required.
Good knowledge of Arabic would be seen as an asset.
Ralf Steinberger ( <mailto:Ralf.Steinberger at jrc.it> Ralf.Steinberger at jrc.it
)
European Commission - Joint Research Centre (JRC)
IPSC - SeS - Language Technology
URL: Applications: http://emm.jrc.it/overview.html
URL: The science behind them: <http://langtech.jrc.it/> http://langtech.jrc.it
.
The JRC’s Language Technology group specialises in the development of
highly multilingual text analysis tools and in cross-lingual
applications. Many applications are accessible online, e.g.:
* <http://press.jrc.it/NewsExplorer/> NewsExplorer:
multilingual news aggregation and analysis (19 languages); allows to
navigate the news over time and across languages; trend analysis;
collects information about people from the news; social network
detection.
* <http://press.jrc.it/> NewsBrief: breaking news detection and
display of the very latest thematic news from around the world; email
alerting (22+ languages).
* <http://medusa.jrc.it/> MedISys Medical Information System:
latest health-related news from around the world according to themes
and diseases (22+ languages).
* EMM-Labs <http://emm-labs.jrc.it:8080/> : Latest developments;
social networks; live people-in-the-news; country and theme fact
sheets; maps showing violent events world-wide.
JRC-Acquis Multilingual Parallel Corpus (Version 3)
* Freely available for research purposes.
* 22 languages: Bulgarian, Czech, Danish, German, Greek,
English, Spanish, Estonian, Finnish, French, Hungarian, Italian,
Lithuanian, Latvian, Maltese, Dutch, Polish, Portuguese, Romanian,
Slovak, Slovene and Swedish.
* Altogether over 1 Billion words.
* Sentence alignment for 231 language pairs.
* For more information and download, see <http://langtech.jrc.it/JRC-Acquis.html
> http://langtech.jrc.it/JRC-Acquis.html.
DGT-Translation Memory
* Freely available for research purposes.
* Aligned translation units for 231 language pairs.
* Alignment manually verified.
* For more information and download, see http://langtech.jrc.it/DGT-TM.html
.
--------------------------------------------------------------------------
End of Arabic-L: 16 Apr 2008
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/arabic-l/attachments/20080416/66e61c08/attachment.htm>
More information about the Arabic-l
mailing list