[Corpora-List] Open trainee positions at the EC's Joint Research Centre in Italy (2nd call)
Ralf Steinberger
ralf.steinberger at jrc.it
Fri Aug 31 09:57:52 UTC 2007
Apologies for multiple postings!
The European Commission's Joint Research Centre (JRC) is advertising
scientific internship positions in a large variety of fields, including
three profiles related to text analysis.
<http://ipsc.jrc.cec.eu.int/traineegrant.php?id=5> IPSC/G02-5/2007 Web
Mining and Information Extraction
<http://ipsc.jrc.cec.eu.int/traineegrant.php?id=6> IPSC/G02-6/2007
Multilingual text analysis tools
<http://ipsc.jrc.cec.eu.int/traineegrant.php?id=7> IPSC/G02-7/2007
Political scientist
For the full call, see http://ipsc.jrc.cec.eu.int/jobs.php?id=7. Below, you
find information on the profile 'Multilingual text analysis tools'.
Location: Ispra, at the Lago Maggiore in Italy, 60 km West of Milan;
Host: European Commission - Joint Research Centre (JRC)
Position: traineeship / internship / stage / Praktikum / tirocino;
Starting date: late 2007 or any time in 2008;
Duration: 3 to 12 months;
Remuneration: 963 Euro per month + travel allowance;
Nationality: Applicants must have the nationality of an EU Member
State, of an
Associated EU Candidate Country, an Associated State or a
Developing Country;
Working language: English;
Activity: Language Technology, Web Technology; many other subject
areas
URL: <http://ipsc.jrc.cec.eu.int/jobs.php?id=7>
http://ipsc.jrc.cec.eu.int/jobs.php?id=7, <http://langtech.jrc.it/>
http://langtech.jrc.it/, <http://www.jrc.it/> http://www.jrc.it/;
Deadline: Open call. First cut-off date: Tuesday 14 September 2007
Contact: JRC-IPSC-STAGE AT ec.europa.eu
The European Commission's Joint Research Centre in Italy is seeking students
or recent graduates to spend an internship with our motivated and successful
multinational team of scientists and developers producing concrete and
widely used applications. Successful applicants will want to produce
hands-on results and to work in a team. The trainees will learn about our
multilingual text analysis tools (covering between 19 and 32 languages) and
their integration into complex and highly used web portals: our news
analysis pages are visited with up to 1.2 Million hits per day. The trainees
will also get experience of working in the multilingual, multinational,
multi-disciplinary environment of an international organisation.
Depending on your profile, you can expect to work on one or more of the
following subject areas:
- Information Extraction: named entities, relations, event
scenarios, ...;
- Symbolic or statistical approaches;
- Writing English event and relation extraction rules;
- Document Clustering, Categorisation (Classification;
Categorization);
- Terminology extraction, multilingual lexicology;
- Social networks;
- Visualisation;
- Topic detection and tracking, Trend detection;
- Adapting the JRC's tool set to new languages;
- Web log analysis for our applications;
- Applying text analysis tools to the medical or political domains;
- Mining the NewsExplorer <http://press.jrc.it/NewsExplorer> name
database;
- JAVA re-implementation of PERL programs;
- ...
Applicants must have good programming skills in JAVA or PERL and must be
able to use English as a working language.
Experience with one or more of the following would be a plus: databases, web
technology, XML, knowledge of several natural languages (even passive),
knowledge of - or interest in - medicine or political science, experience of
working with thesauri, ontologies, dictionaries.
If you are interested in this opportunity and you feel that you can
contribute to any of the tasks mentioned above, please follow the
instructions given at http://ipsc.jrc.cec.eu.int/jobs.php?id=7. Please
carbon-copy your email application to Ralf.Steinberger AT jrc.it.
For information on the European Commission's Joint Research Centre and its
Web and Language Technology group, see http://langtech.jrc.it
<http://langtech.jrc.it/> . For more information on traineeships, cost of
living, etc., see http://langtech.jrc.it/WorkatJRC.html.
When applying, please follow the instructions given on the web page for the
call. If you could send a copy of your application to Ralf Steinberger, this
would be useful.
Ralf Steinberger (Ralf.Steinberger AT jrc.it)
European Commission - Joint Research Centre (JRC)
IPSC - SeS - Language Technology ( <http://langtech.jrc.it/>
http://langtech.jrc.it, <http://press.jrc.it/NewsExplorer/>
http://press.jrc.it/NewsExplorer)
JRC-Acquis Multilingual Parallel Corpus (Version 3)
. Freely available for research purposes.
. 22 languages: Bulgarian, Czech, Danish, German, Greek, English, Spanish,
Estonian, Finnish, French, Hungarian, Italian, Lithuanian, Latvian, Maltese,
Dutch, Polish, Portuguese, Romanian, Slovak, Slovene and Swedish.
. Altogether over 1 Billion words.
. Sentence alignment for 231 language pairs.
. For more information and download, see
<http://langtech.jrc.it/JRC-Acquis.html>
http://langtech.jrc.it/JRC-Acquis.html.
The JRC's Language Technology group specialises in the development of highly
multilingual text analysis tools and in cross-lingual applications. Many
applications are accessible online, e.g.:
. <http://press.jrc.it/NewsExplorer/> NewsExplorer: multilingual news
aggregation and analysis (19 languages); allows to navigate the news over
time and across languages; trend analysis; collects information about people
from the news; social network detection.
. <http://press.jrc.it/> NewsBrief: breaking news detection and display of
the very latest thematic news from around the world; email alerting (22+
languages).
. <http://medusa.jrc.it/> MedISys Medical Information System: latest
health-related news from around the world according to themes and diseases
(22+ languages).
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20070831/9fc74626/attachment.htm>
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list