31.1062, Jobs: Computational Linguistics: Rank Open, Hieronymus AG

The LINGUIST List linguist at listserv.linguistlist.org
Thu Mar 19 00:59:38 UTC 2020


LINGUIST List: Vol-31-1062. Wed Mar 18 2020. ISSN: 1069 - 4875.

Subject: 31.1062, Jobs: Computational Linguistics: Rank Open, Hieronymus AG

Moderator: Malgorzata E. Cavar (linguist at linguistlist.org)
Student Moderator: Jeremy Coburn
Managing Editor: Becca Morris
Team: Helen Aristar-Dry, Everett Green, Sarah Robinson, Peace Han, Nils Hjortnaes, Yiwen Zhang, Julian Dietrich
Jobs: jobs at linguistlist.org | Conferences: callconf at linguistlist.org | Pubs: pubs at linguistlist.org

Homepage: http://linguistlist.org

Please support the LL editors and operation with a donation at:
           https://funddrive.linguistlist.org/donate/

Editor for this issue: Everett Green <everett at linguistlist.org>
================================================================


Date: Wed, 18 Mar 2020 20:55:55
From: Tatjana Reimann [translations at hieronymus.com]
Subject: Computational Linguistics: Rank Open, Hieronymus AG, Zurich, Switzerland

 
University or Organization: Hieronymus AG 
Web Address: https://hieronymus.ch/
Job Title: Computer Linguist(s) (M/F) – 40-100%
Job Rank: Rank Open

Specialty Areas: Computational Linguistics 


Description:

Hieronymus Ltd provides high quality legal translation services for law firms
and legal departments of banks and large Swiss corporations.
Since early 2020, Hieronymus Ltd has also offered the first neural machine
translation engine specialising in Swiss law: LEXMachina. Although the initial
results are already extremely promising, we intend to continue improving this
technology in order to produce increasingly accurate translations. For this
purpose, we are seeking to hire one or more Computer Linguist(s) interested in
working in the field of legal linguistics and helping to develop dedicated
neural translation engines.

Position description: 

As a Computer Linguist for LEXMachina, you will be responsible for the entire
data processing pipeline and improvements on the LEXMachina engines.
More specifically, your duties will involve developing scraping scripts to
gather content from websites, performing optical character recognition (OCR)
(and customizing OCR tools) for text extraction in order to convert (and
automate the conversion of) certain content into standardised formats (e.g.
PDFs to XML or DOCX), cleaning the selected documents to feed the engine
(post-processing of the converted texts) and improving our NLP tool for
categorising those documents according to their legal field. You will also
develop and implement algorithms to pair texts available in multiple languages
(document pairing), segment them, and align them to create parallel corpora
that can be fed into LEXMachina. Finally, you will evaluate the corpora
created in this way to identify any problems.

With the continual aim of improving LEXMachina, you will analyse feedback from
users and find appropriate solutions for incorporating the requested changes
into the system, for example by defining pre-editing or post-editing rules, in
collaboration with our NMT partner.

Finally, in addition to constantly improving the "baseline" engine, you will
help to develop new engines in various areas of law, finance and insurance.

Requirements:

- Computer Linguist with a master's degree in computational linguistics,
computer science, engineering, etc. or in the process of studying for one 
- Passion for languages and strong interest for the legal and financial field 
- Good knowledge of English, German, French (Italian a plus)
- Excellent analytical skills 
- Good understanding of DTP and digital publishing processes (HTML, CMS, CSS,
HTML Markup, boilerplate, tags, encodings, etc.)
- Familiarity with OCR tools and processes (Abbyy Fine Reader, tesseract,
etc.), familiarity with publishing-related tools and data formats a plus
(Adobe Indesign, Acrobat, QuarkXPress, images)
- Experience with command line and Python, PERL, BAT (shell scripting a
significant plus)
- Understanding of Unix/Linux operating systems, as well as Windows servers
- Familiarity with translation industry and CAT tools a plus (XLIFF, SDLTM,
TMX, CSV, etc.) 

What we can offer you:

- The opportunity to play an active role in the development of the first Swiss
legal translation engine and to contribute to technological advances in this
area
- Work in close collaboration with expert lawyer-linguists with a well-honed
eye for quality and accuracy
- Various trips (outside Switzerland) to our NMT partner who is one of the
European market leaders in the field of neural translation and research in the
topic
- A pleasant working atmosphere within a small, highly motivated team
The start date and salary are to be agreed upon with the candidate(s) based on
their level of training and their experience.

Please submit your application by email to the email for applications below
(max. 2MB).
We look forward to receiving it.



Application Deadline:  (Open until filled)
	  
Email Address for Applications: stelle at hieronymus.com 
Contact Information:
	Tatjana Reimann 
	Email: translations at hieronymus.com 


------------------------------------------------------------------------------

***************************    LINGUIST List Support    ***************************
 The 2019 Fund Drive is under way! Please visit https://funddrive.linguistlist.org
  to find out how to donate and check how your university, country or discipline
     ranks in the fund drive challenges. Or go directly to the donation site:
               https://iufoundation.fundly.com/the-linguist-list-2019

                        Let's make this a short fund drive!
                Please feel free to share the link to our campaign:
                    https://funddrive.linguistlist.org/donate/
 


----------------------------------------------------------
LINGUIST List: Vol-31-1062	
----------------------------------------------------------






More information about the LINGUIST mailing list