23.3938, Jobs: Comp Ling; Language Documentation; Lexicography: Lexical Data Specialist, The Long Now Foundation

linguist at linguistlist.org linguist at linguistlist.org
Sun Sep 23 17:39:36 UTC 2012


LINGUIST List: Vol-23-3938. Sun Sep 23 2012. ISSN: 1069 - 4875.

Subject: 23.3938, Jobs: Comp Ling; Language Documentation; Lexicography: Lexical Data Specialist, The Long Now Foundation

Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
            Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>

Reviews: Veronika Drake, U of Wisconsin Madison
Monica Macaulay, U of Wisconsin Madison
Rajiv Rao, U of Wisconsin Madison
Joseph Salmons, U of Wisconsin Madison
Anja Wanner, U of Wisconsin Madison
       <reviews at linguistlist.org>

Homepage: http://linguistlist.org

Do you want to donate to LINGUIST without spending an extra penny? Bookmark
the Amazon link for your country below; then use it whenever you buy from
Amazon!

USA: http://www.amazon.com/?_encoding=UTF8&tag=linguistlist-20
Britain: http://www.amazon.co.uk/?_encoding=UTF8&tag=linguistlist-21
Germany: http://www.amazon.de/?_encoding=UTF8&tag=linguistlistd-21
Japan: http://www.amazon.co.jp/?_encoding=UTF8&tag=linguistlist-22
Canada: http://www.amazon.ca/?_encoding=UTF8&tag=linguistlistc-20
France: http://www.amazon.fr/?_encoding=UTF8&tag=linguistlistf-21

For more information on the LINGUIST Amazon store please visit our
FAQ at http://linguistlist.org/amazon-faq.cfm.

Editor for this issue: Sarah Fox <sarah at linguistlist.org>
================================================================  

Visit LL's Multitree project for over 1000 trees dynamically generated
from scholarly hypotheses about language relationships:
          http://multitree.linguistlist.org/
					
					

Date: Sun, 23 Sep 2012 13:39:14
From: Jonathan Pool [jobs at panlex.org]
Subject: Computational Linguistics; Language Documentation; Lexicography: Lexical Data Specialist, The Long Now Foundation, California, USA

E-mail this message to a friend:
http://linguistlist.org/issues/emailmessage/verification.cfm?iss=23-3938.html&submissionid=4553925&topicid=7&msgnumber=1
 
University or Organization: The Long Now Foundation 
Department: PanLex Project
Job Location: California, USA 
Web Address: http://panlex.org
Job Rank: Lexical Data Specialist  

Specialty Areas: Computational Linguistics; Language Documentation; Lexicography 


Description:

The PanLex Project at The Long Now Foundation seeks a Lexical 
Data Specialist to help develop its massively multilingual database, 
being built to document all known translations among all the words 
of all languages in the world. We envision this development taking 
place through 1) hands-on conversion of legacy lexical data into 
structured data, 2) the acquisition of new printed, digital, and 
crowdsourced data, and 3) the cultivation of partnerships that bring 
new data into the project, or enhance existing data in creative ways 
(for example, by linking to other datasets, or providing new tools and 
interfaces for the PanLex dataset).

Responsibilities:
Conversion of Legacy Data:
-  Become familiar with the PanLex stack (Apache, PostgreSQL, Perl) 
and upload procedures
-  Develop scripts to convert legacy lexical data into structured data 
for upload into the PanLex database

Recruitment and Training of Volunteers:
-  Work with project staff to develop a structured internship program 
that we can pilot during the summer of 02013. Participants will be 
introduced to the PanLex stack as well as an application of Unicode 
that provides a foundation for developing enterprise-level 
multilingual Web applications.
-  Recruit, train, and supervise interns and other volunteers
-  Supervise existing interns and volunteers

Acquisition of New Data:
-  Oversee acquisition and importation of PanLex data, addressing 
logistical, legal, and IT issues
-  Research and identify opportunities for acquiring new lexical data

Outreach:
-  Handle requests for use of PanLex data
-  Research and identify partnership opportunities, including potential 
donors and grantors
-  Represent the PanLex Project in communications with partner 
organizations.
-  Represent the PanLex Project at meetings, events, and 
conferences

Qualifications:
-  Relevant bachelor's or master's degree (linguistics, computer 
science, informatics, or nonprofit management)
-  Experience with open source database management and Web 
development
-  Knowledge of programming and scripting languages
-  Knowledge of linguistic concepts
-  Familiarity with methods of computational natural language 
processing
-  Familiarity with issues of endangered languages and language 
revitalization
-  Familiarity with writing systems, character encodings, and Unicode
-  Experience with project planning and management
-  Experience recruiting and supervising employees or volunteers
-  Ability to read and write English fluently
-  Knowledge of two or more languages other than English a plus
-  Experience with lexicography a plus

Employment Conditions:
The Lexical Data Specialist will work full- or part-time at Fort Mason 
Center in San Francisco or in Berkeley. The time commitment, office 
location, compensation, and start date are negotiable.

To Apply:
Please reply to the application email address below with a subject 
line that reads "PanLex Project - Lexical Data Specialist". Your initial 
communication should be an email message of no more than 200 
words, with no attachments, summarizing your main qualifications, 
motivations, and potential contributions. We'll circulate your 
message among our key personnel and contact you if we require 
additional information.


Application Deadline:  (Open until filled)
	  
Email Address for Applications: jobs at longnow.org 
Web Address for Applications: http://panlex.org/help/job-lds.shtml 
Contact Information:
	Project Director Jonathan Pool 
	Email: jobs at panlex.org 





----------------------------------------------------------
LINGUIST List: Vol-23-3938	
----------------------------------------------------------
Visit LL's Multitree project for over 1000 trees dynamically generated
from scholarly hypotheses about language relationships:
          http://multitree.linguistlist.org/
					
					



More information about the LINGUIST mailing list