21.207, Internships: Comp Ling: Information Extraction, Intelius, California, USA
linguist at LINGUISTLIST.ORG
linguist at LINGUISTLIST.ORG
Wed Jan 13 21:30:33 UTC 2010
LINGUIST List: Vol-21-207. Wed Jan 13 2010. ISSN: 1068 - 4875.
Subject: 21.207, Internships: Comp Ling: Information Extraction, Intelius, California, USA
Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
Reviews: Monica Macaulay, U of Wisconsin-Madison
Eric Raimy, U of Wisconsin-Madison
Joseph Salmons, U of Wisconsin-Madison
Anja Wanner, U of Wisconsin-Madison
<reviews at linguistlist.org>
Homepage: http://linguistlist.org/
The LINGUIST List is funded by Eastern Michigan University,
and donations from subscribers and publishers.
Editor for this issue: Matthew Lahrman <matt at linguistlist.org>
================================================================
The LINGUIST List strongly encourages employers to use
non-discriminatory standards in hiring policy. In particular we urge
that employers do not discriminate on the grounds of race, ethnicity,
nationality, age, religion, gender, or sexual orientation. However, we
have no means of enforcing these standards.
To submit an internship, use our convenient web form at
http://linguistlist.org/internship.
===========================Directory==============================
1)
Date: 11-Jan-2010
From: Andrew Borthwick < aborthwick at intelius.com >
Subject: Computational Linguistics: Information Extraction Graduate Summer Intern, Intelius, Redwood City, California, USA
-------------------------Message 1 ----------------------------------
Date: Wed, 13 Jan 2010 16:28:36
From: Andrew Borthwick [aborthwick at intelius.com]
Subject: Computational Linguistics: Information Extraction Graduate Summer Intern, Intelius, Redwood City, California, USA
E-mail this message to a friend:
http://linguistlist.org/issues/emailmessage/verification.cfm?iss=21-207.html&submissionid=2373434&topicid=15&msgnumber=1
University or Organization: Intelius, Inc.
Department: Data Research Team
Web Address: http://www.intelius.com
Type of Work: NLP
Linguistic Field(s): Computational Linguistics
Duration: 01-Jun-2010 to 31-Aug-2010
Compensation: Paid Competitive
Internship Location: Redwood City, California, USA
Minimum Education Level: BA
Special Qualifications: Current graduate student <br />
Description:
The successful candidate will work to enhance and extend Intelius' system for doing information extraction from biographical data. An example problem would be to extract education, job title, image, and descriptive snippets for each individual from a page such as http://www.google.com/intl/en/corporate/execs.html. This task is similar to the WePS-2 attribute extraction task (see http://nlp.uned.es/weps/weps2/papers/weps2-ae-task-description.pdf)
This is an exciting opportunity to work with a large crawl of the web, ample hardware, and a team of engineers focused on the problem. The internship will be at our Silicon Valley office in Redwood City, California and will offer a competitive salary.
Responsibilities:
* Primary goal is to enhance and extend Intelius' information extraction algorithms. This task encompasses work in high-precision named entity identification, attribute extraction, intra-document coreference resolution, tokenization, and sentence boundary detection. The candidate will then test these algorithms by running experiments on a massive scale.
Required Skills:
* Graduate student working on an M.S. or Ph.D. in computer science, computational linguistics, or related field
* Thesis focus on entity resolution or information extraction preferred.
* Strong hands-on skills in Java or Python
* Experience with complex regular expressions
Desired skills:
* Familiarity with entity resolution literature
* Experience with GATE or other NLP toolkits
* Experience with Hadoop
Application Deadline: Open until filled.
Web Address for Applications:http://tbe.taleo.net/NA6/ats/careers/requisition.jsp?org=INTELIUSCORP&cws=1&rid=31
Contact Information:
Andrew Borthwick, Ph.D.
Email: aborthwick at intelius.com
Phone: 425-974-8912
Fax: 650-365-3300
-----------------------------------------------------------
LINGUIST List: Vol-21-207
More information about the LINGUIST
mailing list