20.3325, Jobs: Comp Ling, Natural Lang Processing: Software Engineer, Amazon

linguist at LINGUISTLIST.ORG linguist at LINGUISTLIST.ORG
Fri Oct 2 15:26:05 UTC 2009


LINGUIST List: Vol-20-3325. Fri Oct 02 2009. ISSN: 1068 - 4875.

Subject: 20.3325, Jobs: Comp Ling, Natural Lang Processing: Software Engineer, Amazon

Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
            Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
 
Reviews: Monica Macaulay, U of Wisconsin-Madison  
Eric Raimy, U of Wisconsin-Madison  
Joseph Salmons, U of Wisconsin-Madison  
Anja Wanner, U of Wisconsin-Madison  
       <reviews at linguistlist.org> 

Homepage: http://linguistlist.org/

The LINGUIST List is funded by Eastern Michigan University, 
and donations from subscribers and publishers.

Editor for this issue: Erica Wicks <erica at linguistlist.org>
================================================================  

The LINGUIST List strongly encourages employers to use
non-discriminatory standards in hiring policy. In particular we urge
that employers do not discriminate on the grounds of race, ethnicity,
nationality, age, religion, gender, or sexual orientation. However, we
have no means of enforcing these standards.

Job seekers should pay special attention to language in ads regarding
employment requirements and are encouraged to consult our international
employment page http://linguistlist.org/jobs/jobnet.html. This page has been set 
up so that people can report on the employment standards of various countries.

To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.html.

===========================Directory==============================  

1)
Date: 30-Sep-2009
From: C.C. Scott < ccscott at amazon.com >
Subject: English & Computational Linguistics, Natural Language Processing and Inforamtion Retrieval: Software Development Engineer, Amazon, Washington, USA
 

	
-------------------------Message 1 ---------------------------------- 
Date: Fri, 02 Oct 2009 11:24:10
From: C.C. Scott [ccscott at amazon.com]
Subject: English & Computational Linguistics, Natural Language Processing and Inforamtion Retrieval: Software Development Engineer, Amazon, Washington, USA

E-mail this message to a friend:
http://linguistlist.org/issues/emailmessage/verification.cfm?iss=20-3325.html&submissionid=232254&topicid=7&msgnumber=1
  


University or Organization: Amazon 
Job Location: Washington, USA 
Web Address: http://www.amazon.com

Job Rank: Software Development Engineer  

Specialty Areas: Computational Linguistics; Natural Language Processing, Data Mining, Information Retrieval

Required Language(s): English (eng)

Description:

Amazon.com's Darwin team is looking for exceptional software engineers to
develop algorithms and build systems to automatically solve a variety of
Information Retrieval and Data Mining problems related to the Amazon
Product Catalog - one of the company's biggest assets. 

Our principal challenge is to improve the shopping experience by detecting
duplicate products for sale in the catalog and merging them. Merchants on
Amazon.com provide information about the products they want to sell. Amazon
attempts to match these product data submissions to items in its catalog so
that it can display offers for the same product on a single page. Poorly
structured or incomplete data makes this problem very challenging and often
results in duplicate products getting created in the catalog. These
duplicate products are shown in search results and end up confusing
customers, leading to a bad customer experience. The Darwin team detects
these duplicate products in the Amazon.com catalog using an innovative mix
of Information Retrieval, Data Mining and Natural Language Processing
algorithms and human intelligence harnessed via the Amazon Mechanical Turk.
We then automatically merge products detected as duplicates together,
improving customer experience and the quality of the catalog.

We are also responsible for a variety of other Catalog-related projects
such as placing Product Advertisements on pages, automatically extracting
important product features from the product description with a view to
improving the discovery (search and browse) experience on the website and
detecting egregious cases of poor quality data provided by sellers. 

We are a highly-motivated, co-operative and fun loving team who thrive on
solving challenging problems with innovation. As part of this team you will
be analyzing data, developing new algorithms, building large-scale
distributed software systems in Java using open source technologies such as
Apache Lucene and JBoss and other Amazon.com proprietary technologies.

Qualifications:

The ideal candidate will have the following qualifications:

    * Advanced degree in Computer Science, Math or related field with 5+
years of experience.
    * Past experience in at least one of the following areas - Information
Retrieval, Data Mining, Natural Language Processing or Machine Learning.
    * Desire to analyze data while developing solutions to problems.
    * Strong desire to build high-performance, highly-available and
scalable distributed systems.
    * Strong design and coding skills in Java/C++ on Unix Platforms.
    * Familiar with Perl and have a good understanding of SQL.
    * Be highly innovative, flexible and self-directed.
    * Excellent written and verbal communication skills.


Application Deadline: 31-Dec-2009 
	  
Email Address for Applications: ccscott at amazon.com 
Contact Information:
	C.C. Scott 
	Email: ccscott at amazon.com 



-----------------------------------------------------------
LINGUIST List: Vol-20-3325	

	



More information about the LINGUIST mailing list