[Corpora-List] Summer Intern/Large Scale Graph Mining

Ang Sun asun at cs.nyu.edu
Sat Feb 9 16:29:00 UTC 2013


Description http://ch.tbe.taleo.net/CH06/ats/careers/requisition.jsp?org=INTELIUSCORP&cws=39&rid=231

inome is gathering the world’s information and making it people
centric. The inome graph connects billions of entities (people,
organizations and addresses) to encode the information-genome of each
individual. inome Research develops cutting-edge systems to extract,
standardize, link and create intelligence to power inome’s
industry-leading people search engine and platform development
environment. inome research is a team of scientists with vast
expertise in Record Linkage, Natural Language Processing, Entity
Resolution, Data Deduplication, Machine Learning and Information
Retrieval. The internship will be at our headquarters in Bellevue, WA,
and offer a competitive compensation.

Responsibilities:

This position will explore large scale graph algorithms for problems
interfacing people search and data record linkage using billions of
person records derived from sources ranging from public social network
profiles to phone books. You will design advanced algorithms and
implement them to run on a large Hadoop cluster and monitor the
quality of inome’s person matching system. You will also design and
develop tools on top of the inome entity graph by exploring the
entities and their connections. Sample projects include Recommendation
Systems, Community Detection, Finding Influential People, etc. This is
likely to be innovative work and we expect the summer intern to
improve/extend our products and publish a paper at a top conference.

Required Skills:

Pursuing Ph.D. in Computer Science with a focus on large scale graphs,
graph mining, social media analytics, data mining, machine learning,
natural language processing or related fields
Experience with large scale graph algorithms, clustering, page-rank,
and community detection
Strong hands-on skills in object-oriented design methodology and
application development in Java
Proficiency in at least one of Perl, PHP, Python
Basic understanding of Hadoop and/or MapReduce
Familiarity with large-scale, distributed systems’ backend
architecture and development
Familiarity with graph based machine learning toolkits such as GraphLab
Familiarity with graph databases, preferably having hands on
experience in neo4j, or InfiniteGraph
Familiarity with graph query languages

Desired Skills:
Interest in solving problems with big data collected from various sources
Excellent understanding of computer science fundamentals, data
structures, and algorithms
Excellent problem solving skills
Past publications on graph algorithms, social networks, or related field
Familiarity with Amazon Mechanical Turk or other human evaluation systems

Contact:
Please apply online at
http://ch.tbe.taleo.net/CH06/ats/careers/apply.jsp?org=INTELIUSCORP&cws=39
or send your CV to Ang Sun, asun at inome.com

_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list