[Corpora-List] Summer Intern/Information Extraction and Integration

Ang Sun asun at cs.nyu.edu
Sat Feb 9 16:27:37 UTC 2013


Description http://ch.tbe.taleo.net/CH06/ats/careers/requisition.jsp?org=INTELIUSCORP&cws=39&rid=230

inome is gathering the world’s information and making it people
centric. The inome graph connects billions of entities (people,
organizations and addresses) to encode the information-genome of each
individual. inome Research is looking for graduate interns to work on
developing its next generation of Information Extraction and
Information Integration technologies. The interns for summer 2013 will
explore developing novel strategies for extracting intelligence from
both unstructured text and semi-structured text and integrating the
intelligence with the inome graph. Sample projects include extracting
events, trends, users' interests from unstructured text; extracting
attributes of people from publicly available sources; linking
extracted entities to the entity nodes in the inome graph. This is
likely to be innovative work and we expect the summer internship to
lead to both product impact and a research paper in a top conference.
The internship will be at our headquarters in Bellevue, WA, and offer
a competitive compensation.

inome Research develops cutting-edge systems to standardize, create,
and link intelligence to power inome's industry-leading people
information-genome platform. Team members have published papers in top
research conferences such as NIPS, ACL, VLDB, CIKM, and SIGIR, given
invited talks, organized workshops, and turned algorithms into
deployed systems.

Responsibilities:

    Build and/or extend systems to do high-precision IE from
structured, semi-structured, and unstructured information sources
    Build and/or extend systems to do high-precision linkage from a
variety of information sources
    Design and implement algorithms for evaluating the performance of
IE and linkage

Required Skills:

    Graduate student working on a Ph.D. in Natural Language
Processing, Data Mining, Computational Social Science or related field
    Experience in one or more of the following areas: named entity
extraction, relation extraction, within document and cross-document
coreference, graph-based information extraction and information
fusion/integration
    Self-motivated, creative, and independent researching skills

Desired Skills:

    Strong hands-on skills in Java
    Experience with large-scale machine learning
    Experience with Hadoop
    Familiarity with graph based machine learning toolkits such as GraphLab
    Experience with crowdsourcing/Mechanical Turk
    Experience with NLP toolkits
    Experience with supervised/semi-supervised/unsupervised
information extraction
    Experience with graph-based NLP

Contact:
Please apply online. For faster considerations, please send your CV to
Ang Sun, asun at inome.com

_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list