[Corpora-List] Call for Participation - SIGIR 2010 Web N-gram Workshop - July 23, 2010

Evelyne Viegas evelynev at microsoft.com
Fri Jul 16 21:00:07 UTC 2010



Call for Participation



Web N-gram Workshop<http://research.microsoft.com/webngram>

at the 33rd Annual ACM SIGIR Conference (SIGIR 2010)

July 23, 2010

Geneva, Switzerland



Workshop registration is open to all SIGIR 2010 attendees. We hope to see you there.





We have an exciting workshop coming up on July 23. See below for the talks, keynote, panel, and tutorial. Be a part of it and come enrich the discussions.

The workshop overview and goals can be seen at: http://research.microsoft.com/en-us/events/webngram/

Agenda below:
9:00-9:05

Introduction-Workshop Goals

9:05-9:50

Beyond Googleology: Assessing the Composition of the Web as a Large Corpus
Serge Sharoff, Centre for Translation Studies, University of Leeds

9:50-10:15

Information Extraction from Web-Scale N-Gram Data
Niket Tandon, Gerard De Melo, Max Planck Institute

10:15-10:45

Break

10:45-11:10

A Comparative Study of Bing Web N-gram Language Models for Web Search and Natural Language Processing
Jianfeng Gao, Patrick Ngyuen, Microsoft Research; Xiaolong Li, Microsoft Bing; Chris Thrasher, Mu Li, Kuansan Wang, Microsoft Research

11:10-11:35

Verifying the Implicit Presence of Difficult Query Aspects using a Large External Corpus
Dmitri Roussinov, University of Strathclyde

11:35-12:05

Minimal Perfect Hash Rank: Efficient Storage of Large Language Models
David Guthrie, Mark Hepple, University of Sheffield

12:05-14:00

Lunch

14:00-15:10

Global Statistics in Proximity Weighting Models
Craig Macdonald, Iadh Ounis, University of Glasgow

Further Studies on Multi-Style Language Model for Web Information Retrieval
Xiaolong Li, Microsoft Bing; Jianfeng Gao, Kuansan Wang, Microsoft Research

Using Web N-Grams to Help Second-Language Speakers
Martin Potthastm, Martin Trenkmann, Benno Stein, Bauhaus-Universität Weimar

Comparing Web N-grams and Other Means of Identifying Named Entities in Corporate Blogs
Aditya Rachakonda, Srinath Srinivasa, IIITB; Sudarshan Murthy, Wipro Technologies; Avinashreddy Palleti, Ramya Krishna, IIITB

Language Differences and Metadata Features on Twitter
Emre Kiciman, Microsoft Research

15:10-15:40

Tutorial
An Introduction to Web N-gram Service
Kuansan Wang, Microsoft Research

15:40-16:10

Break

16:10-17:20

Panel
Web-based Data Services for Research - Challenges and Opportunities

  *   Kenneth Church, John Hopkins University
  *   Evgeniy Gabrilovich, Yahoo! Research
  *   Haym Hirsch, National Science Foundation
  *   Donald Metzler, Information Sciences Institute University of Southern California
  *   Kuansan Wang, Microsoft Research

17:20-17:30

Wrap Up



-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20100716/61d4fea1/attachment.htm>
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list