25.3781, Calls: Hindi, Tamil, Malayalam, English, Computational Linguistics/India

Fri Sep 26 19:52:17 UTC 2014

LINGUIST List: Vol-25-3781. Fri Sep 26 2014. ISSN: 1069 - 4875.

Subject: 25.3781, Calls: Hindi, Tamil, Malayalam, English, Computational Linguistics/India

Moderators: Damir Cavar, Indiana U <damir at linguistlist.org>
            Malgorzata E. Cavar, Indiana U <gosia at linguistlist.org>

Reviews: reviews at linguistlist.org
Anthony Aristar <aristar at linguistlist.org>
Helen Aristar-Dry <hdry at linguistlist.org>
Sara Couture, Indiana U <sara at linguistlist.org>

Homepage: http://linguistlist.org

Do you want to donate to LINGUIST without spending an extra penny? Bookmark
the Amazon link for your country below; then use it whenever you buy from
Amazon!

USA: http://www.amazon.com/?_encoding=UTF8&tag=linguistlist-20
Britain: http://www.amazon.co.uk/?_encoding=UTF8&tag=linguistlist-21
Germany: http://www.amazon.de/?_encoding=UTF8&tag=linguistlistd-21
Japan: http://www.amazon.co.jp/?_encoding=UTF8&tag=linguistlist-22
Canada: http://www.amazon.ca/?_encoding=UTF8&tag=linguistlistc-20
France: http://www.amazon.fr/?_encoding=UTF8&tag=linguistlistf-21

For more information on the LINGUIST Amazon store please visit our
FAQ at http://linguistlist.org/amazon-faq.cfm.

Editor for this issue: Anna White <awhite at linguistlist.org>
================================================================  

Date: Fri, 26 Sep 2014 15:52:04
From: Pattabhi RK Rao [pattabhi at au-kbc.org]
Subject: Named Entity Recognition for Indian Languages at Forum for Information Retrieval Evaluation

E-mail this message to a friend:
http://linguistlist.org/issues/emailmessage/verification.cfm?iss=25-3781.html&submissionid=35961737&topicid=3&msgnumber=1

Full Title: Named Entity Recognition for Indian Languages at Forum for Information Retrieval Evaluation 
Short Title: NER-IL @ FIRE 2014 

Date: 05-Dec-2014 - 07-Dec-2014
Location: Bangalore, India 
Contact Person: Sobha Lalitha Devi
Meeting Email: sobha at au-kbc.org
Web Site: http://au-kbc.org/nlp/NER-FIRE2014 

Linguistic Field(s): Computational Linguistics 

Subject Language(s): English (eng)
                     Hindi (hin)
                     Malayalam (mal)
                     Tamil (tam)

Call Deadline: 10-Oct-2014 

Meeting Description:

NER-IL @FIRE 2014 - Named Entity Recognition for Indian Languages
http://www.au-kbc.org/nlp/NER-FIRE2014/

This is an evaluation lab workshop on Automatic Identification of Embedded/Nested Named Entities in the text documents using machine learning techniques. The corpus is available in 3 Major Indian languages Hindi, Tamil and Malayalam, also English. The corpus is more than 100K words and has been manually annotated. The NER-IL is held as part of Forum for Information Retrieval Evaluation (FIRE 2014) Workshop/Conference at Bangalore, India.

Organizers:

Pattabhi RK Rao, AU-KBC Research Centre, Chennai, India
Malarkodi CS, AU-KBC Research Centre, Chennai, India
Vijay Sundar Ram, AU-KBC Research Centre, Chennai, India
Sobha Lalitha Devi, (Chair) , AU-KBC Research Centre, Chennai, India

Call for Participation:

The Task Objective:

Automatic Identification of Embedded/Nested Named Entities in the text documents using machine learning techniques. The documents are available in 3 Major Indian languages Hindi, Tamil and Malayalam, also English. Training data has been released and available for download on the task
website. The corpus is more than 100K words and has been manually annotated.

Task Description:

Named Entity Recognition(NER) Refers to automatic identification of named entities in a text document. It is a known fact that some of the Named entities contain other named entities inside them. In the field of Named entity recognition, it is observed that the task of embedded named entity
identification has been ignored. Advantages of embedded named entity recognition is that this helps identifying entity relationships and also in higher NLP applications especially in the development of Information extraction systems. 

One of the biggest challenges in embedded named entity recognition is the availability of benchmark data with embedded tagging. And especially for Indian languages we have no such data. Here we have made efforts to provide benchmark data for Indian languages with embedded tagging. 

Important Dates:

September 23, 2014: Training Corpus released
October 4, 2014: Development data release
October 10, 2014: Registration Deadline
October 16, 2014: Test data release
October 23, 2014: Test runs submission
November 10, 201: Working Notes due
December 5 - 7, 2014: FIRE 2014 Workshop @ Bangalore

Contact: Organizing Chair - sobha at au-kbc.org

----------------------------------------------------------
LINGUIST List: Vol-25-3781	
----------------------------------------------------------