[Corpora-List] Second Call for EMNLP 2014 Workshop & Shared Task on Arabic Natural Language Processing

Nizar Habash habash at ccls.columbia.edu
Tue May 6 18:35:32 UTC 2014


=======================================================

Second Call for Papers and Participation
EMNLP Workshop on Arabic Natural Language Processing 
Including Shared Task on Automatic Arabic Error Correction

      Apologies for multiple postings
      Please distribute to colleagues

=======================================================

Second Call for Papers and Participation

Arabic Natural Language Processing Workshop
collocated with EMNLP 2014, Doha, Qatar 

Workshop date: Saturday October 25, 2014
Paper submission deadline: July 26, 2014
Shared task registration deadline: July 1, 2014

Workshop Website: http://www.emnlp2014.org/workshops/anlp/call.html   
Shared Task Website: http://www.emnlp2014.org/workshops/anlp/shared_task.html 

=======================================================

WORKSHOP DESCRIPTION

There has been a lot of progress in the last 15 years in the area of
Arabic Natural Language Processing (NLP).  Many Arabic NLP (or Arabic
NLP-related) workshops and conferences have taken place, both in the
Arab World and in association with international conferences. This
workshop follows in the footsteps of previous efforts to provide a
forum for researchers to share and discuss their ongoing work.

We invite submissions on topics that include, but are not limited to,
the following: 

* Basic core technologies: morphological analysis, disambiguation,
  tokenization, POS tagging, named entity detection, chunking,
  parsing, semantic role labeling, sentiment analysis, Arabic dialect
  modeling, etc.

* Applications: machine translation, speech recognition, speech
  synthesis, optical character recognition, pedagogy, assistive
  technologies, social media, etc.

* Resources: dictionaries, annotated data, specialized databases etc.
	
Submissions may include work in progress as well as finished work.
Submissions must have a clear focus on specific issues pertaining to
the Arabic language whether it is standard Arabic, dialectal, or
mixed. Descriptions of commercial systems are welcome, but authors
should be willing to discuss the details of their work.  Submissions
are expected to be 8 pages long plus 2 pages for references.
Associated with the workshop will be a shared task on Arabic text
error correction (details below).

SHARED TASK

As part of the Arabic Natural Language Processing Workshop at EMNLP
2014, we will conduct a shared task on Automatic Arabic Error
Correction. We designed this task in the traditions of high profile
shared tasks in natural language processing such as CONLL?s
grammar/error detection and correction shared tasks in 2011-2013 and
numerous machine translation campaigns by NIST/WMT/MEDAR, among
others.  The task relies on resources created under the Qatar Arabic
Language Bank (QALB) project (currently over 1M words of manually
corrected Arabic text).

A participating system in this shared task will be given Modern
Standard Arabic texts, which are to be automatically corrected. The
input will be provided in Arabic script, and will be annotated for
part-of-speech (in different granularities), inflectional features,
clitics (which appear in 20% of Arabic words), lemmas, and English
glosses.  All of the input text will be preprocessed in a common way
to make sure all participants have access to all of these features at
no additional overhead novelty cost.  We follow the file format and
evaluation framework used by the CONLL shared tasks on error
correction. The task is focused on correction as opposed to
identification. There will not be an error identification task per se.

Participants need to register.  Once registered, all participating
teams will be provided with a common training data set, which includes
common preprocessed input and corrected output. Registration link is
on the Shared Task Website (see above). A common development set will
also be provided. A blind test data set will be used to evaluate the
output of the participating teams. An evaluation script will be
provided to all the teams.  Each participating team can submit up to
three systems.  Participants are welcome to use additional resources
and tools that are not part of the released data set. However, all
such additions must be fully disclosed.  

Participants are expected to author a short paper (4 pages + 2 for
references) describing their approach, resources and experiments.  The
paper needs to follow the standard format of EMNLP conference.

IMPORTANT DATES

Shared task registration period: April 8, 2014 through July 1, 2014
Shared task test release:  July 7, 2014
Shared task system output collection: July 18, 2014
Submission deadline (Workshop and shared task papers): July 26, 2014
Author notification: August 26, 2014
Camera Ready: September 15, 2014
Workshop: October 25, 2014 

ORGANIZERS
Program Co-chairs
Nizar Habash, Columbia University
Stephan Vogel, Qatar Computing Research Institute

Publication Co-chairs
Nadi Tomeh, Paris 13 University
Houda Bouamor, Carnegie Mellon University Qatar

Website Committee
Kareem Darwish, Qatar Computing Research Institute
Noura Farra, Columbia University 

Shared Task Committee
Behrang Mohit (co-chair), Carnegie Mellon University Qatar
Alla Rozovskaya (co-chair), Columbia University
Wajdi Zaghouani, Carnegie Mellon University Qatar 
Ossama Obeid, Carnegie Mellon University Qatar
Nizar Habash (advisor), Columbia University 

Program Committee Members 

Abdelmajid Ben-Hamadou, University of Sfax, Tunisia
Abdelhadi Soudi, Ecole Nationale de l’Industrie Minérale, Morocco
Abdelsalam Nwesri, University of Tripoli, Libya
Achraf Chalabi , Microsoft Research, Egypt
Ahmed Ali, Qatar Computing Research Institute, Qatar
Ahmed Rafea, The American University in Cairo, Egypt
Alexis Nasr,  University of Marseille, France
Ali Farghaly, Monterey Peninsula College, USA
Almoataz B. Al-Said, Cairo University, Egypt
Alon Lavie, Carnegie Mellon University, USA
Aly Fahmy, Cairo University, Egypt
Azadeh Shakery, University of Tehran, Iran
Azzeddine Mazroui, University Mohamed I, Morocco	
Bassam Haddad, University of Petra, Jordan
Bayan Abu Shawar, Arab Open University, Jordan
Behrang Mohit, Carnegie Mellon University Qatar, Qatar
Eric Atwell, University of Leeds, UK
Farhad	Oroumchian, University of Wollongong, Australia
Ghassan Mourad, Université Libanaise, Lebanon
Hassan Sawaf, eBay Inc., USA
Hazem Hajj, American University of Beirut, Lebanon
Hend Alkhalifa, King Saud University, Saudi Arabia
Houda Bouamor, Carnegie Mellon University Qatar, Qatar
Imed Zitouni, Microsoft Research, USA
Joseph Dichy, Université Lyon 2, France
Karim Bouzoubaa , Mohammad V University, Morocco
Karine	Megerdoomian, The MITRE Corporation, USA
Katrin Kirchhoff, University of Washington, USA
Kemal Oflazer, Carnegie Mellon University Qatar, Qatar
Khaled Shaalan, The British University in Dubai, UAE
Khaled Shaban, Qatar University, Qatar
Khalil Sima’an, Universiteit van Amsterdam, Netherlands
Lamia Hadrich Belguith, University of Sfax, Tunisia
Michael Rosner, University of Malta, Malta
Mohamed Elmahdy, Qatar University, Qatar
Mohsen Rashwan, Cairo University, Egypt
Mona Diab, George Washington University, USA
Mustafa Jarrar, Bir Zeit University, Palestine
Nada Ghneim, Higher Institute for Applied Sciences and Technology, Syria
Nadi Tomeh, University Paris 13, France
Ossama Emam, IBM, USA
Otakar Smrž, Džám-e Džam Language Institute, Czech Republic
Owen Rambow, Columbia University, USA
Preslav Nakov, Qatar Computing Research Institute, Qatar
Ramzi Abbes, TECHLIMED, France
Salwa Hamada, Cairo University, Egypt
Shahram Khadivi, Tehran Polytechnic, Iran
Sherri Condon , The MITRE Corporation, USA
Taha Zerrouki, University of Bouira, Algeria
Violetta Cavalli-Sforza, Al Akhawayn University, Morocco



_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list