[Corpora-List] Final CFP: 2nd WAC Workshop, at EACL

Adam Kilgarriff adam at lexmasterclass.com
Fri Jan 6 09:33:27 UTC 2006


                          Final Call for Papers: 
                       2nd WEB AS CORPUS WORKSHOP 

In conjunction with the 11th Conference of the European Chapter of the 
Association for Computational Linguistics (EACL) 

                               Trento, Italy 
                               April 4, 2006 

Submissions by 6 Jan 2006 at
http://www.softconf.com/start/EACL06_WS01/ 

Workshop site: 
http://sslmit.unibo.it/~baroni/web_as_corpus_eacl06.html 

Previous WaC Workshop: 
http://sslmit.unibo.it/~baroni/web_as_corpus_cl05.html 

                   Co-chairs: Adam Kilgarriff and Marco Baroni 

                                     Topics 
                                     ------ 

Research on the Web as corpus is currently at a very exciting stage:
increasing evidence points to the enormous potential of the Internet as a
source of linguistic data, but we are still far from a working,
fully-fledged 
linguists' search engine. Many fundamental issues are just starting to be
tackled, ranging from Web frequency distributions and registers, to
efficient handling of massive data sets, to copyright.

We invite submissions which: 

- describe Web corpus collection projects, or modules for one part of 
the process (crawling, filtering, language-id, tokenizing, 
lemmatizing, POS-tagging, indexing, ...) 

- explore characteristics of Web data, from a linguistics/NLP 
perspective 

- use crawled Web data for NLP purposes. 

Preference will be given to projects where Web data are downloaded and 
processed directly, rather than via search engine interfaces. 

Submission Information 
---------------------- 

Authors are invited to submit full papers on original, unpublished 
work in the topic area of this workshop. Submissions should follow the 
two-column format of ACL proceedings and should not exceed eight (8) 
pages, including references. We strongly recommend the use of ACL 
LaTeX or Microsoft Word style files tailored for this year's 
conference available at 

http://eacl06.itc.it/submission/submission.htm 

Papers must conform to the official EACL-06 style guidelines, and we 
reserve the right to reject submissions that do not conform to these 
styles, including font size restrictions. Submissions should be in PDF 
format and must include all fonts, so that the paper will print (not 
just view) anywhere. 

Please submit your paper no later than January 6, 2006, at
http://www.softconf.com/start/EACL06_WS01/ 

Each submission will be reviewed at least by two members of the 
program committee. Accepted papers will be published in the workshop 
proceedings. 

Dual submissions to the main EACL 2006 conference and this workshop 
are allowed; if you submit to the main session, do indicate this when 
you submit to the workshop, and specify your EACL submission reference 
number, for administrative ease. If your paper is accepted for the 
main session, you should withdraw your paper from the workshop upon 
notification by the main session. 

Important Dates 
--------------- 

January 6, 2006 - Deadline for workshop papers 

January 27, 2006 - Notification of acceptance 

February 10, 2006 - Camera-ready papers due 

April 4, 2006 - Workshop 

Program Committee 
----------------- 

Marco Baroni (co-chair) 
Silvia Bernardini 
Massimiliano Ciaramita 
Stefan Evert 
William H. Fletcher 
Gregory Grefenstette 
Frank Keller 
Adam Kilgarriff (co-chair) 
Mirella Lapata 
Anke Lüdeling 
Philip Resnik 
Serge Sharoff 

Contacts 
--------

Adam Kilgarriff: adam_AT_lexmasterclass.com 
Marco Baroni: baroni_AT_sslmit.unibo.it 
 



More information about the Corpora mailing list