18.248, Calls: Computational Linguistics,Text/Corpus Linguistics/Belgium
LINGUIST Network
linguist at LINGUISTLIST.ORG
Wed Jan 24 17:20:20 UTC 2007
LINGUIST List: Vol-18-248. Wed Jan 24 2007. ISSN: 1068 - 4875.
Subject: 18.248, Calls: Computational Linguistics,Text/Corpus Linguistics/Belgium
Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
Reviews: Laura Welcher, Rosetta Project / Long Now Foundation
<reviews at linguistlist.org>
Homepage: http://linguistlist.org/
The LINGUIST List is funded by Eastern Michigan University, Wayne
State University, and donations from subscribers and publishers.
Editor for this issue: Ania Kubisz <ania at linguistlist.org>
================================================================
As a matter of policy, LINGUIST discourages the use of abbreviations
or acronyms in conference announcements unless they are explained in
the text.
To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.html.
===========================Directory==============================
1)
Date: 23-Jan-2007
From: Isabelle Lecroart < lecroart at tedm.ucl.ac.be >
Subject: Web as Corpus 2007
-------------------------Message 1 ----------------------------------
Date: Wed, 24 Jan 2007 12:11:56
From: Isabelle Lecroart < lecroart at tedm.ucl.ac.be >
Subject: Web as Corpus 2007
Full Title: Web as Corpus 2007
Short Title: WAC3
Date: 15-Sep-2007 - 16-Sep-2007
Location: Louvain-La-Neuve, Belgium
Contact Person: Bernadette Dehottay
Meeting Email: dehottay at tedm.ucl.ac.be
Web Site: http://cental.fltr.ucl.ac.be/wac3
Linguistic Field(s): Computational Linguistics; Text/Corpus Linguistics
Call Deadline: 01-May-2006
Meeting Description:
3rd Web as Corpus Workshop (WAC3)
Incorporating Cleaneval
An ACL-SIGWAC Event
We invite submissions which
- describe Web corpus collection projects, or modules for one part of the
process (crawling, filtering, language-id, tokenising, lemmatising, POS-tagging
or indexing)
- explore characteristics of Web data, from a linguistics/NLP perspective
including registers, domains, frequency distribution
- use crawled Web data for NLP purposes (with emphasis on the data rather than
the use)
Call for papers
3rd Web as Corpus Workshop (WAC3)
Incorporating Cleaneval
An ACL-SIGWAC Event
Sept. 15-16, 2007
University of Louvain, Louvain-la-Neuve, Belgium
More and more people are using Web data for linguistic and NLP research. The
workshop provides a venue for exploring how we can use it effectively and what
we will find if we do.
Cleaneval
Anyone using Web data needs to clean it, to get rid of unwanted material
including, for example, HTML markup, navigation bars, advertisements. To date
there has been no sharing of resources or expertise and the cleaning has often
been done minimally. Cleaneval is an exercise to promote sharing and to improve
our understanding of the issues. It will take the now-familiar form of an open
competition and shared task. More info at Cleaneval Previous WAC workshops.
More info on WAC1 at Corpus Linguistics conference, Birmingham, UK, July 2005.
More info on WAC2 at EACL, Trento, Italy, April 2006.
Invited speaker: Kevin Scannell
Kevin Scannell, of Saint Louis Univ., Missouri, USA, has been working with
scholars of a range of smaller languages to develop Web corpora for those
languages: website currently lists 135 corpora/languages.
Submission
For regular papers
Papers (6-10 pages), demos (max. 2 pages) and posters (max. 2 pages) to be
written in English and follow ACL formatting. Template files (.doc & Latex)
available on the website.
For Cleaneval submissions see Cleaneval website.
Deadline: 1 May 2007
Venue
Université catholique de Louvain, in the elegant new city of Louvain-la-Neuve
(Belgium). Large computer rooms will be available for demo sessions.
Points of contact
Workshop Co-chairs
Cédrick Fairon, UCLouvain, Cental, fairon at tedm.ucl.ac.be
Prof. Gilles-Maurice de Schryver, Universiteit Gent
Cleaneval committee
Marco Baroni, U Trento; Secretary, SIGWAC
Tony Hartley, U Leeds
Adam Kilgarriff, Lexical Computing Ltd; Chair, SIGWAC
Serge Sharoff, U Leeds
Local organisation team
Bernadette Dehottay, UCLouvain, Cental, dehottay at tedm.ucl.ac.be
Julia Medori, CENTAL, UCLouvain
Laurent Kevers, CENTAL, UCLouvain
Hubert Naets, CENTAL, UCLouvain
Isabelle Lecroart, CENTAL, UCLouvain
Claude Devis, CENTAL, UCLouvain
Contact us
Bernadette Dehottay
Université catholique de Louvain
Centre for Natural Language Processing (CENTAL)
Place Blaise Pascal, 1
1348 Louvain-la-Neuve
Phone. +32 10 47 37 88
Fax. +32 10 47 26 06
dehottay at tedm.ucl.ac.be
-----------------------------------------------------------
LINGUIST List: Vol-18-248
More information about the LINGUIST
mailing list