[Corpora-List] To Saad Mahamood RE: Corpora Digest, Vol 23, Issue 26

Davis, Boyd bdavis at uncc.edu
Wed May 27 13:10:27 UTC 2009


Saad, you might want to contact Peyton Mason ktmason at email.msn.com about stance analysis, which is computer-supported coding of text for various stances: see Stance-shifting in language used by sex offenders: Five case studies, by: Lord, Vivian, Davis, Boyd, Mason, Peyton. Psychology, Crime and Law, Vol. 14, No. 4. (August 2008), pp. 357-379.
Boyd Davis

-----Original Message-----
From: corpora-bounces at uib.no on behalf of corpora-request at uib.no
Sent: Wed 5/27/2009 9:00 AM
To: corpora at uib.no
Subject: Corpora Digest, Vol 23, Issue 26
 
Today's Topics:

   1.  Extended Deadline Cfp: 11th International Conference	on
      Parsing Technologies (IWPT'09) (DJamé Seddah)
   2.  PASSAGE 2nd French Parsing Evaluation Campaign (Paroubek)
   3.  Analysing Parent-Medical Corpora? (Saad Mahamood)


----------------------------------------------------------------------

Message: 1
Date: Tue, 26 May 2009 17:02:44 +0200
From: DJamé Seddah <djame.seddah at free.fr>
Subject: [Corpora-List] Extended Deadline Cfp: 11th International
	Conference	on Parsing Technologies (IWPT'09)
To: elsnet-list at elsnet.org
Cc: corpora at uib.no

C a l l f o r P a p e r s

IWPT 2009

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
11th International Conference on Parsing Technologies
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Sponsored by ACL/SIGPARSE

October 7 - 9, 2009

Paris, France

~~~
http://alpage.inria.fr/iwpt09
~~~

ACL/SIGPARSE will host the 11th International Conference on Parsing
Technologies (IWPT'09) on October 7 - 9, 2009 in Paris, France, at
the University of Paris 7.

IWPT'09 continues the tradition of biennial workshops on parsing
technology organized by SIGPARSE, the Special Interest Group on
Parsing of the Association for Computational Linguistics (ACL). The
first workshop, in Pittsburgh and Hidden Valley, was followed by
workshops in Cancun (Mexico) in 1991; Tilburg (Netherlands) and Durbuy
(Belgium) in 1993; Prague and Karlovy Vary (Czech Republic) in 1995;
Boston/Cambridge (Massachusetts) in 1997; Trento (Italy) in 2000;
Beijing (China) in 2001; in Nancy (France) in 2003; in Vancouver
(Canada) (2005); and in Prague (Czech Republic) in 2007.


Topics of interest for IWPT'09
------------------------------
Topics of interest for IWPT'09 include, but are not limited to:

theoretical and practical studies of parsing and transduction
algorithms for natural language sentences, texts, fragments,
dialogues, ill-formed sentences, speech input, multi-dimensional
(pictorial) or signed languages, multimedia (web) documents; parsing
issues arising or viewed in a multimodal and multilingual context;
parsing in relation to semantic and pragmatic information and
communication; cognitive aspects of grammar and their impact on
parsing.
Knowledge-based and data-driven approaches are equally welcome.


Time schedule: (full paper deadline has been extended to the 29 May)
--------------
Deadline for full paper submission: 29 May, 2009
Full papers notification of acceptance: 1 July, 2009
Deadline for short paper submission: 8 July, 2009
Short papers notification of acceptance: 15 July, 2009
Final papers due: 15 August, 2009
Conference: 7 - 9 October, 2009


Submitting Papers
-----------------
Two types of submissions are invited:
- full papers, to be presented as such during the conference and to
be published in the conference proceedings (maximally 12 pages);
- short papers, to be presented at the conference in the form of a
poster plus a summary oral presentation in a short-paper session;
four-page summaries will be published in the conference proceedings.

Submissions of either type should describe original work. Full papers
should be concerned with completed work or intermediate results of
ongoing work, rather than with intended work, and should indicate
clearly the state of completion of the reported results. Short papers
may describe planned or starting research or untested ideas.

A paper accepted for presentation at IWPT 2009 cannot be presented or
have been presented at any other meeting with publicly available
published proceedings. Papers that are being submitted to other
conferences or workshops must indicate this on the title page, as must
papers that contain significant overlap with previously published
work.

The reviewing of the papers will be blind. The paper should not
include the authors' names and affiliations. Furthermore,
self-citations and other references (e.g. to projects, corpora, or
software) that could reveal the author's identity should be
avoided. For example, instead of "We previously showed (Smith, 1991)
...", write "Smith previously showed (Smith, 1991) ...".


Instructions for authors
------------------------
Prospective authors are invited to send their submissions
through the electronic submission page of the conference. The only
accepted format for submitted papers is PDF. Full papers should not
exceed 12 pages; short papers should not exceed 8 pages.


Detailed formatting and submission instructions for authors will be
available through the IWPT'09 home page at:

http://alpage.inria.fr/iwpt09


Inquiries:
----------
For all inquiries send mail to eric.de_la_clergerie at inria.fr or to
harry.bunt at uvt.nl


Programme Committee
---------------------------
All submitted papers will be reviewed by (or under the supervision of)
the international IWPT'09 Programme Committee, consisting of the
following members:

Philippe Blache (CNRS/Provence University, Aix-en-Provence, France)
Harry Bunt (TiCC, Tilburg University, Netherlands)
David Chiang(USC/ISI, Marina del Rey, USA)
John Carroll (University of Sussex, Brighton, UK)
Stephen Clarck (Oxford Unniversity, UK)
Eric Villemonte de la Clergerie (INRIA, Rocquencourt, France) (chair)
Jason Eisner (Johns Hopkins University, Baltimore, USA)
James Henderson (University of Edinburgh,UK)
Julia Hockenmaier (University of Pennsylvania, Philadelphia, USA)
Aravind Joshi (University of Pennsylvania, Philadelphia, USA)
Ronald Kaplan (Xerox Palo Alto Research Center, USA)
Martin Kay (Xerox Palo Alto Research Center, USA)
Dan Klein (UC Berkeley, USA)
Sadao Kurohashi (University of Kyoto, Japan)
Alon Lavie (Carnegie-Mellon University, Pittsburgh, USA)
Rob Malouf (San Diego State University, USA)
Yuji Matsumoto (Nara Institute of Science and Technology, Japan)
Bob Moore (Microsoft, Redmond, USA)
Mark-Jan Nederhof (University of St. Andrews, Scotland)
Joakim Nivre (University of Uppsala, Sweden)
Gertjan van Noord (University of Groningen, Netherlands)
Stephan Oepen (University of Oslo, Norway)
Stefan Riezler (Xerox Palo Alto Research Center, USA)
Giorgio Satta (University of Padua, Italy)
Kenji Sagae (Institute for Creative Technologies, Marina del Rey, USA)
Khalil Sima'an (University of Amsterdam, Netherlands)
Hozumi Tanaka (Chukyo University, Japan)
K. Vijay-Shanker (University of Delaware, USA)
Eric Wehrli (LATL, University of Geneva, Switzerland)
David Weir (University of Sussex, Brighton, UK)
Shuly Wintner (University of Haifa, Israel)
Dekai Wu (Hong Kong University of Science and Technology, China)


Organization
-----------------
General Chair: Harry Bunt (Tilburg University, Netherlands)

Program Chair: Eric Villemonte de la Clergerie (INRIA, Rocqencourt,  
France)

Logistic Arrangements Chair: Laurence Danlos (University of Paris 7,  
France)


Further information
---------------------------
Additional information about IWPT'07 is available at the URL:

http://alpage.inria.fr/iwpt09/

At the site http://www.cs.cmu.edu/afs/cs.cmu.edu/project/grasp-1/Grasp/SIGParse/ 
  there is also
information about previous IWPTs, proceedings, books based on IWPTs,
and SIGPARSE related activities.




------------------------------

Message: 2
Date: Tue, 26 May 2009 20:19:10 +0200 (CEST)
From: pap at limsi.fr (Paroubek)
Subject: [Corpora-List] PASSAGE 2nd French Parsing Evaluation Campaign
To: corpora at uib.no


--------------------------------------------------------------------------------
Call for Participation to the Second PASSAGE French Parsing Evaluation 
Campaign
--------------------------------------------------------------------------------

We invite the developers of parsers for French both from academia and
industry to participate to the second PASSAGE evaluation campaign. In
the course of achieving the two main objectives of the PASSAGE
project (http://atoll.inria.fr/passage):

1. building a large sized Treebank for French and making it available
to the community

2. and investigating lexical acquisition for Parser improvement by
parser outputs merging,

we offer to parser developers the opportunity to test and compare
their system on the PASSAGE corpus. Historically this evaluation
campaign comes after the first evaluation campaign of PASSAGE that
took place in 2007 and the EASY evaluation campaign (2006) of EVALDA
project in the TECHNOLANGUE program.

Participation is free (after registering, see below) on a voluntary
basis and gives access to all the resources that PASSAGE has built
(corpora, annotation editors, evaluation toolkits). To have a look at
the PASSAGE annotations, you can test EasyRef
(http://atoll.inria.fr/easyrefpub/), the WEB interactive annotation
editor now freely open to all potential participants.

--------
The data
--------
The corpus that we propose you to parse is a 100 Million words
collection of material freely available from the WEB completed with a
small amount of copyrighted newspaper material. Word and sentence
segmentations are not imposed. The participant data will be mapped
onto the reference data using a dynamic programming algorithm.

---------------------
Syntactic Annotations
---------------------
The syntactic annotations that we used in PASSAGE are derived from the
one used during the EASY campaign. The annotations are described in
various papers
(http://atoll.inria.fr/passage/articles.en.html). Documentation and
software support is available on the PASSAGE site along with EasyRef
(http://atoll.inria.fr/easyrefpub/), an open WEB annotation editor and
an evaluation server enabling an automatic comparison of one's parser
against the PASSAGE development data (app. 85,600 words) issued from
previous PASSAGE campaign and the EASY campaign.

--------
Schedule 
--------
Both the development corpus resulting from the EASY evaluation
campaign and the PASSAGE-2 test corpus of 100 million words are now
available. They will be communicated to the participants as soon as
they are registered. Participants to PASSAGE-2 are expected to return
the PASSAGE-2 corpus completely parsed with the PASSAGE annotations
between August 31 2009 and October 15 2009.

-----------------
Evaluation Tracks
-----------------
PASSAGE-2 will have 2 evaluation tracks:

1. the manual reference track with its gold standard of 400,000 words
that have been hand-annotated

2. the automatic reference track, where the gold standard will be the
results of combining the ouputs of the participating parsers.

For all registered participants, performance results of the first
track will be published with the participant identified, while
performance results of the second track will be published anonymously
because of the exploratory nature of the reference data.

-----------------------------------------
Link with EVALITA dependency parsing task
-----------------------------------------
For exploring possible links with Parsing evaluation for other
languages, PASSAGE-2 campaign has a tiny development and test corpus
shared with the EVALITA (http://evalita.fbk.eu/) campaign on
dependency parsing (http://evalita.fbk.eu/parsing.html) . Aligned data
both in French and Italian have been hand-annotated (200 sentences of
developement and 50 sentences for test) both with PASSAGE annotations
for the French part and the TUT annotations for the Italian one.

----------------------------
Conditions for participation
----------------------------
Registration is now open and necessitates signing a participation
agreement available at ELDA. Participants are required to return the
test corpus parsed according to the schedule above and agree to the
publication of their identified performance results by the PASSAGE-2
organizer. Please contact Olivier Hamon at ELDA for obtaining the
participation agreement.

--------
Contacts
--------
PASSAGE-2 is organized by:

* ILES (Information Langues Écrites et Signées) of LIMSI-CNRS (Patrick
 Paroubek pap at limsi.fr, or Anne Vilnat anne.vilnat at limsi.fr)

* ELDA (Olivier Hamon, hamon at elda.org)

* INRIA-ALPAGE (Eric de la Clergerie, Eric.De_La_Clergerie at inria.fr). 

with the help of all the PASSAGE participants.

PASSAGE (ANR-06-MDCA-013) is funded by ANR
(http://www.agence-nationale-recherche.fr).
--------------------------------------------------------------------------------

-----
Patrick Paroubek / LIR Group / Human-Machine Communication Dept.
LIMSI - CNRS, Batiment 508 Universite Paris XI, BP 133 - 91403 ORSAY Cedex - France
phone: (33) (0)1 69 85 80 04 fax: (33) (0)1 69 85 80 88 email:pap at limsi.fr



------------------------------

Message: 3
Date: Tue, 26 May 2009 19:34:03 +0100
From: Saad Mahamood <saad.mahamood at abdn.ac.uk>
Subject: [Corpora-List] Analysing Parent-Medical Corpora?
To: "corpora at uib.no" <corpora at uib.no>


Dear members,


I'm currently trying to undertake an analysis of medical texts written
for and about the interactions between parents of a premature child in
neonatal care and medical staff (Nurses and Consultants). In
particular, there are four sets of corpora I'm trying to analyse. The
first is text written by medical consultants that is usually written
for the parents about prior verbal consultations that the two parties
have had. The other three corpora sets are that written by nurse about
their interactions  with parents and their observations of the
parents. What I have noticed informally is that the nature of text
written by both consultants and nurses changes if the condition of the
premature in neonatal care deteriorates and/or if the parents are
emotionally distressed. In particular, the most noticeable result is
that the simplification of the language used and the increasing use of
phrases to build reassurance. However, I was wondering what tools are
available for me to observe aspects of this phenomena in a
quantitative manner? I have used readability metrics such as Fog,
Flesch, and Flesch-Kincaid scores. Is there any other metrics that I
should be exploring?

Many Thanks,

Saad Mahamood.

Research Student,
Department of Computing Science,
Room 244, Meston Building,
King's College, University of Aberdeen,
Aberdeen, Scotland, United Kingdom,
AB24 3UE

Tel: +44 (0) 1224 274173
E-mail: saad.mahamood at abdn.ac.uk
Webpage: http://www.csd.abdn.ac.uk/~smahamoo/




The University of Aberdeen is a charity registered in Scotland, No SC013683.



----------------------------------------------------------------------
Send Corpora mailing list submissions to
	corpora at uib.no

To subscribe or unsubscribe via the World Wide Web, visit
	http://mailman.uib.no/listinfo/corpora
or, via email, send a message with subject or body 'help' to
	corpora-request at uib.no

You can reach the person managing the list at
	corpora-owner at uib.no

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Corpora digest..."


_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


End of Corpora Digest, Vol 23, Issue 26
***************************************

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20090527/95454d85/attachment.htm>
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list