[Corpora-List] The Rhapsodie Corpus

Paola Pietrandrea paolapietrandrea at gmail.com
Tue Oct 1 09:35:31 UTC 2013


*Rhapsodie: a Prosodic and Syntactic Treebank for Spoken French*****

We are pleased to announce that *Rhapsodie,* a syntactic and prosodic
treebank of spoken French created with the aim of modeling the interface
between prosody, syntax and discourse in spoken French is now available
at   http://www.projet-rhapsodie.fr/****

The Rhapsodie treebank is made up of 57 short samples of spoken French (5
minutes long on average, amounting to 3 hours of speech and a 33 000 word
corpus) endowed with an orthographical phoneme-aligned transcription . ****

The corpus is representative of different genres (private and public
speech; monologues and dialogues; face-to-face interviews and broadcasts;
more or less interactive discourse; descriptive, argumentative and
procedural samples, variations in planning type).****

The corpus samples have been mainly drawn from existing corpora of spoken
French and partially created within the frame of the*Rhapsodie* project. We
would especially like to thank the coordinators of the
CFPP2000,<http://cfpp2000.univ-paris3.fr/>
 PFC <http://www.projet-pfc.net/>, ESLO <http://www.univ-orleans.fr/eslo/>,
C-Prom <https://sites.google.com/site/corpusprom/> projects as well as Piet
Mertens, Mathieu Avanzi, Anne Lacheret and Nicolas Obin.****

The sound samples (waves, MP3, cleaned and stylized pitch), the
orthographic transcriptions (txt), the macrosyntactic annotations (txt),
the prosodic annotations  (xml, textgrid) as well as the metadata (xml and
html) can be freely downloaded under the terms of the Creative Commons
licence Attribution - Noncommercial - Share Alike 3.0 France.****

Microsyntactic annotation will be available soon.****

The metadata are  searchable on line through a browser.****

The prosodic annotation can be explored on line through the Rhapsodie Query
Language.****

The tutorials of transcription, annotations and Rhapsodie Query Language
are available on the site.****

 ****

*The Rhapsodie team (Modyco, Université Paris Ouest Nanterre** :*

Sylvain Kahane, Anne Lacheret, Paola Pietrandrea, Atanas Tchobanov, Arthur
Truong.****

*Partners*: IRCAM (Paris), LATTICE (Paris), LPL (Aix-en-Provence),CLLE-ERSS
(Toulouse).****

**
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20131001/c6966de6/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list