corpus NECTE - DECTE (Paris Diderot, 3-4 juin 2013)

Nicolas Ballier ballier at UNIV-PARIS13.FR
Fri May 31 06:22:50 UTC 2013


*corpus interoperability and spoken diachronic databases : the 
NECTE-DECTE corpora*

http://www.clillac-arp.univ-paris-diderot.fr/evenements/necte_decte_interop_2013


This two-day workshop presents some 50 years of sociolinguistic surveys 
of Geordie encapsulated in theNECTE 
<http://research.ncl.ac.uk/necte>andDECTE 
<http://research.ncl.ac.uk/decte>corpora. In the wake of Beal, Corrigan 
and Moisl 2007 selection of papers, the two corpora are discussed by 
linguists investigating syntactic, prosodic and phonetic features, 
questioning the connections between linguistic data, corpus annotation, 
linguistic research questions and technological queries.

*Paris DIDEROT ,Olympe de Gouges 
<http://www.ums-riate.fr/plan_ODG.html>, 5 rue Thomas Mann, salle 153 , 
3-4 June 2013*
Day 1 (afternoon) presents the corpora and how they have been used so 
far. Day 2 (morning) discusses the formats, tools and potential 
suggestions for the synchronisation of textual/syntactic/phonetic/sound 
data and the possibilities to query both corpora at the same time.

The workshop is free but limited to 50 participants. Participants are 
advised to register individually to have free access to the corpus 
before hand and to download sample files from the NECTEwebsite 
<http://research.ncl.ac.uk/decte/corpus.htm>

*DAY 1 DATA COLLECTION AND INITIAL EXPLOITATION OF THE CORPORA*

  *
    *1400*N. Ballier (Paris Diderot):*Introduction*
  *
    *1405*K . CORRIGAN (Newcastle) :*From NECTE to DECTE*

Detail the data collection, and objectives of NECTE / DECTE

  *
    *1430*J. BEAL (Sheffield) :*The Diachrony of the NECTE corpus*

first hand experience on the data collection of the corpus / possible 
insights from field data and data collection

  *
    *1500*Herman MOISL (Newcastle):*Doing sociophonetics by numbers*

Working with the numbers of the phonetic transcriptions (explains how 
sociophonetic variation can be captured by numbers instead of the IPA 
symbols)

  *
    *1530-1535*Philippe MARTIN (Paris Diderot) :*Associating the NECTE
    phonetic digits to IPA transcriptions using WinPitch*
  *
    question time
  *
    coffee break
  *
    *1600-1630*Christophe PARISSE (Paris 10, MODYCO) :*Converting the
    NECTE files into CLAN readable format*

(this session details the algorithm used to convert theXMLfiles into 
praat-like files to ensure interoperability of tools)

  *
    *1630-1645*questions
  *
    *1645-1700*Nicolas Ballier (Paris Diderot) : Using the NECTE corpus
    for the investigation of prosody and syntax

(demo of sample solutions of queries involving praat-like files)

  *
    *1700-1715*Philippe MARTIN (Paris Diderot) :*Using WinPitch as a
    multifile concordancer for the NECTE corpus*

(demo of the latest version of WinPitch software, which has a specific 
device for querying NECTE files)

  *
    questions
  *
    *1715-1745*Esther LE GREZAUSE (Paris Diderot/UW) :*Analysing SO with
    a subset of the NECTE corpus*
  *
    *1745*questions
  *
    *1800*end


*TUESDAY 4 TH (DAY 2) CORPUS INTEROPERABILITY*

  *
    XMLsession
  *
    *9 00- 930*Hermann Moisl (Newcastle) :*Corpus alignment and TEI
    conventions*

Hermann Moisl (Newcastle) : How I aligned sound and texts and used the 
TEI to indicate all this

discussants : Nicolas Ballier, Philippe Martin, Christophe Parisse 
(Things we found strange in theXMLannotation, bug reports and 
suggestions to improve theXMLannotation of the corpus)

  *
    *1030*coffee break

  *
    *11 00*ROUND TABLE: TEI,XMLand time alignment of corpora

round table : Philippe Martin, Christophe Parisse)

  *
    Exploring spoken corpora with text grammar in mind : experimenting
    the DECTE corpus with Xaira
  *
    Nicolas Ballier : DECTE-NECTE FOR CORPUS PROSODY (a comparison with
    AIX-MARSEC)
  *
    SomeXMLrecommendations for corpus prosody??
  *
    Conclusion : what's a Spoken database ?

  *
    *11 45*K. CORRIGAN (Newcastle) : Next steps and future plans

(plans for pedagogical exploitations of the DECTE corpus, applications)

  *
    1200 Concluding remarks

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/parislinguists/attachments/20130531/b1f5decb/attachment.htm>


More information about the Parislinguists mailing list