[Corpora-List] Audiovisual corpora: Summary of responses

Paul Thompson p.a.thompson at reading.ac.uk
Mon Dec 15 15:28:07 UTC 2008


A couple of weeks ago, I posted a message to the list asking for 
information on audiovisual corpus projects. I'd like to express my 
thanks to all those who responded to my request, their names appear 
below, and here is a summary of the projects (and some tools) that were 
mentioned:

1 The AMI project http://corpus.amiproject.org/ and the NITE toolkit for 
linking A-V materials to the transcripts, http://www.ltg.ed.ac.uk/NITE/

2 Spoken Chinese Corpus of Situated Discourse collected in Beijing by 
Yueguo Gu under the auspices of the Chinese Academy of Social Science 
(SCCSD BJ-500 for short): 
http://ling.cass.cn/dangdai/gu_papers/sampling%20situated%20discourse.pdf

3 The ScoRE corpus in Singapore:
http://score.crpp.nie.edu.sg/score/index.htm

4 BSL Corpus project:
http://www.bslcorpusproject.org/

5 Carter & Adolphs’s (2008) Headtalk project: 
http://www.nottingham.ac.uk/english/research/cral/doku.php?id=projects:headtalk
and http://www.ncess.ac.uk/research/sgp/headtalk/

6 The Sacodeyl project (European Youth Language) http://www.um.es/sacodeyl/

7 EXMARaLDA, a system for creating, managing and analysing spoken 
language corpora:
http://www.exmaralda.org/en_index.html
On the website, there's a small demo corpus: 
http://www.exmaralda.org/en_demokorpus.html

8 The Scottish Corpus of Texts & Speech (entirely web accessible):
http://www.scottishcorpus.ac.uk/corpus/search/document.php?documentid=804 
(example transcript with video linking)

9 The proceedings of the one-day Workshop on Multimodal Corpora held at 
LREC in Marrakech in May 2008: 
http://www.lrec-conf.org/proceedings/lrec2008/workshops/W8_Proceedings.pdf

10 COSMOROE: a cross-media relations framework for modelling multimedia 
dialectics http://www.citeulike.org/article/3643976

11 The Human Speechome Project at MIT:
http://www.media.mit.edu/cogmac/projects/hsp.html

Includes a video showing a child saying the word "ball" at different 
moments over several months:
http://www.media.mit.edu/cogmac/videos/blue_ball_low.mov

12 The CHIL project 'Computers in the Human Interaction Loop'. 
http://chil.server.de/servlet/is/101/
A journal article on this topic can be found at:
http://www.springerlink.com/content/70h381g7qv721547/

13 Tools for linking video and transcript (Hiroaki Sato)
http://sato.fm.senshu-u.ac.jp/_web/corpus/21DVDhtaKWIC/4satoF/4readMe.html
http://sato.fm.senshu-u.ac.jp/_web/corpus/21DVDhtaKWIC/xdemo.qt

14 The book 'Multimodal transcription and text analysis' by Baldry & 
Thilbault (2006).

Thanks to the following for the information:

J. L. De Lucca, Hong Huaqing, Djamel Mostefa, Stefanie Tellex, Stephen 
Lewis, Meladel Mistica, Dave Beavan, Katerina Pastra, Martin Tietze, 
Thomas Schmidt, Trevor Jenkins, Christopher Brewster, Hiroaki Sato, 
Geoff Leech, Chris Ruehlemann, John Niekrasz, Martin Thomas, John Corbett

Paul

-- 
********************************************************
Paul Thompson
School Director of Postgraduate Studies
Department of Applied Linguistics
School of Languages and European Studies
PO Box 241
University of Reading
Reading RG6 6AA, UK
Phone: +44 118 3786472
URL: www.reading.ac.uk/internal/appling/thompson.htm

Hon. Secretary of the British Association for Applied Linguistics (BAAL)
URL: www.baal.org.uk
Convenor, BAAL Corpus Linguistics SIG
URL: http://corpus-sig-baal.org.uk/
Editorial Board member, Journal of English for Academic Purposes
URL: www.elsevier.com/locate/jeap
********************************************************


_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list