[Corpora-List] Parallel corpora of data and text

Anja Belz a.s.belz at itri.brighton.ac.uk
Tue Mar 28 09:33:23 UTC 2006


Dear list members,

Does anyone out there have, or know of, any data resources where some form
of nonverbal data is paired up with texts (or with pieces of transcribed
spoken language)?

I'm particularly interested in the kind of parallel input/output corpora
that would be useful for applied NLG, e.g.

content representations // realisations
weather data // weather forecasts
database entries // item descriptions
set of coordinates // route descriptions
abstract plans // cooking recipes
etc.

But I'd also like to hear about any other corpora of data/text pairs.

I'm envisaging putting together an annotated list of such resources for
NLG researchers.

Many thanks in advance,

Anja

++++++++++++++++++++++++++++++++++++++++++++++++++
+ INLG'06 Special Session on Data Sharing and
  Comparative Evaluation
  www.ict.csiro.au/inlg2006/special_session.htm
+ EPSRC Studentship at NLTG, Brighton University
  www.nltg.brighton.ac.uk/nltg/studentship2006.html
++++++++++++++++++++++++++++++++++++++++++++++++++

--------------------------------------------------
Dr Anja Belz
Senior Lecturer
Natural Language Technology Group
CMIS, University of Brighton
Lewes Road, Brighton BN2 4GJ, UK
Web: http://www.brighton.ac.uk/nltg/home/Anja.Belz
Tel: +44 (0)1273 642909
Fax: +44 (0)1273 642908
--------------------------------------------------



More information about the Corpora mailing list