[Corpora-List] Deadline extension 28 May: SIGIR 2007 workshop on Searching Spontaneous Conversational Speech

Djoerd Hiemstra sigir07 at cs.utwente.nl
Wed May 16 13:43:05 UTC 2007


*******************************************************
2nd Call for papers: Deadline extension to 28 May 2007

Searching Spontaneous Conversational Speech

ACM SIGIR 2007 Workshop - 27 July 2007

http://hmi.ewi.utwente.nl/sscs
*******************************************************

Background

Nearly a decade ago, we learned from the TREC Spoken Document Retrieval
(SDR) track that searching speech was a "solved problem." Three factors 
were key to this success: (1) broadcast news has a "story" structure that
resembled written documents, (2) the redundancy present in human language
meant that search effectiveness held up well over a reasonable range of
transcription accuracy, and (3) sufficiently accurate Large-Vocabulary
Continuous Speech Recognition (LVCSR) systems could be built for the 
planned speech of news announcers.

The long-term trend in speech recognition research has been toward
transcription of progressively more challenging sources. Over the last few
years, LVCSR for spontaneous conversational speech has improved to a degree
where transcription accuracy comparable to what was previously found to be
effective for broadcast news can now be achieved for a diverse range of
sources. This has inspired a renaissance in research on search and browse
technology for spoken word collections in communities focused on: (1)
archived cultural heritage materials (e.g., interviews and parliamentary
debates), (2) discussion venues (e.g., business meetings and classroom
instruction), and (3) broadcast conversations (e.g., in-studio talk shows
and call-in programs). Test collections are being developed in individual
projects around the world, and some comparative evaluation activity for
speech search technology has developed over this period. The time seems now
right to look more broadly across these research communities for potential
synergies that can help to shape the information retrieval research agenda
of each of these communities by sharing ideas and resources.

Context

This workshop is part of ACM SIGIR 2007, 23-27 July, Amsterdam, The
Netherlands (http://www.sigir2007.org/).

Workshop Organization

Franciska de Jong, University of Twente, The Netherlands
Douglas Oard, University of Maryland, USA
Roeland Ordelman, University of Twente, The Netherlands
Stephan Raaijmakers, TNO ICT, The Netherlands

Format

We plan to organize the workshop as a mix of oral presentations, panel
discussions and a poster session. Workshop Proceedings will be available at
the workshop. Possibilities for a special journal issue with a selection of
workshop contributions are under negotiation.

Workshop Topics

We welcome contributions on a range of cross-cutting issues, including:

* Segmentation (e.g., speaker turns, topic shifts)
* Content characterization (e.g., LVCSR, word lattice search, spoken term
  detection on phone lattice)
* Classification (e.g., speaker, topic, decision, non-speech acoustic event)
* Exploiting multimodality (integrating features from associated non-speech
  content)
* Search effectiveness (e.g., evidence combination, expansion)
* Interaction design (e.g., query formulation, result presentation, search
  strategies)
* Evaluation (content sources, measures, test collection design, user study
  design)
* Broader issues (applications, intellectual property, privacy)

Submission Types

Two types of submissions are invited: research papers for oral or poster
presentation, and position papers for the selection of discussants and
panelists.

Submission Guidelines

Information on how to submit can be found in the submission guidelines
(http://hmi.ewi.utwente.nl/sscs/submissions).

Important Dates

Call for papers:         April 18, 2007
Papers due:              May 28, 2007
Acceptance notification: June 13, 2007
Final versions due:      July 1, 2007

Program Committee

Samy Bengio (Google)
Laurence Devillers (LIMSI)
Sadaoki Furui (TITECH)
Marcello Federico (FBK-IRST)
Jon Fiscus (NIST)
John Garofolo (NIST)
Sam Gustman (USC)
Thomas Hain (Sheffield)
John Hansen (UT Dallas)
Alex Hauptmann (CMU)
Julia Hirschberg (Columbia)
Diana Inkpen (Ottawa)
Gareth Jones (DCU)
David van Leeuwen (TNO)
Lori Lamel (LIMSI)
Christian Mueller (ICSI)
Steve Renals (Edinburgh)
Salim Roukos (IBM Research)
Liz Shriberg (SRI and ICSI)



More information about the Corpora mailing list