[Corpora-List] New from LDC

Linguistic Data Consortium ldc at ldc.upenn.edu
Thu May 26 19:47:23 UTC 2011


/New Publications:/

LDC2011S01*
**- 2005 NIST Speaker Recognition Evaluation Training Data <#sre>**  -
****
*LDC2011V03*
- NIST/USF Evaluation Resources for the VACE Program - Meeting Data Test 
Set Part 3 <#vace>* *-***

------------------------------------------------------------------------

*New Publications*

(1) 2005 NIST Speaker Recognition Evaluation Training Data 
<http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2011S01>was 
developed at LDC and NIST (National Insitute of Standards and 
Technology). It consists of 392 hours of conversational telephone speech 
in English, Arabic, Mandarin Chinese, Russian and Spanish and associated 
English transcripts used as training data in the NIST-sponsored 2005 
Speaker Recognition Evaluation 
<http://www.itl.nist.gov/iad/mig/tests/spk/2005/index.html>(SRE). The 
ongoing series of SRE yearly evaluations conducted by NIST are intended 
to be of interest to researchers working on the general problem of text 
independent speaker recognition. To that end the evaluations are 
designed to be simple, to focus on core technology issues, to be fully 
supported and to be accessible to those wishing to participate.

The task of the 2005 SRE evaluation was speaker detection, that is, to 
determine whether a specified speaker is speaking during a given segment 
of conversational speech. The task was divided into 20 distinct and 
separate tests involving one of five training conditions and one of four 
test conditions.

The speech data consists of conversational telephone speech with 
"multi-channel" data collected simultaneously from a number of auxiliary 
microphones. The files are organized into two segments: 10 second 
two-channel excerpts (continuous segments from single conversations that 
are estimated to contain approximately 10 seconds of actual speech in 
the channel of interest) and 5 minute two-channel conversations.

The speech files are stored as 8-bit u-law speech signals in separate 
SPHERE files. In addition to the standard header fields, the SPHERE 
header for each file contains some auxiliary information that includes 
the language of the conversation and whether the data was recorded over 
a telephone line.

English language word transcripts in .cmt format were produced using an 
automatic speech recognition system (ASR) and contain error rates in the 
range of 15-30%.


***

(2) NIST/USF Evaluation Resources for the VACE Program - Meeting Data 
Test Set Part 3 
<http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2011V03>, Linguistic 
Data Consortium (LDC) catalog number LDC2011V03 and isbn 1-58563-579-0, 
was developed by researchers at the Department of Computer Science and 
Engineering <http://www.cse.usf.edu/>, University of South Florida 
(USF), Tampa, Florida and the Multimodal Information Group 
<http://nist.gov/itl/iad/mig/>at the National Institute of Standards and 
Technology (NIST). It contains approximately eleven hours of meeting 
room video data collected in 2001 and 2002 at NIST's Meeting Data 
Collection Laboratory and annotated for the VACE (Video Analysis and 
Content Extraction) 2005 face, person and hand detection and tracking tasks.

The VACE program was established to develop novel algorithms for 
automatic video content extraction, multi-modal fusion, and event 
understanding. During VACE Phases I and II, the program made significant 
progress in the automated detection and tracking of moving objects 
including faces, hands, people, vehicles and text in four primary video 
domains: broadcast news, meetings, street surveillance, and unmanned 
aerial vehicle motion imagery. Initial results were also obtained on 
automatic analysis of human activities and understanding of video 
sequences.

Three performance evaluations were conducted under the auspices of the 
VACE program between 2004 and 2007.  The 2005 evaluation was 
administered by USF in collaboration with NIST and guided by an advisory 
forum including the evaluation participants. A summary of results of the 
evaluation can be found in the 2005 VACE results and analysis paper 
<https://secure.ldc.upenn.edu/intranet/docs/VACE2005_report.pdf>included 
in this release.

NIST's Meeting Data Collection Laboratory is designed to collect corpora 
to support research, development and evaluation in meeting recognition 
technologies. It is equipped to look and sound like a conventional 
meeting space. The data collection facility includes five Sony EV1-D30 
video cameras, four of which have stationary views of a center 
conference table (one view from each surrounding wall) with a fixed 
focus and viewing angle, and an additional "floating" camera which is 
used to focus on particular participants, whiteboard or conference table 
depending on the meeting forum. The data is captured in a NIST-internal 
file format. The video data was extracted from the NIST format and 
encoded using the MPEG-2 standard in NTSC format. Further information 
concerning the video data parameters can found in the documentation 
included with this corpus.


------------------------------------------------------------------------

Ilya Ahtaridis
Membership Coordinator
--------------------------------------------------------------------
Linguistic Data Consortium                  Phone: 1 (215) 573-1275
University of Pennsylvania                    Fax: 1 (215) 573-2175
3600 Market St., Suite 810ldc at ldc.upenn.edu
Philadelphia, PA 19104 USAhttp://www.ldc.upenn.edu



-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20110526/ab156ef1/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list