Corpora: Broadcast corpus
Christopher Cieri
ccieri at ldc.upenn.edu
Mon Jan 17 17:57:12 UTC 2000
Professor Chandrasekar,
Thanks for your e-mail. Someone from LDC did respond to Professor Maucec
directly. We will summarize for the list presently.
Thanks and best wishes,
Chris
Raman Chandrasekar wrote:
> LDC does have transcribed broadcast news. See
> http://morph.ldc.upenn.edu/Catalog/by_type.html under the heading
> Broadcast text . You'll see the following:
>
Broadcast text
[text]
LDC98T31 1996 CSR Hub-4 Language Model
LDC97T22 1996 English Broadcast News Transcripts (Hub-4)
LDC98T28 1997 English Broadcast News Transcripts (Hub-4)
LDC98T24 1997 Mandarin Broadcast News Transcripts (Hub-4NE)
LDC98T29 1997 Spanish Broadcast News Transcripts (Hub-4NE)
LDC99T36 USC Marketplace Broadcast News Transcripts
> However, access to these collections may require you to be a member.
> I'm cc'ing LDC on this, hopefully they'll get back to you
> directly.Regards, -- Raman Chandrasekar
>
> -----Original Message-----
> From: Mirjam Sepesy Maucec [mailto:mirjam.sepesy at uni-mb.si]
> Sent: Sunday, January 16, 2000 10:41 PM
> To: corpora at hd.uib.no
> Subject: Corpora: Broadcast corpus
>
> Hi,
>
> my research topic is domain based adaptation of language
> model. For my work I hardly need a text corpus
> with topic tags.
> Broadcast corpus seems to be appropriate. Where can I get
> it? I don't find it in LDC catalog. I also write 2
> e-mails to Primary Source Media to get some information and
> I got no answer.
> Please, help!
>
> Mirjam
>
> --
> _____________________________________________________________
>
> Mirjam Sepesy Maucec
> Faculty of Electrical Engineering and Computer Science
> University of Maribor
> Smetanova 17
> 2000 MARIBOR
> tel: ++386 (062) 220 7225
> e-mail: mirjam.sepesy at uni-mb.si
>
>
>
--
Christopher Cieri
Executive Director, Linguistic Data Consortium
3615 Market Street, Philadelphia, PA 19104-2608 USA
phone: 215-573-5489, fax: 215-573-2175
mailto:Christopher.Cieri at ldc.upenn.edu
http://www.ldc.upenn.edu
More information about the Corpora
mailing list