Corpora: Broadcast corpus
    Christopher Cieri 
    ccieri at ldc.upenn.edu
       
    Mon Jan 17 17:57:12 UTC 2000
    
    
  
Professor Chandrasekar,
Thanks for your e-mail. Someone from LDC did respond to Professor Maucec
directly. We will summarize for the list presently.
Thanks and best wishes,
Chris
Raman Chandrasekar wrote:
>  LDC does have transcribed broadcast news. See
> http://morph.ldc.upenn.edu/Catalog/by_type.html  under the heading
> Broadcast text . You'll see the following:
>
                              Broadcast text
                                   [text]
           LDC98T31 1996 CSR Hub-4 Language Model
           LDC97T22 1996 English Broadcast News Transcripts (Hub-4)
           LDC98T28 1997 English Broadcast News Transcripts (Hub-4)
           LDC98T24 1997 Mandarin Broadcast News Transcripts (Hub-4NE)
           LDC98T29 1997 Spanish Broadcast News Transcripts (Hub-4NE)
           LDC99T36 USC Marketplace Broadcast News Transcripts
>  However, access to these collections may require you to be a member.
> I'm cc'ing LDC on this, hopefully they'll get back to you
> directly.Regards,   -- Raman Chandrasekar
>
>      -----Original Message-----
>      From: Mirjam Sepesy Maucec [mailto:mirjam.sepesy at uni-mb.si]
>      Sent: Sunday, January 16, 2000 10:41 PM
>      To: corpora at hd.uib.no
>      Subject: Corpora: Broadcast corpus
>
>      Hi,
>
>      my research topic is domain based  adaptation of language
>      model. For my work I hardly need a text corpus
>      with topic tags.
>      Broadcast corpus seems to be appropriate. Where can I get
>      it?  I don't find it in LDC catalog. I also write 2
>      e-mails to Primary Source Media to get some information and
>      I got no answer.
>      Please, help!
>
>      Mirjam
>
>      --
>      _____________________________________________________________
>
>      Mirjam Sepesy Maucec
>      Faculty of Electrical Engineering and Computer Science
>      University of Maribor
>      Smetanova 17
>      2000 MARIBOR
>      tel: ++386 (062) 220 7225
>      e-mail: mirjam.sepesy at uni-mb.si
>
>
>
--
Christopher Cieri
Executive Director, Linguistic Data Consortium
3615 Market Street, Philadelphia, PA 19104-2608 USA
phone: 215-573-5489, fax: 215-573-2175
mailto:Christopher.Cieri at ldc.upenn.edu
http://www.ldc.upenn.edu
    
    
More information about the Corpora
mailing list