Corpora: Broadcast corpus

Christopher Cieri ccieri at ldc.upenn.edu
Mon Jan 17 18:00:23 UTC 2000


Professor Chandrasekar,

Thanks for your post. Someone from LDC did write to Professor Maucec
directly. We probably should have copied the whole list since the
information may be of interest to others. We will do that presently.

Thanks and best wishes,
Chris

Raman Chandrasekar wrote:

>  LDC does have transcribed broadcast news. See
> http://morph.ldc.upenn.edu/Catalog/by_type.html  under the heading
> Broadcast text . You'll see the following:
>

                              Broadcast text
                                   [text]
           LDC98T31 1996 CSR Hub-4 Language Model
           LDC97T22 1996 English Broadcast News Transcripts (Hub-4)
           LDC98T28 1997 English Broadcast News Transcripts (Hub-4)
           LDC98T24 1997 Mandarin Broadcast News Transcripts (Hub-4NE)
           LDC98T29 1997 Spanish Broadcast News Transcripts (Hub-4NE)
           LDC99T36 USC Marketplace Broadcast News Transcripts
>  However, access to these collections may require you to be a member.
> I'm cc'ing LDC on this, hopefully they'll get back to you
> directly.Regards,   -- Raman Chandrasekar
>
>      -----Original Message-----
>      From: Mirjam Sepesy Maucec [mailto:mirjam.sepesy at uni-mb.si]
>      Sent: Sunday, January 16, 2000 10:41 PM
>      To: corpora at hd.uib.no
>      Subject: Corpora: Broadcast corpus
>
>      Hi,
>
>      my research topic is domain based  adaptation of language
>      model. For my work I hardly need a text corpus
>      with topic tags.
>      Broadcast corpus seems to be appropriate. Where can I get
>      it?  I don't find it in LDC catalog. I also write 2
>      e-mails to Primary Source Media to get some information and
>      I got no answer.
>      Please, help!
>
>      Mirjam
>
>      --
>      _____________________________________________________________
>
>      Mirjam Sepesy Maucec
>      Faculty of Electrical Engineering and Computer Science
>      University of Maribor
>      Smetanova 17
>      2000 MARIBOR
>      tel: ++386 (062) 220 7225
>      e-mail: mirjam.sepesy at uni-mb.si
>
>
>
--
Christopher Cieri
Executive Director, Linguistic Data Consortium
3615 Market Street, Philadelphia, PA 19104-2608 USA
phone: 215-573-5489, fax: 215-573-2175
mailto:Christopher.Cieri at ldc.upenn.edu
http://www.ldc.upenn.edu



More information about the Corpora mailing list