<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=ISO-8859-1">
<META content="MSHTML 5.00.2920.0" name=GENERATOR></HEAD>
<BODY>
<DIV><FONT color=#0000ff face=Tahoma size=2><SPAN class=279283217-17012000>LDC
does have transcribed broadcast news. See <A
href="http://morph.ldc.upenn.edu/Catalog/by_type.html">http://morph.ldc.upenn.edu/Catalog/by_type.html</A>
under the heading Broadcast text . You'll see the following:</SPAN></FONT></DIV>
<DIV><FONT color=#0000ff face=Tahoma size=2><SPAN
class=279283217-17012000></SPAN></FONT> </DIV><FONT color=#0000ff
size=2><SPAN class=279283217-17012000>
<TABLE border=0 cellPadding=4 cellSpacing=0 width="100%">
<TBODY>
<TR>
<TD align=middle colSpan=2><BR><A name=text.broadcast>
<CENTER><FONT face=Tahoma><FONT size=+1>Broadcast text</FONT><BR><FONT
size=-1>[<A
href="http://morph.ldc.upenn.edu/Catalog/by_type.html#text">text</A>]</FONT></FONT></CENTER>
<TR>
<TD align=right vAlign=top width="25%"><A
href="http://morph.ldc.upenn.edu/Catalog/LDC98T31.html"><FONT
face=Tahoma>LDC98T31</FONT></A></TD>
<TD align=left vAlign=top width="75%"><FONT face=Tahoma>1996 CSR Hub-4
Language Model</FONT></TD>
<TR>
<TD align=right vAlign=top width="25%"><A
href="http://morph.ldc.upenn.edu/Catalog/LDC97T22.html"><FONT
face=Tahoma>LDC97T22</FONT></A></TD>
<TD align=left vAlign=top width="75%"><FONT face=Tahoma>1996 English
Broadcast News Transcripts (Hub-4)</FONT></TD>
<TR>
<TD align=right vAlign=top width="25%"><A
href="http://morph.ldc.upenn.edu/Catalog/LDC98T28.html"><FONT
face=Tahoma>LDC98T28</FONT></A></TD>
<TD align=left vAlign=top width="75%"><FONT face=Tahoma>1997 English
Broadcast News Transcripts (Hub-4)</FONT></TD>
<TR>
<TD align=right vAlign=top width="25%"><A
href="http://morph.ldc.upenn.edu/Catalog/LDC98T24.html"><FONT
face=Tahoma>LDC98T24</FONT></A></TD>
<TD align=left vAlign=top width="75%"><FONT face=Tahoma>1997 Mandarin
Broadcast News Transcripts (Hub-4NE)</FONT></TD>
<TR>
<TD align=right vAlign=top width="25%"><A
href="http://morph.ldc.upenn.edu/Catalog/LDC98T29.html"><FONT
face=Tahoma>LDC98T29</FONT></A></TD>
<TD align=left vAlign=top width="75%"><FONT face=Tahoma>1997 Spanish
Broadcast News Transcripts (Hub-4NE)</FONT></TD>
<TR>
<TD align=right vAlign=top width="25%"><A
href="http://morph.ldc.upenn.edu/Catalog/LDC99T36.html"><FONT
face=Tahoma>LDC99T36</FONT></A></TD>
<TD align=left vAlign=top width="75%"><FONT face=Tahoma>USC Marketplace
Broadcast News Transcripts</FONT></TD></TR></TBODY></TABLE>
<DIV> </DIV>
<DIV><FONT face=Tahoma><SPAN class=279283217-17012000>However, access to these
collections may require you to be a member. I'm cc'ing LDC on this, hopefully
they'll get back to you directly.</SPAN></FONT></DIV>
<DIV><FONT face=Tahoma><SPAN class=279283217-17012000></SPAN></FONT> </DIV>
<DIV><FONT face=Tahoma><SPAN
class=279283217-17012000>Regards,</SPAN></FONT></DIV>
<DIV><FONT face=Tahoma><SPAN class=279283217-17012000></SPAN></FONT> </DIV>
<DIV><FONT face=Tahoma><SPAN class=279283217-17012000> -- Raman
Chandrasekar</SPAN></FONT></DIV>
<DIV></SPAN></FONT> </DIV>
<BLOCKQUOTE>
<DIV align=left class=OutlookMessageHeader dir=ltr><FONT face=Tahoma
size=2>-----Original Message-----<BR><B>From:</B> Mirjam Sepesy Maucec
[mailto:mirjam.sepesy@uni-mb.si]<BR><B>Sent:</B> Sunday, January 16, 2000
10:41 PM<BR><B>To:</B> corpora@hd.uib.no<BR><B>Subject:</B> Corpora: Broadcast
corpus<BR><BR></DIV></FONT>Hi,
<P>my research topic is domain based adaptation of language model. For
my work I hardly need a text corpus <BR>with topic tags. <BR>Broadcast corpus
seems to be appropriate. Where can I get it? I don't find it in LDC
catalog. I also write 2 <BR>e-mails to Primary Source Media to get some
information and I got no answer. <BR>Please, help!
<P>Mirjam <PRE>--
_____________________________________________________________
Mirjam Sepesy Maucec
Faculty of Electrical Engineering and Computer Science
University of Maribor
Smetanova 17
2000 MARIBOR
tel: ++386 (062) 220 7225
e-mail: mirjam.sepesy@uni-mb.si</PRE> </BLOCKQUOTE></BODY></HTML>