[Corpora-List] English-language paraphrase corpora
Dragomir Radev
radev at umich.edu
Tue Feb 1 13:39:10 UTC 2005
Our system, a precursor to Google News is also active on the Web:
www.newsinessence.com
Using it, we have collected 50,000 or so clusters of related news.
--
Drago
nielsen at dcs.kcl.ac.uk wrote:
>
>
> If you don't mind collecting raw text, news.google.com does this.
>
> Leif
>
> >
> > Dear All,
> >
> > I am looking for English-language "comparable" corpora. I.e. I want,
> > e.g., 2 collections of articles from different sources describing same
> > events.
> >
> > Alternatively, would anyone know off-hand how one would go about
> > constructing such comparable collections?
> >
> > (This is to be used for automatic paraphrasing.)
> >
> > Any pointers greatly appreciated,
> >
> > Olga
> > University of Sussex NLP group
> >
> >
> >
> >
> >
> >
> >
>
>
>
>
>
--
Dragomir R. Radev radev at umich.edu
Assistant Professor of Information, Electrical Engineering and
Computer Science, and Linguistics, the University of Michigan, Ann Arbor
Phone: 734-615-5225 Fax: 734-764-2475 http://www.si.umich.edu/~radev
More information about the Corpora
mailing list