[Corpora-List] English-language paraphrase corpora

Dragomir Radev radev at umich.edu
Tue Feb 1 13:39:10 UTC 2005


Our system, a precursor to Google News is also active on the Web:

www.newsinessence.com

Using it, we have collected 50,000 or so clusters of related news.

--
Drago


nielsen at dcs.kcl.ac.uk wrote:
>
>
> If you don't mind collecting raw text, news.google.com does this.
>
> Leif
>
> >
> > Dear All,
> >
> > I am looking for English-language "comparable" corpora. I.e. I want,
> > e.g., 2 collections of articles from different sources describing same
> > events.
> >
> > Alternatively, would anyone know off-hand how one would go about
> > constructing such comparable collections?
> >
> > (This is to be used for automatic paraphrasing.)
> >
> > Any pointers greatly appreciated,
> >
> > Olga
> > University of Sussex NLP group
> >
> >
> >
> >
> >
> >
> >
>
>
>
>
>


--
Dragomir R. Radev                                         radev at umich.edu
Assistant Professor of Information, Electrical Engineering and
Computer Science, and Linguistics, the University of Michigan, Ann Arbor
Phone: 734-615-5225   Fax: 734-764-2475    http://www.si.umich.edu/~radev



More information about the Corpora mailing list