Corpora: Reuters Corpus

Tony Rose Tony.Rose at reuters.com
Tue Aug 14 10:48:08 UTC 2001


Reuters, the global information, news and technology group, is for the first time making available free of charge, large quantities of archived Reuters news stories for use by research communities around the world. The first Reuters Corpus archive includes over 800,000 English language news stories, equivalent to the annual global news output of Reuters. All the news stories are fully referenced using a total of 775 different category codes for topic, geography and industry sector.

Although this Corpus has been available for some time, it has not yet been widely publicised. We are now happy to distribute it more widely within the research community. Further details can be found at:

http://about.reuters.com/researchandstandards/corpus/

For discussion and queries regarding this corpus and future Reuters releases, please refer to the ReutersCorpora mailing list, which can be found at:

http://groups.yahoo.com/group/ReutersCorpora

Best wishes,
Tony
==========
Dr TG Rose
Leader of Language Technology
Reuters Limited, 85 Fleet Street, London EC4P 4AJ
Email: Tony.Rose at reuters.com




-----------------------------------------------------------------
        Visit our Internet site at http://www.reuters.com

Any views expressed in this message are those of  the  individual
sender,  except  where  the sender specifically states them to be
the views of Reuters Ltd.



More information about the Corpora mailing list