FW: Reuters Corpus

Mon Aug 20 07:10:25 UTC 2001

-----Original Message-----
From: The LINGUIST Discussion List
[mailto:LINGUIST at LISTSERV.LINGUISTLIST.ORG]On Behalf Of The LINGUIST
List
Sent: Thursday, August 16, 2001 5:53 PM
To: LINGUIST at LISTSERV.LINGUISTLIST.ORG
Subject: 12.2055, FYI: ELRA, Disappearing accents, Reuters Corpus

Editor for this issue: Jody Huellmantel <jody at linguistlist.org>

-------------------------------- Message
3 -------------------------------

Date:  Tue, 14 Aug 2001 11:48:24 +0100
From:  Tony.Rose at reuters.com
Subject:  Reuters Corpus

Reuters, the global information, news and technology group, is for
the
first time making available free of charge, large quantities of
archived Reuters news stories for use by research communities around
the world. The first Reuters Corpus archive includes over 800,000
English language news stories, equivalent to the annual global news
output of Reuters. All the news stories are fully referenced using a
total of 775 different category codes for topic, geography and
industry sector.

Although this Corpus has been available for some time, it has not
yet
been widely publicised. We are now happy to distribute it more
widely
within the research community. Further details can be found at:

http://about.reuters.com/researchandstandards/corpus/

For discussion and queries regarding this corpus and future Reuters
releases, please refer to the ReutersCorpora mailing list, which can
be found at:

http://groups.yahoo.com/group/ReutersCorpora

Best wishes,
Tony
==========
Dr TG Rose
Leader of Language Technology
Reuters Limited, 85 Fleet Street, London EC4P 4AJ
Email: Tony.Rose at reuters.com

- ---------------------------------------------------------------
        Visit our Internet site at http://www.reuters.com

Any views expressed in this message are those of  the  individual
sender,  except  where  the sender specifically states them to be
the views of Reuters Ltd.