[Corpora-List] sports-related resources / corpora
Diana McCarthy
diana at dianamccarthy.co.uk
Thu Sep 5 09:32:16 UTC 2013
Hi William
you may be interested in the Reuters corpus which has documents marked
up with various topic codes including sports
Tony Rose, Mark Stevenson, Miles Whitehead: The Reuters Corpus Volume 1
-from Yesterday's News to Tomorrow's Language Resources. LREC 2002
We manually annotated a small subset of sentences from both the sports
and finance documents of this corpus using WordNet senses. The data is
described in:
Rob Koeling, Diana McCarthy, and John Carroll (2005) Domain-Specific
Sense Distributions and Predominant Sense Acquisition. In Proceedings of
the Human Language Technology Conference and Conference on Empirical
Methods in Natural Language Processing. HLT/EMNLP 2005 pp 419-426
and the data is available for download at
http://www.dianamccarthy.co.uk/downloads/hlt2005releasev2.tgz
best wishes
Diana
William Li wrote, On 04/09/13 14:38:
> Hi everyone,
>
> I'm looking to get suggestions of available sports-related resources
> and corpora, including word lists, WordNet-type resources, collections
> of documents, news articles, sports statistics, etc. I'm new to this
> particular domain, so does anyone have any suggestions?
>
> Thanks,
> William
>
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
--
===========================================================================
Diana McCarthy,
http://www.dianamccarthy.co.uk/
===========================================================================
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list