[Corpora-List] corpus of abstracts/papers with free-form keywords

Mark Johnson mark.johnson at mq.edu.au
Thu Dec 2 14:12:54 UTC 2010


I'm trying to evaluate unsupervised algorithms for identifying topical 
collocations in document collections.  One idea I've had is: if I had a 
corpus of abstracts or papers that have been manually labelled with 
free-form keywords, I could evaluate the degree to which the topical 
collocations match the human-annotated keywords.   Can anyone point me 
to a suitable corpus -- perhaps one that has already been used for this 
purpose?

Thanks in advance,

Mark Johnson

_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list