[Corpora-List] Gold standard for document similarity

Ivelina Nikolova iva at lml.bas.bg
Tue Mar 4 15:48:08 UTC 2014


Dear corpora members,

I am looking for a gold standard to train/evaluate document similarity 
metrics.
Can anyone suggest a suitable corpus for such purposes. I'm especially 
interested in similarity between newspaper articles.

Thanks in advance,
Ivelina

-- 
Ivelina Nikolova
PhD student in Computer Science
Linguistic Modelling Department
Institute of Information and Communication Technologies
Bulgarian Academy of Sciences


_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list