[Corpora-List] Anonymization or de-identification guidelines and software?
Eric Atwell
csc6ea at leeds.ac.uk
Wed Nov 2 09:11:25 UTC 2011
We are exploring a corpus of Verbal Autopsies: semi-formal interviews
about deaths, mainly mothers describing how their baby died.
Before this corpus can be used more widely, we need to anonymize
or de-identify all references to people.
Can CORPORA experts please direct me to Guidelines or Protocols
for anonymization or de-personalisation of texts?
(eg to the standard exemplified in the BNC)
And recommend software to automate this process?
thnaks in advance for help
Eric Atwell, Senior Lecturer, Language research group,
I-AIBS Institute for Artificial Intelligence and Biological Systems
School of Computing, Faculty of Engineering, UNIVERSITY OF LEEDS
Leeds LS2 9JT, England. TEL: 0113-3435430 FAX: 0113-3435468
WWW: http://www.comp.leeds.ac.uk/eric
http://www.comp.leeds.ac.uk/nlp
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list