[Corpora-List] Anonymization or de-identification guidelines and software?

Eric Atwell csc6ea at leeds.ac.uk
Wed Nov 2 09:11:25 UTC 2011


We are exploring a corpus of Verbal Autopsies: semi-formal interviews
about deaths, mainly mothers describing how their baby died.
Before this corpus can be used more widely, we need to anonymize
or de-identify all references to people.
Can CORPORA experts please direct me to Guidelines or Protocols 
for anonymization or de-personalisation of texts?
(eg to the standard exemplified in the BNC) 
And recommend software to automate this process?

thnaks in advance for help


Eric Atwell, Senior Lecturer, Language research group,
  I-AIBS Institute for Artificial Intelligence and Biological Systems
  School of Computing, Faculty of Engineering, UNIVERSITY OF LEEDS
  Leeds LS2 9JT, England.        TEL: 0113-3435430  FAX: 0113-3435468
  WWW: http://www.comp.leeds.ac.uk/eric
       http://www.comp.leeds.ac.uk/nlp

_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list