[Corpora-List] Anonymization tools for patient record research methods

Uzuner, Ozlem OUzuner at uamail.albany.edu
Sat May 28 05:39:12 UTC 2011


Hi Eric,
Here are a few leads from the i2b2 de-identification challenge in 2006:

        Uzuner Ö, Juo Y, Szolovits P.  Evaluating the state-of-the-art in automatic de-identification.  J Am Med Inform Assoc. 2007, 14(5):550-63. http://www.jamia.org/cgi/content/abstract/14/5/550
        Uzuner Ö , Sibanda T, Luo Y, Szolovits P.   A De-identifier for Medical Discharge Summaries   International Journal Artificial Intelligence in Medicine. 2008; 42(1): 13-35. www.aiimjournal.com/article/SO933-3657(07)00132-7/pdf
        Hara K. Applying a SVM based chunker and a text classifier to the deid challenge.  Online only at www.jamia.org
        Wellner B, Huyck M, Mardis S, Aberdeen J, Morgan M, Peshkin L, Yeh A, Hitzeman J, Hirschman L.  Rapidly retargetable approaches to de-identification in medical records.  J Am Med Inform Assoc. 2007; 12(5):564-73. http://www.jamia.org/cgi/content/abstract/14/5/564
        Szarvas Gy, Farkas R, Busa-Fekete R.  State-of-the-art anonymisation of medical records using an iterative machine learning framewor.  J Am Med Inform Assoc.  2007; 14(5):574-80. http://www.jamia.org/cgi/content/abstract/M2441v1

Thanks,
Ozlem.
________________________________________
From: corpora-bounces at uib.no [corpora-bounces at uib.no] On Behalf Of Eric Atwell [csc6ea at leeds.ac.uk]
Sent: Friday, May 27, 2011 6:12 PM
To: corpora at uib.no
Subject: [Corpora-List] Anonymization tools for patient record research methods

We are investigating research methods for patient records.
To be available for Corpus Linguistics analysis, patient records
have to be anonymised, so individual patients cannot be identified.
Can anyone point us at tools to (semi-)automate anonymization or
deidentification of health text data (or any other text data)?

I managed to find "deid" in Physionet
http://www.physionet.org/physiotools/deid/
Neamatullah I, Douglass M, Lehman LH, Reisner A, Villarroel M, Long WJ,
Szolovits P, Moody GB, Mark RG, Clifford GD. Automated De-Identification
of Free-Text Medical Records. British Medical Council: Medical Informatics
and Decision Making, 2008, 8:32.

and a survey:
Ozlem Uzuner, Yuan Luo, Peter Szolovits. Evaluating the State-of-the-Art
in Automatic De-identification. JAMIA Journal of the American Medical
Informatics Association, 2007,14:550-563

thanks forany other recommendations

Eric Atwell, Senior Lecturer, Language research group,
  I-AIBS Institute for Artificial Intelligence and Biological Systems
  School of Computing, Faculty of Engineering, UNIVERSITY OF LEEDS
  Leeds LS2 9JT, England.        TEL: 0113-3435430  FAX: 0113-3435468
  WWW: http://www.comp.leeds.ac.uk/arabic
       http://www.comp.leeds.ac.uk/nlp

_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora

_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list