2003
Conference Proceedings:
Egov OpenSource and
SecurE-biz Executive Summit
Conference Topic Map

Browse
by Topic Category:
Keyword
Author
Presentation
Organization
City
State
Country

Browse:
XTM Format

Abstract

 Berman, Jules    Open Source Confidentiality Methods   

Open Source Confidentiality Methods

Abstract

Scientific progress requires the free exchange of research data. Because  HIPAA   NIH  medical research  patient identification fields   is often conducted using confidential records, medical researchers have historically refused to share their primary data, thus denying other scientists the opportunity of using these data sets for further research. Pressured by federal regulations restricting the use of identified medical records ( and the Common Rule), and by recent data-sharing proposals from and from publishers, researchers have devised a variety of innovative technical solutions that permit researchers to obtain and share large data sets derived from medical records without breaching patient confidentiality. Some of the methods used are: one-way hashing of (such as name and social security number),data scrubbing (removing private information from free-text), threshold splitting (dividing text into multiple files, any one of which can be shared and used for scientific purposes without breaching confidentiality), and data ambiguating (ensuring non-uniqueness of records). Using these methods, large medical data sets can be safely used for research without obtaining patient consent and can be shared by the scientific community. These methods and their available open source implementations will be discussed.

Overview
Presentation