Access to the Document
Achieving k-anonymity in DataMarts used for gene expressions exploitation
Stark, Konrad ; Eder, Johann ; Zatloukal, Kurt
Journal of Integrative Bioinformatics - JIB (ISSN 1613-4516)
Abstract:
Gene expression profiling is a sophisticated method to discover differences in activation patterns of genes between different patient collectives. By reasonably defining patient groups from a medical point of view, subsequent gene expression analysis may reveal disease-related gene expression patterns that are applicable for tumor markers and pharmacological target identification. When releasing patient-specific data for medical studies privacy protection has to be guaranteed for ethical and legal reasons. k-anonymisation may be used to generate a sufficient number of k data twins in order to ensure that sensitive data used in analyses is protected from being linked to individuals. We use an adapted concept of k-anonymity for distributed data sources and include various customisation parameters in the anonymisation process to guarantee that the transformed data is still applicable for further processing. We present a real-world medical-relevant use case and show how the related data is materialised, anonymised, and released in a data mart for testing the related hypotheses.
Institution: |
|
Faculty of Technology, Research Groups in Informatics |
DDC classification: |
|
Data processing, computer science, computer systems |
Suggested Citation:
Stark, Konrad ; Eder, Johann ; Zatloukal, Kurt ( 2007) Achieving k-anonymity in DataMarts used for gene expressions exploitation.
Journal of Integrative Bioinformatics - JIB (ISSN 1613-4516), 4(1), 2007. Special Issue: Data Warehouse Technologies in Bioinformatics, Wittenberg, Germany, 2006
Online-Journal: http://journal.imbio.de/index.php?paper_id=58
URL:
http://biecoll.ub.uni-bielefeld.de/volltexte/2007/239
|