Access to the Document
Using Variable Precision Rough Set for Selection and Classification of Biological Knowledge Integrated in DNA Gene Expression
Calvo-Dmgz, Diego ; Galvez, Juan Francisco ; Glez-Pena, Daniel ; Gomez-Meire, Silvana ; Fdez-Riverola, Florentino
Journal of Integrative Bioinformatics - JIB (ISSN 1613-4516)
DNA microarrays have contributed to the exponential growth of genomic and experimental data in the last decade. This large amount of gene expression data has been used by researchers seeking diagnosis of diseases like cancer using machine learning methods. In turn, explicit biological knowledge about gene functions has also grown tremendously over the last decade. This work integrates explicit biological knowledge, provided as gene sets, into the classication process by means of Variable Precision Rough Set Theory (VPRS). The proposed model is able to highlight which part of the provided biological knowledge has been important for classification. This paper presents a novel model for microarray data classification which is able to incorporate prior biological knowledge in the form of gene sets. Based on this knowledge, we transform the input microarray data into supergenes, and then we apply rough set theory to select the most promising supergenes and to derive a set of easy interpretable classification rules. The proposed model is evaluated over three breast cancer microarrays datasets obtaining successful results compared to classical classification techniques. The experimental results shows that there are not significant differences between our model and classical techniques but it is able to provide a biological-interpretable explanation of how it classifies new samples.
||Faculty of Technology, Research Groups in Informatics
||Data processing, computer science, computer systems
Using Variable Precision Rough Set for Selection and Classification of Biological Knowledge Integrated in DNA Gene Expression.
Journal of Integrative Bioinformatics - JIB (ISSN 1613-4516), 9(3): Special Issue: Selected extended papers of the 6th International Conference on Practical Applications of Computational Biology and Bioinformatics, Salamanca, Spain, 2012