A Rough Set based Gene Expression Clustering Algorithm
Abstract
Problem statement: Microarray technology helps in monitoring the expression levels of thousands of genes across collections of related samples. Approach: The main goal in the analysis of large and heterogeneous gene expression datasets was to identify groups of genes that get expressed in a set of experimental conditions. Results: Several clustering techniques have been proposed for identifying gene signatures and to understand their role and many of them have been applied to gene expression data, but with partial success. The main aim of this work was to develop a clustering algorithm that would successfully indentify gene patterns. The proposed novel clustering technique (RCGED) provides an efficient way of finding the hidden and unique gene expression patterns. It overcomes the restriction of one object being placed in only one cluster. Conclusion/Recommendations: The proposed algorithm is termed intelligent because it automatically determines the optimum number of clusters. The proposed algorithm was experimented with colon cancer dataset and the results were compared with Rough Fuzzy K Means algorithm.
DOI: https://doi.org/10.3844/jcssp.2011.986.990
Copyright: © 2011 J. Jeba Emilyn and K. Ramar. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- 3,404 Views
- 3,534 Downloads
- 4 Citations
Download
Keywords
- Microarray technology
- clustering algorithm
- gene expression data
- fuzzy membership
- rough clustering
- clustering technique
- knowledge discovery
- data mining
- attribute clustering