An Improved Microarray Gene Expression Classification Using Fuzzy Expert System and Ant Bee Algorithm

Authors

  • S. Deepakkumar  Computer Science and Engineering, Sri Vidya College of Engineering and Technology, Virudhunagar, Tamilnadu, India
  • M. Mohankumar  Computer Science and Engineering, Sri Vidya College of Engineering and Technology, Virudhunagar, Tamilnadu, India
  • Dr. P. Murugeswari  Computer Science and Engineering, Sri Vidya College of Engineering and Technology, Virudhunagar, Tamilnadu, India

Keywords:

Medical Diagnosis, Fuzzy expert system, Micro array data, artificial bee colony, ant colony optimization and mutual information.

Abstract

Medical diagnosis and dealing of micro array data can be done effectively by means of fuzzy expert system. The main intention of using fuzzy system is high perfection and minimum complexity in dealing with classification of medical data. The existing GSA (Genetic Swarm Algorithm) enables high precision in classification on fuzzy expert system with desired cost on medical research. The main drawback of GSA is if-then rules which is multifaceted and protracted that’s highly complicated, those are difficult for a physician to understand. In order to address the interpretability-accuracy tradeoff, a development is made in presenting the rule set by the combination of integer numbers and the task of rule generation. Ant colony optimization (ACO) generates simple rule set according to the gene expression values by which fuzzy partition is applied. But it still suffers from addressing the formless and continuous expression values of a gene. In this paper, we propose artificial bee colony (ABC) algorithm based on mutual information. This mutual information effective in analyzing the informative genes and the improved proposed mechanism hybrid Ant Bee Algorithm (ABA) using fuzzy-II logic is computed with six gene expression data sets in order to examine its effectiveness. The results shows our improved proposed mechanism achieves more perfection in fuzzy system by the combination of highly interpretable and compact rules among all the data sets thus proves its performance is far better than other traditional mechanisms.

References

  1. T. R. Golub, D.K. Slonim, P. Tamayo, C. Huard, M. Gaasenbeek, J.P. Mesirov, "Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring," Science, vol. 286, pp. 531-537, 1999.
  2. L. Li, "Gene Selection for Sample Classification Based on Gene Expression Data: Study of Sensitivity to Choice of Parameters of the ga/knn Method," Bioinformatics, vol. 17, pp. 1131-1142, 2001.
  3. S. Dudoit, J. Fridlyand, and T.P. Speed, "Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data," J. Am. Statistics Assoc., vol. 97, no. 457, pp. 77- 87, 2000.
  4. G. Fort and S.L. Lacroix, "Classification Using Partial Least Squares with Penalized Logistic Regression," Bioinformatics, vol. 21, no. 7, pp. 1104-1111, 2005.
  5. L. Fan, K.L. Poh, and P. Zhou, "A Sequential Feature Extraction Approach for Na€ıve Bayes Classification of Microarray Data," Expert Systems with Applications, vol. 36, no. 6, pp. 9919-9923, 2009.
  6. T.S. Furey, N. Cristianini, N. Duffy, D.W. Bednarski, M. Schummer, and D. Haussler, "Support Vector Machine Classification and Validation of Cancer Tissue Samples Using Microarray Expression Data," Bioinformatics, vol. 16, pp. 906-914, 2000.
  7. J. Khan, J.S. Wei, M. Ringner, L.H. Saal, M. Ladanyi, F. Westermann, F. Berthold, M. Schwab, C.R. Antonescu, C. Peterson, and P.S. Meltzer, "Classification and Diagnostic Prediction of Cancers Using Gene Expression Profiling and Artificial Neural Networks," Nature Medicine, vol. 7, pp. 673-679, 2001.
  8. A.C. Tan and D. Gilbert, "Ensemble Machine Learning on Gene Expression Data for Cancer Classification," Applied Bioinformatics, vol. 2, pp. 75-83, 2003.
  9. D.E. Johnson, F.J. Oles, T. Zhang, and T. Goetz, "A Decision-Tree- Based Symbolic Rule Induction System for Text Categorization," IBM Systems J., vol. 41, no. 3, pp. 1-10, 2002.
  10. J.S.R. Jang, C.T. Sun, and E. Mizutani, Neuro-Fuzzy and Soft Computting. Prentice Hall, 1997.
  11. A.C. Tan, D.Q. Naiman, L. Xu, R.L. Winslow, and D. Geman, "Simple Decision Rules for Classifying Human Cancers from Gene Expression Profiles," Bioinformatics, vol. 21, pp. 3896-3904, 2005.
  12. Y. Yoon, S. Bien, and S. Park, "Microarray Data Classifier Consisting of k-Top-Scoring Rank-Comparison Decision Rules with a Variable Number of Genes," IEEE Trans. Systems, Man, and Cybernetics- Part C: Applications and Rev., vol. 40, no. 2, pp. 216-226, Mar. 2010.
  13. P. Woolf and Y. Wang, "A Fuzzy Logic Approach to Analyzing Gene Expression Data," Physiological Genomics, vol. 3, pp. 9-15, 2000.
  14. S. Vinterbo, "Small, Fuzzy and Interpretable Gene Expression Based Classifiers," Bioinformatics, vol. 21, no. 9, pp. 1964-1970, 2005.
  15. G. Schaefer, "Thermography Based Breast Cancer Analysis Using Statistical Features and Fuzzy Classification," Pattern Recognition, vol. 42, no. 6, pp. 1133-1137, 2009.
  16. X. Zong, Z. Yong, J. Li-min, and H. Wei-li, "Construct Interpretable Fuzzy Classification System Based on Fuzzy Clustering Initialization," Int’l J. Information Technology, vol. 11, no. 6, pp. 91- 107, 2005.
  17. Y. Hu, "Fuzzy Integral-Based Perceptron for Two-Class Pattern Classification Problems," Information Sciences, vol. 177, no. 7, pp. 1673-1686, 2007.
  18. Z. Wang and V. Palade, "A Comprehensive Fuzzy-Based Framework for Cancer Microarray Data Gene Expression Analysis," Proc. IEEE Int’l Conf. Bioinformatics and Bioeng., pp. 1003-1010, 2007.
  19. S.M. Chen and F.M. Tsai, "Generating Fuzzy Rules from Training Instances for Fuzzy Classification Systems," Expert Systems with Applications, vol. 35, no. 3, pp. 611-621, 2008.
  20. G. Schaefer and T. Nakashima, "Data Mining of Gene Expression Data by Fuzzy and Hybrid Fuzzy Methods," IEEE Trans. Information Technology in Biomedicine, vol. 14, no. 1, pp. 23-29, Jan. 2010.
  21. P. Maji, "f-Information Measures for Efficient Selection of Discriminative Genes from Microarray Data," IEEE Trans. Biomedicine Eng., vol. 56, no. 4, pp. 1063-1069, Apr. 2009.
  22. P. Maji and S.K. Pal, "Fuzzy-Rough Sets for Information Measures and Selection of Relevant Genes from Microarray Data," IEEE Trans. Systems, Man and Cybernetics, vol. 40, no. 3, pp. 741-752, June 2010.
  23. Y. Leung and Y. Hung, "A Multiple-Filter-Multipe-Wrapper Approach to Gene Selection and Microarray Data Classification," IEEE/ACM Trans. Computational Biology and Bioinformatics, vol. 7, no. 1, pp. 108-117, Jan./Feb. 2010.
  24. C. Lazar, J. Taminau, S. Meganck, D. Steenhoff, A. Coletta, C. Molter, V. Schaetzen, R. Duque, H. Bersini, and A. Nowe, "A Survey on Filter Techniques for Feature Selection in Gene Expression Microarray Analysis," , vol. 9, no. 4, pp. 1106-1119, 2012.
  25. Sharma, S. Imoto, and S. Miyano, "A Top-r Feature Selection Algorithm for Microarray Gene Expression Data," IEEE/ACM Trans. Computational Biology and Bioinformatics, vol. 9, no. 3, pp. 754-764, May/June 2012.
  26. P. Maji, "f-Information Measures for Efficient Selection of Discriminative Genes from Microarray Data," IEEE Trans. Biomedicine Eng., vol. 56, no. 4, pp. 1063-1069, Apr. 2009.
  27. P. Maji and S.K. Pal, "Fuzzy-Rough Sets for Information Measures and Selection of Relevant Genes from Microarray Data," IEEE Trans. Systems, Man and Cybernetics, vol. 40, no. 3, pp. 741-752, June 2010.
  28. Y. Leung and Y. Hung, "A Multiple-Filter-Multipe-Wrapper Approach to Gene Selection and Microarray Data Classification," IEEE/ACM Trans. Computational Biology and Bioinformatics, vol. 7, no. 1, pp. 108-117, Jan./Feb. 2010.
  29. Lazar, J. Taminau, S. Meganck, D. Steenhoff, A. Coletta, C. Molter, V. Schaetzen, R. Duque, H. Bersini, and A. Nowe, "A Survey on Filter Techniques for Feature Selection in Gene Expression Microarray Analysis," , vol. 9, no. 4, pp. 1106-1119, 2012.
  30. Sharma, S. Imoto, and S. Miyano, "A Top-r Feature Selection Algorithm for Microarray Gene Expression Data," IEEE/ACM Trans. Computational Biology and Bioinformatics, vol. 9, no. 3, pp. 754-764, May/June 2012.
  31. V.K. Mootha, C.M. Lindgren, K.F. Eriksson, A. Subramanian, S. Sihag, J. Lehar, P. Puigserver, et.al, "PGC-1a Responsive Genes Involved in Oxidative Phosphorylation are Coordinately Down Regulated in Human Diabetes," Nature Genetics, vol. 34, no. 3, pp. 267-273, 2003.
  32. U. Alon, N. Barkai, D.A. Notterman, K. Gish, S. Ybarra, D. Mack, and A.J. Levine, "Broad Patterns of Gene Expression Revealed by Clustering of Tumor and Normal Colon Tissues Probed by Oligonucleotide Arrays," Proc. Nat’l Academy of Sciences USA, vol. 96, no. 12, pp. 6745-6750, 1999.
  33. A.A. Alizadeh, M.B. Eisen, R.E. Davis, C. Ma, I.S. Lossos, A. Rosenwald, J.C. Boldrick et.al, "Distinct Types of Diffuse Large B-Cell Lymphoma Identified by Gene Expression Profiling," Nature, vol. 403, no. 3, pp. 503-511, 2000.
  34. T.C. Kraan, F.A. Gaalen, P.V. Kasperkovitz, N.L. Verbeet, T.J. Smeets, M.C. Kraan, "Rheumatoid Arthritis Is a Heterogeneous Disease: Evidence for Differences in the Activation of the STAT-1 Pathway between Rheumatoid Tissues," Arthritis and Rheumatism, vol. 48, no. 8, pp. 2132-2145, 2003.
  35. V.H. Teixeira, R. Olaso, M.L.M. Magniette, S. Lasbleiz, L. Jacq, C.R. Oliveira, P. Hilliquin, "Transcriptome Analysis Describing New Immunity and Defense Genes in Peripheral Blood Mononuclear Cells of Rheumatoid Arthritis Patients," PLoS ONE, vol. 4, no. 8, p. e6803, 2009.

Downloads

Published

2016-06-30

Issue

Section

Research Articles

How to Cite

[1]
S. Deepakkumar, M. Mohankumar, Dr. P. Murugeswari, " An Improved Microarray Gene Expression Classification Using Fuzzy Expert System and Ant Bee Algorithm, International Journal of Scientific Research in Science, Engineering and Technology(IJSRSET), Print ISSN : 2395-1990, Online ISSN : 2394-4099, Volume 2, Issue 3, pp.722-727, May-June-2016.