A survey on Improving Classification Accuracy in Data Mining
Keywords:
Classification; Pre-processing; Outliers detection; Feature Selection; Dimensionality reductionAbstract
There are various classifiers available for data classification, selecting the best classifier is one of the critical problems of data classification. Also pre-processing approach to be used is quite important. In this paper, study of various approaches to improve the classification accuracy in data mining is carried out. The purpose of the pre-processing is to gain a high degree of distinct classes before the classifier is trained or tested. Handling noise and outliers is an important aspect in data mining to improve the classification accuracy. High accuracy of classification also depends upon the quality of data being used for classification in data mining. Feature selection is also one of the aspects which can refine the dataset before providing it to the learning algorithm to improve the accuracy of the classifier.
References
- Moeinzadeh, H, Nasersharif, B, Rezaee, A., Pazhoumand-dar, H., "Improving Classification Accuracy Using Evolutionary Fuzzy Transformation", 11th Annual Conference on Genetic and Evolutionary Computation Conference (GECCO 2009), Montreal, Canada, 2009 (1)
- Bratu, C.V.; Muresan, T.; Potolea, R., "Improving classification accuracy through feature selection," Intelligent Computer Communication and Processing, 2008. ICCP 2008. 4th International Conference on , vol., no., pp.25,32, 28-30 Aug. 2008.
- Nilsson,R., Statistical Feature Selection, with Applications in Life Science, PhD Thesis, Linkoping University, 2007.
- University, Kohavi, R. Wrappers for Performance Enhancement and Oblivious Decision Graphs, PhD thesis, Stanford University, Computer Science Department, 1995. (3).
- Vidrighin C., Potolea R., „Towards a Combined Approach for Feature Selection", accepted at ICSOFT 2008.
- T. M. Khoshgoftaar, N. Seliya, and K. Gao. Rule-based noise detection for software measurement data. In Proc.of the IEEE int. conf. on inf. Reuse and Integration,pages 302-307. IEEE Syst., Man, and Cybern. Society, 2004.
- Smith, Michael R., and Tony Martinez. "Improving classification accuracy by identifying and removing instances that should be misclassified." Neural Networks (IJCNN), The 2011 International Joint Conference on. IEEE, 2011.
- Han, Jiawei, Micheline Kamber, and Jian Pei. Data mining, southeast asia edition: Concepts and techniques. Morgan kaufmann, 2006.
Downloads
Published
Issue
Section
License
Copyright (c) IJSRSET

This work is licensed under a Creative Commons Attribution 4.0 International License.