An Improved K-Means Clustering Algorithm

Ekta Joshi; Dr. D. A. Parikh

doi:10.32628/IJSRSET184240

Authors

Ekta Joshi Computer Engineering, L.D. College of Engineering, Ahmedabad, India
Dr. D. A. Parikh HOD Computer Engineering, L.D. College of Engineering, Ahmedabad, India

Keywords:

K means, Clustering, Data Mining, Big Data.

Abstract

This Vast spread of computing technologies has led to abundance of large data sets. Thus, there is a need to find similarities and define groupings among the elements of these big data sets. One of the ways to find these similarities is data clustering. Currently, there exist several data clustering algorithms which differ by their application area and efficiency. Increase in computational power and algorithmic improvements have reduced the time for clustering of big data sets. But it usually happens that big data sets can’t be processed whole due to hardware and computational restrictions. Clustering techniques, like K-Means are useful in analyzing data in a parallel fashion. K-Means largely depends upon a proper initialization to produce optimal results.

References

Anu Saini, G. B. Pant ,Jaypriya Ubriani “New Approach for Clustering of Big Data: DisK-Means”, 2016 IEEE ,International Conference on Computing, Communication and Automation ,pp 122-126;
Kun niu, zhipeng gao,haizhen jaog ,haijie deng “K-mean+:a developed clustering algorithm for big data”, 2016 IEEE , Proceedings of CCIS2016,pp 141-144;
Vadlana Baby,Dr. N. Subhash Chandra “Distributed threshold k-means clustering for privacy preserving data mining”,2016 IEEE,Conference on Advances in Computing, Communications and Informatics (ICACCI);
Rasim Alguliyev , Ramiz Aliguliyev , Adil Bagirov , Rafael Karimov “Batch Clustering Algorithm for Big Data Sets”;
Caiquan Xiong, Zhen Hua, Ke Lv, Wuhan Hubei ,“An Improved K-means text clustering algorithm By Optimizing initial cluster centers”, 2016 IEEE, International Conference on Cloud Computing and Big Data,pp 265-268;
Jiawei Han, Jian Pei, Micheline Kamber “Data Mining: Concepts and Techniques” 3rd edition;
Vu Viet Thang, D.V. Pantiukhin, A.I. Galushkin “A hybrid clustering algorithm : the FastDBSCAN” 2015 International Conference on Engineering and Telecommunication,pp 69-74;
Tahereh Kamali, Daniel Stashuk “A Density-Based Clustering Approach to Motor Unit Potential Characterizations to Support Diagnosis of Neuromuscular Disorders” 2016 IEEE Transactions on Neural Systems and Rehabilitation Engineering ;
Bin Jiang, Jian Pei, Yufei Tao and Xuemin Lin, Member, IEEE “Clustering Uncertain Data Based on Probability Distribution Similarity” IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 25, NO. 4, APRIL 2013;
Chang Lu , Yueting Shi, Yueyang Chen, Shiqi Bao, Lixing Tang “Data Mining Applied to Oil Well Using K-means and DBSCAN” 2016 7th International Conference on Cloud Computing and Big Data;
Jianbing Shen, Xiaopeng Hao, Zhiyuan Liang, Yu Liu, Wenguan Wang,and Ling Shao, Member, IEEE “Real-time Superpixel Segmentation by DBSCAN Clustering Algorithm” 2016 IEEE TRANSACTIONS ON IMAGE PROCESSING;
Dongming Tang.Affinity propagation clustering for bid data based on Hadoop. Computer Engineering and Applications, 2015, 51(4):29-34;
Joshua M.Dudik a, AtsukoKurosu b, JamesL.Coyle b, ErvinSejdi? a,n “A comparative analysis of DBSCAN, K-means, and quadratic variation algorithms for automatic identification of swallows from swallowing accelerometry signals”, Computers in Biology and Medicine 59 (2015);
Jesal Shethna “Data Mining Techniques available from https://www.educba.com/7-data-mining-techniques-for-best-results/” November 7, 2016;
Martin Brown “Key techniques from https://www.ibm.com/developerworks/library/ba-data-mining-techniques/” Published on December 11, 2012;
Data Mining tutorials “Data Mining Techniques from http://www.zentut.com/data-mining/data-mining-techniques/”
Saurabh Arora, Inderveer Chana “A Survey of Clustering Techniques for Big Data Analysis” 2014 5th International Conference- Confluence The Next Generation Information Technology Summit (Confluence),pp 59-65.
Martin Ester, Hans Peter Kriegel, Jorg Sander, Xiaowei Xu, “A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise”, Published in Proceedings of 2nd International Conference on Knowledge Discovery and Data Mining (KDD-96)

An Improved K-Means Clustering Algorithm

Authors

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite