Efficient Algorithm for Frequent Item Set Generation in Big Data

Priyanka Wankhede; Prof. Vijaya Kamble

doi:10.32628/IJSRSET196172

Authors

Priyanka Wankhede Department of Computer Science and Engineering, Gurunanak Institute of Engineering and Technology, Nagpur, India
Prof. Vijaya Kamble Assistant Professor, Department of Computer Science and Engineering, Gurunanak Institute of Engineering and Technology, Nagpur, India

Keywords:

Incremental FP-Growth Algorithm, Big Data, Data Mining, Frequent Itemset Mining.

Abstract

Data mining faces a lot of challenges in the big data era. Association rule mining is an important area of research in the field of data mining. Association rule mining algorithm is not sufficient to process large data sets. Apriori algorithm has limitations like the high I/O load and low performance. The FP-Growth algorithm also has certain limitations like less internal memory. Mining the frequent itemset in the dynamic scenarios is a challenging task. To overcome these issues a parallelized approach using the mapreduce framework has been used. The mining algorithm has been implemented using the Hadoop.

References

"Horizontal Format Data Mining with Extended Bitmaps", Buddhika De Alwis1, Supun Malinga2, Kathiravelu Pradeeban3, Denis Weerasiri4, Shehan Perera, International Journal of Computer Information Systems and Industrial Management Applications. ISSN 2150-7988 Volume 4 (2012) pp. 514-521
J. Han and M. Kamber. "Data mining: Concepts and Techniques", Morgan Kaufman, San Francisco, CA,2001.
Arun K Pujari "Data Mining Techniques" UniversityPress (India) Pvt. Ltd 2001
C. F. Tsai, W. C. Lin, and S. W. Ke, "Big data mining with parallel computing: a comparison of distributed and MapReduce methodologies", Journal of Systems and Software, vol. 122, pp. 83-92, 2016.
R. Agrawal and R. Srikant. Fast algorithms for mining association rules in large databases. In Proc. VLDB, pages 487-499, 1994
B. Mobasher, H. Dai, T. Luo, and M. Nakagawa. Effective personalization based on association rule discovery from web usage data. In Proc. WIDM, pages 9-15. ACM, 2001
X. Wu, X. Zhu, G. Q. Wu, and W. Ding, "Data mining with big data", IEEE Transactions on Knowledge and Data Engineering, vol. 26, no.1, pp. 97-107, 2014.
R. Agrawal, and J. C. Shafer, "Parallel mining of ARs", IEEE Transactions on Knowledge and Data Engineering, vol. 8, no. 6, pp. 962-969, 1996
W. Fan, and A. Bifet, "Mining big data: current status, and forecast to the future", ACM SIGKDD Exploration, vol. 14, no. 2, pp. 1-5, 2012..
M. J. Zaki, M. Ogihara, S. Parthasarathy, and W. Li, "Parallel data mining for ARs on shared-memory multi-processors", In Conference of Supercomputing, pp. 43-43, 1996.

Efficient Algorithm for Frequent Item Set Generation in Big Data

Authors

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite