Privacy Preservation of Big Data Using Hadoop

Authors

  • Vishal Phadtare  College of Engineer and Management Studies, Pune University, Maharashtra, India
  • Samir Kashid  College of Engineer and Management Studies, Pune University, Maharashtra, India
  • Monika Dherange  College of Engineer and Management Studies, Pune University, Maharashtra, India
  • Prof. Pramod Murkute  College of Engineer and Management Studies, Pune University, Maharashtra, India

Keywords:

Privacy, security, integrity, and protection, distributed databases SMC, TTP

Abstract

In Big data applications data collection has grown continuously, due to this it becomes expensive to manage, capture or extract and process data using existing software tools. Performing data analysis is becoming expensive with increasing large volume of data in data warehouse. Data privacy is one of the challenges in data mining with big data. To preserved the privacy of the user we need to use some method so that data privacy is preserve and at the same time increase the data utility. In existing centralized algorithms it assumes that the all data should be at centralized location for anonymization which is not possible for large scale dataset. And there was distributed algorithms which mainly focus on privacy preservation of large dataset rather than the scalability issue. In the proposed system we focus to maintain the privacy for distributed data, and also overcome the problems of M-privacy and secrecy approach with new anonymization and slicing technique. Our main goal is to publish an anonymized view of integrated data, which will be prevents the vulnerable attacks. We use MR-Cube approach which addresses the challenges of large scale cube computation with holistic measure. Slicing contains tuple partition, vertical and horizontal partition, generalization, slicing and anonymization. At the slicing is successful then anonymized data can easily access by user effectively.

References

  1. Xindong Wu, Fellow, IEEE, Xingquan Zhu, Senior Member, IEEE, Gong-Qing Wu,and Wei Ding, Senior Member, IEEE Data Mining with Big Data in IEEE TRANS-ACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 26, NO. 1, JANUARY 2014.
  2. Arnab Nandi, Cong Yu, Philip Bohannon, and Raghu Ramakrishnan, Fellow, IEEE,Data Cube Materialization and Mining over MapReduce TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 6, NO. 1, JANUARY 2012 .
  3. Benjamin C.M. Fung, Ke Wang, and Philip S. Yu, Fellow, IEEE, Anonymizing Classification Data for Privacy Preservation in IEEE TRANSACTIONS ON KNOWL- EDGE AND DATA ENGINEERING, VOL. 19, NO. 5, MAY 2007.
  4. D. Mohanapriya, Dr.T.Meyyappan, High Dimensional Data Handling Technique Using Overlapping Slicing Method for Privacy Preservation in International Journal of Advanced Research in Computer Science and Software Engineering Volume 3, Issue 6, June 2013.
  5. Tiancheng Li, Ninghui Li, Jian Zhang, Ian molloy Slicing: A New Approach for Privacy Preserving Data Publishing in IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 24, NO. 3, MARCH 2012.
  6. Madhuri Patil, Sandip Ingale Privacy Control Methods for Anonymous And Confidential Database Using Advance Encryption Standard in International Journal of Computer Science and Mobile Computing, Vol. 2, Issue. 8, August 2013.
  7. D. Mohanapriya, Dr.T.Meyyappan, High Dimensional Data Handling Technique Using Overlapping Slicing Method for Privacy Preservation in International Journal of Advanced Research in Computer Science and Software Engineering Volume 3, Issue 6, June 2013.
  8. Senthil Raja M And Vidya Bharathi D Enhancement of Privacy Preservation in Slicing Approach Using Identity Disclosure Protection in ITSI Transactions on Electrical and Electronics Engineering (ITSITEEE) Volume -1, Issue -2, 2013.
  9. Xuyun Zhang, Laurence T. Yang, Senior Member, IEEE, Chang Liu, and Jinjun Chen, Member, IEEE A Scalable Two-Phase Top-Down Specialization Approach for Data Anonymization Using MapReduce on Cloud in IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, VOL. 25, NO. 2, FEBRUARY 2014.
  10. Dhanshri S. Lad , Rasika P. Saste, Di_erent Cube Computation Approaches: Survey Paper(IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 5 (3) ,4057-4061, 2

Downloads

Published

2016-06-30

Issue

Section

Research Articles

How to Cite

[1]
Vishal Phadtare, Samir Kashid, Monika Dherange, Prof. Pramod Murkute, " Privacy Preservation of Big Data Using Hadoop, International Journal of Scientific Research in Science, Engineering and Technology(IJSRSET), Print ISSN : 2395-1990, Online ISSN : 2394-4099, Volume 2, Issue 3, pp.364-372, May-June-2016.