Content Based E-Mail Classification

Authors

  • Er.Sonal Chakole  Assistant Professor, Department of Computer Science and Engineering, Priyadarshini J. L. College of Engineering, Nagpur, Maharashtra, India
  • Sarita Padole  BE Scholar, Department of Computer Science and Engineering, Priyadarshini J. L. College of Engineering, Nagpur, Maharashtra, India
  • Apurva Kamble  BE Scholar, Department of Computer Science and Engineering, Priyadarshini J. L. College of Engineering, Nagpur, Maharashtra, India
  • 4Vandana Wadekar  BE Scholar, Department of Computer Science and Engineering, Priyadarshini J. L. College of Engineering, Nagpur, Maharashtra, India
  • Ankit Dhande  BE Scholar, Department of Computer Science and Engineering, Priyadarshini J. L. College of Engineering, Nagpur, Maharashtra, India

DOI:

https://doi.org//10.32628/IJSRSET218323

Keywords:

Email classifications, Content Based Filtering, Spam detector, Data mining Classification

Abstract

Electronic Mail (E-mail) has established a significant place in information user’s life. Mails are used as a major and important mode of information sharing because emails are faster and effective way of communication. Email plays its important role of communication in both personal and professional aspects of one’s life. The rapid increase in the number of account holders from last few decades and the increase in the volume of mails have generated various serious issues too. The content base mail classification can be classified into four ways namely Private, Public, Newsletter, and Anonymous. Every user has the right to choose their keyword (a semi-private password). Those contacts who know the user’s keyword will be classified as private contacts and those users who are unknown them classified anonymous contacts. A contact can be classified as public or private, upon verification of an anonymous contact. Any newsletter or group mails are classified into newsletter contacts. It is highly likely that the rests are junk mail or spam. In this project, a spam detector to identify an email as either spam or ham is built using n-gram analysis. The system involves the classification of mails based on user’s contacts. This way any mail from a contact whom the user knows very well is being displayed.

References

  1. Awad M., Foqaha M. Email spam classification using hybrid approach of RBF neural network and particle swarm optimization. Int. J. Netw. Secur. Appl. 2016;8(4) [Google Scholar]
  2. 2.Bouguila N., Amayri O. A discrete mixture-based kernel for SVMs: application to spam and image categorization. Inf. Process. Manag. 2009;45(6):631–642. [Google Scholar]
  3. Cao Y., Liao X., Li Y. International Symposium on Neural Networks. Springer Berlin Heidelberg; 2004. An e-mail filtering approach using neural network; pp. 688–694. [Google Scholar]
  4. Mason S. 2003. New Law Designed to Limit Amount of Spam in E-Mail.http://www.wral.com/technolog [Google Scholar]
  5. Wang X.L. 2005 International Conference on Machine Learning and Cybernetics (Vol. 9, pp. 5716-5719) IEEE; Aug 2005. Learning to classify email: a survey. [Google Scholar]
  6. Wang X.L. 2005 International Conference on Machine Learning and Cybernetics (Vol. 9, pp. 5716-5719) IEEE; Aug 2005. Learning to classify email: a survey. [Google Scholar]
  7. Cormack G.V. Email spam filtering: a systematic review. Found. Trends Inf. Retr. 2008;1(4):335–455. [Google Scholar]

Downloads

Published

2021-06-30

Issue

Section

Research Articles

How to Cite

[1]
Er.Sonal Chakole, Sarita Padole, Apurva Kamble, 4Vandana Wadekar, Ankit Dhande, " Content Based E-Mail Classification, International Journal of Scientific Research in Science, Engineering and Technology(IJSRSET), Print ISSN : 2395-1990, Online ISSN : 2394-4099, Volume 8, Issue 3, pp.141-144, May-June-2021. Available at doi : https://doi.org/10.32628/IJSRSET218323