Detecting Hate Speech in Tweets with Advanced Machine Learning Techniques

Authors

  • Dornipadu Karthika Chaitrika Department of Artificial Intelligence and Machine Learning, Dr K V Subba Reddy Institute of Technology, Kurnool, Andhra Pradesh, India Author
  • Chillale Lalitha Department of Artificial Intelligence and Machine Learning, Dr K V Subba Reddy Institute of Technology, Kurnool, Andhra Pradesh, India Author
  • Erthineni Gnanasai Department of Artificial Intelligence and Machine Learning, Dr K V Subba Reddy Institute of Technology, Kurnool, Andhra Pradesh, India Author
  • Deshai Keerthi Department of Artificial Intelligence and Machine Learning, Dr K V Subba Reddy Institute of Technology, Kurnool, Andhra Pradesh, India Author
  • K. Mudduswamy Department of Artificial Intelligence and Machine Learning, Dr K V Subba Reddy Institute of Technology, Kurnool, Andhra Pradesh, India Author

DOI:

https://doi.org/10.32628/IJSRSET2512312

Keywords:

Hate Speech Detection, Natural Language Processing (NLP), Machine Learning (ML), Decision Tree Classifier, Content Moderation

Abstract

Hate speech detection is a critical aspect of online content moderation, ensuring that digital platforms remain safe and inclusive. With the exponential rise of social media, harmful content such as hate speech and offensive language has increased, necessitating automated solutions for effective moderation. This project employs Natural Language Processing (NLP) and Machine Learning (ML) techniques to classify tweets into three categories: Hate Speech, Offensive Speech, and No Hate or Offensive Speech. By leveraging a Decision Tree Classifier, the system efficiently detects and categorizes harmful content while reducing manual intervention. The methodology involves data preprocessing, feature extraction using CountVectorizer, and training a classification model to achieve high accuracy. The proposed system overcomes the limitations of traditional keyword-based filtering by improving context awareness and scalability. The implementation is designed to process large volumes of data, making it highly suitable for real-world applications. This approach enhances digital safety, minimizes human effort in moderation, and ensures compliance with ethical standards. Future improvements may include the integration of deep learning models like LSTMs or Transformers and real-time social media API monitoring to enhance accuracy further. This project contributes to the growing need for robust and automated hate speech detection solutions in the digital era.

Downloads

Download data is not yet available.

References

Davidson, T., Warmsley, D., Macy, M., & Weber, I. (2017). Automated hate speech detection and the problem of offensive language. Proceedings of the International AAAI Conference on Web and Social Media, 11(1), 512-515.

Fortuna, P., &Nunes, S. (2018). A survey on automatic detection of hate speech in text. ACM Computing Surveys, 51(4), 85:1-85:30. https://doi.org/10.1145/3232676

Schmidt, A., &Wiegand, M. (2017). A survey on hate speech detection using natural language processing. Proceedings of the Fifth International Workshop on Natural Language Processing for Social Media, 1-10.

Badjatiya, P., Gupta, S., Gupta, M., & Varma, V. (2017). Deep learning for hate speech detection in tweets. Proceedings of the 26th International Conference on World Wide Web Companion, 759-760.

Kwok, I., & Wang, Y. (2013). Locate the hate: Detecting tweets against blacks. Proceedings of the 27th AAAI Conference on Artificial Intelligence, 1621-1622.

Waseem, Z., &Hovy, D. (2016). Hateful symbols or hateful people? Predictive features for hate speech detection on Twitter. Proceedings of the NAACL Student Research Workshop, 88-93.

Founta, A., Chatzakou, D., Karakostas, A., &Vakali, A. (2019). A unified deep learning architecture for abuse detection. Proceedings of the ACM Transactions on Web, 13(3), 17:1-17:33. https://doi.org/10.1145/3341094

Nobata, C., Tetreault, J., Thomas, A., Mehdad, Y., & Chang, Y. (2016). Abusive language detection in online user content. Proceedings of the 25th International Conference on World Wide Web, 145-153.

Jangid, J., & Dixit, S. (2023). The AI Renaissance: Innovations, Ethics, and the Future of Intelligent Systems (Vol. 1). Technoscience Academy.

Mishra, P., Yannakoudakis, H., &Shutova, E. (2019). Tackling online abuse: A survey of automated abuse detection methods. Proceedings of the 27th International Conference on Computational Linguistics, 4155-4169.

Warner, W., & Hirschberg, J. (2012). Detecting hate speech on the world wide web. Proceedings of the Second Workshop on Language in Social Media, 19-26. .

Downloads

Published

09-05-2025

Issue

Section

Research Articles

How to Cite

[1]
Dornipadu Karthika Chaitrika, Chillale Lalitha, Erthineni Gnanasai, Deshai Keerthi, and K. Mudduswamy, “Detecting Hate Speech in Tweets with Advanced Machine Learning Techniques”, Int J Sci Res Sci Eng Technol, vol. 12, no. 3, pp. 49–55, May 2025, doi: 10.32628/IJSRSET2512312.

Similar Articles

1-10 of 203

You may also start an advanced similarity search for this article.