Detecting Hate Speech in Tweets with Advanced Machine Learning Techniques

Dornipadu Karthika Chaitrika; Chillale Lalitha; Erthineni Gnanasai; Deshai Keerthi; K. Mudduswamy

doi:10.32628/IJSRSET2512312

Authors

Dornipadu Karthika Chaitrika Department of Artificial Intelligence and Machine Learning, Dr K V Subba Reddy Institute of Technology, Kurnool, Andhra Pradesh, India Author
Chillale Lalitha Department of Artificial Intelligence and Machine Learning, Dr K V Subba Reddy Institute of Technology, Kurnool, Andhra Pradesh, India Author
Erthineni Gnanasai Department of Artificial Intelligence and Machine Learning, Dr K V Subba Reddy Institute of Technology, Kurnool, Andhra Pradesh, India Author
Deshai Keerthi Department of Artificial Intelligence and Machine Learning, Dr K V Subba Reddy Institute of Technology, Kurnool, Andhra Pradesh, India Author
K. Mudduswamy Department of Artificial Intelligence and Machine Learning, Dr K V Subba Reddy Institute of Technology, Kurnool, Andhra Pradesh, India Author

DOI:

https://doi.org/10.32628/IJSRSET2512312

Keywords:

Hate Speech Detection, Natural Language Processing (NLP), Machine Learning (ML), Decision Tree Classifier, Content Moderation

Abstract

Hate speech detection is a critical aspect of online content moderation, ensuring that digital platforms remain safe and inclusive. With the exponential rise of social media, harmful content such as hate speech and offensive language has increased, necessitating automated solutions for effective moderation. This project employs Natural Language Processing (NLP) and Machine Learning (ML) techniques to classify tweets into three categories: Hate Speech, Offensive Speech, and No Hate or Offensive Speech. By leveraging a Decision Tree Classifier, the system efficiently detects and categorizes harmful content while reducing manual intervention. The methodology involves data preprocessing, feature extraction using CountVectorizer, and training a classification model to achieve high accuracy. The proposed system overcomes the limitations of traditional keyword-based filtering by improving context awareness and scalability. The implementation is designed to process large volumes of data, making it highly suitable for real-world applications. This approach enhances digital safety, minimizes human effort in moderation, and ensures compliance with ethical standards. Future improvements may include the integration of deep learning models like LSTMs or Transformers and real-time social media API monitoring to enhance accuracy further. This project contributes to the growing need for robust and automated hate speech detection solutions in the digital era.

📊 Article Downloads

References

Davidson, T., Warmsley, D., Macy, M., & Weber, I. (2017). Automated hate speech detection and the problem of offensive language. Proceedings of the International AAAI Conference on Web and Social Media, 11(1), 512-515.

Fortuna, P., &Nunes, S. (2018). A survey on automatic detection of hate speech in text. ACM Computing Surveys, 51(4), 85:1-85:30. https://doi.org/10.1145/3232676

Schmidt, A., &Wiegand, M. (2017). A survey on hate speech detection using natural language processing. Proceedings of the Fifth International Workshop on Natural Language Processing for Social Media, 1-10.

Badjatiya, P., Gupta, S., Gupta, M., & Varma, V. (2017). Deep learning for hate speech detection in tweets. Proceedings of the 26th International Conference on World Wide Web Companion, 759-760.

Kwok, I., & Wang, Y. (2013). Locate the hate: Detecting tweets against blacks. Proceedings of the 27th AAAI Conference on Artificial Intelligence, 1621-1622.

Waseem, Z., &Hovy, D. (2016). Hateful symbols or hateful people? Predictive features for hate speech detection on Twitter. Proceedings of the NAACL Student Research Workshop, 88-93.

Founta, A., Chatzakou, D., Karakostas, A., &Vakali, A. (2019). A unified deep learning architecture for abuse detection. Proceedings of the ACM Transactions on Web, 13(3), 17:1-17:33. https://doi.org/10.1145/3341094

Nobata, C., Tetreault, J., Thomas, A., Mehdad, Y., & Chang, Y. (2016). Abusive language detection in online user content. Proceedings of the 25th International Conference on World Wide Web, 145-153.

Jangid, J., & Dixit, S. (2023). The AI Renaissance: Innovations, Ethics, and the Future of Intelligent Systems (Vol. 1). Technoscience Academy.

Mishra, P., Yannakoudakis, H., &Shutova, E. (2019). Tackling online abuse: A survey of automated abuse detection methods. Proceedings of the 27th International Conference on Computational Linguistics, 4155-4169.

Warner, W., & Hirschberg, J. (2012). Detecting hate speech on the world wide web. Proceedings of the Second Workshop on Language in Social Media, 19-26. .

Detecting Hate Speech in Tweets with Advanced Machine Learning Techniques

Authors

DOI:

Keywords:

Abstract

📊 Article Downloads

References

Downloads

Published

Issue

Section

License

How to Cite

IssueDate

RightSideBlock

Latest publications