Online Writeup Categorization using a Machine Learning Strategy
DOI:
https://doi.org/10.32628/IJSRSET24115118Keywords:
News Articles, Machine Learning, Classification of News, Support Vector Machine, Logistic RegressionAbstract
A variety of online sources produces a huge amount of daily news; thus, it is important to categorize the news items to make the information accessible to consumers easily and quickly scraping is used to gather existing news items from news websites and then categorize them automatically using a variety of classification algorithms. As a result, news categorization is a technique for discovering untracked news themes and providing specific recommendations based on the user's historical interests. The BBC News dataset, which includes articles from five different categories including Business, Entertainment, Politics, Sport, and Technology, is used in this task to discuss various steps in news classification and implement a few algorithmic approaches such as Naive Bayes, Binary Classifier, SVM, Perceptron, and SGD. In the study, results from several classification algorithms are examined, and their accuracy is measured.
Downloads
References
Kaur, Gurmeet, and Karan Bajaj (2016). “News Classification and Its Techniques: A Review”, IOSR Journal of Computer Engineering (IOSR-JCE) e-ISSN 2278-0661.
Kaur, Sandeep, and Navdeep Kaur Khiva (2016). “Online news classification using Deep Learning Technique”, International Research Journal of Engineering and Technology (IRJET) 3.10.
Dr. R. R. Deshmukh, Mr. D. K. Kirange (2013). “Classifying News Headlines for Providing User-Centered E-Newspaper Using SVM”, International Journal of Emerging Trends & Technology in Computer Science (IJETTCS) Volume 2, Issue 3.
Zach CHASE, Nicolas Genain, and OrrenKarniol-Tambour (2014). “Learning Multi-Label Topic Classification of News Articles”.
Rana, Mazhar Iqbal, Shehzad Khalid, and Muhammad Usman Akbar (2014). “News classification based on their headlines: A review”, 17th IEEE International Multi-Topic Conference. DOI: https://doi.org/10.1109/INMIC.2014.7097339
Kowsari, Kamran (2019). “Text classification algorithms: A survey”. DOI: https://doi.org/10.3390/info10040150
B. Marakarkandy, B. Nemade, S. Kelkar, P. V. Chandrika, V. A. Shirsath, and M. Mali, "Enhancing Multi-Channel Consumer Behavior Analysis: A Data-Driven Approach using the Optimized Apriori Algorithm," Journal of Electrical Systems, vol. 20, no. 2s, pp. 700–708, 2024. DOI: https://doi.org/10.52783/jes.1536
B. Nemade, N. Phadnis, A. Desai, and K. K. Mungekar, "Enhancing connectivity and intelligence through embedded Internet of Things devices," ICTACT Journal on Microelectronics, vol. 9, no. 4, pp. 1670-1674, Jan. 2024, doi: 10.21917/ijme.2024.0289.
Sembodo, Jaka E., Erwin B. Setiawan, and Moch Arif Bijaksana (2018). “Automatic Tweet Classification Based on News Category in the Indonesian Language”, 6th International Conference on Information and Communication Technology (ICOICT). DOI: https://doi.org/10.1109/ICoICT.2018.8528788
Downloads
Published
Issue
Section
License
Copyright (c) 2024 International Journal of Scientific Research in Science, Engineering and Technology
This work is licensed under a Creative Commons Attribution 4.0 International License.