Predictive Analysis for Big Mart Sales Using ML Algorithms
Keywords:
Linear Regression, Polynomial Regression, Ridge Regression, Xgboost RegressionAbstract
Big Marts, which are distribution centers for supermarket chains, now keep tabs on sales volume and revenue numbers for each product to anticipate domestic consumption and adjust inventory control. Examining the data warehouse's server database often reveals inconsistencies and overarching patterns. Companies like Big Mart can use the data with a variety of machine learning techniques to predict future product sales. Many different machine learning algorithms, including Linear Regression, Ridge Regression, Lasso Regression, Decision Tree Regression, Random Forest Regression, Support Vector Regressor, Adaboost Regressor, and XGBoost Regression, have been employed in this project to forecast Big Mart product sales. We find that XGBoost Regression performs the best in predicting sales volume among the listed algorithms. To this end, we have developed a model with XGBoost Regression and optimized it for maximum precision. This model is available through a flask application; users simply log in, specify the product's parameters, and receive sales forecasts.
References
- Ching Wu Chu and Guoqiang Peter Zhang, A comparative study of linear and nonlinear models for aggregate retails sales forecasting, Int. Journal Production Economics, vol. 86, pp. 217231, 2003.
- Wang, Haoxiang. "Sustainable development and management in consumer electronics using soft computation." Journal of Soft Computing Paradigm (JSCP) 1, no. 01 (2019): 56.- 2. Suma, V., and Shavige Malleshwara Hills.
- Suma, V., and Shavige Malleshwara Hills. "Data Mining based Prediction of Demand in Indian Market for Refurbished Electronics." Journal of Soft Computing Paradigm (JSCP) 2, no. 02 (2020): 101110
- Giuseppe Nunnari, Valeria Nunnari, Forecasting Monthly Sales Retail Time Series: A Case Study, Proc. of IEEE Conf. on Business Informatics (CBI), July 2017.
- https://halobi.com/blog/sales-forecasting-five-uses/. [Accessed: Oct. 3, 2018]
- Zone-Ching Lin, Wen-Jang Wu, Multiple Linear Regression Analysis of the Overlay Accuracy Model Zone, IEEE Trans. on Semiconductor Manufacturing, vol. 12, no. 2, pp. 229 237, May1999.
- O. Ajao Isaac, A. Abdullahi Adedeji, I. Raji Ismail, Polynomial Regression Model of Making Cost Prediction In Mixed Cost Analysis, Int. Journal on Mathematical Theory and Modeling, vol. 2, no. 2, pp. 14 23, 2012.
- C. Saunders, A. Gammerman and V. Vovk, Ridge Regression Learning Algorithm in Dual Variables, Proc. of Int. Conf. on Machine Learning, pp. 515 521, July 1998.IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 56, NO. 7, JULY 2010 3561.
- Robust Regression and Lasso. Huan Xu, Constantine Caramanis, Member, IEEE, and Shie Mannor, Senior Member, IEEE. 2015 International Conference on Industrial Informatics-Computing Technology, Intelligent Technology, Industrial Information Integration.An improved Adaboost algorithm based on uncertain functions.Shu Xinqing School of Automation Wuhan University of Technology.Wuhan, China Wang Pan School of the Automation Wuhan University of Technology Wuhan, China.
Downloads
Published
Issue
Section
License
Copyright (c) IJSRSET

This work is licensed under a Creative Commons Attribution 4.0 International License.