Web Based Machine Learning Automated Pipeline

Authors

  • Prof. Sachin Sambhaji Patil  Computer Engineering Department, Zeal College of Engineering and Research, Pune, Maharashtra, India
  • Mahesh Manohar Sirsat  Computer Engineering Department, Zeal College of Engineering and Research, Pune, Maharashtra, India
  • Ajitkumar Vishwakarma Sharma  Computer Engineering Department, Zeal College of Engineering and Research, Pune, Maharashtra, India
  • Aashish Shahi  Computer Engineering Department, Zeal College of Engineering and Research, Pune, Maharashtra, India
  • Omkar Maruti Halgi   Computer Engineering Department, Zeal College of Engineering and Research, Pune, Maharashtra, India

DOI:

https://doi.org/10.32628/IJSRSET231035

Keywords:

Dataset, Dataset Filtering, Client Server, Pdf Generation, Data Preprocessing.

Abstract

With the increasing volume, velocity, veracity, and variety of data, it has become critical to have efficient techniques and tools for managing and analyzing data in machine learning. Abstraction is a powerful concept that allows users to interact with machine learning algorithms without understanding their technical implementation details. In this project the user will provide the dataset in .csv format the dataset is then processed further to different machine learning preprocessing steps like removing unwanted columns, handling missing values, label encoding, outlier detection and removal, normalization, model building, model prediction, and the result can be downloaded as pdf, tracable pdf and CSV, this all processes gives a result of different model and their respective accuracy so that we can choose the best model for that particular dataset. tracable pdf will be containing all the timestamp of the processes done with their respective result, Apart from client-server model user is also provided a api so that all processes can be implemented in different platforms like c++, java, ruby etc. Overall, this paper highlights the critical role of abstraction in managing the complexity of data and machine learning algorithms, enabling more efficient and effective analysis of large and complex datasets.

References

  1. I. F. Qayyum and D.-H. Kim, “A Survey of datasets, preprocesssing, modelling mechanisms,” 2022.
  2. T. Petrou, “Pandas Cookbook”.
  3. J. Grus, “Data Science from Scratch”.
  4. IEEE, “A dataset of attributes from papers of a machine learning conference Algorithm,” 2019.
  5. IEEE, “Missing Data Analysis in Regression,” 2022.
  6. IEEE, “A survey on outlier explanations,” 2022.
  7. S. Raschka and V. Mirjalili, “Python Machine Learning”.

Downloads

Published

2023-06-30

Issue

Section

Research Articles

How to Cite

[1]
Prof. Sachin Sambhaji Patil, Mahesh Manohar Sirsat, Ajitkumar Vishwakarma Sharma, Aashish Shahi, Omkar Maruti Halgi "Web Based Machine Learning Automated Pipeline" International Journal of Scientific Research in Science, Engineering and Technology (IJSRSET), Print ISSN : 2395-1990, Online ISSN : 2394-4099, Volume 10, Issue 3, pp.43-51, May-June-2023. Available at doi : https://doi.org/10.32628/IJSRSET231035