Web Based Machine Learning Automated Pipeline

Prof. Sachin Sambhaji Patil; Mahesh Manohar Sirsat; Ajitkumar Vishwakarma Sharma; Aashish Shahi; Omkar Maruti Halgi

doi:10.32628/IJSRSET231035

Authors

Prof. Sachin Sambhaji Patil Computer Engineering Department, Zeal College of Engineering and Research, Pune, Maharashtra, India
Mahesh Manohar Sirsat Computer Engineering Department, Zeal College of Engineering and Research, Pune, Maharashtra, India
Ajitkumar Vishwakarma Sharma Computer Engineering Department, Zeal College of Engineering and Research, Pune, Maharashtra, India
Aashish Shahi Computer Engineering Department, Zeal College of Engineering and Research, Pune, Maharashtra, India
Omkar Maruti Halgi Computer Engineering Department, Zeal College of Engineering and Research, Pune, Maharashtra, India

DOI:

https://doi.org/10.32628/IJSRSET231035

Keywords:

Dataset, Dataset Filtering, Client Server, Pdf Generation, Data Preprocessing.

Abstract

With the increasing volume, velocity, veracity, and variety of data, it has become critical to have efficient techniques and tools for managing and analyzing data in machine learning. Abstraction is a powerful concept that allows users to interact with machine learning algorithms without understanding their technical implementation details. In this project the user will provide the dataset in .csv format the dataset is then processed further to different machine learning preprocessing steps like removing unwanted columns, handling missing values, label encoding, outlier detection and removal, normalization, model building, model prediction, and the result can be downloaded as pdf, tracable pdf and CSV, this all processes gives a result of different model and their respective accuracy so that we can choose the best model for that particular dataset. tracable pdf will be containing all the timestamp of the processes done with their respective result, Apart from client-server model user is also provided a api so that all processes can be implemented in different platforms like c++, java, ruby etc. Overall, this paper highlights the critical role of abstraction in managing the complexity of data and machine learning algorithms, enabling more efficient and effective analysis of large and complex datasets.

References

I. F. Qayyum and D.-H. Kim, “A Survey of datasets, preprocesssing, modelling mechanisms,” 2022.
T. Petrou, “Pandas Cookbook”.
J. Grus, “Data Science from Scratch”.
IEEE, “A dataset of attributes from papers of a machine learning conference Algorithm,” 2019.
IEEE, “Missing Data Analysis in Regression,” 2022.
IEEE, “A survey on outlier explanations,” 2022.
S. Raschka and V. Mirjalili, “Python Machine Learning”.

Web Based Machine Learning Automated Pipeline

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite