AutoML Platform for Comparative Analysis of Machine Learning Models

doi:10.21203/rs.3.rs-4363855/v1

Download PDF

Research Article

AutoML Platform for Comparative Analysis of Machine Learning Models

https://doi.org/10.21203/rs.3.rs-4363855/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Automated Machine Learning (AutoML) platforms have emerged as indispensable tools in facilitating efficient algorithm selection for diverse machine learning tasks. In this study, we introduce a novel AutoML platform designed to empower users with seamless comparative analysis of machine learning algorithms. Our platform offers a user-friendly interface, guiding users through the process of uploading datasets, selecting algorithms, and evaluating performance metrics. Leveraging automation and in-depth analysis, users can effortlessly compare the performance of two selected algorithms, gaining insights into their data-driven projects. Through visualization tools and explainability mechanisms, our platform aids users in making informed decisions for optimal algorithm selection. By addressing the complexities of algorithm selection and enhancing accessibility, our AutoML platform contributes to advancing data-driven decision-making across various domains.

Automated Machine Learning

Machine Learning Algorithms

Algorithm Comparison

Data- Driven Decision-Making

Model Performance

Exploratory Data Analysis

Data Visualization

Machine learning has revolutionized numerous domains by providing powerful tools to analyze and extract insights from data. Central to the success of machine learning projects is the selection of appropriate algorithms tailored to the specific characteristics of the data at hand. However, with the proliferation of machine learning techniques, choosing the most suitable algorithm has become increasingly complex and consequential.

To address this challenge, Automated Machine Learning (AutoML) platforms have emerged as essential resources for practitioners and researchers alike. These platforms streamline the process of algorithm selection by automating tasks such as data preprocessing, model training, and evaluation. By providing users with the ability to compare the performance of different algorithms on their datasets, AutoML platforms facilitate informed decision-making and accelerate the development of effective machine learning models.

In this paper, we present a novel AutoML platform designed to empower users with the capability to effortlessly compare the performance of two selected machine learning algorithms. Our platform offers a user-friendly interface that guides users through the entire process, from uploading their datasets to analyzing the results. By leveraging automation, accessibility, and in-depth analysis, our platform enables users to gain insights into how different algorithms impact their data and make informed decisions to enhance the effectiveness of their data-driven projects.

The significance of our AutoML platform lies in its ability to democratize access to advanced machine learning tools and drive innovation in artificial intelligence. By simplifying the comparative analysis of machine learning models, our platform empowers users, regardless of their expertise in machine learning, to navigate the complexities of algorithm selection and accelerate the development of impactful machine learning solutions. Through this paper, we aim to showcase the capabilities of our AutoML platform and its potential to revolutionize the field of machine learning research and application.

Automated Machine Learning (AutoML) has emerged as a pivotal advancement in the field of machine learning, aiming to streamline and automate the complex process of model development. In this paper, we provide a comprehensive overview of the current state and recent advancements in AutoML, exploring its significance, practical applications, methodologies, and empirical findings from recent studies.

A. "Automated Machine Learning: The New Wave of Machine Learning" [1] presents a detailed survey of AutoML, segmenting the AutoML pipeline and reviewing contributions in each segment. It evaluates state-of-the-art AutoML tools and explores advancements often overshadowed by deep learning, offering insights into its application in the insurance industry and summarizing various AutoML frameworks and tools available in the market.

B. "Automated Machine Learning in Practice: State of the Art and Recent Results" [2] delves into practical applications and recent benchmark results of various AutoML algorithms. It highlights the impact of AutoML in real-world scenarios across domains like predictive maintenance and healthcare, while also discussing feature engineering, meta-learning, and architecture search methods.

C. "Evolving Fully Automated Machine Learning via Life-Long Knowledge Anchors" [3] introduces an AutoML framework that utilizes evolutionary algorithms and knowledge anchors to automate machine learning pipeline creation. It discusses components such as NAUC, ALC, and time normalization, showcasing performance across different modalities and datasets.

D. "A Review on Automated Machine Learning (AutoML) Systems" [4] explores the need for standardized documentation and evaluation of AutoML approaches. It categorizes AutoML into fully automated and semi-automated approaches, discussing essential components and commercial systems like Auto-WEKA and TPOT.

E. "Efficient and Robust Automated Machine Learning" [5] evaluates the strengths and limitations of AUTO-SKLEARN, comparing it with other AutoML systems and conducting detailed analyses of individual classifiers and preprocessors.

F. "A Unified Framework for Automatic Distributed Active Learning" [6] introduces AutoDAL, focusing on improving classification accuracy in active learning, especially with imbalanced datasets. It discusses loss functions, adaptation strategies, and extension to hyperparameter tuning, showcasing its effectiveness across different data modalities.

G. "D-SmartML: A Distributed Automated Machine Learning Framework" [7] presents D- SmartML, a distributed AutoML framework on Apache Spark, emphasizing its scalability and performance compared to TransmogrifAI.

H. "Adaptation Strategies for Automated Machine Learning on Evolving Data" [8] evaluates how AutoML methods handle concept drift in evolving data streams and proposes adaptation strategies to enhance their robustness, discussing various adaptation strategies and their effectiveness.

I. "An Empirical Study on the Usage of Automated Machine Learning Tools" [9] analyzes the popularity and usage patterns of AutoML tools in GitHub projects, discussing their purposes and common combinations.

J. "Towards Automated Machine Learning: Evaluation and Comparison of AutoML Approaches and Tools" [10] provides a comprehensive evaluation of various AutoML tools across different datasets and tasks, covering functionalities, experimental evaluations, and tool performance.

Some common themes and highlights across the summaries include:

1. Diverse Applications: AutoML finds applications in various domains such as healthcare, finance, manufacturing, and more. Its ability to automate tedious tasks and streamline model development processes makes it valuable across industries.

2. Efficiency and Accessibility: One of the primary goals of AutoML is to make machine learning more accessible to individuals without extensive expertise in the field. By automating tasks like model selection, hyperparameter tuning, and feature engineering, AutoML tools aim to reduce the time and expertise required to build effective machine learning models.

3. Performance Evaluation: Evaluating the performance of AutoML approaches presents challenges due to the complexity of the search space, lack of standardization in evaluation procedures, and the need for benchmark datasets. Nonetheless, researchers are actively working on developing standardized evaluation procedures and benchmark datasets to facilitate comparisons across different AutoML methods.

4. Meta-Learning and Optimization Techniques: Many AutoML approaches leverage meta- learning and optimization techniques such as Bayesian optimization, evolutionary algorithms, and reinforcement learning to automate the process of model selection, hyperparameter tuning, and feature engineering.

5. Model Interpretability and Transparency*: Some AutoML frameworks emphasize model interpretability and transparency, providing insights into model performance, hyperparameters, feature importance, and prediction explanations. This is particularly important in domains where interpretability is crucial, such as healthcare and finance.

6. Challenges and Future Directions: Despite the progress made in AutoML research, there are still challenges to overcome, including scalability, robustness, interpretability, and generalization to unseen datasets. Future research directions include addressing these challenges, developing more efficient optimization algorithms, and advancing the state-of-the- art in AutoML.

Overall, these summaries provide valuable insights into the current state of AutoML research, its applications, challenges, and future directions. As AutoML continues to evolve, it is likely to play an increasingly important role in democratizing machine learning and accelerating the development of AI systems.

The AutoML platform simplifies the comparative analysis of ML models through automation. It handles data preprocessing tasks like handling missing data and optimizing features automatically. Users can choose from a variety of ML algorithms, benefiting from automatic hyperparameter tuning. The platform streamlines model training and testing processes, ensuring robustness through cross-validation techniques. It provides comprehensive comparisons of model performance metrics, aiding in informed decision-making. Interactive visualization tools offer insights into model strengths and weaknesses. With scalability and efficiency at its core, the platform accommodates large datasets seamlessly. Its user-friendly interface makes it accessible to users of all skill levels. Customization options allow users to use the platform according to their specific needs. Overall, the AutoML platform offers a powerful solution for efficient and effective ML model comparison.

The AutoML platform's architecture is a well-structured framework comprising several interlinked components, each playing a crucial role in facilitating the comparative analysis of machine learning (ML) models. At its core, the platform includes modules dedicated to data preprocessing, model selection, hyperparameter tuning, model training, evaluation, comparison generation, visualization, and user interface. The data preprocessing module ensures data integrity and quality by handling tasks like cleaning, normalization, and feature engineering. Model selection allows users to choose from a list of ML algorithms, while the hyperparameter tuning module optimizes model performance. Subsequently, the model training module trains selected models on pre-processed data, and the evaluation module assesses their performance using various metrics. The comparison generation module generates in-depth comparisons, supports users in informed decision-making. The visualization and reporting module offer interactive tools for visualizing results, while the user interface provides an intuitive platform for seamless interaction. Scalability and efficiency are ensured through parallel processing techniques, and the platform's extensibility allows users to incorporate custom algorithms and techniques. Altogether, the AutoML platform's architecture offers a robust and versatile solution for conducting thorough comparative analysis of ML models, enabling users to navigate and harness the complexities of machine learning effectively.

In the manual testing for ML models, users initiate the process by specifying target variables relevant to their analysis, a critical step stated by the objectives of the study and the domain's complexities. Once targets are identified, data preprocessing takes precedence, including tasks such as handling missing values, outliers, and feature engineering. This phase demands meticulous attention to detail as users clean and refine the dataset to ensure its integrity and reliability. Subsequently, users manually select ML models for comparison, drawing upon their understanding of the problem domain and the dataset's characteristics. This selection process involves weighing factors like model complexity, interpretability, and computational

efficiency. Following model selection, users undertake manual evaluation, where they rigorously assess the performance of each model using a range of metrics tailored to the problem at hand. This evaluation phase not only demands quantitative analysis but also qualitative considerations, including model interpretability and suitability for real-world deployment. Through a systematic approach guided by expertise and domain knowledge, manual testing allows users to make informed decisions about model selection and performance assessment, thereby advancing the field of machine learning research and application.

In automated testing of ML models, a systematic approach to data preprocessing, model training, evaluation, and comparison generation is conducted. Automated data cleaning is executed efficiently within the system, utilizing predefined algorithms to handle missing values, outliers, and feature engineering tasks. Model training is conducted automatically, with the system selecting appropriate ML algorithms and optimizing their performance through hyperparameter tuning techniques. Following training, automated evaluation processes assess model performance using predefined metrics, ensuring robustness and generalization. Finally, the system generates comprehensive comparisons between ML models, presenting insights into their relative strengths and weaknesses. Through automation, the system enhances efficiency and objectivity, empowering users to make informed decisions about model selection and deployment. This streamlined methodology accelerates the ML workflow, enabling users to derive actionable insights and drive impactful results in their data-driven endeavours.

In our experimental setup, we meticulously structured a framework allowing customers to upload their own diverse datasets, including real-world data, to ensure thorough evaluation. Using stratified sampling, we partitioned the data into training, validation, and testing sets for

robust model assessment. The manual testing system featured a range of ML models, manually fine-tuning hyperparameters and employing standard evaluation metrics for consistency. Simultaneously, the automated testing system, powered by leading AutoML frameworks, autonomously executed data preprocessing, model selection, and evaluation. High- performance computing clusters for efficient execution, supported by Python and key libraries for model development. Our setup aimed to provide a thorough comparison between manual and automated testing, exploring efficacy and scalability. By following the standardized protocols and using appropriate configurations, we ensured reliable and reproducible experimentation.

Evaluation metrics are important in measuring the performance of machine learning models across diverse tasks. Key metrics such as accuracy, precision, recall, F1 score, specificity, and ROC-AUC offer nuanced insights into model effectiveness. Accuracy provides a basic measure of correctness, while precision focuses on minimizing false positives. Recall emphasizes capturing all positive instances, even at the expense of higher false positives. The F1 score balances precision and recall, offering a comprehensive metric of overall performance. Specificity complements recall by evaluating the model's ability to minimize false alarms. ROC-AUC quantifies the model's discrimination ability across different threshold settings. These metrics are chosen for their interpretability, relevance, and ability to guide decision- making in real-world applications.

To validate the results obtained from both manual and automated testing systems, a systematic procedure is followed, ensuring the reliability and credibility of the findings. Cross-validation techniques are employed to assess the robustness and generalization of the models. This involves dividing the dataset into subsets, training the models on a subset, and evaluating their performance on the remaining data, repeated multiple times to ensure thorough assessment. Statistical analyses, including hypothesis testing and effect size estimation, are conducted to compare the performance metrics between manual and automated systems, determining their significance and practical implementation. Furthermore, validation procedures verify the consistency of results against ground truth labels or expert judgments, ensuring alignment with the study objectives. By comparing findings and investigating differences, researchers can gain confidence in the validity of the evaluation outcomes, facilitating informed decision-making in model selection and deployment.

The deployment of automated systems for ML model testing and comparison necessitates a keen awareness of ethical implications to uphold fairness and transparency. Concerns about potential biases embedded in both data and algorithms must be carefully addressed. Strategies for bias detection and mitigation are essential to rectify biases that could perpetuate discrimination or exacerbate inequalities. Ensuring diverse representation in datasets and development teams can help mitigate biases and enhance fairness. Establishing transparent processes and accountability mechanisms promotes ethical integrity and enables stakeholder scrutiny. Continuous monitoring and evaluation are crucial to identify and address biases promptly, ensuring ethical compliance throughout the system's lifecycle. Adherence to established ethical guidelines and standards is fundamental in upholding ethical principles and protecting individuals' rights. By proactively addressing ethical considerations, researchers can foster responsible and ethical practices in ML model testing and comparison.

The AutoML platform and methods used in the research offer significant advancements, yet encounter notable limitations and challenges. Algorithmic constraints may limit its applicability to complex ML tasks. Quality and availability of data pose challenges, impacting the platform's effectiveness. Substantial computational resources required hinder scalability, particularly for large datasets. Interpreting outcomes can be difficult due to lack of interpretability. Human action may still be necessary, introducing potential biases. Selection of appropriate evaluation metrics may not fully capture model fine points. Ensuring validity and generalization across datasets remains challenging. Ongoing research is needed to enhance platform capabilities and robustness. A multidisciplinary approach is crucial for addressing complex challenges and ensuring responsible deployment.

Future research and enhancement in the AutoML platform can focus on several critical areas to bolster its capabilities and address current limitations. Expanding support for a wider array of ML algorithms would enhance diversity, while advanced techniques for hyperparameter optimization could refine model performance. Improved robustness to noisy data and enhanced interpretability features would ensure more reliable and transparent outcomes. Automated bias detection and mitigation methods are essential for promoting fairness and inclusivity. Scalability and efficiency improvements are crucial for handling large datasets and complex models effectively. Integrating domain-specific knowledge would tailor recommendations to specific application domains. Continuous learning mechanisms would ensure the platform remains adaptive to evolving data and requirements, encouraging its long-term relevance and utility. Through these avenues, the AutoML platform can evolve into a more powerful and adaptable tool for streamlined ML model testing and comparison, driving innovation in AI research and applications.

Table I and Table III shows Evaluation metrics for measuring performance of machine learning models using our platform. Table II and Table IV shows Evaluation metrics for measuring performance of machine learning models by manual coding.

Table I: Automate Classification

Models	Accuracy	AUC	Recall	Precision	F1	Kappa	MCC	TT (sec)
CatBoost Classifier	0.7709	0.8326	0.6152	0.6954	0.6494	0.4809	0.4854	1.983
Gradient Boosting Classifier	0.769	0.84	0.6254	0.6927	0.6493	0.4794	0.4862	0.056
Logistic Regression	0.7672	0.8313	0.5781	0.7108	0.6286	0.4637	0.4749	1.395
Linear Discriminant Analysis	0.7652	0.8303	0.5775	0.7059	0.6281	0.4604	0.4705	0.011
Ridge Classifier	0.7615	0	0.5667	0.7011	0.6187	0.4497	0.4606	0.014
Extra Trees Classifier	0.7559	0.8192	0.583	0.6769	0.6228	0.4449	0.4501	0.085
Naive Bayes	0.7542	0.8283	0.5892	0.6756	0.6235	0.4435	0.45	0.011
Random Forest Classifier	0.7522	0.8243	0.5827	0.6743	0.6213	0.4391	0.4445	0.105
Light Gradient Boosting Machine	0.7521	0.8135	0.631	0.6542	0.6381	0.4507	0.4541	0.19
Extreme Gradient Boosting	0.7502	0.8085	0.614	0.6493	0.6299	0.4419	0.4433	0.215
Ada Boost Classifier	0.7353	0.7971	0.5661	0.6407	0.5969	0.402	0.4064	0.041
K Neighbors Classifier	0.7297	0.7524	0.5766	0.6173	0.5937	0.3926	0.3946	0.023
Decision Tree Classifier	0.7226	0.6983	0.6167	0.6116	0.6036	0.3928	0.4002	0.014
Quadratic Discriminant Analysis	0.7187	0.8119	0.5515	0.6097	0.5725	0.3661	0.3708	0.012
Dummy Classifier	0.6518	0.5	0	0	0	0	0	0.015
SVM - Linear Kernel	0.5773	0	0.3219	0.5163	0.2565	0.0415	0.0918	0.016

Table II: Manual Classification

Models	Accuracy	AUC	Recall	Precision	F1	Kappa	MCC	TT (sec)
Gradient Boosting Classifier	0.7467	0.7303	0.6727	0.6379	0.6548	0.4550	0.4554	0.170
Logistic Regression	0.7467	0.7303	0.6727	0.6379	0.6548	0.4550	0.4554	0.020
Linear Discriminant Analysis	0.7597	0.7404	0.6727	0.6607	0.6666	0.4788	0.4789	0.003
Ridge Classifier	0.7597	0.7404	0.6727	0.6607	0.6666	0.4788	0.4789	0.003
Extra Trees Classifier	0.7207	0.6898	0.5818	0.6153	0.5981	0.3844	0.3848	0.139
Naive Bayes	0.7662	0.7535	0.7090	0.6610	0.6842	0.4990	0.4997	0.001
Random Forest Classifier	0.7532	0.7353	0.6727	0.6491	0.6607	0.4669	0.4671	0.187
Light Gradient Boosting Machine	0.7207	0.7141	0.6909	0.5937	0.6386	0.4132	0.4164	0.196
Extreme Gradient Boosting	0.7077	0.6959	0.6545	0.5806	0.6153	0.3811	0.3829	0.141
Ada Boost Classifier	0.7337	0.7121	0.6363	0.625	0.6306	0.4225	0.4225	0.121
K Neighbors Classifier	0.6623	0.6444	0.5818	0.5245	0.5517	0.2820	0.2830	0.002
Decision Tree Classifier	0.7337	0.7282	0.7090	0.6093	0.6554	0.4405	0.4439	0.003
Quadratic Discriminant Analysis	0.7792	0.7636	0.7090	0.6842	0.6964	0.5230	0.5232	0.002
Dummy Classifier	0.6428	0.5	0	0	0	0	0	0
SVM - Linear Kernel	0.7532	0.7313	0.6545	0.6545	0.6545	0.4626	0.4626	3.437

Table III: Automate Regression

Models	MAE	MSE	RMSE	RMSLE	MAPE	TT (sec)
Linear Regression	5.4036	52.4600	7.2429	0.2963	16.871	1.257
Lasso Regression	5.2075	52.3334	7.2341	0.2956	16.2587	0.061
Ridge Regression	5.4000	52.4308	7.2409	0.2963	16.8597	0.045
Elastic Net	5.2080	52.1438	7.2210	0.2949	16.2603	0.04
Least Angle Regression	5.3964	52.4018	7.2389	0.2934	16.8483	0.041
Lasso Least Angle Regression	5.2075	52.3335	7.2341	0.2956	16.2587	0.05
Orthogonal Matching Pursuit	5.8425	68.1528	8.2554	0.3498	18.2412	0.043
Bayesian Ridge	5.3270	51.9136	7.2051	0.2956	16.6317	0.041
Passive Aggressive Regressor	5.9685	68.4285	8.2721	0.2134	18.6347	0.046
Huber Regressor	5.6921	54.4686	7.3802	0.2597	17.7716	0.073
K Neighbors Regressor	5.5046	52.7471	7.2627	0.2688	17.1864	0.069
Decision Tree Regressor	6.4454	73.5216	8.5744	0.1996	20.1236	0.038
Random Forest Regressor	5.2289	49.3775	7.0269	0.2785	16.3254	0.105
Extra Trees Regressor	5.1889	47.9954	6.9278	0.2628	16.2006	0.094
AdaBoost Regressor	4.9074	44.4848	6.6696	0.2564	15.3217	0.066
Gradient Boosting Regressor	5.1222	49.3154	7.0224	0.2828	15.9924	0.057
Extreme Gradient Boosting	5.4607	50.9143	7.1354	0.2864	17.0491	0.249
Light Gradient Boosting Machine	5.4666	53.6933	7.3275	0.2954	17.0676	0.14
CatBoost Regressor	5.1886	48.4705	6.9620	0.2836	16.1997	0.811
Dummy Regressor	6.0567	71.0875	8.4313	0.3569	18.9099	0.044

Table IV: Manual Regression

Models	MAE	MSE	RMSE	RMSLE	MAPE	TT (sec)
Linear Regression	5.403649	52.46006	7.242932	0.296367	16.871	0.007
Lasso Regression	5.207542	52.33341	7.234184	0.295612	16.25872	0.005
Ridge Regression	5.400056	52.43088	7.240917	0.296331	16.85978	0.002
Elastic Net	5.208069	52.14381	7.221067	0.294989	16.26037	0.001
Least Angle Regression	5.3964	52.40185	7.238912	0.293444	16.84836	0.018
Lasso Least Angle Regression	5.207544	52.33357	7.234194	0.295613	16.25873	0.002
Orthogonal Matching Pursuit	5.842533	68.15282	8.255472	0.349847	18.24126	0.0009
Bayesian Ridge	5.32702	51.91367	7.205114	0.295636	16.63175	0.002
Passive Aggressive Regressor	6.030276	69.77136	8.352925	0.230103	18.82742	0.006
Huber Regressor	5.69213	54.4686	7.380284	0.259756	17.77168	0.045
K Neighbors Regressor	5.504675	52.74719	7.262726	0.268896	17.18642	0.002
Decision Tree Regressor	6.703896	80.80506	8.989164	0.275024	20.93056	0.005
Random Forest Regressor	5.209058	48.62507	6.973168	0.278332	16.26346	0.488
Extra Trees Regressor	5.167299	46.55768	6.823319	0.261615	16.13308	0.270
AdaBoost Regressor	4.778158	42.0025	6.480934	0.245371	14.91812	0.066
Gradient Boosting Regressor	5.180781	51.20314	7.155637	0.291129	16.17517	0.170
Extreme Gradient Boosting	5.460706	50.91434	7.135428	0.286486	17.04914	0.228
Light Gradient Boosting Machine	5.466648	53.69331	7.327572	0.295453	17.06769	0.308
CatBoost Regressor	5.188665	48.47051	6.962076	0.28362	16.19978	3.579
Dummy Regressor	6.056705	71.08755	8.431343	0.356975	18.90994	0.001

In conclusion, our AutoML platform represents a significant advancement in machine learning research, offering streamlined model testing and comparison processes. By creating both manual and automated systems, we've showcased the efficiency gains and objectivity that automation brings. Our automated system not only accelerates testing but also minimizes errors and bias, empowering researchers to make informed decisions. Future improvements may focus on expanding algorithm support and enhancing interpretability, furthering the platform's utility and impact in the field. Ultimately, our AutoML platform holds promise for democratizing access to advanced machine learning tools and driving innovation in artificial intelligence.

Author Contribution

S.P, S.W., M.Y. and S.Y. conceived and designed the study. M.Y. and S.Y. conducted the experiments. S.S. analyzed the data. S.P. and S.W. contributed materials and tools. M.Y. and S.Y. wrote the main manuscript text. All authors reviewed and approved the final manuscript.

Karansingh Chauhan1, Shreena Jani1, Dhrumin Thakkar1, Riddham Dave1,Jitendra Bhatia1, Sudeep Tanwar2, Mohammad S. Obaidat, Fellow of IEEE and Fellow of SCS3: Automated Machine Learning: The New Wave of Machine Learning in IEEE Xplore Part Number: CFP20K58-ART; ISBN: 978-1-7281- 4167-1
Lukas Tuggener1,2, Mohammadreza Amirian1,3, Katharina Rombach1, Stefan L¨orwald4, Anastasia Varlet4, Christian Westermann4, and Thilo Stadelmann1: Automated Machine Learning in Practice: State of the Art and Recent Results in 2019 6th Swiss Conference on Data Science (SDS)
Xiawu Zheng, Yang Zhang, Sirui Hong, Huixia Li, Lang Tang, Youcheng Xiong, Jin Zhou, Yan Wang ,Xiaoshuai Sun, Member, IEEE, Pengfei Zhu, Chenglin Wu, and Rongrong Ji, Senior Member, IEEE: Evolving Fully Automated Machine Learning via Life-Long Knowledge Anchors in IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 43, NO. 9, SEPTEMBER 2021
Thiloshon Nagarajah University of Westminster, Guhanathan Poravi Informatics Institute of Technology: A Review on Automated Machine Learning (AutoML) Systems in 2019 IEEE 5th International Conference for Convergence in Technology (I2CT)DOI:10.1109/I2CT45611.2019 29–31 March 2019
Matthias Feure,r Aaron Klein, Katharina Eggensperger Department of Computer Science University of Freiburg, Germany: Efficient and Robust Automated Machine Learning in NIPS'15: Proceedings of the 28th International Conference on Neural Information Processing Systems - Volume 2December 2015Pages 2755–2763
Ahmed Abd Elrahman, Mohamed El Helw Nile University Giza, Egypt Radwa Elshawi, Sherif Sakr University of Tartu Tartu, Estonia: D-SmartML: A Distributed Automated Machine Learning Framework in 2020 IEEE 40th International Conference on Distributed Computing Systems (ICDCS)
Xu Chen and Brett Wujek: A Unified Framework for Automatic Distributed Active Learning in IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 44, NO. 12, DECEMBER 2022
Bilge Celik and Joaquin Vanschoren:Adaptation Strategies for Automated Machine Learning on Evolving Data in IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 43, NO. 9, SEPTEMBER 2021
Forough Majidi†, Moses Openja†, Foutse Khomh†, Heng Li†: An Empirical Study on the Usage of Automated Machine Learning Tools in arXiv:2208.13116v1 [cs.SE] 28 Aug 2022
Anh Truong∗, Austin Walters∗, Jeremy Goodsitt∗, Keegan Hines∗, C. Bayan Bruss∗, Reza Farivar∗: Towards Automated Machine Learning: Evaluation and Comparison of AutoML Approaches and Tools in 2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)
Lu´ıs Ferreira, Andr´e Pilastri, Carlos Manuel Martins, Pedro Miguel Pires, Paulo Cortez: A Comparison of AutoML Tools for Machine Learning, Deep Learning and XGBoost in 2021 International Joint Conference on Neural Networks (IJCNN) DOI: 10.1109/IJCNN52387.2021 18–22 July 2021
Nick Erickson, Jonas Mueller, Alexander Shirkov, Hang Zhang, Pedro Larroy, Mu Li, Alexander Smola: AutoGluon-Tabular: Robust and Accurate AutoML for Structured Data in 7th ICML Workshop on Automated Machine Learning (2020)
DOMINIK KREUZBERGER, NIKLAS KÜHL AND SEBASTIAN HIRSCHL: Machine Learning Operations (MLOps): Overview, Definition, and Architecture in IEEE Access (Volume: 11) DOI: 10.1109/ACCESS.2023.3262138 Date of Publication: 27 March 2023
Felix Mohr, Marcel Wever, Alexander Tornede, and Eyke H€ullermeier: Predicting Machine Learning Pipeline Runtimes in the Context of Automated Machine Learning in IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 43, NO. 9, SEPTEMBER 2021
Duc Anh Nguyen Anna V. Kononova, Stefan Menzelx, Bernhard Sendhoffx, and Thomas Bäck, Leiden Institute of Advanced Computer Science (LIACS), Leiden University, The Netherlands: Efficient AutoML via Combinational Sampling in 2021 IEEE Symposium Series on Computational Intelligence (SSCI) DOI: 10.1109/SSCI50451.2021 5–7 Dec. 2021
DUC ANH NGUYEN 1, ANNA V. KONONOVA, STEFAN MENZEL, BERNHARD SENDHOFF AND THOMAS BÄCK: An Efficient Contesting Procedure for AutoML Optimization in IEEE Access ( Volume: 10) DOI: 10.1109/ACCESS.2022.3192036 Date of Publication: 18 July 2022
Jack Parker-Holder, Raghu Rajan rajanr, Xingyou Song, André Biedenkapp, Yingjie Miao, Theresa Eimer, Baohe Zhang, Vu Nguyen, Roberto Calandra, Aleksandra Faust, Frank Hutter, Marius Lindauer: Automated Reinforcement Learning (AutoRL): A Survey and Open Problems in Journal of Artificial Intelligence Research 74 (2022) 517–568 Submitted 01/2022; published 06/2022
Moncef Garouani, Adeel Ahmad, Mourad Bouneffa, Mohamed Hamlich: AMLBID: An auto-explained Automated Machine Learning tool for Big Industrial Data in ELSEVIER SoftwareX 17 (2022) 100919
MarcelWever, Alexander Tornede, Felix Mohr, and Eyke H€ullermeier, Senior Member, IEEE: AutoML for Multi-Label Classification: Overview and Empirical Evaluation in IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 43, NO. 9, SEPTEMBER 2021
Zhengying Liu, Adrien Pavao, Zhen Xu, Sergio Escalera, Isabelle Guyon, Julio C. S. Jacques Junior, Meysam Madadi, Sebastien Treguer: How far are we from true AutoML: reection from winning solutions and results of AutoDL challenge in 7th ICML Workshop on Automated Machine Learning (2020)

No competing interests reported.

Download PDF

Editor assigned by journal
20 May, 2024
Submission checks completed at journal
03 May, 2024
First submitted to journal
03 May, 2024

You are reading this latest preprint version

AutoML Platform for Comparative Analysis of Machine Learning Models

Status:

Version 1

Abstract

Figures

I. INTRODUCTION

II. BACKGROUND AND RELATED WORK

III. METHODOLOGIS

IV. RESULTS

V. CONCLUSION

Declarations

Author Contribution

References

Additional Declarations

Status:

Version 1