[HTML][HTML] Automated data processing and feature engineering for deep learning and big data applications: a survey

A Mumuni, F Mumuni - Journal of Information and Intelligence, 2024 - Elsevier
Modern approach to artificial intelligence (AI) aims to design algorithms that learn directly
from data. This approach has achieved impressive results and has contributed significantly …

CtxPipe: Context-aware Data Preparation Pipeline Construction for Machine Learning

H Gao, S Cai, TTA Dinh, Z Huang, BC Ooi - Proceedings of the ACM on …, 2024 - dl.acm.org
Machine learning models are only as good as their training data. Simple models trained on
well-chosen features extracted from the raw data often outperform complex models trained …

Data Processing and Optimization in the Development of Machine Learning Systems: Detailed Requirements Analysis, Model Architecture, and Anti-Data Drift …

N Boyko - Journal of Applied Data Sciences, 2024 - bright-journal.org
The research relevance is determined by the growing need to use machine learning
systems in various industries, which requires reliable data processing and optimization. The …

ReClean: Reinforcement Learning for Automated Data Cleaning in ML Pipelines

M Abdelaal, AB Yayak, K Klede… - 2024 IEEE 40th …, 2024 - ieeexplore.ieee.org
Addressing data quality issues is a challenging task due to the labor-intensive nature of
manual data cleaning processes and the inadequacy of automated tools that lack effective …

Regression-Stratified Sampling for Optimized Algorithm Selection in Time-Constrained Tabular AutoML

M Bahrami, S Hasegawa, L Liu, WP Chen - ICML 2024 Workshop on … - openreview.net
The selection of a machine-learning (ML) algorithm is indispensable for tabular AutoML
training. Finding an optimized algorithm from a search space can be expensive for large …

Construction and Evaluation of Enterprise Operation Risk Early Warning Model Based on Decision Tree Algorithm and Electric Power Big Data

Y Zhou, J Gao, K Yang - 2023 International Conference on …, 2023 - ieeexplore.ieee.org
In the current era, electric power enterprises are facing a lot of corporate risks. How to detect
and warn the operational risks of enterprises in a timely manner has become an important …

Pre-processing Approach for Semi-Structured Medical Data

AH Ab Yazik, AM Ali, S Nordin… - … on Innovation & …, 2024 - atlantis-press.com
The exponential growth of healthcare data nowadays due to the widespread use of
electronic health records (EHRs) has presented both opportunities and challenges in patient …

[引用][C] Pipeline para identificação de erros lexicais e geração de sugestões de correção

LQ Garcia, MH Chinellato, HM Caseli… - Anais do XIV Simpósio …, 2023 - SBC

[引用][C] A Comprehensive AutoML Solution for Automated Data Preprocessing and Model Deployment

N Palanivel, B Vigneshwaraan, B Sobanraj, P Ragavan