[HTML][HTML] A survey on missing data in machine learning

T Emmanuel, T Maupong, D Mpoeleng, T Semong… - Journal of Big …, 2021 - Springer
Abstract Machine learning has been the corner stone in analysing and extracting information
from data and often a problem of missing values is encountered. Missing values occur …

Imputation methods used in missing traffic data: A literature review

P Wu, L Xu, Z Huang - … , ISICA 2019, Guangzhou, China, November 16–17 …, 2020 - Springer
The missing traffic data has caused great obstacles and interference to further research,
such as traffic flow prediction, which affects the traffic authorities' judgment for the real traffic …

Short‐term traffic forecasting using self‐adjusting k‐nearest neighbours

B Sun, W Cheng, P Goswami… - IET Intelligent Transport …, 2018 - Wiley Online Library
Short‐term traffic forecasting is becoming more important in intelligent transportation
systems. The k‐nearest neighbour (kNN) method is widely used for short‐term traffic …

A short-term household load forecasting framework using LSTM and data preparation

D Ageng, CY Huang, RG Cheng - Ieee Access, 2021 - ieeexplore.ieee.org
IoT devices are deployed in a building to instantly collect electricity load usage for next hour
load consumption forecasting so that the operation of the building can be properly managed …

ST-A-PGCL: Spatiotemporal adaptive periodical graph contrastive learning for traffic prediction under real scenarios

Y Qu, J Rong, Z Li, K Chen - Knowledge-Based Systems, 2023 - Elsevier
Exploring complicated dynamic spatiotemporal correlations has always been a challenging
issue in traffic prediction. Besides, methods that make predictions directly from data with …

A many-objective optimization based intelligent high performance data processing model for cyber-physical-social systems

Z Cui, Z Zhang, Z Hu, S Geng… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
As an emerging paradigm, Cyber-Physical Social System (CPSS) can provide humans with
efficient, convenient and personalized services. However, due to the explosive growth of …

The impact of heterogeneous distance functions on missing data imputation and classification performance

MS Santos, PH Abreu, A Fernández, J Luengo… - … Applications of Artificial …, 2022 - Elsevier
This work performs an in-depth study of the impact of distance functions on K-Nearest
Neighbours imputation of heterogeneous datasets. Missing data is generated at several …

Exploring the potential of machine learning to understand the occurrence and health risks of haloacetic acids in a drinking water distribution system

Y Yu, MM Hossain, R Sikder, Z Qi, L Huo… - Science of The Total …, 2024 - Elsevier
Determining the occurrence of disinfection byproducts (DBPs) in drinking water distribution
system (DWDS) remains challenging. Predicting DBPs using readily available water quality …

A Robust Data‐Driven Method for Multiseasonality and Heteroscedasticity in Time Series Preprocessing

B Sun, L Ma, T Shen, R Geng, Y Zhou… - … and Mobile Computing, 2021 - Wiley Online Library
Internet of Things (IoT) is emerging, and 5G enables much more data transport from mobile
and wireless sources. The data to be transmitted is too much compared to link capacity …

[PDF][PDF] Local average of nearest neighbors: Univariate time series imputation

A Flores, H Tito, C Silva - International Journal of …, 2019 - pdfs.semanticscholar.org
The imputation of time series is one of the most important tasks in the homogenization
process, the quality and precision of this process will directly influence the accuracy of the …