AI-based fog and edge computing: A systematic review, taxonomy and future directions

S Iftikhar, SS Gill, C Song, M Xu, MS Aslanpour… - Internet of Things, 2023 - Elsevier
Resource management in computing is a very challenging problem that involves making
sequential decisions. Resource limitations, resource heterogeneity, dynamic and diverse …

Machine learning-based orchestration of containers: A taxonomy and future directions

Z Zhong, M Xu, MA Rodriguez, C Xu… - ACM Computing Surveys …, 2022 - dl.acm.org
Containerization is a lightweight application virtualization technology, providing high
environmental consistency, operating system distribution portability, and resource isolation …

Machine learning (ML)-centric resource management in cloud computing: A review and future directions

T Khan, W Tian, G Zhou, S Ilager, M Gong… - Journal of Network and …, 2022 - Elsevier
Cloud computing has rapidly emerged as a model for delivering Internet-based utility
computing services. Infrastructure as a Service (IaaS) is one of the most important and …

[HTML][HTML] Machine learning for data center optimizations: feature selection using Shapley additive exPlanation (SHAP)

Y Gebreyesus, D Dalton, S Nixon, D De Chiara… - Future Internet, 2023 - mdpi.com
The need for artificial intelligence (AI) and machine learning (ML) models to optimize data
center (DC) operations increases as the volume of operations management data upsurges …

Ai for it operations (aiops) on cloud platforms: Reviews, opportunities and challenges

Q Cheng, D Sahoo, A Saha, W Yang, C Liu… - arXiv preprint arXiv …, 2023 - arxiv.org
Artificial Intelligence for IT operations (AIOps) aims to combine the power of AI with the big
data generated by IT Operations processes, particularly in cloud infrastructures, to provide …

Thermal prediction for efficient energy management of clouds using machine learning

S Ilager, K Ramamohanarao… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Thermal management in the hyper-scale cloud data centers is a critical problem. Increased
host temperature creates hotspots which significantly increases cooling cost and affects …

Take it to the limit: peak prediction-driven resource overcommitment in datacenters

N Bashir, N Deng, K Rzadca, D Irwin, S Kodak… - Proceedings of the …, 2021 - dl.acm.org
To increase utilization, datacenter schedulers often overcommit resources where the sum of
resources allocated to the tasks on a machine exceeds its physical capacity. Setting the right …

[HTML][HTML] Internet of Intelligent Things: A convergence of embedded systems, edge computing and machine learning

F Oliveira, DG Costa, F Assis, I Silva - Internet of Things, 2024 - Elsevier
This article comprehensively reviews the emerging concept of Internet of Intelligent Things
(IoIT), adopting an integrated perspective centred on the areas of embedded systems, edge …

FLASH: Fast model adaptation in ML-centric cloud platforms

H Qiu, W Mao, A Patke, S Cui, C Wang… - Proceedings of …, 2024 - proceedings.mlsys.org
The emergence of ML in various cloud system management tasks (eg, workload autoscaling
and job scheduling) has become a core driver of ML-centric cloud platforms. However, there …

[HTML][HTML] Deep reinforcement learning-based methods for resource scheduling in cloud computing: A review and future directions

G Zhou, W Tian, R Buyya, R Xue, L Song - Artificial Intelligence Review, 2024 - Springer
With the acceleration of the Internet in Web 2.0, Cloud computing is a new paradigm to offer
dynamic, reliable and elastic computing services. Efficient scheduling of resources or …