A semisupervised autoencoder-based approach for anomaly detection in high performance computing systems

A Borghesi, A Bartolini, M Lombardi, M Milano… - … Applications of Artificial …, 2019 - Elsevier
Abstract High Performance Computing (HPC) systems are complex machines with
heterogeneous components that can break or malfunction. Automated anomaly detection in …

Adaptive scheduling of multiprogrammed dynamic-multithreading applications

Z Wang, C Xu, K Agrawal, J Li - Journal of Parallel and Distributed …, 2022 - Elsevier
Modern parallel platforms, such as clouds or servers, are often shared among many different
jobs. However, existing parallel programming runtime systems are designed and optimized …

Cloud application predictability through integrated load-balancing and service time control

T Nylander, MT Andrén, KE Årzén… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
Cloud computing provides the illusion of infinite capacity to application developers.
However, data center provisioning is complex and it is still necessary to handle the risk of …

When to hedge in interactive services

M Primorac, K Argyraki, E Bugnion - 18th USENIX Symposium on …, 2021 - usenix.org
In online data-intensive (OLDI) services, each client request typically executes on multiple
servers in parallel; as a result," system hiccups", although rare within a single server, can …

Prediction based replica selection strategy for reducing tail latency in distributed systems

SM Shithil - 2022 - lib.buet.ac.bd
The deployment of modern applications in geo-distributed systems results in performance
fluctuation which is a consequence of long-tail latency. To deliver high-quality services these …

Programming technologies for engineering quality multicore software

TTFS Kaler - 2020 - dspace.mit.edu
This thesis is concerned with the development of programming technologies that reduce the
complexity of parallel programming to make it easier for average programmers to exploit the …

Improving the Performance of Cloud Applications: A Multi-Layered Approach

HMM Bashir - 2024 - search.proquest.com
Modern cloud applications require strict performance guarantees from the underlying
systems. However, these applications suffer from several performance issues such as high …

Amcilk: A framework for multiprogrammed parallel workloads

Z Wang, C Xu, K Agrawal, J Li - 2020 IEEE 27th International …, 2020 - ieeexplore.ieee.org
Modern parallel platforms, such as clouds or servers, are often shared among many different
jobs. However, existing parallel programming runtime systems are designed and optimized …

Scheduling for High Throughput and Small Latency in Parallel and Distributed Systems

Z Wang - 2022 - search.proquest.com
Parallel and distributed systems are pervasive, such as web services, clouds, and cyber-
physical systems. We often desire high throughput and small latency in the parallel and …

[PDF][PDF] Managing tail latency in large scale information retrieval systems

J Mackenzie - 2019 - core.ac.uk
As both the availability of internet access and the prominence of smart devices continue to
increase, data is being generated at a rate faster than ever before. This massive increase in …