Enabling serverless deployment of large-scale ai workloads

A Christidis, S Moschoyiannis, CH Hsu… - IEEE Access, 2020 - ieeexplore.ieee.org
We propose a set of optimization techniques for transforming a generic AI codebase so that
it can be successfully deployed to a restricted serverless environment, without compromising …

Serving machine learning workloads in resource constrained environments: A serverless deployment example

A Christidis, R Davies… - 2019 IEEE 12th …, 2019 - ieeexplore.ieee.org
Deployed AI platforms typically ship with bulky system architectures which present
bottlenecks and a high risk of failure. A serverless deployment can mitigate these factors and …

Caerus:{NIMBLE} task scheduling for serverless analytics

H Zhang, Y Tang, A Khandelwal, J Chen… - 18th USENIX Symposium …, 2021 - usenix.org
Serverless platforms facilitate transparent resource elasticity and fine-grained billing, making
them an attractive choice for data analytics. We find that while server-centric analytics …

Towards demystifying serverless machine learning training

J Jiang, S Gan, Y Liu, F Wang, G Alonso… - Proceedings of the …, 2021 - dl.acm.org
The appeal of serverless (FaaS) has triggered a growing interest on how to use it in data-
intensive applications such as ETL, query processing, or machine learning (ML). Several …

Lass: Running latency sensitive serverless computations at the edge

B Wang, A Ali-Eldin, P Shenoy - … of the 30th international symposium on …, 2021 - dl.acm.org
Serverless computing has emerged as a new paradigm for running short-lived computations
in the cloud. Due to its ability to handle IoT workloads, there has been considerable interest …

Sledge: A serverless-first, light-weight wasm runtime for the edge

PK Gadepalli, S McBride, G Peach… - Proceedings of the 21st …, 2020 - dl.acm.org
Emerging IoT applications with real-time latency constraints require new data processing
systems operating at the Edge. Serverless computing offers a new compelling paradigm …

Towards plan-aware resource allocation in serverless query processing

M Bag, A Jindal, H Patel - 12th USENIX Workshop on Hot Topics in …, 2020 - usenix.org
Resource allocation for serverless query processing is a challenge. Unfortunately, prior
approaches have treated queries as black boxes, thereby missing significant resource …

Evaluation of integrated frameworks for optimizing qos in serverless computing

A Kumari, B Sahoo, RK Behera, S Misra… - … Science and Its …, 2021 - Springer
Serverless computing is an emerging cloud deployment model where developers can
concentrate on developing application logic without worrying about the underlying …

[HTML][HTML] {SONIC}: Application-aware data passing for chained serverless applications

A Mahgoub, L Wang, K Shankar, Y Zhang… - 2021 USENIX Annual …, 2021 - s.usenix.org
The conference papers and full proceedings are available to registered attendees now and
will be available to everyone beginning Wednesday, July 14, 2021. Paper abstracts and …

Towards efficient processing of latency-sensitive serverless dags at the edge

X Lyu, L Cherkasova, R Aitken, G Parmer… - Proceedings of the 5th …, 2022 - dl.acm.org
Many emerging novel applications expect" near real-time" processing and responses, which
can not be guaranteed by today's Cloud and would require processing at the Edge …