A review of auto-scaling techniques for elastic applications in cloud environments

T Lorido-Botran, J Miguel-Alonso, JA Lozano - Journal of grid computing, 2014 - Springer
Cloud computing environments allow customers to dynamically scale their applications. The
key problem is how to lease the right amount of resources, on a pay-as-you-go basis …

Auto-scaling web applications in clouds: A taxonomy and survey

C Qu, RN Calheiros, R Buyya - ACM Computing Surveys (CSUR), 2018 - dl.acm.org
Web application providers have been migrating their applications to cloud data centers,
attracted by the emerging cloud computing paradigm. One of the appealing features of the …

{MArk}: Exploiting cloud services for {Cost-Effective},{SLO-Aware} machine learning inference serving

C Zhang, M Yu, W Wang, F Yan - 2019 USENIX Annual Technical …, 2019 - usenix.org
The advances of Machine Learning (ML) have sparked a growing demand of ML-as-a-
Service: developers train ML models and publish them in the cloud as online services to …

Cloud computing: state-of-the-art and research challenges

Q Zhang, L Cheng, R Boutaba - Journal of internet services and …, 2010 - Springer
Cloud computing has recently emerged as a new paradigm for hosting and delivering
services over the Internet. Cloud computing is attractive to business owners as it eliminates …

Machine learning techniques in emerging cloud computing integrated paradigms: A survey and taxonomy

D Soni, N Kumar - Journal of Network and Computer Applications, 2022 - Elsevier
Cloud computing offers Infrastructure as a Service (IaaS), Platform as a Service (PaaS), and
Software as a Service (SaaS) to provide compute, network, and storage capabilities to the …

Autoscale: Dynamic, robust capacity management for multi-tier data centers

A Gandhi, M Harchol-Balter, R Raghunathan… - ACM Transactions on …, 2012 - dl.acm.org
Energy costs for data centers continue to rise, already exceeding $15 billion yearly. Sadly
much of this power is wasted. Servers are only busy 10--30% of the time on average, but …

Research on auto-scaling of web applications in cloud: survey, trends and future directions

P Singh, P Gupta, K Jyoti, A Nayyar - Scalable Computing: Practice and …, 2019 - scpe.org
Cloud computing emerging environment attracts many applications providers to deploy web
applications on cloud data centers. The primary area of attraction is elasticity, which allows …

Adaptive resource provisioning for read intensive multi-tier applications in the cloud

W Iqbal, MN Dailey, D Carrera, P Janecek - Future Generation Computer …, 2011 - Elsevier
A Service-Level Agreement (SLA) provides surety for specific quality attributes to the
consumers of services. However, current SLAs offered by cloud infrastructure providers do …

No one (cluster) size fits all: automatic cluster sizing for data-intensive analytics

H Herodotou, F Dong, S Babu - … of the 2nd ACM Symposium on Cloud …, 2011 - dl.acm.org
Infrastructure-as-a-Service (IaaS) cloud platforms have brought two unprecedented changes
to cluster provisioning practices. First, any (nonexpert) user can provision a cluster of any …

Prepare: Predictive performance anomaly prevention for virtualized cloud systems

Y Tan, H Nguyen, Z Shen, X Gu… - 2012 IEEE 32nd …, 2012 - ieeexplore.ieee.org
Virtualized cloud systems are prone to performance anomalies due to various reasons such
as resource contentions, software bugs, and hardware failures. In this paper, we present a …