A unifying framework for online optimization with long-term constraints

M Castiglioni, A Celli, A Marchesi… - Advances in Neural …, 2022 - proceedings.neurips.cc
We study online learning problems in which a decision maker has to take a sequence of
decisions subject to $ m $ long-term constraints. The goal of the decision maker is to …

DOPE: Doubly optimistic and pessimistic exploration for safe reinforcement learning

A Bura, A HasanzadeZonuzy… - Advances in neural …, 2022 - proceedings.neurips.cc
Safe reinforcement learning is extremely challenging--not only must the agent explore an
unknown environment, it must do so while ensuring no safety constraint violations. We …

Distributed bandit online convex optimization with time-varying coupled inequality constraints

X Yi, X Li, T Yang, L Xie, T Chai… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Distributed bandit online convex optimization with time-varying coupled inequality
constraints is considered, motivated by a repeated game between a group of learners and …

Online convex optimization with hard constraints: Towards the best of two worlds and beyond

H Guo, X Liu, H Wei, L Ying - Advances in Neural …, 2022 - proceedings.neurips.cc
This paper considers online convex optimization with hard constraints and analyzes
achievable regret and cumulative hard constraint violation (violation for short). The problem …

Online learning for low-latency adaptive streaming

T Karagkioules, R Mekuria, D Griffioen… - Proceedings of the 11th …, 2020 - dl.acm.org
Achieving low-latency is paramount for live streaming scenarios, that are now-days
becoming increasingly popular. In this paper, we propose a novel algorithm for bitrate …

Online primal-dual mirror descent under stochastic constraints

X Wei, H Yu, MJ Neely - Proceedings of the ACM on Measurement and …, 2020 - dl.acm.org
We consider online convex optimization with stochastic constraints where the objective
functions are arbitrarily time-varying and the constraint functions are independent and …

Online Learning under Budget and ROI Constraints via Weak Adaptivity

M Castiglioni, A Celli, C Kroer - Forty-first International Conference …, 2024 - openreview.net
We study online learning problems in which a decision maker has to make a sequence of
costly decisions, with the goal of maximizing their expected reward while adhering to budget …

Distributed online optimization with long-term constraints

D Yuan, A Proutiere, G Shi - IEEE Transactions on Automatic …, 2021 - ieeexplore.ieee.org
In this article, we consider distributed online convex optimization problems, where the
distributed system consists of various computing units connected through a time-varying …

Cache optimization models and algorithms

G Paschos, G Iosifidis, G Caire - Foundations and Trends® in …, 2020 - nowpublishers.com
Caching refers to the act of replicating information at a faster (or closer) medium with the
purpose of improving performance. This deceptively simple idea has given rise to some of …

Online ad procurement in non-stationary autobidding worlds

JCN Liang, H Lu, B Zhou - Advances in Neural Information …, 2024 - proceedings.neurips.cc
Today's online advertisers procure digital ad impressions through interacting with
autobidding platforms: advertisers convey high level procurement goals via setting levers …