Cautious regret minimization: Online optimization with long-term budget constraints

M Castiglioni, A Celli, A Marchesi… - Advances in Neural …, 2022 - proceedings.neurips.cc

We study online learning problems in which a decision maker has to take a sequence of
decisions subject to $ m $ long-term constraints. The goal of the decision maker is to …

被引用次数：24 相关文章所有 8 个版本

[PDF] neurips.cc

DOPE: Doubly optimistic and pessimistic exploration for safe reinforcement learning

A Bura, A HasanzadeZonuzy… - Advances in neural …, 2022 - proceedings.neurips.cc

Safe reinforcement learning is extremely challenging--not only must the agent explore an
unknown environment, it must do so while ensuring no safety constraint violations. We …

被引用次数：31 相关文章所有 6 个版本

[PDF] arxiv.org

Distributed bandit online convex optimization with time-varying coupled inequality constraints

X Yi, X Li, T Yang, L Xie, T Chai… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org

Distributed bandit online convex optimization with time-varying coupled inequality
constraints is considered, motivated by a repeated game between a group of learners and …

被引用次数：71 相关文章所有 7 个版本

[PDF] neurips.cc

Online convex optimization with hard constraints: Towards the best of two worlds and beyond

H Guo, X Liu, H Wei, L Ying - Advances in Neural …, 2022 - proceedings.neurips.cc

This paper considers online convex optimization with hard constraints and analyzes
achievable regret and cumulative hard constraint violation (violation for short). The problem …

被引用次数：15 相关文章所有 3 个版本

Online learning for low-latency adaptive streaming

T Karagkioules, R Mekuria, D Griffioen… - Proceedings of the 11th …, 2020 - dl.acm.org

Achieving low-latency is paramount for live streaming scenarios, that are now-days
becoming increasingly popular. In this paper, we propose a novel algorithm for bitrate …

被引用次数：49 相关文章

[PDF] arxiv.org

Online primal-dual mirror descent under stochastic constraints

X Wei, H Yu, MJ Neely - Proceedings of the ACM on Measurement and …, 2020 - dl.acm.org

We consider online convex optimization with stochastic constraints where the objective
functions are arbitrarily time-varying and the constraint functions are independent and …

被引用次数：58 相关文章所有 8 个版本

[PDF] openreview.net

Online Learning under Budget and ROI Constraints via Weak Adaptivity

M Castiglioni, A Celli, C Kroer - Forty-first International Conference …, 2024 - openreview.net

We study online learning problems in which a decision maker has to make a sequence of
costly decisions, with the goal of maximizing their expected reward while adhering to budget …

被引用次数：1 相关文章

[PDF] arxiv.org

Distributed online optimization with long-term constraints

D Yuan, A Proutiere, G Shi - IEEE Transactions on Automatic …, 2021 - ieeexplore.ieee.org

In this article, we consider distributed online convex optimization problems, where the
distributed system consists of various computing units connected through a time-varying …

被引用次数：47 相关文章所有 6 个版本

[PDF] nowpublishers.com

Cache optimization models and algorithms

G Paschos, G Iosifidis, G Caire - Foundations and Trends® in …, 2020 - nowpublishers.com

Caching refers to the act of replicating information at a faster (or closer) medium with the
purpose of improving performance. This deceptively simple idea has given rise to some of …

被引用次数：40 相关文章所有 8 个版本

[PDF] neurips.cc

Online ad procurement in non-stationary autobidding worlds

JCN Liang, H Lu, B Zhou - Advances in Neural Information …, 2024 - proceedings.neurips.cc

Today's online advertisers procure digital ad impressions through interacting with
autobidding platforms: advertisers convey high level procurement goals via setting levers …

被引用次数：1 相关文章所有 4 个版本

高级搜索

QQ 群