Tracking slowly moving clairvoyant: Optimal dynamic regret of online learning with true and...

X Li, L Xie, N Li - Annual Reviews in Control, 2023 - Elsevier

Distributed online optimization and online games have been increasingly researched in the
last decade, mostly motivated by their wide applications in sensor networks, robotics (eg …

被引用次数：14 相关文章

[PDF] mlr.press

Online meta-learning

C Finn, A Rajeswaran, S Kakade… - … on machine learning, 2019 - proceedings.mlr.press

A central capability of intelligent systems is the ability to continuously build upon previous
experiences to speed up and enhance learning of new tasks. Two distinct research …

被引用次数：482 相关文章所有 12 个版本

[PDF] arxiv.org

Distributed online optimization in dynamic environments using mirror descent

S Shahrampour, A Jadbabaie - IEEE Transactions on Automatic …, 2017 - ieeexplore.ieee.org

This work addresses decentralized online optimization in nonstationary environments. A
network of agents aim to track the minimizer of a global, time-varying, and convex function …

被引用次数：291 相关文章所有 6 个版本

[PDF] neurips.cc

Dynamic regret of convex and smooth functions

P Zhao, YJ Zhang, L Zhang… - Advances in Neural …, 2020 - proceedings.neurips.cc

We investigate online convex optimization in non-stationary environments and choose the
dynamic regret as the performance measure, defined as the difference between cumulative …

被引用次数：90 相关文章所有 10 个版本

[PDF] mlr.press

No-regret learning in time-varying zero-sum games

M Zhang, P Zhao, H Luo… - … Conference on Machine …, 2022 - proceedings.mlr.press

Learning from repeated play in a fixed two-player zero-sum game is a classic problem in
game theory and online learning. We consider a variant of this problem where the game …

被引用次数：37 相关文章所有 7 个版本

[PDF] mlr.press

A new algorithm for non-stationary contextual bandits: Efficient, optimal and parameter-free

Y Chen, CW Lee, H Luo… - Conference on Learning …, 2019 - proceedings.mlr.press

We propose the first contextual bandit algorithm that is parameter-free, efficient, and optimal
in terms of dynamic regret. Specifically, our algorithm achieves $\mathcal {O}(\min\{\sqrt …

被引用次数：126 相关文章所有 7 个版本

[PDF] neurips.cc

Adapting to online label shift with provable guarantees

Y Bai, YJ Zhang, P Zhao… - Advances in Neural …, 2022 - proceedings.neurips.cc

The standard supervised learning paradigm works effectively when training data shares the
same distribution as the upcoming testing samples. However, this stationary assumption is …

被引用次数：22 相关文章所有 9 个版本

[PDF] arxiv.org

Distributed online optimization for multi-agent networks with coupled inequality constraints

X Li, X Yi, L Xie - IEEE Transactions on Automatic Control, 2020 - ieeexplore.ieee.org

This article investigates the distributed online optimization problem over a multi-agent
network subject to local set constraints and coupled inequality constraints, which has a lot of …

被引用次数：105 相关文章所有 5 个版本

[PDF] arxiv.org

Distributed bandit online convex optimization with time-varying coupled inequality constraints

X Yi, X Li, T Yang, L Xie, T Chai… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org

Distributed bandit online convex optimization with time-varying coupled inequality
constraints is considered, motivated by a repeated game between a group of learners and …

被引用次数：73 相关文章所有 7 个版本

[PDF] neurips.cc

Improved dynamic regret for non-degenerate functions

L Zhang, T Yang, J Yi, R Jin… - Advances in Neural …, 2017 - proceedings.neurips.cc

Recently, there has been a growing research interest in the analysis of dynamic regret,
which measures the performance of an online learner against a sequence of local …

被引用次数：125 相关文章所有 20 个版本

高级搜索

QQ 群