相关文章- 学术资源搜索

A finite-time analysis of Q-learning with neural network function approximation

P Xu, Q Gu - International Conference on Machine Learning, 2020 - proceedings.mlr.press

Q-learning with neural network function approximation (neural Q-learning for short) is
among the most prevalent deep reinforcement learning algorithms. Despite its empirical …

被引用次数：85 相关文章所有 7 个版本

[PDF] neurips.cc

A new convergent variant of Q-learning with linear function approximation

D Carvalho, FS Melo, P Santos - Advances in Neural …, 2020 - proceedings.neurips.cc

In this work, we identify a novel set of conditions that ensure convergence with probability 1
of Q-learning with linear function approximation, by proposing a two time-scale variation …

被引用次数：33 相关文章所有 5 个版本

[PDF] mlr.press

A theoretical analysis of deep Q-learning

J Fan, Z Wang, Y Xie, Z Yang - Learning for dynamics and …, 2020 - proceedings.mlr.press

Despite the great empirical success of deep reinforcement learning, its theoretical
foundation is less well understood. In this work, we make the first attempt to theoretically …

被引用次数：734 相关文章所有 9 个版本

[PDF] neurips.cc

Provably efficient Q-learning with function approximation via distribution shift error checking oracle

SS Du, Y Luo, R Wang… - Advances in Neural …, 2019 - proceedings.neurips.cc

Q-learning with function approximation is one of the most popular methods in reinforcement
learning. Though the idea of using function approximation was proposed at least 60 years …

被引用次数：104 相关文章所有 7 个版本

[PDF] mlr.press

Diagnosing bottlenecks in deep q-learning algorithms

J Fu, A Kumar, M Soh, S Levine - … Conference on Machine …, 2019 - proceedings.mlr.press

Q-learning methods are a common class of algorithms used in reinforcement learning (RL).
However, their behavior with function approximation, especially with neural networks, is …

被引用次数：151 相关文章所有 7 个版本

[PDF] github.io

[PDF][PDF] Performance of q-learning with linear function approximation: Stability and finite-time analysis

Z Chen, S Zhang, TT Doan, ST Maguluri… - arXiv preprint arXiv …, 2019 - optrl2019.github.io

In this paper, we consider the model-free reinforcement learning problem and study the
popular Q-learning algorithm with linear function approximation for finding the optimal …

被引用次数：64 相关文章

[PDF] neurips.cc

Finite-time analysis for double Q-learning

H Xiong, L Zhao, Y Liang… - Advances in neural …, 2020 - proceedings.neurips.cc

Although Q-learning is one of the most successful algorithms for finding the best action-
value function (and thus the optimal policy) in reinforcement learning, its implementation …

被引用次数：34 相关文章所有 8 个版本

[PDF] ieee.org

Q-learning algorithms: A comprehensive classification and applications

B Jang, M Kim, G Harerimana, JW Kim - IEEE access, 2019 - ieeexplore.ieee.org

Q-learning is arguably one of the most applied representative reinforcement learning
approaches and one of the off-policy strategies. Since the emergence of Q-learning, many …

被引用次数：478 相关文章所有 6 个版本

[PDF] arxiv.org

Using deep q-learning to control optimization hyperparameters

S Hansen - arXiv preprint arXiv:1602.04062, 2016 - arxiv.org

We present a novel definition of the reinforcement learning state, actions and reward
function that allows a deep Q-network (DQN) to learn to control an optimization …

被引用次数：51 相关文章所有 2 个版本

[PDF] mlr.press

Target-based temporal-difference learning

D Lee, N He - International Conference on Machine …, 2019 - proceedings.mlr.press

The use of target networks has been a popular and key component of recent deep Q-
learning algorithms for reinforcement learning, yet little is known from the theory side. In this …

被引用次数：40 相关文章所有 12 个版本

高级搜索

QQ 群

A finite-time analysis of Q-learning with neural network function approximation

A new convergent variant of Q-learning with linear function approximation

A theoretical analysis of deep Q-learning

Provably efficient Q-learning with function approximation via distribution shift error checking oracle

Diagnosing bottlenecks in deep q-learning algorithms

[PDF][PDF] Performance of q-learning with linear function approximation: Stability and finite-time analysis

Finite-time analysis for double Q-learning

Q-learning algorithms: A comprehensive classification and applications

Using deep q-learning to control optimization hyperparameters

Target-based temporal-difference learning

引用