相关文章- 学术资源搜索

Q-learning with experience replay in a dynamic environment

M Pieters, MA Wiering - 2016 IEEE Symposium Series on …, 2016 - ieeexplore.ieee.org

Most research in reinforcement learning has focused on stationary environments. In this
paper, we propose several adaptations of Q-learning for a dynamic environment, for both …

被引用次数：33 相关文章所有 7 个版本

DQL: A new updating strategy for reinforcement learning based on Q-learning

CE Mariano, EF Morales - Machine Learning: ECML 2001: 12th European …, 2001 - Springer

In reinforcement learning an autonomous agent learns an optimal policy while interacting
with the environment. In particular, in one-step Q-learning, with each action an agent …

被引用次数：39 相关文章所有 11 个版本

Improving the performance of q-learning using simultanouse q-values updating

M Pouyan, A Mousavi, S Golzari… - … and Knowledge (ICTCK …, 2014 - ieeexplore.ieee.org

Q-learning is a one of the best model-free reinforcement learning algorithms. The goal is to
find an estimate of the optimal action-value function called Q-value function. The Q-value …

被引用次数：23 相关文章所有 2 个版本

Interaction models for multiagent reinforcement learning

R Ribeiro, AP Borges… - … Intelligence for Modelling …, 2008 - ieeexplore.ieee.org

This article proposes and compares different interaction models for reinforcement learning
based on multi-agent system. The cooperation during the learning process is crucial to …

被引用次数：23 相关文章所有 4 个版本

[PDF] arxiv.org

Reinforcement learning in R

N Pröllochs, S Feuerriegel - arXiv preprint arXiv:1810.00240, 2018 - arxiv.org

Reinforcement learning refers to a group of methods from artificial intelligence where an
agent performs learning through trial and error. It differs from supervised learning, since …

被引用次数：10 相关文章所有 7 个版本

An experience replay method based on tree structure for reinforcement learning

WC Jiang, KS Hwang, JL Lin - IEEE Transactions on Emerging …, 2019 - ieeexplore.ieee.org

Q-Learning, which is a well-known model-free reinforcement learning algorithm, a learning
agent explores an environment to update a state-action function. In reinforcement learning …

被引用次数：18 相关文章所有 2 个版本

[PDF] ifaamas.org

[PDF][PDF] Addressing the policy-bias of q-learning by repeating updates

S Abdallah, M Kaisers - … of the 2013 international conference on …, 2013 - ifaamas.org

ABSTRACT Q-learning is a very popular reinforcement learning algorithm being proven to
converge to optimal policies in Markov decision processes. However, Q-learning shows …

被引用次数：21 相关文章所有 6 个版本

A distributed q-learning algorithm for multi-agent team coordination

J Huang, B Yang, D Liu - 2005 International Conference on …, 2005 - ieeexplore.ieee.org

Q-learning is an effective model-free reinforcement learning algorithm. However, Q-learning
is centralized and competent only for single agent learning but not multi-agent learning …

被引用次数：35 相关文章所有 3 个版本

[PDF] psu.edu

[PDF][PDF] An analysis of Q-learning algorithms with strategies of reward function

S Manju, M Punithavalli - International Journal on Computer Science and …, 2011 - Citeseer

Q-Learning is a Reinforcement Learning technique that works by learning an action-value
function that gives the expected utility of taking a given action in a given state and following …

被引用次数：50 相关文章

Regional cooperative multi-agent q-learning based on potential field

L Liu, L Li - 2008 Fourth International Conference on Natural …, 2008 - ieeexplore.ieee.org

More and more artificial intelligence researchers focused on the reinforcement learning (RL)-
based multi-agent system (MAS). Multi-agent learning problems can in principle be solved …

被引用次数：13 相关文章所有 4 个版本

高级搜索

QQ 群