Approximate planning for bayesian hierarchical reinforcement learning

A deep hierarchical reinforcement learning algorithm in partially observable Markov decision processes

TP Le, NA Vien, TC Chung - Ieee Access, 2018 - ieeexplore.ieee.org

In recent years, reinforcement learning (RL) has achieved remarkable success due to the
growing adoption of deep learning techniques and the rapid growth of computing power …

被引用次数：77 相关文章所有 11 个版本

[PDF] aaai.org

Hierarchical monte-carlo planning

NA Vien, M Toussaint - Proceedings of the AAAI Conference on …, 2015 - ojs.aaai.org

Abstract Monte-Carlo Tree Search, especially UCT and its POMDP version POMCP, have
demonstrated excellent performanceon many problems. However, to efficiently scale to …

被引用次数：60 相关文章所有 8 个版本

[PDF] amazonaws.com

[PDF][PDF] Reinforcement learning algorithms: survey and classification

NR Ravishankar… - Indian J. Sci …, 2017 - sciresol.s3.us-east-2.amazonaws …

Reinforcement Learning (RL) has emerged as a strong approach in the field of Artificial
intelligence, specifically, in the field of machine learning, robotic navigation, etc. In this paper …

被引用次数：33 相关文章

[PDF] strath.ac.uk

Continuous-observation partially observable semi-Markov decision processes for machine maintenance

M Zhang, M Revie - IEEE Transactions on Reliability, 2016 - ieeexplore.ieee.org

Partially observable semi-Markov decision processes (POSMDPs) provide a rich framework
for planning under both state transition uncertainty and observation uncertainty. In this …

被引用次数：26 相关文章所有 5 个版本

[PDF] aaai.org

An efficient approach to model-based hierarchical reinforcement learning

Z Li, A Narayan, TY Leong - Proceedings of the AAAI Conference on …, 2017 - ojs.aaai.org

We propose a model-based approach to hierarchical reinforcement learning that exploits
shared knowledge and selective execution at different levels of abstraction, to efficiently …

被引用次数：28 相关文章所有 6 个版本

[PDF] tu-berlin.de

POMDP manipulation via trajectory optimization

NA Vien, M Toussaint - 2015 IEEE/RSJ International …, 2015 - ieeexplore.ieee.org

Efficient object manipulation based only on force feedback typically requires a plan of
actively contact-seeking actions to reduce uncertainty over the true environmental model. In …

被引用次数：15 相关文章所有 8 个版本

A partially observable Markov-decision-process-based blackboard architecture for cognitive agents in partially observable environments

H Itoh, H Nakano, R Tokushima… - … on Cognitive and …, 2020 - ieeexplore.ieee.org

Partial observability, or the inability of an agent to fully observe the state of its environment,
exists in many real-world problem domains. However, most cognitive architectures do not …

被引用次数：6 相关文章所有 2 个版本

High-efficiency online planning using composite bounds search under partial observation

Y Chen, J Liu, Y Huang, H Zhang, Y Wang - Applied Intelligence, 2023 - Springer

Motion planning in uncertain environments is a common challenge and essential for
autonomous robot operations. Representatively, the determinized sparse partially …

被引用次数：1 相关文章所有 3 个版本

Single trajectory learning: exploration versus exploitation

Q Fu, Q Liu, S Zhong, H Luo, H Wu… - International Journal of …, 2018 - World Scientific

In reinforcement learning (RL), the exploration/exploitation (E/E) dilemma is a very crucial
issue, which can be described as searching between the exploration of the environment to …

被引用次数：4 相关文章所有 2 个版本

Bayes-adaptive hierarchical MDPs

NA Vien, SG Lee, TC Chung - Applied Intelligence, 2016 - Springer

Reinforcement learning (RL) is an area of machine learning that is concerned with how an
agent learns to make decisions sequentially in order to optimize a particular performance …

被引用次数：4 相关文章所有 7 个版本

高级搜索

QQ 群