A deep hierarchical reinforcement learning algorithm in partially observable Markov decision processes

TP Le, NA Vien, TC Chung - Ieee Access, 2018 - ieeexplore.ieee.org
In recent years, reinforcement learning (RL) has achieved remarkable success due to the
growing adoption of deep learning techniques and the rapid growth of computing power …

Hierarchical monte-carlo planning

NA Vien, M Toussaint - Proceedings of the AAAI Conference on …, 2015 - ojs.aaai.org
Abstract Monte-Carlo Tree Search, especially UCT and its POMDP version POMCP, have
demonstrated excellent performanceon many problems. However, to efficiently scale to …

[PDF][PDF] Reinforcement learning algorithms: survey and classification

NR Ravishankar… - Indian J. Sci …, 2017 - sciresol.s3.us-east-2.amazonaws …
Reinforcement Learning (RL) has emerged as a strong approach in the field of Artificial
intelligence, specifically, in the field of machine learning, robotic navigation, etc. In this paper …

Continuous-observation partially observable semi-Markov decision processes for machine maintenance

M Zhang, M Revie - IEEE Transactions on Reliability, 2016 - ieeexplore.ieee.org
Partially observable semi-Markov decision processes (POSMDPs) provide a rich framework
for planning under both state transition uncertainty and observation uncertainty. In this …

An efficient approach to model-based hierarchical reinforcement learning

Z Li, A Narayan, TY Leong - Proceedings of the AAAI Conference on …, 2017 - ojs.aaai.org
We propose a model-based approach to hierarchical reinforcement learning that exploits
shared knowledge and selective execution at different levels of abstraction, to efficiently …

POMDP manipulation via trajectory optimization

NA Vien, M Toussaint - 2015 IEEE/RSJ International …, 2015 - ieeexplore.ieee.org
Efficient object manipulation based only on force feedback typically requires a plan of
actively contact-seeking actions to reduce uncertainty over the true environmental model. In …

A partially observable Markov-decision-process-based blackboard architecture for cognitive agents in partially observable environments

H Itoh, H Nakano, R Tokushima… - … on Cognitive and …, 2020 - ieeexplore.ieee.org
Partial observability, or the inability of an agent to fully observe the state of its environment,
exists in many real-world problem domains. However, most cognitive architectures do not …

High-efficiency online planning using composite bounds search under partial observation

Y Chen, J Liu, Y Huang, H Zhang, Y Wang - Applied Intelligence, 2023 - Springer
Motion planning in uncertain environments is a common challenge and essential for
autonomous robot operations. Representatively, the determinized sparse partially …

Single trajectory learning: exploration versus exploitation

Q Fu, Q Liu, S Zhong, H Luo, H Wu… - International Journal of …, 2018 - World Scientific
In reinforcement learning (RL), the exploration/exploitation (E/E) dilemma is a very crucial
issue, which can be described as searching between the exploration of the environment to …

Bayes-adaptive hierarchical MDPs

NA Vien, SG Lee, TC Chung - Applied Intelligence, 2016 - Springer
Reinforcement learning (RL) is an area of machine learning that is concerned with how an
agent learns to make decisions sequentially in order to optimize a particular performance …