A reinforcement learning method for human-robot collaboration in assembly tasks

R Zhang, Q Lv, J Li, J Bao, T Liu, S Liu - Robotics and Computer-Integrated …, 2022 - Elsevier
The assembly process of high precision products involves a variety of delicate operations
that are time-consuming and energy-intensive. Neither the human operators nor the robots …

A graph-based reinforcement learning-enabled approach for adaptive human-robot collaborative assembly operations

R Zhang, J Lv, J Li, J Bao, P Zheng, T Peng - Journal of Manufacturing …, 2022 - Elsevier
In today's prevailing manufacturing paradigm of mass personalization, neither human
operators nor robots alone can perform all assembly tasks efficiently. To overcome it, human …

Self-teaching adaptive dynamic programming for Gomoku

D Zhao, Z Zhang, Y Dai - Neurocomputing, 2012 - Elsevier
In this paper adaptive dynamic programming (ADP) is applied to learn to play Gomoku. The
critic network is used to evaluate board situations. The basic idea is to penalize the last …

ADP with MCTS algorithm for Gomoku

Z Tang, D Zhao, K Shao, LV Le - 2016 IEEE Symposium Series …, 2016 - ieeexplore.ieee.org
Inspired by the core idea of AlphaGo, we combine a neural network, which is trained by
Adaptive Dynamic Programming (ADP), with Monte Carlo Tree Search (MCTS) algorithm for …

[PDF][PDF] Learning from noisy and delayed rewards the value of reinforcement learning to defense modeling and simulation

JK Alt - 2012 - core.ac.uk
Modeling and simulation of military operations requires human behavior models capable of
learning from experience in complex environments where feedback on action quality is …

[PDF][PDF] Dynamic Task Graphs for Teams in Collaborative Assembly Processes

AMM Macedo - 2022 - repositorio-aberto.up.pt
Collaborative robots are increasingly used in industry as they improve efficiency. Particularly
in assembly processes, collaboration, whether Human-Robot or Robot-Robot, expands the …

Prediction in Human Decision Making: A Modeling Approach

R Kianifar, F Towhidkhah… - Frontiers in Biomedical …, 2014 - fbt.tums.ac.ir
Human beings can determine optimal behaviors, which depends on the ability to make
planned and adaptive decisions. Decision making is defined as the ability to choose …

A predictive reinforcement learning framework for modeling human decision making behavior

R Kianifar, F Towhidkhah - 2009 14th International CSI …, 2009 - ieeexplore.ieee.org
Human can determine optimal behaviors which depend on the ability to make planned and
adaptive decisions. In this paper, we have proposed a predictive structure based on …

ニューロファジィ型強化学習システムを用いた群行動の獲得

呉本尭, 山野祐樹, 馮良炳, 小林邦和… - 電気学会論文誌C (電子 …, 2013 - jstage.jst.go.jp
抄録 Individuals in the swarm intelligence systems are generally designed to be able to
perform cooperative behaviors. However, those individual are usually with simple structures …