anthropomorphic robot manipulators. The proposal improves classical motion planning
algorithms by introducing a Deep Reinforcement Learning (DRL) approach trained ad hoc
for performing obstacle avoidance, while achieving a reaching task in the operative space.
More specifically, a switching mechanism is enabled whenever a condition of proximity to
the obstacles is met, thus conferring to the dual-mode architecture a self-configuring …