Automatic evaluation of excavator operators using learned reward functions

P Agarwal, M Teichmann, S Andrews… - arXiv preprint arXiv …, 2022 - arxiv.org
Training novice users to operate an excavator for learning different skills requires the
presence of expert teachers. Considering the complexity of the problem, it is comparatively …

Learning task-based instructional policy for excavator-like robots

H Maske, E Kieson, G Chowdhary… - … on Robotics and …, 2018 - ieeexplore.ieee.org
We explore beyond existing work in learning from demonstration by asking the
question:“Can robots learn to guide?”, that is, can a robot autonomously learn an …

3D Operation of Autonomous Excavator based on Reinforcement Learning through Independent Reward for Individual Joints

Y Yoo, D Jung, SW Kim - arXiv preprint arXiv:2406.19848, 2024 - arxiv.org
In this paper, we propose a control algorithm based on reinforcement learning, employing
independent rewards for each joint to control excavators in a 3D space. The aim of this …

Validation of Double Transition Model by Analyzing Reward Distributions

E Santos, H Nguyen, KJ Kim, G Hyde… - 2020 IEEE/WIC/ACM …, 2020 - ieeexplore.ieee.org
To keep up with the world's increasing complexity, we need ever-more sophisticated
computational models that can help to make wiser decisions and accurately predict potential …

Robot fine-tuning made easy: Pre-training rewards and policies for autonomous real-world reinforcement learning

J Yang, MS Mark, B Vu, A Sharma, J Bohg… - arXiv preprint arXiv …, 2023 - arxiv.org
The pre-train and fine-tune paradigm in machine learning has had dramatic success in a
wide range of domains because the use of existing data or pre-trained models on the …

Visual rewards from observation for sequential tasks: Autonomous pile loading

N Strokina, W Yang, J Pajarinen… - Frontiers in Robotics …, 2022 - frontiersin.org
One of the key challenges in implementing reinforcement learning methods for real-world
robotic applications is the design of a suitable reward function. In field robotics, the absence …

Active reward learning from critiques

Y Cui, S Niekum - 2018 IEEE international conference on …, 2018 - ieeexplore.ieee.org
Learning from demonstration algorithms, such as Inverse Reinforcement Learning, aim to
provide a natural mechanism for programming robots, but can often require a prohibitive …

Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning

K Huang, ES Hu, D Jayaraman - arXiv preprint arXiv:2212.08961, 2022 - arxiv.org
Physical interactions can often help reveal information that is not readily apparent. For
example, we may tug at a table leg to evaluate whether it is built well, or turn a water bottle …

Discovering generalizable skills via automated generation of diverse tasks

K Fang, Y Zhu, S Savarese, L Fei-Fei - arXiv preprint arXiv:2106.13935, 2021 - arxiv.org
The learning efficiency and generalization ability of an intelligent agent can be greatly
improved by utilizing a useful set of skills. However, the design of robot skills can often be …

[PDF][PDF] Deep Learning of Robotic Tasks using Strong and Weak Human Supervision

B Hilleli, R El-Yaniv - arXiv preprint arXiv:1612.01086, 2016 - researchgate.net
We propose a scheme for training a computerized agent to perform complex human tasks
such as highway steering. The scheme resembles natural teaching-learning processes used …