相关文章- 学术资源搜索

Automatic evaluation of excavator operators using learned reward functions

P Agarwal, M Teichmann, S Andrews… - arXiv preprint arXiv …, 2022 - arxiv.org

Training novice users to operate an excavator for learning different skills requires the
presence of expert teachers. Considering the complexity of the problem, it is comparatively …

被引用次数：2 相关文章所有 5 个版本

Learning task-based instructional policy for excavator-like robots

H Maske, E Kieson, G Chowdhary… - … on Robotics and …, 2018 - ieeexplore.ieee.org

We explore beyond existing work in learning from demonstration by asking the
question:“Can robots learn to guide?”, that is, can a robot autonomously learn an …

被引用次数：18 相关文章所有 3 个版本

[PDF] arxiv.org

3D Operation of Autonomous Excavator based on Reinforcement Learning through Independent Reward for Individual Joints

Y Yoo, D Jung, SW Kim - arXiv preprint arXiv:2406.19848, 2024 - arxiv.org

In this paper, we propose a control algorithm based on reinforcement learning, employing
independent rewards for each joint to control excavators in a 3D space. The aim of this …

Validation of Double Transition Model by Analyzing Reward Distributions

E Santos, H Nguyen, KJ Kim, G Hyde… - 2020 IEEE/WIC/ACM …, 2020 - ieeexplore.ieee.org

To keep up with the world's increasing complexity, we need ever-more sophisticated
computational models that can help to make wiser decisions and accurately predict potential …

被引用次数：2 相关文章所有 3 个版本

[PDF] arxiv.org

Robot fine-tuning made easy: Pre-training rewards and policies for autonomous real-world reinforcement learning

J Yang, MS Mark, B Vu, A Sharma, J Bohg… - arXiv preprint arXiv …, 2023 - arxiv.org

The pre-train and fine-tune paradigm in machine learning has had dramatic success in a
wide range of domains because the use of existing data or pre-trained models on the …

被引用次数：6 相关文章所有 5 个版本

[PDF] frontiersin.org

Visual rewards from observation for sequential tasks: Autonomous pile loading

N Strokina, W Yang, J Pajarinen… - Frontiers in Robotics …, 2022 - frontiersin.org

One of the key challenges in implementing reinforcement learning methods for real-world
robotic applications is the design of a suitable reward function. In field robotics, the absence …

被引用次数：3 相关文章所有 12 个版本

[PDF] nsf.gov

Active reward learning from critiques

Y Cui, S Niekum - 2018 IEEE international conference on …, 2018 - ieeexplore.ieee.org

Learning from demonstration algorithms, such as Inverse Reinforcement Learning, aim to
provide a natural mechanism for programming robots, but can often require a prohibitive …

被引用次数：97 相关文章所有 7 个版本

[PDF] arxiv.org

Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning

K Huang, ES Hu, D Jayaraman - arXiv preprint arXiv:2212.08961, 2022 - arxiv.org

Physical interactions can often help reveal information that is not readily apparent. For
example, we may tug at a table leg to evaluate whether it is built well, or turn a water bottle …

被引用次数：2 相关文章所有 4 个版本

[PDF] arxiv.org

Discovering generalizable skills via automated generation of diverse tasks

K Fang, Y Zhu, S Savarese, L Fei-Fei - arXiv preprint arXiv:2106.13935, 2021 - arxiv.org

The learning efficiency and generalization ability of an intelligent agent can be greatly
improved by utilizing a useful set of skills. However, the design of robot skills can often be …

被引用次数：12 相关文章所有 5 个版本

[PDF] researchgate.net

[PDF][PDF] Deep Learning of Robotic Tasks using Strong and Weak Human Supervision

B Hilleli, R El-Yaniv - arXiv preprint arXiv:1612.01086, 2016 - researchgate.net

We propose a scheme for training a computerized agent to perform complex human tasks
such as highway steering. The scheme resembles natural teaching-learning processes used …

被引用次数：2 相关文章

高级搜索

QQ 群

Automatic evaluation of excavator operators using learned reward functions

Learning task-based instructional policy for excavator-like robots

3D Operation of Autonomous Excavator based on Reinforcement Learning through Independent Reward for Individual Joints

Validation of Double Transition Model by Analyzing Reward Distributions

Robot fine-tuning made easy: Pre-training rewards and policies for autonomous real-world reinforcement learning

Visual rewards from observation for sequential tasks: Autonomous pile loading

Active reward learning from critiques

Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning

Discovering generalizable skills via automated generation of diverse tasks

[PDF][PDF] Deep Learning of Robotic Tasks using Strong and Weak Human Supervision

相关搜索

引用