相关文章- 学术资源搜索

Videodex: Learning dexterity from internet videos

K Shaw, S Bahl, D Pathak - Conference on Robot Learning, 2023 - proceedings.mlr.press

To build general robotic agents that can operate in many environments, it is often imperative
for the robot to collect experience in the real world. However, this is often not feasible due to …

被引用次数：45 相关文章所有 9 个版本

[PDF] arxiv.org

Learning generalizable robotic reward functions from" in-the-wild" human videos

AS Chen, S Nair, C Finn - arXiv preprint arXiv:2103.16817, 2021 - arxiv.org

We are motivated by the goal of generalist robots that can complete a wide range of tasks
across many environments. Critical to this is the robot's ability to acquire some metric of task …

被引用次数：90 相关文章所有 7 个版本

[PDF] arxiv.org

Avid: Learning multi-stage tasks via pixel-level translation of human videos

L Smith, N Dhawan, M Zhang, P Abbeel… - arXiv preprint arXiv …, 2019 - arxiv.org

Robotic reinforcement learning (RL) holds the promise of enabling robots to learn complex
behaviors through experience. However, realizing this promise for long-horizon tasks in the …

被引用次数：132 相关文章所有 6 个版本

[PDF] arxiv.org

Reinforcement learning with videos: Combining offline observations with interaction

K Schmeckpeper, O Rybkin, K Daniilidis… - arXiv preprint arXiv …, 2020 - arxiv.org

Reinforcement learning is a powerful framework for robots to acquire skills from experience,
but often requires a substantial amount of online data collection. As a result, it is difficult to …

被引用次数：84 相关文章所有 3 个版本

[PDF] arxiv.org

Robonet: Large-scale multi-robot learning

S Dasari, F Ebert, S Tian, S Nair, B Bucher… - arXiv preprint arXiv …, 2019 - arxiv.org

Robot learning has emerged as a promising tool for taming the complexity and diversity of
the real world. Methods based on high-capacity models, such as deep networks, hold the …

被引用次数：270 相关文章所有 6 个版本

[PDF] thecvf.com

Affordances from human videos as a versatile representation for robotics

S Bahl, R Mendonca, L Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com

Building a robot that can understand and learn to interact by watching humans has inspired
several vision problems. However, despite some successful results on static datasets, it …

被引用次数：61 相关文章所有 9 个版本

[PDF] arxiv.org

R3m: A universal visual representation for robot manipulation

S Nair, A Rajeswaran, V Kumar, C Finn… - arXiv preprint arXiv …, 2022 - arxiv.org

We study how visual representations pre-trained on diverse human video data can enable
data-efficient learning of downstream robotic manipulation tasks. Concretely, we pre-train a …

被引用次数：346 相关文章所有 5 个版本

[PDF] arxiv.org

Robotic offline rl from internet videos via value-function pre-training

C Bhateja, D Guo, D Ghosh, A Singh, M Tomar… - arXiv preprint arXiv …, 2023 - arxiv.org

Pre-training on Internet data has proven to be a key ingredient for broad generalization in
many modern ML systems. What would it take to enable such capabilities in robotic …

被引用次数：10 相关文章所有 3 个版本

[PDF] mlr.press

Graph inverse reinforcement learning from diverse videos

S Kumar, J Zamora, N Hansen… - … on Robot Learning, 2023 - proceedings.mlr.press

Abstract Research on Inverse Reinforcement Learning (IRL) from third-person videos has
shown encouraging results on removing the need for manual reward design for robotic …

被引用次数：31 相关文章所有 5 个版本

[PDF] arxiv.org

Zero-shot robot manipulation from passive human videos

H Bharadhwaj, A Gupta, S Tulsiani, V Kumar - arXiv preprint arXiv …, 2023 - arxiv.org

Can we learn robot manipulation for everyday tasks, only by watching videos of humans
doing arbitrary tasks in different unstructured settings? Unlike widely adopted strategies of …

被引用次数：18 相关文章所有 4 个版本

高级搜索

QQ 群

Videodex: Learning dexterity from internet videos

Learning generalizable robotic reward functions from" in-the-wild" human videos

Avid: Learning multi-stage tasks via pixel-level translation of human videos

Reinforcement learning with videos: Combining offline observations with interaction

Robonet: Large-scale multi-robot learning

Affordances from human videos as a versatile representation for robotics

R3m: A universal visual representation for robot manipulation

Robotic offline rl from internet videos via value-function pre-training

Graph inverse reinforcement learning from diverse videos

Zero-shot robot manipulation from passive human videos

相关搜索

引用