A review of robot learning for manipulation: Challenges, representations, and algorithms

O Kroemer, S Niekum, G Konidaris - Journal of machine learning research, 2021 - jmlr.org
A key challenge in intelligent robotics is creating robots that are capable of directly
interacting with the world around them to achieve their goals. The last decade has seen …

Where2act: From pixels to actions for articulated 3d objects

K Mo, LJ Guibas, M Mukadam… - Proceedings of the …, 2021 - openaccess.thecvf.com
One of the fundamental goals of visual perception is to allow agents to meaningfully interact
with their environment. In this paper, we take a step towards that long-term goal--we extract …

Self-supervised 6d object pose estimation for robot manipulation

X Deng, Y Xiang, A Mousavian… - … on Robotics and …, 2020 - ieeexplore.ieee.org
To teach robots skills, it is crucial to obtain data with supervision. Since annotating real world
data is time-consuming and expensive, enabling robots to learn in a self-supervised way is …

Seal: Self-supervised embodied active learning using exploration and 3d consistency

DS Chaplot, M Dalal, S Gupta, J Malik… - Advances in neural …, 2021 - proceedings.neurips.cc
In this paper, we explore how we can build upon the data and models of Internet images and
use them to adapt to robot vision without requiring any extra labels. We present a framework …

Scnet: Training inference sample consistency for instance segmentation

T Vu, H Kang, CD Yoo - Proceedings of the AAAI Conference on …, 2021 - ojs.aaai.org
Cascaded architectures have brought significant performance improvement in object
detection and instance segmentation. However, there are lingering issues regarding the …

O2o-afford: Annotation-free large-scale object-object affordance learning

K Mo, Y Qin, F Xiang, H Su… - Conference on robot …, 2022 - proceedings.mlr.press
Contrary to the vast literature in modeling, perceiving, and understanding agent-object (eg,
human-object, hand-object, robot-object) interaction in computer vision and robotics, very …

Learning physical graph representations from visual scenes

D Bear, C Fan, D Mrowca, Y Li, S Alter… - Advances in …, 2020 - proceedings.neurips.cc
Abstract Convolutional Neural Networks (CNNs) have proved exceptional at learning
representations for visual object categorization. However, CNNs do not explicitly encode …

Dance: A deep attentive contour model for efficient instance segmentation

Z Liu, JH Liew, X Chen, J Feng - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Contour-based instance segmentation methods are attractive due to their efficiency.
However, existing contour-based methods either suffer from lossy representation, complex …

Act the part: Learning interaction strategies for articulated object part discovery

SY Gadre, K Ehsani, S Song - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
People often use physical intuition when manipulating articulated objects, irrespective of
object semantics. Motivated by this observation, we identify an important embodied task …

Embodied amodal recognition: Learning to move to perceive objects

J Yang, Z Ren, M Xu, X Chen… - Proceedings of the …, 2019 - openaccess.thecvf.com
Passive visual systems typically fail to recognize objects in the amodal setting where they
are heavily occluded. In contrast, humans and other embodied agents have the ability to …