X Wang, W Wongkamjan, R Jia… - … on Machine Learning, 2023 - proceedings.mlr.press
Abstract Model-based reinforcement learning (RL) often achieves higher sample efficiency in practice than model-free RL by learning a dynamics model to generate samples for policy …
Transformer has achieved great successes in learning vision and language representation, which is general across various downstream tasks. In visual control, learning transferable …
X Chen, YM Mu, P Luo, S Li… - … Conference on Machine …, 2022 - proceedings.mlr.press
Abstract Partially Observable Markov Decision Process (POMDP) provides a principled and generic framework to model real world sequential decision making processes but yet …
Compared to model-free reinforcement learning (RL), model-based RL is often more sample efficient by leveraging a learned dynamics model to help decision making. However, the …
Z Gao, Y Mu, J Qu, M Hu, L Guo, P Luo, Y Lu - arXiv preprint arXiv …, 2024 - arxiv.org
Dual-arm robots offer enhanced versatility and efficiency over single-arm counterparts by enabling concurrent manipulation of multiple objects or cooperative execution of tasks using …
The Overfitted Brain hypothesis suggests dreams happen to allow generalization in the human brain. Here, we ask if the same is true for reinforcement learning agents as well …