Evaluations of the gap between supervised and reinforcement lifelong learning on robotic...

V Casamayor Pujol, A Morichetta, I Murturi… - Information, 2023 - mdpi.com

This article discusses four fundamental topics for future Distributed Computing Continuum
Systems: their representation, model, lifelong learning, and business model. Further, it …

被引用次数：28 相关文章所有 7 个版本

[PDF] mlr.press

Lifelong robotic reinforcement learning by retaining experiences

A Xie, C Finn - Conference on Lifelong Learning Agents, 2022 - proceedings.mlr.press

Multi-task learning ideally allows embodied agents such as robots to acquire a diverse
repertoire of useful skills. However, many multi-task reinforcement learning efforts assume …

被引用次数：41 相关文章所有 4 个版本

[PDF] arxiv.org

Policy Stitching: Learning Transferable Robot Policies

P Jian, E Lee, Z Bell, MM Zavlanos, B Chen - arXiv preprint arXiv …, 2023 - arxiv.org

Training robots with reinforcement learning (RL) typically involves heavy interactions with
the environment, and the acquired skills are often sensitive to changes in task environments …

被引用次数：7 相关文章所有 4 个版本

[PDF] archive.org

Power Norm Based Lifelong Learning for Paraphrase Generations

D Li, P Yang, Y Zhang, P Li - Proceedings of the 46th International ACM …, 2023 - dl.acm.org

Lifelong seq2seq language generation models are trained with multiple domains in a
lifelong learning manner, with data from each domain being observed in an online fashion. It …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Online Continual Learning For Interactive Instruction Following Agents

B Kim, M Seo, J Choi - arXiv preprint arXiv:2403.07548, 2024 - arxiv.org

In learning an embodied agent executing daily tasks via language directives, the literature
largely assumes that the agent learns all training data at the beginning. We argue that such …

被引用次数：6 相关文章所有 3 个版本

[PDF] mlr.press

Forgetting and imbalance in robot lifelong learning with off-policy data

W Zhou, S Bohez, J Humplik, N Heess… - Conference on …, 2022 - proceedings.mlr.press

Robots will experience non-stationary environment dynamics throughout their lifetime: the
robot dynamics can change due to wear and tear, or its surroundings may change over time …

被引用次数：6 相关文章所有 3 个版本

Efficient multitask reinforcement learning without performance loss

J Baek, S Baek, S Han - IEEE Transactions on Neural …, 2023 - ieeexplore.ieee.org

We propose an iterative sparse Bayesian policy optimization (ISBPO) scheme as an efficient
multitask reinforcement learning (RL) method for industrial control applications that require …

被引用次数：4 相关文章所有 3 个版本

[PDF] arxiv.org

Optimizing Robotic Manipulation with Decision-RWKV: A Recurrent Sequence Modeling Approach for Lifelong Learning

Y Dong, T Wu, C Song - Journal of Computing and …, 2024 - asmedigitalcollection.asme.org

Abstract Models based on the Transformer architecture have seen widespread application
across fields such as natural language processing (NLP), computer vision, and robotics, with …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

Time-varying propensity score to bridge the gap between the past and present

R Fakoor, J Mueller, ZC Lipton, P Chaudhari… - arXiv preprint arXiv …, 2022 - arxiv.org

Real-world deployment of machine learning models is challenging because data evolves
over time. While no model can work when data evolves in an arbitrary fashion, if there is …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Task-unaware Lifelong Robot Learning with Retrieval-based Weighted Local Adaptation

P Yang, X Wang, R Zhang, C Wang, F Oliehoek… - arXiv preprint arXiv …, 2024 - arxiv.org

Real-world environments require robots to continuously acquire new skills while retaining
previously learned abilities, all without the need for clearly defined task boundaries. Storing …

高级搜索

QQ 群