Fundamental research challenges for distributed computing continuum systems

V Casamayor Pujol, A Morichetta, I Murturi… - Information, 2023 - mdpi.com
This article discusses four fundamental topics for future Distributed Computing Continuum
Systems: their representation, model, lifelong learning, and business model. Further, it …

Lifelong robotic reinforcement learning by retaining experiences

A Xie, C Finn - Conference on Lifelong Learning Agents, 2022 - proceedings.mlr.press
Multi-task learning ideally allows embodied agents such as robots to acquire a diverse
repertoire of useful skills. However, many multi-task reinforcement learning efforts assume …

Policy Stitching: Learning Transferable Robot Policies

P Jian, E Lee, Z Bell, MM Zavlanos, B Chen - arXiv preprint arXiv …, 2023 - arxiv.org
Training robots with reinforcement learning (RL) typically involves heavy interactions with
the environment, and the acquired skills are often sensitive to changes in task environments …

Power Norm Based Lifelong Learning for Paraphrase Generations

D Li, P Yang, Y Zhang, P Li - Proceedings of the 46th International ACM …, 2023 - dl.acm.org
Lifelong seq2seq language generation models are trained with multiple domains in a
lifelong learning manner, with data from each domain being observed in an online fashion. It …

Online Continual Learning For Interactive Instruction Following Agents

B Kim, M Seo, J Choi - arXiv preprint arXiv:2403.07548, 2024 - arxiv.org
In learning an embodied agent executing daily tasks via language directives, the literature
largely assumes that the agent learns all training data at the beginning. We argue that such …

Forgetting and imbalance in robot lifelong learning with off-policy data

W Zhou, S Bohez, J Humplik, N Heess… - Conference on …, 2022 - proceedings.mlr.press
Robots will experience non-stationary environment dynamics throughout their lifetime: the
robot dynamics can change due to wear and tear, or its surroundings may change over time …

Efficient multitask reinforcement learning without performance loss

J Baek, S Baek, S Han - IEEE Transactions on Neural …, 2023 - ieeexplore.ieee.org
We propose an iterative sparse Bayesian policy optimization (ISBPO) scheme as an efficient
multitask reinforcement learning (RL) method for industrial control applications that require …

Optimizing Robotic Manipulation with Decision-RWKV: A Recurrent Sequence Modeling Approach for Lifelong Learning

Y Dong, T Wu, C Song - Journal of Computing and …, 2024 - asmedigitalcollection.asme.org
Abstract Models based on the Transformer architecture have seen widespread application
across fields such as natural language processing (NLP), computer vision, and robotics, with …

Time-varying propensity score to bridge the gap between the past and present

R Fakoor, J Mueller, ZC Lipton, P Chaudhari… - arXiv preprint arXiv …, 2022 - arxiv.org
Real-world deployment of machine learning models is challenging because data evolves
over time. While no model can work when data evolves in an arbitrary fashion, if there is …

Task-unaware Lifelong Robot Learning with Retrieval-based Weighted Local Adaptation

P Yang, X Wang, R Zhang, C Wang, F Oliehoek… - arXiv preprint arXiv …, 2024 - arxiv.org
Real-world environments require robots to continuously acquire new skills while retaining
previously learned abilities, all without the need for clearly defined task boundaries. Storing …