C She, C Sun, Z Gu, Y Li, C Yang… - Proceedings of the …, 2021 - ieeexplore.ieee.org
As one of the key communication scenarios in the fifth-generation and also the sixth- generation (6G) mobile communication networks, ultrareliable and low-latency …
Reinforcement learning from human feedback (RLHF) is a technique for training AI systems to align with human goals. RLHF has emerged as the central method used to finetune state …
Large Language Models (LLMs) have excelled as high-level semantic planners for sequential decision-making tasks. However, harnessing them to learn complex low-level …
J Skalse, N Howe… - Advances in Neural …, 2022 - proceedings.neurips.cc
We provide the first formal definition of\textbf {reward hacking}, a phenomenon where optimizing an imperfect proxy reward function, $\mathcal {\tilde {R}} $, leads to poor …
Reward and representation learning are two long-standing challenges for learning an expanding set of robot manipulation skills from sensory observations. Given the inherent …
The advances in reinforcement learning have recorded sublime success in various domains. Although the multi-agent domain has been overshadowed by its single-agent counterpart …
With the development of large language models (LLMs), striking a balance between the performance and safety of AI systems has never been more critical. However, the inherent …
Foundation models are first pre-trained on vast unsupervised datasets and then fine-tuned on labeled data. Reinforcement learning, notably from human feedback (RLHF), can further …
Reinforcement learning is a learning paradigm for solving sequential decision-making problems. Recent years have witnessed remarkable progress in reinforcement learning …