Scalable Oversight by Accounting for Unreliable Feedback

文章

学术资源搜索

获得 1 条结果（用时0.02秒）

我的图书馆

Scalable Oversight by Accounting for Unreliable Feedback

在引用文章中搜索

[PDF] arxiv.org

Mapping out the Space of Human Feedback for Reinforcement Learning: A Conceptual Framework

Y Metz, D Lindner, R Baur, M El-Assady - arXiv preprint arXiv:2411.11761, 2024 - arxiv.org

Reinforcement Learning from Human feedback (RLHF) has become a powerful tool to fine-
tune or train agentic machine learning models. Similar to how humans interact in social …

高级搜索

QQ 群

Scalable Oversight by Accounting for Unreliable Feedback

Mapping out the Space of Human Feedback for Reinforcement Learning: A Conceptual Framework

引用