关注
Anand Siththaranjan
Anand Siththaranjan
在 berkeley.edu 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Open problems and fundamental limitations of reinforcement learning from human feedback
S Casper, X Davies, C Shi, TK Gilbert, J Scheurer, J Rando, R Freedman, ...
arXiv preprint arXiv:2307.15217, 2023
2512023
Analyzing human models that adapt online
A Bajcsy, A Siththaranjan, CJ Tomlin, AD Dragan
2021 IEEE International Conference on Robotics and Automation (ICRA), 2754-2760, 2021
202021
Distributional preference learning: Understanding and accounting for hidden context in RLHF
A Siththaranjan, C Laidlaw, D Hadfield-Menell
arXiv preprint arXiv:2312.08358, 2023
152023
Inferring neuronal ionic conductances from membrane potentials using cnns
R Ben-Shalom, J Balewski, A Siththaranjan, V Baratham, H Kyoung, ...
bioRxiv, 727974, 2019
92019
Open problems and fundamental limitations of reinforcement learning from human feedback. CoRR, abs/2307.15217, 2023. doi: 10.48550
S Casper, X Davies, C Shi, TK Gilbert, J Scheurer, J Rando, R Freedman, ...
arXiv preprint ARXIV.2307.15217, 0
7
Understanding hidden context in preference learning: Consequences for rlhf
A Siththaranjan, C Laidlaw, D Hadfield-Menell
Socially Responsible Language Modelling Research, 2023
42023
AI Alignment with Changing and Influenceable Reward Functions
M Carroll, D Foote, A Siththaranjan, S Russell, A Dragan
arXiv preprint arXiv:2405.17713, 2024
22024
On the computational consequences of cost function design in nonlinear optimal control
T Westenbroek, A Siththaranjan, M Sarwari, CJ Tomlin, S Sastry
2022 IEEE 61st Conference on Decision and Control (CDC), 7423-7430, 2022
12022
Social Planning in Population Games
A Siththaranjan, C Tomlin
2024
Intent Demonstration in General-Sum Dynamic Games via Iterative Linear-Quadratic Approximations
J Li, A Siththaranjan, S Sojoudi, C Tomlin, A Bajcsy
arXiv preprint arXiv:2402.10182, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–10