Open problems and fundamental limitations of reinforcement learning from human feedback S Casper, X Davies, C Shi, TK Gilbert, J Scheurer, J Rando, R Freedman, ... arXiv preprint arXiv:2307.15217, 2023 | 251 | 2023 |
Analyzing human models that adapt online A Bajcsy, A Siththaranjan, CJ Tomlin, AD Dragan 2021 IEEE International Conference on Robotics and Automation (ICRA), 2754-2760, 2021 | 20 | 2021 |
Distributional preference learning: Understanding and accounting for hidden context in RLHF A Siththaranjan, C Laidlaw, D Hadfield-Menell arXiv preprint arXiv:2312.08358, 2023 | 15 | 2023 |
Inferring neuronal ionic conductances from membrane potentials using cnns R Ben-Shalom, J Balewski, A Siththaranjan, V Baratham, H Kyoung, ... bioRxiv, 727974, 2019 | 9 | 2019 |
Open problems and fundamental limitations of reinforcement learning from human feedback. CoRR, abs/2307.15217, 2023. doi: 10.48550 S Casper, X Davies, C Shi, TK Gilbert, J Scheurer, J Rando, R Freedman, ... arXiv preprint ARXIV.2307.15217, 0 | 7 | |
Understanding hidden context in preference learning: Consequences for rlhf A Siththaranjan, C Laidlaw, D Hadfield-Menell Socially Responsible Language Modelling Research, 2023 | 4 | 2023 |
AI Alignment with Changing and Influenceable Reward Functions M Carroll, D Foote, A Siththaranjan, S Russell, A Dragan arXiv preprint arXiv:2405.17713, 2024 | 2 | 2024 |
On the computational consequences of cost function design in nonlinear optimal control T Westenbroek, A Siththaranjan, M Sarwari, CJ Tomlin, S Sastry 2022 IEEE 61st Conference on Decision and Control (CDC), 7423-7430, 2022 | 1 | 2022 |
Social Planning in Population Games A Siththaranjan, C Tomlin | | 2024 |
Intent Demonstration in General-Sum Dynamic Games via Iterative Linear-Quadratic Approximations J Li, A Siththaranjan, S Sojoudi, C Tomlin, A Bajcsy arXiv preprint arXiv:2402.10182, 2024 | | 2024 |