On kernelized multi-armed bandits SR Chowdhury, A Gopalan International Conference on Machine Learning, 844-853, 2017 | 435 | 2017 |
Misspecified linear bandits A Ghosh, SR Chowdhury, A Gopalan Proceedings of the AAAI Conference on Artificial Intelligence 31 (1), 2017 | 68 | 2017 |
Online learning in kernelized markov decision processes SR Chowdhury, A Gopalan The 22nd International Conference on Artificial Intelligence and Statistics …, 2019 | 48 | 2019 |
Bayesian optimization under heavy-tailed payoffs S Ray Chowdhury, A Gopalan Advances in Neural Information Processing Systems 32, 2019 | 26 | 2019 |
Shuffle private linear contextual bandits SR Chowdhury, X Zhou International Conference in Machine Learning, 2022., 2022 | 18 | 2022 |
No-regret algorithms for multi-task bayesian optimization SR Chowdhury, A Gopalan International Conference on Artificial Intelligence and Statistics, 1873-1881, 2021 | 18 | 2021 |
Differentially private regret minimization in episodic markov decision processes SR Chowdhury, X Zhou Proceedings of the AAAI Conference on Artificial Intelligence 36 (6), 6375-6383, 2022 | 14 | 2022 |
Bregman deviations of generic exponential families SR Chowdhury, P Saux, O Maillard, A Gopalan The Thirty Sixth Annual Conference on Learning Theory, 394-449, 2023 | 13 | 2023 |
Distributed Differential Privacy in Multi-Armed Bandits SR Chowdhury, X Zhou ICLR 2023, 2022 | 13 | 2022 |
Reinforcement learning in parametric mdps with exponential families SR Chowdhury, A Gopalan, OA Maillard International Conference on Artificial Intelligence and Statistics, 1855-1863, 2021 | 13 | 2021 |
On differentially private federated linear contextual bandits X Zhou, SR Chowdhury arXiv preprint arXiv:2302.13945, 2023 | 11 | 2023 |
Value Function Approximations via Kernel Embeddings for No-Regret Reinforcement Learning SR Chowdhury, R Oliveira Asian Conference on Machine Learning, 249-264, 2023 | 10* | 2023 |
Adaptive control of differentially private linear quadratic systems SR Chowdhury, X Zhou, N Shroff 2021 IEEE International Symposium on Information Theory (ISIT), 485-490, 2021 | 8 | 2021 |
Active learning of conditional mean embeddings via bayesian optimisation SR Chowdhury, R Oliveira, F Ramos Conference on Uncertainty in Artificial Intelligence, 1119-1128, 2020 | 8 | 2020 |
Model Selection in Reinforcement Learning with General Function Approximations A Ghosh, SR Chowdhury ECML-PKDD, 2022, 2022 | 6* | 2022 |
Provably Sample Efficient RLHF via Active Preference Optimization N Das, S Chakraborty, A Pacchiano, SR Chowdhury arXiv preprint arXiv:2402.10500, 2024 | 5 | 2024 |
On Batch Bayesian Optimization SR Chowdhury, A Gopalan arXiv preprint arXiv:1911.01032, 2019 | 5 | 2019 |
Provably Robust DPO: Aligning Language Models with Noisy Feedback SR Chowdhury, A Kini, N Natarajan arXiv preprint arXiv:2403.00409, 2024 | 2 | 2024 |
GAR-meets-RAG Paradigm for Zero-Shot Information Retrieval D Arora, A Kini, SR Chowdhury, N Natarajan, G Sinha, A Sharma arXiv preprint arXiv:2310.20158, 2023 | 2 | 2023 |
Differentially Private Reward Estimation from Preference Based Feedback SR Chowdhury, X Zhou ICML 2023 Workshop The Many Facets of Preference-Based Learning, 2023 | 2 | 2023 |