关注
QIWEI DI
QIWEI DI
Phd student, Department of Computer Science , University of California, Los Angeles
在 cs.ucla.edu 的电子邮件经过验证
标题
引用次数
引用次数
年份
Borda regret minimization for generalized linear dueling bandits
Y Wu, T Jin, H Lou, F Farnoud, Q Gu
ICML2024, 2023
62023
Variance-Aware Regret Bounds for Stochastic Contextual Dueling Bandits
Q Di, T Jin, Y Wu, H Zhao, F Farnoud, Q Gu
International Conference on Learning Representations 2024, 2023
52023
Pessimistic nonlinear least-squares value iteration for offline reinforcement learning
Q Di, H Zhao, J He, Q Gu
International Conference on Learning Representations 2024, 2023
42023
Nearly optimal algorithms for contextual dueling bandits from adversarial feedback
Q Di, J He, Q Gu
arXiv preprint arXiv:2404.10776, 2024
12024
Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path
Q Di, J He, D Zhou, Q Gu
International Conference on Machine Learning, 2023
2023
系统目前无法执行此操作,请稍后再试。
文章 1–5