关注
Volodymyr Tkachuk
Volodymyr Tkachuk
在 ualberta.ca 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Efficient planning in combinatorial action spaces with applications to cooperative multi-agent reinforcement learning
V Tkachuk, SA Bakhtiari, J Kirschner, M Jusup, I Bogunovic, C Szepesvári
International Conference on Artificial Intelligence and Statistics, 6342-6370, 2023
32023
Regret minimization via saddle point optimization
J Kirschner, A Bakhtiari, K Chandak, V Tkachuk, C Szepesvári
Advances in Neural Information Processing Systems 36, 2024
22024
Investigating action encodings in recurrent neural networks in reinforcement learning
MK Schlegel, V Tkachuk, AM White, M White
22023
Trajectory Data Suffices for Statistically Efficient Learning in Offline RL with Linear -Realizability and Concentrability
V Tkachuk, G Weisz, C Szepesvári
arXiv preprint arXiv:2405.16809, 2024
2024
On Efficient Planning in Large Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning
V Tkachuk
2023
系统目前无法执行此操作,请稍后再试。
文章 1–5