CORL: Research-oriented deep offline reinforcement learning library D Tarasov, A Nikulin, D Akimov, V Kurenkov, S Kolesnikov Advances in Neural Information Processing Systems 36, 2024 | 59 | 2024 |
Anti-exploration by random network distillation A Nikulin, V Kurenkov, D Tarasov, S Kolesnikov International Conference on Machine Learning, 26228-26244, 2023 | 20 | 2023 |
Revisiting the minimalist approach to offline reinforcement learning D Tarasov, V Kurenkov, A Nikulin, S Kolesnikov Advances in Neural Information Processing Systems 36, 2024 | 16 | 2024 |
Q-ensemble for offline rl: Don't scale the ensemble, scale the batch size A Nikulin, V Kurenkov, D Tarasov, D Akimov, S Kolesnikov arXiv preprint arXiv:2211.11092, 2022 | 15 | 2022 |
Let offline rl flow: Training conservative agents in the latent space of normalizing flows D Akimov, V Kurenkov, A Nikulin, D Tarasov, S Kolesnikov arXiv preprint arXiv:2211.11096, 2022 | 10 | 2022 |
Prompts and Pre-Trained Language Models for Offline Reinforcement Learning D Tarasov, V Kurenkov, S Kolesnikov ICLR 2022 Workshop on Generalizable Policy Learning in Physical World, 2022 | 4 | 2022 |
Predicting perceived ethnicity with data on personal names in Russia A Bessudnov, D Tarasov, V Panasovets, V Kostenko, I Smirnov, ... Journal of Computational Social Science 6 (2), 589-608, 2023 | 3 | 2023 |
Katakomba: tools and benchmarks for data-driven NetHack V Kurenkov, A Nikulin, D Tarasov, S Kolesnikov Advances in Neural Information Processing Systems 36, 2024 | 2 | 2024 |
Distilling LLMs' Decomposition Abilities into Compact Language Models D Tarasov, K Shridhar arXiv preprint arXiv:2402.01812, 2024 | 1 | 2024 |
Is Value Functions Estimation with Classification Plug-and-play for Offline Reinforcement Learning? D Tarasov, K Brilliantov, D Kharlapenko arXiv preprint arXiv:2406.06309, 2024 | | 2024 |
Offline RL for generative design of protein binders D Tarasov, UA Mbou Sob, M Arbesu, N Siboni, S Boyer, M Skwark, A Smit, ... bioRxiv, 2023.11. 29.569328, 2023 | | 2023 |
Fixing 1-bit Adam and 1-bit LAMB algorithms D Tarasov, VA Ershov Computing 15 (4), 86-97, 2022 | | 2022 |
Predicting ethnicity with data on personal names in Russia A Bessudnov, D Tarasov, V Panasovets, V Kostenko, I Smirnov, ... | | 2021 |
Revisiting Behavior Regularized Actor-Critic D Tarasov, V Kurenkov, A Nikulin, S Kolesnikov Workshop on Reincarnating Reinforcement Learning at ICLR 2023, 0 | | |