Lior Shani 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	443	439
h 指数	6	6
i10 指数	6	6

120

2019202020212022202320245 40 88 85 119 102

开放获取的出版物数量

查看全部

1 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ Nvidia Research在 technion.ac.il 的电子邮件经过验证
Yonathan EfroniMeta, New York在 fb.com 的电子邮件经过验证
Aviv RosenbergGoogle Research在 google.com 的电子邮件经过验证
Guy TennenholtzResearch Scientist, Google Research在 google.com 的电子邮件经过验证
Manan TomarPhD student at University of Alberta在 ualberta.ca 的电子邮件经过验证
Mohammad GhavamzadehAmazon在 amazon.com 的电子邮件经过验证
Tom ZahavyStaff Research Scientist, Google DeepMind在 deepmind.com 的电子邮件经过验证
Nadav MerlisPostdoctoral Fellow @ CREST, ENSAE Paris在 ensae.fr 的电子邮件经过验证

关注

Lior Shani

Google Research

在 google.com 的电子邮件经过验证

Reinforcement Learning Machine Learning NLP


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Adaptive trust region policy optimization: Global convergence and faster rates for regularized mdps L Shani, Y Efroni, S Mannor Thirty-Fourth AAAI Conference on Artificial Intelligence, 5668-5675, 2020	185	2020
Optimistic Policy Optimization with Bandit Feedback Y Efroni, L Shani, A Rosenberg, S Mannor Proceedings of the 37th International Conference on Machine Learning 119 …, 2020	95	2020
Mirror Descent Policy Optimization M Tomar, L Shani, Y Efroni, M Ghavamzadeh The Tenth International Conference on Learning Representations, 2020	64	2020
Factually consistent summarization via reinforcement learning with textual entailment feedback P Roit, J Ferret, L Shani, R Aharoni, G Cideron, R Dadashi, M Geist, ... arXiv preprint arXiv:2306.00186, 2023	42	2023
Online apprenticeship learning L Shani, T Zahavy, S Mannor Proceedings of the AAAI conference on artificial intelligence 36 (8), 8240-8248, 2022	26	2022
Exploration Conscious Reinforcement Learning Revisited L Shani, Y Efroni, S Mannor Proceedings of the 36th International Conference on Machine Learning, 5680--5689, 2019	19*	2019
Demystifying embedding spaces using large language models G Tennenholtz, Y Chow, CW Hsu, J Jeong, L Shani, A Tulepbergenov, ... arXiv preprint arXiv:2310.04475, 2023	5	2023
Reinforcement learning with history dependent dynamic contexts G Tennenholtz, N Merlis, L Shani, M Mladenov, C Boutilier International Conference on Machine Learning, 34011-34053, 2023	3	2023
Reinforcement learning with a terminator G Tennenholtz, N Merlis, L Shani, S Mannor, U Shalit, G Chechik, ... Advances in Neural Information Processing Systems 35, 35696-35709, 2022	3	2022
Multi instance learning for unbalanced data M Kozdoba, E Moroshko, L Shani, T Takagi, T Katoh, S Mannor, ... arXiv preprint arXiv:1812.07010, 2018	1	2018
Offline Regularised Reinforcement Learning for Large Language Models Alignment PH Richemond, Y Tang, D Guo, D Calandriello, MG Azar, R Rafailov, ... arXiv preprint arXiv:2405.19107, 2024		2024
Embedding-Aligned Language Models G Tennenholtz, Y Chow, CW Hsu, L Shani, E Liang, C Boutilier arXiv preprint arXiv:2406.00024, 2024		2024
Multi-turn Reinforcement Learning from Preference Human Feedback L Shani, A Rosenberg, A Cassel, O Lang, D Calandriello, A Zipori, ... arXiv preprint arXiv:2405.14655, 2024		2024

系统目前无法执行此操作，请稍后再试。

文章 1–13

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用