Alexander Havrilla 个人学术档案

引用次数

	总计	2019 年至今
引用	279	278
h 指数	7	7
i10 指数	6	6

200

100

150

202020212022202320242 1 4 73 195

开放获取的出版物数量

查看全部

2 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

关注

Alexander Havrilla

Georgia Institute of Technology

在 gatech.edu 的电子邮件经过验证 - 首页

Machine learning Large language modeling


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Illustrating reinforcement learning from human feedback (rlhf) N Lambert, L Castricato, L von Werra, A Havrilla Hugging Face Blog 9, 2022	102	2022
Arb: Advanced reasoning benchmark for large language models T Sawada, D Paleka, A Havrilla, P Tadepalli, P Vidas, A Kranias, JJ Nay, ... arXiv preprint arXiv:2307.13692, 2023	47	2023
trlX: A framework for large scale reinforcement learning from human feedback A Havrilla, M Zhuravinskyi, D Phung, A Tiwari, J Tow, S Biderman, ... Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023	26	2023
Teaching large language models to reason with reinforcement learning A Havrilla, Y Du, SC Raparthy, C Nalmpantis, J Dwivedi-Yu, ... arXiv preprint arXiv:2403.04642, 2024	25	2024
Sharp Khinchin-type inequalities for symmetric discrete uniform random variables A Havrilla, T Tkocz Israel Journal of Mathematics 246 (1), 281-297, 2021	15	2021
Glore: When, where, and how to improve llm reasoning via global and local refinements A Havrilla, S Raparthy, C Nalmpantis, J Dwivedi-Yu, M Zhuravinskyi, ... arXiv preprint arXiv:2402.10963, 2024	14	2024
Khinchin-type inequalities via Hadamard’s factorisation A Havrilla, P Nayar, T Tkocz International Mathematics Research Notices 2023 (3), 2429-2445, 2023	7	2023
trlX: A scalable framework for RLHF L Castricato, A Havrilla, S Matiana, DV Phung, A Tiwari, J Tow, ... Zenodo. DOI 10, 2023	7	2023
Robust preference learning for storytelling via contrastive reinforcement learning L Castricato, A Havrilla, S Matiana, M Pieler, A Ye, I Yang, S Frazier, ... arXiv preprint arXiv:2210.07792, 2022	7	2022
trlX: A scalable framework for RLHF, June 2023 L Castricato, A Havrilla, S Matiana, DV Phung, A Tiwari, J Tow, ... URL https://github. com/CarperAI/trlx, 0	7
Understanding the Effect of Noise in LLM Training Data with Algorithmic Chains of Thought A Havrilla, M Iyer arXiv preprint arXiv:2402.04004, 2024	6	2024
On deep generative models for approximation and estimation of distributions on manifolds B Dahal, A Havrilla, M Chen, T Zhao, W Liao Advances in Neural Information Processing Systems 35, 10615-10628, 2022	6	2022
Deep nonparametric estimation of intrinsic data structures by chart autoencoders: Generalization error and robustness H Liu, A Havrilla, R Lai, W Liao Applied and Computational Harmonic Analysis 68, 101602, 2024	4	2024
Deep nonparametric estimation of intrinsic data structures by chart autoencoders: Generalization error and robustness H Liu, A Havrilla, R Lai, W Liao arXiv preprint arXiv:2303.09863, 2023	4	2023
A study on improving reasoning in language models Y Du, A Havrilla, S Sukhbaatar, P Abbeel, R Raileanu I Can't Believe It's Not Better Workshop: Failure Modes in the Age of …, 2023	2	2023
Understanding Scaling Laws with Statistical and Approximation Theory for Transformer Neural Networks on Intrinsically Low-dimensional Data A Havrilla, W Liao arXiv preprint arXiv:2411.06646, 2024		2024
Dual Fourier Unet: scale-robust diffusion model for zero-shot super-resolution image generation A Havrilla, K Rojas, W Liao, M Tao NeurIPS 2023 Workshop on Diffusion Models, 2023		2023
DFU: scale-robust diffusion model for zero-shot super-resolution image generation A Havrilla, K Rojas, W Liao, M Tao arXiv preprint arXiv:2401.06144, 2023		2023
On Deep Generative Models for Approximation and Estimation of Distributions on Manifolds B Dahal, A Havrilla, M Chen, T Zhao, W Liao arXiv preprint arXiv:2302.13183, 2023		2023
synthetic-instruct-gptj-pairwise A Havrilla Huggingface, 2023		2023

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

引用