Devansh Arpit 个人学术档案

引用次数

	总计	2019 年至今
引用	5010	4670
h 指数	22	21
i10 指数	26	22

1300

650

325

975

20162017201820192020202120222023202421 66 216 359 538 704 1025 1268 774

开放获取的出版物数量

查看全部

4 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Yoshua BengioProfessor of computer science, University of Montreal, Mila, IVADO, CIFAR在 umontreal.ca 的电子邮件经过验证
Stanisław JastrzębskiChief Technology Officer & Chief Scientist @ Molecule.One在 molecule.one 的电子邮件经过验证
Aaron CourvilleProfessor, DIRO, Université de Montréal, Mila, Cifar CAI chair在 umontreal.ca 的电子邮件经过验证
Venu GovindarajuSUNY Distinguished Professor, State University of New York, Buffalo在 buffalo.edu 的电子邮件经过验证
Yingbo ZhouSenior Research Director, Salesforce Research在 salesforce.com 的电子邮件经过验证
Hung Q. NgoRelationalAI在 relational.ai 的电子邮件经过验证
Chen Xing (星辰)Scale AI在 scale.com 的电子邮件经过验证
Ifeoma NwoguComputer Science and Engineering, University at Buffalo, SUNY在 buffalo.edu 的电子邮件经过验证
Anoop M NamboodiriProfessor, IIIT Hyderabad在 iiit.ac.in 的电子邮件经过验证
Yun Raymond FuNEU, COE Distinguished Professor; MAE, FNAI, FAAAS, FIEEE, FSPIE, FOSA, FIAPR在 neu.edu 的电子邮件经过验证
Shuang WuAmazon.com在 amazon.com 的电子邮件经过验证
Nils NappElectrical and Computer Engineering, Cornell University在 cornell.edu 的电子邮件经过验证

关注

Devansh Arpit

Rashi.ai

在 rashi.ai 的电子邮件经过验证

Deep Learning NLP


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
A closer look at memorization in deep networks D Arpit, S Jastrzębski, N Ballas, D Krueger, E Bengio, MS Kanwal, ... ICML 2017 (arXiv preprint arXiv:1706.05394), 2017	1895	2017
On the spectral bias of deep neural networks N Rahaman, D Arpit, A Baratin, F Draxler, M Lin, FA Hamprecht, Y Bengio, ... ICML 2019 (arXiv preprint arXiv:1806.08734), 2018	1211*	2018
Three factors influencing minima in SGD S Jastrzębski, Z Kenton, D Arpit, N Ballas, A Fischer, Y Bengio, A Storkey ICANN 2018 (arXiv preprint arXiv:1711.04623), 2017	504	2017
The Break-Even Point on Optimization Trajectories of Deep Neural Networks S Jastrzebski, M Szymczak, S Fort, D Arpit, J Tabor, K Cho, K Geras ICLR 2020 (arXiv preprint arXiv:2002.09572), 2020	150	2020
Normalization propagation: A parametric technique for removing internal covariate shift in deep networks D Arpit, Y Zhou, BU Kota, V Govindaraju ICML 2016 (arXiv preprint arXiv:1603.01431), 2016	143	2016
Residual connections encourage iterative inference S Jastrzebski, D Arpit, N Ballas, V Verma, T Che, Y Bengio ICLR 2018 (arXiv preprint arXiv:1710.04773), 2017	136	2017
A walk with sgd C Xing, D Arpit, C Tsirigotis, Y Bengio arXiv preprint arXiv:1802.08770, 2018	111	2018
Ensemble of averages: Improving model selection and boosting performance in domain generalization D Arpit, H Wang, Y Zhou, C Xiong NeurIPS 2022, 2021	105	2021
Why regularized auto-encoders learn sparse representation? D Arpit, Y Zhou, H Ngo, V Govindaraju ICML 2016 (arXiv preprint arXiv:1505.05561), 2015	92	2015
Deep Nets Don't Learn via Memorization D Krueger, N Ballas, S Jastrzebski, D Arpit, MS Kanwal, T Maharaj, ... ICLR 2017 Workshop, 2017	70	2017
Fraternal Dropout K Zolna, D Arpit, D Suhubdy, Y Bengio ICLR 2018 (arXiv preprint arXiv:1711.00066), 2017	60	2017
How to Initialize your Network? Robust Initialization for WeightNorm & ResNets D Arpit, V Campos, Y Bengio NeurIPs 2019, 2019	56	2019
Catastrophic Fisher Explosion: Early Phase Fisher Matrix Impacts Generalization S Jastrzebski, D Arpit, O Astrand, G Kerg, H Wang, C Xiong, R Socher, ... ICML 2021, 2020	55	2020
h-detach: Modifying the LSTM Gradient Towards Better Optimization D Arpit, B Kanuparthi, G Kerg, NR Ke, I Mitliagkas, Y Bengio ICLR 2019 (arXiv preprint arXiv:1810.03023), 2018	46	2018
Bolaa: Benchmarking and orchestrating llm-augmented autonomous agents Z Liu, W Yao, J Zhang, L Xue, S Heinecke, R Murthy, Y Feng, Z Chen, ... arXiv preprint arXiv:2308.05960, 2023	44	2023
Variational bi-lstms S Shabanian, D Arpit, A Trischler, Y Bengio arXiv preprint arXiv:1711.05717, 2017	42	2017
Is joint training better for deep auto-encoders? Y Zhou, D Arpit, I Nwogu, V Govindaraju arXiv preprint arXiv:1405.1380, 2014	40	2014
Finding Flatter Minima with SGD S Jastrzębski, Z Kenton, D Arpit, N Ballas, A Fischer, Y Bengio, A Storkey ICLR 2018 Workshop, 2018	36	2018
The benefits of over-parameterization at initialization in deep ReLU networks D Arpit, Y Bengio arXiv preprint arXiv:1901.03611, 2019	34	2019
Retroformer: Retrospective large language agents with policy gradient optimization W Yao, S Heinecke, JC Niebles, Z Liu, Y Feng, L Xue, R Murthy, Z Chen, ... arXiv preprint arXiv:2308.02151, 2023	33	2023

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用