Training language models to follow instructions with human feedback L Ouyang, J Wu, X Jiang, D Almeida, C Wainwright, P Mishkin, C Zhang, ... Advances in neural information processing systems 35, 27730-27744, 2022 | 7474 | 2022 |
The caltech-ucsd birds-200-2011 dataset C Wah, S Branson, P Welinder, P Perona, S Belongie California Institute of Technology, 2011 | 4240 | 2011 |
Hindsight experience replay M Andrychowicz, F Wolski, A Ray, J Schneider, R Fong, P Welinder, ... Advances in neural information processing systems 30, 2017 | 2770 | 2017 |
Evaluating large language models trained on code M Chen, J Tworek, H Jun, Q Yuan, HPDO Pinto, J Kaplan, H Edwards, ... arXiv preprint arXiv:2107.03374, 2021 | 2394 | 2021 |
Gpt-4 technical report J Achiam, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ... arXiv preprint arXiv:2303.08774, 2023 | 2237 | 2023 |
Caltech-UCSD birds 200 P Welinder, S Branson, T Mita, C Wah, F Schroff, S Belongie, P Perona Technical Report CNS-TR-2010-001, California Institute of Technology 2 (5), 11, 2010 | 1698 | 2010 |
Learning dexterous in-hand manipulation OAIM Andrychowicz, B Baker, M Chociej, R Jozefowicz, B McGrew, ... The International Journal of Robotics Research 39 (1), 3-20, 2020 | 1662 | 2020 |
The multidimensional wisdom of crowds P Welinder, S Branson, P Perona, S Belongie Advances in neural information processing systems 23, 2010 | 1174 | 2010 |
Solving rubik's cube with a robot hand I Akkaya, M Andrychowicz, M Chociej, M Litwin, B McGrew, A Petron, ... arXiv preprint arXiv:1910.07113, 2019 | 1096 | 2019 |
Cascaded pose regression P Dollár, P Welinder, P Perona 2010 IEEE Computer Society Conference on Computer Vision and Pattern …, 2010 | 713 | 2010 |
Visual recognition with humans in the loop S Branson, C Wah, F Schroff, B Babenko, P Welinder, P Perona, ... Computer Vision–ECCV 2010: 11th European Conference on Computer Vision …, 2010 | 579 | 2010 |
Multi-goal reinforcement learning: Challenging robotics environments and request for research M Plappert, M Andrychowicz, A Ray, B McGrew, B Baker, G Powell, ... arXiv preprint arXiv:1802.09464, 2018 | 548 | 2018 |
Online crowdsourcing: rating annotators and obtaining cost-effective labels P Welinder, P Perona 2010 IEEE Computer Society Conference on Computer Vision and Pattern …, 2010 | 417 | 2010 |
Sleep-spindle detection: crowdsourcing and evaluating performance of experts, non-experts and automated methods SC Warby, SL Wendt, P Welinder, EGS Munk, O Carrillo, HBD Sorensen, ... Nature methods 11 (4), 385-392, 2014 | 382 | 2014 |
Asymmetric actor critic for image-based robot learning L Pinto, M Andrychowicz, P Welinder, W Zaremba, P Abbeel arXiv preprint arXiv:1710.06542, 2017 | 337 | 2017 |
Text and code embeddings by contrastive pre-training A Neelakantan, T Xu, R Puri, A Radford, JM Han, J Tworek, Q Yuan, ... arXiv preprint arXiv:2201.10005, 2022 | 292 | 2022 |
Crowdclustering R Gomes, P Welinder, A Krause, P Perona Advances in neural information processing systems 24, 2011 | 224 | 2011 |
Domain randomization and generative models for robotic grasping J Tobin, L Biewald, R Duan, M Andrychowicz, A Handa, V Kumar, ... 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2018 | 182 | 2018 |
Training language models to follow instructions with human feedback, 2022 L Ouyang, J Wu, X Jiang, D Almeida, CL Wainwright, P Mishkin, C Zhang, ... URL https://arxiv. org/abs/2203.02155 13, 1, 2022 | 178 | 2022 |
Sim2real in robotics and automation: Applications and challenges S Höfer, K Bekris, A Handa, JC Gamboa, M Mozifian, F Golemo, ... IEEE transactions on automation science and engineering 18 (2), 398-400, 2021 | 128 | 2021 |