Deep speech 2: End-to-end speech recognition in english and mandarin D Amodei, S Ananthanarayanan, R Anubhai, J Bai, E Battenberg, C Case, ... International conference on machine learning, 173-182, 2016 | 3705 | 2016 |
Megatron-lm: Training multi-billion parameter language models using model parallelism M Shoeybi, M Patwary, R Puri, P LeGresley, J Casper, B Catanzaro arXiv preprint arXiv:1909.08053, 2019 | 1539 | 2019 |
Using deepspeed and megatron to train megatron-turing nlg 530b, a large-scale generative language model S Smith, M Patwary, B Norick, P LeGresley, S Rajbhandari, J Casper, ... arXiv preprint arXiv:2201.11990, 2022 | 571 | 2022 |
Efficient large-scale language model training on gpu clusters using megatron-lm D Narayanan, M Shoeybi, J Casper, P LeGresley, M Patwary, ... Proceedings of the International Conference for High Performance Computing …, 2021 | 521 | 2021 |
Large calculation of the flow over a hypersonic vehicle using a GPU E Elsen, P LeGresley, E Darve Journal of Computational Physics 227 (24), 10148-10161, 2008 | 293 | 2008 |
Investigation of non-linear projection for POD based reduced order models for aerodynamics P LeGresley, J Alonso 39th aerospace sciences meeting and exhibit, 926, 2001 | 204 | 2001 |
Airfoil design optimization using reduced order models based on proper orthogonal decomposition P LeGresley, J Alonso Fluids 2000 conference and exhibit, 2545, 2000 | 203 | 2000 |
Rewon Child, Reza Yazdani Aminabadi, Julie Bernauer, Xia Song, Mohammad Shoeybi, Yuxiong He, Michael Houston, Saurabh Tiwary, and Bryan Catanzaro S Smith, M Patwary, B Norick, P LeGresley, S Rajbhandari, J Casper, ... Using deepspeed and megatron to train megatron-turing nlg 530b, a large …, 2022 | 127 | 2022 |
Aircraft design optimization JJ Alonso, P LeGresley, V Pereyra Mathematics and Computers in Simulation 79 (6), 1948-1958, 2009 | 100 | 2009 |
Application of proper orthogonal decomposition (POD) to design decomposition methods PA LeGresley Stanford University, 2006 | 81 | 2006 |
pyMDO: A framework for high-fidelity multi-disciplinary optimization J Alonso, P LeGresley, E Van Der Weide, JRRA Martins, J Reuther 10th AIAA/ISSMO Multidisciplinary Analysis and Optimization Conference, 4480, 2004 | 67 | 2004 |
Dynamic domain decomposition and error correction for reduced order models P LeGresley, J Alonso 41st Aerospace Sciences Meeting and Exhibit, 250, 2003 | 64 | 2003 |
Chimps: A high-performance scalable module for multi-physics simulations J Alonso, S Hahn, F Ham, M Herrmann, G Iaccarino, G Kalitzin, ... 42nd AIAA/ASME/SAE/ASEE Joint Propulsion Conference & Exhibit, 5274, 2006 | 62 | 2006 |
Improving the performance of design decomposition methods with POD P LeGresley, J Alonso 10th AIAA/ISSMO multidisciplinary analysis and optimization conference, 4465, 2004 | 39 | 2004 |
GPU Enhancement of the Trigger to Extend Physics Reach at the LHC V Halyo, A Hunt, P Jindal, P LeGresley, P Lujan Journal of Instrumentation 8 (10), P10005, 2013 | 36 | 2013 |
First evaluation of the CPU, GPGPU and MIC architectures for real time particle tracking based on Hough transform at the LHC VHV Halyo, P LeGresley, P Lujan, V Karpusenko, A Vladimirov Journal of Instrumentation 9 (04), P04005, 2014 | 35 | 2014 |
High performance computing with CUDA M Fatica, P LeGresley, I Buck, J Stone, J Phillips, S Morton, P Micikevicius SC08, 2008 | 33 | 2008 |
Massively parallel computing and the search for jets and black holes at the LHC V Halyo, P LeGresley, P Lujan Nuclear Instruments and Methods in Physics Research Section A: Accelerators …, 2014 | 15 | 2014 |
Neural odes for image segmentation with level sets R Valle, F Reda, M Shoeybi, P Legresley, A Tao, B Catanzaro arXiv preprint arXiv:1912.11683, 2019 | 9 | 2019 |
Nemotron-4 15B Technical Report J Parmar, S Prabhumoye, J Jennings, M Patwary, S Subramanian, D Su, ... arXiv preprint arXiv:2402.16819, 2024 | 4 | 2024 |