Cache-conscious wavefront scheduling TG Rogers, M O'Connor, TM Aamodt 2012 45th Annual IEEE/ACM International Symposium on Microarchitecture, 72-83, 2012 | 547 | 2012 |
VIS speeds new media processing M Tremblay, JM O'Connor, V Narayanan, L He IEEE micro 16 (4), 10-20, 1996 | 429 | 1996 |
Transparent Offloading and Mapping (TOM): Enabling Programmer-Transparent Near-Data Processing in GPU Systems K Hsieh, E Ebrahimi, G Kim, N Chatterjee, M O’Connor, N Vijaykumar, ... | 311 | 2016 |
Picojava: A direct execution engine for java bytecode H McGhan, M O'Connor Computer 31 (10), 22-30, 1998 | 275 | 1998 |
picoJava-I: The Java virtual machine in hardware JM O'Connor, M Tremblay Micro, IEEE 17 (2), 45-53, 1997 | 253 | 1997 |
Compressing DMA engine: Leveraging activation sparsity for training deep neural networks M Rhu, M O'Connor, N Chatterjee, J Pool, Y Kwon, SW Keckler 2018 IEEE International Symposium on High Performance Computer Architecture …, 2018 | 227 | 2018 |
Fine-grained DRAM: Energy-efficient DRAM for extreme bandwidth systems M O'Connor, N Chatterjee, D Lee, J Wilson, A Agrawal, SW Keckler, ... Proceedings of the 50th Annual IEEE/ACM International Symposium on …, 2017 | 219 | 2017 |
Understanding reduced-voltage operation in modern DRAM devices: Experimental characterization, analysis, and mechanisms KK Chang, AG Yağlıkçı, S Ghose, A Agrawal, N Chatterjee, A Kashyap, ... Proceedings of the ACM on Measurement and Analysis of Computing Systems 1 (1 …, 2017 | 216 | 2017 |
Cache coherence for GPU architectures I Singh, A Shriraman, WWL Fung, M O'Connor, TM Aamodt 2013 IEEE 19th International Symposium on High Performance Computer …, 2013 | 212 | 2013 |
Divergence-aware warp scheduling TG Rogers, M O'Connor, TM Aamodt Proceedings of the 46th Annual IEEE/ACM International Symposium on …, 2013 | 192 | 2013 |
Page placement strategies for GPUs within heterogeneous memory systems N Agarwal, D Nellans, M Stephenson, M O'Connor, SW Keckler Proceedings of the Twentieth International Conference on Architectural …, 2015 | 176 | 2015 |
Scaling the Power Wall: A Path to Exascale O Villa, DR Johnson, M O’Connor, E Bolotin, D Nellans, J Luitjens, ... Supercomputing, 2014 | 174 | 2014 |
Characterizing and evaluating a key-value store application on heterogeneous CPU-GPU systems TH Hetherington, TG Rogers, L Hsu, M O'Connor, TM Aamodt 2012 IEEE International Symposium on Performance Analysis of Systems …, 2012 | 148 | 2012 |
What your DRAM power models are not telling you: Lessons from a detailed experimental study S Ghose, AG Yaglikçi, R Gupta, D Lee, K Kudrolli, WX Liu, H Hassan, ... Proceedings of the ACM on Measurement and Analysis of Computing Systems 2 (3 …, 2018 | 136 | 2018 |
Processor with accelerated array access bounds checking M Tremblay, JM O'Connor, WN Joy US Patent 6,014,723, 2000 | 136 | 2000 |
Generation isolation system and method for garbage collection JM O'Connor, M Tremblay, S Vishin US Patent 6,098,089, 2000 | 134 | 2000 |
Instruction folding for a stack-based machine JM O'Connor, M Tremblay US Patent 6,026,485, 2000 | 130 | 2000 |
Flexible software profiling of GPU architectures M Stephenson, SK Sastry Hari, Y Lee, E Ebrahimi, DR Johnson, ... ACM SIGARCH Computer Architecture News 43 (3), 185-197, 2015 | 129 | 2015 |
A mostly-clean DRAM cache for effective hit speculation and self-balancing dispatch J Sim, GH Loh, H Kim, M OConnor, M Thottethodi 2012 45th Annual IEEE/ACM International Symposium on Microarchitecture, 247-257, 2012 | 127 | 2012 |
Die-stacked memory device providing data translation GH Loh, BM Beckmann, JM O'Connor, M Ignatowski, MJ Schulte, LR Hsu, ... US Patent 9,135,185, 2015 | 126 | 2015 |