Buddy-RAM: Improving the performance and efficiency of bulk bitwise operations using DRAM

V Seshadri, D Lee, T Mullins, H Hassan… - arXiv preprint arXiv …, 2016 - arxiv.org
Bitwise operations are an important component of modern day programming. Many widely-
used data structures (eg, bitmap indices in databases) rely on fast bitwise operations on …

Ambit: In-memory accelerator for bulk bitwise operations using commodity DRAM technology

V Seshadri, D Lee, T Mullins, H Hassan… - Proceedings of the 50th …, 2017 - dl.acm.org
Many important applications trigger bulk bitwise operations, ie, bitwise operations on large
bit vectors. In fact, recent works design techniques that exploit fast bulk bitwise operations to …

In-DRAM bulk bitwise execution engine

V Seshadri, O Mutlu - arXiv preprint arXiv:1905.09822, 2019 - arxiv.org
Many applications heavily use bitwise operations on large bitvectors as part of their
computation. In existing systems, performing such bulk bitwise operations requires the …

Fast bulk bitwise AND and OR in DRAM

V Seshadri, K Hsieh, A Boroum, D Lee… - IEEE Computer …, 2015 - ieeexplore.ieee.org
Bitwise operations are an important component of modern day programming, and are used
in a variety of applications such as databases. In this work, we propose a new and simple …

pluto: Enabling massively parallel computation in dram via lookup tables

JD Ferreira, G Falcao, J Gómez-Luna… - 2022 55th IEEE/ACM …, 2022 - ieeexplore.ieee.org
Data movement between the main memory and the processor is a key contributor to
execution time and energy consumption in memory-intensive applications. This data …

Buffered compares: Excavating the hidden parallelism inside DRAM architectures with lightweight logic

J Lee, JH Ahn, K Choi - 2016 Design, Automation & Test in …, 2016 - ieeexplore.ieee.org
We propose an approach called buffered compares, a less-invasive processing-in-memory
solution that can be used with existing processor memory interfaces such as DDR3/4 with …

[PDF][PDF] pluto: In-dram lookup tables to enable massively parallel general-purpose computation

JD Ferreira, G Falcao, J Gómez-Luna… - arXiv preprint arXiv …, 2021 - academia.edu
Data movement between main memory and the processor is a significant contributor to the
execution time and energy consumption of memory-intensive applications. This data …

Optimizing indirect memory references with milk

V Kiriansky, Y Zhang, S Amarasinghe - Proceedings of the 2016 …, 2016 - dl.acm.org
Modern applications such as graph and data analytics, when operating on real world data,
have working sets much larger than cache capacity and are bottlenecked by DRAM. To …

Chopper: A compiler infrastructure for programmable bit-serial simd processing using memory in dram

X Peng, Y Wang, MC Yang - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Increasing interests in Bit-serial SIMD Processing-Using-DRAM (PUD) architectures amplify
the needs for a compiler to automate code generation, credited to their ultra-wide SIMD …

Roc: Dram-based processing with reduced operation cycles

X Xin, Y Zhang, J Yang - Proceedings of the 56th Annual Design …, 2019 - dl.acm.org
DRAM based memory-centric computing architectures are promising solutions to tackle the
challenges of memory wall. In this paper, we develop a novel design of DRAM-based …