V Seshadri, O Mutlu - arXiv preprint arXiv:1905.09822, 2019 - arxiv.org
… bulkbitwise operations completely inside main memory. Ambit exploits the internal organization and analog operation of DRAM-… With support for fastbulkbitwise operations, we show …
S Angizi, D Fan - arXiv preprint arXiv:1904.05782, 2019 - arxiv.org
… a fast (< 100ns) in-memory copy operation within DRAM sub-arrays, rather than using ∼ 1µs conventional operation in Von-Neumann computing systems, RowClone-Fast Parallel …
X Xin, Y Zhang, J Yang - 2020 IEEE International Symposium …, 2020 - ieeexplore.ieee.org
… • We present a lightweight mechanism to implement bulkbitwise operation in DRAM, … offer fast analytics in specific applications for databases. Traditionally, the bulkbitwise operation …
F Zhang, S Angizi, D Fan - 2021 58th ACM/IEEE Design …, 2021 - ieeexplore.ieee.org
… (2) A new in-DRAM computing circuit and architecture, termed as Max-PIM, is then proposed to support complete bulkbit-wise Boolean logic and optimized for our proposed min/max-in-…
… as the underlying bulk-bitwise technology as it can operate on both bitlines and wordlines [36], enabling a richer functionality compared to other (eg, DRAM-based) bulk-bitwise PIM. …
MF Ali, A Jaiswal, K Roy - … on Circuits and Systems I: Regular …, 2019 - ieeexplore.ieee.org
… bulk copy and data initialization inside the DRAM chip. Ambit [20] exploits triple-row activation for performing bulkbit-wise … We propose a fast in-DRAM addition mechanism, where the …
… for bulkbitwise operations; (i) it falls short of maximally exploiting the bit-level parallelism of bulkbitwise … We model DRAM timing with the DDR4 interface [123] in Ramulator [124, 125], a …
R Zhou, A Roohi, D Misra, S Angizi - Proceedings of the ACM/IEEE …, 2022 - dl.acm.org
… -DRAM framework named FlexiDRAM that supports the efficient implementation of complex bulkbitwise … RowClone: Fast and energy-efficient in-DRAMbulk data copy and initialization. …
M Zarubin, P Damme, T Kissinger, D Habich… - Proceedings of the 15th …, 2019 - dl.acm.org
… Similarly to the compression experiments (cf., Section 4), we observe that the performance of NVRAM-only and interplayed schemes is never faster than that of DRAM-only allocation. …