MIMDRAM: An End-to-End Processing-Using-DRAM System for High-Throughput, Energy-Efficient and Programmer-Transparent Multiple-Instruction Multiple-Data …

GF Oliveira, A Olgun, AG Yağlıkçı… - … Symposium on High …, 2024 - ieeexplore.ieee.org
Processing-using-DRAM (PUD) is a processing-in-memory (PIM) approach that uses a
DRAM array's massive internal parallelism to execute very-wide (eg, 16,384-262,144-bit …

PIM GPT a hybrid process in memory accelerator for autoregressive transformers

Y Wu, Z Wang, WD Lu - npj Unconventional Computing, 2024 - nature.com
Decoder-only Transformer models such as Generative Pre-trained Transformers (GPT) have
demonstrated exceptional performance in text generation by autoregressively predicting the …

Accelerating genome analysis via algorithm-architecture co-design

O Mutlu, C Firtina - 2023 60th ACM/IEEE Design Automation …, 2023 - ieeexplore.ieee.org
High-throughput sequencing (HTS) technologies have revolutionized the field of genomics,
enabling rapid and cost-effective genome analysis for various applications. However, the …

CASA: An Energy-Efficient and High-Speed CAM-based SMEM Seeding Accelerator for Genome Alignment

Y Huang, L Kong, D Chen, Z Chen, X Kong… - Proceedings of the 56th …, 2023 - dl.acm.org
Genome analysis is a critical tool in medical and bioscience research, clinical diagnostics
and treatment, and disease control and prevention. Seed and extension-based alignment is …

A framework for designing efficient deep learning-based genomic basecallers

G Singh, M Alser, A Khodamoradi, K Denolf… - arXiv preprint arXiv …, 2022 - arxiv.org
Nanopore sequencing generates noisy electrical signals that need to be converted into a
standard string of DNA nucleotide bases using a computational step called basecalling. The …

Ppimce: An in-memory computing fabric for privacy preserving computing

H Geng, J Mo, D Reis, J Takeshita, T Jung… - arXiv preprint arXiv …, 2023 - arxiv.org
Privacy has rapidly become a major concern/design consideration. Homomorphic
Encryption (HE) and Garbled Circuits (GC) are privacy-preserving techniques that support …

RawHash: enabling fast and accurate real-time analysis of raw nanopore signals for large genomes

C Firtina, N Mansouri Ghiasi, J Lindegger… - …, 2023 - academic.oup.com
Nanopore sequencers generate electrical raw signals in real-time while sequencing long
genomic strands. These raw signals can be analyzed as they are generated, providing an …

TALCO: Tiling Genome Sequence Alignment Using Convergence of Traceback Pointers

S Walia, C Ye, A Bera, D Lodhavia… - … Symposium on High …, 2024 - ieeexplore.ieee.org
Pairwise sequence alignment is one of the most fundamental and computationally intensive
steps in genome analysis. With the improving costs and throughput of third-generation …

NDPmulator: Enabling Full-System Simulation for Near-Data Accelerators From Caches to DRAM

J Vieira, N Roma, G Falcao, P Tomás - IEEE Access, 2024 - ieeexplore.ieee.org
The accurate simulation and performance assessment of Near-Data Accelerators (NDAccs)
is a complex challenge as it must consider the operation of the entire processing system, the …

GateSeeder: Near-memory CPU-FPGA Acceleration of Short and Long Read Mapping

J Eudine, M Alser, G Singh, C Alkan, O Mutlu - arXiv preprint arXiv …, 2023 - arxiv.org
Motivation: Read mapping is a computationally expensive process and a major bottleneck in
genomics analyses. The performance of read mapping is mainly limited by the performance …