Neural cache: Bit-serial in-cache acceleration of deep neural networks

C Eckert, X Wang, J Wang… - 2018 ACM/IEEE …, 2018 - ieeexplore.ieee.org
This paper presents the Neural Cache architecture, which re-purposes cache structures to
transform them into massively parallel compute units capable of running inferences for Deep …

High-performance and energy-efficient mobile web browsing on big/little systems

Y Zhu, VJ Reddi - 2013 IEEE 19th International Symposium on …, 2013 - ieeexplore.ieee.org
Internet web browsing has reached a critical tipping point. Increasingly, users rely more on
mobile web browsers to access the Internet than desktop browsers. Meanwhile, webpages …

Cache automaton

A Subramaniyan, J Wang… - Proceedings of the 50th …, 2017 - dl.acm.org
Finite State Automata are widely used to accelerate pattern matching in many emerging
application domains like DNA sequencing and XML parsing. Conventional CPUs and …

Navigating big data with high-throughput, energy-efficient data partitioning

L Wu, RJ Barker, MA Kim, KA Ross - Proceedings of the 40th Annual …, 2013 - dl.acm.org
The global pool of data is growing at 2.5 quintillion bytes per day, with 90% of it produced in
the last two years alone [24]. There is no doubt the era of big data has arrived. This paper …

Data-parallel finite-state machines

T Mytkowicz, M Musuvathi, W Schulte - Proceedings of the 19th …, 2014 - dl.acm.org
A finite-state machine (FSM) is an important abstraction for solving several problems,
including regular-expression matching, tokenizing text, and Huffman decoding. FSM …

HARE: Hardware accelerator for regular expressions

V Gogte, A Kolli, MJ Cafarella… - 2016 49th Annual …, 2016 - ieeexplore.ieee.org
Rapidly processing text data is critical for many technical and business applications.
Traditional software-based tools for processing large text corpora use memory bandwidth …

Scalable structural index construction for JSON analytics

L Jiang, J Qiu, Z Zhao - Proceedings of the VLDB Endowment, 2020 - dl.acm.org
JavaScript Object Notation (JSON) and its variants have gained great popularity in recent
years. Unfortunately, the performance of their analytics is often dragged down by the …

REGISTOR: A platform for unstructured data processing inside SSD storage

S Pei, J Yang, Q Yang - ACM Transactions on Storage (TOS), 2019 - dl.acm.org
This article presents REGISTOR, a platform for r egular e xpression g rabbing i nside stor
age. The main idea of Registor is accelerating regular expression (regex) search inside …

Architectural support for server-side PHP processing

D Gope, DJ Schlais, MH Lipasti - 2017 ACM/IEEE 44th Annual …, 2017 - ieeexplore.ieee.org
PHP is the dominant server-side scripting language used to implement dynamic web
content. Just-in-time compilation, as implemented in Facebook's state-of-the-art HipHopVM …

Microspec: Speculation-centric fine-grained parallelization for fsm computations

J Qiu, Z Zhao, B Ren - … of the 2016 International Conference on Parallel …, 2016 - dl.acm.org
Finite state machines (FSMs) are basic computation models that play essential roles in many
applications. Enabling efficient parallel FSM execution is critical to the performance of these …