Deep spoken keyword spotting: An overview

I López-Espejo, ZH Tan, JHL Hansen, J Jensen - IEEE Access, 2021 - ieeexplore.ieee.org
Spoken keyword spotting (KWS) deals with the identification of keywords in audio streams
and has become a fast-growing technology thanks to the paradigm shift introduced by deep …

A survey on optimization techniques for edge artificial intelligence (ai)

C Surianarayanan, JJ Lawrence, PR Chelliah… - Sensors, 2023 - mdpi.com
Artificial Intelligence (Al) models are being produced and used to solve a variety of current
and future business and technical problems. Therefore, AI model engineering processes …

Hello edge: Keyword spotting on microcontrollers

Y Zhang, N Suda, L Lai, V Chandra - arXiv preprint arXiv:1711.07128, 2017 - arxiv.org
Keyword spotting (KWS) is a critical component for enabling speech based user interactions
on smart devices. It requires real-time response and high accuracy for good user …

Intelligence beyond the edge: Inference on intermittent embedded systems

G Gobieski, B Lucia, N Beckmann - Proceedings of the Twenty-Fourth …, 2019 - dl.acm.org
Energy-harvesting technology provides a promising platform for future IoT applications.
However, since communication is very expensive in these devices, applications will require …

Model compression with adversarial robustness: A unified optimization framework

S Gui, H Wang, H Yang, C Yu… - Advances in Neural …, 2019 - proceedings.neurips.cc
Deep model compression has been extensively studied, and state-of-the-art methods can
now achieve high compression ratios with minimal accuracy loss. This paper studies model …

Personalized speech recognition on mobile devices

I McGraw, R Prabhavalkar, R Alvarez… - … , Speech and Signal …, 2016 - ieeexplore.ieee.org
We describe a large vocabulary speech recognition system that is accurate, has low latency,
and yet has a small enough memory and computational footprint to run faster than real-time …

[HTML][HTML] Compressed time delay neural network for small-footprint keyword spotting

M Sun, D Snyder, Y Gao, V Nagaraja, M Rodehorst… - 2017 - amazon.science
In this paper we investigate a time delay neural network (TDNN) for a keyword spotting task
that requires low CPU, memory and latency. The TDNN is trained with transfer learning and …

Max-pooling loss training of long short-term memory networks for small-footprint keyword spotting

M Sun, A Raju, G Tucker… - 2016 IEEE spoken …, 2016 - ieeexplore.ieee.org
We propose a max-pooling based loss function for training Long Short-Term Memory
(LSTM) networks for small-footprint keyword spotting (KWS), with low CPU, memory, and …

Binarized-blstm-rnn based human activity recognition

M Edel, E Köppe - … conference on indoor positioning and indoor …, 2016 - ieeexplore.ieee.org
High computational complexity hinders the widespread usage of neural networks, especially
in mobile devices, which are often the basis of fine-grained localization technology for …

On the compression of recurrent neural networks with an application to LVCSR acoustic modeling for embedded speech recognition

R Prabhavalkar, O Alsharif, A Bruguier… - … on Acoustics, Speech …, 2016 - ieeexplore.ieee.org
We study the problem of compressing recurrent neural networks (RNNs). In particular, we
focus on the compression of RNN acoustic models, which are motivated by the goal of …