Deep spoken keyword spotting: An overview

I López-Espejo, ZH Tan, JHL Hansen, J Jensen - IEEE Access, 2021 - ieeexplore.ieee.org
Spoken keyword spotting (KWS) deals with the identification of keywords in audio streams
and has become a fast-growing technology thanks to the paradigm shift introduced by deep …

Speech commands: A dataset for limited-vocabulary speech recognition

P Warden - arXiv preprint arXiv:1804.03209, 2018 - arxiv.org
Describes an audio dataset of spoken words designed to help train and evaluate keyword
spotting systems. Discusses why this task is an interesting challenge, and why it requires a …

Broadcasted residual learning for efficient keyword spotting

B Kim, S Chang, J Lee, D Sung - arXiv preprint arXiv:2106.04140, 2021 - arxiv.org
Keyword spotting is an important research field because it plays a key role in device wake-
up and user interaction on smart devices. However, it is challenging to minimize errors while …

Temporal convolution for real-time keyword spotting on mobile devices

S Choi, S Seo, B Shin, H Byun, M Kersner… - arXiv preprint arXiv …, 2019 - arxiv.org
Keyword spotting (KWS) plays a critical role in enabling speech-based user interactions on
smart devices. Recent developments in the field of deep learning have led to wide adoption …

A neural attention model for speech command recognition

DC De Andrade, S Leo, MLDS Viana… - arXiv preprint arXiv …, 2018 - arxiv.org
This paper introduces a convolutional recurrent network with attention for speech command
recognition. Attention models are powerful tools to improve performance on natural …

Xrbench: An extended reality (xr) machine learning benchmark suite for the metaverse

H Kwon, K Nair, J Seo, J Yik… - Proceedings of …, 2023 - proceedings.mlsys.org
Real-time multi-task multi-model (MTMM) workloads, a new form of deep learning inference
workloads, are emerging for applications areas like extended reality (XR) to support …

Asymmetric temperature scaling makes larger networks teach well again

XC Li, WS Fan, S Song, Y Li… - Advances in neural …, 2022 - proceedings.neurips.cc
Abstract Knowledge Distillation (KD) aims at transferring the knowledge of a well-performed
neural network (the {\it teacher}) to a weaker one (the {\it student}). A peculiar phenomenon …

MoCA: Memory-centric, adaptive execution for multi-tenant deep neural networks

S Kim, H Genc, VV Nikiforov, K Asanović… - … Symposium on High …, 2023 - ieeexplore.ieee.org
Driven by the wide adoption of deep neural networks (DNNs) across different application
domains, multi-tenancy execution, where multiple DNNs are deployed simultaneously on …

Convmixer: Feature interactive convolution with curriculum learning for small footprint and noisy far-field keyword spotting

D Ng, Y Chen, B Tian, Q Fu… - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
Building efficient architecture in neural speech processing is paramount to success in
keyword spotting deployment. However, it is very challenging for lightweight models to …

[PDF][PDF] Spatial-Temporal Self-Attention for Asynchronous Spiking Neural Networks.

Y Wang, K Shi, C Lu, Y Liu, M Zhang, H Qu - IJCAI, 2023 - ijcai.org
The brain-inspired spiking neural networks (SNNs) are receiving increasing attention due to
their asynchronous event-driven characteristics and low power consumption. As attention …