Deep spoken keyword spotting: An overview

I López-Espejo, ZH Tan, JHL Hansen, J Jensen - IEEE Access, 2021 - ieeexplore.ieee.org
Spoken keyword spotting (KWS) deals with the identification of keywords in audio streams
and has become a fast-growing technology thanks to the paradigm shift introduced by deep …

A 510-nW wake-up keyword-spotting chip using serial-FFT-based MFCC and binarized depthwise separable CNN in 28-nm CMOS

W Shan, M Yang, T Wang, Y Lu, H Cai… - IEEE Journal of Solid …, 2020 - ieeexplore.ieee.org
We propose a sub-μW always-ON keyword spotting (μKWS) chip for audio wake-up
systems. It is mainly composed of a neural network (NN) and a feature extraction (FE) circuit …

Learning audio-text agreement for open-vocabulary keyword spotting

HK Shin, H Han, D Kim, SW Chung… - arXiv preprint arXiv …, 2022 - arxiv.org
In this paper, we propose a novel end-to-end user-defined keyword spotting method that
utilizes linguistically corresponding patterns between speech and text sequences. Unlike …

Seeing wake words: Audio-visual keyword spotting

L Momeni, T Afouras, T Stafylakis, S Albanie… - arXiv preprint arXiv …, 2020 - arxiv.org
The goal of this work is to automatically determine whether and when a word of interest is
spoken by a talking face, with or without the audio. We propose a zero-shot method suitable …

Neural architecture search for keyword spotting

T Mo, Y Yu, M Salameh, D Niu, S Jui - arXiv preprint arXiv:2009.00165, 2020 - arxiv.org
Deep neural networks have recently become a popular solution to keyword spotting
systems, which enable the control of smart devices via voice. In this paper, we apply neural …

AAD-KWS: A Sub-μ W Keyword Spotting Chip With an Acoustic Activity Detector Embedded in MFCC and a Tunable Detection Window in 28-nm CMOS

W Shan, J Qian, L Zhu, J Yang… - IEEE Journal of Solid …, 2022 - ieeexplore.ieee.org
As a widely used speech-triggered interface, deep-learning-based keyword spotting (KWS)
chips require both ultra-low power and high detection accuracy. We propose a sub …

Multi-objective hardware-aware neural architecture search with Pareto rank-preserving surrogate models

H Benmeziane, H Ouarnoughi… - ACM Transactions on …, 2023 - dl.acm.org
Deep learning (DL) models such as convolutional neural networks (ConvNets) are being
deployed to solve various computer vision and natural language processing tasks at the …

Exploring TinyML Frameworks for Small-Footprint Keyword Spotting: A Concise Overview

S Garai, S Samui - 2024 International Conference on Signal …, 2024 - ieeexplore.ieee.org
Keyword spotting with a Small Footprint (SF-KWS) has gained popularity in today's
landscape of smart voice-activated devices, smartphones, and IoT applications. This surge …

Efficient Self-Attention Model for Speech Recognition-Based Assistive Robots Control

S Poirier, U Côté-Allard, F Routhier… - Sensors, 2023 - mdpi.com
Assistive robots are tools that people living with upper body disabilities can leverage to
autonomously perform Activities of Daily Living (ADL). Unfortunately, conventional control …

Autokws: Keyword spotting with differentiable architecture search

B Zhang, W Li, Q Li, W Zhuang, X Chu… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Smart audio devices are gated by an always-on lightweight keyword spotting program to
reduce power consumption. It is however challenging to design models that have both high …