Detection of adventitious respiratory sounds based on convolutional neural network

R Liu, S Cai, K Zhang, N Hu - 2019 International Conference on …, 2019 - ieeexplore.ieee.org
Nowadays, the respiratory disease has become one of the most dangerous diseases that
threaten human health, especially in the developing countries. The early diagnosis of …

A unified deep learning framework for short-duration speaker verification in adverse environments

Y Jung, Y Choi, H Lim, H Kim - IEEE Access, 2020 - ieeexplore.ieee.org
Speaker verification (SV) has recently attracted considerable research interest due to the
growing popularity of virtual assistants. At the same time, there is an increasing requirement …

Convolutional neural networks for audio-based continuous infant cry monitoring at home

J Xie, X Long, RA Otte, C Shan - IEEE Sensors Journal, 2021 - ieeexplore.ieee.org
Cry is an important signal in early infancy for parents to understand needs of their baby and
thereby to provide timely parenting/soothing or to be reassured. Thanks to the recent …

Multi-task learning-based spoofing-robust automatic speaker verification system

Y Zhao, R Togneri, V Sreeram - Circuits, Systems, and Signal Processing, 2022 - Springer
Spoofing attacks posed by generating artificial speech can severely degrade the
performance of a speaker verification system. Recently, many anti-spoofing …

[HTML][HTML] PyMAiVAR: An open-source Python suit for audio-image representation in human action recognition

MB Shaikh, D Chai, SMS Islam, N Akhtar - Software Impacts, 2023 - Elsevier
We present PyMAiVAR, a versatile toolbox that encompasses the generation of image
representations for audio data including Wave plots, Spectral Centroids, Spectral Roll Offs …

Multi-Spectral and Multi-Temporal Features Fusion with SE Network for Anomalous Sound Detection

D Kong, H Yu, G Yuan - IEEE Access, 2024 - ieeexplore.ieee.org
Unsupervised anomalous sound detection (ASD) identifies anomalies by learning or
estimating normal operational patterns and detecting deviations. This capability is crucial for …

Self-adaptive soft voice activity detection using deep neural networks for robust speaker verification

Y Jung, Y Choi, H Kim - 2019 IEEE Automatic Speech …, 2019 - ieeexplore.ieee.org
Voice activity detection (VAD), which classifies frames as speech or non-speech, is an
important module in many speech applications including speaker verification. In this paper …

An unsupervised data-driven approach for wind turbine blade damage detection under passive acoustics-based excitation

J Solimine, M Inalpolat - Wind Engineering, 2022 - journals.sagepub.com
Existing passive acoustics-based techniques for wind turbine blade damage detection lack
the robustness and adaptability necessary for an operational implementation due to their …

[PDF][PDF] Real-time causal spectro-temporal voice activity detection based on convolutional encoding and residual decoding

J Wang, J Zhang, LR Dai - Proc. INTERSPEECH, 2023 - isca-archive.org
Voice activity detection (VAD) is an essential front-end in many speech applications that
aims at determining the presence or absence of speech signals in an audio frame. However …

Dual attention in time and frequency domain for voice activity detection

J Lee, Y Jung, H Kim - arXiv preprint arXiv:2003.12266, 2020 - arxiv.org
Voice activity detection (VAD) is a challenging task in low signal-to-noise ratio (SNR)
environment, especially in non-stationary noise. To deal with this issue, we propose a novel …