LEAF: A learnable frontend for audio classification

N Zeghidour, O Teboul, FDC Quitry… - arXiv preprint arXiv …, 2021 - arxiv.org
Mel-filterbanks are fixed, engineered audio features which emulate human perception and
have been used through the history of audio understanding up to today. However, their …

[图书][B] Fundamentals of music processing: Using Python and Jupyter notebooks

M Müller - 2021 - Springer
The textbook provides both profound technological knowledge and a comprehensive
treatment of essential topics in music processing and music information retrieval (MIR) …

Neural audio fingerprint for high-specific audio retrieval based on contrastive learning

S Chang, D Lee, J Park, H Lim, K Lee… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Most of existing audio fingerprinting systems have limitations to be used for high-specific
audio retrieval at scale. In this work, we generate a low-dimensional representation from a …

WearBreathing: Real world respiratory rate monitoring using smartwatches

D Liaqat, M Abdalla, P Abed-Esfahani… - Proceedings of the …, 2019 - dl.acm.org
Respiratory rate is a vital physiological signal that may be useful for a multitude of clinical
applications, especially if measured in the wild rather than controlled settings. In-the-wild …

Self-supervised audio representation learning for mobile devices

M Tagliasacchi, B Gfeller, FC Quitry… - arXiv preprint arXiv …, 2019 - arxiv.org
We explore self-supervised models that can be potentially deployed on mobile devices to
learn general purpose audio representations. Specifically, we propose methods that exploit …

Is my phone listening in? On the feasibility and detectability of mobile eavesdropping

JL Kröger, P Raschke - Data and Applications Security and Privacy XXXIII …, 2019 - Springer
Besides various other privacy concerns with mobile devices, many people suspect their
smartphones to be secretly eavesdropping on them. In particular, a large number of reports …

[PDF][PDF] Open Broadcast Media Audio from TV: A Dataset of TV Broadcast Audio with Relative Music Loudness Annotations.

B Meléndez-Catalán, E Molina… - Trans. Int. Soc. Music …, 2019 - emilio-molina.github.io
Open Broadcast Media Audio from TV (OpenBMAT) is an open, annotated dataset for the
task of music detection that contains over 27 hours of TV broadcast audio from 4 countries …

Automating nearest neighbor search configuration with constrained optimization

P Sun, R Guo, S Kumar - arXiv preprint arXiv:2301.01702, 2023 - arxiv.org
The approximate nearest neighbor (ANN) search problem is fundamental to efficiently
serving many real-world machine learning applications. A number of techniques have been …

Asymmetric contrastive learning for audio fingerprinting

X Wu, H Wang - IEEE Signal Processing Letters, 2022 - ieeexplore.ieee.org
Audio fingerprinting methods can compress audio contents into compact signatures so that
we can save storage and reduce query time. This technology is widely used in many fields …

Attention-based audio embeddings for query-by-example

A Singh, K Demuynck, V Arora - arXiv preprint arXiv:2210.08624, 2022 - arxiv.org
An ideal audio retrieval system efficiently and robustly recognizes a short query snippet from
an extensive database. However, the performance of well-known audio fingerprinting …