- 学术资源搜索

LEAF: A learnable frontend for audio classification

N Zeghidour, O Teboul, FDC Quitry… - arXiv preprint arXiv …, 2021 - arxiv.org

Mel-filterbanks are fixed, engineered audio features which emulate human perception and
have been used through the history of audio understanding up to today. However, their …

被引用次数：164 相关文章所有 3 个版本

[图书][B] Fundamentals of music processing: Using Python and Jupyter notebooks

M Müller - 2021 - Springer

The textbook provides both profound technological knowledge and a comprehensive
treatment of essential topics in music processing and music information retrieval (MIR) …

被引用次数：65 相关文章所有 5 个版本

[PDF] arxiv.org

Neural audio fingerprint for high-specific audio retrieval based on contrastive learning

S Chang, D Lee, J Park, H Lim, K Lee… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org

Most of existing audio fingerprinting systems have limitations to be used for high-specific
audio retrieval at scale. In this work, we generate a low-dimensional representation from a …

被引用次数：40 相关文章所有 6 个版本

[PDF] toronto.edu

WearBreathing: Real world respiratory rate monitoring using smartwatches

D Liaqat, M Abdalla, P Abed-Esfahani… - Proceedings of the …, 2019 - dl.acm.org

Respiratory rate is a vital physiological signal that may be useful for a multitude of clinical
applications, especially if measured in the wild rather than controlled settings. In-the-wild …

被引用次数：53 相关文章所有 5 个版本

[PDF] arxiv.org

Self-supervised audio representation learning for mobile devices

M Tagliasacchi, B Gfeller, FC Quitry… - arXiv preprint arXiv …, 2019 - arxiv.org

We explore self-supervised models that can be potentially deployed on mobile devices to
learn general purpose audio representations. Specifically, we propose methods that exploit …

被引用次数：49 相关文章所有 2 个版本

[PDF] springer.com

Is my phone listening in? On the feasibility and detectability of mobile eavesdropping

JL Kröger, P Raschke - Data and Applications Security and Privacy XXXIII …, 2019 - Springer

Besides various other privacy concerns with mobile devices, many people suspect their
smartphones to be secretly eavesdropping on them. In particular, a large number of reports …

被引用次数：39 相关文章所有 9 个版本

[PDF] github.io

[PDF][PDF] Open Broadcast Media Audio from TV: A Dataset of TV Broadcast Audio with Relative Music Loudness Annotations.

B Meléndez-Catalán, E Molina… - Trans. Int. Soc. Music …, 2019 - emilio-molina.github.io

Open Broadcast Media Audio from TV (OpenBMAT) is an open, annotated dataset for the
task of music detection that contains over 27 hours of TV broadcast audio from 4 countries …

被引用次数：26 相关文章所有 6 个版本

[PDF] arxiv.org

Automating nearest neighbor search configuration with constrained optimization

P Sun, R Guo, S Kumar - arXiv preprint arXiv:2301.01702, 2023 - arxiv.org

The approximate nearest neighbor (ANN) search problem is fundamental to efficiently
serving many real-world machine learning applications. A number of techniques have been …

被引用次数：5 相关文章所有 5 个版本

Asymmetric contrastive learning for audio fingerprinting

X Wu, H Wang - IEEE Signal Processing Letters, 2022 - ieeexplore.ieee.org

Audio fingerprinting methods can compress audio contents into compact signatures so that
we can save storage and reduce query time. This technology is widely used in many fields …

被引用次数：7 相关文章所有 2 个版本

[PDF] arxiv.org

Attention-based audio embeddings for query-by-example

A Singh, K Demuynck, V Arora - arXiv preprint arXiv:2210.08624, 2022 - arxiv.org

An ideal audio retrieval system efficiently and robustly recognizes a short query snippet from
an extensive database. However, the performance of well-known audio fingerprinting …

被引用次数：6 相关文章所有 7 个版本

高级搜索

QQ 群

LEAF: A learnable frontend for audio classification

[图书][B] Fundamentals of music processing: Using Python and Jupyter notebooks

Neural audio fingerprint for high-specific audio retrieval based on contrastive learning

WearBreathing: Real world respiratory rate monitoring using smartwatches

Self-supervised audio representation learning for mobile devices

Is my phone listening in? On the feasibility and detectability of mobile eavesdropping

[PDF][PDF] Open Broadcast Media Audio from TV: A Dataset of TV Broadcast Audio with Relative Music Loudness Annotations.

Automating nearest neighbor search configuration with constrained optimization

Asymmetric contrastive learning for audio fingerprinting

Attention-based audio embeddings for query-by-example

引用