From model-based optimization algorithms to deep learning models for clustering hyperspectral images

S Huang, H Zhang, H Zeng, A Pižurica - Remote Sensing, 2023 - mdpi.com
Hyperspectral images (HSIs), captured by different Earth observation airborne and space-
borne systems, provide rich spectral information in hundreds of bands, enabling far better …

Sound-based multiple-equipment activity recognition using convolutional neural networks

B Sherafat, A Rashidi, S Asgari - Automation in Construction, 2022 - Elsevier
Automatically recognizing activities of heavy construction equipment using sound data has
recently received considerable attention as a promising research area in construction …

Listen to interpret: Post-hoc interpretability for audio networks with nmf

J Parekh, S Parekh, P Mozharovskyi… - Advances in …, 2022 - proceedings.neurips.cc
This paper tackles post-hoc interpretability for audio processing networks. Our goal is to
interpret decisions of a trained network in terms of high-level audio objects that are also …

A novel enhancement approach following MVMD and NMF separation of complex snoring signals

M Al Mawla, K Chaccour, H Fares - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Snoring is a prominent characteristic of sleep-disordered breathing, and its detection is
critical for determining the severity of the upper airway obstruction and improving daily …

Tackling interpretability in audio classification networks with non-negative matrix factorization

J Parekh, S Parekh, P Mozharovskyi… - … on Audio, Speech …, 2024 - ieeexplore.ieee.org
This article tackles two major problem settings for interpretability of audio processing
networks, post-hoc and by-design interpretation. For post-hoc interpretation, we aim to …

Unsupervised audio source separation using generative priors

V Narayanaswamy, JJ Thiagarajan, R Anirudh… - arXiv preprint arXiv …, 2020 - arxiv.org
State-of-the-art under-determined audio source separation systems rely on supervised end-
end training of carefully tailored neural network architectures operating either in the time or …

Phase retrieval with Bregman divergences and application to audio signal recovery

PH Vial, P Magron, T Oberlin… - IEEE Journal of Selected …, 2021 - ieeexplore.ieee.org
Phase retrieval (PR) aims to recover a signal from the magnitudes of a set of inner products.
This problem arises in many audio signal processing applications which operate on a short …

Weakly supervised representation learning for audio-visual scene analysis

S Parekh, S Essid, A Ozerov… - … on Audio, Speech …, 2019 - ieeexplore.ieee.org
Audio-visual (AV) representation learning is an important task from the perspective of
designing machines with the ability to understand complex events. To this end, we propose …

Expanding boundaries of Gap Safe screening

CF Dantas, E Soubies, C Févotte - Journal of Machine Learning Research, 2021 - jmlr.org
Sparse optimization problems are ubiquitous in many fields such as statistics, signal/image
processing and machine learning. This has led to the birth of many iterative algorithms to …

Adaptive noise reduction for sound event detection using subband-weighted NMF

Q Zhou, Z Feng, E Benetos - Sensors, 2019 - mdpi.com
Sound event detection in real-world environments suffers from the interference of non-
stationary and time-varying noise. This paper presents an adaptive noise reduction method …