A Deep Learning Architecture with Spatio-Temporal Focusing for Detecting Respiratory Anomalies

D Ngo, L Pham, H Phan, M Tran… - 2023 IEEE Biomedical …, 2023 - ieeexplore.ieee.org
This paper presents a deep learning system applied for detecting anomalies from respiratory
sound recordings. Our system initially performs audio feature extraction using Continuous …

An audio-visual dataset and deep learning frameworks for crowded scene classification

L Pham, D Ngo, T Nguyen, P Nguyen… - Proceedings of the 19th …, 2022 - dl.acm.org
In this paper, we present the task of audio-visual scene classification (SC) where input
videos are classified into one of five real-life crowded scenes:'Riot','Noise-Street','Firework …

A low-complexity deep learning framework for acoustic scene classification

L Pham, H Tang, A Jalali, A Schindler, R King… - Data Science–Analytics …, 2022 - Springer
In this paper, we presents a low-complexity deep learning frameworks for acoustic scene
classification (ASC). The proposed framework can be separated into three main steps: Front …

Towards Unsupervised Speaker Diarization System for Multilingual Telephone Calls Using Pre-trained Whisper Model and Mixture of Sparse Autoencoders

P Lam, L Pham, T Nguyen, T Pham, LK Nguyen… - arXiv preprint arXiv …, 2024 - arxiv.org
Existing speaker diarization systems heavily rely on large amounts of manually annotated
data, which is labor-intensive and challenging to collect in real-world scenarios. Additionally …

Deep Learning Based Multimodal with Two-phase Training Strategy for Daily Life Video Classification

L Pham, T Le, C Le, D Ngo, A Weissenfeld… - Proceedings of the 20th …, 2023 - dl.acm.org
In this paper, we present a deep learning based multimodal system for classifying daily life
videos. To train the system, we propose a two-phase training strategy. In the first training …

Low-complexity deep learning frameworks for acoustic scene classification using teacher-student scheme and multiple spectrograms

L Pham, D Ngo, C Le, A Jalali, A Schindler - arXiv preprint arXiv …, 2023 - arxiv.org
In this technical report, a low-complexity deep learning system for acoustic scene
classification (ASC) is presented. The proposed system comprises two main phases:(Phase …

Light-Weight Deep Learning Models for Acoustic Scene Classification Using Teacher-Student Scheme and Multiple Spectrograms

L Pham, T Nguyen, P Lam, D Ngo… - … Symposium on the …, 2023 - ieeexplore.ieee.org
In this paper, we present a light-weight deep learning based system for acoustic scene
classification (ASC), which is armed to be integrated into an Internet of Sound (IoS) system …

DCASE 2022: Comparative Analysis Of CNNs For Acoustic Scene Classification Under Low-Complexity Considerations

J Zaragoza-Paredes, J Naranjo-Alcazar… - arXiv preprint arXiv …, 2022 - arxiv.org
Acoustic scene classification is an automatic listening problem that aims to assign an audio
recording to a pre-defined scene based on its audio data. Over the years (and in past …