作者
Tan Doan, Hung Nguyen, Dat Thanh Ngo, Lam Pham, Ha Hoang Kha
发表日期
2019/10/10
研讨会论文
2019 International Symposium on Electrical and Electronics Engineering (ISEE)
页码范围
63-67
出版商
IEEE
简介
In this paper, we present a deep learning framework applied for acoustic scene classification (ASC) recognizing the environmental sounds. Since an audio scene related to a given location potentially contains numerous sound events, only few of these events supply helpful information on the scene, which makes the acoustic scene classification task become a very complex problem. To confront this challenge, we suggest a novel architecture consisting of two basic processes. The front-end process approaches a spectrogram feature, using Gammatone filters. Regarding the back-end classification, we propose a novel convolutional neural network (CNN) architecture that enforces the network deeply learning middle convolutional layers. Our experiments conducted over DCASE2016 task 1A dataset offer the highest classification accuracy of 84.4% as compared to 72.5% of DCASE2016 baseline.
引用总数
学术搜索中的文章
T Doan, H Nguyen, DT Ngo, L Pham, HH Kha - 2019 International Symposium on Electrical and …, 2019