查看文章

arxiv.org 中的 [PDF]

Deep Feature Embedding and Hierarchical Classification for Audio Scene Classification

作者

Lam Pham, Ian McLoughlin, Huy Phan, Ramaswamy Palaniappan, Alfred Mertins

发表日期

2020

研讨会论文

in Proc. IJCNN, 2020, pp. 1-7

出版商

IEEE

简介

In this work, we propose an approach that features deep feature embedding learning and hierarchical classification with triplet loss function for Acoustic Scene Classification (ASC). In the one hand, a deep convolutional neural network is firstly trained to learn a feature embedding from scene audio signals. Via the trained convolutional neural network, the learned embedding embeds an input into the embedding feature space and transforms it into a high-level feature vector for representation. In the other hand, in order to exploit the structure of the scene categories, the original scene classification problem is structured into a hierarchy where similar categories are grouped into meta-categories. Then, hierarchical classification is accomplished using deep neural network classifiers associated with triplet loss function. Our experiments show that the proposed system achieves good performance on both the DCASE 2018 …

引用总数

被引用次数：23

20212022202320248 10 4 1

学术搜索中的文章

Deep feature embedding and hierarchical classification for audio scene classification

L Pham, I McLoughlin, H Phan, R Palaniappan… - 2020 International Joint Conference on Neural …, 2020

被引用次数：23 相关文章所有 12 个版本