A CRNN-BASED SYSTEM WITH MIXUP TECHNIQUE FOR LARGE-SCALE WEAKLY LA-BELED SOUND EVENT DETECTION- 学术资源搜索

[PDF][PDF] A CRNN-BASED SYSTEM WITH MIXUP TECHNIQUE FOR LARGE-SCALE WEAKLY LA-BELED SOUND EVENT DETECTION

D Wang, K Xu, B Zhu, L Zhang, Y Peng, H Wang - 2018 - dcase.community

D Wang, K Xu, B Zhu, L Zhang, Y Peng, H Wang

2018•dcase.community

Abstract

The details of our method submitted to the task 4 of DCASE challenge 2018 are described in this technical report. This task evaluates systems for the detection of sound events in domestic environments using large-scale weakly labeled data. In particular, an architecture based on the framework of convolutional recurrent neural network (CRNN) is utilized to detect the timestamps of all the events in given audio clips where the training audio files have only clip-level labels. In order to take advantage of the large-scale unlabeled in-domain training data, a deep residual network based model (ResNeXt) is first employed to make predictions for weak labels of the unlabeled data. In addition, a mixup technique is applied in model training process, which is believed to have some benefits on the data augmentation and the model generalization capability. Finally, the system achieves 22.05% F1-value in classwise average metrics for the sound event detection on the provided testing dataset.

dcase.community

展开收起

被引用次数：5 相关文章

以上显示的是最相近的搜索结果。查看全部搜索结果

高级搜索

QQ 群

[PDF][PDF] A CRNN-BASED SYSTEM WITH MIXUP TECHNIQUE FOR LARGE-SCALE WEAKLY LA-BELED SOUND EVENT DETECTION

引用