作者
Ivan Kiskin, Davide Zilli, Yunpeng Li, Marianne Sinka, Kathy Willis, Stephen Roberts
发表日期
2020/2
期刊
Neural Computing and Applications
卷号
32
页码范围
915-927
出版商
Springer London
简介
Many real-world time series analysis problems are characterized by low signal-to-noise ratios and compounded by scarce data. Solutions to these types of problems often rely on handcrafted features extracted in the time or frequency domain. Recent high-profile advances in deep learning have improved performance across many application domains; however, they typically rely on large data sets that may not always be available. This paper presents an application of deep learning for acoustic event detection in a challenging, data-scarce, real-world problem. We show that convolutional neural networks (CNNs), operating on wavelet transformations of audio recordings, demonstrate superior performance over conventional classifiers that utilize handcrafted features. Our key result is that wavelet transformations offer a clear benefit over the more commonly used short-time Fourier transform. Furthermore, we show …
引用总数
2019202020212022202320246132314105
学术搜索中的文章
I Kiskin, D Zilli, Y Li, M Sinka, K Willis, S Roberts - Neural Computing and Applications, 2020