A review of physical and perceptual feature extraction techniques for speech, music and environmental sounds

F Alías, JC Socoró, X Sevillano - Applied Sciences, 2016 - mdpi.com
Endowing machines with sensing capabilities similar to those of humans is a prevalent
quest in engineering and computer science. In the pursuit of making computers sense their …

A survey of audio-based music classification and annotation

Z Fu, G Lu, KM Ting, D Zhang - IEEE transactions on …, 2010 - ieeexplore.ieee.org
Music information retrieval (MIR) is an emerging research area that receives growing
attention from both the research community and music industry. It addresses the problem of …

A mathematical theory of deep convolutional neural networks for feature extraction

T Wiatowski, H Bölcskei - IEEE Transactions on Information …, 2017 - ieeexplore.ieee.org
Deep convolutional neural networks (DCNNs) have led to breakthrough results in numerous
practical machine learning tasks, such as classification of images in the ImageNet data set …

Deep scattering spectrum

J Andén, S Mallat - IEEE Transactions on Signal Processing, 2014 - ieeexplore.ieee.org
A scattering transform defines a locally translation invariant representation which is stable to
time-warping deformation. It extends MFCC representations by computing modulation …

Turning a mobile device into a mouse in the air

S Yun, YC Chen, L Qiu - Proceedings of the 13th Annual International …, 2015 - dl.acm.org
A mouse has been one of the most successful user interfaces due to its intuitive use. As
more devices are equipped with displays and offer rich options for users to choose from, a …

The GTZAN dataset: Its contents, its faults, their effects on evaluation, and its future use

BL Sturm - arXiv preprint arXiv:1306.1461, 2013 - arxiv.org
The GTZAN dataset appears in at least 100 published works, and is the most-used public
dataset for evaluation in machine listening research for music genre recognition (MGR). Our …

Combining visual and acoustic features for music genre classification

L Nanni, YMG Costa, A Lumini, MY Kim… - Expert Systems with …, 2016 - Elsevier
Since musical genre is one of the most common ways used by people for managing digital
music databases, music genre recognition is a crucial task, deep studied by the Music …

A survey of evaluation in music genre recognition

BL Sturm - International Workshop on Adaptive Multimedia …, 2012 - Springer
Much work is focused upon music genre recognition (MGR) from audio recordings, symbolic
data, and other modalities. While reviews have been written of some of this work before, no …

A survey of underwater acoustic data classification methods using deep learning for shoreline surveillance

LCF Domingos, PE Santos, PSM Skelton… - Sensors, 2022 - mdpi.com
This paper presents a comprehensive overview of current deep-learning methods for
automatic object classification of underwater sonar data for shoreline surveillance …

An analysis of the GTZAN music genre dataset

BL Sturm - Proceedings of the second international ACM …, 2012 - dl.acm.org
A significant amount of work in automatic music genre recognition has used a dataset whose
composition and integrity has never been formally analyzed. For the first time, we provide an …