Timbre analysis of music audio signals with convolutional neural networks

J Pons, O Slizovskaia, R Gong… - 2017 25th European …, 2017 - ieeexplore.ieee.org
The focus of this work is to study how to efficiently tailor Convolutional Neural Networks
(CNNs) towards learning timbre representations from log-mel magnitude spectrograms. We …

A tutorial on deep learning for music information retrieval

K Choi, G Fazekas, K Cho, M Sandler - arXiv preprint arXiv:1709.04396, 2017 - arxiv.org
Following their success in Computer Vision and other areas, deep learning techniques have
recently become widely adopted in Music Information Retrieval (MIR) research. However …

Automatic tagging using deep convolutional neural networks

K Choi, G Fazekas, M Sandler - arXiv preprint arXiv:1606.00298, 2016 - arxiv.org
We present a content-based automatic music tagging algorithm using fully convolutional
neural networks (FCNs). We evaluate different architectures consisting of 2D convolutional …

Transfer learning for music classification and regression tasks

K Choi, G Fazekas, M Sandler, K Cho - arXiv preprint arXiv:1703.09179, 2017 - arxiv.org
In this paper, we present a transfer learning approach for music classification and regression
tasks. We propose to use a pre-trained convnet feature, a concatenated feature vector using …

SampleCNN: End-to-end deep convolutional neural networks using very small filters for music classification

J Lee, J Park, KL Kim, J Nam - Applied Sciences, 2018 - mdpi.com
Convolutional Neural Networks (CNN) have been applied to diverse machine learning tasks
for different modalities of raw data in an end-to-end fashion. In the audio domain, a raw …

Experimenting with musically motivated convolutional neural networks

J Pons, T Lidy, X Serra - 2016 14th international workshop on …, 2016 - ieeexplore.ieee.org
A common criticism of deep learning relates to the difficulty in understanding the underlying
relationships that the neural networks are learning, thus behaving like a black-box. In this …

Upsampling artifacts in neural audio synthesis

J Pons, S Pascual, G Cengarle… - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
A number of recent advances in neural audio synthesis rely on up-sampling layers, which
can introduce undesired artifacts. In computer vision, upsampling artifacts have been …

[PDF][PDF] Automatic Stylistic Composition of Bach Chorales with Deep LSTM.

FT Liang, M Gotham, M Johnson, J Shotton - ISMIR, 2017 - microsoft.com
This paper presents “BachBot”: an end-to-end automatic composition system for composing
and completing music in the style of Bach's chorales using a deep long short-term memory …

Deep convolutional networks on the pitch spiral for musical instrument recognition

V Lostanlen, CE Cella - arXiv preprint arXiv:1605.06644, 2016 - arxiv.org
Musical performance combines a wide range of pitches, nuances, and expressive
techniques. Audio-based classification of musical instruments thus requires to build signal …

Explaining deep convolutional neural networks on music classification

K Choi, G Fazekas, M Sandler - arXiv preprint arXiv:1607.02444, 2016 - arxiv.org
Deep convolutional neural networks (CNNs) have been actively adopted in the field of music
information retrieval, eg genre classification, mood detection, and chord recognition …