Musegan: Multi-track sequential generative adversarial networks for symbolic music generation and accompaniment

HW Dong, WY Hsiao, LC Yang, YH Yang - Proceedings of the AAAI …, 2018 - ojs.aaai.org
Generating music has a few notable differences from generating images and videos. First,
music is an art of time, necessitating a temporal model. Second, music is usually composed …

[PDF][PDF] librosa: Audio and music signal analysis in python.

B McFee, C Raffel, D Liang, DPW Ellis, M McVicar… - SciPy, 2015 - academia.edu
This document describes version 0.4. 0 of librosa: a Python package for audio and music
signal processing. At a high level, librosa provides implementations of a variety of common …

Choreomaster: choreography-oriented music-driven dance synthesis

K Chen, Z Tan, J Lei, SH Zhang, YC Guo… - ACM Transactions on …, 2021 - dl.acm.org
Despite strong demand in the game and film industry, automatically synthesizing high-
quality dance motions remains a challenging task. In this paper, we present ChoreoMaster …

[PDF][PDF] Boundary Detection in Music Structure Analysis using Convolutional Neural Networks.

K Ullrich, J Schlüter, T Grill - ISMIR, 2014 - grrrr.org
The recognition of boundaries, eg, between chorus and verse, is an important task in music
structure analysis. The goal is to automatically detect such boundaries in audio signals so …

Pirhdy: Learning pitch-, rhythm-, and dynamics-aware embeddings for symbolic music

H Liang, W Lei, PY Chan, Z Yang, M Sun… - Proceedings of the 28th …, 2020 - dl.acm.org
Definitive embeddings remain a fundamental challenge of computational musicology for
symbolic music in deep learning today. Analogous to natural language, music can be …

Unsupervised music structure annotation by time series structure features and segment similarity

J Serra, M Müller, P Grosche… - IEEE Transactions on …, 2014 - ieeexplore.ieee.org
Automatically inferring the structural properties of raw multimedia documents is essential in
today's digitized society. Given its hierarchical and multi-faceted organization, musical …

Video-to-music recommendation using temporal alignment of segments

L Prétet, G Richard, C Souchier… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
We study cross-modal recommendation of musictracks to be used as soundtracks for videos.
This problem is known as the music supervision task. We build on a self-supervised system …

Learning to segment songs with ordinal linear discriminant analysis

B McFee, DPW Ellis - 2014 IEEE International Conference on …, 2014 - ieeexplore.ieee.org
This paper describes a supervised learning algorithm which optimizes a feature
representation for temporally constrained clustering. The proposed method is applied to …

Crowdsourcing audio semantics by means of hybrid bimodal segmentation with hierarchical classification

L Vrysis, N Tsipas, C Dimoulas… - Journal of the Audio …, 2016 - aes.org
The task of general audio detection and segmentation is quite common in contemporary
audio applications where computational intensive processes are frequently involved …

Evaluating hierarchical structure in music annotations

B McFee, O Nieto, MM Farbood, JP Bello - Frontiers in psychology, 2017 - frontiersin.org
Music exhibits structure at multiple scales, ranging from motifs to large-scale functional
components. When inferring the structure of a piece, different listeners may attend to …