Cross-modal music retrieval and applications: An overview of key methodologies

M Müller, A Arzt, S Balke, M Dorfer… - IEEE Signal Processing …, 2018 - ieeexplore.ieee.org
There has been a rapid growth of digitally available music data, including audio recordings,
digitized images of sheet music, album covers and liner notes, and video clips. This huge …

Schubert Winterreise dataset: A multimodal scenario for music analysis

C Weiß, F Zalkow, V Arifi-Müller, M Müller… - Journal on Computing …, 2021 - dl.acm.org
This article presents a multimodal dataset comprising various representations and
annotations of Franz Schubert's song cycle Winterreise. Schubert's seminal work constitutes …

Multimodal music information processing and retrieval: Survey and future challenges

F Simonetta, S Ntalampiras… - … workshop on multilayer …, 2019 - ieeexplore.ieee.org
Towards improving the performance in various music information processing tasks, recent
studies exploit different modalities able to capture diverse aspects of music. Such modalities …

[PDF][PDF] Learning Audio-Sheet Music Correspondences for Cross-Modal Retrieval and Piece Identification.

M Dorfer, J Hajic Jr, A Arzt… - Trans. Int. Soc. Music …, 2018 - pdfs.semanticscholar.org
This work addresses the problem of matching musical audio directly to sheet music, without
any higherlevel abstract representation. We propose a method that learns joint embedding …

[PDF][PDF] Vocal Melody Extraction with Semantic Segmentation and Audio-symbolic Domain Transfer Learning.

WT Lu, L Su - ISMIR, 2018 - ismir2018.ircam.fr
The melody extraction problem is analogue to semantic segmentation on a time-frequency
image, in which every pixel on the image is classified as a part of a melody object or not …

[PDF][PDF] Towards Full-Pipeline Handwritten OMR with Musical Symbol Detection by U-Nets.

J Hajic Jr, M Dorfer, G Widmer, P Pecina - ISMIR, 2018 - ismir2018.ismir.net
Detecting music notation symbols is the most immediate unsolved subproblem in Optical
Music Recognition for musical manuscripts. We show that a U-Net architecture for semantic …

Learning to listen, read, and follow: Score following as a reinforcement learning game

M Dorfer, F Henkel, G Widmer - arXiv preprint arXiv:1807.06391, 2018 - arxiv.org
Score following is the process of tracking a musical performance (audio) with respect to a
known symbolic representation (a score). We start this paper by formulating score following …

Artist similarity with graph neural networks

F Korzeniowski, S Oramas, F Gouyon - arXiv preprint arXiv:2107.14541, 2021 - arxiv.org
Artist similarity plays an important role in organizing, understanding, and subsequently,
facilitating discovery in large collections of music. In this paper, we present a hybrid …

Improved handling of repeats and jumps in audio-sheet image synchronization

M Shan, TJ Tsai - arXiv preprint arXiv:2007.14580, 2020 - arxiv.org
This paper studies the problem of automatically generating piano score following videos
given an audio recording and raw sheet music images. Whereas previous works focus on …

Passage summarization with recurrent models for audio-sheet music retrieval

L Carvalho, G Widmer - arXiv preprint arXiv:2309.12111, 2023 - arxiv.org
Many applications of cross-modal music retrieval are related to connecting sheet music
images to audio recordings. A typical and recent approach to this is to learn, via deep neural …