Understanding optical music recognition

J Calvo-Zaragoza, JH Jr, A Pacha - ACM Computing Surveys (CSUR), 2020 - dl.acm.org
For over 50 years, researchers have been trying to teach computers to read music notation,
referred to as Optical Music Recognition (OMR). However, this field is still difficult to access …

Multi-label music genre classification from audio, text, and images using deep features

S Oramas, O Nieto, F Barbieri, X Serra - arXiv preprint arXiv:1707.04916, 2017 - arxiv.org
Music genres allow to categorize musical items that share common characteristics. Although
these categories are not mutually exclusive, most related research is traditionally focused on …

Schubert Winterreise dataset: A multimodal scenario for music analysis

C Weiß, F Zalkow, V Arifi-Müller, M Müller… - Journal on Computing …, 2021 - dl.acm.org
This article presents a multimodal dataset comprising various representations and
annotations of Franz Schubert's song cycle Winterreise. Schubert's seminal work constitutes …

Towards robust human-robot collaborative manufacturing: Multimodal fusion

H Liu, T Fang, T Zhou, L Wang - IEEE Access, 2018 - ieeexplore.ieee.org
Intuitive and robust multimodal robot control is the key toward human–robot collaboration
(HRC) for manufacturing systems. Multimodal robot control methods were introduced in …

[PDF][PDF] Learning Audio-Sheet Music Correspondences for Cross-Modal Retrieval and Piece Identification.

M Dorfer, J Hajic Jr, A Arzt… - Trans. Int. Soc. Music …, 2018 - pdfs.semanticscholar.org
This work addresses the problem of matching musical audio directly to sheet music, without
any higherlevel abstract representation. We propose a method that learns joint embedding …

End-to-end cross-modality retrieval with CCA projections and pairwise ranking loss

M Dorfer, J Schlüter, A Vall, F Korzeniowski… - International Journal of …, 2018 - Springer
Cross-modality retrieval encompasses retrieval tasks where the fetched items are of a
different type than the search query, eg, retrieving pictures relevant to a given text query. The …

Learning audio-sheet music correspondences for score identification and offline alignment

M Dorfer, A Arzt, G Widmer - arXiv preprint arXiv:1707.09887, 2017 - arxiv.org
This work addresses the problem of matching short excerpts of audio with their respective
counterparts in sheet music images. We show how to employ neural network-based cross …

Learning to listen, read, and follow: Score following as a reinforcement learning game

M Dorfer, F Henkel, G Widmer - arXiv preprint arXiv:1807.06391, 2018 - arxiv.org
Score following is the process of tracking a musical performance (audio) with respect to a
known symbolic representation (a score). We start this paper by formulating score following …

[PDF][PDF] Flexible and robust music tracking

A Arzt - Ph. D. thesis, 2016 - epub.jku.at
Nowadays computers play an important role in all areas of music, from composition to
production and live performance. This is obvious for all kinds of popular music, where …

Improved handling of repeats and jumps in audio-sheet image synchronization

M Shan, TJ Tsai - arXiv preprint arXiv:2007.14580, 2020 - arxiv.org
This paper studies the problem of automatically generating piano score following videos
given an audio recording and raw sheet music images. Whereas previous works focus on …