Large-scale multimodal piano music identification using marketplace fingerprinting

D Yang, A Goutam, K Ji, TJ Tsai - Algorithms, 2022 - mdpi.com
This paper studies the problem of identifying piano music in various modalities using a
single, unified approach called marketplace fingerprinting. The key defining characteristic of …

[PDF][PDF] FlexDTW: Dynamic Time Warping With Flexible Boundary Conditions.

I Bukey, J Zhang, TJ Tsai - ISMIR, 2023 - archives.ismir.net
Alignment algorithms like DTW and subsequence DTW assume specific boundary
conditions on where an alignment path can begin and end in the cost matrix. In practice, the …

[PDF][PDF] Composer Classification With Cross-Modal Transfer Learning and Musically-Informed Augmentation.

D Yang, T Tsai - ISMIR, 2021 - archives.ismir.net
This paper studies composer style classification of piano sheet music, MIDI, and audio data.
We expand upon previous work in three ways. First, we explore several musically motivated …

Structure-aware audio-to-score alignment using progressively dilated convolutional neural networks

R Agrawal, D Wolff, S Dixon - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
The identification of structural differences between a music performance and the score is a
challenging yet integral step of audio-to-score alignment, an important subtask of music …

Real-Time Music Following in Score Sheet Images via Multi-Resolution Prediction

F Henkel, G Widmer - Frontiers in Computer Science, 2021 - frontiersin.org
The task of real-time alignment between a music performance and the corresponding score
(sheet music), also known as score following, poses a challenging multi-modal machine …

Just Label the Repeats for In-The-Wild Audio-to-Score Alignment

I Bukey, M Feffer, C Donahue - arXiv preprint arXiv:2411.07428, 2024 - arxiv.org
We propose an efficient workflow for high-quality offline alignment of in-the-wild
performance audio and corresponding sheet music scans (images). Recent work on audio …

[PDF][PDF] Piano sheet music identification using dynamic N-gram fingerprinting.

D Yang, TJ Tsai - Trans. Int. Soc. Music. Inf. Retr., 2021 - pdfs.semanticscholar.org
This article introduces a method for large-scale retrieval of piano sheet music images. We
study this problem in two different scenarios: camera-based sheet music identification and …

Segmental DTW: A parallelizable alternative to dynamic time warping

TJ Tsai - ICASSP 2021-2021 IEEE International Conference on …, 2021 - ieeexplore.ieee.org
In this work we explore parallelizable alternatives to DTW for globally aligning two feature
sequences. One of the main practical limitations of DTW is its quadratic computation and …

Improving the Robustness of DTW to Global Time Warping Conditions in Audio Synchronization

J Kraprayoon, A Pham, TJ Tsai - Applied Sciences, 2024 - mdpi.com
Dynamic time warping estimates the alignment between two sequences and is designed to
handle a variable amount of time warping. In many contexts, it performs poorly when …

A deeper look at sheet music composer classification using self-supervised pretraining

D Yang, K Ji, TJ Tsai - Applied Sciences, 2021 - mdpi.com
This article studies a composer style classification task based on raw sheet music images.
While previous works on composer recognition have relied exclusively on supervised …