Foundation Models for Music: A Survey

Y Ma, A Øland, A Ragni, BMS Del Sette, C Saitis… - arXiv preprint arXiv …, 2024 - arxiv.org
In recent years, foundation models (FMs) such as large language models (LLMs) and latent
diffusion models (LDMs) have profoundly impacted diverse sectors, including music. This …

Polyscriber: Integrated fine-tuning of extractor and lyrics transcriber for polyphonic music

X Gao, C Gupta, H Li - IEEE/ACM Transactions on Audio …, 2023 - ieeexplore.ieee.org
Lyrics transcription of polyphonic music is challenging as the background music affects lyrics
intelligibility. Typically, lyrics transcription can be performed by a two-step pipeline, ie a …

Self-transriber: Few-shot lyrics transcription with self-training

X Gao, X Yue, H Li - ICASSP 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org
The current lyrics transcription approaches heavily rely on supervised learning with labeled
data, but such data are scarce and manual labeling of singing is expensive. How to benefit …

Adapting pretrained speech model for mandarin lyrics transcription and alignment

JY Wang, CI Leong, YC Lin, L Su… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org
The tasks of automatic lyrics transcription and lyrics alignment have witnessed significant
performance improvements in the past few years. However, most of the previous works only …

MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics and Audio

JY Wang, CC Wang, CI Leong… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org
We introduce MIR-MLPop, a publicly available multilingual pop music dataset designed for
automatic lyrics transcription and lyrics alignment in polyphonic music. The dataset …

Improving Real-Time Music Accompaniment Separation with MMDenseNet

CH Wang, CC Wang, JY Wang, JSR Jang… - arXiv preprint arXiv …, 2024 - arxiv.org
Music source separation aims to separate polyphonic music into different types of sources.
Most existing methods focus on enhancing the quality of separated results by using a larger …

Automatic lyrics transcription of polyphonic music

X Gao - 2022 - search.proquest.com
Abstract Automatic Lyrics Transcription of polyphonic music (ALTP) aims to recognize the
sung lyrics from singing vocals in the presence of instrumental music accompaniment, and it …