Contrastive audio-language learning for music

I Manco, E Benetos, E Quinton, G Fazekas - arXiv preprint arXiv …, 2022 - arxiv.org
As one of the most intuitive interfaces known to humans, natural language has the potential
to mediate many tasks that involve human-computer interaction, especially in application …

[PDF][PDF] Intelligent User Interfaces for Music Discovery.

P Knees, M Schedl, M Goto - Trans. Int. Soc. Music. Inf. Retr., 2020 - staff.aist.go.jp
With its origins in Information Retrieval research, a fundamental goal of Music Information
Retrieval (MIR) as a dedicated research field in the year 2000 was to develop technology to …

Deep learning approaches in topics of singing information processing

C Gupta, H Li, M Goto - IEEE/ACM Transactions on Audio …, 2022 - ieeexplore.ieee.org
Singing, the vocal productionof musical tones, is one of the most important elements of
music. Addressing the needs of real-world applications, the study of technologies related to …

Generating albums with samplernn to imitate metal, rock, and punk bands

CJ Carr, Z Zukowski - arXiv preprint arXiv:1811.06633, 2018 - arxiv.org
This early example of neural synthesis is a proof-of-concept for how machine learning can
drive new types of music software. Creating music can be as simple as specifying a set of …

[PDF][PDF] Lyrics information processing: Analysis, generation, and applications

K Watanabe, M Goto - Proceedings of the 1st Workshop on NLP …, 2020 - aclanthology.org
In this paper we propose lyrics information processing (LIP) as a research field for
technologies focusing on lyrics text, which has both linguistic and musical characteristics …

[PDF][PDF] Query-by-Blending: A Music Exploration System Blending Latent Vector Representations of Lyric Word, Song Audio, and Artist.

K Watanabe, M Goto - ISMIR, 2019 - archives.ismir.net
This paper presents Query-by-Blending, a novel music exploration system that enables
users to find unfamiliar music content by flexibly combining three musical aspects: lyric word …

Tag propagation and cost-sensitive learning for music auto-tagging

YH Lin, HH Chen - IEEE Transactions on Multimedia, 2020 - ieeexplore.ieee.org
The performance of music auto-tagging depends on the quality of training data. In practice,
the links between songs and tags in the manually labeled training data can be incorrect …

[PDF][PDF] A Chorus-Section Detection Method for Lyrics Text.

K Watanabe, M Goto - ISMIR, 2020 - staff.aist.go.jp
This paper addresses the novel task of detecting chorus sections in English and Japanese
lyrics text. Although chorus-section detection using audio signals has been studied, whether …

[PDF][PDF] Intelligent User Interfaces for Music Discovery: The Past 20 Years and What's to Come.

P Knees, M Schedl, M Goto - ISMIR, 2019 - archives.ismir.net
Providing means to assist the user in finding music is one of the original motivations
underlying the research field known as Music Information Retrieval (MIR). Therefore, already …

[PDF][PDF] Unveiling the Impact of Musical Factors in Judging a Song on First Listen: Insights From a User Survey.

K Tsukuda, T Nakano, M Hamasaki, M Goto - ISMIR, 2023 - archives.ismir.net
When a user listens to a song for the first time, what musical factors (eg, melody, tempo, and
lyrics) influence the user's decision to like or dislike the song? An answer to this question …