Content-based and knowledge-enriched representations for classification across modalities: a survey

N Pittaras, G Giannakopoulos, P Stamatopoulos… - ACM Computing …, 2023 - dl.acm.org
This survey documents representation approaches for classification across different
modalities, from purely content-based methods to techniques utilizing external sources of …

An attention-based approach to hierarchical multi-label music instrument classification

Z Zhong, M Hirano, K Shimada… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Although music is typically multi-label, many works have studied hierarchical music tagging
with simplified settings such as single-label data. Moreover, there lacks a framework to …

Hierarchical classification for instrument activity detection in orchestral music recordings

M Krause, M Müller - IEEE/ACM Transactions on Audio, Speech …, 2023 - ieeexplore.ieee.org
Instrument activity detection is a fundamental task in music information retrieval, serving as a
basis for many applications, such as music recommendation, music tagging, or remixing …

Speech representation learning: Learning bidirectional encoders with single-view, multi-view, and multi-task methods

Q Tang - arXiv preprint arXiv:2308.00129, 2023 - arxiv.org
This thesis focuses on representation learning for sequence data over time or space, aiming
to improve downstream sequence prediction tasks by using the learned representations …

Learning Ontology Informed Representations with Constraints for Acoustic Event Detection

A Raina, SI Sheikh, V Arora - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org
Acoustic Event Detection (AED) has been of great interest for nearly a decade for diverse
applications. Most open datasets contain meta information on the hierarchy of labels, which …

[PDF][PDF] Activity Detection for Sound Events in Orchestral Music Recordings

M Krause - 2023 - opus4.kobv.de
Composers of music can express emotions and communicate with their audience in a
multitude of ways. They decide on which voices or instruments to use, arrange notes into …