Explainable and interpretable multimodal large language models: A comprehensive survey

Y Dang, K Huang, J Huo, Y Yan, S Huang, D Liu… - arXiv preprint arXiv …, 2024 - arxiv.org
The rapid development of Artificial Intelligence (AI) has revolutionized numerous fields, with
large language models (LLMs) and computer vision (CV) systems driving advancements in …

Musiclime: Explainable multimodal music understanding

T Sotirou, V Lyberatos, OM Mastromichalakis… - arXiv preprint arXiv …, 2024 - arxiv.org
Multimodal models are critical for music understanding tasks, as they capture the complex
interplay between audio and lyrics. However, as these models become more prevalent, the …

CHORDONOMICON: A Dataset of 666,000 Songs and their Chord Progressions

S Kantarelis, K Thomas, V Lyberatos… - arXiv preprint arXiv …, 2024 - arxiv.org
Chord progressions encapsulate important information about music, pertaining to its
structure and conveyed emotions. They serve as the backbone of musical composition, and …

Crowdsourcing as a Pedagogical Tool in Computer Science Higher Education: a Case Study

V Lyberatos, S Kantarelis, E Kaldeli, S Bekiaris… - Human …, 2024 - hcjournal.org
The approach used and the insights gained from employing crowdsourcing techniques in a
computer science homework assignment for higher education students are described in this …

[PDF][PDF] Generative Music: Seq2Seq Models for polyphonic enrichment

PE Pavlaki - 2024 - dspace.lib.ntua.gr
Περίληψη Η δημιουργία μουσικής στοχεύει την παραγωγή μουσικών κομματιών που
συνδυάζουν ένα ευχάριστο και αρμονικό αποτέλεσμα με ένα σύνθετο και πλούσιο …