作者
Renato Panda, Ricardo Malheiro, Bruno Rocha, António Oliveira, Rui Pedro Paiva
发表日期
2013/10/14
研讨会论文
10th International Symposium on Computer Music Multidisciplinary Research – CMMR’2013
出版商
http://rppaiva.dei.uc.pt/research.html
简介
We propose a multi-modal approach to the music emotion recognition (MER) problem, combining information from distinct sources, namely audio, MIDI and lyrics. We introduce a methodology for the automatic creation of a multi-modal music emotion dataset resorting to the AllMusic database, based on the emotion tags used in the MIREX Mood Classification Task. Then, MIDI files and lyrics corresponding to a sub-set of the obtained audio samples were gathered. The dataset was organized into the same 5 emotion clusters defined in MIREX. From the audio data, 177 standard features and 98 melodic features were extracted. As for MIDI, 320 features were collected. Finally, 26 lyrical features were extracted. We experimented with several supervised learning and feature selection strategies to evaluate the proposed multi-modal approach. Employing only standard audio features, the best attained performance was 44.3% (F-measure). With the multi-modal approach, results improved to 61.1%, using only 19 multi-modal features. Melodic audio features were particularly important to this improvement.
引用总数
201420152016201720182019202020212022202320243686843121684
学术搜索中的文章
RES Panda, R Malheiro, B Rocha, AP Oliveira… - 10th International symposium on computer music …, 2013