Harmonizing minds and machines: survey on transformative power of machine learning in music

J Liang - Frontiers in Neurorobotics, 2023 - frontiersin.org
This survey explores the symbiotic relationship between Machine Learning (ML) and music,
focusing on the transformative role of Artificial Intelligence (AI) in the musical sphere …

Feeding intensity assessment of aquaculture fish using Mel Spectrogram and deep learning algorithms

Z Du, M Cui, Q Wang, X Liu, X Xu, Z Bai, C Sun… - Aquacultural …, 2023 - Elsevier
Accurately and objectively analyzing fish feeding intensity is essential to guiding feeding
and production techniques. Fish feeding intensity in recirculating aquaculture systems (RAS) …

Learning to Rebalance Multi-Modal Optimization by Adaptively Masking Subnetworks

Y Yang, H Pan, QY Jiang, Y Xu, J Tang - arXiv preprint arXiv:2404.08347, 2024 - arxiv.org
Multi-modal learning aims to enhance performance by unifying models from various
modalities but often faces the" modality imbalance" problem in real data, leading to a bias …

[HTML][HTML] Combining piano performance dimensions for score difficulty classification

P Ramoneda, D Jeong, V Eremenko, NC Tamer… - Expert Systems with …, 2024 - Elsevier
Predicting the difficulty of playing a musical score is essential for structuring and exploring
score collections. Despite its importance for music education, the automatic difficulty …

[PDF][PDF] Insights into end-to-end audio-to-score transcription with real recordings: A case study with saxophone works

JC Martínez-Sevilla, M Alfaro-Contreras… - Proceedings of the …, 2023 - isca-archive.org
Neural end-to-end Audio-to-Score (A2S) transcription aims to retrieve a score that encodes
the music content of an audio recording in a single step. Due to the recentness of this …

Multimodal Classification via Modal-Aware Interactive Enhancement

QY Jiang, Z Chi, Y Yang - arXiv preprint arXiv:2407.04587, 2024 - arxiv.org
Due to the notorious modality imbalance problem, multimodal learning (MML) leads to the
phenomenon of optimization imbalance, thus struggling to achieve satisfactory performance …

A Two-Stage Audio-Visual Fusion Piano Transcription Model Based on the Attention Mechanism

Y Li, X Wang, R Wu, W Xu… - IEEE/ACM Transactions …, 2024 - ieeexplore.ieee.org
Piano transcription is a significant problem in the field of music information retrieval, aiming
to obtain symbolic representations of music from captured audio or visual signals. Previous …

Can Audio Reveal Music Performance Difficulty? Insights from the Piano Syllabus Dataset

P Ramoneda, M Lee, D Jeong, JJ Valero-Mas… - arXiv preprint arXiv …, 2024 - arxiv.org
Automatically estimating the performance difficulty of a music piece represents a key
process in music education to create tailored curricula according to the individual needs of …

Trustworthy Enhanced Multi-view Multi-modal Alzheimer's Disease Prediction with Brain-wide Imaging Transcriptomics Data

S Cong, Z Fan, H Liu, Y Zhang, X Wang, H Luo… - arXiv preprint arXiv …, 2024 - arxiv.org
Brain transcriptomics provides insights into the molecular mechanisms by which the brain
coordinates its functions and processes. However, existing multimodal methods for …

A Novel Intelligent Assessment Based on Audio-Visual Data for Chinese Zither Fingerings

W Zhao, S Wang, Y Zhao, J Wei, T Li - International Conference on Image …, 2023 - Springer
In this paper, we make a novel study on the intelligent assessment for Chinese zither
(Zheng) fingerings in the cross field of art AI. Due to the gaps between science and art, there …