Audio signals represent a wide diversity of acoustic events, from background environmental noise to spoken communication. Machine learning models such as neural networks have …
Y Zhang, B Pardo, Z Duan - IEEE/ACM Transactions on Audio …, 2018 - ieeexplore.ieee.org
Conventional methods for finding audio in databases typically search text labels, rather than the audio itself. This can be problematic as labels may be missing, irrelevant to the audio …
This report presents our audio event detection system submitted for Task 2," Detection of rare sound events", of DCASE 2017 challenge. The proposed system is based on …
Audio-Text retrieval takes a natural language query to retrieve relevant audio files in a database. Conversely, Text-Audio retrieval takes an audio file as a query to retrieve relevant …
CH Shen, JY Sung, HY Lee - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
Audio Word2Vec offers vector representations of fixed dimensionality for variable-length audio segments using Sequence to-sequence Autoencoder (SA). These vector …
Despite recent progress in text-to-audio (TTA) generation, we show that the state-of-the-art models, such as AudioLDM, trained on datasets with an imbalanced class distribution, such …
MS Hussain, MA Haque - arXiv preprint arXiv:1812.00149, 2018 - arxiv.org
Speech, Music and Noise classification/segmentation is an important preprocessing step for audio processing/indexing. To this end, we propose a novel 1D Convolutional Neural …
Information retrieval from brain responses to auditory and visual stimuli has shown success through classification of song names and image classes presented to participants while …
Deep Learning has become state of the art in visual computing and continuously emerges into the Music Information Retrieval (MIR) and audio retrieval domain. In order to bring …