Audio segmentation of broadcast news in the Albayzin-2010 evaluation: overview, results, and discussion

T Butko, C Nadeu - EURASIP Journal on Audio, Speech, and Music …, 2011 - Springer
Recently, audio segmentation has attracted research interest because of its usefulness in
several applications like audio indexing and retrieval, subtitling, monitoring of acoustic …

[PDF][PDF] Open Broadcast Media Audio from TV: A Dataset of TV Broadcast Audio with Relative Music Loudness Annotations.

B Meléndez-Catalán, E Molina… - Trans. Int. Soc. Music …, 2019 - emilio-molina.github.io
Open Broadcast Media Audio from TV (OpenBMAT) is an open, annotated dataset for the
task of music detection that contains over 27 hours of TV broadcast audio from 4 countries …

Empirical comparison of fast clustering algorithms for large data sets

CP Wei, YH Lee, CM Hsu - Proceedings of the 33rd annual …, 2000 - ieeexplore.ieee.org
Several fast algorithms for clustering very large data sets have been proposed in the
literature. CLARA is a combination of a sampling procedure and the classical PAM …

[PDF][PDF] Study and application of acoustic information for the detection of harmful content and fusion with visual information

T Giannakopoulos - 2009 - cgi.di.uoa.gr
This thesis aims at investigating and developing techniques for content-based segmentation
and classification of multimedia files, based on audio information. Emphasis has been given …

Clean vs. overlapped speech-music detection using harmonic-percussive features and multi-task learning

M Bhattacharjee, SRM Prasanna… - IEEE/ACM Transactions …, 2022 - ieeexplore.ieee.org
Detection of speech and music signals in isolated and overlapped conditions is an essential
preprocessing step for many audio applications. Speech signals have wavy and continuous …

3MAS: a multitask, multilabel, multidataset semi-supervised audio segmentation model

M Lebourdais, P Gimeno, T Mariotte, M Tahon… - Speaker and Language …, 2024 - hal.science
When processing audio data, multiple challenges arise, one of them being the diversity of
information present in the audio signal. Various audio segmentation subtasks appeared …

[PDF][PDF] Unsupervised feature learning for speech and music detection in radio broadcasts

J Schlüter, R Sonnleitner - Proceedings of the 15th International Conference …, 2012 - dafx.de
Detecting speech and music is an elementary step in extracting information from radio
broadcasts. Existing solutions either rely on general-purpose audio features, or build on …

[PDF][PDF] Partial AUC Optimisation Using Recurrent Neural Networks for Music Detection with Limited Training Data.

P Gimeno, V Mingote, AO Giménez, A Miguel… - …, 2020 - researchgate.net
State-of-the-art music detection systems, whose aim is to distinguish whether or not music is
present in an audio signal, rely mainly on deep learning approaches. However, these kind of …

Deep Neural Networks-based Classification Methodologies of Speech, Audio and Music, and its Integration for Audio Metadata Tagging

H Park, Y Chung, JH Kim - Journal of Web Engineering, 2023 - ieeexplore.ieee.org
Videos contain visual and auditory information. Visual information in a video can include
images of people, objects, and the landscape, whereas auditory information includes voices …

[PDF][PDF] Relative music loudness estimation using temporal convolutional networks and a cnn feature extraction front-end

B Meléndez-Catalán, BL SL, E Molina… - Proceedings of the …, 2020 - dafx2020.mdw.ac.at
Relative music loudness estimation is a MIR task that consists in dividing audio in segments
of three classes: Foreground Music, Background Music and No Music. Given the temporal …