Speech/music classification using features from spectral peaks

M Bhattacharjee, SRM Prasanna… - IEEE/ACM Transactions …, 2020 - ieeexplore.ieee.org
Spectrograms of speech and music contain distinct striation patterns. Traditional features
represent various properties of the audio signal but do not necessarily capture such …

Making accurate formant measurements: An empirical investigation of the influence of the measurement tool, analysis settings and speaker on formant measurements

P Harrison - 2013 - etheses.whiterose.ac.uk
The aim of this thesis is to provide guidance and information that will assist forensic speech
scientists, and phoneticians generally, in making more accurate formant measurements …

A hierarchical framework for spectro-temporal feature extraction

M Heckmann, X Domont, F Joublin, C Goerick - Speech Communication, 2011 - Elsevier
In this paper we present a hierarchical framework for the extraction of spectro-temporal
acoustic features. The design of the features targets higher robustness in dynamic …

A mixture model approach for formant tracking and the robustness of student's-t distribution

H Sundar, CS Seelamantula… - IEEE transactions on …, 2012 - ieeexplore.ieee.org
We address the problem of robust formant tracking in continuous speech in the presence of
additive noise. We propose a new approach based on mixture modeling of the formant …

Acoustic features and modelling

F Eyben, F Eyben - Real-time Speech and Music Classification by Large …, 2016 - Springer
This chapter gives an overview of the methods for speech and music analysis implemented
by the author in the openSMILE toolkit. The methods described, include all the relevant …

Parametric model of spectral envelope to synthesize realistic intensity variations in singing voice

E Molina, I Barbancho, AM Barbancho… - … on Acoustics, Speech …, 2014 - ieeexplore.ieee.org
In this paper, we propose a method to synthesize the natural variations of spectral envelope
as intensity varies in singing voice. To this end, we propose a parametric model of spectral …

Spectral envelope transformation in singing voice for advanced pitch shifting

JL Santacruz, LJ Tardón, I Barbancho, AM Barbancho - Applied Sciences, 2016 - mdpi.com
The aim of the present work is to perform a step towards more natural pitch shifting
techniques in singing voice for its application in music production and entertainment …

Firing rate homeostasis for dynamic neural field formation

C Glaser, F Joublin - IEEE Transactions on Autonomous Mental …, 2011 - ieeexplore.ieee.org
Dynamic neural fields are recurrent neural networks which aim at modeling cortical activity
evolution both in space and time. A self-organized formation of these fields has been rarely …

Spectral histogram of oriented gradients (SHOGs) for Tamil language male/female speaker classification

A Muthamizh Selvan, R Rajesh - International Journal of Speech …, 2012 - Springer
Abstract Gender (Male/Female) classification plays a primary vital role to develop a robust
Automatic Tamil Speech Recognition (ASR) applications due to the diversity in the vocal …

[PDF][PDF] Robust formant detection using group delay function and stabilized weighted linear prediction.

DN Gowda, J Pohjalainen, M Kurimo, P Alku - INTERSPEECH, 2013 - isca-archive.org
In this paper, we propose a robust spectral representation for detecting formants in heavily
degraded conditions. The method combines the temporal robustness of the stabilized …