Mel frequency cepstral coefficient and its applications: A review

ZK Abdul, AK Al-Talabani - IEEE Access, 2022 - ieeexplore.ieee.org
Feature extraction and representation has significant impact on the performance of any
machine learning method. Mel Frequency Cepstrum Coefficient (MFCC) is designed to …

[HTML][HTML] An ongoing review of speech emotion recognition

J de Lope, M Graña - Neurocomputing, 2023 - Elsevier
User emotional status recognition is becoming a key feature in advanced Human Computer
Interfaces (HCI). A key source of emotional information is the spoken expression, which may …

Classifying heart sounds using images of motifs, MFCC and temporal features

DM Nogueira, CA Ferreira, EF Gomes… - Journal of medical …, 2019 - Springer
Cardiovascular disease is the leading cause of death in the world, and its early detection is
a key to improving long-term health outcomes. The auscultation of the heart is still an …

Egyptian Arabic speech emotion recognition using prosodic, spectral and wavelet features

L Abdel-Hamid - Speech Communication, 2020 - Elsevier
Speech emotion recognition (SER) has recently been receiving increased interest due to the
rapid advancements in affective computing and human computer interaction. English …

End-to-end video-to-speech synthesis using generative adversarial networks

R Mira, K Vougioukas, P Ma, S Petridis… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Video-to-speech is the process of reconstructing the audio speech from a video of a spoken
utterance. Previous approaches to this task have relied on a two-step process where an …

Emotion detection using MFCC and cepstrum features

S Lalitha, D Geyasruti, R Narayanan… - Procedia Computer …, 2015 - Elsevier
A tremendous research is being done on Speech Emotion Recognition (SER) in the recent
years with its main motto to improve human machine interaction. In this work, the effect of …

A new approach of audio emotion recognition

CS Ooi, KP Seng, LM Ang, LW Chew - Expert systems with applications, 2014 - Elsevier
A new architecture of intelligent audio emotion recognition is proposed in this paper. It fully
utilizes both prosodic and spectral features in its design. It has two main paths in parallel …

Speech emotion recognition research: an analysis of research focus

MB Mustafa, MAM Yusoof, ZM Don… - International Journal of …, 2018 - Springer
This article analyses research in speech emotion recognition (“SER”) from 2006 to 2017 in
order to identify the current focus of research, and areas in which research is lacking. The …

M2c: Concise music representation for 3d dance generation

M Marchellus, IK Park - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Generating 3D dance motions that are synchronized with music is a difficult task, as it
involves modeling the complex interplay between musical rhythms and human body …

Stressed speech emotion recognition using feature fusion of teager energy operator and MFCC

SR Bandela, TK Kumar - 2017 8th International Conference on …, 2017 - ieeexplore.ieee.org
In this paper, a novel feature fusion of Teager Energy Operator (TEO) and Mel Frequency
Cepstral Coefficients (MFCC), as Teager-MFCC (T-MFCC) feature extraction technique, is …