related:PtpPT-0pYtoJ:scholar.google.com/

A deep generative model of speech complex spectrograms

AA Nugraha, K Sekiguchi… - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org

This paper proposes an approach to the joint modeling of the short-time Fourier transform
magnitude and phase spectrograms with a deep generative model. We assume that the …

被引用次数：15 相关文章所有 8 个版本

[PDF] arxiv.org

Phase reconstruction from amplitude spectrograms based on von-Mises-distribution deep neural network

S Takamichi, Y Saito, N Takamune… - … on Acoustic Signal …, 2018 - ieeexplore.ieee.org

This paper presents a deep neural network (DNN)-based phase reconstruction from
amplitude spectrograms. In audio signal and speech processing, the amplitude spectrogram …

被引用次数：47 相关文章所有 8 个版本

[PDF] arxiv.org

STFT spectral loss for training a neural speech waveform model

S Takaki, T Nakashika, X Wang… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org

This paper proposes a new loss using short-time Fourier transform (STFT) spectra for the
aim of training a high-performance neural speech waveform model that predicts raw …

被引用次数：19 相关文章所有 5 个版本

[PDF] arxiv.org

Generative adversarial network-based approach to signal reconstruction from magnitude spectrogram

K Oyamada, H Kameoka, T Kaneko… - 2018 26th European …, 2018 - ieeexplore.ieee.org

In this paper, we address the problem of reconstructing a time-domain signal (or a phase
spectrogram) solely from a magnitude spectrogram. Since magnitude spectrograms do not …

被引用次数：42 相关文章所有 10 个版本

[PDF] mcgill.ca

A fully convolutional neural network for complex spectrogram processing in speech enhancement

Z Ouyang, H Yu, WP Zhu… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org

In this paper we propose a fully convolutional neural network (CNN) for complex
spectrogram processing in speech enhancement. The proposed CNN consists of one …

被引用次数：61 相关文章所有 3 个版本

[PDF] ed.ac.uk

Generative adversarial network-based postfilter for STFT spectrograms

T Kaneko, S Takaki, H Kameoka… - Interspeech 2017, 2017 - research.ed.ac.uk

We propose a learning-based postfilter to reconstruct the high-fidelity spectral texture in
short-term Fourier transform (STFT) spectrograms. In speech-processing systems, such as …

被引用次数：78 相关文章所有 7 个版本

[PDF] rwth-aachen.de

Acoustic modeling of speech waveform based on multi-resolution, neural network signal processing

Z Tüske, R Schlüter, H Ney - 2018 IEEE international …, 2018 - ieeexplore.ieee.org

Recently, several papers have demonstrated that neural networks (NN) are able to perform
the feature extraction as part of the acoustic model. Motivated by the Gammatone feature …

被引用次数：26 相关文章所有 7 个版本

[PDF] ucsd.edu

[PDF][PDF] Binary coding of speech spectrograms using a deep auto-encoder

L Deng, ML Seltzer, D Yu, A Acero… - … annual conference of …, 2010 - dub.ucsd.edu

This paper reports our recent exploration of the layer-by-layer learning strategy for training a
multi-layer generative model of patches of speech spectrograms. The top layer of the …

被引用次数：489 相关文章所有 17 个版本

[PDF] ed.ac.uk

Multi-stream acoustic modelling using raw real and imaginary parts of the Fourier transform

E Loweimi, Z Yue, P Bell, S Renals… - … /ACM Transactions on …, 2023 - ieeexplore.ieee.org

In this paper, we investigate multi-stream acoustic modelling using the raw real and
imaginary parts of the Fourier transform of speech signals. Using the raw magnitude …

被引用次数：6 相关文章所有 9 个版本

[PDF] ed.ac.uk

Speech acoustic modelling from raw phase spectrum

E Loweimi, Z Cvetkovic, P Bell… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org

Magnitude spectrum-based features are the most widely employed front-ends for acoustic
modelling in automatic speech recognition (ASR) systems. In this paper, we investigate the …

被引用次数：11 相关文章所有 7 个版本

高级搜索

QQ 群

A deep generative model of speech complex spectrograms

Phase reconstruction from amplitude spectrograms based on von-Mises-distribution deep neural network

STFT spectral loss for training a neural speech waveform model

Generative adversarial network-based approach to signal reconstruction from magnitude spectrogram

A fully convolutional neural network for complex spectrogram processing in speech enhancement

Generative adversarial network-based postfilter for STFT spectrograms

Acoustic modeling of speech waveform based on multi-resolution, neural network signal processing

[PDF][PDF] Binary coding of speech spectrograms using a deep auto-encoder

Multi-stream acoustic modelling using raw real and imaginary parts of the Fourier transform

Speech acoustic modelling from raw phase spectrum

相关搜索

引用