S Takamichi, Y Saito, N Takamune… - … on Acoustic Signal …, 2018 - ieeexplore.ieee.org
This paper presents a deep neural network (DNN)-based phase reconstruction from amplitude spectrograms. In audio signal and speech processing, the amplitude spectrogram …
S Takaki, T Nakashika, X Wang… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org
This paper proposes a new loss using short-time Fourier transform (STFT) spectra for the aim of training a high-performance neural speech waveform model that predicts raw …
In this paper, we address the problem of reconstructing a time-domain signal (or a phase spectrogram) solely from a magnitude spectrogram. Since magnitude spectrograms do not …
Z Ouyang, H Yu, WP Zhu… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org
In this paper we propose a fully convolutional neural network (CNN) for complex spectrogram processing in speech enhancement. The proposed CNN consists of one …
T Kaneko, S Takaki, H Kameoka… - Interspeech 2017, 2017 - research.ed.ac.uk
We propose a learning-based postfilter to reconstruct the high-fidelity spectral texture in short-term Fourier transform (STFT) spectrograms. In speech-processing systems, such as …
Recently, several papers have demonstrated that neural networks (NN) are able to perform the feature extraction as part of the acoustic model. Motivated by the Gammatone feature …
This paper reports our recent exploration of the layer-by-layer learning strategy for training a multi-layer generative model of patches of speech spectrograms. The top layer of the …
In this paper, we investigate multi-stream acoustic modelling using the raw real and imaginary parts of the Fourier transform of speech signals. Using the raw magnitude …
Magnitude spectrum-based features are the most widely employed front-ends for acoustic modelling in automatic speech recognition (ASR) systems. In this paper, we investigate the …