Single channel phase-aware signal processing in speech communication: theory and practice

Efficiently trainable text-to-speech system based on deep convolutional networks with guided attention

H Tachibana, K Uenoyama… - 2018 IEEE international …, 2018 - ieeexplore.ieee.org

This paper describes a novel text-to-speech (TTS) technique based on deep convolutional
neural networks (CNN), without use of any recurrent units. Recurrent neural networks (RNN) …

被引用次数：392 相关文章所有 6 个版本

[PDF] grosswindhager.com

SALMA: UWB-based single-anchor localization system using multipath assistance

B Großwindhager, M Rath, J Kulmer, MS Bakr… - Proceedings of the 16th …, 2018 - dl.acm.org

Setting up indoor localization systems is often excessively time-consuming and labor-
intensive, because of the high amount of anchors to be carefully deployed or the …

被引用次数：92 相关文章所有 10 个版本

[PDF] ieee.org

A noniterative method for reconstruction of phase from STFT magnitude

Z Průša, P Balazs… - IEEE/ACM Transactions on …, 2017 - ieeexplore.ieee.org

A noniterative method for the reconstruction of the short-time fourier transform (STFT) phase
from the magnitude is presented. The method is based on the direct relationship between …

被引用次数：107 相关文章所有 7 个版本

[PDF] ieee.org

Griffin–Lim like phase recovery via alternating direction method of multipliers

Y Masuyama, K Yatabe… - IEEE Signal Processing …, 2018 - ieeexplore.ieee.org

Recovering a signal from its amplitude spectrogram, or phase recovery, exhibits many
applications in acoustic signal processing. When only an amplitude spectrogram is available …

被引用次数：52 相关文章所有 9 个版本

[PDF] openreview.net

[PDF][PDF] Neural Homomorphic Vocoder.

Z Liu, K Chen, K Yu - Interspeech, 2020 - openreview.net

In this paper, we propose the neural homomorphic vocoder (NHV), a source-filter model
based neural vocoder framework. NHV synthesizes speech by filtering impulse trains and …

被引用次数：30 相关文章所有 7 个版本

[PDF] academia.edu

On DCT-based MMSE estimation of short time spectral amplitude for single-channel speech enhancement

S Shi, K Paliwal, A Busch - Applied Acoustics, 2023 - Elsevier

Abstract This paper proposes Discrete Cosine Transform (DCT) based speech
enhancement algorithms. These algorithms utilize minimum mean square error (MMSE) …

被引用次数：10 相关文章所有 2 个版本

Analytic phase features for dysarthric speech detection and intelligibility assessment

K Gurugubelli, AK Vuppala - Speech Communication, 2020 - Elsevier

The objectives of the dysarthria assessment are to discriminate dysarthric speech from
normal speech, to estimate the severity of dysarthria in terms of the dysarthric speech …

被引用次数：24 相关文章

Acoustic application of phase reconstruction algorithms in optics

T Kobayashi, T Tanaka, K Yatabe… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org

Phase reconstruction from amplitude spectrograms has attracted attention in recent
acoustics because of its potential applications in speech synthesis and enhancement. The …

被引用次数：10 相关文章所有 3 个版本

[HTML] nih.gov

Impact of phase estimation on single-channel speech separation based on time-frequency masking

F Mayer, DS Williamson, P Mowlaee… - The Journal of the …, 2017 - pubs.aip.org

Time-frequency masking is a common solution for the single-channel source separation
(SCSS) problem where the goal is to find a time-frequency mask that separates the …

被引用次数：38 相关文章所有 12 个版本

[PDF] isca-archive.org

[PDF][PDF] Funnel Deep Complex U-Net for Phase-Aware Speech Enhancement.

Y Sun, L Yang, H Zhu, J Hao - Interspeech, 2021 - isca-archive.org

The emergence of deep neural networks has made speech enhancement well developed.
Most of the early models focused on estimating the magnitude of spectrum while ignoring …

被引用次数：15 相关文章所有 5 个版本

高级搜索

QQ 群