High-fidelity and pitch-controllable neural vocoder based on unified source-filter networks

文章

学术资源搜索

获得 4 条结果（用时0.03秒）

我的图书馆

High-fidelity and pitch-controllable neural vocoder based on unified source-filter networks

在引用文章中搜索

[PDF] arxiv.org

Wavehax: Aliasing-Free Neural Waveform Synthesis Based on 2D Convolution and Harmonic Prior for Reliable Complex Spectrogram Estimation

R Yoneyama, A Miyashita, R Yamamoto… - arXiv preprint arXiv …, 2024 - arxiv.org

Neural vocoders often struggle with aliasing in latent feature spaces, caused by time-domain
nonlinear operations and resampling layers. Aliasing folds high-frequency components into …

[PDF] jst.go.jp

Interactive tools for making temporally variable, multiple-attributes, and multiple-instances morphing accessible: Flexible manipulation of divergent speech instances …

H Kawahara, M Morise - Acoustical Science and Technology, 2025 - jstage.jst.go.jp

We generalized a voice morphing algorithm capable of handling temporally variable,
multiple-attributes, and multiple instances. The generalized morphing provides a new …

被引用次数：1 相关文章所有 3 个版本

An Efficient Speech Synthesizer–A Hybrid Monotonic architecture for text-to-speech using VAE & LPC-net with independent sentence length

N NAVEENKUMAR - 2024 - researchsquare.com

In this research, it is suggested that a hybrid architecture for text-to-speech, which is named
it as Efficient Speech Synthesizer. ESS optimizes all the parameters through a consistent …

A Review of the Challenges of Adaptive Filtering Technology in High-fidelity Audio Signal Processing

Z Huang - Theoretical and Natural Science, 2024 - ewadirect.com

High-fidelity audio signal processing plays an important role in modern audio technology.
With the increasing demand for this technology in various audio application scenarios, the …

高级搜索

QQ 群

High-fidelity and pitch-controllable neural vocoder based on unified source-filter networks

Wavehax: Aliasing-Free Neural Waveform Synthesis Based on 2D Convolution and Harmonic Prior for Reliable Complex Spectrogram Estimation

Interactive tools for making temporally variable, multiple-attributes, and multiple-instances morphing accessible: Flexible manipulation of divergent speech instances …

An Efficient Speech Synthesizer–A Hybrid Monotonic architecture for text-to-speech using VAE & LPC-net with independent sentence length

A Review of the Challenges of Adaptive Filtering Technology in High-fidelity Audio Signal Processing

引用