Fine-tuned XLSR-53 large model for speech recognition in Chinese

文章

学术资源搜索

获得 4 条结果（用时0.02秒）

我的图书馆

Fine-tuned XLSR-53 large model for speech recognition in Chinese

在引用文章中搜索

[PDF] duke.edu

Integrating frame-level boundary detection and deepfake detection for locating manipulated regions in partially spoofed audio forgery attacks

Z Cai, M Li - Computer Speech & Language, 2024 - Elsevier

Partially fake audio, a variant of deep fake that involves manipulating audio utterances
through the incorporation of fake or externally-sourced bona fide audio clips, constitutes a …

被引用次数：8 相关文章所有 5 个版本

[PDF] arxiv.org

PMMTalk Speech-Driven 3D Facial Animation from Complementary Pseudo Multi-modal Features

T Han, S Gui, Y Huang, B Li, L Liu… - IEEE Transactions …, 2024 - ieeexplore.ieee.org

Speech-driven 3D facial animation has improved a lot recently while most related works only
utilize acoustic modality and neglect the influence of visual and textual cues, leading to …

被引用次数：1 相关文章所有 2 个版本

Incorporating Speaker's Speech Rate Features for Improved Voice Cloning

Q Zhe, I Katunobu - 2023 9th International Conference on …, 2023 - ieeexplore.ieee.org

We investigate a neural network-based text-to-speech (TTS) synthesis system that aims to
simulate the Mandarin voice of different speakers using short voice samples. Our system …

[PDF] duke.edu

Advancing Deep-Generated Speech and Defending against Its Misuse

Z Cai - 2023 - search.proquest.com

Deep learning has revolutionized speech generation, spanning synthesis areas such as text-
to-speech and voice conversion, leading to diverse advancements. On the one hand, when …

高级搜索

QQ 群

Fine-tuned XLSR-53 large model for speech recognition in Chinese

Integrating frame-level boundary detection and deepfake detection for locating manipulated regions in partially spoofed audio forgery attacks

PMMTalk Speech-Driven 3D Facial Animation from Complementary Pseudo Multi-modal Features

Incorporating Speaker's Speech Rate Features for Improved Voice Cloning

Advancing Deep-Generated Speech and Defending against Its Misuse

引用