Text-to-speech synthesis using an autoencoder

V Pollet, E Zovato - US Patent 11,069,335, 2021 - Google Patents

Aspects of the disclosure are related to synthesizing speech or other audio based on input
data. Additionally, aspects of the disclosure are related to using one or more recurrent …

被引用次数：31 相关文章所有 4 个版本

[PDF] googleapis.com

Methods and systems for intent detection and slot filling in spoken dialogue systems

P Angkititrakul, R Schumann - US Patent 10,431,207, 2019 - Google Patents

A method for spoken language understanding (SLU) includes generating a first encoded
representation of words from a user based on an output of a recurrent neural network (RNN) …

被引用次数：25 相关文章所有 4 个版本

[PDF] googleapis.com

Text-to-speech processing using input voice characteristic data

RB Chicote, V Aggarwal, AP Breen… - US Patent …, 2022 - Google Patents

During text-to-speech processing, a speech model creates synthesized speech that
corresponds to input data. The speech model may include an encoder for encoding the input …

被引用次数：17 相关文章所有 4 个版本

[PDF] googleapis.com

Text-to-speech (TTS) processing with transfer of vocal characteristics

V Klimkov, TR Drugman, A Galkin… - US Patent 11,410,684, 2022 - Google Patents

Audio data from a first, source speaker is received and processed to determine linguistic
units and vocal characteristics corresponding to those linguistic units. The linguistic units …

被引用次数：17 相关文章所有 2 个版本

[PDF] googleapis.com

Voice synthesis method, model training method, device and computer device

WU Xixin, M Wang, S Kang, D Su, D Yu - US Patent 12,014,720, 2024 - Google Patents

This application relates to a speech synthesis method and apparatus, a model training
method and apparatus, and a computer device. The method includes: obtaining to-be …

被引用次数：6 相关文章所有 4 个版本

Multilingual speech synthesis and cross-language voice cloning

Y Zhang, RJ Weiss, B Chun, Y Wu, Z Chen… - US Patent …, 2023 - Google Patents

2020-04-29 Assigned to GOOGLE LLC reassignment GOOGLE LLC ASSIGNMENT OF
ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: CHEN, ZHIFENG …

被引用次数：15 相关文章所有 4 个版本

[PDF] googleapis.com

Generating expressive speech audio from text data

S Gururani, K Gupta, D Shah, Z Shakeri… - US Patent …, 2022 - Google Patents

(57) ABSTRACT A system for use in video game development to generate expressive
speech audio comprises a user interface config ured to receive user-input text data and a …

被引用次数：12 相关文章所有 4 个版本

[PDF] googleapis.com

Matching mouth shape and movement in digital video to alternative audio

TD Stratton, S Lile - US Patent 11,436,780, 2022 - Google Patents

A method for matching mouth shape and movement in digital video to alternative audio
includes deriving a sequence of facial poses including mouth shapes for an actor from a …

被引用次数：10 相关文章所有 4 个版本

[PDF] googleapis.com

Systems and methods for neural voice cloning with a few samples

C Jitong, P Kainan, P Wei, Z Yanqi - US Patent 11,238,843, 2022 - Google Patents

Voice cloning is a highly desired capability for personalized speech interfaces. Neural
network-based speech synthesis has been shown to generate high quality speech for a …

被引用次数：5 相关文章所有 4 个版本

[PDF] googleapis.com

Method and apparatus with text-to-speech conversion

H Lee - US Patent 11,138,963, 2021 - Google Patents

A processor-implemented text-to-speech method includes determining, using a sub-
encoder, a first feature vector indicating an utterance characteristic of a speaker from feature …

被引用次数：8 相关文章所有 4 个版本

高级搜索

QQ 群