Heiga Zen, Alex Graves, Helen King, Tom Walters, Dan Belov, and Demis Hassabis

A Jain, A Xie, P Abbeel - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com

Diffusion models have shown impressive results in text-to-image synthesis. Using massive
datasets of captioned images, diffusion models learn to generate raster images of highly …

被引用次数：73 相关文章所有 6 个版本

[PDF] neurips.cc

The challenge of realistic music generation: modelling raw audio at scale

S Dieleman, A Van Den Oord… - Advances in neural …, 2018 - proceedings.neurips.cc

Realistic music generation is a challenging task. When building generative models of music
that are learnt from data, typically high-level representations such as scores or MIDI are …

被引用次数：223 相关文章所有 6 个版本

[PDF] arxiv.org

Parallel iterative edit models for local sequence transduction

A Awasthi, S Sarawagi, R Goyal, S Ghosh… - arXiv preprint arXiv …, 2019 - arxiv.org

We present a Parallel Iterative Edit (PIE) model for the problem of local sequence
transduction arising in tasks like Grammatical error correction (GEC). Recent approaches …

被引用次数：181 相关文章所有 6 个版本

[PDF] mit.edu

Insertion-based decoding with automatically inferred generation order

J Gu, Q Liu, K Cho - Transactions of the Association for Computational …, 2019 - direct.mit.edu

Conventional neural autoregressive decoding commonly assumes a fixed left-to-right
generation order, which may be sub-optimal. In this work, we propose a novel decoding …

被引用次数：114 相关文章所有 8 个版本

[PDF] uliege.be

Real-time voice cloning

C Jemine - 2019 - matheo.uliege.be

Recent advances in deep learning have shown impressive results in the domain of text-to-
speech. To this end, a deep neural network is usually trained using a corpus of several …

被引用次数：49 相关文章所有 2 个版本

[PDF] aps.org

Neural canonical transformation with symplectic flows

SH Li, CX Dong, L Zhang, L Wang - Physical Review X, 2020 - APS

Canonical transformation plays a fundamental role in simplifying and solving classical
Hamiltonian systems. Intriguingly, it has a natural correspondence to normalizing flows with …

被引用次数：36 相关文章所有 14 个版本

[PDF] arxiv.org

Seq-u-net: A one-dimensional causal u-net for efficient sequence modelling

D Stoller, M Tian, S Ewert, S Dixon - arXiv preprint arXiv:1911.06393, 2019 - arxiv.org

Convolutional neural networks (CNNs) with dilated filters such as the Wavenet or the
Temporal Convolutional Network (TCN) have shown good results in a variety of sequence …

被引用次数：35 相关文章所有 8 个版本

[PDF] arxiv.org

Complex-valued neural networks for machine learning on non-stationary physical data

JS Dramsch, M Lüthje, AN Christensen - Computers & Geosciences, 2021 - Elsevier

Deep learning has become an area of interest in most scientific areas, including physical
sciences. Modern networks apply real-valued transformations on the data. Particularly …

被引用次数：43 相关文章所有 9 个版本

[PDF] ismir.net

[PDF][PDF] Fast and Flexible Neural Audio Synthesis.

L Hantrakul, JH Engel, A Roberts, C Gu - Ismir, 2019 - archives.ismir.net

Autoregressive neural networks, such as WaveNet, have opened up new avenues for
expressive audio synthesis. High-quality speech synthesis utilizes detailed linguistic …

被引用次数：32 相关文章

[PDF] arxiv.org

Anytime sampling for autoregressive models via ordered autoencoding

Y Xu, Y Song, S Garg, L Gong, R Shu, A Grover… - arXiv preprint arXiv …, 2021 - arxiv.org

Autoregressive models are widely used for tasks such as image and audio generation. The
sampling process of these models, however, does not allow interruptions and cannot adapt …

被引用次数：21 相关文章所有 3 个版本

高级搜索

QQ 群