关注
Toshimitsu Uesaka
Toshimitsu Uesaka
Sony Research Inc.
在 sony.com 的电子邮件经过验证
标题
引用次数
引用次数
年份
Consistency trajectory models: Learning probability flow ode trajectory of diffusion
D Kim, CH Lai, WH Liao, N Murata, Y Takida, T Uesaka, Y He, Y Mitsufuji, ...
arXiv preprint arXiv:2310.02279, 2023
552023
Sq-vae: Variational bayes on discrete representation with self-annealed stochastic quantization
Y Takida, T Shibuya, WH Liao, CH Lai, J Ohmura, T Uesaka, N Murata, ...
arXiv preprint arXiv:2205.07547, 2022
482022
Gibbsddrm: A partially collapsed gibbs sampler for solving blind inverse problems with denoising diffusion restoration
N Murata, K Saito, CH Lai, Y Takida, T Uesaka, Y Mitsufuji, S Ermon
International conference on machine learning, 25501-25522, 2023
302023
Unsupervised vocal dereverberation with diffusion-based generative models
K Saito, N Murata, T Uesaka, CH Lai, Y Takida, T Fukui, Y Mitsufuji
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
202023
FP-Diffusion: Improving score-based diffusion models by enforcing the underlying score fokker-planck equation
CH Lai, Y Takida, N Murata, T Uesaka, Y Mitsufuji, S Ermon
International Conference on Machine Learning, 18365-18398, 2023
192023
Manifold preserving guided diffusion
Y He, N Murata, CH Lai, Y Takida, T Uesaka, D Kim, WH Liao, Y Mitsufuji, ...
arXiv preprint arXiv:2311.16424, 2023
162023
Diffroll: Diffusion-based generative music transcription with unsupervised pretraining capability
KW Cheuk, R Sawata, T Uesaka, N Murata, N Takahashi, S Takahashi, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
152023
Preventing posterior collapse induced by oversmoothing in gaussian VAE
Y Takida, WH Liao, T Uesaka, S Takahashi, Y Mitsufuji
arXiv preprint arXiv:2102.08663 3 (5), 6, 2021
152021
Multi-view learning over retinal thickness and visual sensitivity on glaucomatous eyes
T Uesaka, K Morino, H Sugiura, T Kiwaki, H Murata, R Asaoka, ...
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge …, 2017
122017
Regularizing score-based models with score fokker-planck equations
CH Lai, Y Takida, N Murata, T Uesaka, Y Mitsufuji, S Ermon
NeurIPS 2022 Workshop on Score-Based Methods, 2022
112022
A versatile diffusion-based generative refiner for speech enhancement
R Sawata, N Murata, Y Takida, T Uesaka, T Shibuya, S Takahashi, ...
arXiv preprint arXiv:2210.17287, 2022
92022
Preventing oversmoothing in VAE via generalized variance parameterization
Y Takida, WH Liao, CH Lai, T Uesaka, S Takahashi, Y Mitsufuji
Neurocomputing 509, 137-156, 2022
82022
On the equivalence of consistency-type models: Consistency models, consistent diffusion models, and fokker-planck regularization
CH Lai, Y Takida, T Uesaka, N Murata, Y Mitsufuji, S Ermon
arXiv preprint arXiv:2306.00367, 2023
62023
SAN: Inducing metrizability of GAN with discriminative normalized linear layer
Y Takida, M Imaizumi, T Shibuya, CH Lai, T Uesaka, N Murata, Y Mitsufuji
arXiv preprint arXiv:2301.12811, 2023
52023
Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement
R Sawata, N Murata, Y Takida, T Uesaka, T Shibuya, S Takahashi, ...
arXiv preprint arXiv:2210.17287, 2022
42022
HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes
Y Takida, Y Ikemiya, T Shibuya, K Shimada, W Choi, CH Lai, N Murata, ...
arXiv preprint arXiv:2401.00365, 2023
22023
PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher
D Kim, CH Lai, WH Liao, Y Takida, N Murata, T Uesaka, Y Mitsufuji, ...
arXiv preprint arXiv:2405.14822, 2024
12024
DisMix: Disentangling Mixtures of Musical Instruments for Source-level Pitch and Timbre Manipulation
YJ Luo, KW Cheuk, W Choi, T Uesaka, K Toyama, K Saito, CH Lai, ...
arXiv preprint arXiv:2408.10807, 2024
2024
INFORMATION PROCESSING APPARATUS, SIGNAL PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND PROGRAM
Y Koyama, M Hentschel, K Kuroda, M Nakamura, H Ogawa, K Shibata, ...
US Patent App. 18/563,940, 2024
2024
Understanding Multimodal Contrastive Learning Through Pointwise Mutual Information
T Uesaka, T Suzuki, Y Takida, CH Lai, N Murata, Y Mitsufuji
arXiv preprint arXiv:2404.19228, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–20