所有版本 - 学术资源搜索

文章

学术资源搜索

获得 2 条结果（用时0.03秒）

Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation

S Yang, Z Zhong, M Zhao, S Takahashi, M Ishii… - arXiv preprint arXiv …, 2024 - arxiv.org

In recent years, with the realistic generation results and a wide range of personalized
applications, diffusion-based generative models gain huge attention in both visual and …

被引用次数：1 相关文章

Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation

S Yang, Z Zhong, M Zhao, S Takahashi, M Ishii… - arXiv e …, 2024 - ui.adsabs.harvard.edu

In recent years, with the realistic generation results and a wide range of personalized
applications, diffusion-based generative models gain huge attention in both visual and …

高级搜索

QQ 群

Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation

Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation

引用