关注
Xichen Pan
Xichen Pan
在 nyu.edu 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition
X Pan, P Chen, Y Gong, H Zhou, X Wang, Z Lin
ACL 2022 Main Conference 1, 4491--4503, 2022
432022
Synthesizing coherent story with auto-regressive latent diffusion models
X Pan, P Qin, Y Li, H Xue, W Chen
WACV 2024 (Oral), 2920--2930, 2022
382022
Kosmos-g: Generating images in context with multimodal large language models
X Pan, L Dong, S Huang, Z Peng, W Chen, F Wei
ICLR 2024, 2023
322023
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs
S Tong, E Brown, P Wu, S Woo, M Middepogu, SC Akula, J Yang, S Yang, ...
arXiv preprint arXiv:2406.16860, 2024
112024
Image Sculpting: Precise Object Editing with 3D Geometry Control
J Yenphraphai, X Pan, S Liu, D Panozzo, S Xie
CVPR 2024, 2024
52024
系统目前无法执行此操作,请稍后再试。
文章 1–5