关注
Zhendong Wang
Zhendong Wang
在 utexas.edu 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Diffusion policies as an expressive policy class for offline reinforcement learning
Z Wang, JJ Hunt, M Zhou
arXiv preprint arXiv:2208.06193, 2022
1872022
Diffusion-gan: Training gans with diffusion
Z Wang, H Zheng, P He, W Chen, M Zhou
arXiv preprint arXiv:2206.02262, 2022
1662022
Patch diffusion: Faster and more data-efficient training of diffusion models
Z Wang, Y Jiang, H Zheng, P Wang, P He, Z Wang, W Chen, M Zhou
Advances in Neural Information Processing Systems 36, 2024
392024
In-context learning unlocked for diffusion models
Z Wang, Y Jiang, Y Lu, P He, W Chen, Z Wang, M Zhou
Advances in Neural Information Processing Systems 36, 8542-8562, 2023
382023
Thompson sampling via local uncertainty
Z Wang, M Zhou
International Conference on Machine Learning, 10115-10125, 2020
222020
Implicit Distributional Reinforcement Learning
Y Yue, Z Wang, M Zhou
Advances in Neural Information Processing Systems 33, 7135-7147, 2020
152020
A Behavior Regularized Implicit Policy for Offline Reinforcement Learning
S Yang, Z Wang, H Zheng, Y Feng, M Zhou
arXiv preprint arXiv:2202.09673, 2022
142022
Probabilistic conformal prediction using conditional random samples
Z Wang, R Gao, M Yin, M Zhou, DM Blei
arXiv preprint arXiv:2206.06584, 2022
82022
Score identity distillation: Exponentially fast distillation of pretrained diffusion models for one-step generation
M Zhou, H Zheng, Z Wang, M Yin, H Huang
Forty-first International Conference on Machine Learning, 2024
52024
Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts
Y Yin, Z Wang, Y Gu, H Huang, W Chen, M Zhou
arXiv preprint arXiv:2402.10958, 2024
52024
Beta diffusion
M Zhou, T Chen, Z Wang, H Zheng
Advances in Neural Information Processing Systems 36, 2024
42024
Adaptive Correlated Monte Carlo for Contextual Categorical Sequence Generation
X Fan, Y Zhang, Z Wang, M Zhou
International Conference on Learning Representations 2020, 2019
42019
Improving In-Context Learning in Diffusion Models with Visual Context-Modulated Prompts
T Chen, Y Liu, Z Wang, J Yuan, Q You, H Yang, M Zhou
arXiv preprint arXiv:2312.01408, 2023
22023
Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion Modeling
H Zheng, Z Wang, J Yuan, G Ning, P He, Q You, H Yang, M Zhou
The Twelfth International Conference on Learning Representations, 2023
12023
Long and Short Guidance in Score identity Distillation for One-Step Text-to-Image Generation
M Zhou, Z Wang, H Zheng, H Huang
arXiv preprint arXiv:2406.01561, 2024
2024
Diffusion-RPO: Aligning Diffusion Models through Relative Preference Optimization
Y Gu, Z Wang, Y Yin, Y Xie, M Zhou
arXiv preprint arXiv:2406.06382, 2024
2024
Self-Augmented Preference Optimization: Off-Policy Paradigms for Language Model Alignment
Y Yin, Z Wang, Y Xie, W Chen, M Zhou
arXiv preprint arXiv:2405.20830, 2024
2024
Diffusion Policies creating a Trust Region for Offline Reinforcement Learning
T Chen, Z Wang, M Zhou
arXiv preprint arXiv:2405.19690, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–18