Diffusion policies as an expressive policy class for offline reinforcement learning Z Wang, JJ Hunt, M Zhou arXiv preprint arXiv:2208.06193, 2022 | 187 | 2022 |
Diffusion-gan: Training gans with diffusion Z Wang, H Zheng, P He, W Chen, M Zhou arXiv preprint arXiv:2206.02262, 2022 | 166 | 2022 |
Patch diffusion: Faster and more data-efficient training of diffusion models Z Wang, Y Jiang, H Zheng, P Wang, P He, Z Wang, W Chen, M Zhou Advances in Neural Information Processing Systems 36, 2024 | 39 | 2024 |
In-context learning unlocked for diffusion models Z Wang, Y Jiang, Y Lu, P He, W Chen, Z Wang, M Zhou Advances in Neural Information Processing Systems 36, 8542-8562, 2023 | 38 | 2023 |
Thompson sampling via local uncertainty Z Wang, M Zhou International Conference on Machine Learning, 10115-10125, 2020 | 22 | 2020 |
Implicit Distributional Reinforcement Learning Y Yue, Z Wang, M Zhou Advances in Neural Information Processing Systems 33, 7135-7147, 2020 | 15 | 2020 |
A Behavior Regularized Implicit Policy for Offline Reinforcement Learning S Yang, Z Wang, H Zheng, Y Feng, M Zhou arXiv preprint arXiv:2202.09673, 2022 | 14 | 2022 |
Probabilistic conformal prediction using conditional random samples Z Wang, R Gao, M Yin, M Zhou, DM Blei arXiv preprint arXiv:2206.06584, 2022 | 8 | 2022 |
Score identity distillation: Exponentially fast distillation of pretrained diffusion models for one-step generation M Zhou, H Zheng, Z Wang, M Yin, H Huang Forty-first International Conference on Machine Learning, 2024 | 5 | 2024 |
Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts Y Yin, Z Wang, Y Gu, H Huang, W Chen, M Zhou arXiv preprint arXiv:2402.10958, 2024 | 5 | 2024 |
Beta diffusion M Zhou, T Chen, Z Wang, H Zheng Advances in Neural Information Processing Systems 36, 2024 | 4 | 2024 |
Adaptive Correlated Monte Carlo for Contextual Categorical Sequence Generation X Fan, Y Zhang, Z Wang, M Zhou International Conference on Learning Representations 2020, 2019 | 4 | 2019 |
Improving In-Context Learning in Diffusion Models with Visual Context-Modulated Prompts T Chen, Y Liu, Z Wang, J Yuan, Q You, H Yang, M Zhou arXiv preprint arXiv:2312.01408, 2023 | 2 | 2023 |
Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion Modeling H Zheng, Z Wang, J Yuan, G Ning, P He, Q You, H Yang, M Zhou The Twelfth International Conference on Learning Representations, 2023 | 1 | 2023 |
Long and Short Guidance in Score identity Distillation for One-Step Text-to-Image Generation M Zhou, Z Wang, H Zheng, H Huang arXiv preprint arXiv:2406.01561, 2024 | | 2024 |
Diffusion-RPO: Aligning Diffusion Models through Relative Preference Optimization Y Gu, Z Wang, Y Yin, Y Xie, M Zhou arXiv preprint arXiv:2406.06382, 2024 | | 2024 |
Self-Augmented Preference Optimization: Off-Policy Paradigms for Language Model Alignment Y Yin, Z Wang, Y Xie, W Chen, M Zhou arXiv preprint arXiv:2405.20830, 2024 | | 2024 |
Diffusion Policies creating a Trust Region for Offline Reinforcement Learning T Chen, Z Wang, M Zhou arXiv preprint arXiv:2405.19690, 2024 | | 2024 |