Learning to schedule in diffusion probabilistic models

Y Wang, X Wang, AD Dinh, B Du, C Xu - Proceedings of the 29th ACM …, 2023 - dl.acm.org
Recently, the field of generative models has seen a significant advancement with the
introduction of Diffusion Probabilistic Models (DPMs). The Denoising Diffusion Implicit Model …

Distributional pareto-optimal multi-objective reinforcement learning

XQ Cai, P Zhang, L Zhao, J Bian… - Advances in …, 2024 - proceedings.neurips.cc
Multi-objective reinforcement learning (MORL) has been proposed to learn control policies
over multiple competing objectives with each possible preference over returns. However …

Imitation learning from vague feedback

XQ Cai, YJ Zhang, CK Chiang… - Advances in Neural …, 2023 - proceedings.neurips.cc
Imitation learning from human feedback studies how to train well-performed imitation agents
with an annotator's relative comparison of two demonstrations (one demonstration is …

Imitation learning from purified demonstration

Y Wang, M Dong, B Du, C Xu - arXiv preprint arXiv:2310.07143, 2023 - arxiv.org
Imitation learning has emerged as a promising approach for addressing sequential decision-
making problems, with the assumption that expert demonstrations are optimal. However, in …

Reinforcement learning from suboptimal demonstrations based on Reward Relabeling

Y Peng, J Zeng, Y Hu, Q Fang, Q Yin - Expert Systems with Applications, 2024 - Elsevier
Deep reinforcement learning (DRL) has achieved remarkable milestones in the field of
artificial intelligence. However, the reward functions for most real-world tasks are …

PAIL: Performance based Adversarial Imitation Learning Engine for Carbon Neutral Optimization

Y Ye, LA Tang, H Wang, R Yu, W Yu, E He… - Proceedings of the 30th …, 2024 - dl.acm.org
Achieving carbon neutrality within industrial operations has become increasingly imperative
for sustainable development. It is both a significant challenge and a key opportunity for …

Visual Imitation Learning with Calibrated Contrastive Representation

Y Wang, L Tao, B Du, Y Lin, C Xu - arXiv preprint arXiv:2401.11396, 2024 - arxiv.org
Adversarial Imitation Learning (AIL) allows the agent to reproduce expert behavior with low-
dimensional states and actions. However, challenges arise in handling visual states due to …

Ranking-Based Generative Adversarial Imitation Learning

Z Shi, X Zhang, Y Fang, C Li, G Liu… - IEEE Robotics and …, 2024 - ieeexplore.ieee.org
In imitation learning, it is often assumed the demonstration data are optimal, even though
they are imperfect in practice. The imperfect demonstrations result from expert errors, large …