Efficient resource allocation with fairness constraints in restless multi-armed bandits D Li, P Varakantham Uncertainty in Artificial Intelligence, 1158-1167, 2022 | 11 | 2022 |
CLAIM: Curriculum learning policy for influence maximization in unknown social networks D Li, M Lowalekar, P Varakantham Uncertainty in Artificial Intelligence, 1455-1465, 2021 | 10 | 2021 |
Aligning crowd feedback via distributional preference reward modeling D Li, C Zhang, K Dong, DGX Deik, R Tang, Y Liu arXiv preprint arXiv:2402.09764, 2024 | 5 | 2024 |
Diversity induced environment design via self-play D Li, W Li, P Varakantham arXiv preprint arXiv:2302.02119, 2023 | 5 | 2023 |
Effective diversity in unsupervised environment design W Li, P Varakantham, D Li arXiv preprint arXiv:2301.08025, 2023 | 5 | 2023 |
Towards soft fairness in restless multi-armed bandits D Li, P Varakantham arXiv preprint arXiv:2207.13343, 2022 | 4 | 2022 |
Avoiding starvation of arms in restless multi-armed bandit D Li, P Varakantham International Foundation for Autonomous Agents and Multiagent Systems, 2023 | 3 | 2023 |
Meta-Task Planning for Language Agents C Zhang, DDG Xin, D Li, H Zhang, Y Liu arXiv preprint arXiv:2405.16510, 2024 | 1 | 2024 |
Generalization through diversity: improving unsupervised environment design W Li, P Varakantham, D Li arXiv preprint arXiv:2301.08025, 2023 | 1 | 2023 |
EduQate: Generating Adaptive Curricula through RMABs in Education Settings S Tio, D Li, P Varakantham arXiv preprint arXiv:2406.14122, 2024 | | 2024 |
Sequential decision learning for social good and fairness D LI Singapore Management University, 2024 | | 2024 |
A Hierarchical Approach to Environment Design with Generative Trajectory Modeling D Li, P Varakantham arXiv preprint arXiv:2310.00301, 2023 | | 2023 |
Hidden State Approximation in Recurrent Neural Networks Using Continuous Particle Filtering D Li arXiv preprint arXiv:2212.09008, 2022 | | 2022 |
Marginal Benefit Induced Unsupervised Environment Design D Li, W Li, P Varakantham | | |