PAC: Assisted Value Factorization with Counterfactual Predictions in Multi-Agent Reinforcement Learning H Zhou, T Lan, V Aggarwal Advances in Neural Information Processing Systems (NeurIPS) 36, 15757-15769, 2022 | 34 | 2022 |
MAC-PO: Multi-agent experience replay via collective priority optimization Y Mei, H Zhou, T Lan, G Venkataramani, P Wei International Conference on Autonomous Agents and Multiagent Systems 22, 466 …, 2023 | 32 | 2023 |
Value functions factorization with latent state information sharing in decentralized multi-agent policy gradients H Zhou, T Lan, V Aggarwal IEEE Transactions on Emerging Topics in Computational Intelligence 7 (5 …, 2023 | 28 | 2023 |
Projection-Optimal Monotonic Value Function Factorization in Multi-Agent Reinforcement Learning Y Mei, H Zhou, T Lan International Conference on Autonomous Agents and Multiagent Systems 23, 2024 | 21* | 2024 |
Every parameter matters: Ensuring the convergence of federated learning with dynamic heterogeneous models reduction H Zhou, T Lan, G Venkataramani, W Ding Advances in Neural Information Processing Systems (NeurIPS) 37, 2023 | 16* | 2023 |
Real-time Network Intrusion Detection via Decision Transformers J Chen, H Zhou, Y Mei, G Adam, ND Bastian, T Lan AAAI-24 Workshop on Artificial Intelligence for Cyber Security (AICS), 2023 | 8 | 2023 |
Federated Learning with Online Adaptive Heterogeneous Local Models H Zhou, T Lan, GP Venkataramani, W Ding Workshop on Federated Learning: Recent Advances and New Challenges (in …, 2022 | 8 | 2022 |
Collaborative ai teaming in unknown environments via active goal deduction Z Zhang, H Zhou, M Imani, T Lee, T Lan arXiv preprint arXiv:2403.15341, 2024 | 7 | 2024 |
Double Policy Estimation for Importance Sampling in Sequence Modeling-Based Reinforcement Learning H Zhou, T Lan, V Aggarwal NeurIPS 2023 Foundation Models for Decision Making Workshop, 2023 | 6* | 2023 |
Pt-vton: an image-based virtual try-on network with progressive pose attention transfer H Zhou, T Lan, G Venkataramani arXiv preprint arXiv:2111.12167, 2021 | 6 | 2021 |
ConcaveQ: Non-Monotonic Value Function Factorization via Concave Representations in Deep Multi-Agent Reinforcement Learning H Li, H Zhou, Y Zou, D Yu, T Lan Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence …, 2024 | 5 | 2024 |
Hunting garbage collection related concurrency bugs through critical condition restoration H Zhou, T Lan, G Venkataramani Proceedings of the 2020 ACM Workshop on Forming an Ecosystem Around Software …, 2020 | 5 | 2020 |
Two-tiered online optimization of region-wide datacenter resource allocation via deep reinforcement learning CL Chen, H Zhou, J Chen, M Pedramfar, V Aggarwal, T Lan, Z Zhu, ... arXiv preprint arXiv:2306.17054, 2023 | 4 | 2023 |