关注
Hanhan Zhou
Hanhan Zhou
PhD student at the George Washington University
在 gwu.edu 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
PAC: Assisted Value Factorization with Counterfactual Predictions in Multi-Agent Reinforcement Learning
H Zhou, T Lan, V Aggarwal
Advances in Neural Information Processing Systems (NeurIPS) 36, 15757-15769, 2022
342022
MAC-PO: Multi-agent experience replay via collective priority optimization
Y Mei, H Zhou, T Lan, G Venkataramani, P Wei
International Conference on Autonomous Agents and Multiagent Systems 22, 466 …, 2023
322023
Value functions factorization with latent state information sharing in decentralized multi-agent policy gradients
H Zhou, T Lan, V Aggarwal
IEEE Transactions on Emerging Topics in Computational Intelligence 7 (5 …, 2023
282023
Projection-Optimal Monotonic Value Function Factorization in Multi-Agent Reinforcement Learning
Y Mei, H Zhou, T Lan
International Conference on Autonomous Agents and Multiagent Systems 23, 2024
21*2024
Every parameter matters: Ensuring the convergence of federated learning with dynamic heterogeneous models reduction
H Zhou, T Lan, G Venkataramani, W Ding
Advances in Neural Information Processing Systems (NeurIPS) 37, 2023
16*2023
Real-time Network Intrusion Detection via Decision Transformers
J Chen, H Zhou, Y Mei, G Adam, ND Bastian, T Lan
AAAI-24 Workshop on Artificial Intelligence for Cyber Security (AICS), 2023
82023
Federated Learning with Online Adaptive Heterogeneous Local Models
H Zhou, T Lan, GP Venkataramani, W Ding
Workshop on Federated Learning: Recent Advances and New Challenges (in …, 2022
82022
Collaborative ai teaming in unknown environments via active goal deduction
Z Zhang, H Zhou, M Imani, T Lee, T Lan
arXiv preprint arXiv:2403.15341, 2024
72024
Double Policy Estimation for Importance Sampling in Sequence Modeling-Based Reinforcement Learning
H Zhou, T Lan, V Aggarwal
NeurIPS 2023 Foundation Models for Decision Making Workshop, 2023
6*2023
Pt-vton: an image-based virtual try-on network with progressive pose attention transfer
H Zhou, T Lan, G Venkataramani
arXiv preprint arXiv:2111.12167, 2021
62021
ConcaveQ: Non-Monotonic Value Function Factorization via Concave Representations in Deep Multi-Agent Reinforcement Learning
H Li, H Zhou, Y Zou, D Yu, T Lan
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence …, 2024
52024
Hunting garbage collection related concurrency bugs through critical condition restoration
H Zhou, T Lan, G Venkataramani
Proceedings of the 2020 ACM Workshop on Forming an Ecosystem Around Software …, 2020
52020
Two-tiered online optimization of region-wide datacenter resource allocation via deep reinforcement learning
CL Chen, H Zhou, J Chen, M Pedramfar, V Aggarwal, T Lan, Z Zhu, ...
arXiv preprint arXiv:2306.17054, 2023
42023
系统目前无法执行此操作,请稍后再试。
文章 1–13