Sim-to-real transfer in deep reinforcement learning for robotics: a survey W Zhao, JP Queralta, T Westerlund 2020 IEEE symposium series on computational intelligence (SSCI), 737-744, 2020 | 743 | 2020 |
MSS U-Net: 3D segmentation of kidneys and tumors from CT images with a multi-scale supervised U-Net W Zhao, D Jiang, JP Queralta, T Westerlund Informatics in Medicine Unlocked 19, 100357, 2020 | 79 | 2020 |
Towards closing the sim-to-real gap in collaborative multi-robot deep reinforcement learning W Zhao, JP Queralta, L Qingqing, T Westerlund 2020 5th International conference on robotics and automation engineering …, 2020 | 32 | 2020 |
Multi scale supervised 3D U-Net for kidney and tumor segmentation W Zhao, Z Zeng arXiv preprint arXiv:1908.03204, 2019 | 13 | 2019 |
Ubiquitous distributed deep reinforcement learning at the edge: Analyzing byzantine agents in discrete action spaces W Zhao, JP Queralta, L Qingqing, T Westerlund Procedia Computer Science 177, 324-329, 2020 | 10 | 2020 |
Multi-scale supervised 3D U-Net for kidneys and kidney tumor segmentation W Zhao, D Jiang, JP Queralta, T Westerlund arXiv preprint arXiv:2004.08108, 2020 | 6 | 2020 |
Less is more: Robust robot learning via partially observable multi-agent reinforcement learning W Zhao, EA Rantala, J Pajarinen, JP Queralta arXiv preprint arXiv:2309.14792, 2023 | 3 | 2023 |
Simplified temporal consistency reinforcement learning Y Zhao, W Zhao, R Boney, J Kannala, J Pajarinen International Conference on Machine Learning, 42227-42246, 2023 | 3 | 2023 |
Self-Paced Multi-Agent Reinforcement Learning W Zhao, J Pajarinen arXiv preprint arXiv:2205.10016, 2022 | 1 | 2022 |
Backpropagation Through Agents Z Li, W Zhao, L Wu, J Pajarinen Proceedings of the AAAI Conference on Artificial Intelligence 38 (12), 13718 …, 2024 | | 2024 |
AgentMixer: Multi-Agent Correlated Policy Factorization Z Li, W Zhao, L Wu, J Pajarinen arXiv preprint arXiv:2401.08728, 2024 | | 2024 |
Optimistic Multi-Agent Policy Gradient for Cooperative Tasks W Zhao, Y Zhao, Z Li, J Kannala, J Pajarinen arXiv preprint arXiv:2311.01953, 2023 | | 2023 |
VHL Gene Mutation Prediction of Clear Cell Renal Cell Carcinoma Based on CT Images W Zhao, J Plosila, Y Wang, MH Haghbayan | | 2019 |
Optimistic Multi-Agent Policy Gradient W Zhao, Y Zhao, Z Li, J Kannala, J Pajarinen Forty-first International Conference on Machine Learning, 0 | | |