A utility-based analysis of equilibria in multi-objective normal-form games R Rădulescu, P Mannion, Y Zhang, DM Roijers, A Nowé The Knowledge Engineering Review 35, e32, 2020 | 31 | 2020 |
Opponent learning awareness and modelling in multi-objective normal form games R Rădulescu, T Verstraeten, Y Zhang, P Mannion, DM Roijers, A Nowé Neural Computing and Applications, 1-23, 2022 | 19 | 2022 |
Opponent Modelling for Reinforcement Learning in Multi-Objective Normal Form Games. Y Zhang, R Radulescu, P Mannion, DM Roijers, A Nowé AAMAS, 2080-2082, 2020 | 18 | 2020 |
Deep coherent exploration for continuous control Y Zhang, H Van Hoof ICML 2021, 2021 | 12 | 2021 |
Opponent modelling using policy reconstruction for multi-objective normal form games Y Zhang, R Rădulescu, P Mannion, DM Roijers, A Nowé Proceedings of the Adaptive and Learning Agents Workshop (ALA-20) at AAMAS …, 2020 | 7 | 2020 |
Scaling up q-learning via exploiting state–action equivalence Y Lyu, A Côme, Y Zhang, MS Talebi Entropy 25 (4), 584, 2023 | 4 | 2023 |
On the inconsistency of Bayesian inference for misspecified neural networks Y Zhang, E Nalisnick Third Symposium on Advances in Approximate Bayesian Inference, 2021 | 4 | 2021 |
If there is no underfitting, there is no Cold Posterior Effect Y Zhang, YS Wu, LA Ortega, AR Masegosa arXiv preprint arXiv:2310.01189, 2023 | 1 | 2023 |
Recursive PAC-Bayes: A Frequentist Approach to Sequential Prior Updates with No Information Loss YS Wu, Y Zhang, BE Chérief-Abdellatif, Y Seldin arXiv preprint arXiv:2405.14681, 2024 | | 2024 |
The Cold Posterior Effect Indicates Underfitting, and Cold Posteriors Represent a Fully Bayesian Method to Mitigate It Y Zhang, YS Wu, LA Ortega, AR Masegosa | | |