Dataset Clustering for Improved Offline Policy Learning

Q Wang, Y Deng, FR Sanchez, K Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
Offline policy learning aims to discover decision-making policies from previously-collected
datasets without additional online interactions with the environment. As the training dataset …

Dataset Clustering for Improved Offline Policy Learning

Q Wang, Y Deng, F Roldan Sanchez, K Wang… - arXiv e …, 2024 - ui.adsabs.harvard.edu
Offline policy learning aims to discover decision-making policies from previously-collected
datasets without additional online interactions with the environment. As the training dataset …