Reinforcing Long-Term Performance in Recommender Systems with User-Oriented Exploration Policy

C Zhang, S Chen, X Zhang, S Dai, W Yu… - Proceedings of the 47th …, 2024 - dl.acm.org
Reinforcement learning (RL) has gained popularity in recommender systems for improving
long-term performance by effectively exploring users' interests. However, modern …