On the opportunities and challenges of offline reinforcement learning for recommender systems

X Chen, S Wang, J McAuley, D Jannach… - ACM Transactions on …, 2024 - dl.acm.org
Reinforcement learning serves as a potent tool for modeling dynamic user interests within
recommender systems, garnering increasing research attention of late. However, a …

Mgpolicy: Meta graph enhanced off-policy learning for recommendations

X Wang, Q Li, D Yu, Z Wang, H Chen… - Proceedings of the 45th …, 2022 - dl.acm.org
Off-policy learning has drawn huge attention in recommender systems (RS), which provides
an opportunity for reinforcement learning to abandon the expensive online training …

The minority matters: A diversity-promoting collaborative metric learning algorithm

S Bao, Q Xu, Z Yang, Y He, X Cao… - Advances in Neural …, 2022 - proceedings.neurips.cc
Abstract Collaborative Metric Learning (CML) has recently emerged as a popular method in
recommendation systems (RS), closing the gap between metric learning and Collaborative …

Improved Diversity-Promoting Collaborative Metric Learning for Recommendation

S Bao, Q Xu, Z Yang, Y He, X Cao… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Collaborative Metric Learning (CML) has recently emerged as a popular method in
recommendation systems (RS), closing the gap between metric learning and collaborative …

Limits to depth efficiencies of self-attention

Y Levine, N Wies, O Sharir, H Bata… - Advances in Neural …, 2020 - proceedings.neurips.cc
Self-attention architectures, which are rapidly pushing the frontier in natural language
processing, demonstrate a surprising depth-inefficient behavior: Empirical signals indicate …

Value penalized q-learning for recommender systems

C Gao, K Xu, K Zhou, L Li, X Wang, B Yuan… - Proceedings of the 45th …, 2022 - dl.acm.org
Scaling reinforcement learning (RL) to recommender systems (RS) is promising since
maximizing the expected cumulative rewards for RL agents meets the objective of RS, ie …

Rethinking collaborative metric learning: Toward an efficient alternative without negative sampling

S Bao, Q Xu, Z Yang, X Cao… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
The recently proposed Collaborative Metric Learning (CML) paradigm has aroused wide
interest in the area of recommendation systems (RS) owing to its simplicity and …

On the global convergence of fitted Q-iteration with two-layer neural network parametrization

M Gaur, V Aggarwal, M Agarwal - … Conference on Machine …, 2023 - proceedings.mlr.press
Deep Q-learning based algorithms have been applied successfully in many decision making
problems, while their theoretical foundations are not as well understood. In this paper, we …

Off-policy learning over heterogeneous information for recommendation

X Wang, Q Li, D Yu, G Xu - Proceedings of the ACM Web Conference …, 2022 - dl.acm.org
Reinforcement learning has recently become an active topic in recommender system
research, where the logged data that records interactions between items and users …

Batch-constrained distributional reinforcement learning for session-based recommendation

D Garg, P Gupta, P Malhotra, L Vig, G Shroff - arXiv preprint arXiv …, 2020 - arxiv.org
Most of the existing deep reinforcement learning (RL) approaches for session-based
recommendations either rely on costly online interactions with real users, or rely on …