Off-policy recommendation system without exploration

X Chen, S Wang, J McAuley, D Jannach… - ACM Transactions on …, 2024 - dl.acm.org

Reinforcement learning serves as a potent tool for modeling dynamic user interests within
recommender systems, garnering increasing research attention of late. However, a …

被引用次数：5 相关文章所有 4 个版本

[PDF] academia.edu

Mgpolicy: Meta graph enhanced off-policy learning for recommendations

X Wang, Q Li, D Yu, Z Wang, H Chen… - Proceedings of the 45th …, 2022 - dl.acm.org

Off-policy learning has drawn huge attention in recommender systems (RS), which provides
an opportunity for reinforcement learning to abandon the expensive online training …

被引用次数：10 相关文章所有 6 个版本

[PDF] neurips.cc

The minority matters: A diversity-promoting collaborative metric learning algorithm

S Bao, Q Xu, Z Yang, Y He, X Cao… - Advances in Neural …, 2022 - proceedings.neurips.cc

Abstract Collaborative Metric Learning (CML) has recently emerged as a popular method in
recommendation systems (RS), closing the gap between metric learning and Collaborative …

被引用次数：6 相关文章所有 8 个版本

Improved Diversity-Promoting Collaborative Metric Learning for Recommendation

S Bao, Q Xu, Z Yang, Y He, X Cao… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Collaborative Metric Learning (CML) has recently emerged as a popular method in
recommendation systems (RS), closing the gap between metric learning and collaborative …

Limits to depth efficiencies of self-attention

Y Levine, N Wies, O Sharir, H Bata… - Advances in Neural …, 2020 - proceedings.neurips.cc

Self-attention architectures, which are rapidly pushing the frontier in natural language
processing, demonstrate a surprising depth-inefficient behavior: Empirical signals indicate …

被引用次数：35 相关文章所有 3 个版本

[PDF] arxiv.org

Value penalized q-learning for recommender systems

C Gao, K Xu, K Zhou, L Li, X Wang, B Yuan… - Proceedings of the 45th …, 2022 - dl.acm.org

Scaling reinforcement learning (RL) to recommender systems (RS) is promising since
maximizing the expected cumulative rewards for RL agents meets the objective of RS, ie …

被引用次数：18 相关文章所有 6 个版本

[PDF] arxiv.org

Rethinking collaborative metric learning: Toward an efficient alternative without negative sampling

S Bao, Q Xu, Z Yang, X Cao… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

The recently proposed Collaborative Metric Learning (CML) paradigm has aroused wide
interest in the area of recommendation systems (RS) owing to its simplicity and …

被引用次数：10 相关文章所有 6 个版本

[PDF] mlr.press

On the global convergence of fitted Q-iteration with two-layer neural network parametrization

M Gaur, V Aggarwal, M Agarwal - … Conference on Machine …, 2023 - proceedings.mlr.press

Deep Q-learning based algorithms have been applied successfully in many decision making
problems, while their theoretical foundations are not as well understood. In this paper, we …

被引用次数：2 相关文章所有 7 个版本

[PDF] researchgate.net

Off-policy learning over heterogeneous information for recommendation

X Wang, Q Li, D Yu, G Xu - Proceedings of the ACM Web Conference …, 2022 - dl.acm.org

Reinforcement learning has recently become an active topic in recommender system
research, where the logged data that records interactions between items and users …

被引用次数：8 相关文章所有 6 个版本

[PDF] arxiv.org

Batch-constrained distributional reinforcement learning for session-based recommendation

D Garg, P Gupta, P Malhotra, L Vig, G Shroff - arXiv preprint arXiv …, 2020 - arxiv.org

Most of the existing deep reinforcement learning (RL) approaches for session-based
recommendations either rely on costly online interactions with real users, or rely on …

被引用次数：12 相关文章所有 5 个版本

高级搜索

QQ 群