Adversarial policies: Attacking deep reinforcement learning

N Akhtar, A Mian, N Kardan, M Shah - IEEE Access, 2021 - ieeexplore.ieee.org

Deep Learning is the most widely used tool in the contemporary field of computer vision. Its
ability to accurately solve complex problems is employed in vision research to learn deep …

被引用次数：217 相关文章所有 6 个版本

[PDF] arxiv.org

Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arXiv preprint arXiv …, 2023 - arxiv.org

AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …

被引用次数：94 相关文章所有 3 个版本

[PDF] arxiv.org

Open problems and fundamental limitations of reinforcement learning from human feedback

S Casper, X Davies, C Shi, TK Gilbert… - arXiv preprint arXiv …, 2023 - arxiv.org

Reinforcement learning from human feedback (RLHF) is a technique for training AI systems
to align with human goals. RLHF has emerged as the central method used to finetune state …

被引用次数：244 相关文章所有 6 个版本

[PDF] jmlr.org

Stable-baselines3: Reliable reinforcement learning implementations

A Raffin, A Hill, A Gleave, A Kanervisto… - Journal of Machine …, 2021 - jmlr.org

STABLE-BASELINES3 provides open-source implementations of deep reinforcement
learning (RL) algorithms in Python. The implementations have been benchmarked against …

被引用次数：1941 相关文章所有 9 个版本

[PDF] springer.com

Multi-agent deep reinforcement learning: a survey

S Gronauer, K Diepold - Artificial Intelligence Review, 2022 - Springer

The advances in reinforcement learning have recorded sublime success in various domains.
Although the multi-agent domain has been overshadowed by its single-agent counterpart …

被引用次数：512 相关文章所有 8 个版本

[PDF] arxiv.org

Unsolved problems in ml safety

D Hendrycks, N Carlini, J Schulman… - arXiv preprint arXiv …, 2021 - arxiv.org

Machine learning (ML) systems are rapidly increasing in size, are acquiring new
capabilities, and are increasingly deployed in high-stakes settings. As with other powerful …

被引用次数：271 相关文章所有 6 个版本

[HTML] sciencedirect.com

[HTML][HTML] Deep reinforcement learning in recommender systems: A survey and new perspectives

X Chen, L Yao, J McAuley, G Zhou, X Wang - Knowledge-Based Systems, 2023 - Elsevier

In light of the emergence of deep reinforcement learning (DRL) in recommender systems
research and several fruitful results in recent years, this survey aims to provide a timely and …

被引用次数：63 相关文章所有 4 个版本

[PDF] arxiv.org

Single and multi-agent deep reinforcement learning for AI-enabled wireless networks: A tutorial

A Feriani, E Hossain - IEEE Communications Surveys & …, 2021 - ieeexplore.ieee.org

Deep Reinforcement Learning (DRL) has recently witnessed significant advances that have
led to multiple successes in solving sequential decision-making problems in various …

被引用次数：237 相关文章所有 3 个版本

[PDF] arxiv.org

Natural attack for pre-trained models of code

Z Yang, J Shi, J He, D Lo - … of the 44th International Conference on …, 2022 - dl.acm.org

Pre-trained models of code have achieved success in many important software engineering
tasks. However, these powerful models are vulnerable to adversarial attacks that slightly …

被引用次数：111 相关文章所有 8 个版本

[PDF] mdpi.com

Robust reinforcement learning: A review of foundations and recent advances

J Moos, K Hansel, H Abdulsamad, S Stark… - Machine Learning and …, 2022 - mdpi.com

Reinforcement learning (RL) has become a highly successful framework for learning in
Markov decision processes (MDP). Due to the adoption of RL in realistic and complex …

被引用次数：92 相关文章所有 7 个版本

高级搜索

QQ 群