Advances in adversarial attacks and defenses in computer vision: A survey

N Akhtar, A Mian, N Kardan, M Shah - IEEE Access, 2021 - ieeexplore.ieee.org
Deep Learning is the most widely used tool in the contemporary field of computer vision. Its
ability to accurately solve complex problems is employed in vision research to learn deep …

Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …

Open problems and fundamental limitations of reinforcement learning from human feedback

S Casper, X Davies, C Shi, TK Gilbert… - arXiv preprint arXiv …, 2023 - arxiv.org
Reinforcement learning from human feedback (RLHF) is a technique for training AI systems
to align with human goals. RLHF has emerged as the central method used to finetune state …

Stable-baselines3: Reliable reinforcement learning implementations

A Raffin, A Hill, A Gleave, A Kanervisto… - Journal of Machine …, 2021 - jmlr.org
STABLE-BASELINES3 provides open-source implementations of deep reinforcement
learning (RL) algorithms in Python. The implementations have been benchmarked against …

Multi-agent deep reinforcement learning: a survey

S Gronauer, K Diepold - Artificial Intelligence Review, 2022 - Springer
The advances in reinforcement learning have recorded sublime success in various domains.
Although the multi-agent domain has been overshadowed by its single-agent counterpart …

Unsolved problems in ml safety

D Hendrycks, N Carlini, J Schulman… - arXiv preprint arXiv …, 2021 - arxiv.org
Machine learning (ML) systems are rapidly increasing in size, are acquiring new
capabilities, and are increasingly deployed in high-stakes settings. As with other powerful …

[HTML][HTML] Deep reinforcement learning in recommender systems: A survey and new perspectives

X Chen, L Yao, J McAuley, G Zhou, X Wang - Knowledge-Based Systems, 2023 - Elsevier
In light of the emergence of deep reinforcement learning (DRL) in recommender systems
research and several fruitful results in recent years, this survey aims to provide a timely and …

Single and multi-agent deep reinforcement learning for AI-enabled wireless networks: A tutorial

A Feriani, E Hossain - IEEE Communications Surveys & …, 2021 - ieeexplore.ieee.org
Deep Reinforcement Learning (DRL) has recently witnessed significant advances that have
led to multiple successes in solving sequential decision-making problems in various …

Natural attack for pre-trained models of code

Z Yang, J Shi, J He, D Lo - … of the 44th International Conference on …, 2022 - dl.acm.org
Pre-trained models of code have achieved success in many important software engineering
tasks. However, these powerful models are vulnerable to adversarial attacks that slightly …

Robust reinforcement learning: A review of foundations and recent advances

J Moos, K Hansel, H Abdulsamad, S Stark… - Machine Learning and …, 2022 - mdpi.com
Reinforcement learning (RL) has become a highly successful framework for learning in
Markov decision processes (MDP). Due to the adoption of RL in realistic and complex …