- 学术资源搜索

Deep learning in electron microscopy

JM Ede - Machine Learning: Science and Technology, 2021 - iopscience.iop.org

Deep learning is transforming most areas of science and technology, including electron
microscopy. This review paper offers a practical perspective aimed at developers with …

被引用次数：104 相关文章所有 8 个版本

[PDF] neurips.cc

The surprising effectiveness of ppo in cooperative multi-agent games

C Yu, A Velu, E Vinitsky, J Gao… - Advances in …, 2022 - proceedings.neurips.cc

Abstract Proximal Policy Optimization (PPO) is a ubiquitous on-policy reinforcement learning
algorithm but is significantly less utilized than off-policy learning algorithms in multi-agent …

被引用次数：1077 相关文章所有 12 个版本

[PDF] arxiv.org

A comprehensive survey of data augmentation in visual reinforcement learning

G Ma, Z Wang, Z Yuan, X Wang, B Yuan… - arXiv preprint arXiv …, 2022 - arxiv.org

Visual reinforcement learning (RL), which makes decisions directly from high-dimensional
visual inputs, has demonstrated significant potential in various domains. However …

被引用次数：20 相关文章所有 3 个版本

[PDF] neurips.cc

Why generalization in rl is difficult: Epistemic pomdps and implicit partial observability

D Ghosh, J Rahme, A Kumar, A Zhang… - Advances in neural …, 2021 - proceedings.neurips.cc

Generalization is a central challenge for the deployment of reinforcement learning (RL)
systems in the real world. In this paper, we show that the sequential structure of the RL …

被引用次数：99 相关文章所有 10 个版本

[PDF] neurips.cc

Plastic: Improving input and label plasticity for sample efficient reinforcement learning

H Lee, H Cho, H Kim, D Gwak, J Kim… - Advances in …, 2024 - proceedings.neurips.cc

Abstract In Reinforcement Learning (RL), enhancing sample efficiency is crucial, particularly
in scenarios when data acquisition is costly and risky. In principle, off-policy RL algorithms …

被引用次数：11 相关文章所有 5 个版本

[PDF] arxiv.org

Regularizing action policies for smooth control with reinforcement learning

S Mysore, B Mabsout, R Mancuso… - 2021 IEEE International …, 2021 - ieeexplore.ieee.org

A critical problem with the practical utility of controllers trained with deep Reinforcement
Learning (RL) is the notable lack of smoothness in the actions learned by the RL policies …

被引用次数：67 相关文章所有 9 个版本

[PDF] arxiv.org

Transient non-stationarity and generalisation in deep reinforcement learning

M Igl, G Farquhar, J Luketina, W Boehmer… - arXiv preprint arXiv …, 2020 - arxiv.org

Non-stationarity can arise in Reinforcement Learning (RL) even in stationary environments.
For example, most RL algorithms collect new data throughout training, using a non …

被引用次数：72 相关文章所有 8 个版本

[PDF] arxiv.org

Training larger networks for deep reinforcement learning

K Ota, DK Jha, A Kanezaki - arXiv preprint arXiv:2102.07920, 2021 - arxiv.org

The success of deep learning in the computer vision and natural language processing
communities can be attributed to training of very deep neural networks with millions or …

被引用次数：43 相关文章所有 2 个版本

[PDF] arxiv.org

Sampling through the lens of sequential decision making

JX Dou, AQ Pan, R Bao, HH Mao, L Luo… - arXiv preprint arXiv …, 2022 - arxiv.org

Sampling is ubiquitous in machine learning methodologies. Due to the growth of large
datasets and model complexity, we want to learn and adapt the sampling process while …

被引用次数：20 相关文章所有 3 个版本

[PDF] arxiv.org

Grow your limits: Continuous improvement with real-world rl for robotic locomotion

L Smith, Y Cao, S Levine - arXiv preprint arXiv:2310.17634, 2023 - arxiv.org

Deep reinforcement learning (RL) can enable robots to autonomously acquire complex
behaviors, such as legged locomotion. However, RL in the real world is complicated by …

被引用次数：6 相关文章所有 3 个版本

高级搜索

QQ 群

Deep learning in electron microscopy

The surprising effectiveness of ppo in cooperative multi-agent games

A comprehensive survey of data augmentation in visual reinforcement learning

Why generalization in rl is difficult: Epistemic pomdps and implicit partial observability

Plastic: Improving input and label plasticity for sample efficient reinforcement learning

Regularizing action policies for smooth control with reinforcement learning

Transient non-stationarity and generalisation in deep reinforcement learning

Training larger networks for deep reinforcement learning

Sampling through the lens of sequential decision making

Grow your limits: Continuous improvement with real-world rl for robotic locomotion

引用