deep reinforcement noisy feedback- 学术资源搜索

Deep reinforcement learning autoencoder with noisy feedback

M Goutay, FA Aoudia, J Hoydis - … International Symposium on …, 2019 - ieeexplore.ieee.org

… noisy feedback channel. Then, we design a system that learns to transmit real numbers over
an unknown channel without a preexisting feedback … the effect of noisy feedback on the end-…

被引用次数：46 相关文章所有 7 个版本

[PDF] arxiv.org

Multi-agent deep reinforcement learning with extremely noisy observations

O Kilinc, G Montana - arXiv preprint arXiv:1812.00922, 2018 - arxiv.org

… ’ observations are also extremely noisy, hence only weakly … To overcome these difficulties,
we propose a multi-agent deep … However, our environments provide no explicit feedback …

被引用次数：44 相关文章所有 4 个版本

[PDF] arxiv.org

Deep reinforcement learning: An overview

Y Li - arXiv preprint arXiv:1701.07274, 2017 - arxiv.org

… data; and in reinforcement learning, there are evaluative feedbacks, but no supervised … from
an exploration policy by adding noise sampled from a noise process to the actor policy. More …

被引用次数：1804 相关文章所有 6 个版本

[PDF] arxiv.org

Feedback control for cassie with deep reinforcement learning

Z Xie, G Berseth, P Clary, J Hurst… - 2018 IEEE/RSJ …, 2018 - ieeexplore.ieee.org

… the state must be estimated from noisy sensor measurements. We are currently extending
our framework to work directly with output (sensory) feedback. Furthermore, even though the …

被引用次数：197 相关文章所有 11 个版本

[PDF] arxiv.org

Deep reinforcement learning

SE Li - Reinforcement learning for sequential decision and …, 2023 - Springer

… High variance means that the model is sensitive to noise, ie, a small fluctuation in the input
will cause a large error in the output. In this situation, the model cannot be used to accurately …

被引用次数：329 相关文章所有 9 个版本

[PDF] ntnu.no

A novel approach to feedback control with deep reinforcement learning

Y Wang, K Velswamy, B Huang - IFAC-PapersOnLine, 2018 - Elsevier

… learning algorithm that can learn robust feedback control laws from … feedback control problems
within the deep reinforcement … state observations st from noisy, correlated observations. …

被引用次数：55 相关文章所有 2 个版本

[PDF] neurips.cc

Discor: Corrective feedback in reinforcement learning via distribution correction

A Kumar, A Gupta, S Levine - Advances in Neural …, 2020 - proceedings.neurips.cc

… Deep reinforcement learning can learn effective policies for a … , and poor results when learning
from noisy, sparse or delayed … that optimally induces corrective feedback, which we show …

被引用次数：109 相关文章所有 7 个版本

[PDF] arxiv.org

Robust Reinforcement Learning from Corrupted Human Feedback

A Bukharin, I Hong, H Jiang, Q Zhang, Z Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org

… this alignment is Reinforcement Learning from Human Feedback (… on human-provided
feedback and preferences [14, 3, 53]. … simulate noisy human preferences, we consider three noise …

Human-feedback shield synthesis for perceived safety in deep reinforcement learning

D Marta, C Pek, GI Melsión, J Tumova… - IEEE Robotics and …, 2021 - ieeexplore.ieee.org

… Our results indicate that our framework converges to policies that are perceived as safe, is
robust against noisy feedback, and can query feedback for multiple policies at the same time. …

被引用次数：8 相关文章所有 4 个版本

[PDF] aps.org

Measurement-based feedback quantum control with deep reinforcement learning for a double-well nonlinear potential

S Borah, B Sarma, M Kewming, GJ Milburn, J Twamley - Physical review letters, 2021 - APS

… When the Bayesian feedback is instead driven by the noisy measurement current I(t) (which
is available in experiments), we find that Bayesian feedback demonstrates almost no control …

被引用次数：55 相关文章所有 11 个版本

高级搜索

QQ 群

Deep reinforcement learning autoencoder with noisy feedback

Multi-agent deep reinforcement learning with extremely noisy observations

Deep reinforcement learning: An overview

Feedback control for cassie with deep reinforcement learning

Deep reinforcement learning

A novel approach to feedback control with deep reinforcement learning

Discor: Corrective feedback in reinforcement learning via distribution correction

Robust Reinforcement Learning from Corrupted Human Feedback

Human-feedback shield synthesis for perceived safety in deep reinforcement learning

Measurement-based feedback quantum control with deep reinforcement learning for a double-well nonlinear potential

相关搜索

引用