O Kilinc, G Montana - arXiv preprint arXiv:1812.00922, 2018 - arxiv.org
… ’ observations are also extremely noisy, hence only weakly … To overcome these difficulties, we propose a multi-agent deep … However, our environments provide no explicit feedback …
Y Li - arXiv preprint arXiv:1701.07274, 2017 - arxiv.org
… data; and in reinforcement learning, there are evaluative feedbacks, but no supervised … from an exploration policy by adding noise sampled from a noise process to the actor policy. More …
Z Xie, G Berseth, P Clary, J Hurst… - 2018 IEEE/RSJ …, 2018 - ieeexplore.ieee.org
… the state must be estimated from noisy sensor measurements. We are currently extending our framework to work directly with output (sensory) feedback. Furthermore, even though the …
SE Li - Reinforcement learning for sequential decision and …, 2023 - Springer
… High variance means that the model is sensitive to noise, ie, a small fluctuation in the input will cause a large error in the output. In this situation, the model cannot be used to accurately …
… learning algorithm that can learn robust feedback control laws from … feedback control problems within the deepreinforcement … state observations st from noisy, correlated observations. …
… Deepreinforcement learning can learn effective policies for a … , and poor results when learning from noisy, sparse or delayed … that optimally induces corrective feedback, which we show …
… this alignment is Reinforcement Learning from Human Feedback (… on human-provided feedback and preferences [14, 3, 53]. … simulate noisy human preferences, we consider three noise …
… Our results indicate that our framework converges to policies that are perceived as safe, is robust against noisyfeedback, and can query feedback for multiple policies at the same time. …
… When the Bayesian feedback is instead driven by the noisy measurement current I(t) (which is available in experiments), we find that Bayesian feedback demonstrates almost no control …