Mushroomrl: Simplifying reinforcement learning research

S Huang, RFJ Dossa, C Ye, J Braga… - Journal of Machine …, 2022 - jmlr.org

CleanRL is an open-source library that provides high-quality single-file implementations of
Deep Reinforcement Learning (DRL) algorithms. These single-file implementations are self …

被引用次数：180 相关文章所有 4 个版本

[PDF] jmlr.org

Tianshou: A highly modularized deep reinforcement learning library

J Weng, H Chen, D Yan, K You, A Duburcq… - Journal of Machine …, 2022 - jmlr.org

In this paper, we present Tianshou, a highly modularized Python library for deep
reinforcement learning (DRL) that uses PyTorch as its backend. Tianshou intends to be …

被引用次数：111 相关文章所有 3 个版本

[PDF] arxiv.org

Sharing knowledge in multi-task deep reinforcement learning

C D'Eramo, D Tateo, A Bonarini, M Restelli… - arXiv preprint arXiv …, 2024 - arxiv.org

We study the benefit of sharing representations among tasks to enable the effective use of
deep neural networks in Multi-Task Reinforcement Learning. We leverage the assumption …

被引用次数：121 相关文章所有 8 个版本

[PDF] researchgate.net

Reinforcement learning in wind energy-a review

VL Narayanan - International Journal of Green Energy, 2023 - Taylor & Francis

Today's environmental concerns, particularly those related to global warming, have sparked
a drive for the usage of renewable energy sources. One of the most significant sources of …

被引用次数：4 相关文章所有 3 个版本

[PDF] springer.com

A survey on deep reinforcement learning for audio-based applications

S Latif, H Cuayáhuitl, F Pervez, F Shamshad… - Artificial Intelligence …, 2023 - Springer

Deep reinforcement learning (DRL) is poised to revolutionise the field of artificial intelligence
(AI) by endowing autonomous systems with high levels of understanding of the real world …

被引用次数：69 相关文章所有 10 个版本

[PDF] mlr.press

Learn2assemble with structured representations and search for robotic architectural construction

N Funk, G Chalvatzaki, B Belousov… - Conference on Robot …, 2022 - proceedings.mlr.press

Autonomous robotic assembly requires a well-orchestrated sequence of high-level actions
and smooth manipulation executions. Learning to assemble complex 3D structures remains …

被引用次数：43 相关文章所有 3 个版本

[PDF] arxiv.org

Robot learning of mobile manipulation with reachability behavior priors

S Jauhri, J Peters, G Chalvatzaki - IEEE Robotics and …, 2022 - ieeexplore.ieee.org

Mobile Manipulation (MM) systems are ideal candidates for taking up the role of personal
assistants in unstructured real-world environments. Among other challenges, Mobile …

被引用次数：35 相关文章所有 3 个版本

[PDF] mlr.press

Robust reinforcement learning using least squares policy iteration with provable performance guarantees

KP Badrinath, D Kalathil - International Conference on …, 2021 - proceedings.mlr.press

This paper addresses the problem of model-free reinforcement learning for Robust Markov
Decision Process (RMDP) with large state spaces. The goal of the RMDPs framework is to …

被引用次数：67 相关文章所有 6 个版本

[PDF] mlr.press

Robot reinforcement learning on the constraint manifold

P Liu, D Tateo, HB Ammar… - Conference on Robot …, 2022 - proceedings.mlr.press

Reinforcement learning in robotics is extremely challenging due to many practical issues,
including safety, mechanical constraints, and wear and tear. Typically, these issues are not …

被引用次数：40 相关文章所有 5 个版本

[PDF] arxiv.org

Value iteration in continuous actions, states and time

M Lutter, S Mannor, J Peters, D Fox, A Garg - arXiv preprint arXiv …, 2021 - arxiv.org

Classical value iteration approaches are not applicable to environments with continuous
states and actions. For such environments, the states and actions are usually discretized …

被引用次数：36 相关文章所有 4 个版本

高级搜索

QQ 群