Cleanrl: High-quality single-file implementations of deep reinforcement learning algorithms

S Huang, RFJ Dossa, C Ye, J Braga… - Journal of Machine …, 2022 - jmlr.org
CleanRL is an open-source library that provides high-quality single-file implementations of
Deep Reinforcement Learning (DRL) algorithms. These single-file implementations are self …

Tianshou: A highly modularized deep reinforcement learning library

J Weng, H Chen, D Yan, K You, A Duburcq… - Journal of Machine …, 2022 - jmlr.org
In this paper, we present Tianshou, a highly modularized Python library for deep
reinforcement learning (DRL) that uses PyTorch as its backend. Tianshou intends to be …

Sharing knowledge in multi-task deep reinforcement learning

C D'Eramo, D Tateo, A Bonarini, M Restelli… - arXiv preprint arXiv …, 2024 - arxiv.org
We study the benefit of sharing representations among tasks to enable the effective use of
deep neural networks in Multi-Task Reinforcement Learning. We leverage the assumption …

Reinforcement learning in wind energy-a review

VL Narayanan - International Journal of Green Energy, 2023 - Taylor & Francis
Today's environmental concerns, particularly those related to global warming, have sparked
a drive for the usage of renewable energy sources. One of the most significant sources of …

A survey on deep reinforcement learning for audio-based applications

S Latif, H Cuayáhuitl, F Pervez, F Shamshad… - Artificial Intelligence …, 2023 - Springer
Deep reinforcement learning (DRL) is poised to revolutionise the field of artificial intelligence
(AI) by endowing autonomous systems with high levels of understanding of the real world …

Learn2assemble with structured representations and search for robotic architectural construction

N Funk, G Chalvatzaki, B Belousov… - Conference on Robot …, 2022 - proceedings.mlr.press
Autonomous robotic assembly requires a well-orchestrated sequence of high-level actions
and smooth manipulation executions. Learning to assemble complex 3D structures remains …

Robot learning of mobile manipulation with reachability behavior priors

S Jauhri, J Peters, G Chalvatzaki - IEEE Robotics and …, 2022 - ieeexplore.ieee.org
Mobile Manipulation (MM) systems are ideal candidates for taking up the role of personal
assistants in unstructured real-world environments. Among other challenges, Mobile …

Robust reinforcement learning using least squares policy iteration with provable performance guarantees

KP Badrinath, D Kalathil - International Conference on …, 2021 - proceedings.mlr.press
This paper addresses the problem of model-free reinforcement learning for Robust Markov
Decision Process (RMDP) with large state spaces. The goal of the RMDPs framework is to …

Robot reinforcement learning on the constraint manifold

P Liu, D Tateo, HB Ammar… - Conference on Robot …, 2022 - proceedings.mlr.press
Reinforcement learning in robotics is extremely challenging due to many practical issues,
including safety, mechanical constraints, and wear and tear. Typically, these issues are not …

Value iteration in continuous actions, states and time

M Lutter, S Mannor, J Peters, D Fox, A Garg - arXiv preprint arXiv …, 2021 - arxiv.org
Classical value iteration approaches are not applicable to environments with continuous
states and actions. For such environments, the states and actions are usually discretized …