[HTML][HTML] Vehicle-to-grid for car sharing-A simulation study for 2030

N Wiedemann, Y Xin, V Medici, L Nespoli, E Suel… - Applied Energy, 2024 - Elsevier
The proliferation of car sharing services in recent years presents a promising avenue for
advancing sustainable transportation. Beyond merely reducing car ownership rates, these …

Relu to the rescue: Improve your on-policy actor-critic with positive advantages

A Jesson, C Lu, G Gupta, N Beltran-Velez… - arXiv preprint arXiv …, 2023 - arxiv.org
This paper proposes a step toward approximate Bayesian inference in on-policy actor-critic
deep reinforcement learning. It is implemented through three changes to the Asynchronous …

Safe Reinforcement Learning as Wasserstein Variational Inference: Formal Methods for Interpretability

Y Wang, D Boyle - arXiv preprint arXiv:2307.07084, 2023 - arxiv.org
Reinforcement Learning or optimal control can provide effective reasoning for sequential
decision-making problems with variable dynamics. Such reasoning in practical …

FAWAC: Feasibility Informed Advantage Weighted Regression for Persistent Safety in Offline Reinforcement Learning

P Koirala, Z Jiang, S Sarkar, C Fleming - arXiv preprint arXiv:2412.08880, 2024 - arxiv.org
Safe offline reinforcement learning aims to learn policies that maximize cumulative rewards
while adhering to safety constraints, using only offline data for training. A key challenge is …

[图书][B] Inverse dynamic game methods for identification of cooperative system behavior

J Jairo Inga Charaja - 2021 - library.oapen.org
This work addresses inverse dynamic games, which generalize the inverse problem of
optimal control, and where the aim is to identify cost functions based on observed optimal …

GAN-MPC: Training Model Predictive Controllers with Parameterized Cost Functions using Demonstrations from Non-identical Experts

R Burnwal, A Santara, NP Bhatt, B Ravindran… - arXiv preprint arXiv …, 2023 - arxiv.org
Model predictive control (MPC) is a popular approach for trajectory optimization in practical
robotics applications. MPC policies can optimize trajectory parameters under kinodynamic …

Deep Reinforcement Learning and Transfer Learning of Robot In-hand Dexterous Manipulation

Y Kuang - 2024 - publications.aston.ac.uk
In recent years, deep reinforcement learning (RL) and imitation learning (IL) have shown
remarkable success in many robotics areas. However, the domain of in-hand dexterous …

[PDF][PDF] Inverse reinforcement learning for robotic applications: hidden variables, multiple experts and unknown dynamics

KD Bogert - 2016 - getd.libs.uga.edu
Robots deployed into many real-world scenarios are expected to face situations that their
designers could not anticipate. Machine learning is an effective tool for extending the …

Neuro-algorithmic Policies for Discrete Planning

MV Pogančić, M Rolinek, G Martius - openreview.net
Although model-based and model-free approaches to learning the control of systems have
achieved impressive results on standard benchmarks, generalization to variations in the task …

[引用][C] From One to Infinity: New Algorithms for Reinforcement Learning and Inverse Reinforcement Learning

Y Chen - 2022 - ResearchSpace@ Auckland