[HTML][HTML] Simulated autonomous driving using reinforcement learning: A comparative study on unity's ML-agents framework

Y Savid, R Mahmoudi, R Maskeliūnas, R Damaševičius - Information, 2023 - mdpi.com
Advancements in artificial intelligence are leading researchers to find use cases that were
not as straightforward to solve in the past. The use case of simulated autonomous driving …

Reconfigurable tolerant control of nonlinear Euler–Lagrange systems under actuator fault: A reinforcement learning-based fixed-time approach

M Mazare - Aerospace Science and Technology, 2023 - Elsevier
This paper presents a novel fixed-time adaptive Fault Tolerant Control (FTC) framework for
MIMO nonlinear Euler-Lagrange systems using sliding mode-based strategy, reinforcement …

Review on Fault Diagnosis and Fault-Tolerant Control Scheme for Robotic Manipulators: Recent Advances in AI, Machine Learning, and Digital Twin

MM Quamar, A Nasir - arXiv preprint arXiv:2402.02980, 2024 - arxiv.org
This comprehensive review article delves into the intricate realm of fault-tolerant control
(FTC) schemes tailored for robotic manipulators. Our exploration spans the historical …

Learning Interpretable Models of Aircraft Handling Behaviour by Reinforcement Learning from Human Feedback

T Bewley, J Lawry, A Richards - AIAA SCITECH 2024 Forum, 2024 - arc.aiaa.org
We propose a method to capture the handling abilities of fast jet pilots in a software model
via reinforcement learning (RL) from human preference feedback. We use pairwise …

Reinforcement learning-based fixed-time resilient control of nonlinear cyber physical systems under false data injection attacks and mismatch disturbances

M Mazare - Journal of the Franklin Institute, 2023 - Elsevier
This paper presents a fixed-time adaptive resilient control framework based on
reinforcement learning (RL), mismatch disturbance observer and nonsingular fast terminal …

TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient

X Lou, J Zhang, TJ Norman, K Huang… - Proceedings of the AAAI …, 2024 - ojs.aaai.org
Multi-Agent Policy Gradient (MAPG) has made significant progress in recent years.
However, centralized critics in state-of-the-art MAPG methods still face the centralized …

Finite‐horizon Q‐learning for discrete‐time zero‐sum games with application to H∞ H _ ∞ control

M Liu, Q Cai, W Meng, D Li, M Fu - Asian Journal of Control, 2023 - Wiley Online Library
In this paper, we investigate the optimal control strategies for model‐free zero‐sum games
involving the H∞ H _ ∞ control. The key contribution is the development of a Q‐learning …

[HTML][HTML] Vehicle Simulation Algorithm for Observations with Variable Dimensions Based on Deep Reinforcement Learning

Y Liu, R Zhang, S Zhou - Electronics, 2023 - mdpi.com
Vehicle simulation algorithms play a crucial role in enhancing traffic efficiency and safety by
predicting and evaluating vehicle behavior in various traffic scenarios. Recently, vehicle …

Linear quadratic control using reinforcement learning and quadratic neural networks

S Asri - 2023 - spectrum.library.concordia.ca
This thesis focuses on the application of reinforcement learning (RL) techniques to design
optimal controllers and observers for linear time-invariant (LTI) systems, namely linear …

[PDF][PDF] Tree Models for Interpretable Agents

T Bewley, T Bewley - AI (expert), 2012 - research-information.bris.ac.uk
As progress in AI impacts all sectors of society, the world is destined to see increasingly
complex and numerous autonomous decision-making agents, which act upon their …