Application of reinforcement learning and deep learning in multiple-input and multiple-output (MIMO) systems

M Naeem, G De Pietro, A Coronato - Sensors, 2021 - mdpi.com
The current wireless communication infrastructure has to face exponential development in
mobile traffic size, which demands high data rate, reliability, and low latency. MIMO systems …

Learning agile skills via adversarial imitation of rough partial demonstrations

C Li, M Vlastelica, S Blaes, J Frey… - … on Robot Learning, 2023 - proceedings.mlr.press
Learning agile skills is one of the main challenges in robotics. To this end, reinforcement
learning approaches have achieved impressive results. These methods require explicit task …

An ai-empowered infrastructure for risk prevention during medical examination

SIH Shah, M Naeem, G Paragliola, A Coronato… - Expert Systems with …, 2023 - Elsevier
A medical examination at Nuclear Medicine Department (NMD) carries out at multiple
stages. Patients are accompanied and guided by nurses during their movements within the …

Learning and assessing optimal dynamic treatment regimes through cooperative imitation learning

SIH Shah, A Coronato, M Naeem, G De Pietro - IEEE Access, 2022 - ieeexplore.ieee.org
Dynamic Treatment Regimes (DTRs) are sets of sequential decision rules that can be
adapted over time to treat patients with a specific pathology. DTR consists of alternative …

Human-Like Implicit Intention Expression for Autonomous Driving Motion Planning Based on Learning Human Intenion Priors

J Liu, X Qi, Y Ni, J Sun, P Hang - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
One of the key factors determining whether autonomous vehicles (AVs) can be seamlessly
integrated into existing traffic systems is their ability to interact smoothly and efficiently with …

A study of inverse reinforcement learning and its implementation

C Zhang, G Jing, S Zuo… - … Symposium on Computer …, 2023 - spiedigitallibrary.org
When dealing with complex tasks, such as robots imitating human actions and autonomous
vehicles driving in urban environments, it can be difficult to determine the reward function of …