Learning to drive via apprenticeship learning and deep reinforcement learning

R Rizdania, SH Riono, PU Rakhmawati… - Jurnal Inovasi Dan …, 2023 - journal.assyfa.com

Abstract The Digital Transformation Services Internship Program at PT Arkatama Multi
Solusindo aims to provide opportunities for intern students to develop themselves through …

被引用次数：24 相关文章所有 2 个版本

[PDF] arxiv.org

Goal-guided transformer-enabled reinforcement learning for efficient autonomous navigation

W Huang, Y Zhou, X He, C Lv - IEEE Transactions on Intelligent …, 2023 - ieeexplore.ieee.org

Despite some successful applications of goal-driven navigation, existing deep reinforcement
learning (DRL)-based approaches notoriously suffers from poor data efficiency issue. One of …

被引用次数：18 相关文章所有 3 个版本

[HTML] ieee-jas.net

Advancements in Humanoid Robots: A Comprehensive Review and Future Prospects

Y Tong, H Liu, Z Zhang - IEEE/CAA Journal of Automatica …, 2024 - ieeexplore.ieee.org

This paper provides a comprehensive review of the current status, advancements, and future
prospects of humanoid robots, highlighting their significance in driving the evolution of next …

被引用次数：9 相关文章所有 3 个版本

AdaBoost maximum entropy deep inverse reinforcement learning with truncated gradient

L Song, D Li, X Wang, X Xu - Information Sciences, 2022 - Elsevier

Studying the representational capacity of neural networks to learn nonlinear rewards is
necessary in a complex and nonlinear environment. Over recent years, the maximum …

被引用次数：10 相关文章所有 2 个版本

Who is delivering my food? Detecting food delivery abusers using variational reward inference networks

DY Yoon, SS Woo - Proceedings of the 29th ACM international …, 2020 - dl.acm.org

The recent paramount success of the gig economy has introduced new business
opportunities in different areas such as food delivery service. However, there are food …

被引用次数：9 相关文章所有 2 个版本

具有优先级的深度确定性策略梯度算法在自动驾驶中的应用.

金彦亮，刘千红，季泽宇 - Journal of Shanghai University …, 2023 - search.ebscohost.com

深度确定性策略梯度(deep deterministic policy gradient, DDPG) 算法在自动驾驶领域中应用
广泛, 但DDPG 算法因采用均匀采样而导致低效率策略比例较高, 训练效率低, 收敛速度慢等 …

被引用次数：1 相关文章所有 3 个版本

逆强化学习算法, 理论与应用研究综述

宋莉，李大字，徐昕 - 自动化学报, 2023 - aas.net.cn

随着深度强化学习的研究与发展, 强化学习在博弈与优化决策, 智能驾驶等现实问题中的应用也
取得显著进展. 然而强化学习在智能体与环境的交互中存在人工设计奖励函数难的问题 …

Offline reinforcement learning via policy regularization and ensemble Q-functions

T Wang, S Xie, M Gao, X Chen… - 2022 IEEE 34th …, 2022 - ieeexplore.ieee.org

Offline reinforcement learning aims to learn effective policies from a fixed set of data
collected in advance and without further interaction with the environment during learning …

被引用次数：1 相关文章所有 2 个版本

[PDF] escholarship.org

[图书][B] Trustable Deep Reinforcement Learning with Efficient Data Utilization

ZM Poornaki - 2020 - search.proquest.com

We live in the era of big data in which the advancement of sensor and monitoring
technologies, data storage and management, and computer processing power enable us to …

[引用][C] Karyawan Magang: Pendampingan dan Penyuluhan Pada Proses Pengembangan Perangkat Lunak

SH Riono, PU Rakhmawati, R Darmayanti - Jurnal Inovasi Dan Pengembangan …, 2023

被引用次数：25 相关文章

高级搜索

QQ 群