Interns: Mentoring and Counseling on the Software Development Process

R Rizdania, SH Riono, PU Rakhmawati… - Jurnal Inovasi Dan …, 2023 - journal.assyfa.com
Abstract The Digital Transformation Services Internship Program at PT Arkatama Multi
Solusindo aims to provide opportunities for intern students to develop themselves through …

Goal-guided transformer-enabled reinforcement learning for efficient autonomous navigation

W Huang, Y Zhou, X He, C Lv - IEEE Transactions on Intelligent …, 2023 - ieeexplore.ieee.org
Despite some successful applications of goal-driven navigation, existing deep reinforcement
learning (DRL)-based approaches notoriously suffers from poor data efficiency issue. One of …

Advancements in Humanoid Robots: A Comprehensive Review and Future Prospects

Y Tong, H Liu, Z Zhang - IEEE/CAA Journal of Automatica …, 2024 - ieeexplore.ieee.org
This paper provides a comprehensive review of the current status, advancements, and future
prospects of humanoid robots, highlighting their significance in driving the evolution of next …

AdaBoost maximum entropy deep inverse reinforcement learning with truncated gradient

L Song, D Li, X Wang, X Xu - Information Sciences, 2022 - Elsevier
Studying the representational capacity of neural networks to learn nonlinear rewards is
necessary in a complex and nonlinear environment. Over recent years, the maximum …

Who is delivering my food? Detecting food delivery abusers using variational reward inference networks

DY Yoon, SS Woo - Proceedings of the 29th ACM international …, 2020 - dl.acm.org
The recent paramount success of the gig economy has introduced new business
opportunities in different areas such as food delivery service. However, there are food …

具有优先级的深度确定性策略梯度算法在自动驾驶中的应用.

金彦亮, 刘千红, 季泽宇 - Journal of Shanghai University …, 2023 - search.ebscohost.com
深度确定性策略梯度(deep deterministic policy gradient, DDPG) 算法在自动驾驶领域中应用
广泛, 但DDPG 算法因采用均匀采样而导致低效率策略比例较高, 训练效率低, 收敛速度慢等 …

逆强化学习算法, 理论与应用研究综述

宋莉, 李大字, 徐昕 - 自动化学报, 2023 - aas.net.cn
随着深度强化学习的研究与发展, 强化学习在博弈与优化决策, 智能驾驶等现实问题中的应用也
取得显著进展. 然而强化学习在智能体与环境的交互中存在人工设计奖励函数难的问题 …

Offline reinforcement learning via policy regularization and ensemble Q-functions

T Wang, S Xie, M Gao, X Chen… - 2022 IEEE 34th …, 2022 - ieeexplore.ieee.org
Offline reinforcement learning aims to learn effective policies from a fixed set of data
collected in advance and without further interaction with the environment during learning …

[图书][B] Trustable Deep Reinforcement Learning with Efficient Data Utilization

ZM Poornaki - 2020 - search.proquest.com
We live in the era of big data in which the advancement of sensor and monitoring
technologies, data storage and management, and computer processing power enable us to …

[引用][C] Karyawan Magang: Pendampingan dan Penyuluhan Pada Proses Pengembangan Perangkat Lunak

SH Riono, PU Rakhmawati, R Darmayanti - Jurnal Inovasi Dan Pengembangan …, 2023