A new history experience replay design for model-free adaptive dynamic programming

F Li, Q Jiang, S Zhang, M Wei, R Song - Neurocomputing, 2019 - Elsevier

Uncertain factors in environments restrict the intelligence level of industrial robots. Based on
deep reinforcement learning, a skill-acquisition method is used to solve the posed problems …

被引用次数：116 相关文章所有 3 个版本

A computationally efficient optimization approach for battery systems in islanded microgrid

A Das, Z Ni - IEEE Transactions on Smart Grid, 2017 - ieeexplore.ieee.org

In islanded microgrids, it is a challenge to optimize battery energy storage systems (BESSs)
with other power supply units (eg, renewable energy and traditional power generator) and …

被引用次数：91 相关文章所有 2 个版本

[PDF] mdpi.com

An improved dueling deep double-q network based on prioritized experience replay for path planning of unmanned surface vehicles

Z Zhu, C Hu, C Zhu, Y Zhu, Y Sheng - Journal of Marine Science and …, 2021 - mdpi.com

Unmanned Surface Vehicle (USV) has a broad application prospect and autonomous path
planning as its crucial technology has developed into a hot research direction in the field of …

被引用次数：27 相关文章所有 5 个版本

Action-Dependent Heuristic Dynamic Programming With Experience Replay for Wastewater Treatment Processes

J Qiao, M Zhao, D Wang, M Li - IEEE Transactions on Industrial …, 2024 - ieeexplore.ieee.org

The wastewater treatment process (WWTP) is beneficial for maintaining sufficient water
resources and recycling wastewater. A crucial link of WWTP is to ensure that the dissolved …

被引用次数：7 相关文章

Skill learning for robotic assembly based on visual perspectives and force sensing

R Song, F Li, W Quan, X Yang, J Zhao - Robotics and Autonomous Systems, 2021 - Elsevier

An environment cannot be effectively described with a single perception form in skill
learning for robotic assembly. The visual perception may provide the object's apparent …

被引用次数：30 相关文章

Prioritizing useful experience replay for heuristic dynamic programming-based learning systems

Z Ni, N Malla, X Zhong - IEEE Transactions on Cybernetics, 2018 - ieeexplore.ieee.org

The adaptive dynamic programming controller usually needs a long training period because
the data usage efficiency is relatively low by discarding the samples once used. Prioritized …

被引用次数：46 相关文章所有 3 个版本

Online event-triggered optimal control for multi-agent systems using simplified ADP and experience replay technique

Y Xu, T Li, W Bai, Q Shan, L Yuan, Y Wu - Nonlinear Dynamics, 2021 - Springer

This paper studies an optimal control problem for multi-agent systems under adaptive
dynamic programming (ADP) framework. To overcome the restrictions resulting from the …

被引用次数：15 相关文章所有 3 个版本

Optimal control for earth pressure balance of shield machine based on action-dependent heuristic dynamic programming

X Liu, S Xu, Y Huang - ISA transactions, 2019 - Elsevier

Earth pressure balance (EPB) shield has been widely used in underground construction.
The excavation face stability is crucial to avoid the accidents caused by EPB shield …

被引用次数：16 相关文章所有 3 个版本

[PDF] ieee.org

Event-driven-modular adaptive backstepping optimal control for strict-feedback systems through zero-sum differential games

Y Ji, H Zhou, B Bai - IEEE Access, 2020 - ieeexplore.ieee.org

This paper addresses the event-driven-modular optimal tracking control problem for
nonlinear strict-feedback systems with external disturbances. Through the backstepping …

被引用次数：9 相关文章所有 3 个版本

[PDF] epfl.ch

Generative models for learning robot manipulation skills from humans

AK Tanwani - 2018 - infoscience.epfl.ch

A long standing goal in artificial intelligence is to make robots seamlessly interact with
humans in performing everyday manipulation skills. Learning from demonstrations or …

被引用次数：16 相关文章所有 4 个版本

高级搜索

QQ 群