相关文章- 学术资源搜索

Joint optimization of concave scalarized multi-objective reinforcement learning with policy gradient based algorithm

Q Bai, M Agarwal, V Aggarwal - Journal of Artificial Intelligence Research, 2022 - jair.org

Many engineering problems have multiple objectives, and the overall aim is to optimize a
non-linear function of these objectives. In this paper, we formulate the problem of …

被引用次数：4 相关文章所有 5 个版本

[PDF] arxiv.org

Joint optimization of multi-objective reinforcement learning with policy gradient based algorithm

Q Bai, M Agarwal, V Aggarwal - arXiv preprint arXiv:2105.14125, 2021 - arxiv.org

Many engineering problems have multiple objectives, and the overall aim is to optimize a
non-linear function of these objectives. In this paper, we formulate the problem of …

被引用次数：8 相关文章所有 5 个版本

[PDF] vt.edu

Sample Complexity of Incremental Policy Gradient Methods for Solving Multi-Task Reinforcement Learning

Y Bai - 2024 - vtechworks.lib.vt.edu

We consider a multi-task learning problem, where an agent is presented a number of N
reinforcement learning tasks. To solve this problem, we are interested in studying the …

Finite-time complexity of incremental policy gradient methods for solving multi-task reinforcement learning

Y Bai, T Doan - 6th Annual Learning for Dynamics & Control …, 2024 - proceedings.mlr.press

We consider a multi-task learning problem, where an agent is presented a number of $ N $
reinforcement learning tasks. To solve this problem, we are interested in studying the …

A Scale-Independent Multi-Objective Reinforcement Learning with Convergence Analysis

M Amidzadeh - 2023 62nd IEEE Conference on Decision and …, 2023 - ieeexplore.ieee.org

Many sequential decision-making problems need optimization of different objectives which
possibly conflict with each other. The conventional way to deal with a multitask problem is to …

被引用次数：1 相关文章所有 4 个版本

[PDF] mlr.press

A hybrid stochastic policy gradient algorithm for reinforcement learning

N Pham, L Nguyen, D Phan… - International …, 2020 - proceedings.mlr.press

We propose a novel hybrid stochastic policy gradient estimator by combining an unbiased
policy gradient estimator, the REINFORCE estimator, with another biased one, an adapted …

被引用次数：25 相关文章所有 8 个版本

Multi-objective reinforcement learning method for acquiring all Pareto optimal policies simultaneously

Y Mukai, Y Kuroe, H Iima - 2012 IEEE international conference …, 2012 - ieeexplore.ieee.org

This paper studies multi-objective reinforcement learning problems in which an agent gains
multiple rewards. In ordinary multi-objective reinforcement learning methods, only a single …

被引用次数：13 相关文章所有 3 个版本

[PDF] arxiv.org

Solving the scalarization issues of Advantage-based Reinforcement Learning algorithms

FA Galatolo, MGCA Cimino, G Vaglini - Computers & Electrical Engineering, 2021 - Elsevier

In this research, some of the issues that arise from the scalarization of the multi-objective
optimization problem in the Advantage Actor–Critic (A2C) reinforcement learning algorithm …

被引用次数：3 相关文章所有 11 个版本

Multi-agent reinforcement learning for training and non-linear optimization

A Morcos, A West, B Maguire - Artificial Intelligence and …, 2022 - spiedigitallibrary.org

The field of Reinforcement Learning continues to show promise in solving old problems in
new and innovative ways. Thanks to the algorithms' ability to learn without an explicit set of …

被引用次数：2 相关文章所有 3 个版本

[PDF] vub.ac.be

[PDF][PDF] Adaptive objective selection for correlated objectives in multi-objective reinforcement learning

T Brys, K Van Moffaert, A Nowé… - Proceedings of the 2014 …, 2014 - ai.vub.ac.be

In this paper we introduce a novel scale-invariant and parameterless technique, called
adaptive objective selection, that allows a temporal-difference learning agent to exploit the …

被引用次数：5 相关文章所有 11 个版本

高级搜索

QQ 群