Joint optimization of concave scalarized multi-objective reinforcement learning with policy gradient based algorithm

Q Bai, M Agarwal, V Aggarwal - Journal of Artificial Intelligence Research, 2022 - jair.org
Many engineering problems have multiple objectives, and the overall aim is to optimize a
non-linear function of these objectives. In this paper, we formulate the problem of …

Joint Optimization of Concave Scalarized Multi-Objective Reinforcement Learning with Policy Gradient Based Algorithm

Q Bai, M Agarwal, V Aggarwal - Journal of Artificial Intelligence …, 2022 - dl.acm.org
Many engineering problems have multiple objectives, and the overall aim is to optimize a
non-linear function of these objectives. In this paper, we formulate the problem of …

Joint Optimization of Concave Scalarized Multi-Objective Reinforcement Learning with Policy Gradient Based Algorithm

Q Bai, M Agarwal, V Aggarwal - The Journal of Artificial …, 2022 - search.proquest.com
Many engineering problems have multiple objectives, and the overall aim is to optimize a
non-linear function of these objectives. In this paper, we formulate the problem of …

[PDF][PDF] Joint Optimization of Concave Scalarized Multi-Objective Reinforcement Learning with Policy Gradient Based Algorithm

Q Bai, M Agarwal, V Aggarwal - Journal of Artificial Intelligence Research, 2022 - jair.org
Many engineering problems have multiple objectives, and the overall aim is to optimize a
non-linear function of these objectives. In this paper, we formulate the problem of …

[PDF][PDF] Joint Optimization of Concave Scalarized Multi-Objective Reinforcement Learning with Policy Gradient Based Algorithm

Q Bai, M Agarwal, V Aggarwal - Journal of Artificial Intelligence …, 2022 - scholar.archive.org
Many engineering problems have multiple objectives, and the overall aim is to optimize a
non-linear function of these objectives. In this paper, we formulate the problem of …