F Yang, H Huang, W Shi, Y Ma, Y Feng… - Journal of Ambient …, 2023 - Springer
… reinforcement learning. Firstly, we generalize the problem of multi-objective reinforcement
learning, then Pareto optimization is applied to Q-learning to select an action and estimate the …