G Xie,
J Xu, Y Yang,
S Zhang - arXiv preprint arXiv:2409.02428, 2024 - arxiv.org
Leveraging large language models (LLMs) for designing reward functions demonstrates
significant potential. However, achieving effective design and improvement of reward …