H Sun, Y Chen, S Wang, W Chen,
X Deng - arXiv preprint arXiv …, 2024 - arxiv.org
Recent research on fine-tuning large language models (LLMs) through the aggregation of
multiple preferences has attracted considerable attention. However, the existing literature …