Deep Bayesian Active Learning for Preference Modeling in Large Language Models

LC Melo, P Tigas, A Abate, Y Gal - arXiv preprint arXiv:2406.10023, 2024 - arxiv.org
Leveraging human preferences for steering the behavior of Large Language Models (LLMs)
has demonstrated notable success in recent years. Nonetheless, data selection and labeling …