所有版本 - 学术资源搜索

文章

学术资源搜索

获得 2 条结果（用时0.02秒）

Configurable Safety Tuning of Language Models with Synthetic Preference Data

V Gallego - arXiv preprint arXiv:2404.00495, 2024 - arxiv.org

State-of-the-art language model fine-tuning techniques, such as Direct Preference
Optimization (DPO), restrict user control by hard-coding predefined behaviors into the …

被引用次数：5 相关文章

Configurable Safety Tuning of Language Models with Synthetic Preference Data

V Gallego - arXiv e-prints, 2024 - ui.adsabs.harvard.edu

State-of-the-art language model fine-tuning techniques, such as Direct Preference
Optimization (DPO), restrict user control by hard-coding predefined behaviors into the …

高级搜索

QQ 群

Configurable Safety Tuning of Language Models with Synthetic Preference Data

Configurable Safety Tuning of Language Models with Synthetic Preference Data

引用