Navigating the metrics maze: Reconciling score magnitudes and accuracies- 学术资源搜索

文章

学术资源搜索

Navigating the metrics maze: Reconciling score magnitudes and accuracies

T Kocmi, V Zouhar, C Federmann, M Post - arXiv preprint arXiv …, 2024 - arxiv.org

Ten years ago a single metric, BLEU, governed progress in machine translation research.
For better or worse, there is no such consensus today, and consequently it is difficult for
researchers to develop and retain the kinds of heuristic intuitions about metric deltas that
drove earlier research and deployment decisions. This paper investigates the" dynamic
range" of a number of modern metrics in an effort to provide a collective understanding of the
meaning of differences in scores both within and among metrics; in other words, we ask …

被引用次数：20 相关文章所有 2 个版本

[引用][C] Navigating the metrics maze: Reconciling score magnitudes and accuracies. arXiv prepring

T Kocmi, V Zouhar, C Federmann, M Post - arXiv preprint arXiv:2401.06760, 2024

被引用次数：2 相关文章

以上显示的是最相近的搜索结果。查看全部搜索结果

高级搜索

QQ 群

Navigating the metrics maze: Reconciling score magnitudes and accuracies

[引用][C] Navigating the metrics maze: Reconciling score magnitudes and accuracies. arXiv prepring

引用