相关文章- 学术资源搜索

A critical analysis of metrics used for measuring progress in artificial intelligence

K Blagec, G Dorffner, M Moradi, M Samwald - arXiv preprint arXiv …, 2020 - arxiv.org

Comparing model performances on benchmark datasets is an integral part of measuring
and driving progress in artificial intelligence. A model's performance on a benchmark …

被引用次数：27 相关文章所有 4 个版本

[PDF] arxiv.org

Why comparing single performance scores does not allow to draw conclusions about machine learning approaches

N Reimers, I Gurevych - arXiv preprint arXiv:1803.09578, 2018 - arxiv.org

Developing state-of-the-art approaches for specific tasks is a major driving force in our
research community. Depending on the prestige of the task, publishing it can come along …

被引用次数：51 相关文章所有 2 个版本

[PDF] mlsys.org

Accounting for variance in machine learning benchmarks

X Bouthillier, P Delaunay, M Bronzi… - Proceedings of …, 2021 - proceedings.mlsys.org

Strong empirical evidence that one machine-learning algorithm A outperforms another one
B, ideally calls for multiple trials optimizing the learning pipeline over sources of variation …

被引用次数：138 相关文章所有 8 个版本

[PDF] theoj.org

[PDF][PDF] PerMetrics: A framework of performance metrics for machine learning models

N Van Thieu - Journal of Open Source Software, 2024 - joss.theoj.org

Performance metrics are pivotal in machine learning field, especially for tasks like
regression, classification, and clustering (Saura, 2021). They offer quantitative measures to …

被引用次数：1 相关文章所有 4 个版本

[PDF] aaai.org

Performance evaluation in machine learning: the good, the bad, the ugly, and the way forward

P Flach - Proceedings of the AAAI conference on artificial …, 2019 - aaai.org

This paper gives an overview of some ways in which our understanding of performance
evaluation measures for machine-learned classifiers has improved over the last twenty …

被引用次数：164 相关文章所有 8 个版本

[HTML] nature.com Full View

[HTML][HTML] Mapping global dynamics of benchmark creation and saturation in artificial intelligence

S Ott, A Barbosa-Silva, K Blagec, J Brauner… - Nature …, 2022 - nature.com

Benchmarks are crucial to measuring and steering progress in artificial intelligence (AI).
However, recent studies raised concerns over the state of AI benchmarking, reporting issues …

被引用次数：21 相关文章所有 11 个版本

[PDF] arxiv.org

Insights into performance fitness and error metrics for machine learning

MZ Naser, A Alavi - arXiv preprint arXiv:2006.00887, 2020 - arxiv.org

Machine learning (ML) is the field of training machines to achieve high level of cognition and
perform human-like analysis. Since ML is a data-driven approach, it seemingly fits into our …

被引用次数：79 相关文章所有 3 个版本

[PDF] jmlr.org

[PDF][PDF] On over-fitting in model selection and subsequent selection bias in performance evaluation

GC Cawley, NLC Talbot - The Journal of Machine Learning Research, 2010 - jmlr.org

Abstract Model selection strategies for machine learning algorithms typically involve the
numerical optimisation of an appropriate model selection criterion, often based on an …

被引用次数：2487 相关文章所有 14 个版本

[PDF] arxiv.org

Openml benchmarking suites

B Bischl, G Casalicchio, M Feurer, P Gijsbers… - arXiv preprint arXiv …, 2017 - arxiv.org

Machine learning research depends on objectively interpretable, comparable, and
reproducible algorithm benchmarks. We advocate the use of curated, comprehensive suites …

被引用次数：113 相关文章所有 9 个版本

[HTML] oup.com Full View

MLcps: machine learning cumulative performance score for classification problems

A Akshay, M Abedi, N Shekarchizadeh… - …, 2023 - academic.oup.com

Background Assessing the performance of machine learning (ML) models requires careful
consideration of the evaluation metrics used. It is often necessary to utilize multiple metrics …

被引用次数：2 相关文章所有 10 个版本

高级搜索

QQ 群

A critical analysis of metrics used for measuring progress in artificial intelligence

Why comparing single performance scores does not allow to draw conclusions about machine learning approaches

Accounting for variance in machine learning benchmarks

[PDF][PDF] PerMetrics: A framework of performance metrics for machine learning models

Performance evaluation in machine learning: the good, the bad, the ugly, and the way forward

[HTML][HTML] Mapping global dynamics of benchmark creation and saturation in artificial intelligence

Insights into performance fitness and error metrics for machine learning

[PDF][PDF] On over-fitting in model selection and subsequent selection bias in performance evaluation

Openml benchmarking suites

MLcps: machine learning cumulative performance score for classification problems

相关搜索

引用