OpenML: An R package to connect to the machine learning platform OpenML

JL Speiser, ME Miller, J Tooze, E Ip - Expert systems with applications, 2019 - Elsevier

Random forest classification is a popular machine learning method for developing prediction
models in many research settings. Often in prediction modeling, a goal is to reduce the …

被引用次数：1297 相关文章所有 7 个版本

[PDF] arxiv.org

Hyperparameters and tuning strategies for random forest

P Probst, MN Wright… - … Reviews: data mining and …, 2019 - Wiley Online Library

The random forest (RF) algorithm has several hyperparameters that have to be set by the
user, for example, the number of observations drawn randomly for each tree and whether …

被引用次数：1892 相关文章所有 10 个版本

[HTML] sciencedirect.com

[HTML][HTML] Benchmark for filter methods for feature selection in high-dimensional classification data

A Bommert, X Sun, B Bischl, J Rahnenführer… - … Statistics & Data Analysis, 2020 - Elsevier

Feature selection is one of the most fundamental problems in machine learning and has
drawn increasing attention due to high-dimensional data sets emerging from different fields …

被引用次数：695 相关文章所有 12 个版本

[PDF] jmlr.org

Tunability: Importance of hyperparameters of machine learning algorithms

P Probst, AL Boulesteix, B Bischl - Journal of Machine Learning Research, 2019 - jmlr.org

Modern supervised machine learning algorithms involve hyperparameters that have to be
set before running them. Options for setting hyperparameters are default values from the …

被引用次数：966 相关文章所有 8 个版本

[PDF] jmlr.org

To tune or not to tune the number of trees in random forest

P Probst, AL Boulesteix - Journal of Machine Learning Research, 2018 - jmlr.org

The number of trees T in the random forest (RF) algorithm for supervised learning has to be
set by the user. It is unclear whether T should simply be set to the largest computationally …

被引用次数：600 相关文章所有 11 个版本

[PDF] arxiv.org

Visualizing the feature importance for black box models

G Casalicchio, C Molnar, B Bischl - … 10–14, 2018, Proceedings, Part I 18, 2019 - Springer

In recent years, a large amount of model-agnostic methods to improve the transparency,
trustability, and interpretability of machine learning models have been developed. Based on …

被引用次数：298 相关文章所有 9 个版本

[PDF] springer.com

Model-agnostic feature importance and effects with dependent features: a conditional subgroup approach

C Molnar, G König, B Bischl, G Casalicchio - Data Mining and Knowledge …, 2024 - Springer

The interpretation of feature importance in machine learning models is challenging when
features are dependent. Permutation feature importance (PFI) ignores such dependencies …

被引用次数：111 相关文章所有 6 个版本

[PDF] arxiv.org

Automl in the age of large language models: Current challenges, future opportunities and risks

A Tornede, D Deng, T Eimer, J Giovanelli… - arXiv preprint arXiv …, 2023 - arxiv.org

The fields of both Natural Language Processing (NLP) and Automated Machine Learning
(AutoML) have achieved remarkable results over the past years. In NLP, especially Large …

被引用次数：37 相关文章所有 6 个版本

[PDF] arxiv.org

Openml benchmarking suites

B Bischl, G Casalicchio, M Feurer, P Gijsbers… - arXiv preprint arXiv …, 2017 - arxiv.org

Machine learning research depends on objectively interpretable, comparable, and
reproducible algorithm benchmarks. We advocate the use of curated, comprehensive suites …

被引用次数：152 相关文章所有 9 个版本

[PDF] oup.com

Large-scale benchmark study of survival prediction methods using multi-omics data

M Herrmann, P Probst, R Hornung… - Briefings in …, 2021 - academic.oup.com

Multi-omics data, that is, datasets containing different types of high-dimensional molecular
variables, are increasingly often generated for the investigation of various diseases …

被引用次数：92 相关文章所有 15 个版本

高级搜索

QQ 群