Hyperparameters and tuning strategies for random forest

P Probst, MN Wright… - … Reviews: data mining and …, 2019 - Wiley Online Library
The random forest (RF) algorithm has several hyperparameters that have to be set by the
user, for example, the number of observations drawn randomly for each tree and whether …

Bootstrapping the out-of-sample predictions for efficient and accurate cross-validation

I Tsamardinos, E Greasidou, G Borboudakis - Machine learning, 2018 - Springer
Abstract Cross-Validation (CV), and out-of-sample performance-estimation protocols in
general, are often employed both for (a) selecting the optimal combination of algorithms and …

Stability of clinical prediction models developed using statistical or machine learning methods

RD Riley, GS Collins - Biometrical Journal, 2023 - Wiley Online Library
Clinical prediction models estimate an individual's risk of a particular health outcome. A
developed model is a consequence of the development dataset and model‐building …

Machine learning methods to predict acute respiratory failure and acute respiratory distress syndrome

AKI Wong, PC Cheung, R Kamaleswaran… - Frontiers in big …, 2020 - frontiersin.org
Acute respiratory failure (ARF) is a common problem in medicine that utilizes significant
healthcare resources and is associated with high morbidity and mortality. Classification of …

Evaluation of random forests for short-term daily streamflow forecasting in rainfall-and snowmelt-driven watersheds

LT Pham, L Luo, A Finley - Hydrology and Earth System …, 2021 - hess.copernicus.org
In the past decades, data-driven machine-learning (ML) models have emerged as promising
tools for short-term streamflow forecasting. Among other qualities, the popularity of ML …

Approaches to regularized regression–a comparison between gradient boosting and the lasso

T Hepp, M Schmid, O Gefeller… - … of information in …, 2016 - thieme-connect.com
Background: Penalization and regularization techniques for statistical modeling have
attracted increasing attention in biomedical research due to their advantages in the …

Predicting transcriptional responses to heat and drought stress from genomic features using a machine learning approach in rice

D Smet, H Opdebeeck, K Vandepoele - Frontiers in Plant Science, 2023 - frontiersin.org
Plants have evolved various mechanisms to adapt to adverse environmental stresses, such
as the modulation of gene expression. Expression of stress-responsive genes is controlled …

[HTML][HTML] Interaction forests: Identifying and exploiting interpretable quantitative and qualitative interaction effects

R Hornung, AL Boulesteix - Computational Statistics & Data Analysis, 2022 - Elsevier
Although interaction effects can be exploited to improve predictions and allow for valuable
insights into covariate interplay, they are given limited attention in analysis. Interaction …

[HTML][HTML] Machine learning using random forest to model heavy metals removal efficiency using a zeolite-embedded sheet in water

ND Takarina, N Matsue, E Johan, A Adiwibowo… - Global Journal of …, 2024 - gjesm.net
BACKGROUND AND OBJECTIVES: Zeolite has been recognized as a potential adsorbent
for heavy metals in water. The form of zeolite that is generally available in powder has …

[HTML][HTML] Parsimonious statistical learning models for low-flow estimation

J Laimighofer, M Melcher… - Hydrology and Earth …, 2022 - hess.copernicus.org
Statistical learning methods offer a promising approach for low-flow regionalization. We
examine seven statistical learning models (Lasso, linear, and nonlinear-model-based …