Hyperparameters and tuning strategies for random forest

P Probst, MN Wright… - … Reviews: data mining and …, 2019 - Wiley Online Library
The random forest (RF) algorithm has several hyperparameters that have to be set by the
user, for example, the number of observations drawn randomly for each tree and whether …

A survey of predictive modeling on imbalanced domains

P Branco, L Torgo, RP Ribeiro - ACM computing surveys (CSUR), 2016 - dl.acm.org
Many real-world data-mining applications involve obtaining predictive models using
datasets with strongly imbalanced distributions of the target variable. Frequently, the least …

DenseSPH-YOLOv5: An automated damage detection model based on DenseNet and Swin-Transformer prediction head-enabled YOLOv5 with attention mechanism

AM Roy, J Bhaduri - Advanced Engineering Informatics, 2023 - Elsevier
Objective. Computer vision-based up-to-date accurate damage classification and
localization are of decisive importance for infrastructure monitoring, safety, and the …

WilDect-YOLO: An efficient and robust computer vision-based accurate object localization model for automated endangered wildlife detection

AM Roy, J Bhaduri, T Kumar, K Raj - Ecological Informatics, 2023 - Elsevier
Objective. With climatic instability, various ecological disturbances, and human actions
threaten the existence of various endangered wildlife species. Therefore, an up-to-date …

[HTML][HTML] The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation

D Chicco, G Jurman - BMC genomics, 2020 - Springer
Background To evaluate binary classifications and their confusion matrices, scientific
researchers can employ several statistical rates, accordingly to the goal of the experiment …

Probabilistic extension of precision, recall, and f1 score for more thorough evaluation of classification models

R Yacouby, D Axman - Proceedings of the first workshop on …, 2020 - aclanthology.org
In pursuit of the perfect supervised NLP classifier, razor thin margins and low-resource test
sets can make modeling decisions difficult. Popular metrics such as Accuracy, Precision …

[HTML][HTML] The impact of class imbalance in classification performance metrics based on the binary confusion matrix

A Luque, A Carrasco, A Martín, A de Las Heras - Pattern Recognition, 2019 - Elsevier
A major issue in the classification of class imbalanced datasets involves the determination of
the most suitable performance metrics to be used. In previous work using several examples …

[HTML][HTML] Evaluation of feature selection methods for text classification with small datasets using multiple criteria decision-making methods

G Kou, P Yang, Y Peng, F Xiao, Y Chen… - Applied Soft Computing, 2020 - Elsevier
The evaluation of feature selection methods for text classification with small sample datasets
must consider classification performance, stability, and efficiency. It is, thus, a multiple …

Tunability: Importance of hyperparameters of machine learning algorithms

P Probst, AL Boulesteix, B Bischl - Journal of Machine Learning Research, 2019 - jmlr.org
Modern supervised machine learning algorithms involve hyperparameters that have to be
set before running them. Options for setting hyperparameters are default values from the …

Reliable crowdsourcing and deep locality-preserving learning for expression recognition in the wild

S Li, W Deng, JP Du - … of the IEEE conference on computer …, 2017 - openaccess.thecvf.com
Past research on facial expressions have used relatively limited datasets, which makes it
unclear whether current methods can be employed in real world. In this paper, we present a …