The state of the art in enhancing trust in machine learning models with the use of visualizations

A Chatzimparmpas, RM Martins, I Jusufi… - Computer Graphics …, 2020 - Wiley Online Library
Abstract Machine learning (ML) models are nowadays used in complex applications in
various domains, such as medicine, bioinformatics, and other sciences. Due to their black …

The random forest algorithm for statistical learning

M Schonlau, RY Zou - The Stata Journal, 2020 - journals.sagepub.com
Random forests (Breiman, 2001, Machine Learning 45: 5–32) is a statistical-or machine-
learning algorithm for prediction. In this article, we introduce a corresponding new …

[HTML][HTML] Data science, machine learning and big data in digital journalism: a survey of state-of-the-art, challenges and opportunities

E Fernandes, S Moro, P Cortez - Expert Systems with Applications, 2023 - Elsevier
Digital journalism has faced a dramatic change and media companies are challenged to use
data science algorithms to be more competitive in a Big Data era. While this is a relatively …

Feature inference attack on model predictions in vertical federated learning

X Luo, Y Wu, X Xiao, BC Ooi - 2021 IEEE 37th International …, 2021 - ieeexplore.ieee.org
Federated learning (FL) is an emerging paradigm for facilitating multiple organizations' data
collaboration without revealing their private data to each other. Recently, vertical FL, where …

Neural architecture search with bayesian optimisation and optimal transport

K Kandasamy, W Neiswanger… - Advances in neural …, 2018 - proceedings.neurips.cc
Bayesian Optimisation (BO) refers to a class of methods for global optimisation of a function f
which is only accessible via point evaluations. It is typically used in settings where f is …

Causal interpretations of black-box models

Q Zhao, T Hastie - Journal of Business & Economic Statistics, 2021 - Taylor & Francis
The fields of machine learning and causal inference have developed many concepts, tools,
and theory that are potentially useful for each other. Through exploring the possibility of …

Tuning hyperparameters without grad students: Scalable and robust bayesian optimisation with dragonfly

K Kandasamy, KR Vysyaraju, W Neiswanger… - Journal of Machine …, 2020 - jmlr.org
Bayesian Optimisation (BO) refers to a suite of techniques for global optimisation of
expensive black box functions, which use introspective Bayesian models of the function to …

Fastshap: Real-time shapley value estimation

N Jethani, M Sudarshan, IC Covert, SI Lee… - International …, 2021 - openreview.net
Although Shapley values are theoretically appealing for explaining black-box models, they
are costly to calculate and thus impractical in settings that involve large, high-dimensional …

Federated forest

Y Liu, Y Liu, Z Liu, Y Liang, C Meng… - … Transactions on Big …, 2020 - ieeexplore.ieee.org
Most real-world data are scattered across different companies or government organizations,
and cannot be easily integrated under data privacy and related regulations such as the …

FeatureSelect: a software for feature selection based on machine learning approaches

Y Masoudi-Sobhanzadeh, H Motieghader… - BMC …, 2019 - Springer
Background Feature selection, as a preprocessing stage, is a challenging problem in
various sciences such as biology, engineering, computer science, and other fields. For this …