Navigating the pitfalls of applying machine learning in genomics

S Whalen, J Schreiber, WS Noble… - Nature Reviews Genetics, 2022 - nature.com
The scale of genetic, epigenomic, transcriptomic, cheminformatic and proteomic data
available today, coupled with easy-to-use machine learning (ML) toolkits, has propelled the …

Data-driven strategies for accelerated materials design

R Pollice, G dos Passos Gomes… - Accounts of Chemical …, 2021 - ACS Publications
Conspectus The ongoing revolution of the natural sciences by the advent of machine
learning and artificial intelligence sparked significant interest in the material science …

[图书][B] Feature engineering and selection: A practical approach for predictive models

M Kuhn, K Johnson - 2019 - taylorfrancis.com
The process of developing predictive models includes many stages. Most resources focus
on the modeling algorithms but neglect other critical aspects of the modeling process. This …

[PDF][PDF] Cross-validation.

D Berrar - 2019 - dberrar.github.io
Cross-validation is one of the most widely used data resampling methods for model
selection and evaluation. Cross-validation can be used to tune the hyperparameters of …

[HTML][HTML] Stability of feature selection algorithm: A review

UM Khaire, R Dhanalakshmi - Journal of King Saud University-Computer …, 2022 - Elsevier
Feature selection technique is a knowledge discovery tool which provides an understanding
of the problem through the analysis of the most relevant features. Feature selection aims at …

Feature selection using bare-bones particle swarm optimization with mutual information

X Song, Y Zhang, D Gong, X Sun - Pattern Recognition, 2021 - Elsevier
Feature selection (FS) is an important data processing method in pattern recognition and
data mining. Due to not considering characteristics of the FS problem itself, traditional …

Clinical prediction models in psychiatry: a systematic review of two decades of progress and challenges

AJ Meehan, SJ Lewis, S Fazel, P Fusar-Poli… - Molecular …, 2022 - nature.com
Recent years have seen the rapid proliferation of clinical prediction models aiming to
support risk stratification and individualized care within psychiatry. Despite growing interest …

mixOmics: An R package for 'omics feature selection and multiple data integration

F Rohart, B Gautier, A Singh… - PLoS computational …, 2017 - journals.plos.org
The advent of high throughput technologies has led to a wealth of publicly available 'omics
data coming from different sources, such as transcriptomics, proteomics, metabolomics …

A survey on evolutionary computation approaches to feature selection

B Xue, M Zhang, WN Browne… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
Feature selection is an important task in data mining and machine learning to reduce the
dimensionality of the data and increase the performance of an algorithm, such as a …

Towards CRISP-ML (Q): a machine learning process model with quality assurance methodology

S Studer, TB Bui, C Drescher, A Hanuschkin… - Machine learning and …, 2021 - mdpi.com
Machine learning is an established and frequently used technique in industry and
academia, but a standard process model to improve success and efficiency of machine …