Evaluation of machine learning methods for covariate data imputation in pharmacometrics

DS Bräm, U Nahum, A Atkinson… - CPT …, 2022 - Wiley Online Library
CPT: Pharmacometrics & Systems Pharmacology, 2022Wiley Online Library
Missing data create challenges in clinical research because they lead to loss of statistical
power and potentially to biased results. Missing covariate data must be handled with
suitable approaches to prepare datasets for pharmacometric analyses, such as population
pharmacokinetic and pharmacodynamic analyses. To this end, various statistical methods
have been widely adopted. Here, we introduce two machine‐learning (ML) methods
capable of imputing missing covariate data in a pharmacometric setting. Based on a …
Abstract
Missing data create challenges in clinical research because they lead to loss of statistical power and potentially to biased results. Missing covariate data must be handled with suitable approaches to prepare datasets for pharmacometric analyses, such as population pharmacokinetic and pharmacodynamic analyses. To this end, various statistical methods have been widely adopted. Here, we introduce two machine‐learning (ML) methods capable of imputing missing covariate data in a pharmacometric setting. Based on a previously published pharmacometric analysis, we simulated multiple missing data scenarios. We compared the performance of four established statistical methods, listwise deletion, mean imputation, standard multiple imputation (hereafter “Norm”), and predictive mean matching (PMM) and two ML based methods, random forest (RF) and artificial neural networks (ANNs), to handle missing covariate data in a statistically plausible manner. The investigated ML‐based methods can be used to impute missing covariate data in a pharmacometric setting. Both traditional imputation approaches and ML‐based methods perform well in the scenarios studied, with some restrictions for individual methods. The three methods exhibiting the best performance in terms of least bias for the investigated scenarios are the statistical method PMM and the two ML‐based methods RF and ANN. ML‐based approaches had comparable good results to the best performing established method PMM. Furthermore, ML methods provide added flexibility when encountering more complex nonlinear relationships, especially when associated parameters are suitably tuned to enhance predictive performance.
Wiley Online Library
以上显示的是最相近的搜索结果。 查看全部搜索结果