EVALUATING VARIABLE-SELECTION STRATEGIES FOR EXTERNAL CONTROL ARM STUDIES: A MONTE CARLO ASSESSMENT OF DIRECTED ACYCLIC GRAPH, PROGNOSTIC RANKING, AND MACHINE-LEARNING APPROACHES
Author(s)
Mostafa Shokoohi, PhD, Paul Spin, PhD;
EVERSANA, Burlington, ON, Canada
EVERSANA, Burlington, ON, Canada
OBJECTIVES: External control arm (ECA) studies rely on covariate adjustment, yet real-world evidence (RWE) studies often follow prognostic importance or machine learning (ML) approaches for variable selection rather than causal relevance. These practices may introduce structural bias, particularly when colliders are included. This study examined how different selection strategies impact treatment-effect estimation when comparing a single-arm trial (SAT) with an ECA.
METHODS: A total of 2,000 datasets, including an SAT (n=80) and an ECA (n=800), were simulated. Each dataset included two true confounders (c1,c2), two prognostic variables (p1,p2), an instrumental variable (z1), a collider (coll1), an unmeasured confounder (u1), and two noise variables. Treatment depended on confounders, unmeasured confounder, and instrumental variable. Outcome followed a logistic model including treatment, confounders, prognostic variables, and the unmeasured confounder. The true treatment effect was set at log(OR) = -0.431. Four selection strategies were evaluated: 1) a directed acyclic graph (DAG)-based set including only true confounders; 2) prognostic strength ranking based on outcome associations; 3) cross-validated logistic LASSO; and 4) random forest (RF) variable importance. For each dataset, treatment effects were estimated using naïve logistic regression, adjusted regression, and stabilized ATT inverse-probability weighting. Bias, RMSE, and 95% interval coverage were summarized.
RESULTS: DAG-based adjustment resulted in the lowest bias and RMSE, with coverage close to nominal levels (0.94). Prognostic ranking, LASSO, or RF approaches frequently included colliders or instrumental variables, leading to bias amplification (e.g., prognostic approach bias -0.243; LASSO bias -0.275) and reduced coverage. Weighted estimators improved performance (lower bias, greater coverage) but were less accurate (more deviation from true effect) than the DAG-based sets.
CONCLUSIONS: Adjustment sets grounded in causal theory, where back-door paths are blocked with minimal sufficient adjustment sets, yielded more accurate and stable estimates. These findings support the use of causal frameworks when designing ECAs and conducting comparative analyses in RWE research.
METHODS: A total of 2,000 datasets, including an SAT (n=80) and an ECA (n=800), were simulated. Each dataset included two true confounders (c1,c2), two prognostic variables (p1,p2), an instrumental variable (z1), a collider (coll1), an unmeasured confounder (u1), and two noise variables. Treatment depended on confounders, unmeasured confounder, and instrumental variable. Outcome followed a logistic model including treatment, confounders, prognostic variables, and the unmeasured confounder. The true treatment effect was set at log(OR) = -0.431. Four selection strategies were evaluated: 1) a directed acyclic graph (DAG)-based set including only true confounders; 2) prognostic strength ranking based on outcome associations; 3) cross-validated logistic LASSO; and 4) random forest (RF) variable importance. For each dataset, treatment effects were estimated using naïve logistic regression, adjusted regression, and stabilized ATT inverse-probability weighting. Bias, RMSE, and 95% interval coverage were summarized.
RESULTS: DAG-based adjustment resulted in the lowest bias and RMSE, with coverage close to nominal levels (0.94). Prognostic ranking, LASSO, or RF approaches frequently included colliders or instrumental variables, leading to bias amplification (e.g., prognostic approach bias -0.243; LASSO bias -0.275) and reduced coverage. Weighted estimators improved performance (lower bias, greater coverage) but were less accurate (more deviation from true effect) than the DAG-based sets.
CONCLUSIONS: Adjustment sets grounded in causal theory, where back-door paths are blocked with minimal sufficient adjustment sets, yielded more accurate and stable estimates. These findings support the use of causal frameworks when designing ECAs and conducting comparative analyses in RWE research.
Conference/Value in Health Info
2026-05, ISPOR 2026, Philadelphia, PA, USA
Value in Health, Volume 29, Issue S6
Code
MSR240
Topic
Methodological & Statistical Research
Topic Subcategory
Confounding, Selection Bias Correction, Causal Inference
Disease
No Additional Disease & Conditions/Specialized Treatment Areas