Comparing the Impact of Random Forest Vs Bayesian G-Computation on Matching-Adjusted Indirect Comparisons of Treatments Between Trials: A Simulation Study

Author(s)

Moradian H1, Tremblay G2, Heeg B2
1Cytel Inc., Coquitlam, Canada, 2Cytel Inc., Waltham, MA, USA

OBJECTIVES: Matching-adjusted indirect comparison (MAIC) is a common method of population-adjusted indirect treatment comparison between two studies. It uses a propensity score (PS) weighting approach which is sensitive to poor effect modifier (EM) overlap and small sample size of the index trial as reweighting often leads to significantly smaller effective sample sizes. G-computation is a marginalization method that can achieve more accurate estimates than MAIC when EM overlap is poor. Random forest (RF) is a non-parametric ensemble technique that averages outcomes from multiple decision trees and can weight patient characteristics based how many times any pair of subjects ends up in the same terminal nodes. This study aimed to evaluate and compare the convergence and fitting of RF and G-computation in sample sizes <100.

METHODS: Data were simulated for an anchored two-study comparison (AB and AC) with three treatment levels. The MAIC included five covariates: two EMs (age, time since diagnosis) and three prognostic variables (smoking status, race, sex). Weights were estimated to match the between-trial EM distributions. MAICs using RF, G-computation, and PS approaches were applied over 1,000 iterations.

RESULTS: MAIC with RF converged in all iterations (sample size = 50), while MAICs with PS and G-computation converged in zero and 34 out of 1,000 iterations, respectively. Mean absolute error (MAE, i.e., absolute difference between the point estimates of the log odds ratio of treatment C vs B) was significantly lower using the RF approach and their true value averaged over the converging G-computation iterations (MAE=0.94 vs. 1.42, p<0.001).

CONCLUSIONS: This simulated study demonstrated that RF was an alternative, highly accurate MAIC method when there is a small sample size and poor imbalance. Additional simulation and patient-level data studies should be conducted to explore results with varied sample sizes, sparse data, and number of covariates.

Conference/Value in Health Info

2023-11, ISPOR Europe 2023, Copenhagen, Denmark

Value in Health, Volume 26, Issue 11, S2 (December 2023)

Code

PT15

Topic

Clinical Outcomes, Methodological & Statistical Research, Study Approaches

Topic Subcategory

Artificial Intelligence, Machine Learning, Predictive Analytics, Comparative Effectiveness or Efficacy, Meta-Analysis & Indirect Comparisons

Disease

No Additional Disease & Conditions/Specialized Treatment Areas

Explore Related HEOR by Topic


Your browser is out-of-date

ISPOR recommends that you update your browser for more security, speed and the best experience on ispor.org. Update my browser now

×