Utilizing Generative AI to Automate Model Selection and Network Meta-Analyses

Author(s)

Tim Reason, BSc, MSc1, Yunchou Wu, PhD1, Cheryl Jones, Phd2, Emma Benbow, PhD1, Kasper Johannesen, PhD3, Bill Malcolm, MSc4;
1Estima Scientific, London, United Kingdom, 2Estima-Scientific, London, United Kingdom, 3Bristol Myers-Squibb, Stockholm, Sweden, 4Bristol Myers-Squibb, Uxbridge, United Kingdom
OBJECTIVES: The automation of network meta-analyses (NMAs) through the use of large language models (LLMs) provides a significant opportunity for healthcare technology developers to optimize workflows critical for health technology assessment (HTA). This is particularly pertinent to ensure requirements of the joint clinical assessment (JCA) are fulfilled. The objective was to develop a system leveraging LLMs to automate substantial elements involved with conducting de novo NMAs.
METHODS: An automated system, utilising Claude 3.5 Sonnet [V2] LLM, was designed to process analysis-ready datasets. Based on each dataset, the LLM was prompted to select a suitable statistical model, write code and execute analyses, evaluate outputs, and interpret results. The automated results were validated by: 1) replicating examples from the National Institute for Health and Care Excellence (NICE) Technical Support Document (TSD 2); 2) reproducing results from two non-DSU published NMAs; and 3) generating and assessing comprehensive outputs, including heterogeneity, inconsistency, and convergence.
RESULTS: For all 14 TSD 2 examples (seven models, fixed and random effects), the LLM produced executable code without the need for human correction or intervention. The mean values produced were within 0.02 of the published results and credible interval limits were within the expected range. For non-DSU published NMA examples, the automated process replicated consistent results. Each model (DSU and non-DSU) was run five times and the LLM consistently selected the appropriate statistical model and generated correct results. Finally, the LLM successfully generated and interpreted heterogeneity, inconsistency, and convergence.
CONCLUSIONS: Starting with analysis-ready datasets, this study has demonstrated LLMs can execute essential steps required for de novo NMAs, paving the way for an automated NMA system. Automation of NMAs can streamline workflows and significantly reduce the amount of time and resource required. This is especially relevant when considering the extensive analyses to be completed within tight timeframes to comply with JCA requirements.

Conference/Value in Health Info

2025-05, ISPOR 2025, Montréal, Quebec, CA

Value in Health, Volume 28, Issue S1

Code

RWD66

Topic

Real World Data & Information Systems

Disease

No Additional Disease & Conditions/Specialized Treatment Areas

Your browser is out-of-date

ISPOR recommends that you update your browser for more security, speed and the best experience on ispor.org. Update my browser now

×