AUTOMATING STATISTICAL ANALYSIS PLAN DEVELOPMENT AND DEMOGRAPHIC DESCRIPTIVE ANALYSES IN CLINICAL TRIAL DATA USING GENERATIVE AI

Author(s)

Ritesh Dubey, PharmD, Shubhram Pandey, MSc, Rajdeep Kaur, PhD, Gagandeep Kaur, M.Pharm, Barinder Singh, RPh;
Pharmacoevidence, Mohali, India
OBJECTIVES: Healthcare decision-making increasingly relies on large and complex datasets, making traditional analytical approaches time-consuming and resource-intensive. This study evaluated whether a generative AI-based analytical framework could support the development of a statistical analysis plan (SAP) by generating SAP table shells and accurately populating them with study data, while producing analytically rigorous, reproducible, and regulatory-aligned outputs
METHODS: A generative AI-enabled analytical framework incorporating a large language model (LLM) was implemented to support structured analysis and the development of an SAP. Simulated individual patient-level clinical data (~10,000 records) were uploaded to the tool in a CSV format and pre-processed prior to analysis. SAP table shells were generated in accordance with predefined specifications, and plain-language user queries were translated into executable statistical code to produce descriptive analyses and populate SAP-aligned tables and visualisations. All outputs were independently reviewed by a human subject-matter expert (SME) to assess analytical accuracy, completeness, and reproducibility
RESULTS: The AI-based framework successfully generated 20 SAP table shells and produced complete descriptive statistics for approximately 20 predefined baseline variables across all table shells. SME validation confirmed the accuracy of all numerical outputs and statistical computations. Manual human refinement was required in approximately 3-4% of cases, where some tables and figures needed to be refined to align with SAP presentation standards. These refinements were addressed through iterative review, resulting in finalised SAP-ready outputs. Compared with conventional manual workflows, the AI-enabled approach substantially reduced analytical development and reporting timelines by almost 85%
CONCLUSIONS: The generative AI-based analytical framework demonstrated the ability to support SAP development by generating and populating SAP table shells with high analytical accuracy under continuous human oversight. These findings highlight the potential of AI-enabled analytics to enhance efficiency and reduce analytical timelines in healthcare research while maintaining methodological rigor

Conference/Value in Health Info

2026-05, ISPOR 2026, Philadelphia, PA, USA

Value in Health, Volume 29, Issue S6

Code

MSR243

Topic

Methodological & Statistical Research

Topic Subcategory

Artificial Intelligence, Machine Learning, Predictive Analytics

Disease

No Additional Disease & Conditions/Specialized Treatment Areas

Your browser is out-of-date

ISPOR recommends that you update your browser for more security, speed and the best experience on ispor.org. Update my browser now

×