APPLYING ARTIFICIAL INTELLIGENCE TO REAL-WORLD DATA IN NSCLC - TRANSPARENT AND REPRODUCIBLE METHODS FOR PRECISION EVIDENCE
Author(s)
Ankit Pahwa, MS1, Nathalie A. Waser, PhD2, Banaz Al-khalidi, BSc, MSc, PhD3;
1ICON plc, Biostatistician, Bengaluru, India, 2ICON, Vancouver, BC, Canada, 3ICON plc, Toronto, ON, Canada
1ICON plc, Biostatistician, Bengaluru, India, 2ICON, Vancouver, BC, Canada, 3ICON plc, Toronto, ON, Canada
OBJECTIVES: To outline methodological strategies for applying artificial intelligence (AI) and machine learning (ML) to real‑world data (RWD) in non-small‑cell lung cancer (NSCLC) to improve stratification based on molecular biomarkers and clinical characteristics for treatment selection while meeting credibility standards for real‑world evidence (RWE), including prespecified protocols, transparent reporting, and reproducibility requirements outlined by FDA, NICE, and ISPOR/ISPE frameworks.
METHODS: We outline a structured framework for applying AI/ML to NSCLC real-world data, guided by regulatory and methodological standards. Key components include:
RESULTS: With rigorous design and validation, AI/ML models such as gradient boosting and regularized regression applied to curated NSCLC RWD (e.g., EHR and registry data) can achieve discrimination and calibration suitable for decision support. The framework highlights performance metrics (e.g., area under the curve, calibration curves) and visualizations showing how clinical and biomarker data inform treatment decisions. Published evaluations of AI/ML models in oncology show that open, well-documented workflows, predefined analysis plans, and model diagnostics facilitate credibility and clinical interpretation without reliance on proprietary tools.
CONCLUSIONS: AI‑enabled analyses of NSCLC RWD can complement evidence from randomized trials by providing timely, context‑specific insights to support personalized treatment decisions. Adherence to recognized frameworks and reporting standards strengthens validity, reproducibility, and confidence, supporting responsible translation of AI into clinical decision‑making.
METHODS: We outline a structured framework for applying AI/ML to NSCLC real-world data, guided by regulatory and methodological standards. Key components include:
- Prespecified analysis protocols defining cohort selection, covariates, and endpoints;
- Documentation of data provenance and quality checks;
- Bias mitigation strategies (e.g., confounding control, handling missingness);
- Model development principles emphasizing internal validation and interpretability;
- Transparent reporting to support reproducibility.
RESULTS: With rigorous design and validation, AI/ML models such as gradient boosting and regularized regression applied to curated NSCLC RWD (e.g., EHR and registry data) can achieve discrimination and calibration suitable for decision support. The framework highlights performance metrics (e.g., area under the curve, calibration curves) and visualizations showing how clinical and biomarker data inform treatment decisions. Published evaluations of AI/ML models in oncology show that open, well-documented workflows, predefined analysis plans, and model diagnostics facilitate credibility and clinical interpretation without reliance on proprietary tools.
CONCLUSIONS: AI‑enabled analyses of NSCLC RWD can complement evidence from randomized trials by providing timely, context‑specific insights to support personalized treatment decisions. Adherence to recognized frameworks and reporting standards strengthens validity, reproducibility, and confidence, supporting responsible translation of AI into clinical decision‑making.
Conference/Value in Health Info
2026-05, ISPOR 2026, Philadelphia, PA, USA
Value in Health, Volume 29, Issue S6
Code
RWD85
Topic
Real World Data & Information Systems
Topic Subcategory
Reproducibility & Replicability
Disease
No Additional Disease & Conditions/Specialized Treatment Areas, SDC: Oncology