Using Chat-GPT for Completing Quality Assessment Questionnaires in Systematic Literature Reviews

Author(s)

Aggarwal S¹, Kumar S², Topaloglu O³
¹NOVEL Health Strategies, Chevy Chase, MD, USA, ²NOVEL HEALTH STRATEGIES, COLUMBIA, MD, USA, ³NOVEL Health Strategies, Bethesda, MD, USA

OBJECTIVES: The objective of this study was to conduct a feasibility assessment for using Chat-GPT to complete Quality Assessment (QA) questionnaires.

METHODS: A custom Chat-GPT-4 application was developed in Python 3 to take inputs from Rinvoq study/trial reports and the ROB 2.0 questionnaire to produce a completed assessment. The questionnaires were also completed by experienced systematic review experts. The two sets of questionnaires were compared to assess accuracy and estimate time savings. All data were reviewed to identify opportunities and limitations in using generative artificial intelligence (AI) for completing QA tools.

RESULTS: The custom Chat-GPT application's output for QA tools was 90-95% accurate. The quantitative questions were completed with 95-100% accuracy. The time savings for completing the ROB 2.0 questionnaire averaged 15.5 hours per study. However, Chat-GPT was unable to successfully complete two subjective questions related to methods for analysis and reporting of outcomes. These subjective questions required highly experienced HEOR experts to review study reports and search for additional supporting documents to correctly complete those sections. Overall, Chat-GPT shows promise in saving significant time and resources.

CONCLUSIONS: Custom Chat-GPT applications have the potential to save significant time, effort, and monetary resources. However, outputs from these AI tools need to be corrected and/or validated by experienced HEOR experts.

Conference/Value in Health Info

2024-11, ISPOR Europe 2024, Barcelona, Spain

Value in Health, Volume 27, Issue 12, S2 (December 2024)

Code

MSR173

Topic

Clinical Outcomes, Methodological & Statistical Research, Study Approaches

Topic Subcategory

Artificial Intelligence, Machine Learning, Predictive Analytics, Clinical Outcomes Assessment, Comparative Effectiveness or Efficacy, Literature Review & Synthesis

Disease

No Additional Disease & Conditions/Specialized Treatment Areas

Explore Related HEOR by Topic

Presentation