Automated Report Generation with Chat GPT API for Cardiology: Performance Evaluation and Impact on Physician Burnout

Author(s)

Cossio M1, Gilardino R2
1Universitat de Barcelona, Barcelona, Spain, 2MSD, Dubendorf, ZH, Switzerland

Presentation Documents

OBJECTIVES: The presence of administrative tasks significantly contributes to physician burnout, impacting healthcare costs and compromising quality. Therefore, the development of tools to reduce clerical burden can optimize healthcare pathways. This study aimed to construct an automated report generator for a cardiology service using a language model and evaluate its performance using algorithmic and human metrics.

METHODS: A Guiding Multiple Choice and Field Completing System (MCFCS) was developed with Python 3.9. It integrated patient identification, gender, age, blood pressure, breath and heart sounds, and ECG/echocardiogram results. Variables were compiled using the Chat GPT API (text-davinci-003) to generate plain text reports with varying temperature values. Word count and generation time were measured. Evaluation involved cardiologist assessment and models such as BERT, BIO CLINICAL BERT, ROBERTA, and Chat GPT, measuring time and assigning scores (1-10). Additional information inclusion was also evaluated (score: 0-3).

RESULTS: Reports generated using the Chat GPT API demonstrated successful outcomes, with an average generation time of 8.7 seconds and an average word count of 70. Evaluation by cardiologists required an average of 22 seconds, yielding an average score of 8. Chat GPT performed closest to cardiologists, with an average score of 7.45 and an evaluation time of 0.13 seconds. In contrast, other models (BERT, BIO CLINICAL BERT, ROBERTA) took longer (above 1.6 seconds on average) and achieved inferior performance with scores of 3.2, 2.5, and 2.5, respectively. Notably, the highest score assigned by physicians was attained using a temperature value of 0.5 in the Chat GPT API.

CONCLUSIONS: The implementation of an automated reporting application enables the rapid generation of high-quality medical reports. Further advancements and refinements hold significant potential for alleviating the burdensome administrative tasks faced by physicians, thus mitigating burnout and optimizing overall healthcare system efficiency.

Conference/Value in Health Info

2023-11, ISPOR Europe 2023, Copenhagen, Denmark

Value in Health, Volume 26, Issue 11, S2 (December 2023)

Code

HSD81

Topic

Medical Technologies, Methodological & Statistical Research

Topic Subcategory

Artificial Intelligence, Machine Learning, Predictive Analytics

Disease

Cardiovascular Disorders (including MI, Stroke, Circulatory), No Additional Disease & Conditions/Specialized Treatment Areas

Explore Related HEOR by Topic


Your browser is out-of-date

ISPOR recommends that you update your browser for more security, speed and the best experience on ispor.org. Update my browser now

×