Artificial Intelligence Applied on Administrative Big Data to Predict the Severity of SARS-COV-2 Infection

Author(s)

Iacolare B1, Perrone V1, Sangiorgi D2, Ghigi A1, Giacomini E1, Nappi C1, Paoli D1, Ancona DD3, Andretta M4, Barbieri A5, Bartolini F6, Cavaliere A7, Ciaccia A8, Citraro R9, Dell'Orco S10, Ferrante F11, Gentile S12, Grego S13, Procacci C3, Ubertazzo L14, Vercellone A15, Degli Espositi L16
1CliCon S.r.l. Health, Economics & Outcomes Research, Bologna, Italy, 2CliCon S.r.l. Health, Economics & Outcomes Research, Bologna, BO, Italy, 3ASL BAT, Trani, Italy, 4Azienda ULSS 8 Berica, Vicenza, Italy, 5ASL Vercelli, Vercelli, Italy, 6USL Umbria 2, Terni, Italy, 7ASL Viterbo, Viterbo, Italy, 8ASL Foggia, Foggia, Italy, 9Azienda ospedaliero-universitaria Mater Domini, Catanzaro, Italy, 10ASL Roma 6, Albano Laziale, Italy, 11ASL Frosinone, Frosinone, Italy, 12Direzione Generale per la Salute Regione Molise, Campobasso, Italy, 13ASL 3 Azienda sociosanitaria ligure 3, Genova, Italy, 14ASL Roma 4, Civitavecchia (RM), Italy, 15ASL Napoli 3 SUD, Torre del Greco, Italy, 16CliCon S.r.l. Health, Economics & Outcomes Research, Ravenna, Italy

Presentation Documents

OBJECTIVES. To estimate the prognostic factors underlying severity of Sars-Cov-2 infection using a machine learning approach.

METHODS. The analysis is based on administrative databases of Italian Entities. Patients who were hospitalized with COVID-19 diagnosis (ICD-9 078.89) after 1st January 2020 were included into the dataset together with 13 relevant features representing age, sex and clinical history of each patient. Each record was labelled as 0 (hospitalized patients) or 1 (patients in intensive care or deceased). KerasTuner was used to define the architecture of the Neural Network achieving good accuracy score. To identify prognostic factors underlying severity of Sars Cov-2 infection, feature’s importance was evaluated starting from a Random Forest Classifier.

RESULTS. The preliminary dataset built contains 10.448 records from 9.346 hospitalized patients. The selected neural network is made of 13 input nodes, each one representing a feature, 1024 nodes in the hidden layer, processing information that comes from the input layer, and 2 nodes in the output layer, each one representing a label to define patient’s condition. The neural network obtained was able to achieve 64% of accuracy on the testing set. The condition of approximately 2 out of 3 patients was correctly predicted just by analysing their features. The feature’s importance computed from the Random Forest Classifier indicated that patient’s age is the primary prognostic factor underlying severity of Sars Cov-2 infection. The combination of the other features slightly improved model’s performance.

CONCLUSIONS. The preliminary analysis shows that age is a prognostic factor of fundamental importance in defining the severity of Sars Cov-2 infection. The model obtained could be used to predict disease progression in patients most at risk by analysing their information in the databases. The model will be further improved through a process of feature selection to increase its accuracy and to allow the identification of other prognostic factors.

Conference/Value in Health Info

2021-11, ISPOR Europe 2021, Copenhagen, Denmark

Value in Health, Volume 24, Issue 12, S2 (December 2021)

Code

POSA310

Topic

Methodological & Statistical Research

Topic Subcategory

Artificial Intelligence, Machine Learning, Predictive Analytics

Disease

Infectious Disease (non-vaccine)

Explore Related HEOR by Topic


Your browser is out-of-date

ISPOR recommends that you update your browser for more security, speed and the best experience on ispor.org. Update my browser now

×