A Comparison of Machine Learning (ML) Algorithms Performance Against Medical Professionals in Diagnosing LUNG Cancer: A Targeted Review and Meta-Analysis
Author(s)
Mangat G1, Singh B2
1Parexel International, Chandigarh, CH, India, 2Parexel International, Mohali, India
OBJECTIVES : The purpose of our review was to assess the performance of ML algorithms in detecting lung cancer and compare it against healthcare professionals. METHODS : In this review, we searched for peer-reviewed journal publications that developed an ML model for lung cancer diagnosis and additionally compared the performance indices made by algorithms and physicians, including radiologists. Due to different cutoff thresholds, the meta-analysis using inverse variance weighting was conducted with the ROC curve metric only. RESULTS : A total of 11 studies were identified that compared the sensitivity, specificity, diagnostic effectiveness, and/or AUC. These studies retrospectively collected data and were published between 2017 and 2020. The method for predictor measurement was consistent with 9/11 (82%) studies investigating computed tomography and one study, each inspecting chest radiograph, and histopathology images. Seven studies used varying models of expert consensus, and four used histopathology as reference standards. Within deep learning models, all studies evaluated the performance of the convolutional neural network. The sensitivity and specificity of neural networks ranged 0.628-0.96 and 0.778-0.98, while clinicians reported 0.683-0.90 and 0.628-0.993, respectively. The average diagnostic effectiveness of human observation was 75.45%, which improved to 81.05% with the aid of neural networks. The AUC scores ranged from 0.771-0.997, while the pooled-effect estimates were 0.925 (95% CI: 0.924-0.926) and 0.907 (95% CI: 0.858-0.956) for fixed and random effects models, respectively. The high AUC value of 0.907 (preferred due to high I2) signifies neural networks an excellent classifier, with a 90.7% chance that it will correctly distinguish a normal from a lung cancer affected patient. CONCLUSIONS : Based on the findings of our review and exploratory meta-analysis, we can cautiously state that the diagnostic performance of neural networks is at least equivalent to healthcare experts. However, additional studies considering the integration of such algorithms are needed to support the implementation of this promising technology further.
Conference/Value in Health Info
2020-09, ISPOR Asia Pacific 2020, Seoul, South Korea
Value in Health Regional, Volume 22S (September 2020)
Code
PCN93
Topic
Medical Technologies, Methodological & Statistical Research
Topic Subcategory
Artificial Intelligence, Machine Learning, Predictive Analytics, Diagnostics & Imaging
Disease
Oncology