Detecting Generalized Pustular Psoriasis Exacerbations in Unstructured Clinical Data Using Deep Neural Networks
Author(s)
Kumar V1, Rasouliyan L1, Althoff AG1, Long S1, Zema C2, Rao MB1
1OMNY Health, Atlanta, GA, USA, 2Zema Consulting, Huntsville, AL, USA
OBJECTIVES:
Our objective was to identify generalized pustular psoriasis (GPP) patients experiencing exacerbations by analyzing the unstructured clinical patient data with machine learning (ML) methodologies.METHODS:
GPP patients were identified in 5 dermatology specialty networks using clinical codes. All notes for these patients were deidentified using a licensed software package. Valid notes were labeled as “exacerbation” if they were associated with certain Current Procedural Terminology codes or if the physician indicated worsening diagnosis statuses. All labeled notes were split into training, validation, and test sets. Notes were tokenized, further preprocessed by removing stopwords and numbers, and represented using publicly available pretrained word embeddings. An L2- and dropout-regularized deep neural network (DNN) consisting of 3 convolutional max-pooling layers followed by a dense neural layer was trained and validated on respective sets, and this model was used to predict exacerbation status on the test set. Results were compared with a string search model on preselected terms.RESULTS:
Valid unstructured clinical data was identified in 846/1298 GPP patients, representing 1630 distinct notes. The “exacerbation” label was given to 667 (41%) of the notes. The DNN model consisted of approximately 373 thousand trainable parameters. After 10 training epochs, test set accuracy was observed as 58%. Similar results were obtained with the string search model.CONCLUSIONS:
Although we were unable to observe higher than 58% accuracy with our DNN model, this analysis represents an important initial step in developing a more robust algorithm to identify GPP exacerbations. Lack of a validated ground truth label in real-world data could have contributed to this low observed accuracy. Further studies are required to clarify whether augmenting rare dermatologic disease data (e.g., GPP) with that of common dermatologic diseases (e.g., plaque psoriasis) or using more traditional ML algorithms would improve accuracy in identifying exacerbations.Conference/Value in Health Info
2022-05, ISPOR 2022, Washington, DC, USA
Value in Health, Volume 25, Issue 6, S1 (June 2022)
Code
MSR55
Topic
Methodological & Statistical Research, Real World Data & Information Systems, Study Approaches
Topic Subcategory
Artificial Intelligence, Machine Learning, Predictive Analytics, Distributed Data & Research Networks, Electronic Medical & Health Records
Disease
Sensory System Disorders