Program

In-person AND virtual! – We are pioneering a new conference format that will connect in-person and virtual audiences to create a unique experience. Matching the innovation that comes through our members’ work, ISPOR is pushing the boundaries of innovation to design an event that works in today’s quickly changing environment.

In-person registration included the full virtual experience, and virtual-only attendees will be able to tune into live in-person sessions and/or watch captured in-person sessions on-demand in addition to having a variety of virtual-only sessions to attend.

Detecting Generalized Pustular Psoriasis Exacerbations in Unstructured Clinical Data Using Deep Neural Networks

Speaker(s)

Kumar V¹, Rasouliyan L¹, Althoff AG¹, Long S¹, Zema C², Rao MB¹
¹OMNY Health, Atlanta, GA, USA, ²Zema Consulting, Huntsville, AL, USA

OBJECTIVES: Our objective was to identify generalized pustular psoriasis (GPP) patients experiencing exacerbations by analyzing the unstructured clinical patient data with machine learning (ML) methodologies.

METHODS: GPP patients were identified in 5 dermatology specialty networks using clinical codes. All notes for these patients were deidentified using a licensed software package. Valid notes were labeled as “exacerbation” if they were associated with certain Current Procedural Terminology codes or if the physician indicated worsening diagnosis statuses. All labeled notes were split into training, validation, and test sets. Notes were tokenized, further preprocessed by removing stopwords and numbers, and represented using publicly available pretrained word embeddings. An L2- and dropout-regularized deep neural network (DNN) consisting of 3 convolutional max-pooling layers followed by a dense neural layer was trained and validated on respective sets, and this model was used to predict exacerbation status on the test set. Results were compared with a string search model on preselected terms.

RESULTS: Valid unstructured clinical data was identified in 846/1298 GPP patients, representing 1630 distinct notes. The “exacerbation” label was given to 667 (41%) of the notes. The DNN model consisted of approximately 373 thousand trainable parameters. After 10 training epochs, test set accuracy was observed as 58%. Similar results were obtained with the string search model.

CONCLUSIONS: Although we were unable to observe higher than 58% accuracy with our DNN model, this analysis represents an important initial step in developing a more robust algorithm to identify GPP exacerbations. Lack of a validated ground truth label in real-world data could have contributed to this low observed accuracy. Further studies are required to clarify whether augmenting rare dermatologic disease data (e.g., GPP) with that of common dermatologic diseases (e.g., plaque psoriasis) or using more traditional ML algorithms would improve accuracy in identifying exacerbations.

Code

MSR55

Topic

Methodological & Statistical Research, Real World Data & Information Systems, Study Approaches

Topic Subcategory

Artificial Intelligence, Machine Learning, Predictive Analytics, Distributed Data & Research Networks, Electronic Medical & Health Records

Disease

Sensory System Disorders

ISPOR 2022

May 15-18, 2022