Estimating Optimal Personalized Treatment Sequencing for Patients with Multiple Myeloma Using Reinforcement Learning

Author(s)

Tang F1, Haider M2, Dittmar A3, Simeone J4, Merinopoulou E5, Gupta A6, Fuchs A7
1Cytel Inc., Arlington, VA, USA, 2Cytel Inc., Toronto, ON, Canada, 3IPAM e.V., University of Wismar, Wismar, MV, Germany, 4Cytel Inc, Waltham, MA, USA, 5Cytel Inc., London, LON, UK, 6Cytel, Toronto, ON, Canada, 7AOK PLUS, Dresden, Saxony, Germany

OBJECTIVES: Multiple myeloma (MM) is a chronic disease with considerable patient heterogeneity, and the optimal treatment sequences for patients with MM are unclear. Reinforcement learning (RL) is a branch of machine learning for estimating optimal decision rules over time that can maximize a future outcome, such as improvement in patient health. While RL has been suggested for identifying optimal treatment sequences by learning from real-world data, its application in administrative claims research is limited. This study sought to use RL to identify patient specific optimal treatment sequences that maximize time to refraction or relapse among MM patients using claims data.

METHODS: This study used the AOK PLUS claims database, covering approximately 3.5 million insured persons in Saxony and Thuringia, Germany. Our approach involved a targeted literature review to identify features of MM patients that are clinically useful for tailoring treatment decisions. Treatment sequences were developed by utilizing MM specific therapies such as proteasome inhibitors and immunomodulators. The outcome of interest was the duration between diagnosis to refraction or relapse status, which was defined as the initiation of the third line of therapy. An RL framework with generalized survival random forest as an estimator was used to identify optimal treatment sequences.

RESULTS: A total of 1,844 patients (mean age: 72 years; 53% male) diagnosed with MM were eligible for inclusion into this study. Commonly occurring comorbidities (e.g., hypertension), symptoms (e.g., back pain) and procedures (e.g., MRI) among MM patients were identified. Preliminary empirical validation using simulated data showed that the method performed well at recovering the correct optimal treatment regimen. Analyses of real-world data are underway.

CONCLUSIONS: The established framework may provide a robust and clinically meaningful approach for estimating personalized treatment regimens that can improve patient outcomes and address heterogeneity in MM and similar disease settings using real-world claims data.

Conference/Value in Health Info

2023-11, ISPOR Europe 2023, Copenhagen, Denmark

Value in Health, Volume 26, Issue 11, S2 (December 2023)

Code

MSR107

Topic

Methodological & Statistical Research, Real World Data & Information Systems, Study Approaches

Topic Subcategory

Artificial Intelligence, Machine Learning, Predictive Analytics, Health & Insurance Records Systems

Disease

Drugs, Oncology

Explore Related HEOR by Topic


Your browser is out-of-date

ISPOR recommends that you update your browser for more security, speed and the best experience on ispor.org. Update my browser now

×