Advancing the Development of Real-World Data for Cancer Care in China: Challenges and Opportunities
Author(s)
Wentian Lu, PhD1, Danyang Gui, MS1, Shuo Yang, PhD1, Jing Li, MSc1, Long Pang, MSc1, Can Chen, MSc1, Cheng Tong, MSc2, Yulan Guo, MSc2, Yang Xie, PhD1, Zheng Yin, PhD1;
1IQVIA Greater China Real World Evidence, Shanghai, China, 2Beijing Yiyong Technology Co., Ltd., Beijing, China
1IQVIA Greater China Real World Evidence, Shanghai, China, 2Beijing Yiyong Technology Co., Ltd., Beijing, China
Presentation Documents
OBJECTIVES: The utilization of secondary database in China has significantly advanced oncology research. We aimed to investigate the trends, current utilization and potentials of databases in oncology in China.
METHODS: A mixed method of systematic literature review and database interviews was applied. Database containing Chinese population were identified through systematic literature review of real-world studies (RWS). Articles in oncology published from January 2020 to December 2024 in PubMed/Medline, EMBASE, China National Knowledge Infrastructure and Wanfang were extracted. Databases were categorized into registries, single-/multi-site electronic medical record (EMR), claims, regional electronic health record (rEHR), and the National Anti-Tumor Drug Surveillance System (NATDSS). We also interviewed managers of the multi-center databases.
RESULTS: We identified 509 English and 145 Chinese RWS articles of oncology, containing 56.6% (n=370) used registries, 19.3% (n=126) used EMR, 6.7% (n=44) used claims, 0.8% (n=5) NATDSS and 0.6% (n=4) used rEHR. Databases in China have been used to describe disease epidemiology and natural history (43.4%; 284/654), clinical effectiveness, drug safety (12.3%; 80/654), treatment pattern and outcomes (31.1%; 204/654) and health economics (4.4%; 29/654). Advantages and limitations of large multi-site databases in data access, governance, quality and linkage were discussed via interviews. Most multi-site databases cover one city/selected provinces. Few databases have regular linkage to CDC death registry. Information related to biomarker, stage and disease progression has started to be processed using natural language processing (NLP) in a few databases without published methodology. NATDSS is the largest oncology database in China covering over 15 million cancer patients (2013 until now, 1400+ hospitals, 31 provinces), including 1.6 million lung cancer, 0.9 million colon, 0.85 million breast cancer and 0.7 million prostate cancer patients.
CONCLUSIONS: The integration of these databases enables robust analysis of patient demographics, drug treatments, biomarkers, and survival outcomes, thereby enhancing the understanding of cancer epidemiology and treatment efficacy in the Chinese population.
METHODS: A mixed method of systematic literature review and database interviews was applied. Database containing Chinese population were identified through systematic literature review of real-world studies (RWS). Articles in oncology published from January 2020 to December 2024 in PubMed/Medline, EMBASE, China National Knowledge Infrastructure and Wanfang were extracted. Databases were categorized into registries, single-/multi-site electronic medical record (EMR), claims, regional electronic health record (rEHR), and the National Anti-Tumor Drug Surveillance System (NATDSS). We also interviewed managers of the multi-center databases.
RESULTS: We identified 509 English and 145 Chinese RWS articles of oncology, containing 56.6% (n=370) used registries, 19.3% (n=126) used EMR, 6.7% (n=44) used claims, 0.8% (n=5) NATDSS and 0.6% (n=4) used rEHR. Databases in China have been used to describe disease epidemiology and natural history (43.4%; 284/654), clinical effectiveness, drug safety (12.3%; 80/654), treatment pattern and outcomes (31.1%; 204/654) and health economics (4.4%; 29/654). Advantages and limitations of large multi-site databases in data access, governance, quality and linkage were discussed via interviews. Most multi-site databases cover one city/selected provinces. Few databases have regular linkage to CDC death registry. Information related to biomarker, stage and disease progression has started to be processed using natural language processing (NLP) in a few databases without published methodology. NATDSS is the largest oncology database in China covering over 15 million cancer patients (2013 until now, 1400+ hospitals, 31 provinces), including 1.6 million lung cancer, 0.9 million colon, 0.85 million breast cancer and 0.7 million prostate cancer patients.
CONCLUSIONS: The integration of these databases enables robust analysis of patient demographics, drug treatments, biomarkers, and survival outcomes, thereby enhancing the understanding of cancer epidemiology and treatment efficacy in the Chinese population.
Conference/Value in Health Info
2025-05, ISPOR 2025, Montréal, Quebec, CA
Value in Health, Volume 28, Issue S1
Code
RWD34
Topic
Real World Data & Information Systems
Topic Subcategory
Distributed Data & Research Networks
Disease
SDC: Oncology