The Official News & Technical Journal Of The International Society For Pharmacoeconomics And Outcomes Research

OUTCOMES ASSESSMENT

Multi-National Assessment of Outcomes Via Retrospective Databases

Vittorio Maio PharmD, MS, MSPH, Elaine J. Yuen, Kenneth D. Smith; Jefferson Medical College, Philadelphia, PA, USA; Diana I. Brixner, Gary M. Oderda, and Carl V. Asche, University of Utah, Salt Lake City, UT, USA; Steve Morgan, University of British Columbia, Vancouver, NC, Canada


Introduction
With the increased use of technology, data are becoming the most mined and useful commodity in any field, including health care. In recent years, there has been an enormous growth of populationbased health care databases, which represent an important resource for clinical epidemiologists and health care managers to answer a wide spectrum of questions. These databases are usually made up of documented hospitalizations, prescriptions, and professional services. Over the last decade health care claims databases have expanded to include lab tests, value, vital signs and other critical information.

With large sample sizes, at relatively low cost, these databases have a great potential for research applications. Population-based databases may be used to study “real world” effectiveness and utilization patterns of health services; to investigate utilization and outcomes of therapeutics; to detect adverse drug events; and to assess the impact of health care policy implementation.

Despite these important benefits, the extent to which individual countries have access to these data is not readily known. In addition, the temptation to utilize the combined information in these databases for outcomes research on healthcarerelated issues of global significance is strong; however, there are few examples of such work. This article will present health care databases from the United States, Canada, and Italy, describing the characteristics of these databases as well as their similarities and differences. Operational, methodological, and analytical challenges to conduct multi-national retrospective outcomes research evaluations will then be reviewed.

U.S. Data
An up and coming source for outcomes research in the U.S. is the Electronic Medical Record (EMR) database, GE Centricity, which is developed and sold by General Electric Clinical Data Systems. There are about 4,600 providers using this EMR product in 35 states, which is growing rapidly at a rate of roughly 20% per year. Physicians who use the medical record system decide whether they want their patients’ data to be available for research purposes. The Medical Quality Improvement Consortium is the group of physicians that allow their data to be used for research. The Consortium’s primary purpose is to improve patient care and to be involved in quality care management and practice profiling.

There is data on about 3.7 million patients in the research database, which is about a quarter of the patients whose providers use the EMR. There are a large number of patients with most diseases. For instance, there are approximately 421,000 patients with hypertension and 623,000 with hyperlipidemia. The data are sent nightly to Portland, OR, USA where it is placed into a format that can be used electronically for research purposes.

Regarding characteristics of this database, the data you would get is very similar to what you would see going through a paper medical record. Physician’s notes are not included due to the 1996 Health Insurance Portability and Accountability Act (HIPAA) concerns that personal health information may be included in the notes.

There are advantages and disadvantages to getting information from an EMR. In this particular case, the first advantage is that the database is quite up to date. Data from physicians’ offices are downloaded on a daily basis to a central computer. Another major advantage is that lab data are available, a fact that distinguishes this database from commercial insurance claims databases available in the United States. There is clinical data available related to diagnosis. Data on all patients are included, regardless of whether the payor is government, commercial insurance, or private. Patients tend to stay with their physicians longer than with an individual commercial insurance plan, which lends to increased longitudinal data. The median time that a patient is in the database is three years. The database includes a record of all vital signs. When a patient goes into a physician’s office and has his/her vital signs taken, this information is included as part of the research database. All collected data is cleaned by an informatics team and extracted for research purposes.

As with any retrospective research database, there are also limitations. A system like this is quite expensive, both in terms of creating the software and the technology infrastructure. In addition, conversion of the data to a research oriented dataset is difficult. The product was originally developed for use in physician offices to help them with the administrative aspects of their practice. There are no patient identifiers, as required by HIPAA regulations, and there is also no way to identify physicians or to distinguish primary care providers from specialists. Patient compliance is not tracked in the system. Physician prescription orders are captured; however, whether the prescription was filled or refilled is not documented. Finally, the amount of inpatient data is quite limited and would only be there if documented by the physician in the outpatient medical record.

The University of Utah Pharmacotherapy Outcomes Research Center (PORC) has been involved in a number of studies using this EMR database, taking advantage of the availability of lab information. An initial study evaluated Body Mass Index (BMI) as a side effect among different second generation antipsychotics [1, 2]. A second study determined the predictors of Chronic Obstructive Pulmonary Disease (COPD) to assess the impact of early diagnosis on outcomes [3]. A final study conducted a risk assessment of OTC non-steroidal anti-inflammatory drugs for perforations, ulcers and gastro-intestinal bleeds [4]. Other disease states such as metabolic syndrome and its impact on chronic disease, blood pressure outcomes related to mono and combination hypertension therapy, and an assessment of mono vs. combination therapy in the treatment of diabetes via HbA1c outcomes are also well suited to the EMR database and are currently under study.

Canadian Data
The Canadian database prototype is provided by British Columbia (BC), a province comprised of over 4.1 million residents located on the West Coast of Canada. The dataset is population-based and patient specific. It contains demographic information about the entire population, as well as utilization records for prescription drugs, medical, and hospital services. The denominators are assembled by the UBC Centre for Health Services and Policy Research. The study cohort is every BC resident eligible for the provincially administered, universal public health insurance, excluding members of the Royal Canadian Mounted Police, First Nations (aboriginal peoples of BC), and Veterans, which represent 4% of the entire population.

Information on prescription drugs is provided by the BC PharmaNet System, which is a computer network linking all pharmacies in the province. The primary use of this system is for patient-specific drug interaction monitoring and insurance claim adjudication. By law, it captures every prescription dispensed in BC regardless of payment type (public drug plan, private drug plan, or out-of-pocket).All individuals, regardless of age, are included in this database. PharmaNet does not capture prescriptions in acute care and rehabilitation hospitals, or prescriptions for residents of penitentiaries.

Diagnostic information of indicated and potentially confounding conditions is captured through Medical Services Plan and Hospital Discharge records. The former pertains to outpatient fee-forservice billings and contains one diagnosis per physician visit or related service. The latter describes patient stays at the point of hospital discharge, and contains up to 16 diagnoses. The team at the UBC Centre for Health Services and Policy Research uses these diagnoses for research purposes, identifying and aggregating conditions using Expanded Diagnostic Clusters (EDCs), a system developed by the Johns Hopkins Adjudicated Clinical Groups (ACGs) team.

The BC database has already been used to study a wide range of health services research questions, including numerous drug utilization reviews (hypertension drugs, antidepressants, cholesterol agents, asthma treatments) and drug policy analyses. Using the BC databases, detailed information about prescription drug utilization has been compiled in the BC Rx Atlas, available online from the UBC Centre for Health Services and Policy Research [5].

Italian Data
In collaboration with the Agenzia Sanitaria Regionale of the Regione Emilia Romagna (RER) of Italy (in Northeast Italy), the Center for Research in Medical Education and Health Care at Jefferson Medical College in Philadelphia, PA, USA, has constructed a population-based, longitudinal health care utilization database for the region. The nature of the Italian health care system makes the resulting database a valuable resource for planning and research. Italy has a universal health care system which is very similar to Great Britain. The RER database contains, therefore, records for the entire population of the region, approximately 4 million residents. The population of RER is increasing at a very slow rate, owing to a positive but small net migration rate that makes up for a low birth rate. As a result, the population in RER database is highly stable, yielding a dataset that is not subject to problems of censored observations or other statistical problems. This provides a great advantage over other utilization databases.

The RER database includes demographic information on all residents, including age, gender, birth and death date, location of residence, primary care physician, with linkages to information about the resident’s primary care physician (a general practitioner in Italy); day and ordinary hospital discharge abstract data, including ICD-9_CM coded diagnosis and procedure codes, admission and discharge dates, and DRG-based payments for RER residents hospitalized either in hospitals in RER or in other regions of Italy; and outpatient pharmacy data at the individual prescription level, including drug codes, pharmacy payments, and patient co-payments. There are also data describing use and costs of specialty care (including lab, diagnostics, therapeutic procedures, visits to specialists); home health data (physician, nurse, therapist, etc.); physician information (i.e. payments received, specialty, years in practice, and patient load); and ad hoc registries of specific medical devices or surgical procedures.

All of this information, fully linkable at both patient and physician levels, currently is available for 5 consecutive years (2000-2004), and there is ongoing data collection. The database can be indefinitely updated with information from additional years, enabling retrospective analyses across an ever-increasing sample frame and prospective analysis as new years of data are added to the database.

The value of the database has been increased by adding clinical classifications mapped from the hospital and pharmacy data. Diagnostic codes from day and acute hospital admissions have been used to classify admissions using the Disease Staging classification which, in turn, has been used to identify the subset of individuals who may be at higher risk for utilizing more extensive or expensive health services in the future [6]. Another set of indicators (Chronic Condition Drug Groups - CCDGs) uses outpatient pharmacy data and the Italian national formulary to identify individuals with chronic disease [7]. Using both hospital and pharmacy data, staging groups and CCDGs have been used to identify chronic diseases in a body system-etiology framework.

The database is being used in a variety of projects including assessment of the appropriateness of acute hospital admissions [8], evaluation of potentially inappropriate prescribing patterns for patients over age 65 [9], assessment of the impact of copayment changes on pharmacy and hospital use, and analyses of concentration and persistence of cost in sub-populations of the region [10]. In addition, the data are being used as a part of a “riskadjustment” model that identifies individuals with chronic disease and predicts future year costs [11]. This information can then be used to rationalize the allocation of resources to health districts based on the needs of the local population.

Similarities and Differences among the Three Databases
There are important similarities and differences among the three databases (Table 1). Of course the most striking difference is that the GE Centricity database is actually an EMR, which is a rich source of data, used primarily by providers to manage their outpatient encounters. The databases in Canada and in Italy are claims data, used for administrative purposes. The Canadian and the Italian datasets are actually quite similar. In terms of available sources, all of the different databases have difficulties in getting specific inpatient data, including pharmacy use. All have out-patient prescription data, and the sources of individual providers vary. The GE Centricity data is based on practice groups and not individual practitioners. Cost data is defined on the BC and Italian datasets. The largest difference is the population described by the data, although in all three datasets there are roughly 4 million people. The GE dataset is scattered over a large geographic area. Obviously, it is more in-depth by being an electronic medical record. The Canadian and Italian datasets include data for a captured population. For morbidity case identification and classification, all have ICD-9 codes and drug use, of course, but these codes are used in different ways. For instance, in Canada researchers are using the EDCs based on the Hopkins work, whereas at Jefferson, researchers have developed the CCDGs with the Italian database. From a research perspective, the detailed clinical outcomes information in the GE Centricity data is of great value. In the Canadian and Italian datasets, claims data can be examined to detect occurrences of adverse events in terms of hospitalization or discontinued prescription use, which cannot be assessed in the EMR.

Challenges and Opportunities to Conduct Multi-National Retrospective Outcomes Research Evaluations
Due to the various differences within the datasets and across global markets, there are benefits as well as challenges of using data from multiple international sources. Using multiple datasets from different countries enables the verification of findings from one country based on information from other countries. For example, suppose a relationship between an adverse event and a drug agent is identified in country A. There might be a unique characteristic of the population that might limit the credibility of the finding. However, if this relationship also exists in other countries, researchers and policy makers are more apt to believe the finding. An additional benefit is the opportunity to compare practice patterns across global regions and countries. For instance, by looking at differences on prescribing patterns of certain medications across countries, we can determine whether physicians in some countries are better able to adhere to current guidelines than physicians in other countries. This enables us to identify potential best practices and policies. For example, if the rate of adherence is greater in some countries than others, and all those countries have continuing medical education (CME) focused exclusively on the guidelines, one might hypothesize that the use of CME might be an approach other countries should follow.

However, making comparisons among databases is not an easy task. First of all, one should always ask whether or not each individual dataset has had a formal evaluation of its data quality. Administrative data, for example, sometimes has problems in quality assurance. It is therefore important to understand the data-generating process, including the method of data collection and the population from which the data is collected. If the administrative data is used for reimbursement purposes, like the United States Medicare data, there are often government regulations to check for upcoding, etc. Although the Italian database is not used for reimbursement, the health department in RER takes time to ensure data quality. Other data systems may not have quality assurance programs. For example, pharmaceutical claims data may not have a reconciliation process for re-submitted claims. As a result, one would see multiple claims for the same prescription. Bulk billing may make it difficult to link claims with actual services.

One major weakness is inevitable churning in and out of commercial insurance based databases in the United States. The United States has a rather fragmented health insurance system so there may be individuals moving in and out of the database. The Italian population utilization database from RER does not have this problem as the population is extremely stable. The database captures utilization of persons in the population until they die or leave the region-which is somewhat rare, except for the student population. Canada has a single payer, national health insurance model, and the populations in their databases are relatively stable as well. The stability of the investigated population is important when observing and comparing differences across countries, though it can potentially be corrected through the choice of statistical models.

There is a further need to understand the structural differences in health care provision across countries in order to interpret cross-country differences. Consider drug availability. Often, drugs are introduced for the first time in the United States, but it takes a while before they are introduced in other countries, or vice versa. So it may not be possible to compare outcomes on specific drugs until the drug has been around for awhile. This may also explain why certain physicians in one country adopt a new procedure or certain new drug agents sooner than another. This can be ascertained through making these cross-country comparisons.

It is very important not only to understand the context, but also to recognize that there will be different error structures in the data as a result of the differences in the data generating processing. And so, it is important to utilize several different measures, or “indicators,” instead of using just one measure to look at an outcome. It is also useful to use ratios and to compare trends rather than just using simple numbers. For example, instead of looking at differences in volume across countries, sometimes it is useful to look at the volume per person across different countries. If there is interest in the volume of a new antihypertensive medication you can compare volume trends in the new antihypertensive across two countries. This may require interpretation in two parallel lines to indicate no real difference in uptake between the two countries. However, a better analysis would compare the ratio of the volume of the new antihypertensive to the volume of the standard hypertensive medication. Even if volume trends are parallel, the trends in the ratio may not be. For example, the ratio may increase faster in one country compared to another. This would indicate that, even though volume was increasing at the same rate for countries A and B, there was a greater level of substitution in country B from the old to the new hypertensive medication. Thus, wherever possible, it is important to look at trends using ratios.

In looking at cost data, it is also important to make sure that a common currency is used. Deflate the data to use a common year, and then use purchasing power parity so that the same buying power throughout the different countries can be seen.

And finally, there should be consideration of various specification tests particularly in multivariate models. For example, one can estimate a model for each country in a regression and perform something like a Chow test to determine whether or not there are structural differences across countries.

Conclusions
Conducting multi-national retrospective outcomes research is appealing. Certainly, there are numerous challenges to be overcome, yet the potential information that can be drawn from such studies is of great value. Therefore, we strongly encourage institutions and organizations to increase the sharing of data in order to understand better the population needs in different countries. This will further assist in measuring important cross national outcomes and setting global health care policy accordingly.


REFERENCES
1. Brixner DI, Said Q, Newcomer J, et al. Real world usage of weight gain drugs in patients on second-generation anti-psychotics. American Diabetes Association. San Diego, CA, June 10-14th, 2005.
2. Brixner DI, Blonde L, Said Q, et al. Real World Impact of Second Generation Antipsychotics on Weight Gain. American Diabetes Association, San Diego, CA, June 10-14th, 2005.
3. Joish VN, Brady E, Stockdale WA, et al. Identification and diagnosis of COPD in a primary care setting. International Conference American Thoracic Society; San Diego, CA, 2005.
4. Biskupiak JE, Brixner DI, Oderda GM, Howard K. Gastrointestinal Complications of over-the-counter non-steroidal anti-inflammatory drugs. ACG 70th Annual Scientific Meeting; October Honolulu, HI, 2005.
5. UBC Centre for Health Services and Policy Research web site. Publications. Available at:
http://www.chspr.ubc.ca  Last Accessed February 16, 2006.
6. Louis DZ, Gonnella JS, McCracken SB, Barnes CA. Patient classification and severity of illness: Disease Staging, DRGs, and APR-DRGs. Patient Classification Systems/Europe, Budapest, Hungary, October 2004.
7. Maio V, Yuen EJ, Rabinowitz C, et al. Using Pharmacy Data to Identify Those with Chronic Conditions in Emilia Romagna, Italy. J Health Serv Res Policy 2005;10:232-8.
8. Louis DZ, Donatini A, Jimbo M, et al. An atlas of appropriateness of hospital admissions in the Emilia-Romagna Region, Italy. 132nd American Public Health Association, Washington, DC, November 2004.
9. Maio V, Yuen EJ, Novielli KD, et al. Potentially Inappropriate Prescribing for Elderly Ambulatory Patients in Regione Emilia Romagna, Italy. 10th Annual Meeting of the International Society Pharmacoeconomics and Outcomes Research, Washington, DC, May 16, 2005,
10. Yuen EJ, Maio V, Louis DZ, et al.Using an Italian Population Database to Profile Prevalence and Costs of Chronic Conditions. 8th Annual European Congress of the International Society Pharmacoeconomics and Outcomes Research, Florence, Italy, November 8, 2005.
11. Smith KD, Yuen EJ, Donatini A, Louis DZ, Rabinowitz C, Maio V, Taroni F. Risk Adjustment in a Non-Market-Based System: The Case of Emilia Romagna, Italy. Intl J Healthcare Technol Managt 2006;7:100-16.


  Issues Index | 2006 Issues Index  

Contact ISPOR @ info@ispor.org  |  View Legal Disclaimer
©2010 International Society for Pharmacoeconomics and Outcomes Research.
All rights reserved under International and Pan-American Copyright Conventions.
 
Website design by Eagle Systems USA, Inc.