SINGLE DISTRIBUTION, TWO-PART, AND TWO-COMPONENT FINITE MIXTURE MODELS FOR PREDICTING SMOKING-RELATED INDIRECT COSTS IN US WORKING ADULTS

Author(s)

Li VW1, Goren A2, Baker CL3, Bruno MC3, Emir B3
1Kantar Health, New York, NJ, USA, 2Kantar Health, New York, NY, USA, 3Pfizer Inc., New York, NY, USA

OBJECTIVES:

Indirect costs data typically include a high proportion of zeros that cannot be adequately modeled with a single distribution. The current study examined predicted total costs associated with work impairments using different models applicable to such distributions.

METHODS:

Data on employed US adults (18-64 years old) were analyzed from the 2013 National Health and Wellness Survey. Self-report was used to define smoking status (never smoked, quit, attempting to quit, and currently smoke) as a predictor. Costs due to work productivity loss were derived from Work Productivity and Activity Impairment questionnaire-based measures on percentage absenteeism and presenteeism, and calculated using weekly wages by age and sex from the US Bureau of Labor Statistics (2014). Given excessive zeros (60%) in the cost data, two-part (first part logit, second part negative binomial [NB]) and two-component finite mixture (first component constant, second component truncated NB) models were used to predict costs as a function of smoking status, controlling for respondent demographics and health characteristics. Model fit statistics (Akaike and Bayesian Information Criterion [AIC and BIC, respectively] and mean squared error [MSE]) were compared with those from a single-distribution generalized linear model (GLM) with NB distribution, which is also suited to highly skewed, count-like distributions.

RESULTS:

Among 36,883 working adults, the two-part model had the best fit statistics (AIC=359159; BIC=359355) compared with the mixture (AIC=394788; BIC=395001) and the GLM (AIC=391201; BIC=391312) models, and also the smallest MSE (105454117 compared with 105482560 and 21486386573, respectively). Overestimation of costs among those with zero cost was greatest in the single-distribution GLM (average predicted costs=$5306.76) compared with those from two-part ($5293.13) and mixture ($5293.04) models.

CONCLUSIONS:

In a broadly representative US population of working adults, two-part modeling was found to better represent high zero-skewed indirect cost data compared with two-component finite mixture and single-distribution models.

Conference/Value in Health Info

2017-11, ISPOR Europe 2017, Glasgow, Scotland

Value in Health, Vol. 20, No. 9 (October 2017)

Code

PRM47

Topic

Economic Evaluation, Methodological & Statistical Research

Topic Subcategory

Cost/Cost of Illness/Resource Use Studies, Modeling and simulation

Disease

Respiratory-Related Disorders

Explore Related HEOR by Topic


Your browser is out-of-date

ISPOR recommends that you update your browser for more security, speed and the best experience on ispor.org. Update my browser now

×