Parametric modelling of cost data: some simulation evidence

Health Econ. 2005 Apr;14(4):421-8. doi: 10.1002/hec.941.


Recently, commentators have suggested that the distributional form of cost data should be explicitly modelled to gain efficiency in estimating the population mean. We perform a series of simulation experiments to evaluate the usual sample mean and the mean estimator of a lognormal distribution, in the context of both theoretical distributions and three large empirical datasets. The sample mean is always unbiased, but is somewhat less efficient when the population distribution is truly lognormal. However the lognormal estimator can perform appallingly when the true distribution is not lognormal. In practical situations, where the true distribution is unknown, the sample mean generally remains the estimator of choice, especially when limited sample size prohibits detailed modelling of the cost data distribution.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Clinical Trials as Topic
  • Computer Simulation
  • Confidence Intervals
  • Cost-Benefit Analysis / methods
  • Data Interpretation, Statistical
  • Health Care Costs / statistics & numerical data*
  • Humans
  • Models, Econometric*
  • Statistical Distributions*