Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2012 Jan-Feb;19(1):39-44.
doi: 10.1136/amiajnl-2011-000256. Epub 2011 Jun 29.

The use of count data models in biomedical informatics evaluation research

Affiliations
Comparative Study

The use of count data models in biomedical informatics evaluation research

Jing Du et al. J Am Med Inform Assoc. 2012 Jan-Feb.

Abstract

Objectives: Studies on the impact and value of health information technology (HIT) have often focused on outcome measures that are counts of such things as hospital admissions or the number of laboratory tests per patient. These measures with their highly skewed distributions (high frequency of 0s and 1s) are more appropriately analyzed with count data models than the much more frequently used variations of ordinary least squares (OLS). Use of a statistical procedure that does not properly fit the distribution of the data can result in significant findings being overlooked. The objective of this paper is to encourage greater use of count data models by demonstrating their utility with an example based on the authors' current work.

Target audience: Researchers conducting impact and outcome studies related to HIT.

Scope: We review and discuss count data models and illustrate their value in comparison to OLS using an example from a study of the impact of an electronic health record (EHR) on laboratory test orders. The best count data model reveals significant relationships that OLS does not detect. We conclude that comprehensive model checking is highly recommended to identify the most appropriate analytic model when the dependent variable being examined contains count data. This strategy can lead to more valid and precise findings in HIT evaluation studies.

PubMed Disclaimer

Conflict of interest statement

Competing interests: None.

Figures

Figure 1
Figure 1
Observed frequency distribution of the number of laboratory tests ordered during index emergency department (ED) visits for asthma patients.
Figure 2
Figure 2
A graphical comparison of the fit of ordinary least squares and count data models to the number of observed laboratory test orders.

Similar articles

Cited by

References

    1. Cameron AC, Trivedi PK. Regression Analysis of Count Data. Cambridge: Cambridge University Press, 1998
    1. Deb P, Manning W, Norton E. Modeling Health Care Costs and Counts. 2006. http://www.unc.edu/∼enorton/DebManningNortonPresentation.pdf (accessed 16 Apr 2011).
    1. Mosteller F, Tukey JW. Data Analysis and Regression: A Second Course in Statistics. Reading: Addison-Wesley, 1977
    1. Cohen J, Cohen P, West SG, et al. Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences. 3rd edn Mahwah, NJ: Lawrence Erlbaum, 2003
    1. Vest JR. Health information exchange and healthcare utilization. J Med Syst 2009;33:223–31 - PubMed

Publication types

MeSH terms