The use of count data models in biomedical informatics evaluation research
- PMID: 21715429
- PMCID: PMC3240756
- DOI: 10.1136/amiajnl-2011-000256
The use of count data models in biomedical informatics evaluation research
Abstract
Objectives: Studies on the impact and value of health information technology (HIT) have often focused on outcome measures that are counts of such things as hospital admissions or the number of laboratory tests per patient. These measures with their highly skewed distributions (high frequency of 0s and 1s) are more appropriately analyzed with count data models than the much more frequently used variations of ordinary least squares (OLS). Use of a statistical procedure that does not properly fit the distribution of the data can result in significant findings being overlooked. The objective of this paper is to encourage greater use of count data models by demonstrating their utility with an example based on the authors' current work.
Target audience: Researchers conducting impact and outcome studies related to HIT.
Scope: We review and discuss count data models and illustrate their value in comparison to OLS using an example from a study of the impact of an electronic health record (EHR) on laboratory test orders. The best count data model reveals significant relationships that OLS does not detect. We conclude that comprehensive model checking is highly recommended to identify the most appropriate analytic model when the dependent variable being examined contains count data. This strategy can lead to more valid and precise findings in HIT evaluation studies.
Conflict of interest statement
Figures
Similar articles
-
Clinical Informatics Researcher's Desiderata for the Data Content of the Next Generation Electronic Health Record.Appl Clin Inform. 2017 Oct;8(4):1159-1172. doi: 10.4338/ACI-2017-06-R-0101. Epub 2017 Dec 21. Appl Clin Inform. 2017. PMID: 29270955 Free PMC article.
-
Safe use of electronic health records and health information technology systems: trust but verify.J Patient Saf. 2013 Dec;9(4):177-89. doi: 10.1097/PTS.0b013e3182a8c2b2. J Patient Saf. 2013. PMID: 24257062
-
Medical informatics research trend analysis: A text mining approach.Health Informatics J. 2018 Dec;24(4):432-452. doi: 10.1177/1460458216678443. Epub 2016 Dec 1. Health Informatics J. 2018. PMID: 30376768
-
Key Contributions in Clinical Research Informatics.Yearb Med Inform. 2021 Aug;30(1):233-238. doi: 10.1055/s-0041-1726514. Epub 2021 Sep 3. Yearb Med Inform. 2021. PMID: 34479395 Free PMC article. Review.
-
Clinical Research Informatics.Yearb Med Inform. 2020 Aug;29(1):203-207. doi: 10.1055/s-0040-1702007. Epub 2020 Aug 21. Yearb Med Inform. 2020. PMID: 32823317 Free PMC article. Review.
Cited by
-
Building on a novel bootstrapping modelling technique to predict region-wide critical care capacity requirements over the next decade.Future Healthc J. 2023 Mar;10(1):50-55. doi: 10.7861/fhj.2022-0025. Future Healthc J. 2023. PMID: 37786497 Free PMC article.
-
Evaluation of negative binomial and zero-inflated negative binomial models for the analysis of zero-inflated count data: application to the telemedicine for children with medical complexity trial.Trials. 2023 Sep 27;24(1):613. doi: 10.1186/s13063-023-07648-8. Trials. 2023. PMID: 37752579 Free PMC article. Clinical Trial.
-
Association of strong opioids and antibiotics prescribing with GP burnout: a retrospective cross-sectional study.Br J Gen Pract. 2023 Jul 27;73(733):e634-e643. doi: 10.3399/BJGP.2022.0394. Print 2023 Aug. Br J Gen Pract. 2023. PMID: 37500457 Free PMC article.
-
Count data models for outpatient health services utilisation.BMC Med Res Methodol. 2022 Oct 5;22(1):261. doi: 10.1186/s12874-022-01733-3. BMC Med Res Methodol. 2022. PMID: 36199028 Free PMC article.
-
Consumer demand for healthy beverages in the hospitality industry: Examining willingness to pay a premium, and barriers to purchase.PLoS One. 2022 May 2;17(5):e0267726. doi: 10.1371/journal.pone.0267726. eCollection 2022. PLoS One. 2022. PMID: 35499987 Free PMC article.
References
-
- Cameron AC, Trivedi PK. Regression Analysis of Count Data. Cambridge: Cambridge University Press, 1998
-
- Deb P, Manning W, Norton E. Modeling Health Care Costs and Counts. 2006. http://www.unc.edu/∼enorton/DebManningNortonPresentation.pdf (accessed 16 Apr 2011).
-
- Mosteller F, Tukey JW. Data Analysis and Regression: A Second Course in Statistics. Reading: Addison-Wesley, 1977
-
- Cohen J, Cohen P, West SG, et al. Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences. 3rd edn Mahwah, NJ: Lawrence Erlbaum, 2003
-
- Vest JR. Health information exchange and healthcare utilization. J Med Syst 2009;33:223–31 - PubMed
