[Control group formation using propensity score matching: The role of primary and secondary data - Results of prevention studies]

Gerhard Müller; Marco Giurgiu; Monika Heinzel-Gutenbrunner; Klaus Bös; Thomas Kohlmann; Manuela Bombana

doi:10.1016/j.zefq.2020.07.004

[Control group formation using propensity score matching: The role of primary and secondary data - Results of prevention studies]

Z Evid Fortbild Qual Gesundhwes. 2020 Nov:156-157:68-74. doi: 10.1016/j.zefq.2020.07.004. Epub 2020 Aug 25.

[Article in German]

Authors

Gerhard Müller¹, Marco Giurgiu², Monika Heinzel-Gutenbrunner³, Klaus Bös⁴, Thomas Kohlmann⁵, Manuela Bombana⁶

Affiliations

¹ Fachbereich Gesundheitsförderung, AOK Baden-Württemberg, Stuttgart, Deutschland. Electronic address: gerhard.mueller@bw.aok.de.
² Institut für Sport und Sportwissenschaft, Karlsruher Institut für Technologie, Karlsruhe, Deutschland; Institut für Psychiatrische und Psychosomatische Psychotherapie, Zentralinstitut für Seelische Gesundheit, Universität Heidelberg, Mannheim, Deutschland.
³ MH Statistikberatung, Marburg, Deutschland.
⁴ Institut für Sport und Sportwissenschaft, Karlsruher Institut für Technologie, Karlsruhe, Deutschland.
⁵ Community Medicine, Universität Greifswald, Greifswald, Deutschland.
⁶ Fachbereich Gesundheitsförderung, AOK Baden-Württemberg, Stuttgart, Deutschland; Abteilung Allgemeinmedizin und Versorgungsforschung, Universitätsklinikum Heidelberg, Heidelberg, Deutschland.

PMID: 32855075
DOI: 10.1016/j.zefq.2020.07.004

Abstract

Background: The creation of control groups in the evaluation of statutory health insurances is a key issue. Randomization represents both an ethical and a legal problem with legally guaranteed services. Matching procedures are relevant alternatives in the construction of control groups. Matchings are mostly based on secondary data from statutory health insurances (for example age, gender, cost of illness, days of incapacity to work). In this study, we examined whether matching based on secondary data alone can cause selection bias.

Methods: We used data from three large prevention studies and applied sensitivity analyses to compare the results of propensity score matchings used to create control groups on the basis of secondary data, with those obtained on the basis of both primary and secondary data. Analysis of covariance was used to investigate the impact of potential selection bias on cost effects.

Results: Matchings based on secondary data alone lead to control groups with similar characteristics captured by secondary data. However, the control group participants are significantly healthier (they have, for example, lower levels of pain, lower levels of psychological stress, a higher degree of quality of life) than the patients in intervention groups. This selection bias would lead to a systematic underestimation of the cost reduction produced by preventive interventions.

Discussion: Prevention course participants seem to have characteristics that differ from the average population (higher health orientation level, preference for prevention over medical treatment services, etc.) and cannot be captured by secondary data; therefore, matchings based on secondary data alone cause selection bias.

Conclusions: Including both primary and secondary data reduces the risk of selection bias in matching procedures for prevention studies. The E-value can be used to evaluate the robustness of results with regard to selection bias.

Keywords: Data linkage; Datenlinkage; Matched-Pair-Analysen; Matched-pair analysis; Secondary data; Sekundärdaten; Selection bias; Selektionsbias.

MeSH terms

Control Groups
Germany
Humans
Propensity Score
Quality of Life*
Selection Bias