Toward a Clearer Definition of Selection Bias When Estimating Causal Effects

Epidemiology. 2022 Sep 1;33(5):699-706. doi: 10.1097/EDE.0000000000001516. Epub 2022 Jun 6.

Abstract

Selection bias remains a subject of controversy. Existing definitions of selection bias are ambiguous. To improve communication and the conduct of epidemiologic research focused on estimating causal effects, we propose to unify the various existing definitions of selection bias in the literature by considering any bias away from the true causal effect in the referent population (the population before the selection process), due to selecting the sample from the referent population, as selection bias. Given this unified definition, selection bias can be further categorized into two broad types: type 1 selection bias owing to restricting to one or more level(s) of a collider (or a descendant of a collider) and type 2 selection bias owing to restricting to one or more level(s) of an effect measure modifier. To aid in explaining these two types-which can co-occur-we start by reviewing the concepts of the target population, the study sample, and the analytic sample. Then, we illustrate both types of selection bias using causal diagrams. In addition, we explore the differences between these two types of selection bias, and describe methods to minimize selection bias. Finally, we use an example of "M-bias" to demonstrate the advantage of classifying selection bias into these two types.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Bias
  • Causality
  • Humans
  • Selection Bias*