Socioeconomic status and health in blacks and whites: the problem of residual confounding and the resiliency of race

Epidemiology. 1997 Nov;8(6):621-8.


A large number of epidemiologic studies have focused on racial/ethnic differences, particularly between blacks and whites. Because health endpoints and racial categorizations are associated with socioeconomic status, investigators generally adjust for socioeconomic indicators. The intention is usually to control for confounding, thereby making groups comparable and excluding socioeconomic status as an alternative explanation to hypotheses of innate physiologic differences. A threat to the validity of these analyses is therefore the presence of residual confounding. We identify four potential sources of residual confounding in this analytical design: categorization of socioeconomic status variables, measurement error in socioeconomic indicators, use of aggregated socioeconomic status measures, and incommensurate socioeconomic indicators. Using simulations and examples from the literature, we demonstrate that the effect of residual confounding is to bias interpretation of data toward the conclusion of independent racial/ethnic group effects. Investigators often refer to possible "genetic" differences on the basis of models that control for socioeconomic status. We propose that such conclusions on the basis of this analytical strategy are generally unwarranted. Racial/ethnic differences in disease are a pressing public health concern, but the current approach does not often provide a basis for inference about putative biological factors in the etiology of this disparity.

Publication types

  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Bias*
  • Black or African American / statistics & numerical data*
  • Computer Simulation
  • Confounding Factors, Epidemiologic*
  • Demography
  • Diabetes Mellitus / epidemiology
  • Epidemiologic Methods
  • Genetic Predisposition to Disease
  • Health Status*
  • Humans
  • Income / classification
  • Income / statistics & numerical data
  • Logistic Models
  • Odds Ratio
  • Residence Characteristics / classification
  • Residence Characteristics / statistics & numerical data
  • Risk Assessment
  • Social Class*
  • United States / epidemiology
  • White People / statistics & numerical data*