Universal adaptability: Target-independent inference that competes with propensity scoring

Michael P Kim; Christoph Kern; Shafi Goldwasser; Frauke Kreuter; Omer Reingold

doi:10.1073/pnas.2108097119

Universal adaptability: Target-independent inference that competes with propensity scoring

Proc Natl Acad Sci U S A. 2022 Jan 25;119(4):e2108097119. doi: 10.1073/pnas.2108097119.

Authors

Michael P Kim^{1

2}, Christoph Kern³, Shafi Goldwasser^{4

5}, Frauke Kreuter^{6

7}, Omer Reingold⁸

Affiliations

¹ Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA 94720.
² Miller Institute for Basic Research in Science, Berkeley, CA 94720.
³ School of Social Sciences, University of Mannheim, 68159 Mannheim, Germany.
⁴ Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA 94720; shafi.goldwasser@gmail.com.
⁵ Simons Institute for the Theory of Computation, Berkeley, CA 94720.
⁶ Joint Program in Survey Methodology, University of Maryland, College Park, MD 20742.
⁷ Department of Statistics, Ludwig-Maximilians-Universität München, 80539 München, Germany.
⁸ Department of Computer Science, Stanford University, Stanford, CA 94305.

Abstract

The gold-standard approaches for gleaning statistically valid conclusions from data involve random sampling from the population. Collecting properly randomized data, however, can be challenging, so modern statistical methods, including propensity score reweighting, aim to enable valid inferences when random sampling is not feasible. We put forth an approach for making inferences based on available data from a source population that may differ in composition in unknown ways from an eventual target population. Whereas propensity scoring requires a separate estimation procedure for each different target population, we show how to build a single estimator, based on source data alone, that allows for efficient and accurate estimates on any downstream target data. We demonstrate, theoretically and empirically, that our target-independent approach to inference, which we dub "universal adaptability," is competitive with target-specific approaches that rely on propensity scoring. Our approach builds on a surprising connection between the problem of inferences in unspecified target populations and the multicalibration problem, studied in the burgeoning field of algorithmic fairness. We show how the multicalibration framework can be employed to yield valid inferences from a single source population across a diverse set of target populations.

Keywords: algorithmic fairness; propensity scoring; statistical validity.

Publication types

Research Support, U.S. Gov't, Non-P.H.S.
Research Support, Non-U.S. Gov't