Model-based multiplicity estimation of population size

Eugene M Laska; Morris Meisner; Joseph Wanderling

doi:10.1002/sim.3614

Model-based multiplicity estimation of population size

Stat Med. 2009 Jul 30;28(17):2230-52. doi: 10.1002/sim.3614.

Authors

Eugene M Laska¹, Morris Meisner, Joseph Wanderling

Affiliation

¹ Statistics and Services Research Division, The Nathan S. Kline Institute for Psychiatric Research, New York University School of Medicine, Orangeburg, NY 10962, USA. laska@nki.rfmh.org

PMID: 19572380
DOI: 10.1002/sim.3614

Abstract

A survey is conducted at w of K selection units or lists, e.g. health care institutions or weeks in a year, to estimate N, the total number of individuals with particular characteristics. Our estimator utilizes two items determined for each survey participant: the number, u, among the w lists in S and the number, j, among all K lists on which each survey participant appears. In its traditional form, selection units are chosen using probability sampling and the statistical properties of the estimator derive from the sampling mechanism. Here, selection units are purposively chosen to maximize the chance that they are 'typical' and a model-based analysis is used for inference. If the sample is typical, the ML estimators of N and E(J) are unbiased. If a condition on the second moment of U/J is satisfied, the model-based variance of the estimator of N based on a purposively chosen typical sample is smaller than one based on a randomly chosen sample. Methods to test whether the typical assumption is valid using data from the survey are not yet available. The importance of proper selection of the sample to maximize the chance that it is typical and model breakdown does not occur must be emphasized.

MeSH terms

Bias
Biometry
Confidence Intervals
Data Interpretation, Statistical
Humans
Likelihood Functions
Models, Statistical*
Population Density*
Sample Size