Background: Randomized trials aimed at improving the quality of medical care often randomize the provider. Such trials are frequently embedded in health care systems with available automated records, which can be used to enhance the design of the trial.
Methods: We consider how available information from automated records can address each of the following concerns in the design of a trial: whether to randomize individual providers or practices; clustering of outcomes among patients in the same practice and its impact on study size; expected heterogeneity in adherence and the response to the intervention; eligibility criteria and the trade-offs between generalizability and internal validity; and blocking or matching to alleviate covariate imbalance across practices.
Results: Investigators can use available information from an automated database to estimate the amount of clustering of patients within providers and practices, and these estimates can inform the decision on whether to randomize at the level of the patient, the provider, or the practice. We illustrate calculation of the anticipated design effect for a proposed cluster-randomized trial and its implications for sample size. With available claims data, investigators can apply focused eligibility criteria to exclude subjects and providers with expected low compliance or lower likelihood of benefit, although possibly at some loss of generalizability. Chance imbalances in covariates are more likely when randomization occurs at the level of the practice than at the level of the patient, so we propose a matching score to limit such imbalances by design.
Conclusions: Challenges to compliance, expected small effects, and covariate imbalances are particularly likely in cluster-randomized trials of quality improvement interventions. When such trials are embedded in medical systems with available automated records, use of these data can enhance the design of the trial.