Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Sep 11:8:61.
doi: 10.1186/1471-2288-8-61.

Data management for prospective research studies using SAS software

Affiliations

Data management for prospective research studies using SAS software

Robin L Kruse et al. BMC Med Res Methodol. .

Abstract

Background: Maintaining data quality and integrity is important for research studies involving prospective data collection. Data must be entered, erroneous or missing data must be identified and corrected if possible, and an audit trail created.

Methods: Using as an example a large prospective study, the Missouri Lower Respiratory Infection (LRI) Project, we present an approach to data management predominantly using SAS software. The Missouri LRI Project was a prospective cohort study of nursing home residents who developed an LRI. Subjects were enrolled, data collected, and follow-ups occurred for over three years. Data were collected on twenty different forms. Forms were inspected visually and sent off-site for data entry. SAS software was used to read the entered data files, check for potential errors, apply corrections to data sets, and combine batches into analytic data sets. The data management procedures are described.

Results: Study data collection resulted in over 20,000 completed forms. Data management was successful, resulting in clean, internally consistent data sets for analysis. The amount of time required for data management was substantially underestimated.

Conclusion: Data management for prospective studies should be planned well in advance of data collection. An ongoing process with data entered and checked as they become available allows timely recovery of errors and missing data.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Flowchart of organizational tasks (Note: some tasks such as obtaining IRB approval, obtaining facility participation, and interacting with attending physicians are not included).
Figure 2
Figure 2
Overview of data editing process.

Similar articles

Cited by

References

    1. Chilvers CE, Fayers PM, Freedman LS, Greenwood RM, Machin D, Palmer N, Westlake AJ. Improving the quality of data in randomized clinical trials: the COMPACT computer package. COMPACT Steering Committee. Stat Med. 1988;7:1165–1170. doi: 10.1002/sim.4780071109. - DOI - PubMed
    1. Karrison T. Data editing in a clinical trial. Control Clin Trials. 1981;2:15–29. doi: 10.1016/0197-2456(81)90055-6. - DOI - PubMed
    1. Tai BC, Seldrup J. A review of software for data management, design and analysis of clinical trials. Ann Acad Med Singapore. 2000;29:576–581. - PubMed
    1. DuChene AG, Hultgren DH, Neaton JD, Grambsch PV, Broste SK, Aus BM, Rasmussen WL. Forms control and error detection procedures used at the Coordinating Center of the Multiple Risk Factor Intervention Trial (MRFIT) Control Clin Trials. 1986;7:34S–45S. doi: 10.1016/0197-2456(86)90158-3. - DOI - PubMed
    1. Grady D, Newman TB, Vittinghoff E. Data management. In: Hulley SB, editor. Designing clinical research: an epidemiologic approach. Philadelphia, PA: Williams & Wilkins; 2001. pp. 247–257.

Publication types

LinkOut - more resources