The what, why, and how of born-open data

Jeffrey N Rouder

doi:10.3758/s13428-015-0630-z

The what, why, and how of born-open data

Behav Res Methods. 2016 Sep;48(3):1062-9. doi: 10.3758/s13428-015-0630-z.

Author

Jeffrey N Rouder¹

Affiliation

¹ University of Missouri, Columbia, MO, 65211, USA. rouderj@missouri.edu.

PMID: 26428912
DOI: 10.3758/s13428-015-0630-z

Abstract

Although many researchers agree that scientific data should be open to scrutiny to ferret out poor analyses and outright fraud, most raw data sets are not available on demand. There are many reasons researchers do not open their data, and one is technical. It is often time consuming to prepare and archive data. In response, my laboratory has automated the process such that our data are archived the night they are created without any human approval or action. All data are versioned, logged, time stamped, and uploaded including aborted runs and data from pilot subjects. The archive is GitHub, github.com, the world's largest collection of open-source materials. Data archived in this manner are called born open. In this paper, I discuss the benefits of born-open data and provide a brief technical overview of the process. I also address some of the common concerns about opening data before publication.

Keywords: Data integrity; Data sharing; Open data; Open science.

MeSH terms

Data Interpretation, Statistical*
Databases, Factual
Information Dissemination
Internet
Publishing
Research Personnel
Scientific Misconduct / statistics & numerical data*
Software