Quantifying the impact of public omics data

Nat Commun. 2019 Aug 5;10(1):3512. doi: 10.1038/s41467-019-11461-w.


The amount of omics data in the public domain is increasing every year. Modern science has become a data-intensive discipline. Innovative solutions for data management, data sharing, and for discovering novel datasets are therefore increasingly required. In 2016, we released the first version of the Omics Discovery Index (OmicsDI) as a light-weight system to aggregate datasets across multiple public omics data resources. OmicsDI aggregates genomics, transcriptomics, proteomics, metabolomics and multiomics datasets, as well as computational models of biological processes. Here, we propose a set of novel metrics to quantify the attention and impact of biomedical datasets. A complete framework (now integrated into OmicsDI) has been implemented in order to provide and evaluate those metrics. Finally, we propose a set of recommendations for authors, journals and data resources to promote an optimal quantification of the impact of datasets.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Access to Information*
  • Computational Biology / statistics & numerical data
  • Datasets as Topic*
  • Gene Expression Profiling / statistics & numerical data
  • Genomics / statistics & numerical data
  • Humans
  • Information Dissemination*
  • Metabolomics / statistics & numerical data
  • Proteomics / statistics & numerical data