A radiogenomic dataset of non-small cell lung cancer

Sci Data. 2018 Oct 16;5:180202. doi: 10.1038/sdata.2018.202.

Abstract

Medical image biomarkers of cancer promise improvements in patient care through advances in precision medicine. Compared to genomic biomarkers, image biomarkers provide the advantages of being non-invasive, and characterizing a heterogeneous tumor in its entirety, as opposed to limited tissue available via biopsy. We developed a unique radiogenomic dataset from a Non-Small Cell Lung Cancer (NSCLC) cohort of 211 subjects. The dataset comprises Computed Tomography (CT), Positron Emission Tomography (PET)/CT images, semantic annotations of the tumors as observed on the medical images using a controlled vocabulary, and segmentation maps of tumors in the CT scans. Imaging data are also paired with results of gene mutation analyses, gene expression microarrays and RNA sequencing data from samples of surgically excised tumor tissue, and clinical data, including survival outcomes. This dataset was created to facilitate the discovery of the underlying relationship between tumor molecular and medical image features, as well as the development and evaluation of prognostic medical image biomarkers.

Publication types

  • Dataset
  • Research Support, N.I.H., Extramural

MeSH terms

  • Carcinoma, Non-Small-Cell Lung* / diagnostic imaging
  • Carcinoma, Non-Small-Cell Lung* / genetics
  • Carcinoma, Non-Small-Cell Lung* / physiopathology
  • Humans
  • Lung Neoplasms* / diagnostic imaging
  • Lung Neoplasms* / genetics
  • Lung Neoplasms* / physiopathology
  • Positron-Emission Tomography
  • Sequence Analysis, RNA
  • Survival Analysis
  • Tomography, X-Ray Computed