Engineered in-vitro cell line mixtures and robust evaluation of computational methods for clonal decomposition and longitudinal dynamics in cancer

Sci Rep. 2017 Oct 18;7(1):13467. doi: 10.1038/s41598-017-13338-8.

Abstract

Characterization and quantification of tumour clonal populations over time via longitudinal sampling are essential components in understanding and predicting the response to therapeutic interventions. Computational methods for inferring tumour clonal composition from deep-targeted sequencing data are ubiquitous, however due to the lack of a ground truth biological data, evaluating their performance is difficult. In this work, we generate a benchmark data set that simulates tumour longitudinal growth and heterogeneity by in vitro mixing of cancer cell lines with known proportions. We apply four different algorithms to our ground truth data set and assess their performance in inferring clonal composition using different metrics. We also analyse the performance of these algorithms on breast tumour xenograft samples. We conclude that methods that can simultaneously analyse multiple samples while accounting for copy number alterations as a factor in allelic measurements exhibit the most accurate predictions. These results will inform future functional genomics oriented studies of model systems where time series measurements in the context of therapeutic interventions are becoming increasingly common. These studies will need computational models which accurately reflect the multi-factorial nature of allele measurement in cancer including, as we show here, segmental aneuploidies.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Breast Neoplasms / etiology
  • Breast Neoplasms / pathology
  • Cell Line, Tumor
  • Computational Biology / methods
  • Computer Simulation*
  • DNA Copy Number Variations
  • Disease Models, Animal
  • Exome Sequencing
  • Female
  • Heterografts
  • Humans
  • Mice
  • Models, Biological*
  • Neoplasms / etiology*
  • Neoplasms / pathology*
  • Polymorphism, Single Nucleotide
  • Reproducibility of Results

Grants and funding