PREFACE: In silico pipeline for accurate cell-free fetal DNA fraction prediction

Prenat Diagn. 2019 Sep;39(10):925-933. doi: 10.1002/pd.5508. Epub 2019 Jul 11.

Abstract

Objective: During routine noninvasive prenatal testing (NIPT), cell-free fetal DNA fraction is ideally derived from shallow-depth whole-genome sequencing data, preventing the need for additional experimental assays. The fraction of aligned reads to chromosome Y enables proper quantification for male fetuses, unlike for females, where advanced predictive procedures are required. This study introduces PREdict FetAl ComponEnt (PREFACE), a novel bioinformatics pipeline to establish fetal fraction in a gender-independent manner.

Methods: PREFACE combines the strengths of principal component analysis and neural networks to model copy number profiles.

Results: For sets of roughly 1100 male NIPT samples, a cross-validated Pearson correlation of 0.9 between predictions and fetal fractions according to Y chromosomal read counts was noted. PREFACE enables training with both male and unlabeled female fetuses. Using our complete cohort (nfemale = 2468, nmale = 2723), the correlation metric reached 0.94.

Conclusions: Allowing individual institutions to generate optimized models sidelines between-laboratory bias, as PREFACE enables user-friendly training with a limited amount of retrospective data. In addition, our software provides the fetal fraction based on the copy number state of chromosome X. We show that these measures can predict mixed multiple pregnancies, sex chromosomal aneuploidies, and the source of observed aberrations.

Publication types

  • Validation Study

MeSH terms

  • Cell-Free Nucleic Acids / analysis*
  • Chromosome Disorders / diagnosis*
  • Chromosome Disorders / epidemiology
  • Cohort Studies
  • Computational Biology / methods
  • Computer Simulation
  • Female
  • Fetus / metabolism*
  • Fetus / physiology
  • Genetic Testing / methods
  • Genetic Testing / statistics & numerical data
  • High-Throughput Nucleotide Sequencing / methods
  • High-Throughput Nucleotide Sequencing / statistics & numerical data
  • Humans
  • Infant, Newborn
  • Male
  • Noninvasive Prenatal Testing* / methods
  • Noninvasive Prenatal Testing* / statistics & numerical data
  • Pregnancy
  • Principal Component Analysis / methods*
  • Prognosis
  • Reproducibility of Results
  • Retrospective Studies
  • Sequence Analysis, DNA / methods
  • Sequence Analysis, DNA / statistics & numerical data
  • Sex Factors
  • Software*

Substances

  • Cell-Free Nucleic Acids