peakC: a flexible, non-parametric peak calling package for 4C and Capture-C data

Nucleic Acids Res. 2018 Sep 6;46(15):e91. doi: 10.1093/nar/gky443.

Abstract

It is becoming increasingly clear that chromosome organization plays an important role in gene regulation. High-resolution methods such as 4C, Capture-C and promoter capture Hi-C (PCHiC) enable the study of chromatin loops such as those formed between promoters and enhancers or CTCF/cohesin binding sites. An important aspect of 4C/Capture-C/PCHiC analyses is the reliable identification of chromatin loops, preferably not based on visual inspection of a DNA contact profile, but on reproducible statistical analysis that robustly scores interaction peaks in the non-uniform contact background. Here, we present peakC, an R package for the analysis of 4C/Capture-C/PCHiC data. We generated 4C data for 13 viewpoints in two tissues in at least triplicate to test our methods. We developed a non-parametric peak caller based on rank-products. Sampling analysis shows that not read depth but template quality is the most important determinant of success in 4C experiments. By performing peak calling on single experiments we show that the peak calling results are similar to the replicate experiments, but that false positive rates are significantly reduced by performing replicates. Our software is user-friendly and enables robust peak calling for one-vs-all chromosome capture experiments. peakC is available at: https://github.com/deWitLab/peakC.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Binding Sites / genetics
  • CCCTC-Binding Factor / metabolism
  • Chromatin / genetics
  • Chromatin / metabolism
  • Chromosomes, Mammalian / genetics
  • Chromosomes, Mammalian / metabolism
  • Computational Biology / methods*
  • Embryonic Stem Cells / metabolism
  • High-Throughput Nucleotide Sequencing / methods*
  • Liver / embryology
  • Liver / metabolism
  • Mice
  • Promoter Regions, Genetic / genetics*
  • Reproducibility of Results
  • Sequence Analysis, DNA / methods*
  • Software*

Substances

  • CCCTC-Binding Factor
  • Chromatin