Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Aug 13;19(1):604.
doi: 10.1186/s12864-018-4989-y.

Unsupervised correction of gene-independent cell responses to CRISPR-Cas9 targeting

Affiliations

Unsupervised correction of gene-independent cell responses to CRISPR-Cas9 targeting

Francesco Iorio et al. BMC Genomics. .

Abstract

Background: Genome editing by CRISPR-Cas9 technology allows large-scale screening of gene essentiality in cancer. A confounding factor when interpreting CRISPR-Cas9 screens is the high false-positive rate in detecting essential genes within copy number amplified regions of the genome. We have developed the computational tool CRISPRcleanR which is capable of identifying and correcting gene-independent responses to CRISPR-Cas9 targeting. CRISPRcleanR uses an unsupervised approach based on the segmentation of single-guide RNA fold change values across the genome, without making any assumption about the copy number status of the targeted genes.

Results: Applying our method to existing and newly generated genome-wide essentiality profiles from 15 cancer cell lines, we demonstrate that CRISPRcleanR reduces false positives when calling essential genes, correcting biases within and outside of amplified regions, while maintaining true positive rates. Established cancer dependencies and essentiality signals of amplified cancer driver genes are detectable post-correction. CRISPRcleanR reports sgRNA fold changes and normalised read counts, is therefore compatible with downstream analysis tools, and works with multiple sgRNA libraries.

Conclusions: CRISPRcleanR is a versatile open-source tool for the analysis of CRISPR-Cas9 knockout screens to identify essential genes.

Keywords: Bias correction; CRISPR-Cas9; Cancer; Gene copy number; Genetic screens.

PubMed Disclaimer

Conflict of interest statement

Not applicable.

Not applicable.

E.A.S. is an employee of GlaxoSmithKline.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Figures

Fig. 1
Fig. 1
Heterogeneous gene-independent responses to CRISPR-Cas9 targeting. a Average logFC values of sgRNA within segments of equal CN (excluding FE and histones) for three cell lines. Each circle represents a CN segment of the indicated copy number. Asterisks mark the CN at which a significance difference (Welchs t-test, p < 0.05) is initially (starting point) and continuously (critical point) observed compared to logFC values at CN = 2. Box-plots show the median, inter-quartile ranges and 95% confidence intervals. b Same as for A but considering only non-expressed genes (FPKM < 0.05). (c,d,e) Segments of equal gene copy number and segments of equal sgRNA logFCs for selected chromosomes in three cell lines
Fig. 2
Fig. 2
Unsupervised detection of segments of equal sgRNA logFCs and their correction. a and b Example segments of equal gene copy number and equal sgRNA logFC values detected and corrected by CRISPRcleanR in two cell lines. c logFC values of sgRNAs of the entire library for all cell lines grouped according to the copy number of their targeted gene before (left) and after (right) CRISPRcleanR correction. Box-plots show the median, inter-quartile ranges and 95% confidence intervals. d Recall curves of sgRNA when classified as targeting amplified genes, amplified non-expressed genes, FE genes, and non-essential genes before and after CRISPRcleanR correction, for an example cell line (EPLC−272H). e Assessment of CRISPRcleanR correction comparing Recall at 5% FDR (top row) or area under the Recall curve (AURC, bottom row) of genes in six predefined gene sets based on their uncorrected or corrected logFCs (averaged across targeting sgRNAs)
Fig. 3
Fig. 3
CRISPRcleanR is effective with multiple different sgRNA libraries. a Recall curves for three sgRNA libraries when classifying sgRNAs targeting amplified genes, amplified non-expressed genes, FE genes, and non-essential genes using sgRNA logFCs before (first row of plots) and after (second row of plots) CRISPRcleanR correction. b Variation of the area under the recall curve for sgRNAs targeting genes in six predefined sets, based on their uncorrected/corrected logFCs, across the three different libraries (one circle per library). c Segments within chromosome 8 of equal gene copy number juxtaposed to segments of equal sgRNA logFCs before and after CRISPRcleanR in HT-29 cells screened with three different sgRNA libraries. The position of MYC is shown with a blue line
Fig. 4
Fig. 4
CRISPRcleanR retains overall essentiality profiles. a Example precision/recall curves in HuP-T3 cells for the indicated number of top depleted sgRNAs after CRISPRcleanR correction, classified based on their un-corrected sgRNAs logFC rank position. b Area under the precision/recall curves defined as for A for all cell lines. Box-plots show the median, inter-quartile ranges and 95% confidence intervals
Fig. 5
Fig. 5
CRISPRcleanR corrected sgRNA counts and downstream analysis with MAGeCK. a and b Normalised counts of sgRNAs of the transfected libraries versus the control plasmid for FE and non-essential genes (first two rows of plots), CN amplified genes (third row) and CN non-expressed genes (fourth row), for two example cell lines before (first and third column) and after (second and fourth column) CRISPRcleanR correction. Essentialities for CN-amplified cancer driver genes such as MYC, ERBB2 and CCND1 are retained post correction. For the sake of readability only genes with at least 10 copies have been highlighted. c Comparison of recall using MAGeCK for sgRNAs targeting genes in six predefined gene sets when using as input CRISPRcleanR uncorrected and corrected sgRNAs counts
Fig. 6
Fig. 6
CRISPRcleanR enables detection of cancer gene dependencies. a Detection of EGFR and PIK3CA dependencies at the level of targeting sgRNAs in mutant cancer cell lines. Rank position of sgRNAs targeting the indicated genes before (top) and after (bottom) CRISPRcleanR correction. FE and non-essential genes are shown for comparison. b A CN-amplified region of chromosome 8 in HT-29 cell line including MYC and 3 surrounding up-streaming/down-streaming genes. Expanded view of sgRNAs targeting MYC and its surrounding genes, with each gene identified by a different colour. The heatmaps (first 7 columns) show ranked positions of the sgRNAs targeting the 7 considered genes (blue bars) before (top heatmap) and after (bottom heatmap) CRISPRcleanR correction. The last two columns show rank positions for the sgRNAs targeting FE genes (second last column) and non-essential genes (last column). c Same as for B but considering a region on chromosome 16 in the NCI-H2170 cell line, including ERBB2 and four flanking upstreaming/downstreaming genes and CDK12

Similar articles

Cited by

References

    1. Mali P, Yang L, Esvelt KM, Aach J, Guell M, DiCarlo JE, et al. RNA-guided human genome engineering via Cas9. Science. 2013;339:823–826. doi: 10.1126/science.1232033. - DOI - PMC - PubMed
    1. Cong L, Ran FA, Cox D, Lin S, Barretto R, Habib N, et al. Multiplex genome engineering using CRISPR/Cas systems. Science. 2013;339:819–823. doi: 10.1126/science.1231143. - DOI - PMC - PubMed
    1. Koike-Yusa H, Li Y, Tan E-P, Velasco-Herrera MDC, Yusa K. Genome-wide recessive genetic screening in mammalian cells with a lentiviral CRISPR-guide RNA library. Nat Biotechnol. 2014;32:267–273. doi: 10.1038/nbt.2800. - DOI - PubMed
    1. Shalem O, Sanjana NE, Hartenian E, Shi X, Scott DA, Mikkelson T, et al. Genome-scale CRISPR-Cas9 knockout screening in human cells. Science. 2014;343:84–87. doi: 10.1126/science.1247005. - DOI - PMC - PubMed
    1. Wang T, Wei JJ, Sabatini DM, Lander ES. Genetic screens in human cells using the CRISPR-Cas9 system. Science. 2014;343:80–84. doi: 10.1126/science.1246981. - DOI - PMC - PubMed