Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jul 14;32(2):107882.
doi: 10.1016/j.celrep.2020.107882.

High Frequency of Shared Clonotypes in Human T Cell Receptor Repertoires

Affiliations

High Frequency of Shared Clonotypes in Human T Cell Receptor Repertoires

Cinque Soto et al. Cell Rep. .

Abstract

The collection of T cell receptors (TCRs) generated by somatic recombination is large but unknown. We generate large TCR repertoire datasets as a resource to facilitate detailed studies of the role of TCR clonotypes and repertoires in health and disease. We estimate the size of individual human recombined and expressed TCRs by sequence analysis and determine the extent of sharing between individual repertoires. Our experiments reveal that each blood sample contains between 5 million and 21 million TCR clonotypes. Three individuals share 8% of TCRβ- or 11% of TCRα-chain clonotypes. Sorting by T cell phenotypes in four individuals shows that 5% of naive CD4+ and 3.5% of naive CD8+ subsets share their TCRβ clonotypes, whereas memory CD4+ and CD8+ subsets share 2.3% and 0.4% of their clonotypes, respectively. We identify the sequences of these shared TCR clonotypes that are of interest for studies of human T cell biology.

Keywords: CDR3; HLA; TCR; adaptive immunity; clonotypes; immune repertoire sequencing.

PubMed Disclaimer

Conflict of interest statement

Declaration of Interests J.E.C. has served as a consultant for Takeda Vaccines, Sanofi Pasteur, Pfizer, and Novavax; is on the scientific advisory boards of CompuVax and Meissa Vaccines; and is founder of IDBiologics, Inc.

Figures

Figure 1.
Figure 1.. TCRβ V3J Clonotype Diversity for Subjects HIP1, HIP2, and HIP3
(A) Rarefaction curves for species richness of V3J clonotypes generated using the program RTK (Saary et al., 2017). HIP1 had an endpoint species richness value of 5.9 million V3J clonotypes (left panel), HIP2 had a value of 17.6 million V3J clonotypes (middle panel), and HIP3 had a value of 21.1 million V3J clonotypes (right panel). The endpoint estimates for species richness appear on each plot with a filled-in circle. (B) Recon (Kaplinsky and Arnaout, 2016) estimates suggested about 38.9 million V3J clonotypes were not observed at this depth of sequencing for HIP1 (left panel), 134 million for HIP2 (middle panel), or 120 million for HIP3 (right panel). The observed values for clonotypes binned by their repeat frequency (clonotype group size) is represented by an open circle, and theoretical fits obtained using Recon are represented by an X. For clarity, only the first 25 clonotype group sizes are shown on the plot. See also Figure S1.
Figure 2.
Figure 2.. Shared V3J Clonotypes for TCRβ Chains Belonging to Subjects HIP1, HIP2, and HIP3
(A) Shared V3J clonotypes for experimentally determined TCRβ chains. (B) Shared V3DJ clonotypes for experimentally determined TCRβ chains. (C) Shared V3DJ clonotypes for synthetic repertoires generated using the program IGoR (Marcou et al., 2018). The pairwise percentage overlaps were based on the average computed for 50,000 comparisons. The average and standard error of the mean (SEM) for the pairwise percentage overlaps were 0.2% (2.0 × 10−5) between simHIP1 and simHIP2, 0.3% (2.0 × 10−5) between simHIP1 and simHIP3, and 0.2% (1.0 × 10−5) between simHIP2 and simHIP3. The average (SEM) for the percentage overlaps among all three synthetic repertoires was 0.01% (4.0 × 10−6). Reducing the CDR3 lengths of the V3DJ clonotypes in (B) so that the maximum length is 19 amino acids did not change the value of the percentage overlaps. (D) Histogram of overlap counts among all three synthetic repertoires, simHIP1, simHIP2, and simHIP3, for 50,000 comparisons. Ranking the overlap count among the three experimentally determined TCRβ repertoires (n = 89,831 shared V3DJ clonotypes) against those obtained from the three synthetic repertoires gave an estimated p = 0.002. See also Figure S2.
Figure 3.
Figure 3.. TCRβ Repertoire Statistics for Subjects HIP1, HIP2, and HIP3 Using the mRNA or gDNA Sequencing Methods
(A) Boxplot showing CDR3 length distribution of repertoires obtained from the mRNA or gDNA sequencing methods. Sequencing from the mRNA method resulted in a median CDR3 length of 13 amino acids for HIP1 (n = 3,161,410 unique CDR3s), HIP2 (n = 5,104,666 unique CDR3s), and HIP3 (n = 9,182,164 unique CDR3s). Sequencing from the gDNA method resulted in median CDR3 lengths of 13 amino acids for HIP1 (n = 2,443,117 unique CDR3s), HIP2 (n = 9,967,538 unique CDR3s), and HIP3 (n = 9,018,345 unique CDR3s). Each box represents the interquartile range (IQR) from 25% to 75%. The median is represented by a line within each box, and the whiskers are each within 1.5 of the IQR. There was no statistical difference in the median values according to a Kruskal-Wallis test (p < 2.2E–16). (B) Morisita-Horn indices for subject HIP1, HIP2, or HIP3 between mRNA and gDNA sequencing methods. (C) Distributions of clonotype abundances for the mRNA and gDNA methods represented as the cumulative fraction of the repertoire versus the cumulative fraction of unique TCRβ V3J clonotypes. The clonotypes were ordered from largest to smallest based on the total number of unique somatic variants associated with each clonotype. (D) TCRβ V3J clonotype overlap percentages for subject HIP1, HIP2, or HIP3 between mRNA and gDNA sequencing methods. Subsampling yielded median percentage overlaps of 6.8% ± 1.1% × 10−2, 5.6% ± 7.5% × 10−3 or 6.6% ± 5.5% × 10−3 for subjects HIP1, HIP2 or HIP3, respectively. See also Figure S3.
Figure 4.
Figure 4.. Pairwise Percentage Overlaps of T Cell Subsets Based on gDNA Sequencing for Subject HIP2, HIP3, HIP4, or HIP5
(A) Pairwise percentage clonotype overlaps for naive or memory CD4+ T cell subsets. (B) Pairwise percentage clonotype overlaps for naive or memory CD8+ T cell subsets. See also Tables S3 and S4 and Figure S4.
Figure 5.
Figure 5.. TCRβ Clonotype Sharing for T Cell Subsets
The total number of shared clonotypes in each T cell subset belonging to subjects HIP2, HIP3, HIP4, and HIP5 was determined. The percentage overlaps for all five subjects were also determined in each T cell subset and are represented as white text in each figure. (A) CD4+ naive cell subset. (B) CD4+ memory cell subset. (C) CD8+ naive subset. (D) CD8+ memory cell subset. See also Figure S5.
Figure 6.
Figure 6.. TCRβ Clonotypes from HIP Subjects Appearing in GenBank
TCRβ V3J clonotypes from HIP subjects were used to search against the entire GenBank database for possible matches. (A) Exact matches were grouped either as patented sequences or into one of five categories that focused on the target: viral target, bacterial target, autoimmune target, cancer target, and other. (B) Representative amino acid sequence alignments between the V region from GenBank and the V region from the HIP subject. In cases in which the sequence was missing in the framework region or regions, the closest-matching germline sequence was used to fill in the missing region. The filled-in portion of the sequence is highlighted by the thick gray line of the alignment. See also Table S5.

Similar articles

Cited by

References

    1. Alanio C, Nicoli F, Sultanik P, Flecken T, Perot B, Duffy D, Bianchi E, Lim A, Clave E, van Buuren MM, et al. (2015). Bystander hyperactivation of preimmune CD8+ T cells in chronic HCV patients. eLife 4, e07916. - PMC - PubMed
    1. Andrews S (2012). FastQC: A quality control tool for high throughput sequence data https://www.bioinformatics.babraham.ac.uk/projects/fastqc/.
    1. Arnaout R, Lee W, Cahill P, Honan T, Sparrow T, Weiand M, Nusbaum C, Rajewsky K, and Koralov SB (2011). High-resolution description of antibody heavy-chain repertoires in humans. PLoS ONE 6, e22365. - PMC - PubMed
    1. Bolotin DA, Poslavsky S, Mitrophanov I, Shugay M, Mamedov IZ, Putintseva EV, and Chudakov DM (2015). MiXCR: software for comprehensive adaptive immunity profiling. Nat. Methods 12, 380–381. - PubMed
    1. Boyd SD, Marshall EL, Merker JD, Maniar JM, Zhang LN, Sahaf B, Jones CD, Simen BB, Hanczaruk B, Nguyen KD, et al. (2009). Measurement and clinical monitoring of human lymphocyte clonality by massively parallel VDJ pyrosequencing. Sci. Transl. Med 1, 12ra23. - PMC - PubMed

Publication types