Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 May 13;13(1):7777.
doi: 10.1038/s41598-023-34467-3.

Immune gene variation associated with chromosome-scale differences among individual zebrafish genomes

Affiliations

Immune gene variation associated with chromosome-scale differences among individual zebrafish genomes

Sean C McConnell et al. Sci Rep. .

Abstract

Immune genes have evolved to maintain exceptional diversity, offering robust defense against pathogens. We performed genomic assembly to examine immune gene variation in zebrafish. Gene pathway analysis identified immune genes as significantly enriched among genes with evidence of positive selection. A large subset of genes was absent from analysis of coding sequences due to apparent lack of reads, prompting us to examine genes overlapping zero coverage regions (ZCRs), defined as 2 kb stretches without mapped reads. Immune genes were identified as highly enriched within ZCRs, including over 60% of major histocompatibility complex (MHC) genes and NOD-like receptor (NLR) genes, mediators of direct and indirect pathogen recognition. This variation was most highly concentrated throughout one arm of chromosome 4 carrying a large cluster of NLR genes, associated with large-scale structural variation covering more than half of the chromosome. Our genomic assemblies uncovered alternative haplotypes and distinct complements of immune genes among individual zebrafish, including the MHC Class II locus on chromosome 8 and the NLR gene cluster on chromosome 4. While previous studies have shown marked variation in NLR genes between vertebrate species, our study highlights extensive variation in NLR gene regions between individuals of the same species. Taken together, these findings provide evidence of immune gene variation on a scale previously unknown in other vertebrate species and raise questions about potential impact on immune function.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Sequence variants and evidence of positive selection. (A) Single nucleotide variants (SNVs) or (B) small insertions/deletions (Indels) were identified using GATK haplotype caller and reported as a percentage of each genome. Both heterozygous (Het.) and homozygous (Hom.) variants are shown. (C) Percentage of base pairs in each genome covered by Zero Coverage Regions (ZCRs), defined as no reads mapped over ≥ 2 kb intervals. Manhattan plots of the ratio of non-synonymous to synonymous mutations (dN/dS) per allele for three human (D) or zebrafish (E) individuals. Each dot represents the log10 ratio of nonsynonymous to synonymous SNVs of one gene with variants. The black horizontal line at ‘0’ indicates alleles under neutral selection, i.e. those having a dN/dS ratio of 1 (the ratio for each allele is plotted on a log10 scale). A large fraction of genes throughout the right arm of zebrafish chromosome 4 (indicated by arrow) have evidence of positive selection (dN/dS > 1). CHM, CEU, and YRI are samples from the 1000 Genomes Project representing: a haploid complete hydatidiform mole, CHM1; Utah Resident (CEPH) with European Ancestry, NA12878; Yoruba in Ibadan, Nigeria, 19240; respectively. CG2 and CG1 are clonal (homozygous diploid) zebrafish lines, and AB3 is a partially inbred fish, all on the AB genetic background.
Figure 2
Figure 2
Chromosomal distribution of zero coverage regions. Comparison of (A) human chromosome 6 (location of the human MHC locus), (B) zebrafish chromosome 19 (location of the zebrafish core MHC locus), (C) human chromosome 11 (location of 4 out of 25 human NLR gene family members), and (D) zebrafish chromosome 4 (location of over 300 zebrafish NLR genes). Zero coverage regions (no mapped reads over ≥ 2 kb intervals) are displayed in red. Gene annotation is shown in blue with a small number of genes labeled. ZCRs are found more densely in zebrafish chromosomes compared with human chromosomes and a large concentration of ZCRs is distributed throughout the heterochromatic right arm of zebrafish chromosome 4 with evidence of haplotypic differences between individuals.
Figure 3
Figure 3
Gene annotation enrichment analysis. Genes with evidence of positive selection (dN/dS > 1) or genes with at least one exon overlapping zero coverage regions (ZCRs, without any mapped reads over ≥ 2 kb intervals) were analyzed using GO (Gene Ontology) annotation to identify genes enriched in specific biological processes. GO enrichments were summarized using REVIGO. Remaining terms after adjustment for redundancy are represented as scatterplots, following semantic similarities. Bubble color indicates the log10 q-value/enrichment (see legend on right) and bubble size indicates the GO term frequency, where smaller bubbles imply more specific terms. Data are shown for zebrafish genes under positive selection (A), zebrafish genes overlapping ZCRs (B), human genes under positive selection (C), and human genes overlapping ZCRs (D). Lists of enriched pathways with genes are provided in Table S4.
Figure 4
Figure 4
Zero coverage regions highlight unique haplotypes. Selected 100 kb region plots including (A) HLA-DRB5 gene on human chromosome 6, (B) LCE3C gene on human chromosome 1, (C) mhc2dab gene on zebrafish chromosome 8, and (D) NLR gene (si:dkeyp-90h9.1) on zebrafish chromosome 4. Plots show mapped sequence read coverage across each region in green. Zero Coverage Regions (no mapped reads over ≥ 2 kb intervals) are displayed in red. Representative examples shown here were selected due to different patterns of coverage for individuals of the same species, indicating the presence of an alternative haplotype at that locus. We note similar findings indicating alternative haplotypes for additional immune gene loci throughout the zebrafish genome, including MHC Class I genes on chromosome 25 and NLR genes on chromosomes beyond chromosome 4, as highlighted in Fig. S2.
Figure 5
Figure 5
MHC Class II genes on zebrafish chromosome 8. (A) Read coverage across the zebrafish Class II MHC locus shows marked variability between individuals. Unlike the CG2 and CG1 fish, the AB3 zebrafish genome has a cluster of zero coverage regions (ZCRs, without any mapped reads over ≥ 2 kb intervals) in the region surrounding mhc2dab (highlighted in light red). In contrast, the CG2 and CG1 fish have a cluster of ZCRs in the region surrounding mhc2dgb (highlighted in light blue). (B) A detailed view of the region highlighted above in light blue (A) showing ZCRs overlapping the neighboring mhc2dgb and mhc2dga genes. (C) A detailed view of the region highlighted above in light red (A) showing ZCRs overlapping the neighboring mhc2dab and mhc2daa genes. Read coverage is depicted in green, ZCRs are in red, and scaffolds from Discovar assemblies that align to the reference genome are grey.
Figure 6
Figure 6
MHC and NLR genes associated with zero coverage regions (ZCRs). The percentage of (A) major histocompatibility complex (MHC) genes, (B) NOD-like receptor (NLR) genes, or (C) all genes in each of three human or zebrafish genomes, with at least one exon overlapping ZCRs. ‘Comb.’ refers to the combined list of genes from all three individuals that overlaps with ZCRs. CHM, CEU, and YRI are human samples from the 1000 Genomes Project representing: a haploid complete hydatidiform mole, CHM1; Utah Resident (CEPH) with European Ancestry, NA12878; Yoruba in Ibadan, Nigeria, 19240; respectively. CG2 and CG1 are clonal zebrafish lines, and AB3 is a partially inbred fish, all on the AB genetic background (the gene lists used for (A,B) are found in Table S9. Total number of zebrafish MHC genes = 42; total number of human MHC genes = 37. Total number of zebrafish NLR genes = 368. The data used to make the bar graphs are found in Table S11).

Similar articles

Cited by

References

    1. Litman GW, Cannon JP, Dishaw LJ. Reconstructing immune phylogeny: New perspectives. Nat. Rev. Immunol. 2005;5:866–879. doi: 10.1038/nri1712. - DOI - PMC - PubMed
    1. Criscitiello MF, de Figueiredo P. Fifty shades of immune defense. Heitman J, editor. PLoS Pathog. 2013;9:e1003110. doi: 10.1371/journal.ppat.1003110. - DOI - PMC - PubMed
    1. Flajnik MF. A cold-blooded view of adaptive immunity. Nat. Rev. Immunol. 2018;18:438–453. doi: 10.1038/s41577-018-0003-9. - DOI - PMC - PubMed
    1. Turvey SE, Broide DH. Innate immunity. J. Allergy Clin. Immunol. 2010;125:S24–32. doi: 10.1016/j.jaci.2009.07.016. - DOI - PMC - PubMed
    1. Maekawa T, Kufer TA, Schulze-Lefert P. NLR functions in plant and animal immune systems: So far and yet so close. Nat. Immunol. 2011;12:817–826. doi: 10.1038/ni.2083. - DOI - PubMed

Publication types

LinkOut - more resources