Maximum entropy models for antibody diversity
- PMID: 20212159
- PMCID: PMC2851784
- DOI: 10.1073/pnas.1001705107
Maximum entropy models for antibody diversity
Abstract
Recognition of pathogens relies on families of proteins showing great diversity. Here we construct maximum entropy models of the sequence repertoire, building on recent experiments that provide a nearly exhaustive sampling of the IgM sequences in zebrafish. These models are based solely on pairwise correlations between residue positions but correctly capture the higher order statistical properties of the repertoire. By exploiting the interpretation of these models as statistical physics problems, we make several predictions for the collective properties of the sequence ensemble: The distribution of sequences obeys Zipf's law, the repertoire decomposes into several clusters, and there is a massive restriction of diversity because of the correlations. These predictions are completely inconsistent with models in which amino acid substitutions are made independently at each site and are in good agreement with the data. Our results suggest that antibody diversity is not limited by the sequences encoded in the genome and may reflect rapid adaptation to antigenic challenges. This approach should be applicable to the study of the global properties of other protein families.
Conflict of interest statement
The authors declare no conflict of interest.
Figures
Similar articles
-
Discovery of an unusual alternative splicing pathway of the immunoglobulin heavy chain in a teleost fish, Danio rerio.Dev Comp Immunol. 2011 Mar;35(3):253-7. doi: 10.1016/j.dci.2010.10.009. Epub 2010 Nov 2. Dev Comp Immunol. 2011. PMID: 21035505
-
High-throughput sequencing of the zebrafish antibody repertoire.Science. 2009 May 8;324(5928):807-10. doi: 10.1126/science.1170020. Science. 2009. PMID: 19423829 Free PMC article.
-
Identification and characterization of a novel immunoglobulin Z isotype in zebrafish: implications for a distinct B cell receptor in lower vertebrates.Mol Immunol. 2010 Jan;47(4):738-46. doi: 10.1016/j.molimm.2009.10.010. Epub 2009 Nov 20. Mol Immunol. 2010. PMID: 19931913
-
The immunoglobulin heavy-chain locus in zebrafish: identification and expression of a previously unknown isotype, immunoglobulin Z.Nat Immunol. 2005 Mar;6(3):295-302. doi: 10.1038/ni1166. Epub 2005 Jan 30. Nat Immunol. 2005. PMID: 15685175
-
Maximum entropy models as a tool for building precise neural controls.Curr Opin Neurobiol. 2017 Oct;46:120-126. doi: 10.1016/j.conb.2017.08.001. Epub 2017 Sep 3. Curr Opin Neurobiol. 2017. PMID: 28869818 Review.
Cited by
-
Scaling Monte-Carlo-Based Inference on Antibody and TCR Repertoires.ArXiv [Preprint]. 2023 Dec 19:arXiv:2312.12525v1. ArXiv. 2023. PMID: 38196748 Free PMC article. Preprint.
-
Genotype to phenotype mapping and the fitness landscape of the E. coli lac promoter.PLoS One. 2013 May 1;8(5):e61570. doi: 10.1371/journal.pone.0061570. Print 2013. PLoS One. 2013. PMID: 23650500 Free PMC article.
-
Dynamics and processing in finite self-similar networks.J R Soc Interface. 2012 Sep 7;9(74):2131-44. doi: 10.1098/rsif.2011.0840. Epub 2012 Feb 29. J R Soc Interface. 2012. PMID: 22378750 Free PMC article.
-
Statistical inference of the generation probability of T-cell receptors from sequence repertoires.Proc Natl Acad Sci U S A. 2012 Oct 2;109(40):16161-6. doi: 10.1073/pnas.1212755109. Epub 2012 Sep 17. Proc Natl Acad Sci U S A. 2012. PMID: 22988065 Free PMC article.
-
Statistical mechanics for natural flocks of birds.Proc Natl Acad Sci U S A. 2012 Mar 27;109(13):4786-91. doi: 10.1073/pnas.1118633109. Epub 2012 Mar 16. Proc Natl Acad Sci U S A. 2012. PMID: 22427355 Free PMC article.
References
-
- Pal C, Papp B, Lercher M. An integrated view of protein evolution. Nat Rev Genet. 2006;7:337–348. - PubMed
-
- Branden C, Tooze J. Introduction to Protein Structure. New York: Garland Science; 1991.
-
- Cordes MH, Davidson AR, Sauer RT. Sequence space, folding and protein design. Curr Opin Struct Biol. 1996;6:3–10. - PubMed
-
- Socolich M, et al. Evolutionary information for specifying a protein fold. Nature. 2005;437:512–518. - PubMed
-
- Russ WP, Lowery DM, Mishra P, Yaffe MB, Ranganathan R. Natural-like function in artificial ww domains. Nature. 2005;437:579–583. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
