Computational identification of CDR3 sequence archetypes among immunoglobulin sequences in chronic lymphocytic leukemia

Leuk Res. 2009 Mar;33(3):368-76. doi: 10.1016/j.leukres.2008.05.022. Epub 2008 Jul 21.

Abstract

The leukemia cells of unrelated patients with chronic lymphocytic leukemia (CLL) display a restricted repertoire of immunoglobulin (Ig) gene rearrangements with preferential usage of certain Ig gene segments. We developed a computational method to rigorously quantify biases in Ig sequence similarity in large patient databases and to identify groups of patients with unusual levels of sequence similarity. We applied our method to sequences from 1577 CLL patients through the CLL Research Consortium (CRC), and identified 67 similarity groups into which roughly 20% of all patients could be assigned. Immunoglobulin light chain class was highly correlated within all groups and light chain gene usage was similar within sets. Surprisingly, over 40% of the identified groups were composed of somatically mutated genes. This study significantly expands the evidence that antigen selection shapes the Ig repertoire in CLL.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Sequence
  • Complementarity Determining Regions*
  • Computational Biology*
  • Humans
  • Leukemia, Lymphocytic, Chronic, B-Cell / immunology*

Substances

  • Complementarity Determining Regions