Reference long-read isoform-aware transcriptomes of 4 human peripheral blood lymphocyte subsets

G3 (Bethesda). 2022 Nov 4;12(11):jkac253. doi: 10.1093/g3journal/jkac253.


Long-read sequencing technologies such as isoform sequencing can generate highly accurate sequences of full-length mRNA transcript isoforms. Such long-read transcriptomics may be especially useful in investigations of lymphocyte functional plasticity as it relates to human health and disease. However, no long-read isoform-aware reference transcriptomes of human circulating lymphocytes are readily available despite being valuable as benchmarks in a variety of transcriptomic studies. To begin to fill this gap, we purified 4 lymphocyte populations (CD4+ T, CD8+ T, NK, and Pan B cells) from the peripheral blood of a healthy male donor and obtained high-quality RNA (RIN > 8) for isoform sequencing and parallel RNA-Seq analyses. Many novel polyadenylated transcript isoforms, supported by both isoform sequencing and RNA-Seq data, were identified within each sample. The datasets met several metrics of high quality and have been deposited to the Gene Expression Omnibus database (GSE202327, GSE202328, GSE202329) as both raw and processed files to serve as long-read reference transcriptomes for future studies of human circulating lymphocytes.

Keywords: B cell; Iso-Seq; NK cell; RNA-Seq; T cell; illumina; isoform; lymphocyte; pacific biosciences; transcriptome.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Gene Expression Profiling*
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Lymphocyte Subsets / metabolism
  • Male
  • Protein Isoforms / genetics
  • Protein Isoforms / metabolism
  • Sequence Analysis, RNA
  • Transcriptome*


  • Protein Isoforms