PuLSE: Quality control and quantification of peptide sequences explored by phage display libraries

PLoS One. 2018 Feb 23;13(2):e0193332. doi: 10.1371/journal.pone.0193332. eCollection 2018.

Abstract

The design of highly diverse phage display libraries is based on assumption that DNA bases are incorporated at similar rates within the randomized sequence. As library complexity increases and expected copy numbers of unique sequences decrease, the exploration of library space becomes sparser and the presence of truly random sequences becomes critical. We present the program PuLSE (Phage Library Sequence Evaluation) as a tool for assessing randomness and therefore diversity of phage display libraries. PuLSE runs on a collection of sequence reads in the fastq file format and generates tables profiling the library in terms of unique DNA sequence counts and positions, translated peptide sequences, and normalized 'expected' occurrences from base to residue codon frequencies. The output allows at-a-glance quantitative quality control of a phage library in terms of sequence coverage both at the DNA base and translated protein residue level, which has been missing from toolsets and literature. The open source program PuLSE is available in two formats, a C++ source code package for compilation and integration into existing bioinformatics pipelines and precompiled binaries for ease of use.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Bacteriophages*
  • Computational Biology
  • Escherichia coli
  • Oligonucleotides / genetics
  • Oligonucleotides / metabolism
  • Peptide Library*
  • Software*

Substances

  • Oligonucleotides
  • Peptide Library