Skip to main page content
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2004 Jun;14(6):1188-90.
doi: 10.1101/gr.849004.

WebLogo: A Sequence Logo Generator

Affiliations
Free PMC article

WebLogo: A Sequence Logo Generator

Gavin E Crooks et al. Genome Res. .
Free PMC article

Abstract

WebLogo generates sequence logos, graphical representations of the patterns within a multiple sequence alignment. Sequence logos provide a richer and more precise description of sequence similarity than consensus sequences and can rapidly reveal significant features of the alignment otherwise difficult to perceive. Each logo consists of stacks of letters, one stack for each position in the sequence. The overall height of each stack indicates the sequence conservation at that position (measured in bits), whereas the height of symbols within the stack reflects the relative frequency of the corresponding amino or nucleic acid at that position. WebLogo has been enhanced recently with additional features and options, to provide a convenient and highly configurable sequence logo generator. A command line interface and the complete, open WebLogo source code are available for local installation and customization.

Figures

Figure 1
Figure 1
(A) CAP (Catabolite Activator Protein, also known as CRP) acts as a transcription promoter by binding at more than 100 sites within the Escherichia coli genome. We rendered the PDB structure 1CGP (Schultz et al. 1991) using Chimera (Huang et al. 1996). (B) The two DNA recognition helices of the CAP homodimer insert themselves into consecutive turns of the major groove. Several consequences can be observed in this CAP binding-site logo. The logo is approximately palindromic, which provides two very similar recognition sites, one for each subunit of the dimer. However, the binding site lacks perfect symmetry, possibly due to the inherent asymmetry of the operon promoter region. The displacement of the two halves is 11 bp, or approximately one full turn of the DNA helix. Additional interactions occur between the protein and the first and last two bases within the DNA minor groove, where the protein cannot easily distinguish A from T, or G from C (Seeman et al. 1976). The data for this logo consists of 59 binding sites determined by DNA footprinting (Robison et al. 1998). (C) The helix-turn-helix motif from the CAP family of homodimeric DNA binding proteins (Brennan and Matthews 1989; Schultz et al. 1991). Positions 180, 181, and 185 are known to interact directly with bases in the major groove (Schultz et al. 1991; Parkinson et al. 1996) and are critical to the sequence-specific binding of the protein. The conserved glycine at position 177 is located inside of the turn between the helices, where packing effects prevent the insertion of a side chain. Partially or completely buried positions (labeled B) frequently contain hydrophobic amino acids, which are colored black. The data for this logo consists of 100 sequences from the full Pfam (Bateman et al. 2002) alignment of this family (Accession no. PF00325). We removed a few sequences with rare insertions for convenience.

Similar articles

See all similar articles

Cited by 4,235 articles

See all "Cited by" articles

Publication types

Feedback