SalMotifDB: a tool for analyzing putative transcription factor binding sites in salmonid genomes

BMC Genomics. 2019 Sep 2;20(1):694. doi: 10.1186/s12864-019-6051-0.

Abstract

Background: Recently developed genome resources in Salmonid fish provides tools for studying the genomics underlying a wide range of properties including life history trait variation in the wild, economically important traits in aquaculture and the evolutionary consequences of whole genome duplications. Although genome assemblies now exist for a number of salmonid species, the lack of regulatory annotations are holding back our mechanistic understanding of how genetic variation in non-coding regulatory regions affect gene expression and the downstream phenotypic effects.

Results: We present SalMotifDB, a database and associated web and R interface for the analysis of transcription factors (TFs) and their cis-regulatory binding sites in five salmonid genomes. SalMotifDB integrates TF-binding site information for 3072 non-redundant DNA patterns (motifs) assembled from a large number of metazoan motif databases. Through motif matching and TF prediction, we have used these multi-species databases to construct putative regulatory networks in salmonid species. The utility of SalMotifDB is demonstrated by showing that key lipid metabolism regulators are predicted to regulate a set of genes affected by different lipid and fatty acid content in the feed, and by showing that our motif database explains a significant proportion of gene expression divergence in gene duplicates originating from the salmonid specific whole genome duplication.

Conclusions: SalMotifDB is an effective tool for analyzing transcription factors, their binding sites and the resulting gene regulatory networks in salmonid species, and will be an important tool for gaining a better mechanistic understanding of gene regulation and the associated phenotypes in salmonids. SalMotifDB is available at https://salmobase.org/apps/SalMotifDB .

Keywords: Gene regulation; Regulatory networks; Salmonid genomics; Transcription factor binding sites; Web tool.

MeSH terms

  • Animals
  • DNA / chemistry
  • Databases, Genetic*
  • Gene Duplication / genetics
  • Gene Regulatory Networks
  • Genomics / methods*
  • Lipid Metabolism / genetics
  • Nucleotide Motifs
  • Protein Binding
  • Salmonidae / genetics*
  • Transcription Factors / metabolism*

Substances

  • Transcription Factors
  • DNA