Characterization of a de novo assembled transcriptome of the Common Blackbird (Turdus merula)

PeerJ. 2017 Dec 13:5:e4045. doi: 10.7717/peerj.4045. eCollection 2017.

Abstract

Background: In recent years, next generation high throughput sequencing technologies have proven to be useful tools for investigations concerning the genomics or transcriptomics also of non-model species. Consequently, ornithologists have adopted these technologies and the respective bioinformatics tools to survey the genomes and transcriptomes of a few avian non-model species. The Common Blackbird is one of the most common bird species living in European cities, which has successfully colonized urban areas and for which no reference genome or transcriptome is publicly available. However, to target questions like genome wide gene expression analysis, a reference genome or transcriptome is needed.

Methods: Therefore, in this study two Common Blackbirds were sacrificed, their mRNA was isolated and analyzed by RNA-Seq to de novo assemble a transcriptome and characterize it. Illumina reads (125 bp paired-end) and a Velvet/Oases pipeline led to 162,158 transcripts. For the annotation (using Blast+), an unfiltered protein database was used. SNPs were identified using SAMtools and BCFtools. Furthermore, mRNA from three single tissues (brain, heart and liver) of the same two Common Blackbirds were sequenced by Illumina (75 bp single-end reads). The draft transcriptome and the three single tissues were compared by their BLAST hits with the package VennDiagram in R.

Results: Following the annotation against protein databases, we found evidence for 15,580 genes in the transcriptome (all well characterized hits after annotation). On 18% of the assembled transcripts, 144,742 SNPs were identified which are, consequently, 0.09% of all nucleotides in the assembled transcriptome. In the transcriptome and in the single tissues (brain, heart and liver), 10,182 shared genes were found.

Discussion: Using a next-generation technology and bioinformatics tools, we made a first step towards the genomic investigation of the Common Blackbird. The de novo assembled transcriptome is usable for downstream analyses such as differential gene expression analysis and SNP identification. This study shows the importance of the approach to sequence single tissues to understand functions of tissues, proteins and the phenotype.

Keywords: Common blackbird; Ornithology; RNAseq; SNP; Transcriptomics.

Grants and funding

The authors received financial support from the Deutsche Forschungsgemeinschaft and Ruprecht-Karls-Universität Heidelberg within the funding programme Open Access Publishing. The authors acknowledge support by the state of Baden-Württemberg through bwHPC and the German Research Foundation (DFG) through grant INST 35/1134-1 FUGG. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.