Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jan;31(1):159-169.
doi: 10.1101/gr.266932.120. Epub 2020 Nov 25.

Accessing NCBI data using the NCBI Sequence Viewer and Genome Data Viewer (GDV)

Affiliations

Accessing NCBI data using the NCBI Sequence Viewer and Genome Data Viewer (GDV)

Sanjida H Rangwala et al. Genome Res. 2021 Jan.

Abstract

The National Center for Biotechnology Information (NCBI) is an archive providing free access to a wide range and large volume of biological sequence data and literature. Staff scientists at NCBI analyze user-submitted data in the archive, producing gene and SNP annotation and generating sequence alignment tools. NCBI's flagship genome browser, Genome Data Viewer (GDV), displays our in-house RefSeq annotation; is integrated with other NCBI resources such as Gene, dbGaP, and BLAST; and provides a platform for customized analysis and visualization. Here, we describe how members of the biomedical research community can use GDV and the related NCBI Sequence Viewer (SV) to access, analyze, and disseminate NCBI and custom biomedical sequence data. In addition, we report how users can add SV to their own web pages to create a custom graphical sequence display without the need for infrastructure investments or back-end deployments.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
NCBI Sequence Viewer (SV) is the graphical component of many NCBI pages and is configured to display data appropriate to the resource page. (A) SV located on an NCBI gene record page. Brown labels indicate tracks for sequence, gene annotation, and RNA-seq data. (B) SV located on an NCBI SNP record page. Brown labels indicate tracks for gene annotation, RefSNPs, Clinical (ClinVar), missense, and frameshift variants. A marker is automatically placed over the variant described on the page.
Figure 2.
Figure 2.
Genome Data Viewer (GDV) home page (https://www.ncbi.nlm.nih.gov/genome/gdv/). Users can search for an organism (A) or browse the taxonomic tree (B). There is also an option to navigate the available genome assemblies in table form (C). The panel on the right side provides information and links for the selected assembly. Adding a location or gene symbol in the search box (D), clicking on the “browse genome” button (E), or clicking on a chromosome in the ideogram (F) will take the user directly to the genome browser view. There are also options to BLAST the genome (G), download information using the NCBI Datasets interface (H), and obtain information about the annotation (I).
Figure 3.
Figure 3.
GDV. (A) NCBI SV embedded application; (B) search box; (C) region selector ideogram; (D) exon navigator; (E) zoom and pan options from SV; (F) tracks configuration menu from SV to add NCBI-provided tracks; (G) user data and track hubs and BLAST widgets to add custom tracks; (H) download data options from SV; (I) buttons to download track data, change track settings, or hide tracks (x); (J) share this page link; and (K) news banner.
Figure 4.
Figure 4.
Tracks supplied by NCBI can be added using the track configuration panel (A), which can be accessed from the tracks menu located in the SV toolbar. The tracks menu on the toolbar also provides the option to add predetermined sets of tracks from the NCBI recommended track sets menu (B) or private track sets saved in “My NCBI Track Collections” (C). Track display settings can be changed within the track configuration panel (D).
Figure 5.
Figure 5.
Track data displays for different track types. (A) Gene annotation track showing two transcript variants in a gene selected using mouse clicks. Green indicates gene; purple, transcript; red, coding sequence (CDS). Red hairlines denote positions where the transcript variants show differences in splicing. (B) MultiWig files displayed as overlay (top) or stacked (bottom set) graphs. (C) Pile-up graphs and tables for alignment data, for example, BAM files. (D) Example of data available in the tooltip of an alignment, along with a pop-up view showing sequence of unaligned nucleotides.
Figure 6.
Figure 6.
GDV session displaying the BLAST Alignment Inspector. When the cursor hovers over an aligned region in the BLAST Alignment Inspector (A), the corresponding region in the SV is highlighted as well. Note the full list of alignments is reported in a table in the BLAST panel (B). BLAST alignments can also be viewed as tracks added to the SV (C).
Figure 7.
Figure 7.
Organization of data flow for SV to NCBI in-house tools (such as the NCBI GDV browser) and non-NCBI web pages. Both GDV and non-NCBI web pages communicate with the NCBI back-end to obtain sequence and annotation data hosted at NCBI. Non-NCBI streaming data must also access NCBI back-end servers in order to be displayed in the NCBI graphical viewer application.

Similar articles

  • Database resources of the National Center for Biotechnology Information.
    Wheeler DL, Barrett T, Benson DA, Bryant SH, Canese K, Chetvernin V, Church DM, DiCuccio M, Edgar R, Federhen S, Geer LY, Kapustin Y, Khovayko O, Landsman D, Lipman DJ, Madden TL, Maglott DR, Ostell J, Miller V, Pruitt KD, Schuler GD, Sequeira E, Sherry ST, Sirotkin K, Souvorov A, Starchenko G, Tatusov RL, Tatusova TA, Wagner L, Yaschenko E. Wheeler DL, et al. Nucleic Acids Res. 2007 Jan;35(Database issue):D5-12. doi: 10.1093/nar/gkl1031. Epub 2006 Dec 14. Nucleic Acids Res. 2007. PMID: 17170002 Free PMC article.
  • A collection of plant-specific genomic data and resources at NCBI.
    Tatusova T, Smith-White B, Ostell J. Tatusova T, et al. Methods Mol Biol. 2007;406:61-87. doi: 10.1007/978-1-59745-535-0_3. Methods Mol Biol. 2007. PMID: 18287688
  • Database resources of the National Center for Biotechnology Information.
    NCBI Resource Coordinators. NCBI Resource Coordinators. Nucleic Acids Res. 2014 Jan;42(Database issue):D7-17. doi: 10.1093/nar/gkt1146. Epub 2013 Nov 19. Nucleic Acids Res. 2014. PMID: 24259429 Free PMC article.
  • Database resources of the National Center for Biotechnology Information.
    Sayers EW, Beck J, Bolton EE, Bourexis D, Brister JR, Canese K, Comeau DC, Funk K, Kim S, Klimke W, Marchler-Bauer A, Landrum M, Lathrop S, Lu Z, Madden TL, O'Leary N, Phan L, Rangwala SH, Schneider VA, Skripchenko Y, Wang J, Ye J, Trawick BW, Pruitt KD, Sherry ST. Sayers EW, et al. Nucleic Acids Res. 2021 Jan 8;49(D1):D10-D17. doi: 10.1093/nar/gkaa892. Nucleic Acids Res. 2021. PMID: 33095870 Free PMC article. Review.
  • Education resources of the National Center for Biotechnology Information.
    Cooper PS, Lipshultz D, Matten WT, McGinnis SD, Pechous S, Romiti ML, Tao T, Valjavec-Gratian M, Sayers EW. Cooper PS, et al. Brief Bioinform. 2010 Nov;11(6):563-9. doi: 10.1093/bib/bbq022. Epub 2010 Jun 22. Brief Bioinform. 2010. PMID: 20570844 Free PMC article. Review.

Cited by

References

    1. Barrett T, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, Marshall KA, Phillippy KH, Sherman PM, Holko M, et al. 2013. NCBI GEO: archive for functional genomics data sets—update. Nucleic Acids Res 41: D991–D995. 10.1093/nar/gks1193 - DOI - PMC - PubMed
    1. Boratyn GM, Camacho C, Cooper PS, Coulouris G, Fong A, Ma N, Madden TL, Matten WT, McGinnis SD, Merezhuk Y, et al. 2013. BLAST: a more efficient report with usability improvements. Nucleic Acids Res 41: W29–W33. 10.1093/nar/gkt282 - DOI - PMC - PubMed
    1. Buels R, Yao E, Diesh CM, Hayes RD, Munoz-Torres M, Helt G, Goodstein DM, Elsik CG, Lewis SE, Stein L, et al. 2016. JBrowse: a dynamic web platform for genome visualization and analysis. Genome Biol 17: 66 10.1186/s13059-016-0924-1 - DOI - PMC - PubMed
    1. Church DM, Schneider VA, Graves T, Auger K, Cunningham F, Bouk N, Chen HC, Agarwala R, McLaren WM, Ritchie GR, et al. 2011. Modernizing reference genome assemblies. PLoS Biol 9: e1001091 10.1371/journal.pbio.1001091 - DOI - PMC - PubMed
    1. Computational Pan-Genomics Consortium. 2018. Computational pan-genomics: status, promises and challenges. Brief Bioinform 19: 118–135. 10.1093/bib/bbw089 - DOI - PMC - PubMed

Publication types