Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Mar 21:10:e13146.
doi: 10.7717/peerj.13146. eCollection 2022.

State of biodiversity documentation in the Philippines: Metadata gaps, taxonomic biases, and spatial biases in the DNA barcode data of animal and plant taxa in the context of species occurrence data

Affiliations

State of biodiversity documentation in the Philippines: Metadata gaps, taxonomic biases, and spatial biases in the DNA barcode data of animal and plant taxa in the context of species occurrence data

Carmela Maria P Berba et al. PeerJ. .

Abstract

Anthropogenic changes in the natural environment have led to alarming rates of biodiversity loss, resulting in a more urgent need for conservation. Although there is an increasing cognizance of the importance of incorporating biodiversity data into conservation, the accuracy of the inferences generated from these records can be highly impacted by gaps and biases in the data. Because of the Philippines' status as a biodiversity hotspot, the assessment of potential gaps and biases in biodiversity documentation in the country can be a critical step in the identification of priority research areas for conservation applications. In this study, we systematically assessed biodiversity data on animal and plant taxa found in the Philippines by examining the extent of metadata gaps, taxonomic biases, and spatial biases in DNA barcode data while using species occurrence data as a backdrop of the 'Philippines' biodiversity. These barcode and species occurrence datasets were obtained from public databases, namely: GenBank, Barcode of Life Data System and Global Biodiversity Information Facility. We found that much of the barcode data had missing information on either records and publishing, geolocation, or taxonomic metadata, which consequently, can limit the usability of barcode data for further analyses. We also observed that the amount of barcode data can be directly associated with the amount of species occurrence data available for a particular taxonomic group and location-highlighting the potential sampling biases in the barcode data. While the majority of barcode data came from foreign institutions, there has been an increase in local efforts in recent decades. However, much of the contribution to biodiversity documentation only come from institutions based in Luzon.

Keywords: Comparative analysis; Conservation; Genetic diversity; Online biodiversity database; Sampling biases; Spatial analysis; Species diversity.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Figure 1
Figure 1. Summary of barcode records associated with specific gene markers and issues encountered while manually parsing through the descriptive information on sampling locality.
For graph A, the genetic summary of the available barcode records focuses on the gene markers of interest used in the examination for metadata gaps, taxonomic biases, and spatial biases in DNA barcode data on animal and plant taxa sampled in the Philippines were the following: cytochrome b (CYTB), cytochrome oxidase c subunit I (COI), internal transcribed spacer 2 (ITS2), ribulose-1,5-biphosphate carboxylase (rbcL), and maturase K (matK). For graph B, the geolocation issues resulted in the descriptions of the sampling location (particularly in terms of administrative units) being unclear or in some cases, inconclusive. The categories include misspelled (incorrect spelling), none (no major issue), mixed (more than one issue), unspecified (somewhat informative but still vague), unknown (completely not informative), multiple (provided more than one location), and mismatch (discrepancies between the administrative units provided). This dataset includes the records with NA entries for country sampled (for A and B) and those that had additional information on the geolocation other than the coordinates (for B only).
Figure 2
Figure 2. Relationship between the percentage of barcode records identified at the species level and the proportion of documented species (represented in species occurrence data) that currently have DNA barcode data available.
This relationship was evaluated for each known animal (orange) and plant (green) taxonomic group represented in the Philippine barcode data at the phylum/division (A), class (B), order (C), and family (D) levels. This dataset includes the records with NA entries for country sampled.
Figure 3
Figure 3. Relationship between the amount of genetic and species data associated with each known animal and plant taxonomic group represented in the Philippine biodiversity data at different taxonomic levels.
This relationship was evaluated for each known animal (orange) and plant (green) taxonomic group represented in the Philippine barcode data at the phylum/division (A), class (B), order (C), and family (D) levels. Values were transformed logarithmically prior to plotting however, taxa with zero (0) records in either genetic or species data were assigned the value of negative one (−1). Dashed lines represent the 5th and 95th percentiles for genetic (horizontal) and species (vertical) data. This dataset includes the records with NA entries for country sampled.
Figure 4
Figure 4. Maps of the sampling distribution of barcode and species occurrence data on animal and plant taxa across the Philippines and the relationship between the two datasets in terms of province.
For both maps (A – barcode data and B – species occurrence data), records on marine specimens were assigned to a specific province based on which corresponding centroid has the shortest distance from the given sampling coordinates (if available). Also, values presented in the maps represent the number of records in the thousands. In the scatter plot (C), values were transformed logarithmically and provinces with zero (0) records in either genetic or species data were assigned the value of negative one (−1). Dashed lines represent the 5th and 95th percentiles for genetic (horizontal) and species (vertical) data. The barcode dataset includes the records with NA entries for country sampled.
Figure 5
Figure 5. Map of the distribution of barcode data on Philippine animal and plant biodiversity contributed by different countries across the world and their contribution to documenting efforts across the years.
For map A, contribution was based on the institution that holds the copyright to the image associated with the records while for the graphs, it was based on the collection of samples, starting from the 1990s (B) and submission of barcode data, starting from the 2000s (C) by foreign countries (violet) and the Philippines (red). Trendlines in the graphs represent the average, “best” fitted line. This dataset includes the records with NA entries for country sampled.
Figure 6
Figure 6. Heatmap matrix showcasing the relationship between the number of barcode records associated with regions that have been sampled and the regions of local institutions that contributed the data.
There are officially seventeen regions in the Philippines, represented by the Philippine map (A), with non-numerical regions labelled as follows: ca, Cordillera Administrative Region (CAR); mm, National Capital Region (NCR or also referred to as Metro Manila); and br, Bangsamoro Autonomous Region in Muslim Mindanao (BARMM). Regions are also divided based on their island groups – namely Luzon (red), Visayas (yellow), and Mindanao (blue). For matrix B, contribution was based on the institution that holds the copyright to the image associated with the records. Regions along the x- and y-axis are sorted to provide spatial context, with the map as a reference. The diagonal line represents the “ideal” scenario wherein the region serving as the processing center of barcode data can sufficiently sample its own local area. This dataset includes the records with NA entries for country sampled.

Similar articles

Cited by

  • Towards global insect biomonitoring with frugal methods.
    Brydegaard M, Pedales RD, Feng V, Yamoa AS, Kouakou B, Månefjord H, Wührl L, Pylatiuk C, Amorim DS, Meier R. Brydegaard M, et al. Philos Trans R Soc Lond B Biol Sci. 2024 Jun 24;379(1904):20230103. doi: 10.1098/rstb.2023.0103. Epub 2024 May 6. Philos Trans R Soc Lond B Biol Sci. 2024. PMID: 38705174 Free PMC article.

References

    1. Amano T, Lamming JDL, Sutherland WJ. Spatial gaps in global biodiversity information and the role of citizen science. BioScience. 2016;66(5):393–400. doi: 10.1093/biosci/biw022. - DOI
    1. AMEC Earth & Environmental Best practices for sharing sensitive environmental geospatial data. Natural Resources Canada. 2010. https://publications.gc.ca/site/eng/9.694895/publication.html https://publications.gc.ca/site/eng/9.694895/publication.html
    1. Andrews J. Sensitive species data policy. Department of environment, climate change and water NSW. 2009. https://www.environment.nsw.gov.au/-/media/OEH/Corporate-Site/Documents/... https://www.environment.nsw.gov.au/-/media/OEH/Corporate-Site/Documents/...
    1. Arayata MC. PH needs more scientists: NAST. Philippine News Agency. 2019. https://www.pna.gov.ph/articles/1074747 https://www.pna.gov.ph/articles/1074747
    1. Aureo WA, Reyes TD, Francis FC, Jose RP, Sarnowski MB. Diversity and composition of plant species in the forest over limestone of Rajah Sikatuna Protected Landscape, Bohol, Philippines. Biodiversity Data Journal. 2020;8(142):1–23. doi: 10.3897/BDJ.8.E55790. - DOI - PMC - PubMed

Grants and funding

The authors received no funding for this work.

LinkOut - more resources