Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Sep 21;8(1):653.
doi: 10.1038/s41467-017-00413-x.

Establishing multiple omics baselines for three Southeast Asian populations in the Singapore Integrative Omics Study

Affiliations

Establishing multiple omics baselines for three Southeast Asian populations in the Singapore Integrative Omics Study

Woei-Yuh Saw et al. Nat Commun. .

Abstract

The Singapore Integrative Omics Study provides valuable insights on establishing population reference measurement in 364 Chinese, Malay, and Indian individuals. These measurements include > 2.5 millions genetic variants, 21,649 transcripts expression, 282 lipid species quantification, and 284 clinical, lifestyle, and dietary variables. This concept paper introduces the depth of the data resource, and investigates the extent of ethnic variation at these omics and non-omics biomarkers. It is evident that there are specific biomarkers in each of these platforms to differentiate between the ethnicities, and intra-population analyses suggest that Chinese and Indians are the most biologically homogeneous and heterogeneous, respectively, of the three groups. Consistent patterns of correlations between lipid species also suggest the possibility of lipid tagging to simplify future lipidomics assays. The Singapore Integrative Omics Study is expected to allow the characterization of intra-omic and inter-omic correlations within and across all three ethnic groups through a systems biology approach.The Singapore Genome Variation projects characterized the genetics of Singapore's Chinese, Malay, and Indian populations. The Singapore Integrative Omics Study introduced here goes further in providing multi-omic measurements in individuals from these populations, including genetic, transcriptome, lipidome, and lifestyle data, and will facilitate the study of common diseases in Asian communities.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interests.

Figures

Fig. 1
Fig. 1
PCAs of omics and clinical/lifestyle/diet data. Biplots are shown for five distinct PCAs using the respective first two axes of variations from each PCA. The five PCAs correspond to the analysis of: a 101,099 autosomal SNPs pseudo-randomly chosen to minimize linkage disequilibrium between the SNPs; b 21,649 gene transcript probesets; c 274 miRNAs; d 282 lipid species; e a set of 284 clinical, lifestyle, and dietary variables; and f only the 199 dietary variables. Each circle represents an individual from the iOmics and is assigned a color corresponding to the self-reported ethnicity of the subject, according to the color legend on the top right panel in a
Fig. 2
Fig. 2
Distribution of the Wright FST value across three ethnic groups at the eight HLA loci. The distribution of the Wright FST value across three populations at the eight HLA loci. The alleles shown in the plot are the top three FST alleles at each HLA loci. The triangular shape indicates the HLA alleles, where the FST values are driven by differences between Chinese and Indians. The diamond shape indicates the HLA alleles, where the FST values are driven by the differences between Chinese and Malays. The square shape indicates the HLA alleles, where the FST values are driven by the differences between Malays and Indians. Shapes with red color outline are representing drug-associated HLA alleles (Table 3)
Fig. 3
Fig. 3
A combined boxplot and scatter plot of the top three significant transcript probesets across three populations. The combined plot showing distribution of transcript intensities of a 7912136:UTS2 gene, b 8041061:PLB1 gene, c 8102362:TIFA gene across three populations. P-values were calculated using ANOVA, adjusted for batch effect and gender, and corrected for Bonferroni. The upper whisker represents either the maximum value observed or is 1.5 times the interquartile range greater than the third quartile, whichever is smaller. The lower whisker represents either the minimum value observed or is 1.5 times lower than the first quartile, whichever is greater. The details of the significant transcript probesets across three populations can be found in Supplementary Data 3
Fig. 4
Fig. 4
Correlation heatmap of the 282 lipids in each ethnic group. The correlation heatmap between 282 lipids in the a Chinese; b Malays; and c Indians. The correlation was calculated by using concentration of the lipids via Pearson’s correlation, r 2. The lipids are first categorized into lipid category and followed by lipid classes. In each of the lipid class, the lipid species are ordered according to their carbon chain length and degree of unsaturation (number of double bonds). The intensity of the color reflects the magnitude of the correlation, in which the white color means r 2 = 0 and red color means r 2 = 1
Fig. 5
Fig. 5
Distribution of the differences and correlation of the 16 significant differentiated clinical phenotypes across three populations. The distribution of the differences of the 16 clinical significant phenotypes across three populations, with a correlation heatmap of the phenotypes. The correlation was calculated using Pearson’s correlation, , across 358 samples. The intensity of the color reflects the magnitude of the correlation, in which the white color means  = 0 and red color means  = 1. The details of the differences can be found in Supplementary Table 9

Similar articles

Cited by

References

    1. Welter D, et al. The NHGRI GWAS catalog: a curated resource of SNP-trait associations. Nucleic Acids Res. 2014;42:D1001–D1006. doi: 10.1093/nar/gkt1229. - DOI - PMC - PubMed
    1. Visscher PM, Brown MA, McCarthy MI, Yang J. Five years of GWAS discovery. Am. J. Hum. Genet. 2012;90:7–24. doi: 10.1016/j.ajhg.2011.11.029. - DOI - PMC - PubMed
    1. Wellcome Trust Case Control, C Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature. 2007;447:661–678. doi: 10.1038/nature05911. - DOI - PMC - PubMed
    1. International HapMap, C The international HapMap project. Nature. 2003;426:789–796. doi: 10.1038/nature02168. - DOI - PubMed
    1. Frazer KA, et al. A second generation human haplotype map of over 3.1 million SNPs. Nature. 2007;449:851–861. doi: 10.1038/nature06258. - DOI - PMC - PubMed

Publication types

LinkOut - more resources