Molecular patterns and pathways in idiopathic pulmonary fibrosis (IPF) have been extensively investigated, but few studies have assimilated multiomic platforms to provide an integrative understanding of molecular patterns that are relevant in IPF. Herein, we combine the coding and noncoding transcriptomes, DNA methylomes, and proteomes from IPF and healthy lung tissue to identify molecules and pathways associated with this disease. RNA sequencing, Illumina MethylationEPIC array, and liquid chromatography-mass spectrometry proteomic data were collected on lung tissue from 24 subjects with IPF and 14 control subjects. Significant differential features were identified by using linear models adjusting for age and sex, inflation, and bias when appropriate. Data Integration Analysis for Biomarker Discovery Using a Latent Component Method for Omics Studies was used for integrative multiomic analysis. We identified 4,643 differentially expressed transcripts aligning to 3,439 genes, 998 differentially abundant proteins, 2,500 differentially methylated regions, and 1,269 differentially expressed long noncoding RNAs (lncRNAs) that were significant after correcting for multiple tests (false discovery rate < 0.05). Unsupervised hierarchical clustering using 20 coding mRNA, protein, methylation, and lncRNA features with the highest loadings on the top latent variable from the four data sets demonstrates perfect separation of IPF and control lungs. Our analysis confirmed previously validated molecules and pathways known to be dysregulated in disease and implicated novel molecular features as potential drivers and modifiers of disease. For example, 4 proteins, 18 differentially methylated regions, and 10 lncRNAs were found to have strong correlations (|r| > 0.8) with MMP7 (matrix metalloproteinase 7). Therefore, by using a system biology approach, we have identified novel molecular relationships in IPF.
Keywords: methylome; multiomics; proteome; systems biology; transcriptome.