PBHMDA: Path-Based Human Microbe-Disease Association Prediction

Front Microbiol. 2017 Feb 22:8:233. doi: 10.3389/fmicb.2017.00233. eCollection 2017.

Abstract

With the advance of sequencing technology and microbiology, the microorganisms have been found to be closely related to various important human diseases. The increasing identification of human microbe-disease associations offers important insights into the underlying disease mechanism understanding from the perspective of human microbes, which are greatly helpful for investigating pathogenesis, promoting early diagnosis and improving precision medicine. However, the current knowledge in this domain is still limited and far from complete. Here, we present the computational model of Path-Based Human Microbe-Disease Association prediction (PBHMDA) based on the integration of known microbe-disease associations and the Gaussian interaction profile kernel similarity for microbes and diseases. A special depth-first search algorithm was implemented to traverse all possible paths between microbes and diseases for inferring the most possible disease-related microbes. As a result, PBHMDA obtained a reliable prediction performance with AUCs (The area under ROC curve) of 0.9169 and 0.8767 in the frameworks of both global and local leave-one-out cross validations, respectively. Based on 5-fold cross validation, average AUCs of 0.9082 ± 0.0061 further demonstrated the efficiency of the proposed model. For the case studies of liver cirrhosis, type 1 diabetes, and asthma, 9, 7, and 9 out of predicted microbes in the top 10 have been confirmed by previously published experimental literatures, respectively. We have publicly released the prioritized microbe-disease associations, which may help to select the most potential pairs for further guiding the experimental confirmation. In conclusion, PBHMDA may have potential to boost the discovery of novel microbe-disease associations and aid future research efforts toward microbe involvement in human disease mechanism. The code and data of PBHMDA is freely available at http://www.escience.cn/system/file?fileId=85214.

Keywords: association network; computational prediction model; diseases; microbes; path-based measure.