中文

In silico prediction of potential miRNA‐disease association using an integrative bioinformatics approach based on kernel fusion

2021-09-06

2021-09-06

NN Guan, CC Wang, L Zhang, L Huang, JQ Li, X Piao. In silico prediction of potential miRNA‐disease association using an integrative bioinformatics approach based on kernel fusion.. Journal of cellular and molecular medicine 24 (1), 573-587.  https://doi.org/10.1111/jcmm.14765


Abstract

Accumulating experimental evidence has demonstrated that microRNAs (miRNAs) have a huge impact on numerous critical biological processes and they are associated with different complex human diseases. Nevertheless, the task to predict potential miRNAs related to diseases remains difficult. In this paper, we developed a Kernel Fusion-based Regularized Least Squares for MiRNA-Disease Association prediction model (KFRLSMDA), which applied kernel fusion technique to fuse similarity matrices and then utilized regularized least squares to predict potential miRNA-disease associations. To prove the effectiveness of KFRLSMDA, we adopted leave-one-out cross-validation (LOOCV) and 5-fold cross-validation and then compared KFRLSMDA with 10 previous computational models (MaxFlow, MiRAI, MIDP, RKNNMDA, MCMDA, HGIMDA, RLSMDA, HDMP, WBSMDA and RWRMDA). Outperforming other models, KFRLSMDA achieved AUCs of 0.9246 in global LOOCV, 0.8243 in local LOOCV and average AUC of 0.9175 ± 0.0008 in 5-fold cross-validation. In addition, respectively, 96%, 100% and 90% of the top 50 potential miRNAs for breast neoplasms, colon neoplasms and oesophageal neoplasms were confirmed by experimental discoveries. We also predicted potential miRNAs related to hepatocellular cancer by removing all known related miRNAs of this cancer and 98% of the top 50 potential miRNAs were verified. Furthermore, we predicted potential miRNAs related to lymphoma using the data set in the old version of the HMDD database and 80% of the top 50 potential miRNAs were confirmed. Therefore, it can be concluded that KFRLSMDA has reliable prediction performance.