The lncLocator: a subcellular localization predictor for long non-coding RNAs based on a stacked ensemble classifier
- PMID: 29462250
- DOI: 10.1093/bioinformatics/bty085
The lncLocator: a subcellular localization predictor for long non-coding RNAs based on a stacked ensemble classifier
Abstract
Motivation: The long non-coding RNA (lncRNA) studies have been hot topics in the field of RNA biology. Recent studies have shown that their subcellular localizations carry important information for understanding their complex biological functions. Considering the costly and time-consuming experiments for identifying subcellular localization of lncRNAs, computational methods are urgently desired. However, to the best of our knowledge, there are no computational tools for predicting the lncRNA subcellular locations to date.
Results: In this study, we report an ensemble classifier-based predictor, lncLocator, for predicting the lncRNA subcellular localizations. To fully exploit lncRNA sequence information, we adopt both k-mer features and high-level abstraction features generated by unsupervised deep models, and construct four classifiers by feeding these two types of features to support vector machine (SVM) and random forest (RF), respectively. Then we use a stacked ensemble strategy to combine the four classifiers and get the final prediction results. The current lncLocator can predict five subcellular localizations of lncRNAs, including cytoplasm, nucleus, cytosol, ribosome and exosome, and yield an overall accuracy of 0.59 on the constructed benchmark dataset.
Availability and implementation: The lncLocator is available at www.csbio.sjtu.edu.cn/bioinf/lncLocator.
Supplementary information: Supplementary data are available at Bioinformatics online.
Similar articles
-
lncLocator 2.0: a cell-line-specific subcellular localization predictor for long non-coding RNAs with interpretable deep learning.Bioinformatics. 2021 Aug 25;37(16):2308-2316. doi: 10.1093/bioinformatics/btab127. Bioinformatics. 2021. PMID: 33630066
-
iLoc-lncRNA: predict the subcellular location of lncRNAs by incorporating octamer composition into general PseKNC.Bioinformatics. 2018 Dec 15;34(24):4196-4204. doi: 10.1093/bioinformatics/bty508. Bioinformatics. 2018. PMID: 29931187
-
Lnclocator-imb: An Imbalance-tolerant Ensemble Deep Learning Framework for Predicting Long Non-coding RNA Subcellular Localization.IEEE J Biomed Health Inform. 2023 Oct 16;PP. doi: 10.1109/JBHI.2023.3324709. Online ahead of print. IEEE J Biomed Health Inform. 2023. PMID: 37843994
-
LncFinder: an integrated platform for long non-coding RNA identification utilizing sequence intrinsic composition, structural information and physicochemical property.Brief Bioinform. 2019 Nov 27;20(6):2009-2027. doi: 10.1093/bib/bby065. Brief Bioinform. 2019. PMID: 30084867 Free PMC article. Review.
-
Long non-coding RNAs and complex diseases: from experimental results to computational models.Brief Bioinform. 2017 Jul 1;18(4):558-576. doi: 10.1093/bib/bbw060. Brief Bioinform. 2017. PMID: 27345524 Free PMC article. Review.
Cited by
-
Ensemble learning for integrative prediction of genetic values with genomic variants.BMC Bioinformatics. 2024 Mar 21;25(1):120. doi: 10.1186/s12859-024-05720-x. BMC Bioinformatics. 2024. PMID: 38515026
-
Identification and clinical value of a new ceRNA axis (TIMP3/hsa-miR-181b-5p/PAX8-AS1) in thyroid cancer.Health Sci Rep. 2024 Feb 25;7(2):e1859. doi: 10.1002/hsr2.1859. eCollection 2024 Feb. Health Sci Rep. 2024. PMID: 38410497 Free PMC article.
-
Predicting the incidence of infectious diarrhea with symptom surveillance data using a stacking-based ensembled model.BMC Infect Dis. 2024 Feb 26;24(1):265. doi: 10.1186/s12879-024-09138-x. BMC Infect Dis. 2024. PMID: 38408967 Free PMC article.
-
Uncovering the ceRNA Network Related to the Prognosis of Stomach Adenocarcinoma Among 898 Patient Samples.Biochem Genet. 2024 Feb 15. doi: 10.1007/s10528-023-10656-7. Online ahead of print. Biochem Genet. 2024. PMID: 38361095
-
Identification and Characterization of a ceRNA Regulatory Network Involving LINC00482 and PRRC2B in Peripheral Blood Mononuclear Cells: Implications for COPD Pathogenesis and Diagnosis.Int J Chron Obstruct Pulmon Dis. 2024 Feb 8;19:419-430. doi: 10.2147/COPD.S437046. eCollection 2024. Int J Chron Obstruct Pulmon Dis. 2024. PMID: 38348310 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous
