Towards reliable named entity recognition in the biomedical domain
- PMID: 31218364
- PMCID: PMC6956779
- DOI: 10.1093/bioinformatics/btz504
Towards reliable named entity recognition in the biomedical domain
Abstract
Motivation: Automatic biomedical named entity recognition (BioNER) is a key task in biomedical information extraction. For some time, state-of-the-art BioNER has been dominated by machine learning methods, particularly conditional random fields (CRFs), with a recent focus on deep learning. However, recent work has suggested that the high performance of CRFs for BioNER may not generalize to corpora other than the one it was trained on. In our analysis, we find that a popular deep learning-based approach to BioNER, known as bidirectional long short-term memory network-conditional random field (BiLSTM-CRF), is correspondingly poor at generalizing. To address this, we evaluate three modifications of BiLSTM-CRF for BioNER to improve generalization: improved regularization via variational dropout, transfer learning and multi-task learning.
Results: We measure the effect that each strategy has when training/testing on the same corpus ('in-corpus' performance) and when training on one corpus and evaluating on another ('out-of-corpus' performance), our measure of the model's ability to generalize. We found that variational dropout improves out-of-corpus performance by an average of 4.62%, transfer learning by 6.48% and multi-task learning by 8.42%. The maximal increase we identified combines multi-task learning and variational dropout, which boosts out-of-corpus performance by 10.75%. Furthermore, we make available a new open-source tool, called Saber that implements our best BioNER models.
Availability and implementation: Source code for our biomedical IE tool is available at https://github.com/BaderLab/saber. Corpora and other resources used in this study are available at https://github.com/BaderLab/Towards-reliable-BioNER.
Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press.
Figures
Similar articles
-
Transfer learning for biomedical named entity recognition with neural networks.Bioinformatics. 2018 Dec 1;34(23):4087-4094. doi: 10.1093/bioinformatics/bty449. Bioinformatics. 2018. PMID: 29868832 Free PMC article.
-
Cross-type biomedical named entity recognition with deep multi-task learning.Bioinformatics. 2019 May 15;35(10):1745-1752. doi: 10.1093/bioinformatics/bty869. Bioinformatics. 2019. PMID: 30307536
-
DTranNER: biomedical named entity recognition with deep learning-based label-label transition model.BMC Bioinformatics. 2020 Feb 11;21(1):53. doi: 10.1186/s12859-020-3393-1. BMC Bioinformatics. 2020. PMID: 32046638 Free PMC article.
-
GRAM-CNN: a deep learning approach with local context for named entity recognition in biomedical text.Bioinformatics. 2018 May 1;34(9):1547-1554. doi: 10.1093/bioinformatics/btx815. Bioinformatics. 2018. PMID: 29272325 Free PMC article.
-
An Open Medical Platform to Share Source Code and Various Pre-Trained Weights for Models to Use in Deep Learning Research.Korean J Radiol. 2021 Dec;22(12):2073-2081. doi: 10.3348/kjr.2021.0170. Epub 2021 Oct 26. Korean J Radiol. 2021. PMID: 34719891 Free PMC article. Review.
Cited by
-
Pathway Commons 2019 Update: integration, analysis and exploration of pathway data.Nucleic Acids Res. 2020 Jan 8;48(D1):D489-D497. doi: 10.1093/nar/gkz946. Nucleic Acids Res. 2020. PMID: 31647099 Free PMC article.
-
Negation and uncertainty detection in clinical texts written in Spanish: a deep learning-based approach.PeerJ Comput Sci. 2022 Mar 7;8:e913. doi: 10.7717/peerj-cs.913. eCollection 2022. PeerJ Comput Sci. 2022. PMID: 35494817 Free PMC article.
-
AIONER: all-in-one scheme-based biomedical named entity recognition using deep learning.Bioinformatics. 2023 May 4;39(5):btad310. doi: 10.1093/bioinformatics/btad310. Bioinformatics. 2023. PMID: 37171899 Free PMC article.
-
Identifying stroke diagnosis-related features from medical imaging reports to improve clinical decision-making support.BMC Med Inform Decis Mak. 2022 Oct 20;22(1):275. doi: 10.1186/s12911-022-02012-3. BMC Med Inform Decis Mak. 2022. PMID: 36266650 Free PMC article.
-
HunFlair2 in a cross-corpus evaluation of biomedical named entity recognition and normalization tools.Bioinformatics. 2024 Oct 1;40(10):btae564. doi: 10.1093/bioinformatics/btae564. Bioinformatics. 2024. PMID: 39302686 Free PMC article.
References
-
- Baxter J. et al. (2000) A model of inductive bias learning. J. Artif. Intell. Res., 12, 3.
-
- Bayer J. et al. (2013) On fast dropout and its applicability to recurrent networks. arXiv preprint arXiv: 1311.0701.
-
- Caruana R. (1993) Multitask learning: a knowledge-based source of inductive bias. In: Proceedings of the Tenth International Conference on Machine Learning, pp. 41–48. Morgan Kaufmann, Citeseer.
