Text mining for the Vaccine Adverse Event Reporting System: medical text classification using informative feature selection
- PMID: 21709163
- PMCID: PMC3168300
- DOI: 10.1136/amiajnl-2010-000022
Text mining for the Vaccine Adverse Event Reporting System: medical text classification using informative feature selection
Abstract
Objective: The US Vaccine Adverse Event Reporting System (VAERS) collects spontaneous reports of adverse events following vaccination. Medical officers review the reports and often apply standardized case definitions, such as those developed by the Brighton Collaboration. Our objective was to demonstrate a multi-level text mining approach for automated text classification of VAERS reports that could potentially reduce human workload.
Design: We selected 6034 VAERS reports for H1N1 vaccine that were classified by medical officers as potentially positive (N(pos)=237) or negative for anaphylaxis. We created a categorized corpus of text files that included the class label and the symptom text field of each report. A validation set of 1100 labeled text files was also used. Text mining techniques were applied to extract three feature sets for important keywords, low- and high-level patterns. A rule-based classifier processed the high-level feature representation, while several machine learning classifiers were trained for the remaining two feature representations.
Measurements: Classifiers' performance was evaluated by macro-averaging recall, precision, and F-measure, and Friedman's test; misclassification error rate analysis was also performed.
Results: Rule-based classifier, boosted trees, and weighted support vector machines performed well in terms of macro-recall, however at the expense of a higher mean misclassification error rate. The rule-based classifier performed very well in terms of average sensitivity and specificity (79.05% and 94.80%, respectively).
Conclusion: Our validated results showed the possibility of developing effective medical text classifiers for VAERS reports by combining text mining with informative feature selection; this strategy has the potential to reduce reviewer workload considerably.
Conflict of interest statement
Figures
Similar articles
-
Vaccine adverse event text mining system for extracting features from vaccine safety reports.J Am Med Inform Assoc. 2012 Nov-Dec;19(6):1011-8. doi: 10.1136/amiajnl-2012-000881. Epub 2012 Aug 25. J Am Med Inform Assoc. 2012. PMID: 22922172 Free PMC article.
-
Network analysis of possible anaphylaxis cases reported to the US vaccine adverse event reporting system after H1N1 influenza vaccine.Stud Health Technol Inform. 2011;169:564-8. Stud Health Technol Inform. 2011. PMID: 21893812
-
Application of information retrieval approaches to case classification in the vaccine adverse event reporting system.Drug Saf. 2013 Jul;36(7):573-82. doi: 10.1007/s40264-013-0064-4. Drug Saf. 2013. PMID: 23703591
-
Elective termination of pregnancy after vaccination reported to the Vaccine Adverse Event Reporting System (VAERS): 1990-2006.Vaccine. 2008 May 2;26(19):2428-32. doi: 10.1016/j.vaccine.2008.02.052. Epub 2008 Mar 17. Vaccine. 2008. PMID: 18406499 Review.
-
Data mining in the US using the Vaccine Adverse Event Reporting System.Drug Saf. 2006;29(5):375-84. doi: 10.2165/00002018-200629050-00002. Drug Saf. 2006. PMID: 16689554 Review.
Cited by
-
The Impact of Artificial Intelligence on Allergy Diagnosis and Treatment.Curr Allergy Asthma Rep. 2024 Jul;24(7):361-372. doi: 10.1007/s11882-024-01152-y. Epub 2024 Jul 2. Curr Allergy Asthma Rep. 2024. PMID: 38954325 Review.
-
Trust but Verify: Lessons Learned for the Application of AI to Case-Based Clinical Decision-Making From Postmarketing Drug Safety Assessment at the US Food and Drug Administration.J Med Internet Res. 2024 Jun 6;26:e50274. doi: 10.2196/50274. J Med Internet Res. 2024. PMID: 38842929 Free PMC article.
-
Improving long COVID-related text classification: a novel end-to-end domain-adaptive paraphrasing framework.Sci Rep. 2024 Jan 2;14(1):85. doi: 10.1038/s41598-023-48594-4. Sci Rep. 2024. PMID: 38168099 Free PMC article.
-
Generalizability of machine learning methods in detecting adverse drug events from clinical narratives in electronic medical records.Front Pharmacol. 2023 Jul 12;14:1218679. doi: 10.3389/fphar.2023.1218679. eCollection 2023. Front Pharmacol. 2023. PMID: 37502211 Free PMC article.
-
Automatic Extraction of Comprehensive Drug Safety Information from Adverse Drug Event Narratives in the Korea Adverse Event Reporting System Using Natural Language Processing Techniques.Drug Saf. 2023 Aug;46(8):781-795. doi: 10.1007/s40264-023-01323-2. Epub 2023 Jun 17. Drug Saf. 2023. PMID: 37330415 Free PMC article.
References
-
- Singleton JA, Lloyd JC, Mootrey GT, et al. An overview of the vaccine adverse event reporting system (VAERS) as a surveillance system. Vaccine 1999;17:2908–17 - PubMed
-
- Conway M, Doan S, Kawazoe A, et al. Classifying disease outbreak reports using n-grams and semantic features. Int J Med Inform 2009;78:e47–58 - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
