Interrater reliability of sleep stage scoring: a meta-analysis
- PMID: 34310277
- PMCID: PMC8807917
- DOI: 10.5664/jcsm.9538
Interrater reliability of sleep stage scoring: a meta-analysis
Abstract
Study objectives: We evaluated the interrater reliabilities of manual polysomnography sleep stage scoring. We included all studies that employed Rechtschaffen and Kales rules or American Academy of Sleep Medicine standards. We sought the overall degree of agreement and those for each stage.
Methods: The keywords were "Polysomnography (PSG)," "sleep staging," "Rechtschaffen and Kales (R&K)," "American Academy of Sleep Medicine (AASM)," "interrater (interscorer) reliability," and "Cohen's kappa." We searched PubMed, OVID Medline, EMBASE, the Cochrane library, KoreaMed, KISS, and the MedRIC. The exclusion criteria included automatic scoring and pediatric patients. We collected data on scorer histories, scoring rules, numbers of epochs scored, and the underlying diseases of the patients.
Results: A total of 101 publications were retrieved; 11 satisfied the selection criteria. The Cohen's kappa for manual, overall sleep scoring was 0.76, indicating substantial agreement (95% confidence interval, 0.71-0.81; P < .001). By sleep stage, the figures were 0.70, 0.24, 0.57, 0.57, and 0.69 for the W, N1, N2, N3, and R stages, respectively. The interrater reliabilities for stage N2 and N3 sleep were moderate, and that for stage N1 sleep was only fair.
Conclusions: We conducted a meta-analysis to generalize the variation in manual scoring of polysomnography and provide reference data for automatic sleep stage scoring systems. The reliability of manual scorers of polysomnography sleep stages was substantial. However, for certain stages, the results were poor; validity requires improvement.
Citation: Lee YJ, Lee JY, Cho JH, Choi JH. Interrater reliability of sleep stage scoring: a meta-analysis. J Clin Sleep Med. 2022;18(1):193-202.
Keywords: interrater reliability; meta-analysis; sleep stage scoring.
© 2022 American Academy of Sleep Medicine.
Conflict of interest statement
All authors have seen and approved the manuscript. Work for this study was performed in the Department of Otorhinolaryngology—Head and Neck Surgery, College of Medicine, Soonchunhyang University, Bucheon Hospital, Bucheon, Korea. This study was funded by the Soonchunhyang University Research Fund. The authors report no conflicts of interest.
Figures
Similar articles
-
Interrater agreement between American and Chinese sleep centers according to the 2014 AASM standard.Sleep Breath. 2019 Jun;23(2):719-728. doi: 10.1007/s11325-019-01801-x. Epub 2019 Feb 19. Sleep Breath. 2019. PMID: 30783913
-
Scoring accuracy of automated sleep staging from a bipolar electroocular recording compared to manual scoring by multiple raters.Sleep Med. 2013 Nov;14(11):1199-207. doi: 10.1016/j.sleep.2013.04.022. Epub 2013 Aug 16. Sleep Med. 2013. PMID: 24047533 Clinical Trial.
-
The 2007 AASM recommendations for EEG electrode placement in polysomnography: impact on sleep and cortical arousal scoring.Sleep. 2011 Jan 1;34(1):73-81. doi: 10.1093/sleep/34.1.73. Sleep. 2011. PMID: 21203376 Free PMC article. Clinical Trial.
-
The Visual Scoring of Sleep in Infants 0 to 2 Months of Age.J Clin Sleep Med. 2016 Mar;12(3):429-45. doi: 10.5664/jcsm.5600. J Clin Sleep Med. 2016. PMID: 26951412 Free PMC article. Review.
-
The AASM scoring manual: a critical appraisal.Curr Opin Pulm Med. 2009 Nov;15(6):540-9. doi: 10.1097/MCP.0b013e328331a2bf. Curr Opin Pulm Med. 2009. PMID: 19738470 Review.
Cited by
-
The Virtual Sleep Lab-A Novel Method for Accurate Four-Class Sleep Staging Using Heart-Rate Variability from Low-Cost Wearables.Sensors (Basel). 2023 Feb 21;23(5):2390. doi: 10.3390/s23052390. Sensors (Basel). 2023. PMID: 36904595 Free PMC article.
-
Are pictures worth a thousand sleep signals?Sleep. 2023 Dec 11;46(12):zsad268. doi: 10.1093/sleep/zsad268. Sleep. 2023. PMID: 37843473 Free PMC article. No abstract available.
-
Automated sleep staging algorithms: have we reached the performance limit due to manual scoring?Sleep. 2022 Sep 8;45(9):zsac159. doi: 10.1093/sleep/zsac159. Sleep. 2022. PMID: 35866932 Free PMC article. No abstract available.
-
Looking for a balance between visual and automatic sleep scoring.NPJ Digit Med. 2023 Sep 5;6(1):165. doi: 10.1038/s41746-023-00915-7. NPJ Digit Med. 2023. PMID: 37670135 Free PMC article.
-
Spotlight on Sleep Stage Classification Based on EEG.Nat Sci Sleep. 2023 Jun 29;15:479-490. doi: 10.2147/NSS.S401270. eCollection 2023. Nat Sci Sleep. 2023. PMID: 37405208 Free PMC article.
References
-
- Javaheri S, Redline S. Sleep, slow-wave sleep, and blood pressure. Curr Hypertens Rep. 2012; 14( 5): 442– 448. - PubMed
-
- Pillai JA, Leverenz JB. Sleep and neurodegeneration: a critical appraisal. Chest. 2017; 151( 6): 1375– 1386. - PubMed
-
- Kales A, Rechtschaffen A. A Manual of Standardized Terminology, Techniques and Scoring System for Sleep Stages in Human Subjects. Washington, DC: U.S. Government Printing Office; 1968. - PubMed
-
- Iber C, Ancoli-Israel S, Chesson AL Jr, Quan SF; for the American Academy of Sleep Medicine. The AASM Manual for the Scoring of Sleep and Associated Events: Rules, Terminology and Technical Specifications. 1st ed. Westchester, IL: American Academy of Sleep Medicine; 2007.
-
- Norman RG, Pal I, Stewart C, Walsleben JA, Rapoport DM. Interobserver agreement among sleep scorers from different centers in a large dataset. Sleep. 2000; 23( 7): 901– 908. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
