BAGLS, a multihospital Benchmark for Automatic Glottis Segmentation

Pablo Gómez; Andreas M Kist; Patrick Schlegel; David A Berry; Dinesh K Chhetri; Stephan Dürr; Matthias Echternach; Aaron M Johnson; Stefan Kniesburges; Melda Kunduk; Youri Maryn; Anne Schützenberger; Monique Verguts; Michael Döllinger

doi:10.1038/s41597-020-0526-3

BAGLS, a multihospital Benchmark for Automatic Glottis Segmentation

Sci Data. 2020 Jun 19;7(1):186. doi: 10.1038/s41597-020-0526-3.

Authors

Pablo Gómez^#¹, Andreas M Kist^#², Patrick Schlegel³, David A Berry⁴, Dinesh K Chhetri⁴, Stephan Dürr³, Matthias Echternach⁵, Aaron M Johnson⁶, Stefan Kniesburges³, Melda Kunduk⁷, Youri Maryn^{8

9

10

11

12}, Anne Schützenberger³, Monique Verguts^{8

13}, Michael Döllinger³

Affiliations

¹ Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Waldstraße 1, 91054, Erlangen, Germany. pablo.gomez@tum.de.
² Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Waldstraße 1, 91054, Erlangen, Germany. andreas.kist@uk-erlangen.de.
³ Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Waldstraße 1, 91054, Erlangen, Germany.
⁴ Department of Head and Neck Surgery, David Geffen School of Medicine at the University of California, Los Angeles, Los Angeles, California, USA.
⁵ Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Munich, Germany.
⁶ NYU Voice Center, Department of Otolaryngology - Head and Neck Surgery, New York University School of Medicine, New York, New York, USA.
⁷ Department of Communication Sciences and Disorders, Louisiana State University, Baton Rouge, Louisiana, USA.
⁸ European Institute for ORL-HNS, Department of Otorhinolaryngology and Head & Neck Surgery, Sint-Augustinus GZA, Wilrijk, Belgium.
⁹ Department of Speech, Language and Hearing sciences, University of Ghent, Ghent, Belgium.
¹⁰ Faculty of Education, Health and Social Work, University College Ghent, Ghent, Belgium.
¹¹ Faculty of Psychology and Educational Sciences, School of Logopedics, Université Catholique de Louvain, Louvain-la-Neuve, Belgium.
¹² Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium.
¹³ Department of Otorhinolaryngology and Voice Disorders, Diest General Hospital, Diest, Belgium.

^# Contributed equally.

Abstract

Laryngeal videoendoscopy is one of the main tools in clinical examinations for voice disorders and voice research. Using high-speed videoendoscopy, it is possible to fully capture the vocal fold oscillations, however, processing the recordings typically involves a time-consuming segmentation of the glottal area by trained experts. Even though automatic methods have been proposed and the task is particularly suited for deep learning methods, there are no public datasets and benchmarks available to compare methods and to allow training of generalizing deep learning models. In an international collaboration of researchers from seven institutions from the EU and USA, we have created BAGLS, a large, multihospital dataset of 59,250 high-speed videoendoscopy frames with individually annotated segmentation masks. The frames are based on 640 recordings of healthy and disordered subjects that were recorded with varying technical equipment by numerous clinicians. The BAGLS dataset will allow an objective comparison of glottis segmentation methods and will enable interested researchers to train their own models and compare their methods.

Publication types

Dataset
Research Support, Non-U.S. Gov't

MeSH terms

Endoscopy*
Glottis / diagnostic imaging
Glottis / physiology*
Humans
Video Recording*
Vocal Cords / diagnostic imaging
Vocal Cords / physiology*
Voice Disorders / diagnosis*

Abstract

Publication types

MeSH terms

Grants and funding