Construct validity of the endoscopic sinus surgery simulator: II. Assessment of discriminant validity and expert benchmarking

Marvin P Fried; Babak Sadoughi; Suzanne J Weghorst; Michael Zeltsan; Hernando Cuellar; José I Uribe; Clarence T Sasaki; Douglas A Ross; Joseph B Jacobs; Richard A Lebowitz; Richard M Satava

doi:10.1001/archotol.133.4.350

Construct validity of the endoscopic sinus surgery simulator: II. Assessment of discriminant validity and expert benchmarking

Arch Otolaryngol Head Neck Surg. 2007 Apr;133(4):350-7. doi: 10.1001/archotol.133.4.350.

Authors

Marvin P Fried¹, Babak Sadoughi, Suzanne J Weghorst, Michael Zeltsan, Hernando Cuellar, José I Uribe, Clarence T Sasaki, Douglas A Ross, Joseph B Jacobs, Richard A Lebowitz, Richard M Satava

Affiliation

¹ Department of Otorhinolaryngology-Head and Neck Surgery, Montefiore Medical Center, Albert Einstein College of Medicine, Bronx, NY 10467, USA. mfried@montefiore.org

PMID: 17438249
DOI: 10.1001/archotol.133.4.350

Abstract

Objectives: To establish discriminant validity of the endoscopic sinus surgery simulator (ES3) (Lockheed Martin, Akron, Ohio) between various health care provider experience levels and to define benchmarking criteria for skills assessment.

Design: Prospective multi-institutional comparison study.

Setting: University-based tertiary care institution.

Participants: Ten expert otolaryngologists, 14 otolaryngology residents, and 10 medical students.

Interventions: Subjects completed the ES3's virtual reality curriculum (10 novice mode, 10 intermediate mode, and 3 advanced mode trials). Performance scores were recorded on each trial. Performance differences were analyzed using analysis of variance for repeated measures (experience level as between-subjects factor).

Main outcome measures: Simulator performance scores, accuracy, time to completion, and hazard disruption.

Results: The novice mode accurately distinguished the 3 groups, particularly at the onset of training (mean scores: senior otolaryngologists, 66.0; residents, 42.7; students, 18.3; for the paired comparisons between groups 1 and 2 and groups 1 and 3, P = .04 and .03, respectively). Subjects were not distinguished beyond trial 5. The intermediate mode only discriminated students from other subjects (P = .008). The advanced mode did not show performance differences between groups. Scores on the novice mode predicted those on the intermediate mode, which predicted advanced mode scores (r = 0.687), but no relationship was found between novice and advanced scores. All groups performed equally well and with comparable consistency at the outset of training. Expert scores were used to define benchmark criteria of optimal performance.

Conclusions: This study completes the construct validity assessment of the ES3 by demonstrating its discriminant capabilities. It establishes expert surgeon benchmark performance criteria and shows that the ES3 can train novice subjects to attain those. The refined analysis of trial performance scores could serve educational and skills assessment purposes. Current studies are evaluating the transfer of surgical skills acquired on the ES3 to the operating room (predictive validity).

Publication types

Comparative Study
Multicenter Study
Research Support, U.S. Gov't, P.H.S.

MeSH terms

Analysis of Variance
Benchmarking
Clinical Competence
Computer Simulation
Computer-Assisted Instruction / methods*
Educational Measurement
Educational Technology
Endoscopy / education*
Endoscopy / methods*
Humans
Paranasal Sinus Diseases / surgery*
Prospective Studies
User-Computer Interface*

Grants and funding

R18HS11866-03/HS/AHRQ HHS/United States