Artificial Intelligence-Assisted Left Ventricular Diastolic Function Assessment and Grading: Multiview Versus Single View

J Am Soc Echocardiogr. 2023 Oct;36(10):1064-1078. doi: 10.1016/j.echo.2023.07.001. Epub 2023 Jul 10.


Background: Clinical assessment and grading of left ventricular diastolic function (LVDF) requires quantification of multiple echocardiographic parameters interpreted according to established guidelines, which depends on experienced clinicians and is time consuming. The aim of this study was to develop an artificial intelligence (AI)-assisted system to facilitate the clinical assessment of LVDF.

Methods: In total, 1,304 studies (33,404 images) were used to develop a view classification model to select six specific views required for LVDF assessment. A total of 2,238 studies (16,794 two-dimensional [2D] images and 2,198 Doppler images) to develop 2D and Doppler segmentation models, respectively, to quantify key metrics of diastolic function. We used 2,150 studies with definite LVDF labels determined by two experts to train single-view classification models by AI interpretation of strain metrics or video. The accuracy and efficiency of these models were tested in an external data set of 388 prospective studies.

Results: The view classification model identified views required for LVDF assessment with good sensitivity (>0.9), and view segmentation models successfully outlined key regions of these views with intersection over union > 0.8 in the internal validation data set. In the external test data set of 388 cases, AI quantification of 2D and Doppler images showed narrow limits of agreement compared with the two experts (e.g., left ventricular ejection fraction, -12.02% to 9.17%; E/e' ratio, -3.04 to 2.67). These metrics were used to detect LV diastolic dysfunction (DD) and grade DD with accuracy of 0.9 and 0.92, respectively. Concerning the single-view method, the overall accuracy of DD detection was 0.83 and 0.75 by strain-based and video-based models, and the accuracy of DD grading was 0.85 and 0.8, respectively. These models could achieve diagnosis and grading of LVDD in a few seconds, greatly saving time and labor.

Conclusion: AI models successfully achieved LVDF assessment and grading that compared favorably with human experts reading according to guideline-based algorithms. Moreover, when Doppler variables were missing, AI models could provide assessment by interpreting 2D strain metrics or videos from a single view. These models have the potential to save labor and cost and to facilitate work flow of clinical LVDF assessment.

Keywords: Deep learning; Echocardiography; Left ventricular diastolic function.