Evaluation of the prediagnosis and management of ChatGPT-4.0 in clinical cases in cardiology

Future Cardiol. 2024 Mar 11;20(4):197-207. doi: 10.1080/14796678.2024.2348898. Epub 2024 May 17.

Abstract

Aim: Evaluation of the performance of ChatGPT-4.0 in providing prediagnosis and treatment plans for cardiac clinical cases by expert cardiologists. Methods: 20 cardiology clinical cases developed by experienced cardiologists were divided into two groups according to preparation methods. Cases were reviewed and analyzed by the ChatGPT-4.0 program, and analyses of ChatGPT were then sent to cardiologists. Eighteen expert cardiologists evaluated the quality of ChatGPT-4.0 responses using Likert and Global quality scales. Results: Physicians rated case difficulty (median 2.00), revealing high ChatGPT-4.0 agreement to differential diagnoses (median 5.00). Management plans received a median score of 4, indicating good quality. Regardless of the difficulty of the cases, ChatGPT-4.0 showed similar performance in differential diagnosis (p: 0.256) and treatment plans (p: 0.951). Conclusion: ChatGPT-4.0 excels at delivering accurate management and demonstrates its potential as a valuable clinical decision support tool in cardiology.

Keywords: ChatGPT; artificial intelligence; cardiology; clinical decision support systems; large language models.

Plain language summary

Have you ever wondered if an artificial intelligence (AI) program could help doctors figure out what the problem is when someone has heart complaints? Our research examined this by testing an AI program called ChatGPT-4.0 on clinical cases. We wanted to see if it could help doctors by giving good advice on what might be wrong with patients who have heart issues and what should be done to help them. To test this, we used ChatGPT-4.0 to look at 20 different stories about patients with heart problems. These stories were made to cover a variety of common heart conditions faced by heart doctors. Then, we asked 18 heart doctors to check if the advice from ChatGPT-4.0 was good and made sense. What we found was quite interesting! Most of the time, the doctors agreed that the computer gave good advice on what might be wrong with the patients and how to help them. This means that this smart computer program could be a helpful tool for doctors, especially when they are trying to figure out tricky heart problems. But, it's important to say that computers like ChatGPT-4.0 are not ready to replace doctors. They are tools that can offer suggestions. Doctors still need to use their knowledge and experience to make the final call on what's best for their patients. In simple terms, our study shows that with more development and testing, AI like ChatGPT-4.0 could be a helpful assistant to doctors in treating heart disease, making sure patients get the best care possible.

MeSH terms

  • Cardiology* / methods
  • Clinical Decision-Making / methods
  • Diagnosis, Differential
  • Female
  • Heart Diseases / diagnosis
  • Heart Diseases / therapy
  • Humans
  • Male
  • Middle Aged