AI-Generated Draft Replies Integrated Into Health Records and Physicians' Electronic Communication
- PMID: 38619840
- PMCID: PMC11019394
- DOI: 10.1001/jamanetworkopen.2024.6565
AI-Generated Draft Replies Integrated Into Health Records and Physicians' Electronic Communication
Abstract
Importance: Timely tests are warranted to assess the association between generative artificial intelligence (GenAI) use and physicians' work efforts.
Objective: To investigate the association between GenAI-drafted replies for patient messages and physician time spent on answering messages and the length of replies.
Design, setting, and participants: Randomized waiting list quality improvement (QI) study from June to August 2023 in an academic health system. Primary care physicians were randomized to an immediate activation group and a delayed activation group. Data were analyzed from August to November 2023.
Exposure: Access to GenAI-drafted replies for patient messages.
Main outcomes and measures: Time spent (1) reading messages, (2) replying to messages, (3) length of replies, and (4) physician likelihood to recommend GenAI drafts. The a priori hypothesis was that GenAI drafts would be associated with less physician time spent reading and replying to messages. A mixed-effects model was used.
Results: Fifty-two physicians participated in this QI study, with 25 randomized to the immediate activation group and 27 randomized to the delayed activation group. A contemporary control group included 70 physicians. There were 18 female participants (72.0%) in the immediate group and 17 female participants (63.0%) in the delayed group; the median age range was 35-44 years in the immediate group and 45-54 years in the delayed group. The median (IQR) time spent reading messages in the immediate group was 26 (11-69) seconds at baseline, 31 (15-70) seconds 3 weeks after entry to the intervention, and 31 (14-70) seconds 6 weeks after entry. The delayed group's median (IQR) read time was 25 (10-67) seconds at baseline, 29 (11-77) seconds during the 3-week waiting period, and 32 (15-72) seconds 3 weeks after entry to the intervention. The contemporary control group's median (IQR) read times were 21 (9-54), 22 (9-63), and 23 (9-60) seconds in corresponding periods. The estimated association of GenAI was a 21.8% increase in read time (95% CI, 5.2% to 41.0%; P = .008), a -5.9% change in reply time (95% CI, -16.6% to 6.2%; P = .33), and a 17.9% increase in reply length (95% CI, 10.1% to 26.2%; P < .001). Participants recognized GenAI's value and suggested areas for improvement.
Conclusions and relevance: In this QI study, GenAI-drafted replies were associated with significantly increased read time, no change in reply time, significantly increased reply length, and some perceived benefits. Rigorous empirical tests are necessary to further examine GenAI's performance. Future studies should examine patient experience and compare multiple GenAIs, including those with medical training.
Conflict of interest statement
Figures
Similar articles
-
Large Language Model-Based Responses to Patients' In-Basket Messages.JAMA Netw Open. 2024 Jul 1;7(7):e2422399. doi: 10.1001/jamanetworkopen.2024.22399. JAMA Netw Open. 2024. PMID: 39012633 Free PMC article.
-
Artificial Intelligence-Generated Draft Replies to Patient Inbox Messages.JAMA Netw Open. 2024 Mar 4;7(3):e243201. doi: 10.1001/jamanetworkopen.2024.3201. JAMA Netw Open. 2024. PMID: 38506805 Free PMC article.
-
Decoding medical educators' perceptions on generative artificial intelligence in medical education.J Investig Med. 2024 Oct;72(7):633-639. doi: 10.1177/10815589241257215. Epub 2024 Jun 7. J Investig Med. 2024. PMID: 38785310
-
Promises and challenges of generative artificial intelligence for human learning.Nat Hum Behav. 2024 Oct;8(10):1839-1850. doi: 10.1038/s41562-024-02004-5. Epub 2024 Oct 22. Nat Hum Behav. 2024. PMID: 39438686 Review.
-
Preliminary Evidence of the Use of Generative AI in Health Care Clinical Services: Systematic Narrative Review.JMIR Med Inform. 2024 Mar 20;12:e52073. doi: 10.2196/52073. JMIR Med Inform. 2024. PMID: 38506918 Free PMC article. Review.
Cited by
-
Can we ensure a safe and effective integration of language models in oncology?Lancet Reg Health Eur. 2024 Sep 20;46:101081. doi: 10.1016/j.lanepe.2024.101081. eCollection 2024 Nov. Lancet Reg Health Eur. 2024. PMID: 39381545 Free PMC article. No abstract available.
-
Large language models in biomedicine and health: current research landscape and future directions.J Am Med Inform Assoc. 2024 Sep 1;31(9):1801-1811. doi: 10.1093/jamia/ocae202. J Am Med Inform Assoc. 2024. PMID: 39169867 Free PMC article. No abstract available.
-
Prompt engineering with a large language model to assist providers in responding to patient inquiries: a real-time implementation in the electronic health record.JAMIA Open. 2024 Aug 20;7(3):ooae080. doi: 10.1093/jamiaopen/ooae080. eCollection 2024 Oct. JAMIA Open. 2024. PMID: 39166170 Free PMC article.
-
Large Language Model Influence on Management Reasoning: A Randomized Controlled Trial.medRxiv [Preprint]. 2024 Aug 7:2024.08.05.24311485. doi: 10.1101/2024.08.05.24311485. medRxiv. 2024. Update in: JAMA Netw Open. 2024 Oct 1;7(10):e2440969. doi: 10.1001/jamanetworkopen.2024.40969. PMID: 39148822 Free PMC article. Updated. Preprint.
-
Large Language Model-Based Responses to Patients' In-Basket Messages.JAMA Netw Open. 2024 Jul 1;7(7):e2422399. doi: 10.1001/jamanetworkopen.2024.22399. JAMA Netw Open. 2024. PMID: 39012633 Free PMC article.
References
-
- McClellan SR, Panattoni L, Chan AS, Tai-Seale M. Patient-initiated electronic messages and quality of care for patients with diabetes and hypertension in a large fee-for-service medical group: results from a natural experiment. Med Care. 2016;54(3):287-295. doi:10.1097/MLR.0000000000000483 - DOI - PubMed
