Early-stage medical AI struggles to diagnose, study finds

TL;DR Summary
A Jama Network Open study testing 21 large language models across 29 clinical vignettes finds AI chatbots fail to propose multiple differential diagnoses when patient information is incomplete, with failure rates over 80% for differential diagnoses; accuracy improves with more complete data, but the results underscore that AI should support—not replace—clinical judgment, especially in early, uncertain cases.
Topics:business#artificial-intelligence#clinical-decision-support#healthcare#large-language-models#medical-diagnosis
- AI chatbots misdiagnose in over 80% of early medical cases, study finds Financial Times
- How reliable is medical advice from ChatGPT and other chatbots? New Mass General Brigham study gives answers. The Boston Globe
- Using AI for health questions? Here are 4 tips for the most accurate answers. Mashable
- Ready or Not, LLMs Are Coming for Medicine Medscape
- The ChatGPT Symptom Spiral The Atlantic
Reading Insights
Total Reads
0
Unique Readers
29
Time Saved
4 min
vs 5 min read
Condensed
93%
855 → 57 words
Want the full story? Read the original article
Read on Financial Times