Study: Off-the-Shelf LLMs Not Ready for Clinical Prime Time
Chatbots Getting Better Making Final Diagnoses, But Clinical Reasoning Still Weak
General purpose large language model chatbots are getting better at coming up with patients' final diagnoses but are still weak in clinical reasoning, including generating differential diagnoses to identify and rule out other potential conditions and causes of symptoms.
General purpose large language model chatbots are getting better at coming up with patients' final diagnoses but are still weak in clinical reasoning, including generating differential diagnoses to identify and rule out other potential conditions and causes of symptoms.