Omission and hallucination prevalence of clinical guidelines in diagnostic large language model outputs

Objective: Meaningful assessments of how large language models (LLMs) incorporate clinical guidelines require large-scale testing over many queries. Here, we evaluate the prevalence of clinical guideline omissions and hallucinati... ...

请注册登录后继续浏览