Omission and hallucination prevalence of clinical guidelines in diagnostic large language model outputs
{{output}}
Objective: Meaningful assessments of how large language models (LLMs) incorporate clinical guidelines require large-scale testing over many queries. Here, we evaluate the prevalence of clinical guideline omissions and hallucinati... ...