Qualitative metrics from the biomedical literature for evaluating large language models in clinical decision-making: a narrative review
{{output}}
Background: The large language models (LLMs), most notably ChatGPT, released since November 30, 2022, have prompted shifting attention to their use in medicine, particularly for supporting clinical decision-making. However, there... ...