Qualitative metrics from the biomedical literature for evaluating large language models in clinical decision-making: a narrative review

Background: The large language models (LLMs), most notably ChatGPT, released since November 30, 2022, have prompted shifting attention to their use in medicine, particularly for supporting clinical decision-making. However, there... ...

请注册登录后继续浏览