Performance and reliability of large language models on the European Board of Hand Surgery examination: a multi-model evaluation study
{{output}}
Introduction: Artificial intelligence (AI) has demonstrated transformative potential in medical education and assessment, with large language models achieving competitive results across multiple high-stakes examinations. In this ... ...