首页 正文

Journal of pediatric surgery. 2024 May;59(5):783-790. doi: 10.1016/j.jpedsurg.2024.01.033 Q22.42024

Using Artificial Intelligence to Label Free-Text Operative and Ultrasound Reports for Grading Pediatric Appendicitis

使用人工智能标记自由文本的手术和超声波报告以对儿童阑尾炎进行分级 翻译改进

Waseem Abu-Ashour  1, Sherif Emil  2, Dan Poenaru  2

作者单位 +展开

作者单位

  • 1 Harvey E. Beardmore Division of Pediatric Surgery, The Montreal Children's Hospital, McGill University Health Centre, Montreal, Quebec, Canada; McGill University Health Centre Research Institute, Montreal, Quebec, Canada. Electronic address: waseem.abu-ashour@muhc.mcgill.ca.
  • 2 Harvey E. Beardmore Division of Pediatric Surgery, The Montreal Children's Hospital, McGill University Health Centre, Montreal, Quebec, Canada; McGill University Health Centre Research Institute, Montreal, Quebec, Canada.
  • DOI: 10.1016/j.jpedsurg.2024.01.033 PMID: 38383177

    摘要 Ai翻译

    Purpose: Data science approaches personalizing pediatric appendicitis management are hampered by small datasets and unstructured electronic medical records (EMR). Artificial intelligence (AI) chatbots based on large language models can structure free-text EMR data. We compare data extraction quality between ChatGPT-4 and human data collectors.

    Methods: To train AI models to grade pediatric appendicitis preoperatively, several data collectors extracted detailed preoperative and operative data from 2100 children operated for acute appendicitis. Collectors were trained for the task based on satisfactory Kappa scores. ChatGPT-4 was prompted to structure free text from 103 random anonymized ultrasound and operative records in the dataset using the set variables and coding options, and to estimate appendicitis severity grade from the operative report. A pediatric surgeon then adjudicated all data, identifying errors in each method.

    Results: Within the 44 ultrasound (42.7%) and 32 operative reports (31.1%) discordant in at least one field, 98% of the errors were found in the manual data extraction. The appendicitis grade was erroneously assigned manually in 29 patients (28.2%), and by ChatGPT-4 in 3 (2.9%). Across datasets, the use of the AI chatbot was able to avoid misclassification in 59.2% of the records including both reports and extracted data approximately 40 times faster.

    Conclusion: AI chatbot significantly outperformed manual data extraction in accuracy for ultrasound and operative reports, and correctly assigned the appendicitis grade. While wider validation is required and data safety concerns must be addressed, these AI tools show significant promise in improving the accuracy and efficiency of research data collection.

    Levels of evidence: Level III.

    Keywords: Appendicitis grade; Artificial intelligence; Comparative study; Diagnosis; Pediatric appendicitis.

    Keywords:Artificial Intelligence; Free-Text Operative; Ultrasound Reports; Grading Pediatric Appendicitis

    Copyright © Journal of pediatric surgery. 中文内容为AI机器翻译,仅供参考!

    相关内容

    期刊名:Journal of pediatric surgery

    缩写:J PEDIATR SURG

    ISSN:0022-3468

    e-ISSN:1531-5037

    IF/分区:2.4/Q2

    文章目录 更多期刊信息

    全文链接
    引文链接
    复制
    已复制!
    推荐内容
    Using Artificial Intelligence to Label Free-Text Operative and Ultrasound Reports for Grading Pediatric Appendicitis