首页 正文

Journal of the American Medical Informatics Association : JAMIA. 2019 Mar 1;26(3):228-241. doi: 10.1093/jamia/ocy142 Q14.72024

Synthesizing electronic health records using improved generative adversarial networks

基于改进生成对抗网络的电子健康记录数据合成方法研究 翻译改进

Mrinal Kanti Baowaly  1  2, Chia-Ching Lin  3  4, Chao-Lin Liu  2, Kuan-Ta Chen  4

作者单位 +展开

作者单位

  • 1 Social Networks and Human-Centered Computing, Taiwan International Graduate Program, Institute of Information Science, Academia Sinica, Taipei, Taiwan.
  • 2 Department of Computer Science, National Chengchi University, Taipei, Taiwan.
  • 3 Graduate Institute of Electrical Engineering, National Taiwan University, Taipei, Taiwan.
  • 4 Institute of Information Science, Academia Sinica, Taipei, Taiwan.
  • DOI: 10.1093/jamia/ocy142 PMID: 30535151

    摘要 Ai翻译

    Objective: The aim of this study was to generate synthetic electronic health records (EHRs). The generated EHR data will be more realistic than those generated using the existing medical Generative Adversarial Network (medGAN) method.

    Materials and methods: We modified medGAN to obtain two synthetic data generation models-designated as medical Wasserstein GAN with gradient penalty (medWGAN) and medical boundary-seeking GAN (medBGAN)-and compared the results obtained using the three models. We used 2 databases: MIMIC-III and National Health Insurance Research Database (NHIRD), Taiwan. First, we trained the models and generated synthetic EHRs by using these three 3 models. We then analyzed and compared the models' performance by using a few statistical methods (Kolmogorov-Smirnov test, dimension-wise probability for binary data, and dimension-wise average count for count data) and 2 machine learning tasks (association rule mining and prediction).

    Results: We conducted a comprehensive analysis and found our models were adequately efficient for generating synthetic EHR data. The proposed models outperformed medGAN in all cases, and among the 3 models, boundary-seeking GAN (medBGAN) performed the best.

    Discussion: To generate realistic synthetic EHR data, the proposed models will be effective in the medical industry and related research from the viewpoint of providing better services. Moreover, they will eliminate barriers including limited access to EHR data and thus accelerate research on medical informatics.

    Conclusion: The proposed models can adequately learn the data distribution of real EHRs and efficiently generate realistic synthetic EHRs. The results show the superiority of our models over the existing model.

    Keywords:electronic health records

    Copyright © Journal of the American Medical Informatics Association : JAMIA. 中文内容为AI机器翻译,仅供参考!

    相关内容

    期刊名:Journal of the american medical informatics association

    缩写:J AM MED INFORM ASSN

    ISSN:1067-5027

    e-ISSN:1527-974X

    IF/分区:4.7/Q1

    文章目录 更多期刊信息

    全文链接
    引文链接
    复制
    已复制!
    推荐内容
    Synthesizing electronic health records using improved generative adversarial networks