首页 正文

Journal of the American Medical Informatics Association : JAMIA. 2021 Dec 28;29(1):52-61. doi: 10.1093/jamia/ocab222 Q14.72024

A cost-effective chart review sampling design to account for phenotyping error in electronic health records (EHR) data

一种经济有效的图表审查抽样设计,以解决电子健康记录(EHR)数据中的表型误差问题 翻译改进

Ziyan Yin  1, Jiayi Tong  2, Yong Chen  2, Rebecca A Hubbard  2, Cheng Yong Tang  1

作者单位 +展开

作者单位

  • 1 Department of Statistical Science, Temple University, Philadelphia, Pennsylvania, USA.
  • 2 Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, The University of Pennsylvania, Philadelphia, Pennsylvania, USA.
  • DOI: 10.1093/jamia/ocab222 PMID: 34718618

    摘要 Ai翻译

    Objectives: Electronic health records (EHR) are commonly used for the identification of novel risk factors for disease, often referred to as an association study. A major challenge to EHR-based association studies is phenotyping error in EHR-derived outcomes. A manual chart review of phenotypes is necessary for unbiased evaluation of risk factor associations. However, this process is time-consuming and expensive. The objective of this paper is to develop an outcome-dependent sampling approach for designing manual chart review, where EHR-derived phenotypes can be used to guide the selection of charts to be reviewed in order to maximize statistical efficiency in the subsequent estimation of risk factor associations.

    Materials and methods: After applying outcome-dependent sampling, an augmented estimator can be constructed by optimally combining the chart-reviewed phenotypes from the selected patients with the error-prone EHR-derived phenotype. We conducted simulation studies to evaluate the proposed method and applied our method to data on colon cancer recurrence in a cohort of patients treated for a primary colon cancer in the Kaiser Permanente Washington (KPW) healthcare system.

    Results: Simulations verify the coverage probability of the proposed method and show that, when disease prevalence is less than 30%, the proposed method has smaller variance than an existing method where the validation set for chart review is uniformly sampled. In addition, from design perspective, the proposed method is able to achieve the same statistical power with 50% fewer charts to be validated than the uniform sampling method, thus, leading to a substantial efficiency gain in chart review. These findings were also confirmed by the application of the competing methods to the KPW colon cancer data.

    Discussion: Our simulation studies and analysis of data from KPW demonstrate that, compared to an existing uniform sampling method, the proposed outcome-dependent method can lead to a more efficient chart review sampling design and unbiased association estimates with higher statistical efficiency.

    Conclusion: The proposed method not only optimally combines phenotypes from chart review with EHR-derived phenotypes but also suggests an efficient design for conducting chart review, with the goal of improving the efficiency of estimated risk factor associations using EHR data.

    Keywords: association study; augmented estimation; cost-effective chart review; outcome-dependent sampling.

    Keywords:electronic health records; phenotyping error; sampling design成本效益; chart review sampling设计

    Copyright © Journal of the American Medical Informatics Association : JAMIA. 中文内容为AI机器翻译,仅供参考!

    相关内容

    期刊名:Journal of the american medical informatics association

    缩写:J AM MED INFORM ASSN

    ISSN:1067-5027

    e-ISSN:1527-974X

    IF/分区:4.7/Q1

    文章目录 更多期刊信息

    全文链接
    引文链接
    复制
    已复制!
    推荐内容
    A cost-effective chart review sampling design to account for phenotyping error in electronic health records (EHR) data