首页 正文

Bioinformatics (Oxford, England). 2024 Jan 2;40(1):btae020. doi: 10.1093/bioinformatics/btae020 Q15.42025

scMAE: a masked autoencoder for single-cell RNA-seq clustering

scMAE:单细胞RNA测序聚类的掩码自动编码器 翻译改进

Zhaoyu Fang  1, Ruiqing Zheng  1, Min Li  1

作者单位 +展开

作者单位

  • 1 School of Computer Science and Engineering, Central South University, 932 South Lushan Road, Yuelu District, Changsha 410083, China.
  • DOI: 10.1093/bioinformatics/btae020 PMID: 38230824

    摘要 Ai翻译

    Motivation: Single-cell RNA sequencing has emerged as a powerful technology for studying gene expression at the individual cell level. Clustering individual cells into distinct subpopulations is fundamental in scRNA-seq data analysis, facilitating the identification of cell types and exploration of cellular heterogeneity. Despite the recent development of many deep learning-based single-cell clustering methods, few have effectively exploited the correlations among genes, resulting in suboptimal clustering outcomes.

    Results: Here, we propose a novel masked autoencoder-based method, scMAE, for cell clustering. scMAE perturbs gene expression and employs a masked autoencoder to reconstruct the original data, learning robust and informative cell representations. The masked autoencoder introduces a masking predictor, which captures relationships among genes by predicting whether gene expression values are masked. By integrating this masking mechanism, scMAE effectively captures latent structures and dependencies in the data, enhancing clustering performance. We conducted extensive comparative experiments using various clustering evaluation metrics on 15 scRNA-seq datasets from different sequencing platforms. Experimental results indicate that scMAE outperforms other state-of-the-art methods on these datasets. In addition, scMAE accurately identifies rare cell types, which are challenging to detect due to their low abundance. Furthermore, biological analyses confirm the biological significance of the identified cell subpopulations.

    Availability and implementation: The source code of scMAE is available at: https://zenodo.org/records/10465991.

    Keywords:single-cell RNA-seq; masked autoencoder; clustering

    Copyright © Bioinformatics (Oxford, England). 中文内容为AI机器翻译,仅供参考!

    相关内容

    期刊名:Bioinformatics

    缩写:BIOINFORMATICS

    ISSN:1367-4803

    e-ISSN:1367-4811

    IF/分区:5.4/Q1

    文章目录 更多期刊信息

    全文链接
    引文链接
    复制
    已复制!
    推荐内容
    scMAE: a masked autoencoder for single-cell RNA-seq clustering