首页 文献索引 SCI期刊 AI助手
登录 注册
首页 正文

Data mining and knowledge discovery. 2024;38(3):813-839. doi: 10.1007/s10618-023-00979-9 Q22.82024

Somtimes: self organizing maps for time series clustering and its application to serious illness conversations

基于自组织映射的时间序列聚类及其在严重疾病对话中的应用 翻译改进

Ali Javed  1  2, Donna M Rizzo  3  2, Byung Suk Lee  2, Robert Gramling  4

作者单位 +展开

作者单位

  • 1 Department of Medicine, Stanford University, 300 Pasteur Dr, Stanford, CA 94305 USA.
  • 2 Department of Computer Science, University of Vermont, Burlington, VT USA.
  • 3 Department of Civil and Environmental Engineering, University of Vermont, Burlington, VT USA.
  • 4 Department of Family Medicine, University of Vermont, Burlington, VT USA.
  • DOI: 10.1007/s10618-023-00979-9 PMID: 38711534

    摘要 Ai翻译

    There is demand for scalable algorithms capable of clustering and analyzing large time series data. The Kohonen self-organizing map (SOM) is an unsupervised artificial neural network for clustering, visualizing, and reducing the dimensionality of complex data. Like all clustering methods, it requires a measure of similarity between input data (in this work time series). Dynamic time warping (DTW) is one such measure, and a top performer that accommodates distortions when aligning time series. Despite its popularity in clustering, DTW is limited in practice because the runtime complexity is quadratic with the length of the time series. To address this, we present a new a self-organizing map for clustering TIME Series, called SOMTimeS, which uses DTW as the distance measure. The method has similar accuracy compared with other DTW-based clustering algorithms, yet scales better and runs faster. The computational performance stems from the pruning of unnecessary DTW computations during the SOM's training phase. For comparison, we implement a similar pruning strategy for K-means, and call the latter K-TimeS. SOMTimeS and K-TimeS pruned 43% and 50% of the total DTW computations, respectively. Pruning effectiveness, accuracy, execution time and scalability are evaluated using 112 benchmark time series datasets from the UC Riverside classification archive, and show that for similar accuracy, a 1.8× speed-up on average for SOMTimeS and K-TimeS, respectively with that rates vary between 1× and 18× depending on the dataset. We also apply SOMTimeS to a healthcare study of patient-clinician serious illness conversations to demonstrate the algorithm's utility with complex, temporally sequenced natural language.

    Supplementary information: The online version contains supplementary material available at 10.1007/s10618-023-00979-9.

    Keywords: Clustering; Dynamic time warping; Self-organizing maps; Serious illness conversations; Time series clustering.

    Keywords:self organizing maps; time series clustering

    Copyright © Data mining and knowledge discovery. 中文内容为AI机器翻译,仅供参考!

    相关内容

    期刊名:Data mining and knowledge discovery

    缩写:DATA MIN KNOWL DISC

    ISSN:1384-5810

    e-ISSN:1573-756X

    IF/分区:2.8/Q2

    文章目录 更多期刊信息

    全文链接
    引文链接
    复制
    已复制!
    推荐内容
    Somtimes: self organizing maps for time series clustering and its application to serious illness conversations