首页 文献索引 SCI期刊 AI助手
期刊目录筛选

期刊名:Ieee transactions on pattern analysis and machine intelligence

缩写:IEEE T PATTERN ANAL

ISSN:0162-8828

e-ISSN:1939-3539

IF/分区:18.6/Q1

文章目录 更多期刊信息

共收录本刊相关文章索引6618
Clinical Trial Case Reports Meta-Analysis RCT Review Systematic Review
Classical Article Case Reports Clinical Study Clinical Trial Clinical Trial Protocol Comment Comparative Study Editorial Guideline Letter Meta-Analysis Multicenter Study Observational Study Randomized Controlled Trial Review Systematic Review
Juncheng Li,Minghe Gao,Xiangnan He et al. Juncheng Li et al.
Large Language Models (LLMs) exhibit remarkable proficiency in understanding and managing text-based tasks. Many works try to transfer these capabilities to the video domain, which are referred to as Video-LLMs. However, current Video-LLMs ...
Xun Jiang,Xing Xu,Zheng Wang et al. Xun Jiang et al.
Egocentric Task Verification (ETV) aims to determine if the operation flows of procedural tasks in egocentric videos align with the logic of given rules. Early works adopt the video-based verification paradigm that compares a reference vide...
Bingbing Jiang,Jie Wen,Zidong Wang et al. Bingbing Jiang et al.
Semi-supervised learning can leverage both labeled and unlabeled samples simultaneously to improve performance. However, existing methods often present the following issues: (1) The emphasis of learning is put on either the similarity struc...
Jinhui Yang,Ming Jiang,Qi Zhao Jinhui Yang
Large Vision-Language Models (LVLMs) with "multimodal distractibility," where plausible but irrelevant visual or textual inputs cause significant drops in reasoning consistency and lead to unreliable outputs. This paper introduces a compreh...
Zhendong Mao,Mengqi Huang,Yijing Lin et al. Zhendong Mao et al.
Existing generative image transformers follow a two-stage generation paradigm, where the first stage learns a codebook to encode images into discrete codes via vector quantization, and the second stage completes the image generation based o...
Wu Wang,Liang-Jian Deng,Qi Cao et al. Wu Wang et al.
The goal of a deep learning-based general image fusion method is to solve multiple image fusion tasks with a single model, thereby facilitating the deployment of models in practical applications. However, existing methods fail to provide an...
Penglei Wang,Jitao Lu,Danyang Wu et al. Penglei Wang et al.
Recently, Multi-View Graph Clustering (MVGC) methods have achieved significant progress, leading to their wide adoption in various applications. However, most MVGC methods merely pursue consistent information by simply fusing multi-view gra...
De Cheng,Yubo Li,Chaowei Fang et al. De Cheng et al.
Cloth-Changing Person Re-Identification (CC-ReID) aims to recognize individuals across camera views despite clothing variations, a crucial task for surveillance and security systems. Existing methods typically frame it as a cross-modal alig...
Fangjinhua Wang,Qingtian Zhu,Di Chang et al. Fangjinhua Wang et al.
3D reconstruction aims to recover the dense 3D structure of a scene. It plays an essential role in various applications such as Augmented/Virtual Reality (AR/VR), autonomous driving and robotics. Leveraging multiple views of a scene capture...
Di Wu,Shihui Li,Yi He et al. Di Wu et al.
High-dimensional and incomplete (HDI) data are ubiquitous in various Big Data-related industrial applications, such as drug innovation and recommender systems. Hash learning is the most efficient representation learning approach to extract ...