Xinyi Wu,Cuiqun Chen,Hui Zeng et al.
Xinyi Wu et al.
Sketch-based Person Retrieval (SBPR) aims to identify and retrieve a target individual across non-overlapping camera views using professional sketches as queries. In practice, sketches drawn by different artists often present diverse painti...
DrivingEditor: 4D Composite Gaussian Splatting for Reconstruction and Edition of Dynamic Autonomous Driving Scenes [0.03%]
四维复合高斯点扩散函数方法实现自主驾驶场景的重建与编辑
Wang Xu,Yeqiang Qian,Yun-Fu Liu et al.
Wang Xu et al.
In recent years, with the development of autonomous driving, 3D reconstruction for unbounded large-scale scenes has attracted researchers' attention. Existing methods have achieved outstanding reconstruction accuracy in autonomous driving s...
Nimrod Shabtay,Eli Schwartz,Raja Giryes
Nimrod Shabtay
In Deep Image Prior (DIP), a Convolutional Neural Network (CNN) is fitted to map a latent space to a degraded (e.g. noisy) image but in the process learns to reconstruct the clean image. This phenomenon is attributed to CNN's internal image...
Double Nonconvex Tensor Robust Kernel Principal Component Analysis and Its Visual Applications [0.03%]
双非凸张量鲁棒核主成分分析及其视觉应用
Liang Wu,Jianjun Wang,Wei-Shi Zheng et al.
Liang Wu et al.
Tensor robust principal component analysis (TRPCA), as a popular linear low-rank method, has been widely applied to various visual tasks. The mathematical process of the low-rank prior is derived from the linear latent variable model. Howev...
Procedure-Aware Hierarchical Alignment for Open Surgery Video-Language Pretraining [0.03%]
面向程序的层次化对齐用于开放式手术视频语言预训练
Boqiang Xu,Jinlin Wu,Jian Liang et al.
Boqiang Xu et al.
Recent advances in surgical robotics and computer vision have greatly improved intelligent systems' autonomy and perception in the operating room (OR), especially in endoscopic and minimally invasive surgeries. However, for open surgery, wh...
Hao Yang,Yue Sun,Hui Xie et al.
Hao Yang et al.
The synthesis of computed tomography images can supplement electron density information and eliminate MR-CT image registration errors. Consequently, an increasing number of MR-to-CT image translation approaches are being proposed for MR-onl...
Foundation Model Empowered Real-Time Video Conference with Semantic Communications [0.03%]
基于基础模型的实时语义通信视频会议系统
Mingkai Chen,Wenbo Ma,Mujian Zeng et al.
Mingkai Chen et al.
With the development of real-time video conferences, interactive multimedia services have proliferated, leading to a surge in traffic. Interactivity becomes one of the main features on future multimedia services, which brings a new challeng...
High-Confident Block Diagonal Analysis for Multi-View Palmprint Recognition in Unrestrained Environment [0.03%]
自适应环境下的多视角掌纹识别的高可信模块对角线分析方法研究
Shuping Zhao,Lunke Fei,Tingting Cai et al.
Shuping Zhao et al.
Unrestrained palmprint recognition refers to a comprehensive identity authentication technology, that performs personal authentication based on the palmprint images captured in uncontrolled environments, i.e., smartphone cameras, surveillan...
Improving Unsupervised Ultrasonic Image Anomaly Detection via Frequency-Spatial Feature Filtering and Gaussian Mixture Modeling [0.03%]
基于频域空域特征滤波和高斯混合模型的无监督超声图像异常检测方法
Wenjing Zhang,Ke Lu,Jinbao Wang et al.
Wenjing Zhang et al.
Ultrasonic image anomaly detection faces significant challenges due to limited labeled data, strong structural and random noise, and highly diverse defect manifestations. To overcome these obstacles, we introduce UltraChip, a new large-scal...
Complementary Mixture-of-Experts and Complementary Cross-Attention for Single Image Reflection Separation in the Wild [0.03%]
互补的专家混合和互补交叉注意的单幅图像反射分离方法
Jonghyuk Park,Jae-Young Sim
Jonghyuk Park
Single Image Reflection Separation (SIRS) aims to reconstruct both the transmitted and reflected images from a single image that contains a superimposition of both, captured through a glass-like reflective surface. Recent learning-based met...