Zhihao Duan,Zhan Ma,Fengqing Zhu
Zhihao Duan
Advances in both lossy image compression and semantic content understanding have been greatly fueled by deep learning techniques, yet these two tasks have been developed separately for the past decades. In this work, we address the problem ...
Liqi Yan,Siqi Ma,Qifan Wang et al.
Liqi Yan et al.
Video captioning is a challenging task as it needs to accurately transform visual understanding into natural language description. To date, state-of-the-art methods inadequately model global-local vision representation for sentence generati...
Cohesive Multi-Modality Feature Learning and Fusion for COVID-19 Patient Severity Prediction [0.03%]
基于协同多模态特征学习和融合的COVID-19患者病情严重程度预测方法
Jinzhao Zhou,Xingming Zhang,Ziwei Zhu et al.
Jinzhao Zhou et al.
The outbreak of coronavirus disease (COVID-19) has been a nightmare to citizens, hospitals, healthcare practitioners, and the economy in 2020. The overwhelming number of confirmed cases and suspected cases put forward an unprecedented chall...
Cong Shi,Gang Luo
Cong Shi
This paper proposes a bio-inspired visual motion estimation algorithm based on motion energy, along with its compact very-large-scale integration (VLSI) architecture using low-cost embedded systems. The algorithm mimics motion perception fu...
Single image super-resolution via an iterative reproducing kernel Hilbert space method [0.03%]
基于迭代再生核希尔伯特空间的单幅图像超分辨率重建方法
Liang-Jian Deng,Weihong Guo,Ting-Zhu Huang
Liang-Jian Deng
Image super-resolution, a process to enhance image resolution, has important applications in satellite imaging, high definition television, medical imaging, etc. Many existing approaches use multiple low-resolution images to recover one hig...
Structured Set Intra Prediction With Discriminative Learning in a Max-Margin Markov Network for High Efficiency Video Coding [0.03%]
基于极大 margins 马尔可夫模型的判别学习结构集内预测方法在高效率视频编码中的应用
Wenrui Dai,Hongkai Xiong,Xiaoqian Jiang et al.
Wenrui Dai et al.
This paper proposes a novel model on intra coding for High Efficiency Video Coding (HEVC), which simultaneously predicts blocks of pixels with optimal rate distortion. It utilizes the spatial statistical correlation for the optimal predicti...