首页 正文

Cross Modality Bias in Visual Question Answering: A Causal View with Possible Worlds VQA

{{output}}
To increase the generalization capability of VQA systems, many recent studies have tried to de-bias spurious language or vision associations that shortcut the question or image to the answer. Despite these efforts, the literature fails to address the confoundi... ...