Skip to yearly menu bar Skip to main content


Poster

Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding

Zining Wang ⋅ Tongkun Guan ⋅ Pei Fu ⋅ Chen Duan ⋅ Qianyi Jiang ⋅ Zhentao Guo ⋅ Shan Guo ⋅ Junfeng Luo ⋅ Wei Shen ⋅ Xiaokang Yang
2025 Poster

Abstract

Chat is not available.