Skip to yearly menu bar Skip to main content


Poster

VinQA: Visual Elements Interleaved Long-form Answer Generation for Real-World Multimodal Document QA

Young Rok Jang ⋅ Hyesoo Kong ⋅ Kyunghwan An ⋅ Jae Sub Huh ⋅ Gyeonghun KIM ⋅ Stanley Jungkyu Choi

Abstract

Log in and register to view live content