Skip to yearly menu bar Skip to main content


Poster

Perception Tokens Enhance Visual Reasoning in Multimodal Language Models

Mahtab Bigverdi ⋅ Zelun Luo ⋅ Cheng-Yu Hsieh ⋅ Ethan Shen ⋅ Dongping Chen ⋅ Linda Shapiro ⋅ Ranjay Krishna
2025 Poster

Abstract

Chat is not available.