Skip to yearly menu bar Skip to main content


Poster Fri, Jun 13, 2025 • 8:30 AM – 10:30 AM PDT

Perception Tokens Enhance Visual Reasoning in Multimodal Language Models

Mahtab Bigverdi · Zelun Luo · Cheng-Yu Hsieh · Ethan Shen · Dongping Chen · Linda Shapiro · Ranjay Krishna

Abstract

Chat is not available.