Skip to yearly menu bar Skip to main content


Poster

Don’t Show Pixels, Show Cues: Unlocking Visual Tool Reasoning in Language Models via Perception Programs

Muhammad Kamran Janjua ⋅ Hugo Silva ⋅ Di Niu ⋅ Bahador Rashidi

Abstract

Log in and register to view live content