Skip to yearly menu bar Skip to main content


Poster

AXG-Reasoner: Error Detection and Explanation in Long Task Videos with Vision–Language Models

Shih-Po Lee ⋅ Ehsan Elhamifar

Abstract

Log in and register to view live content