Skip to yearly menu bar Skip to main content


Poster

FAVE: A Structured Benchmark for Fine-Grained Audio-Visual Temporal Evaluation in Multimodal LLMs

Weiheng Lu ⋅ An Yu ⋅ Jian Li ⋅ Zhenfei Zhang ⋅ Felix X. Ye ⋅ Ming-Ching Chang

Abstract

Log in and register to view live content