Skip to yearly menu bar Skip to main content


Poster

TopV: Compatible Token Pruning with Inference Time Optimization for Fast and Low-Memory Multimodal Vision Language Model

Cheng Yang ⋅ Yang Sui ⋅ Jinqi Xiao ⋅ Lingyi Huang ⋅ Yu Gong ⋅ Chendi Li ⋅ Jinghua Yan ⋅ Yu Bai ⋅ Ponnuswamy Sadayappan ⋅ Xia Hu ⋅ Bo Yuan
2025 Poster

Abstract

Chat is not available.