AdaSVD: Singular Value Decomposition with Adaptive Mechanisms for Large Multimodal Models
Abstract
Large Multimodal Models (LMMs) have attained impressive achievements in multimodal processing tasks, yet their massive memory demands pose major obstacles to deployment on resource-limited devices. Singular Value Decomposition (SVD) has emerged as a promising compression technique for LMMs, delivering substantial reductions in memory overhead. However, existing SVD-based methods often struggle to effectively alleviate the errors caused by SVD truncation, resulting in a noticeable performance gap when compared to the original models. Moreover, adopting a uniform compression ratio across all transformer layers fails to consider the varying importance of different layers. To tackle these challenges, we propose AdaSVD, an adaptive SVD-based LMM compression approach. Specifically, AdaSVD introduces adaComp, which adaptively compensates for SVD truncation errors by alternately updating the singular matricesand. Additionally, AdaSVD introduces adaCR, which adaptively assigns layer-specific compression ratios according to the relative importance of each layer. Comprehensive experiments across multiple LMM families show the effectiveness of AdaSVD, achieving better performance while significantly reducing memory requirements. We will make all the code and models of AdaSVD publicly available.