Skip to yearly menu bar Skip to main content


Poster

Proxy3D: Efficient 3D Representations for Vision-Language Models via Semantic Clustering and Alignment

Jerry Jiang ⋅ Haowen Sun ⋅ Denis Gudovskiy ⋅ Yohei Nakata ⋅ Tomoyuki Okuno ⋅ Kurt Keutzer ⋅ Wenzhao Zheng

Abstract

Log in and register to view live content