Skip to yearly menu bar Skip to main content


Poster

Scenes as Tokens: Multi-Scale Normal Distributions Transform Tokenizer for General 3D Vision–Language Understanding

Yutao Tang ⋅ Cheng Zhao ⋅ Gaurav Mittal ⋅ Rohith Kukkala ⋅ Rama Chellappa ⋅ Cheng Peng ⋅ Mei Chen

Abstract

Log in and register to view live content