Skip to yearly menu bar Skip to main content


Poster

β-CLIP: Text-Conditioned Contrastive Learning for Multi-Granular Vision-Language Alignment

Fatimah Zohra ⋅ Chen Zhao ⋅ Hani Itani ⋅ Bernard Ghanem

Abstract

Log in and register to view live content