Skip to yearly menu bar Skip to main content


(704 events)   Timezone:  
Show all
The 2026 schedule is still incomplete
Toggle Poster Visibility
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 1
Evidential Neural Radiance Fields
Ruxiao Duan ⋅ Alex Wong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 2
Global-Aware Edge Prioritization for Pose Graph Initialization
Tong Wei ⋅ Giorgos Tolias ⋅ Jiri Matas ⋅ Daniel Barath
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 3
Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding
Christopher Clark ⋅ Jieyu Zhang ⋅ Zixian Ma ⋅ Jae Sung Park ⋅ Rohun Tripathi ⋅ Sangho Lee ⋅ Reza Salehi ⋅ Jason Ren ⋅ Chris Dongjoo Kim ⋅ Yinuo Yang ⋅ Vincent Shao ⋅ Yue Yang ⋅ Weikai Huang ⋅ Ziqi Gao ⋅ Taira Anderson ⋅ Jianrui Zhang ⋅ Jitesh Jain ⋅ George Stoica ⋅ Ali Farhadi ⋅ Ranjay Krishna
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 4
Optical Flow Matching: Reframing Optical Flow as Continuous Transport Dynamics
Ao Luo ⋅ XIN LI ⋅ Fan Yang ⋅ Yuezun Li ⋅ Zhaoquan Yuan ⋅ SHAN ZHAO ⋅ Bing Su ⋅ Xiao WU
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 5
SEATrack: Simple, Efficient, and Adaptive Multimodal Tracker
Junbin Su ⋅ Ziteng Xue ⋅ Shihui Zhang ⋅ Kun Chen ⋅ Weiming Hu ⋅ Zhipeng Zhang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 6
U^2Flow: Uncertainty-Aware Unsupervised Optical Flow Estimation
Xunpei Sun ⋅ Wenwei Lin ⋅ Yi Chang ⋅ Gang Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 7
AToken: A Unified Tokenizer for Vision
Jiasen Lu ⋅ Liangchen Song ⋅ Mingze Xu ⋅ Byeongjoo Ahn ⋅ Yanjun Wang ⋅ Chen Chen ⋅ Afshin Dehghan ⋅ Yinfei Yang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 8
Confusion-Aware Spectral Regularizer for Long-Tailed Recognition
Ziquan Zhu ⋅ Gaojie Jin ⋅ Hanruo Zhu ⋅ Si-Yuan Lu ⋅ Yunxiao Zhang ⋅ ZEYU FU ⋅ Ronghui Mu ⋅ Guoqiang Zhang ⋅ Zhao Sun ⋅ Yuhang Xia ⋅ Jiaxing Shang ⋅ Xiang Li ⋅ Lu Liu ⋅ Tianjin Huang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 9
Learning Latent Concepts for Detecting Out-of-Distribution Objects
Ting Peng ⋅ Junhao Dong ⋅ Yew-Soon Ong
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 10
Learning Like Humans: Analogical Concept Learning for Generalized Category Discovery
Jizhou Han ⋅ Chenhao Ding ⋅ Yuhang He ⋅ Qiang Wang ⋅ Shaokun Wang ⋅ SongLin Dong ⋅ Yihong Gong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 11
Understanding and Enforcing Weight Disentanglement in Task Arithmetic
Shangge Liu ⋅ Yuehan Yin ⋅ Lei Wang ⋅ Qi Fan ⋅ Yinghuan Shi ⋅ Wenbin Li ⋅ Yang Gao ⋅ Dacheng Tao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 12
Understanding Task Transfer in Vision-Language Models
Bhuvan Sachdeva ⋅ Karan Uppal ⋅ Abhinav Java ⋅ Vineeth Balasubramanian
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 13
AT-VLA: Adaptive Tactile Injection for Enhanced Feedback Reaction in Vision-Language-Action Models
Xiaoqi Li ⋅ Muhe Cai ⋅ Jiadong Xu ⋅ Juan Zhu ⋅ Hongwei Fan ⋅ Yan Shen ⋅ Guangrui Ren ⋅ Hao Dong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 14
Learning Diffeomorphism for Medical Image Registration with Time-Embedded Architectures Using Semigroup Regularization
Mohammadjavad Matinkia ⋅ Nilanjan Ray
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 15
QuadSync: Quadrifocal Tensor Synchronization via Tucker Decomposition
Daniel Miao ⋅ Gilad Lerman ⋅ Joe Kileel
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 16
SocialNav: Training Human-Inspired Foundation Model for Socially-Aware Embodied Navigation
Ziyi Chen ⋅ Yingnan Guo ⋅ Zedong Chu ⋅ Minghua Luo ⋅ Yanfen Shen ⋅ Mingchao Sun ⋅ Junjun Hu ⋅ Shichao Xie ⋅ Yang Kuan ⋅ Pei Shi ⋅ Zhining Gu ⋅ Lu Liu ⋅ Honglin Han ⋅ Xiaolong Wu ⋅ Mu Xu ⋅ Yu Zhang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 17
Structural Action Transformer for 3D Dexterous Manipulation
Xiaohan Lei ⋅ Min Wang ⋅ Bohong Weng ⋅ Wengang Zhou ⋅ Houqiang Li
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 18
TESO: Online Tracking of Essential Matrix by Stochastic Optimization
Jaroslav Moravec ⋅ Radim Sara ⋅ Akihiro Sugimoto
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 19
BoostSLT: Boosting Sign Language Translation via a Plug-and-Play Diffusion-Based Semantic Enhancer
Changzhou Han ⋅ Wanlun Ma ⋅ XI TANG ⋅ Kun Hu ⋅ Sheng Wen ⋅ Yang Xiang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 20
ImmerIris: A Large-Scale Dataset and Benchmark for Off-Axis and Unconstrained Iris Recognition in Immersive Applications
Yuxi Mi ⋅ Qiuyang Yuan ⋅ Zhizhou Zhong ⋅ Xuan Zhao ⋅ Jiaogen Zhou ⋅ Fubao Zhu ⋅ Jihong Guan ⋅ Shuigeng Zhou
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 21
OLATverse: A Large-scale Real-world Object Dataset with Precise Lighting Control
Xilong Zhou ⋅ Jianchun Chen ⋅ Pramod Rao ⋅ Timo Teufel ⋅ Linjie Lyu ⋅ Tigran Minasian ⋅ Oleksandr Sotnychenko ⋅ Xiaoxiao Long ⋅ Marc Habermann ⋅ Christian Theobalt
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 22
OpenDance: Multimodal Controllable 3D Dance Generation with Large-scale Internet Data
Jinlu Zhang ⋅ Zixi Kang ⋅ Libin Liu ⋅ Jianlong Chang ⋅ Qi Tian ⋅ Feng Gao ⋅ Yizhou Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 23
POLAR: A Portrait OLAT Dataset and Generative Framework for Illumination-Aware Face Modeling
Zhuo Chen ⋅ Chengqun Yang ⋅ Zhuo Su ⋅ Zheng Lv ⋅ Jingnan Gao ⋅ Xiaoyuan Zhang ⋅ Xiaokang Yang ⋅ Yichao Yan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 24
Relightable Holoported Characters: Capturing and Relighting Dynamic Human Performance from Sparse Views
Kunwar Maheep Singh ⋅ Jianchun Chen ⋅ Vladislav Golyanik ⋅ Stephan Garbin ⋅ Thabo Beeler ⋅ Rishabh Dabral ⋅ Marc Habermann ⋅ Christian Theobalt
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 25
Scaling View Synthesis Transformers
Evan Kim ⋅ Hyunwoo Ryu ⋅ Thomas W. Mitchel ⋅ Vincent Sitzmann
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 26
WildPose: A Unified Framework for Robust Pose Estimation in the Wild
Jianhao Zheng ⋅ Liyuan Zhu ⋅ Zihan Zhu ⋅ Iro Armeni
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 27
MoRe: Motion-aware Feed-forward 4D Reconstruction Transformer
Juntong Fang ⋅ Zequn Chen ⋅ Weiqi Zhang ⋅ Donglin Di ⋅ Xuancheng Zhang ⋅ Chengmin Yang ⋅ Yu-Shen Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 28
Revisiting Monocular SLAM with Spatio-Temporal Scene Modeling
Valter Piedade ⋅ Lalit Manam ⋅ Masashi Yamazaki ⋅ Pedro Miraldo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 29
Minimal Constraint Relaxation for Multiview Autocalibration
Norio Kosaka ⋅ Timothy Duff ⋅ Tomas Pajdla
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 30
Motion 3-to-4: 3D Motion Reconstruction for 4D Synthesis
hongyuan chen ⋅ Xingyu Chen ⋅ Zexiang Xu ⋅ Anpei Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 31
GGPT: Geometry-Grounded Point Transformer
Yutong Chen ⋅ Yiming Wang ⋅ Xucong Zhang ⋅ Sergey Prokudin ⋅ Siyu Tang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 32
MERG3R: A Divide-and-Conquer Approach to Large-Scale Neural Visual Geometry
Leo Kaixuan Cheng ⋅ Abdus Shaikh ⋅ Ruofan Liang ⋅ Zhijie Wu ⋅ Yushi Guan ⋅ Nandita Vijaykumar
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 33
Unlocking the Power of Critical Factors for 3D Visual Geometry Estimation
Guangkai Xu ⋅ Hua Geng ⋅ Huanyi Zheng ⋅ Songyi Yin ⋅ Yanlong Sun ⋅ Hao Chen ⋅ Chunhua Shen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 34
KV-Tracker: Real-Time Pose Tracking with Transformers
Marwan Taher ⋅ Ignacio Alzugaray ⋅ Kirill Mazur ⋅ Xin Kong ⋅ Andrew J. Davison
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 35
InstructMix2Mix: Consistent Sparse-View Editing Through Multi-View Model Personalization
Daniel Gilo ⋅ Or Litany
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 36
From Rays to Projections: Better Inputs for Feed-Forward View Synthesis
Zirui Wu ⋅ Zeren Jiang ⋅ Martin R. Oswald ⋅ Jie Song
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 37
SLARM: Streaming and Language-Aligned Reconstruction Model for Dynamic Scenes
ZhiCheng Qiu ⋅ Jiarui Meng ⋅ Tong-an Luo ⋅ Yican Huang ⋅ Xuan Feng ⋅ Xuanfu Li ⋅ Zhan Xu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 38
Parallel Rigidity Matters for Bundle Adjustment
Lalit Manam ⋅ Venu Madhav Govindu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 39
Simple but Effective Triplet-Based Compression Strategies for Compact Visual Localization
Torsten Sattler ⋅ Zuzana Kukelova
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 40
VIAFormer: Voxel-Image Alignment Transformer for High-Fidelity Voxel Refinement
Tiancheng Fang ⋅ Bowen Pan ⋅ Lingxi Chen ⋅ Jiangjing Lyu ⋅ Chengfei Lv ⋅ Chaoyue Niu ⋅ Fan Wu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 41
Mining Attribute Subspaces for Efficient Fine-tuning of 3D Foundation Models
Yu Jiang ⋅ Hanwen Jiang ⋅ Ahmed Abdelkader ⋅ Wen-Sheng Chu ⋅ Brandon Y. Feng ⋅ Zhangyang Wang ⋅ Qixing Huang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 42
DualPrim: Compact 3D Reconstruction with Positive and Negative Primitives
Xiaoxu Meng ⋅ Zhongmin Chen ⋅ Bo Yang ⋅ Weikai Chen ⋅ Weixiao Liu ⋅ Lin Gao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 43
StyleGallery: Training-free and Semantic-aware Personalized Style Transfer from Arbitrary Image References
Boyu He ⋅ Yunfan Ye ⋅ Chang Liu ⋅ Weishang Wu ⋅ FANG LIU ⋅ Zhiping Cai
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 44
DynFusion: Rethinking Condition Fusion for Adaptive Multi-Conditional Text-to-Image Generation
Zheng Fang ⋅ Lichuan Xiang ⋅ Xu Cai ⋅ Bing Wang ⋅ Bo Yang ⋅ Hongkai Wen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 45
Agentic Retoucher for Text-To-Image Generation
Shaocheng Shen ⋅ Jianfeng Liang ⋅ Chunlei Cai ⋅ Cong Geng ⋅ Huiyu Duan ⋅ Xiaoyun Zhang ⋅ Qiang Hu ⋅ Guangtao Zhai
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 46
StyleDoctor: Towards Specialist Reward Model for Style-centric Generation Tasks
Xilin He ⋅ Xiaole Xian ⋅ Xiangyu Yue ⋅ Muhammad Haris Khan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 47
SwitchCraft: Training-Free Multi-Event Video Generation with Attention Controls
Qianxun Xu ⋅ Chenxi Song ⋅ Yujun Cai ⋅ Chi Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 48
Premier: Personalized Preference Modulation with Learnable User Embedding in Text-to-Image Generation
Zihao Wang ⋅ Yuxiang Wei ⋅ Xinpeng Zhou ⋅ Tianyu Zhang ⋅ Tao Liang ⋅ Yalong Bai ⋅ Hongzhi Zhang ⋅ Wangmeng Zuo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 49
Paper2Figure: A Multi-Agent Collaborative System for Figure Generation Towards Academic Research Paper
Siwei Han ⋅ Haonian Ji ⋅ Siyang Xin ⋅ Juanquan Shi ⋅ Shi Qiu ⋅ Xinyu Ye ⋅ Peng Xia ⋅ Jiaqi Liu ⋅ Zhaorun Chen ⋅ Yiyang Zhou ⋅ Linjie Li ⋅ Lijuan Wang ⋅ Huaxiu Yao
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 50
Adapting In-context Generation for Enhanced Composed Image Retrieval
Haiwen Li ⋅ Zining Chen ⋅ Delong Liu ⋅ Zhaohui Hou ⋅ Zhicheng Zhao ⋅ Fei Su
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 51
Transition Models: Rethinking the Generative Learning Objective
ZiDong Wang ⋅ Yiyuan Zhang ⋅ Xiaoyu Yue ⋅ Xiangyu Yue ⋅ Yangguang Li ⋅ Wanli Ouyang ⋅ Lei Bai
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 52
Rethinking Glyph Spatial Information in Font Generation
Peng Su ⋅ Xi Yang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 53
StreamDiT: Real-Time Streaming Text-to-Video Generation
Akio Kodaira ⋅ Tingbo Hou ⋅ Ji Hou ⋅ Markos Georgopoulos ⋅ Felix Juefei-Xu ⋅ Masayoshi Tomizuka ⋅ Yue Zhao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 54
ChArtist: Generating Pictorial Charts with Unified Spatial and Subject Control
Shishi Xiao ⋅ Tongyu Zhou ⋅ David H. Laidlaw ⋅ Gromit Yeuk-Yin Chan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 55
Camera Control for Text-to-Image Generation via Learning Viewpoint Tokens
Xinxuan Lu ⋅ Charless Fowlkes ⋅ Alex Berg
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 56
3D Space as a Scratchpad for Editable Text-to-Image Generation
Oindrila Saha ⋅ Vojtech Krs ⋅ Radomir Mech ⋅ Subhransu Maji ⋅ Matheus Gadelha ⋅ Kevin Blackburn-Matzen
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 57
Aligning Multi-Character Narrative Image Generation with Multi-Aspect Human Preferences
Ziyi Gao ⋅ Zhipeng Wei ⋅ Jingjing Chen ⋅ Stewart Tan ⋅ Hao li ⋅ Yi-Ping Phoebe Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 58
FoleyDirector: Directing Temporal Controllable Video-to-Audio Generation via Fine-Grained Temporal Scripts
You Li ⋅ Dewei Zhou ⋅ Fan Ma ⋅ Fu Li ⋅ Dongliang He ⋅ Yi Yang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 59
DCoAR: Deep Concept Injection into Unified Autoregressive Models for Personalized Text-to-Image Generation
Fangtai Wu ⋅ Mushui Liu ⋅ Weijie He ⋅ Zhao Wang ⋅ Yunlong Yu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 60
DreamOmni2: Multimodal Instruction-based Generation and Editing
Bin Xia ⋅ Bohao Peng ⋅ Yuechen Zhang ⋅ Junjia Huang ⋅ Jiyang Liu ⋅ Jingyao Li ⋅ Haoru Tan ⋅ WU Sitong ⋅ Chengyao Wang ⋅ Yitong Wang ⋅ Bei Yu ⋅ Jiaya Jia
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 61
AutoDebias: An Automated Framework for Detecting and Mitigating Backdoor Biases in Text-to-Image Models
Hongyi Cai ⋅ HONGYI CAI ⋅ MingKang Dong ⋅ Muxin Pu ⋅ Moayad Aloqaily ⋅ jie li ⋅ Xinfeng Li ⋅ Jialie Shen ⋅ Meikang Qiu ⋅ Qingsong Wen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 62
PosterIQ: A Design Perspective Benchmark for Poster Understanding and Generation
Yuheng Feng ⋅ Wen Zhang ⋅ Haodong Duan ⋅ Xingxing Zou
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 63
IVAAN: Instance-level Vision-Language Alignment via Attribute-Guided Text Prompts Generation for Nuclei Analysis
Jaehoon Jeong ⋅ Yi Hu ⋅ Soopil Kim ⋅ Jongseong Jang ⋅ Soonyoung Lee ⋅ Sang Hyun Park
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 64
IsoCLIP: Decomposing CLIP Projectors for Efficient Intra-modal Alignment
Simone Magistri ⋅ Dipam Goswami ⋅ Marco Mistretta ⋅ Bartłomiej Twardowski ⋅ Joost van de Weijer ⋅ Andrew Bagdanov
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 65
TIPSv2: Advancing Vision-Language Pretraining with Enhanced Patch-Text Alignment
Bingyi Cao ⋅ Koert Chen ⋅ Kevis-kokitsi Maninis ⋅ Kaifeng Chen ⋅ Arjun Karpur ⋅ Ye Xia ⋅ Sahil Dua ⋅ Tanmaya Dabral ⋅ Guangxing Han ⋅ Bohyung Han ⋅ Joshua Ainslie ⋅ Alex Bewley ⋅ Mithun Jacob ⋅ René Wagner ⋅ Washington Ramos ⋅ Krzysztof Choromanski ⋅ Mojtaba Seyedhosseini ⋅ Howard Zhou ⋅ André Araujo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 66
BioVITA: Biological Dataset, Model, and Benchmark for Visual-Textual-Acoustic Alignment
Risa Shinoda ⋅ Kaede Shiohara ⋅ Nakamasa Inoue ⋅ Kuniaki Saito ⋅ Hiroaki Santo ⋅ Fumio Okura
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 67
Boosting Visual Reprogramming for CLIP with Dual Granularity Alignment
Jiayang Wu ⋅ Xinyang Chen ⋅ Ke Lv ⋅ Weili Guan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 68
Decouple to Generalize: Context-First Self-Evolving Learning for Data-Scarce Vision-Language Reasoning
Tingyu Li ⋅ Zheng Sun ⋅ Jingxuan Wei ⋅ Conghui He ⋅ Lijun Wu ⋅ Cheng Tan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 69
UniGen-1.5: Enhancing Image Generation and Editing through Reward Unification in RL
Rui Tian ⋅ Mingfei Gao ⋅ Haiming Gang ⋅ Jiasen Lu ⋅ Zhe Gan ⋅ Yinfei Yang ⋅ Zuxuan Wu ⋅ Afshin Dehghan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 70
PolySLGen: Online Multimodal Speaking-Listening Reaction Generation in Polyadic Interaction
Zhi-Yi Lin ⋅ Thomas Markhorst ⋅ Jouh Yeong Chew ⋅ Xucong Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 71
Label What Matters: Modality-Balanced and Difficulty-Aware Multimodal Active Learning
Yuqiao Zeng ⋅ Xu Wang ⋅ Tengfei Liang ⋅ Yiqing Hao ⋅ Yi Jin ⋅ Hui Yu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 72
Unified Personalized Understanding, Generating and Editing
Yu Zhong ⋅ Tianwei Lin ⋅ Ruike Zhu ⋅ Yuqian Yuan ⋅ Haoyu Zheng ⋅ Liang Liang ⋅ Wenqiao Zhang ⋅ Feifei Shao ⋅ Haoyuan Li ⋅ Wanggui He ⋅ Hao Jiang ⋅ Yueting Zhuang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 73
MSRL: Scaling Generative Multimodal Reward Modeling via Multi-Stage Reinforcement Learning
Chenglong Wang ⋅ Yifu Huo ⋅ Yang Gan ⋅ Qiaozhi He ⋅ Qi Meng ⋅ Bei Li ⋅ Yan Wang ⋅ Junfu Liu ⋅ Tianjua Zhou ⋅ JingBo Zhu ⋅ Tong Xiao
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 74
Towards Uncertainty-aware Unsupervised Domain Adaptation for Videos and Time-Series with Causal Optimal Transport
Khushboo Mishra ⋅ Varun Trivedi ⋅ Tanima Dutta
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 75
Foundation Model Priors Enhance Object Focus in Feature Space for Source-Free Object Detection
Sairam Rebbapragada ⋅ Rishabh Lalla ⋅ Aveen Dayal ⋅ Tejal Kulkarni ⋅ Anuj Lalla ⋅ Vineeth Balasubramanian ⋅ Muhammad Haris Khan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 76
Decision Boundary-aware Generation for Long-tailed Learning
jiacheng yang ⋅ Ruichi Zhang ⋅ Chikai Shang ⋅ Mengke Li ⋅ Xinyi Shang ⋅ Junlong Gao ⋅ Yonggang Zhang ⋅ Yang Lu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 77
Towards Stable Federated Continual Test-Time Adaptation in Wild World
Liwen Wang ⋅ Xingbo Dong ⋅ Yi Liao ⋅ Zhe Jin
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 78
HyCal: A Training-Free Prototype Calibration Method for Cross-Discipline Few-Shot Class-Incremental Learning
Eunju Lee ⋅ MiHyeon Kim ⋅ Junehyoung Kwon ⋅ Yoonji Lee ⋅ JiHyun Kim ⋅ Soojin Jang ⋅ YoungBin Kim
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 79
ACE-Merging: Data-Free Model Merging with Adaptive Covariance Estimation
Bo Xu ⋅ Haotian Wu ⋅ Hehai Lin ⋅ Weiquan Huang ⋅ Beier Zhu ⋅ Yao Shu ⋅ Chengwei Qin
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 80
CHIPS: Efficient CLIP Adaptation via Curvature-aware Hybrid Influence-based Data Selection
Xinlin Zhuang ⋅ Yichen Li ⋅ Xiwei Liu ⋅ Haolin Yang ⋅ Yifan Lu ⋅ Ziyun Zou ⋅ Yulong Li ⋅ Huifa Li ⋅ Dongliang Chen ⋅ Qinglei Wang ⋅ Weiyang Liu ⋅ Ying Qian ⋅ Jiangming Shi ⋅ Imran Razzak
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 81
Addressing Exacerbated Attention Sink for Source-Free Cross-Domain Few-Shot Learning
Shuai Yi ⋅ Yixiong Zou ⋅ Yuhua Li ⋅ Ruixuan Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 82
Depth Hypothesis Guided Iterative Refinement for Event–Image Monocular Depth Estimation
Daikun Liu ⋅ Teng Wang ⋅ Changyin Sun
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 83
High-Quality and Efficient Turbulence Mitigation with Events
Xiaoran Zhang ⋅ Jian Ding ⋅ Yuxing Duan ⋅ Haoyue Liu ⋅ Gang Chen ⋅ Yi Chang ⋅ Luxin Yan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 84
Tracking through Severe Occlusion via Event-Derived Transient Cues
Hao Dong ⋅ Yujin Liu ⋅ Haoyue Liu ⋅ Zhenyu Wang ⋅ Shihan Peng ⋅ Zhiwei Shi ⋅ Yi Chang ⋅ Luxin Yan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 85
FastEventDGS: Deformable Gaussian Splatting for Fast Dynamic Scenes from a Single Event Camera
Zijia Dai ⋅ Nico Messikommer ⋅ Rong Zou ⋅ Nikola Zubic ⋅ Davide Scaramuzza ⋅ Laurent Kneip
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 86
Event-Based Motion Deblurring Using Task-Oriented 3D Gaussian Event Representations
Shengdong Xue ⋅ Haoxiang Ma ⋅ Hao Chen ⋅ Zhen Yang ⋅ Yongjian Deng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 87
From Corners to Fiducial Tags: Revisiting Checkerboard Calibration for Event Cameras
Taehun Ryu ⋅ Changwoo Kang ⋅ Kyungdon Joo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 88
Extending Embodied Question Answering from Perception to Decision
Xicheng Gong ⋅ Qiwei Li ⋅ Peiran Xu ⋅ Yadong Mu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 89
Dejavu: Towards Experience Feedback Learning for Embodied Intelligence
Shaokai Wu ⋅ Yanbiao Ji ⋅ Qiuchang Li ⋅ Zhiyi Zhang ⋅ Qichen He ⋅ Wenyuan XIE ⋅ Guodong Zhang ⋅ Bayram Bayramli ⋅ Yue Ding ⋅ Hongtao Lu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 90
Demo2Tutorial: From Human Experience to Multimodal Software Tutorials
Zechen Bai ⋅ Zhiheng Chen ⋅ Yiqi Lin ⋅ Kevin Qinghong Lin ⋅ Difei Gao ⋅ Xiangwu Guo ⋅ Xin Wang ⋅ Mike Zheng Shou
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 91
MaskDexGrasp: Generative Masked Modeling for Part-Aware Dexterous Grasp Synthesis
Binghui Zuo ⋅ Lin Zhou ⋅ Haoxuan Xu ⋅ Jianan Yan ⋅ ZhiPeng Yu ⋅ Zekai Liu ⋅ Yangang Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 92
Predict Before You Explore: Predictive Planning with Specialized Memory for Embodied Question Answering
Bowen Yuan ⋅ Sisi You ⋅ Bing-Kun Bao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 93
VideoWeaver: Multimodal Multi-View Video-to-Video Transfer for Embodied Agents
George Eskandar ⋅ Fengyi Shen ⋅ Mohammad Altillawi ⋅ Dong Chen ⋅ Yang Bai ⋅ Liudi Yang ⋅ Ziyuan Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 94
MindPower: Enabling Theory-of-Mind Reasoning in VLM-based Embodied Agents
Ruoxuan Zhang ⋅ Qiyun Zheng ⋅ Zhiyu Zhou ⋅ Ziqi Liao ⋅ Siyu Wu ⋅ Jian-Yu Jiang-Lin ⋅ Bin Wen ⋅ Hongxia Xie ⋅ Jianlong Fu ⋅ Wen-Huang Cheng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 95
Align While Search: Belief-Guided Exploratory Inference for World-Grounded Embodied Agents
Seohui Bae ⋅ Jeonghye Kim ⋅ Youngchul Sung ⋅ Woohyung Lim
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 96
Rethinking Intermediate Representation for VLM-based Robot Manipulation
Weiliang Tang ⋅ Jialin Gao ⋅ Jia-Hui Pan ⋅ Gang Wang ⋅ Li Erran Li ⋅ Yun-Hui Liu ⋅ Mingyu Ding ⋅ Pheng-Ann Heng ⋅ Chi-Wing Fu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 97
Dexterous World Models
Byungjun Kim ⋅ Taeksoo Kim ⋅ Junyoung Lee ⋅ Hanbyul Joo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 98
FantasyVLN: Unified Multimodal Chain-of-Thought Reasoning for Vision-and-Language Navigation
Jing Zuo ⋅ Lingzhou Mu ⋅ Fan Jiang ⋅ Chengcheng Ma ⋅ Mu Xu ⋅ Yonggang Qi
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 99
UniLight: A Unified Representation for Lighting
Zitian Zhang ⋅ Iliyan Georgiev ⋅ Michael Fischer ⋅ Yannick Hold-Geoffroy ⋅ Jean-François Lalonde ⋅ Valentin Deschaintre
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 100
MICo-150K: A Comprehensive Dataset Advancing Multi-Image Composition
Xinyu Wei ⋅ Kangrui Cen ⋅ Hongyang Wei ⋅ Zhen Guo ⋅ Bairui Li ⋅ Zeqing Wang ⋅ Jinrui Zhang ⋅ Lei Zhang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 101
Upsample Anything: A Simple and Hard to Beat Baseline for Feature Upsampling
Minseok Seo ⋅ Mark Hamilton ⋅ Changick Kim
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 102
Hist2Style: Histogram-Guided Stylization with Bilateral Grids
Dekel Galor ⋅ Adam Pikielny ⋅ Zhoutong Zhang ⋅ Ke Wang ⋅ Laura Waller ⋅ Jiawen Chen ⋅ Ilya Chugunov
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 103
Harmonic Canvas: Inversion-Free Editing for Visually-Guided Music Style Transfer
Yue Lei ⋅ Siqi Yang ⋅ Ting Zhong ⋅ Fan Zhou
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 104
How to Take a Memorable Picture? Empowering Users with Actionable Feedback
Francesco Laiti ⋅ Davide Talon ⋅ Jacopo Staiano ⋅ Elisa Ricci
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 105
UniEdit-I: Training-free Image Editing for Unified VLM via Iterative Understanding, Editing and Verifying
Bai Chengyu ⋅ Jintao Chen ⋅ Xiang Bai ⋅ Yilong Chen ⋅ Qi She ⋅ Ming Lu ⋅ Shanghang Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 106
SCIEval: Evaluating and Benchmarking the Faithfulness of Scientific Image Generation and Interpretation with Large Multimodal Models
Guanghui Ye ⋅ Huan Zhao ⋅ Zhixue Zhao ⋅ Tengfei Ma ⋅ Kehan Wang ⋅ Steffen Eger ⋅ Zhihua Jiang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 107
GeoRelight: Learning Joint Geometrical Reconstruction and Relighting with Flexible Multi-Modal Diffusion Transformers
Yuxuan Xue ⋅ Ruofan Liang ⋅ Egor Zakharov ⋅ Timur Bagautdinov ⋅ Chen Cao ⋅ Giljoo Nam ⋅ Shunsuke Saito ⋅ Gerard Pons-Moll ⋅ Javier Romero
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 108
HAD: Hallucination-Aware Diffusion Priors for 3D Reconstruction
Xi Liu ⋅ Weiwei Sun ⋅ Joe Ren ⋅ Christopher Broaddus ⋅ Siyu Huang ⋅ Laurent Guigues
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 109
Catalyst4D: High-Fidelity 3D-to-4D Scene Editing via Dynamic Propagation
Shifeng Chen ⋅ Yihui Li ⋅ Jun Liao ⋅ Hongyu Yang ⋅ Di Huang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 110
ReFlow: Self-correction Motion Learning for Dynamic Scene Reconstruction
Yanzhe Liang ⋅ Ruijie Zhu ⋅ Hanzhi Chang ⋅ Zhuoyuan Li ⋅ Jiahao Lu ⋅ Tianzhu Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 111
Semantic Foam: Unifying Spatial and Semantic Scene Decomposition
Amr Sharafeldin ⋅ Aryan Mikaeili ⋅ Thomas Walker ⋅ Shrisudhan Govindarajan ⋅ Daniel Rebain ⋅ Kwang Moo Yi ⋅ Andrea Tagliasacchi
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 112
NVGS: Neural Visibility for Occlusion Culling in 3D Gaussian Splatting
Brent Zoomers ⋅ Florian Hahlbohm ⋅ Joni Vanherck ⋅ Lode Jorissen ⋅ Marcus Magnor ⋅ Nick Michiels
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 113
NeAR: Coupled Neural Asset–Renderer Stack
Hong Li ⋅ Chongjie Ye ⋅ Houyuan Chen ⋅ Weiqing Xiao ⋅ Ziyang Yan ⋅ Lixing Xiao ⋅ Zhaoxi Chen ⋅ Jianfeng XIANG ⋅ Shaocong Xu ⋅ Xuhui Liu ⋅ Yikai Wang ⋅ Baochang Zhang ⋅ Xiaoguang Han ⋅ Jiaolong Yang ⋅ Hao Zhao
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 114
Thermal is Always Wild: Characterizing and Addressing Challenges in Thermal-Only Novel View Synthesis
M. Kerem Aydin ⋅ Vishwanath Saragadam ⋅ Emma Alexander
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 115
PhysGM: Large Physical Gaussian Model for Feed-Forward 4D Synthesis
chunji lv ⋅ Zequn Chen ⋅ Donglin Di ⋅ Weinan Zhang ⋅ Hao Li ⋅ Wei Chen ⋅ Yinjie Lei ⋅ Changsheng Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 116
Life-IQA: Boosting Blind Image Quality Assessment through GCN-enhanced Layer Interaction and MoE-based Feature Decoupling
Tang Long ⋅ Huiyu Duan ⋅ Guoquan Zheng ⋅ Jianbo Zhang ⋅ Jie Hao ⋅ Liang Yuan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 117
TM-BSN: Triangular-Masked Blind-Spot Network for Real-World Self-Supervised Image Denoising
Junyoung Park ⋅ Youngjin Oh ⋅ Nam Ik Cho
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 118
Multinex: Lightweight Low-light Image Enhancement via Multi-prior Retinex
Alexandru Brateanu ⋅ Tingting Mu ⋅ Codruta O. Ancuti ⋅ Cosmin Ancuti
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 119
Beyond Ground-Truth: Leveraging Image Quality Priors for Real-World Image Restoration
Fengyang Xiao ⋅ Peng Hu ⋅ Lei Xu ⋅ XingE Guo ⋅ Guanyi Qin ⋅ Yuqi Shen ⋅ Chengyu Fang ⋅ Rihan Zhang ⋅ Chunming He ⋅ Sina Farsiu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 120
ExpoCM: Exposure-Aware One-Step Generative Single-Image HDR Reconstruction
Aoyu Liu ⋅ Zhen Liu ⋅ Ziyi Wang ⋅ Dian Chen ⋅ Bing Zeng ⋅ Shuaicheng Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 121
Physically-Grounded Turbulence Mitigation with Frame-Shared Degradation Parameters
Dongxin Xie ⋅ Yan Huang ⋅ Yong Xu ⋅ Hui Ji
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 122
Convexity-Aware Noise Calibration: A Self-Supervised Framework for Noise-Level-Unknown Image Denoising
Zhan Wang ⋅ Wang Leiquan ⋅ Chunlei Wu ⋅ Yu Meng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 123
UCMNet: Uncertainty-Aware Context Memory Network for Under-Display Camera Image Restoration
DAEHYUN KIM ⋅ Youngmin Kim ⋅ Yoon Ju Oh ⋅ Tae Hyun Kim
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 124
Beyond the Ground Truth: Enhanced Supervision for Image Restoration
Donghun Ryou ⋅ Inju Ha ⋅ Sanghyeok Chu ⋅ Bohyung Han
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 125
ShiftLUT: Spatial Shift Enhanced Look-Up Tables for Efficient Image Restoration
ZENG XIAOLONG ⋅ Yitong Yu ⋅ Shiyao Xiong ⋅ Jinhua Hao ⋅ Ming Sun ⋅ Chao Zhou ⋅ Bin Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 126
Bilevel Layer-Positioning LoRA for Real Image Dehazing
Yan Zhang ⋅ Long Ma ⋅ Yuxin Feng ⋅ Zhe Huang ⋅ Fan Zhou ⋅ Zhuo Su
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 127
SD-FSMIS: Adapting Stable Diffusion for Few-Shot Medical Image Segmentation
Meihua Li ⋅ Yang Zhang ⋅ Weizhao He ⋅ Hu Qu ⋅ Yisong Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 128
GeoSemba: Reconstructing State Space Model for Cross Paradigm Representation in Medical Image Segmentation
Xutao Sun ⋅ Jiarui Li ⋅ Junwen Liu ⋅ Yonggong Ren
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 129
SHAPE: Structure-aware Hierarchical Unsupervised Domain Adaptation with Plausibility Evaluation for Medical Image Segmentation
Linkuan Zhou ⋅ Yinghao Xia ⋅ Yufei Shen ⋅ Xiangyu Li ⋅ Wenjie Du ⋅ Cong Cong ⋅ leyi wei ⋅ Ran Su ⋅ Qiangguo Jin
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 130
Delving Aleatoric Uncertainty in Medical Image Segmentation via Vision Foundation Models
Ruiyang Li ⋅ Fang Liu ⋅ Licheng Jiao ⋅ Xinglin Xie ⋅ Jiayao Hao ⋅ Shuo Li ⋅ Xu Liu ⋅ Jingyi yang ⋅ Lingling Li ⋅ Puhua Chen ⋅ Wenping Ma
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 131
Revisiting 2D Foundation Models for Scalable 3D Medical Image Classification
Han Liu ⋅ Bogdan Georgescu ⋅ Yanbo Zhang ⋅ Youngjin Yoo ⋅ Michael Baumgartner ⋅ Riqiang Gao ⋅ Jianing Wang ⋅ Gengyan Zhao ⋅ Eli Gibson ⋅ Dorin Comaniciu ⋅ Sasa Grbic
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 132
Focus on Background: Exploring SAM's Potential in Few-shot Medical Image Segmentation with Background-centric Prompting
Yuntian Bo ⋅ Yazhou Zhu ⋅ Piotr Koniusz ⋅ Haofeng Zhang
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 133
Simple-ViLMedSAM: Simple Text Prompts Meet Vision-Language Models for Medical Image Segmentation
Chengcan Qian ⋅ Dong Nie ⋅ Geng Chen ⋅ Daoqiang Zhang ⋅ Xuyun Wen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 134
NeuroSeg Meets DINOv3: Transferring 2D Self-Supervised Visual Priors to 3D Neuron Segmentation via DINOv3 Initialization
Yik San Cheng ⋅ Runkai Zhao ⋅ Weidong Cai
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 135
Multi-Paradigm Collaborative Adversarial Attack Against Multi-Modal Large Language Models
Yuanbo Li ⋅ Tianyang Xu ⋅ Cong Hu ⋅ Tao Zhou ⋅ Xiao-Jun Wu ⋅ Josef Kittler
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 136
TINA: Text-Free Inversion Attack for Unlearned Text-to-Image Diffusion Models
Qianlong Xiang ⋅ Miao Zhang ⋅ Haoyu Zhang ⋅ Kun Wang ⋅ Junhui Hou ⋅ Liqiang Nie
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 137
Jailbreaking Vision-Language Models via Dissonance-Guided Suffix Optimization and Image–Phrase Injection
Jiacheng Pi ⋅ Zhiguo Yang ⋅ Xingxing Huang ⋅ Dongsheng Xu ⋅ Ruizhi Zhong ⋅ Wenjie Ruan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 138
BlackMirror: Black-Box Backdoor Detection for Text-to-Image Models via Instruction-Response Deviation
Feiran Li ⋅ Qianqian Xu ⋅ Shilong Bao ⋅ Zhiyong Yang ⋅ Xilin Zhao ⋅ Xiaochun Cao ⋅ Qingming Huang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 139
VCP-Attack: Visual-Contrastive Projection for Transferable Black-Box Targeted Attacks on Large Vision-Language Models
Jiawei Zhao ⋅ Minjie Du ⋅ Zihan Qin ⋅ Zhuoran Wang ⋅ Lizhe Xie ⋅ Yining Hu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 140
Adapter Shield: A Unified Framework with Built-in Authentication for Preventing Unauthorized Zero-Shot Image-to-Image Generation
Jun Jia ⋅ Hongyi Miao ⋅ Yingjie Zhou ⋅ Wangqiu Zhou ⋅ Jianbo Zhang ⋅ Linhan Cao ⋅ Dandan Zhu ⋅ Hua Yang ⋅ Xiongkuo Min ⋅ Wei Sun ⋅ Guangtao Zhai
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 141
LLaVAShield: Safeguarding Multimodal Multi-Turn Dialogues in Vision-Language Models
Guolei Huang ⋅ Qinzhi Peng ⋅ Gan Xu ⋅ Yao Huang ⋅ Yuxuan Lu ⋅ Yongjun Shen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 142
Transform to Transfer: Boosting Adversarial Attack Transferability on Vision-Language Pre-training Models
Yang Li ⋅ Jia-Li Yin ⋅ Luojun Lin ⋅ Wei Lin
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 143
Mask to Align, Weight to Disambiguate: Reliable Unsupervised Cross-Modal Hashing with Masked-Weight Contrast
Fan Yang ⋅ Yuanzhi Zhao ⋅ Haimei Zhao ⋅ Yudong Zhao ⋅ Haikun Xu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 144
Reliable Clustering Number Estimation for Contrastive Multi-View Clustering
Zhengzhong Zhu ⋅ Pei Zhou ⋅ Lanxi Bai ⋅ Li Cheng ⋅ Jia Nie ⋅ Shiquan min ⋅ Jiangping Zhu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 145
Pushing the Frontier of Audiovisual Perception with Large-Scale Multimodal Correspondence Learning
Apoorv Vyas ⋅ Heng-Jui Chang ⋅ Cheng-Fu Yang ⋅ Po-Yao Huang ⋅ Luya Gao ⋅ Julius Richter ⋅ Sanyuan Chen ⋅ Matthew Le ⋅ Piotr Dollár ⋅ Christoph Feichtenhofer ⋅ Ann Lee ⋅ Wei-Ning Hsu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 146
Enhance-then-Balance Modality Collaboration for Robust Multimodal Sentiment Analysis
Kang He ⋅ Yuzhe Ding ⋅ Xinrong Wang ⋅ Fei Li ⋅ Chong Teng ⋅ Donghong Ji
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 147
SonoWorld: From One Image to a 3D Audio-Visual Scene
Derong Jin ⋅ Xiyi Chen ⋅ Ming C. Lin ⋅ Ruohan Gao
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 148
MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping
yushi Huang ⋅ Zining Wang ⋅ Zhihang Yuan ⋅ Yifu Ding ⋅ RUIHAO GONG ⋅ Jinyang Guo ⋅ Xianglong Liu ⋅ Jun Zhang
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 149
EXOTIC: External Vision-driven Incomplete Multi-view Classification
Shilin Xu ⋅ Dezhong Peng ⋅ Zhenwen Ren ⋅ Yuan Sun
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 150
Easy2Hard: From Partially to Fully Unmatched Modalities as Negative Samples in Contrastive Learning
Zhicheng Yang ⋅ Yichen Liu ⋅ Chang Ge ⋅ Xiaopeng Jiang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 151
OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation
Han Li ⋅ Xinyu Peng ⋅ Yaoming Wang ⋅ Zelin Peng ⋅ Xin Chen ⋅ Rongxiang Weng ⋅ Jingang Wang ⋅ Xunliang Cai ⋅ Wenrui Dai ⋅ Hongkai Xiong
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 152
BALM: A Model-Agnostic Framework for Balanced Multimodal Learning under Imbalanced Missing Rates
Phuong-Anh Nguyen ⋅ Tien Anh Pham ⋅ Duc-Trong Le ⋅ Van Nguyen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 153
UniT: Unified Multimodal Chain-of-Thought Test-time Scaling
Leon Liangyu Chen ⋅ Haoyu Ma ⋅ Zhipeng Fan ⋅ Ziqi Huang ⋅ Animesh Sinha ⋅ Xiaoliang Dai ⋅ Jialiang Wang ⋅ Zecheng He ⋅ Jianwei Yang ⋅ Chunyuan Li ⋅ Junzhe Sun ⋅ Chu Wang ⋅ Serena Yeung ⋅ Felix Juefei-Xu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 154
Multi-modal Test-time Adaptation via Adaptive Probabilistic Gaussian Calibration
Jinglin Xu ⋅ Yi Li ⋅ Chuxiong Sun ⋅ Xiao Xu ⋅ Jiangmeng Li ⋅ Fanjiang Xu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 155
Information-Theoretic Decomposition for Multimodal Interaction Learning
Zequn Yang ⋅ Yake Wei ⋅ HaoTian Ni ⋅ Zhihao Xu ⋅ Di Hu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 156
Is the Modality Gap a Bug or a Feature? A Robustness Perspective
Rhea Chowers ⋅ Oshri Naparstek ⋅ Udi Barzelay ⋅ Yair Weiss
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 157
Omni-Fake: Benchmarking Unified Multimodal Social Media Deepfake Detection
Tianxiao Li ⋅ Zhenglin Huang ⋅ Haiquan Wen ⋅ Yiwei He ⋅ Xinze Li ⋅ BINGYU ZHU ⋅ WUHUI DUAN ⋅ Congang CHEN ⋅ ZEYU FU ⋅ Yi Dong ⋅ Baoyuan Wu ⋅ Xiangtai Li ⋅ Guangliang Cheng
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 158
MUST: Modality-Specific Representation-Aware Transformer for Diffusion-Enhanced Survival Prediction with Missing Modality
Kyungwon Kim ⋅ Dosik Hwang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 159
VQRAE: Representation Quantization Autoencoders for Multimodal Understanding, Generation and Reconstruction
SiNan Du ⋅ JiaHao Guo ⋅ Bo Li ⋅ Shuhao Cui ⋅ Zhengzhuo Xu ⋅ Yifu Luo ⋅ Yongxian Wei ⋅ Kun Gai ⋅ Xinggang Wang ⋅ Kai Wu ⋅ Chun Yuan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 160
MOS: Mitigating Optical-SAR Modality Gap for Cross-Modal Ship Re-Identification
Yujian Zhao ⋅ Hankun Liu ⋅ Guanglin Niu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 161
SeD-UD: An Influence-Driven and Hierarchically-Decoupled Information Bottleneck for Multimodal Intent Recognition
Qin Li ⋅ Wenbo Zhang ⋅ Limei Liu ⋅ Han Peng ⋅ Junfeng Yang ⋅ Guanying Xu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 162
MultiModalPFN: Extending Prior-Data Fitted Networks for Multimodal Tabular Learning
Wall Kim ⋅ Chaeyoung Song ⋅ Hanul Kim
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 163
LacTokGen: Latent Consistency Tokenizer for 1024-pixel Image Generation by 256 Tokens
Qingsong Xie ⋅ Luyuan Zhang ⋅ Zhao Zhang ⋅ Siyuan Li ⋅ Zhe Huang ⋅ Zhenyu Yang ⋅ Haonan Lu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 164
FlowSteer: Guiding Few-Step Image Synthesis with Authentic Trajectories
Lei Ke ⋅ Hubery Yin ⋅ Gongye Liu ⋅ Zhengyao Lv ⋅ Jingcai Guo ⋅ Chen Li ⋅ Wenhan Luo ⋅ Yujiu Yang ⋅ Jing LYU
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 165
Visual Autoregressive Modeling via Next Focus Prediction
Xiaofan Li ⋅ Chenming Wu ⋅ Yanpeng Sun ⋅ Jiaming Zhou ⋅ Delin Qu ⋅ Yansong Qu ⋅ Weihao Bo ⋅ Haibao Yu ⋅ Dingkang Liang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 166
Semantic Context Matters: Improving Conditioning for Autoregressive Models
Dongyang Jin ⋅ Ryan Xu ⋅ Jianhao Zeng ⋅ Rui Lan ⋅ Yancheng Bai ⋅ Lei Sun ⋅ Xiangxiang Chu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 167
TempoMaster: Efficient Long Video Generation via Next-Frame-Rate Prediction
Yukuo Ma ⋅ Cong Liu ⋅ Junke Wang ⋅ Junqi Liu ⋅ Haibin Huang ⋅ Zuxuan Wu ⋅ Chi Zhang ⋅ Xuelong Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 168
FlashIn: Fast and Accurate Image Inversion for Real-time Image Editing
Guangzhi Wang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 169
EasyV2V: A High-quality Instruction-based Video Editing Framework
Jinjie Mai ⋅ Chaoyang Wang ⋅ Gordon Guocheng Qian ⋅ Willi Menapace ⋅ Sergey Tulyakov ⋅ Bernard Ghanem ⋅ Peter Wonka ⋅ Ashkan Mirzaei
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 170
One Algorithm to Align Them All
Boyi Pang ⋅ Savva Ignatyev ⋅ Vladimir Ippolitov ⋅ Ramil Khafizov ⋅ Yurii Melnik ⋅ Oleg Voynov ⋅ Maksim Nakhodnov ⋅ Aibek Alanov ⋅ Xiaopeng Fan ⋅ Peter Wonka ⋅ Evgeny Burnaev
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 171
VGA-Bench: A Unified Benchmark and Multi-Model Framework for Video Aesthetics and Generation Quality Evaluation
Longteng Jiang ⋅ DanDan Zheng ⋅ Qianqian Qiao ⋅ Heng Huang ⋅ Huaye Wang ⋅ Yihang Bo ⋅ Bao Peng ⋅ Jingdong Chen ⋅ JUN ZHOU ⋅ Xin Jin
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 172
Improved Mean Flows: On the Challenges of Fastforward Generative Models
ZHENGYANG GENG ⋅ Yiyang Lu ⋅ Zongze Wu ⋅ Eli Shechtman ⋅ Zico Kolter ⋅ Kaiming He
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 173
SynMotion: Semantic-Visual Adaptation for Motion Customized Video Generation
Shuai Tan ⋅ Biao Gong ⋅ Yujie Wei ⋅ Shiwei Zhang ⋅ Zhuoxin Liu ⋅ Ke Ma ⋅ Yan Wang ⋅ Kecheng Zheng ⋅ Xing Zhu ⋅ Yujun Shen ⋅ Hengshuang Zhao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 174
Match-and-Fuse: Consistent Generation from Unstructured Image Sets
Kate Feingold ⋅ Omri Kaduri ⋅ Tali Dekel
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 175
Mixture of Style Experts for Diverse Image Stylization
Shihao Zhu ⋅ Ziheng Ouyang ⋅ Yijia Kang ⋅ Qilong Wang ⋅ Mi Zhou ⋅ Bo Li ⋅ Mingming Cheng ⋅ Qibin Hou
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 176
Mirai: Autoregressive Visual Generation Needs Foresight
Yonghao Yu ⋅ Lang Huang ⋅ Zerun Wang ⋅ Runyi Li ⋅ Toshihiko Yamasaki
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 177
Align Images Before You Generate
Shihua Zhang ⋅ Qiuhong Shen ⋅ Xinchao Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 178
Bridging the Perception Gap in Image Super-Resolution Evaluation
Shaolin Su ⋅ Josep M. ⋅ Danna Xue ⋅ David Serrano-Lozano ⋅ Lei Sun ⋅ Javier Vazquez-Corral
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 179
Time-Aware One Step Diffusion Network for Real-World Image Super-Resolution
Tianyi Zhang ⋅ Zheng-Peng Duan ⋅ Chunle Guo ⋅ Peng-Tao Jiang ⋅ Bo Li ⋅ Mingming Cheng ⋅ Chongyi Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 180
Restore Text First, Enhance Image Later: Two-Stage Scene Text Image Super-Resolution with Glyph Structure Guidance
Minxing Luo ⋅ Linlong Fan ⋅ Qiushi Wang ⋅ Ge Wu ⋅ Yiyan Luo ⋅ Yuhang Yu ⋅ Jinwei Chen ⋅ Yaxing Wang ⋅ Qingnan Fan ⋅ Jian Yang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 181
IAFMNet: Information-Aware Feature Modulation for Efficient Super-Resolution
Junwei Xu ⋅ Mengzu Liu ⋅ Zhenyu Wang ⋅ Fangfang Wu ⋅ Sijia Wu ⋅ Tao Huang ⋅ Weisheng Dong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 182
Physics-Consistent Diffusion for Efficient Fluid Super-Resolution via Multiscale Residual Correction
Zhihao LI ⋅ Shengwei Dong ⋅ Chuang Yi ⋅ Junxuan Gao ⋅ Zhilu Lai ⋅ Zhiqiang Liu ⋅ Wei Wang ⋅ Guangtao Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 183
Bridging Fidelity-Reality with Controllable One-Step Diffusion for Image Super-Resolution
Hao Chen ⋅ Junyang Chen ⋅ Jinshan Pan ⋅ Jiangxin Dong
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 184
Omni-Supervised Motion Editing: Balancing Change and Invariance through Positive-Negative Learning
Zhenwu Shi ⋅ Jingyu Gong ⋅ Peiwei Wang ⋅ Xingzan Wang ⋅ Tianwen Qian ⋅ Wenxi Li ⋅ Yuan Fang ⋅ Jiao Xie ⋅ Lizhuang Ma ⋅ Shaohui Lin
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 185
FaceCam: Portrait Video Camera Control via Scale-Aware Conditioning
Weijie Lyu ⋅ Ming-Hsuan Yang ⋅ ZHIXIN SHU
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 186
Cross-Axis Feature Fusion with Joint-Wise Motion Difference Prediction for Text-Based 3D Human Motion Editing
Gyojin Han ⋅ Junmo Kim
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 187
MotionMaster: Generalizable Text-Driven Motion Generation and Editing
Nan Jiang ⋅ yunhao li ⋅ Lexi Pang ⋅ Zimo He ⋅ Siyuan Huang ⋅ Yixin Zhu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 188
OpenT2M: No-frill Motion Generation with Open-source, Large-scale, High-quality Data
Bin Cao ⋅ Sipeng Zheng ⋅ Hao Luo ⋅ Boyuan Li ⋅ Jing Liu ⋅ Zongqing Lu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 189
Towards Decompositional Human Motion Generation with Energy-Based Diffusion Models
Jianrong Zhang ⋅ Hehe Fan ⋅ Yi Yang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 190
PAMotion: Physics-Aware Motion Generation for Full-Body Interaction with Multiple Objects
Yan Di ⋅ Yuheng Li ⋅ Yaoxing Wang ⋅ Mengge Liu ⋅ Shan Gao ⋅ Xiangyang Ji
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 191
Sketch2Colab: Sketch-Conditioned Multi-Human Animation via Controllable Flow Distillation
Divyanshu Daiya ⋅ Aniket Bera
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 192
ViHOI: Human-Object Interaction Synthesis with Visual Priors
Songjin Cai ⋅ Linjie Zhong ⋅ Ling Guo ⋅ Changxing Ding
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 193
CLEP: Contrastive Language-Pose Pretraining
Sen Jia ⋅ Huayu Wang ⋅ Hsiang-Wei Huang ⋅ Zhaochong An ⋅ Jenq-Neng Hwang ⋅ Huaping Zhang ⋅ Lei Li
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 194
OpenFS: Multi-Hand-Capable Fingerspelling Recognition with Implicit Signing-Hand Detection and Frame-Wise Letter-Conditioned Synthesis
Junuk Cha ⋅ Jihyeon Kim ⋅ Han-Mu Park
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 195
ARMFlow: AutoRegressive MeanFlow for Online 3D Human Reaction Generation
Zichen Geng ⋅ Zeeshan Hayder ⋅ Wei Liu ⋅ Hesheng Wang ⋅ Ajmal Mian
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 196
InterPhys: Physics-aware Human Motion Synthesis in a Dynamic Scene
Chaoyue Xing ⋅ Wei Mao ⋅ Miaomiao Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 197
Beyond Mimicry: Learning Whole-Body Human-Humanoid Interaction from Human-Human Demonstrations
Wei-Jin Huang ⋅ Yue-Yi Zhang ⋅ Yi-Lin Wei ⋅ Zhi-Wei Xia ⋅ Juantao Tan ⋅ Yuanming Li ⋅ Zhilin Zhao ⋅ Wei-Shi Zheng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 198
PHAC: Promptable Human Amodal Completion
Seung Young ⋅ Ju Yong Chang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 199
CoordSpeaker: Exploiting Gesture Captioning for Coordinated Caption-Empowered Co-Speech Gesture Generation
Fengyi Fang ⋅ Sicheng Yang ⋅ Wenming Yang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 200
IntrinsicWeather: Controllable Weather Editing in Intrinsic Space
Yixin Zhu ⋅ Zuo-Liang Zhu ⋅ Jian Yang ⋅ Milos Hasan ⋅ Jin Xie ⋅ Beibei Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 201
Outlier-Robust Diffusion Solvers for Inverse Problems
Yang Zheng ⋅ Jiahua Liu ⋅ Tongyao Pang ⋅ Wen Li ⋅ Zhaoqiang Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 202
Beyond Fixed Formulas: Data-Driven Linear Predictor for Efficient Diffusion Models
Zhirong Shen ⋅ Rui Huang ⋅ Jiacheng Liu ⋅ Chang Zou ⋅ Peiliang Cai ⋅ Shikang Zheng ⋅ zhengyi shi ⋅ Liang Feng ⋅ Linfeng Zhang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 203
ReasonX: MLLM-Guided Intrinsic Image Decomposition
Alara Dirik ⋅ Tuanfeng Wang ⋅ Duygu Ceylan ⋅ Stefanos Zafeiriou ⋅ Anna Frühstück
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 204
Diff-SemiER: Transparency-Aware Adaptive Fusion Diffusion Model with Generative Prior for Semi-Transparent Eyeglasses Removal
Jiahao Li ⋅ Shiqi Yin ⋅ Zhenxiang Lian ⋅ jingtao guo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 205
KLIP: Localized Distribution Shift Detection via KL-Divergence with Diffusion Priors in Inverse Problems
Alireza Kheirandish ⋅ Jihoon Hong ⋅ Sara Fridovich-Keil
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 206
Elucidating the Design Space of Arbitrary-Noise-Based Diffusion Models
Xingyu Qiu ⋅ Mengying Yang ⋅ Xinghua Ma ⋅ Dong Liang ⋅ Fanding Li ⋅ Gongning Luo ⋅ wei wang ⋅ Kuanquan Wang ⋅ Shuo Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 207
Taming Generative Diffusion Model for Task-Oriented Infrared Imaging
Tengyu Ma ⋅ Zhilong Dai ⋅ Yubo Diao ⋅ Guanming An ⋅ Long Ma ⋅ Jinyuan Liu ⋅ Risheng Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 208
Attention, May I Have Your Decision? Localizing Generative Choices in Diffusion Models
Katarzyna Zaleska ⋅ Łukasz Popek ⋅ Monika Wysoczańska ⋅ Kamil Deja
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 209
RxnCaption: Reformulating Reaction Diagram Parsing as Visual Prompt Guided Captioning
Jiahe Song ⋅ Chuang Wang ⋅ Bowen Jiang ⋅ Yinfan Wang ⋅ Hao Zheng ⋅ Xingjian Wei ⋅ Chengjin Liu ⋅ Rui Nie ⋅ Junyuan Gao ⋅ Jiaxing Sun ⋅ Yubin Wang ⋅ Lijun Wu ⋅ Zhenhua Huang ⋅ Jiang Wu ⋅ Qian Yu ⋅ Conghui He
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 210
More than the Sum: Panorama-Language Models for Adverse Omni-Scenes
Weijia Fan ⋅ Ruiping Liu ⋅ Jiale Wei ⋅ Yufan Chen ⋅ Junwei Zheng ⋅ Zichao Zeng ⋅ Jiaming Zhang ⋅ Qiufu Li ⋅ Linlin Shen ⋅ Rainer Stiefelhagen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 211
DiGraphHal-Bench: Evaluating Multimodal Large Language Models on Complex Directed Graphs
Yixin Fan ⋅ He Zhao ⋅ Yuxin Hou ⋅ Changhua Zhou ⋅ Zihao Liu ⋅ Peng Wang ⋅ Lu ChengLong ⋅ Xu Zhang ⋅ Wei Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 212
SEA-Vision: A Multilingual Benchmark for Comprehensive Document and Scene Text Understanding in Southeast Asia
Pengfei Yue ⋅ Xingran Zhao ⋅ Juntao Chen ⋅ Peng Hou ⋅ Wang Longchao ⋅ Jianghang Lin ⋅ Shengchuan Zhang ⋅ Anxiang Zeng ⋅ Liujuan Cao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 213
Time Blindness: Why Video-Language Models Can’t See What Humans Can?
Ujjwal Upadhyay ⋅ Mukul Ranjan ⋅ Zhiqiang Shen ⋅ Mohamed Elhoseiny
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 214
Spot The Ball: A Benchmark for Visual Social Inference
Neha Balamurugan ⋅ Sarah Wu ⋅ Cristobal Eyzaguirre ⋅ Tobias Gerstenberg
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 215
MM-SeR: Multimodal Self-Refinement for Lightweight Image Captioning
Junha Song ⋅ Yongsik Jo ⋅ So Yeon Min ⋅ Quanting Xie ⋅ Taehwan Kim ⋅ Yonatan Bisk ⋅ Jaegul Choo
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 216
E-comIQ-ZH: A Human-Aligned Dataset and Benchmark for Fine-Grained Evaluation of E-commerce Posters with Chain-of-Thought
Meiqi Sun ⋅ mingyu Li ⋅ Junxiong Zhu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 217
GeoWorld: Geometric World Models
Zeyu Zhang ⋅ Danning Li ⋅ Ian Reid ⋅ Richard Hartley
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 218
ORD: Object-Relation Decoupling for Generalized 3D Visual Grounding
Ronggang Huang ⋅ FanSen Meng ⋅ Huaidong Zhang ⋅ Xuemiao Xu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 219
Benchmarking PhD-Level Coding in 3D Geometric Computer Vision
Wenyi Li ⋅ Renkai Luo ⋅ Yue Yu ⋅ Huan-ang Gao ⋅ Mingju Gao ⋅ Li Yuan ⋅ Chaoyou Fu ⋅ Hao Zhao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 220
MonoVLM: Monocular 3D Visual Grounding with Vision Language Models
Huaizhi Qu ⋅ Hossein Nourkhiz Mahjoub ⋅ Vaishnav Tadiparthi ⋅ Kwonjoon Lee ⋅ Tianlong Chen
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 221
Curvature-Aware Captioning: Leveraging Geodesic Attention for 3D Scene Understanding
Ziyao He ⋅ Yingjie Liu ⋅ Zhang Yangrui ⋅ Mingsong Chen ⋅ Xuan Tang ⋅ Xian Wei
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 222
SPREAD: Spatial-Physical REasoning via geometry Aware Diffusion
Minzhang Li ⋅ Kuixiang Shao ⋅ xuebing li ⋅ Yuyang Jiao ⋅ Yinuo Bai ⋅ Hengan Zhou ⋅ Sixian Shen ⋅ Jiayuan Gu ⋅ Jingyi Yu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 223
ExtrinSplat: Decoupling Geometry and Semantics for Open-Vocabulary Understanding in 3D Gaussian Splatting
Jiayu Ding ⋅ Xinpeng Liu ⋅ Zhiyi Pan ⋅ Shiqiang Long ⋅ Ge Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 224
SpatialScore: Towards Comprehensive Evaluation for Spatial Intelligence
Haoning Wu ⋅ Xiao Huang ⋅ Yaohui Chen ⋅ Ya Zhang ⋅ Yanfeng Wang ⋅ Weidi Xie
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 225
4D-RGPT: Toward Region-level 4D Understanding via Perceptual Distillation
Chiao-An Yang ⋅ Ryo Hachiuma ⋅ Sifei Liu ⋅ Subhashree Radhakrishnan ⋅ Raymond A. Yeh ⋅ Yu-Chiang Frank Wang ⋅ Min-Hung Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 226
VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction
Zhiwen Fan ⋅ Jian Zhang ⋅ Renjie Li ⋅ Junge Zhang ⋅ Runjin Chen ⋅ Hezhen Hu ⋅ Kevin Wang ⋅ Peihao Wang ⋅ Huaizhi Qu ⋅ Shijie Zhou ⋅ Dilin Wang ⋅ Zhicheng Yan ⋅ Hongyu Xu ⋅ Justin Theiss ⋅ Tianlong Chen ⋅ Jiachen Li ⋅ Zhengzhong Tu ⋅ Zhangyang Wang ⋅ Rakesh Ranjan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 227
Merge3D: Efficient 3D Multimodal LLMs via Joint 2D-3D Token Merging
Tianbo Pan ⋅ Xingyi Yang ⋅ Xinchao Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 228
Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models
Runsen Xu ⋅ Weiyao Wang ⋅ Hao Tang ⋅ Xingyu Chen ⋅ Xiaodong Wang ⋅ Fu-Jen Chu ⋅ Matt Feiszli ⋅ Kevin J Liang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 229
LocateAnything3D: Vision-Language 3D Detection with Chain-of-Sight
Yunze Man ⋅ Shihao Wang ⋅ Guowen Zhang ⋅ Johan Bjorck ⋅ Liang-Yan Gui ⋅ Jim Fan ⋅ Jan Kautz ⋅ Yu-Xiong Wang ⋅ Zhiding Yu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 230
Quota-Calibrated Fine-Grained Alignment with Context-Aware Marginals for Text-based Person Retrieval
Dongsheng Li ⋅ Xinyuan Guo ⋅ Huijie Zhang ⋅ Pingting Hao ⋅ Qiushi Xia
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 231
Evo-Retriever: LLM-Guided Curriculum Evolution with Viewpoint-Pathway Collaboration for Multimodal Document Retrieval
Li Weiqing ⋅ Jinyue Guo ⋅ Yaqi Wang ⋅ HAIYANG XIAO ⋅ Yuewei Zhang ⋅ Guohua Liu ⋅ Hao Henry Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 232
Taxonomy-Aware Representation Alignment for Hierarchical Visual Recognition with Large Multimodal Models
Hulingxiao He ⋅ Zhi Tan ⋅ Yuxin Peng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 233
FAAR: Efficient Frequency-Aware Multi-Task Fine-Tuning via Automatic Rank Selection
Maxime Fontana ⋅ Michael Spratling ⋅ Miaojing Shi
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 234
Model Merging in the Essential Subspace
Longhua Li ⋅ Lei Qi ⋅ Qi Tian ⋅ Xin Geng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 235
Beyond Semantic Search: Towards Referential Anchoring in Composed Image Retrieval
Yuxin Yang ⋅ Yinan Zhou ⋅ Yuxin Chen ⋅ Ziqi Zhang ⋅ Zongyang Ma ⋅ Chunfeng Yuan ⋅ Bing Li ⋅ Jun Gao ⋅ Weiming Hu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 236
SAVE: Speech-Aware Video Representation Learning for Video-Text Retrieval
Ruixiang Zhao ⋅ Zhihao Xu ⋅ Bangxiang Lan ⋅ Zijie Xin ⋅ Jingyu Liu ⋅ Xirong Li
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 237
MarkushGrapher-2: End-to-end Multimodal Recognition of Chemical Structures
Tim Strohmeyer ⋅ Lucas Morin ⋅ Gerhard Ingmar Meijer ⋅ Valery Weber ⋅ Ahmed Nassar ⋅ Peter Staar
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 238
Progressive Cross-Modal Causal Intervention for Long-Term Action Recognition
Shaowu Xu ⋅ Xibin Jia ⋅ Chao Fan ⋅ Junyu Gao ⋅ Jing Chang ⋅ Qianmei Sun
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 239
EthoCLIP: Ontology-Enhanced Video-Language Pretraining for Animal Behavior Understanding
Yinuo Jing ⋅ Jinyan Wu ⋅ Zixi Yang ⋅ Kongming Liang ⋅ Xiatian Zhu ⋅ Zhanyu Ma
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 240
TrajTok: Learning Trajectory Tokens Enhances Video Understanding
Chenhao Zheng ⋅ Jieyu Zhang ⋅ Jianing Zhang ⋅ Weikai Huang ⋅ Ashutosh Kumar ⋅ Quan Kong ⋅ Oncel Tuzel ⋅ Chun-Liang Li ⋅ Ranjay Krishna
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 241
Streaming Video Instruction Tuning
Jiaer Xia ⋅ Peixian Chen ⋅ Mengdan Zhang ⋅ Xing Sun ⋅ Kaiyang Zhou
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 242
VidPrism: Heterogeneous Mixture of Experts for Image-to-Video Transfer
Rui Lin ⋅ Chuanming Wang ⋅ Huadong Ma
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 243
ViterbiPlanNet: Injecting Procedural Knowledge via Differentiable Viterbi for Planning in Instructional Videos
Luigi Seminara ⋅ Davide Moltisanti ⋅ Antonino Furnari
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 244
From Static to Dynamic: Exploring Self-supervised Image-to-Video Representation Transfer Learning
Yang Liu ⋅ Qianqian Xu ⋅ Peisong Wen ⋅ Siran Dai ⋅ Xilin Zhao ⋅ Qingming Huang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 245
Learnable Motion-Focused Tokenization for Effective and Efficient Video Unsupervised Domain Adaptation
Tzu Ling Liu ⋅ Ian Stavness ⋅ Mrigank Rochan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 246
FluxMem: Adaptive Hierarchical Memory for Streaming Video Understanding
Yiweng Xie ⋅ Bo He ⋅ Junke Wang ⋅ Xiangyu Zheng ⋅ Ziyi Ye ⋅ Zuxuan Wu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 247
Learning Transferable Temporal Primitives for Video Reasoning via Synthetic Videos
Sontao Jiang ⋅ Sibo Song ⋅ Chenyi Zhou ⋅ Yuan Wang ⋅ Ruizhe Chen ⋅ Tongkun Guan ⋅ Ruilin Luo ⋅ Yan Zhang ⋅ Zhihang Tang ⋅ Yuchong Sun ⋅ Hang Zhang ⋅ Zhibo Yang ⋅ Shuai Bai ⋅ Junyang Lin ⋅ Zuozhu Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 248
Video Panels for Long Video Understanding
Lars Doorenbos ⋅ Federico Spurio ⋅ Jürgen Gall
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 249
Gaze Target Estimation Anywhere with Concepts
Xu Cao ⋅ Houze Yang ⋅ Vipin Gunda ⋅ Zhongyi Zhou ⋅ Tianyu Xu ⋅ Adarsh Kowdle ⋅ Inki Kim ⋅ James M.
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 250
Select, Hypothesize and Verify: Towards Verified Neuron Concept Interpretation
ZeBin Ji ⋅ Yang Hu ⋅ Xiuli Bi ⋅ Bo Liu ⋅ Bin Xiao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 251
Finding Distributed Object-Centric Properties in Self-Supervised Transformers
Samyak Rawlekar ⋅ Amitabh Swain ⋅ Yujun Cai ⋅ Yiwei Wang ⋅ Ming-Hsuan Yang ⋅ Narendra Ahuja
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 252
Explaining CLIP Zero-shot Predictions Through Concepts
Onat Ozdemir ⋅ Anders Christensen ⋅ Stephan Alaniz ⋅ Zeynep Akata ⋅ Emre Akbas
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 253
See Through the Noise: Improving Domain Generalization in Gaze Estimation
Yanming Peng ⋅ Shijing Wang ⋅ Yaping Huang ⋅ Yi Tian
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 254
Mechanisms of Object Localization in Vision–Language Models
Timothy Schaumlöffel ⋅ Martina G. Vilas ⋅ Gemma Roig
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 255
mmWaveFlow: Unified Enhancement and Generation of mmWave Human Point Clouds
Chang Su ⋅ Beihong Jin ⋅ Qiwen Shi ⋅ Zhi Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 256
From Feature Learning to Spectral Basis Learning: A Unifying and Flexible Framework for Efficient and Robust Shape Matching
Feifan Luo ⋅ Hongyang Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 257
Topology-aware Feature Propagation for Unsupervised Non-rigid Point Cloud Correspondence
Haozhe Chen ⋅ Rui Li ⋅ 正宝 王 ⋅ Xinhao Zhu ⋅ Linjie Li ⋅ Tianyu Xiong ⋅ Xuan Ouyang ⋅ Jiaqi Yang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 258
BEV-SLD: Self-Supervised Scene Landmark Detection for Global Localization with LiDAR Bird’s-Eye View Images
David Skuddis ⋅ Vincent Ress ⋅ Wei Zhang ⋅ Vincent Ofosu Nyako ⋅ Norbert Haala
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 259
SAG-GNN: Semantic-Aware Guided GNN for Descriptor-Free 2D-3D Matching
Shihua Zhang ⋅ Tianhao Xu ⋅ Zizhuo Li ⋅ Qing Ma ⋅ Jiayi Ma
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 260
LiREC-Net: A Target-Free and Learning-Based Network for LiDAR, RGB, and Event Calibration
Aditya Ranjan Dash ⋅ Ramy Battrawy ⋅ René Schuster ⋅ Didier Stricker
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 261
GM-R^2: Generative Matching Learning for Unsupervised Geometric Representation and Registration
Haobo Jiang ⋅ Liang Yu ⋅ Jianmin Zheng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 262
4D Local Modeling Toward Dynamic Global Perception for Ambiguity-free Rotation-Invariant Point Cloud Analysis
JIAXUN GUO ⋅ Wentao Fan ⋅ Manar Amayri ⋅ Nizar Bouguila
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 263
PointNSP: Autoregressive 3D Point Cloud Generation with Next-Scale Level-of-Detail Prediction
Ziqiao Meng ⋅ Qichao Wang ⋅ Zhiyang Dou ⋅ Zixing Song ⋅ Zhipeng Zhou ⋅ Irwin King ⋅ Peilin Zhao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 264
MORE-STEM: Long-Short MemOry REcall and Spatio-TEmporal Consistency Model for Query-Driven 3D/4D Point Cloud Segmentation
Chade Li ⋅ Haida Feng ⋅ Pengju Zhang ⋅ Yihong Wu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 265
Low-Rank Test-Time Training for Pre-Trained Point Cloud Models
Ouyangzi Ye ⋅ Feifei Shao ⋅ Kexin Li ⋅ Yawei Luo ⋅ Zikai Song ⋅ Ping Liu ⋅ Fengda Zhang ⋅ Hongwei Wang ⋅ Jun Xiao
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 266
STAR: Test-Time Adaptation Can Enhance Universal Prompt Learning for Vision-Language Models
Yiwei Fu ⋅ Hui Wan ⋅ Xiao Luo ⋅ Minghua Deng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 267
Exploring Visual Pretraining for Learning Language Intelligence
Zhonghan Zhao ⋅ Yiming Zhang ⋅ Wenwei Zhang ⋅ Haiteng Zhao ⋅ Xingguang Wei ⋅ Zhangwei Gao ⋅ Kuikun Liu ⋅ Yuzhe Gu ⋅ Size Wu ⋅ Haian Huang ⋅ Jianfei Gao ⋅ haijun Lv ⋅ Demin Song ⋅ Yunhua Zhou ⋅ Qipeng Guo ⋅ Gaoang Wang ⋅ Kai Chen
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 268
VL-Eraser: Vacuum Distillation for Machine Unlearning in Vision-Language Models
Yili Wang ⋅ Lu Dai ⋅ Tairan Huang ⋅ Yijie Xu ⋅ Hui Xiong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 269
DeAR: Fine-Grained VLM Adaptation by Decomposing Attention Head Roles
Yiming Ma ⋅ Hongkun Yang ⋅ Lionel Z. Wang ⋅ BIN CHEN ⋅ Weizhi Xian ⋅ Jianzhi Teng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 270
SynCLIP: Synonym-Coherent Language-Image Pretraining for Robust Open-Vocabulary Dense Perception
Mingjie Xie ⋅ Guangjun He ⋅ Dongli Xu ⋅ Youtian Lin ⋅ Hongjue Li ⋅ Pengming Feng ⋅ Jian Guan ⋅ Yue Deng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 271
MODIX: A Training-Free Multimodal Information-Driven Positional Index Scaling for Vision-Language Models
Ruoxiang Huang ⋅ Zhen Yuan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 272
VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models
Xinlei Yu ⋅ Chengming Xu ⋅ Guibin Zhang ⋅ Zhangquan Chen ⋅ Yudong Zhang ⋅ Yongbo He ⋅ Peng-Tao Jiang ⋅ Jiangning Zhang ⋅ Xiaobin Hu ⋅ Shuicheng Yan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 273
ORION: ORthonormal Text Encoding for Universal VLM AdaptatION
Omprakash Chakraborty ⋅ Jose Dolz ⋅ Ismail Ben Ayed
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 274
CASPA: Graph-Structured Concept Anchors for Modality-Agnostic Adaptation in Vision–Language Models
Abhiroop Chatterjee ⋅ Susmita Ghosh ⋅ Ashish Ghosh ⋅ Emmett Ientilucci
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 275
Mirror Illusion Art
Xiaopei Zhu ⋅ Zeyuan Li ⋅ Jun Zhu ⋅ Xiaolin Hu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 276
HOG-Layout: Hierarchical 3D Scene Generation, Optimization and Editing via Vision-Language Models
Haiyan Jiang ⋅ Deyu Zhang ⋅ dongdong weng ⋅ Weitao Song ⋅ Henry Been-Lirn Duh
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 277
Towards Human-Like Robot Handwriting via Contour-Aware Generation
Yutao Qin ⋅ Gang Dai ⋅ Yifan Zhang ⋅ Youwei Han ⋅ Qisheng He ⋅ Shuangping Huang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 278
MajutsuCity: Language-driven Aesthetic-adaptive City Generation with Controllable 3D Assets and Layouts
Zilong Huang ⋅ Jun He ⋅ Xiaobin Huang ⋅ Ziyi Xiong ⋅ Yang Luo ⋅ Junyan Ye ⋅ Weijia Li ⋅ Yiping Chen ⋅ Ting Han
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 279
VectorArk: Learning Practical Image Vectorization with Rounded Polygon Representation
Tarun Gehlaut ⋅ Difan Liu ⋅ Charu Bansal ⋅ Krutik Malani ⋅ Souymodip Chakraborty ⋅ Ankit Phogat ⋅ Matthew Fisher ⋅ Vineet Batra
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 280
OctoT2I: A Self-Evolving Agentic Text-to-Image Router
Jiang Xu ⋅ Bin Chen ⋅ Gehui Li ⋅ Yule Duan ⋅ Ronggang Wang ⋅ Jian Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 281
LottieGPT: Tokenizing Vector Animation for Autoregressive Generation
Junhao Chen ⋅ Gao Kejun ⋅ Yuehan Cui ⋅ Mingze Sun ⋅ Mingjin Chen ⋅ Shaohui Wang ⋅ Xiaoxiao Long ⋅ Fei Ma ⋅ Qi Tian ⋅ Hao Zhao ⋅ Ruqi Huang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 282
SEA: Evaluating Sketch Abstraction Efficiency via Element-level Commonsense Visual Question Answering
Jiho Park ⋅ Sieun Choi ⋅ Jaeyoon Seo ⋅ Minho Sohn ⋅ Yeana Kim ⋅ Jihie Kim
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 283
Selective Amnesia using Contrastive Subnet Erasure for Class Level Unlearning in Vision Models
Vishal Pramanik ⋅ Maisha Maliha ⋅ Susmit Jha ⋅ Alvaro Velasquez ⋅ Olivera Kotevska ⋅ Sumit Jha
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 284
A Closed-Form Solution for Debiasing Vision-Language Models with Utility Guarantees Across Modalities and Tasks
Tangzheng Lian ⋅ Guanyu Hu ⋅ Yijing Ren ⋅ Dimitrios Kollias ⋅ Oya Celiktutan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 285
Rank-Guided Pseudo-Bias Learning for Robust Black-Box Adaptation
Rajeev Ranjan Dwivedi ⋅ Anshuman Dangwal ⋅ Vinod Kurmi
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 286
Diagnosing and Repairing Unsafe Channels in Vision-Language Models via Causal Discovery and Dual-Modal Safety Subspace Projection
Jinhu Fu ⋅ Yihang Lou ⋅ Qingyi Si ⋅ Shudong Zhang ⋅ Sen Su
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 287
WaTeRFlow: Watermark Temporal Robustness via Flow Consistency
Utae Jeong ⋅ Sumin In ⋅ Hyunju Ryu ⋅ Jaewan Choi ⋅ Feng Yang ⋅ Jongheon Jeong ⋅ Seungryong Kim ⋅ Sangpil Kim
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 288
DSO: Direct Steering Optimization for Bias Mitigation
Lucas Monteiro Paes ⋅ Nivedha Sivakumar ⋅ Yinong Oliver Wang ⋅ Masha Fedzechkina ⋅ Barry-John Theobald ⋅ Luca Zappella ⋅ Nicholas Apostoloff
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 289
SWIFT: Sliding Window Reconstruction for Few-Shot Training-Free Generated Video Attribution
Chao Wang ⋅ Zijin Yang ⋅ Yaofei Wang ⋅ Yuang Qi ⋅ Weiming Zhang ⋅ Nenghai Yu ⋅ Kejiang Chen
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 290
SineProject: Machine Unlearning for Stable Vision-Language Alignment
Arpit Garg ⋅ Hemanth Saratchandran ⋅ Simon Lucey
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 291
HiLoRA: Hierarchical Low-Rank Adaptation for Personalized Federated Learning
Zihao Peng ⋅ Nan Zou ⋅ Jiandian Zeng ⋅ Guo Li ⋅ Ke Chen ⋅ Boyuan Li ⋅ Tian Wang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 292
OS-Fed: One Snapshot Is All You Need
Xuwei Qian ⋅ Jinghui Zhang ⋅ Yuchuan Tan ⋅ Wenbo Huang ⋅ Zhen Wu ⋅ Shen Zhou ⋅ LiSha Gao ⋅ Ding Ding ⋅ Fang Dong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 293
FedAlign: Differentially Private Distribution Alignment for Non-IID Federated Learning
Peng Wu ⋅ Jiapeng Zhang ⋅ Yingjie Song ⋅ Xiong Xiao ⋅ Zhuo Tang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 294
Guiding Diffusion Models with Fine-Grained Conditions and Semantics-Preserving Sampling for One-Shot Federated Learning
Xiaojun Deng ⋅ Tianchi Liao ⋅ Zhiyuan Liu ⋅ Chuan Chen ⋅ Zibin Zheng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 295
Personalized Federated Training of Diffusion Models with Privacy Guarantees
Kumar Kshitij Patel ⋅ Bingqing Jiang ⋅ A F M Mahfuzul Kabir ⋅ Weitong Zhang ⋅ Difan Zou ⋅ Lingxiao Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 296
FedRAC: Rolling Submodel Allocation for Collaborative Fairness in Federated Learning
Zihui Wang ⋅ Yuhang Fu ⋅ Mengmeng Du ⋅ Zhimin Yuan ⋅ Yachen Liu ⋅ Weisheng Liao ⋅ Kaiyu Wang ⋅ Zheng Wang
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 297
Understanding Temporal Logic Consistency in Video-Language Models through Cross-Modal Attention Discriminability
Chengzhi Li ⋅ Heyan Huang ⋅ Ping Jian ⋅ Zhen Yang ⋅ Yaning Tian ⋅ Zhongbin Guo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 298
Small Object, Great Challenge: A Benchmark for Small Object Visual Grounding
Wenqi Jia ⋅ Ruifan Li ⋅ Pengyue Lin ⋅ Fangxiang Feng ⋅ Zhanyu Ma ⋅ Xiaojie Wang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 299
UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models
Hewen Pan ⋅ Cong Wei ⋅ Dashuang Liang ⋅ Zepeng Huang ⋅ Pengfei Gao ⋅ Ziqi Zhou ⋅ Lulu Xue ⋅ Pengfei Yan ⋅ Xiaoming Wei ⋅ Minghui Li ⋅ Shengshan Hu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 300
ReMoRa: Multimodal Large Language Model based on Refined Motion Representation for Long-Video Understanding
Daichi Yashima ⋅ Shuhei Kurita ⋅ Yusuke Oda ⋅ Komei Sugiura
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 301
CaST-Bench: Benchmarking Causal Chain-Grounded Spatio-Temporal Reasoning for Video Question Answering
Mingfang Zhang ⋅ Jingjing Pan ⋅ Ashutosh Kumar ⋅ Rajat Saini ⋅ Mustafa Erdogan ⋅ Hsuan-Kung Yang ⋅ Caixin Kang ⋅ Yifei Huang ⋅ Yoichi Sato ⋅ Quan Kong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 302
HERO: Hierarchical Embedding-Refinement for Open-Vocabulary Temporal Sentence Grounding in Videos
Tingting Han ⋅ Xinsong Tao ⋅ Yufei Yin ⋅ Min Tan ⋅ Sicheng Zhao ⋅ Zhou Yu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 303
Scaling the Long Video Understanding of Multimodal Large Language Models via Visual Memory Mechanism
Tao Chen ⋅ Kun Zhang ⋅ Qiong Wu ⋅ Xiao Chen ⋅ Chao Chang ⋅ Xiaoshuai Sun ⋅ Yiyi Zhou ⋅ Rongrong Ji
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 304
Hybrid Token Compression for Vision-Language Models
jusheng zhang ⋅ Xiaoyang Guo ⋅ Kaitong Cai ⋅ Qinhan Lv ⋅ Yijia Fan ⋅ Wenhao Chai ⋅ Jian Wang ⋅ Keze Wang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 305
Focus, Don’t Prune: Identifying Instruction-Relevant Regions for Information-Rich Image Understanding
Mincheol Kwon ⋅ MINSEUNG LEE ⋅ Seonga Choi ⋅ Miso Choi ⋅ Kyeongjin Oh ⋅ Hyunyoung Lee ⋅ Cheonyoung Park ⋅ Yongho Song ⋅ Seunghyun Park ⋅ Jinkyu Kim
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 306
When Token Pruning is Worse than Random: Understanding Visual Token Information in VLLMs
Yahong Wang ⋅ Juncheng Wu ⋅ Zhangkai Ni ⋅ Longzhen Yang ⋅ Yihang Liu ⋅ Chengmei Yang ⋅ Ying Wen ⋅ Lianghua He ⋅ Xianfeng Tang ⋅ Hui Liu ⋅ Yuyin Zhou
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 307
VISion On Request: Enhanced VLLM efficiency with sparse, dynamically selected, vision-language interactions
Adrian Bulat ⋅ Alberto Baldrati ⋅ Ioannis Maniadis Metaxas ⋅ Yassine Ouali ⋅ Georgios Tzimiropoulos
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 308
BiGain: Unified Token Compression for Joint Generation and Classification
Jiacheng Liu ⋅ Shengkun Tang ⋅ Jiacheng Cui ⋅ Dongkuan Xu ⋅ Zhiqiang Shen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 309
Hi-Lo Prune: Look at What You'll Lose before Pruning with Hierarchical Token Selection
Zixun Sun ⋅ Yubo Dong ⋅ Hehe Fan ⋅ Yi Yang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 310
VLM-Pruner: Buffering for Spatial Sparsity in an Efficient VLM Centrifugal Token Pruning Paradigm
Zhenkai Wu ⋅ Xiaowen Ma ⋅ ZHENLIANG NI ⋅ Dengming Zhang ⋅ Han Shu ⋅ Xin Jiang ⋅ Xinghao Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 311
Bridge: Basis-Driven Causal Inference Marries VFMs for Domain Generalization
Mingbo Hong ⋅ Feng Liu ⋅ Caroline Gevaert ⋅ George Vosselman ⋅ Hao Cheng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 312
In Pursuit of Pixel Supervision for Visual Pre-training
Lihe Yang ⋅ Shang-Wen Li ⋅ Yang Li ⋅ Xinjie Lei ⋅ Dong Wang ⋅ Abdelrahman Mohamed ⋅ Saining Xie ⋅ Hengshuang Zhao ⋅ Kaiming He ⋅ Hu Xu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 313
GaussianMatch: Semi-Supervised Regression with Pseudo-Label Filtering via Multi-View Gaussian Consistency
Yin Wang ⋅ Hao Lu ⋅ Zixuan Wang ⋅ Zhen Qin ⋅ Li Kuang ⋅ Mengchu Zhou ⋅ Shuiguang Deng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 314
TAR: Token-Aware Refinement for Fine-grained Generalized Category Discovery
XingYu Yang ⋅ Yu Zhang ⋅ Siya Mi ⋅ Xiu-Shen Wei
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 315
Semantic Noise Reduction via Teacher-Guided Dual-Path Audio-Visual Representation Learning
Linge Wang ⋅ Yingying Chen ⋅ Bingke Zhu ⋅ Lu Zhou ⋅ Jinqiao Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 316
The Universal Normal Embedding
Chen Tasker ⋅ Roy Betser ⋅ Eyal Gofer ⋅ Meir Yossef Levi ⋅ Guy Gilboa
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 317
Bypassing the Transport Plan: Dynamic Reweighting for Out-of-Distribution Detection with Optimal Transport
Yang Xiao ⋅ Weiming Liu ⋅ Jun Dan ⋅ Tengyue Xu ⋅ Fan Wang ⋅ Hua Yu ⋅ Junhao Dong ⋅ Jiao Liu ⋅ Shunjie Dong ⋅ Lianyong Qi
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 318
Cross-domain Dual-stream Feature Disentanglement for Brain Disorder Prediction with Sparsely Labeled PET
Huabin Wang ⋅ Xinyu Chen ⋅ Yuan Zhou ⋅ Fei Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 319
Debiased Sample Selection for Learning with Noisy Labels
Weiran Pan ⋅ Wei Wei ⋅ Wenfeng xie
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 320
Driving on Registers
Ellington Kirby ⋅ Alexandre Boulch ⋅ Yihong Xu ⋅ Yuan Yin ⋅ Gilles Puy ⋅ Éloi Zablocki ⋅ Andrei Bursuc ⋅ Spyros Gidaris ⋅ Renaud Marlet ⋅ Florent Bartoccioni ⋅ Anh Quan Cao ⋅ Nermin Samet ⋅ Vu ⋅ Matthieu Cord
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 321
Open-Ended Instruction Realization with LLM-Enabled Multi-Planner Scheduling in Autonomous Vehicles
Jiawei Liu ⋅ Xun Gong ⋅ Fen Fang ⋅ Muli Yang ⋅ Bohao Qu ⋅ Yunfeng hu ⋅ Hong Chen ⋅ Xulei Yang ⋅ Qing Guo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 322
EE-RL: Vision Language Guided Reinforcement Learning with Explorer and Expert model for End-to-End Autonomous Driving
Xiaolong Li ⋅ Lan Yang ⋅ Ruyang Li ⋅ Shan Fang ⋅ Yang Liu ⋅ Xiangmo Zhao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 323
Sensor2Sensor: Cross-Embodiment Sensor Conversion for Autonomous Driving
Jiahao Wang ⋅ Bo Sun ⋅ Yijing Bai ⋅ Vincent Casser ⋅ Songyou Peng ⋅ Zehao Zhu ⋅ Meng-Li Shih ⋅ Xander Masotto ⋅ Shih-Yang Su ⋅ Kanaad Parvate ⋅ Tiancheng Ge ⋅ Linn Bieske ⋅ Dragomir Anguelov ⋅ Mingxing Tan ⋅ Chiyu “Max” Jiang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 324
SHARP: Short-Window Streaming for Accurate and Robust Prediction in Motion Forecasting
Alexander Prutsch ⋅ Christian Fruhwirth-Reisinger ⋅ David Schinagl ⋅ Horst Possegger
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 325
DriveCombo: Benchmarking Compositional Traffic Rule Reasoning in Autonomous Driving
Enhui Ma ⋅ Jiahuan Zhang ⋅ Guantian Zheng ⋅ Tao Tang ⋅ Shengbo Eben Li ⋅ Yuhang Lu ⋅ xia zhou ⋅ Xueyang Zhang ⋅ Yifei Zhan ⋅ Kun Zhan ⋅ Zhihui Hao ⋅ XianPeng Lang ⋅ Kaicheng Yu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 326
CausalVAD: De-confounding End-to-End Autonomous Driving via Causal Intervention
Jiacheng Tang ⋅ Zhiyuan Zhou ⋅ Zhuolin He ⋅ Jia Zhang ⋅ Kai Zhang ⋅ Jian Pu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 327
Reliable Policy Transfer for Safety-Aware End-to-End Driving with Deep Reinforcement Learning
Uddin Md. Borhan ⋅ Arif Raza ⋅ Zhiliang Lin ⋅ Lu Wang ⋅ Jianqiang Li ⋅ Jie Chen
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 328
Learning to Drive is a Free Gift: Large-Scale Label-Free Autonomy Pretraining from Unposed In-The-Wild Videos
Matthew Strong ⋅ Wei-Jer Chang ⋅ Quentin HERAU ⋅ Jiezhi Yang ⋅ Yihan Hu ⋅ Chensheng Peng ⋅ Wei Zhan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 329
WhisperNet: A Scalable Solution for Bandwidth-Efficient Collaboration
Gong Chen ⋅ Chaokun Zhang ⋅ Xinyan Zhao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 330
Efficient Equivariant Transformer for Self-Driving Agent Modeling
Scott Xu ⋅ Dian Chen ⋅ Kelvin Wong ⋅ Chris Zhang ⋅ Kion Fallah ⋅ Raquel Urtasun
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 331
Generalizable Co-Salient Object Detection via Mixed Content-Style Modulation
Guanting Guo ⋅ Shenglong Hu ⋅ Kaihua Zhang ⋅ Guangcan Liu ⋅ Min Xia
[ Slides
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 332
Saliency-Driven Token Merging for Vision Transformers
Weiying Xie ⋅ Xiaoyu Chen ⋅ Xin Zhang ⋅ Chenhe Hao ⋅ Jitao Ma ⋅ Yunsong Li ⋅ Leyuan Fang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 333
RISE: Single Static Radar-based Indoor Scene Understanding
Kaichen Zhou ⋅ Laura Dodds ⋅ Sayed Saad Afzal ⋅ Fadel Adib
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 334
Mixture-of-Experts based Feature Decoupling for Open Vocabulary Scene Graph Generation
Yiming Li ⋅ Sisi You ⋅ Bing-Kun Bao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 335
TF-SSD: A Strong Pipeline via Synergic Mask Filter for Training-free Co-salient Object Detection
Zhijin He ⋅ Shuo Jin ⋅ Siyue Yu ⋅ Shuwei Wu ⋅ Bingfeng Zhang ⋅ Li Yu ⋅ Jimin Xiao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 336
Denoise and Align: Towards Source-Free UDA for Robust Panoramic Semantic Segmentation
Yaowen Chang ⋅ Zhen Cao ⋅ Xu Zheng ⋅ Xiaoxin Mi ⋅ Zhen Dong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 337
SPOT: Spatiotemporal Prompt Optimization for Motion-Stabilized MLLM-Guided Video Segmentation
Jiayi Fan ⋅ Zheyun Qin ⋅ Xiaoming Xi ⋅ Xiushan Nie ⋅ Yilong Yin
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 338
Changes in Real Time: Online Scene Change Detection with Multi-View Fusion
Chamuditha Jayanga Galappaththige ⋅ Jason Lai ⋅ Lloyd Windrim ⋅ Donald Dansereau ⋅ Niko Suenderhauf ⋅ Dimity Miller
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 339
Subspace Alignment for CLIP-based Continual Learning via Canonical Correlation Analysis
Huan Zhang ⋅ Shuyu Dong ⋅ Yujin Zheng ⋅ Dingwen Wang ⋅ Shenghua Fan ⋅ Fan Lyu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 340
DGS: Dual Gradient and Semantic-Shift Guided Low-Rank Adaptation for Class Incremental Learning
KAI LI ⋅ Jiafeng Li ⋅ Lianghua He ⋅ Ying Wen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 341
Dynamic Magic: Unleashing Restricted Knowledge for Lifelong Person Re-Identification
Jinjia Peng ⋅ Jican Tan ⋅ Jiazuo Yu ⋅ Zeze Tao ⋅ Huibing Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 342
Which Concepts to Forget and How to Refuse? Decomposing Concepts for Continual Unlearning in Large Vision-Language Models
Hyundong Jin ⋅ Dongyoon Han ⋅ Eunwoo Kim
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 343
Temporal Imbalance of Positive and Negative Supervision in Class-Incremental Learning
Jinge Ma ⋅ Fengqing Zhu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 344
Forging a Dynamic Memory: Retrieval-Guided Continual Learning for Generalist Medical Foundation Models
Zizhi Chen ⋅ Yizhen Gao ⋅ Minghao Han ⋅ Yizhou Liu ⋅ Zhaoyu Chen ⋅ Dingkang Yang ⋅ Lihua Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 345
Dance Across Shifts: Forward-Facilitation Continual Test-Time Adaptation through Dynamic Style Bridging
Zhilin Zhu ⋅ Yabin Wang ⋅ Zhiheng Ma ⋅ Yaguang Song ⋅ Yaowei Wang ⋅ Xiaopeng Hong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 346
Few-Shot Hybrid Incremental Learning: Continually Learning under Data Scarcity and Task Uncertainty
Yan Li ⋅ Yuzhu Shi ⋅ Kan Zhou ⋅ Shu Zhang ⋅ Diqi He ⋅ Dingwen Zhang ⋅ Junwei Han
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 347
High-Fidelity Mobile Avatars with Pruned Local Blendshapes
Youyi Zhan ⋅ He Wang ⋅ Tianjia Shao ⋅ Kun Zhou
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 348
PhysSkin: Real-Time and Generalizable Physics-Based Animation via Self-Supervised Neural Skinning
Yuanhang Lei ⋅ Tao Cheng ⋅ Xingxuan Li ⋅ Boming Zhao ⋅ Siyuan Huang ⋅ Ruizhen Hu ⋅ Peter Yichen Chen ⋅ Hujun Bao ⋅ Zhaopeng Cui
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 349
Bridging Privacy and Provenance: Traceable Virtual Identity Generation
Xianhan Zeng ⋅ Xiaoxiao Hu ⋅ Sheng Li ⋅ Zhenxing Qian ⋅ Xinpeng Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 350
PortraitDirector: A Hierarchical Disentanglement Framework for Controllable and Real-time Facial Reenactment
Chaonan Ji ⋅ Jinwei Qi ⋅ Sheng Xu ⋅ Peng Zhang ⋅ Bang Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 351
Dynamic Label Noise Suppression with Optimal Teacher Pool for Facial Expression Recognition
Yuzhuang Yang ⋅ Xiaolin Tian ⋅ Qigong Sun
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 352
MimicTalker: A Multimodal Interactive and Memory-Enhanced Framework for Real-Time Dyadic 3D Head Generation
Yinuo Wang ⋅ Yanbo Fan ⋅ Xuan Wang ⋅ Boyao Zhou ⋅ Yu Guo ⋅ Yujun Shen ⋅ Fei Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 353
DecoVLN: Decoupling Observation, Reasoning, and Correction for Vision-and-Language Navigation
zihao xin ⋅ Wentong Li ⋅ Yixuan Jiang ⋅ Bin Wang ⋅ Runmin Cong ⋅ Jie Qin ⋅ Shengjun Huang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 354
HybridDriveVLA: Vision-Language-Action Model with Visual CoT reasoning and ToT Evaluation for Autonomous Driving
Yipene Cedric Francois Bassole ⋅ Sungwoo Kim ⋅ Jiwoo Jung ⋅ Yunsick Sung
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 355
NavForesee: A Unified Vision-Language World Model for Hierarchical Planning and Dual-Horizon Navigation Prediction
Fei Liu ⋅ Shichao Xie ⋅ Minghua Luo ⋅ Zedong Chu ⋅ Junjun Hu ⋅ Xiaolong Wu ⋅ Mu Xu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 356
LookasideVLN: Direction-Aware Aerial Vision-and-Language Navigation
Yuwei Ning ⋅ Ganlong Zhao ⋅ Yipeng Qin ⋅ Si Liu ⋅ Yang Liu ⋅ Liang Lin ⋅ Guanbin Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 357
MAPS: Preserving Vision-Language Representations via Module-Wise Proximity Scheduling for Better Vision-Language-Action Generalization
Chengyue Huang ⋅ Mellon M. Zhang ⋅ Robert Azarcon ⋅ Glen Chou ⋅ Zsolt Kira
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 358
D3D-VLP: Dynamic 3D Vision-Language-Planning Model for Embodied Grounding and Navigation
Zihan Wang ⋅ Seungjun Lee ⋅ Guangzhao Dai ⋅ Gim Hee Lee
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 359
FreeForm: Reduced-Order Deformable Simulation from Particle-Based Skinning Eigenmodes
Donglai Xiang ⋅ Vismay Modi ⋅ Rishit Dagli ⋅ Ty Trusty ⋅ Gilles Daviet ⋅ Anka Chen ⋅ Nicholas Sharp ⋅ David I. W. Levin
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 360
GeoDiff4D: Geometry-Aware Diffusion for 4D Head Avatar Reconstruction
Chao Xu ⋅ Xiaochen Zhao ⋅ xiang deng ⋅ Jingxiang Sun ⋅ Donglin Di ⋅ Zhuo Su ⋅ Yebin Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 361
4DEquine: Disentangling Motion and Appearance for 4D Equine Reconstruction from Monocular Video
Jin Lyu ⋅ Liang An ⋅ Pujin Cheng ⋅ Yebin Liu ⋅ Xiaoying Tang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 362
PhysHO: Physics-Based Dynamic 3D Gaussian Human and Object from Monocular Video
Suyi Jiang ⋅ Gim Hee Lee
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 363
ProgressiveAvatars: Progressive Animatable 3D Gaussian Avatars
Kaiwen Song ⋅ Jinkai Cui ⋅ Juyong Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 364
ZINA: Multimodal Fine-grained Hallucination Detection and Editing
Yuiga Wada ⋅ Kazuki Matsuda ⋅ Komei Sugiura ⋅ Graham Neubig
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 365
Mitigating Multimodal Hallucinations via Gradient-based Self-Reflection
Shan Wang ⋅ Maying Shen ⋅ Nadine Chang ⋅ Chuong Nguyen ⋅ Hongdong Li ⋅ Jose M. Alvarez
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 366
HalluGen: Synthesizing Realistic and Controllable Hallucinations for Evaluating Image Restoration
Seunghoi Kim ⋅ Henry F. J. Tregidgo ⋅ Chen Jin ⋅ Matteo Figini ⋅ Daniel C. Alexander
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 367
KVSmooth: Mitigating Hallucination in Multi-modal Large Language Models through Key-Value Smoothing
Siyu Jiang ⋅ Feiyang Chen ⋅ Xiaojin Zhang ⋅ Kun He
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 368
ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Video Understanding
Hao Lu ⋅ Jiahao Wang ⋅ Yaolun Zhang ⋅ Ruohui Wang ⋅ Xuanyu Zheng ⋅ Yepeng Tang ⋅ Dahua Lin ⋅ Lewei Lu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 369
Tell Model Where to Look: Mitigating Hallucinations in MLLMs by Vision-Guided Attention
Jianfei Zhao ⋅ Feng Zhang ⋅ Xin Sun ⋅ Chong Feng ⋅ Zhixing Tan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 370
Circular-DPO: Aligning Multi-Stage 3D Generative Models via Preference Feedback Loop
Zejian Li ⋅ Jiarui Ma ⋅ Han Xu ⋅ Weiting Zheng ⋅ Yangrui Zhu ⋅ Chenye Meng ⋅ Pei Chen ⋅ Ling Yang ⋅ Zhiyuan Yang ⋅ Changyuan Yang ⋅ Guang Yang ⋅ Immanuel Koh ⋅ Lingyun Sun
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 371
Cloning Deterministic Worlds: The Critical Role of Latent Geometry in Long-Horizon World Models
Zaishuo Xia ⋅ Yukuan Lu ⋅ Xinyi Li ⋅ Yifan Xu ⋅ Yubei Chen
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 372
PrITTI: Primitive-based Generation of Controllable and Editable 3D Semantic Urban Scenes
Christina Ourania Tze ⋅ Daniel Dauner ⋅ Yiyi Liao ⋅ Dzmitry Tsishkou ⋅ Andreas Geiger
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 373
CubeComposer: Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video
Lingen Li ⋅ Guangzhi Wang ⋅ Xiaoyu Li ⋅ Zhaoyang Zhang ⋅ Qi Dou ⋅ Jinwei Gu ⋅ Tianfan Xue ⋅ Ying Shan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 374
ExPose: Reinforcing Video Generation Models for Extreme Pose Estimation
Youngho Yoon ⋅ Wonjune Cho ⋅ Hyunho Ha ⋅ Sujung Kim ⋅ Kuk-Jin Yoon
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 375
Choreographing a World of Dynamic Objects
Yanzhe Lyu ⋅ Chen Geng ⋅ Karthik Dharmarajan ⋅ Yunzhi Zhang ⋅ Hadi Alzayer ⋅ Shangzhe Wu ⋅ Jiajun Wu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 376
SounDiT: Geo-Contextual Soundscape-to-Landscape Generation
Junbo Wang ⋅ Haofeng Tan ⋅ Bowen Liao ⋅ Albert Jiang ⋅ Teng Fei ⋅ Qixing Huang ⋅ Bing Zhou ⋅ Zhengzhong Tu ⋅ Shan Ye ⋅ Yuhao Kang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 377
Vista4D: Video Reshooting with 4D Point Clouds
Kuan Heng Lin ⋅ Zhizheng Liu ⋅ Pablo Salamanca ⋅ Yash Kant ⋅ Ryan Burgert ⋅ Yuancheng Xu ⋅ Koichi Namekata ⋅ Yiwei Zhao ⋅ Bolei Zhou ⋅ Micah Goldblum ⋅ Paul Debevec ⋅ Ning Yu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 378
CamDirector: Towards Long-Term Coherent Video Trajectory Editing
Kejia Yin ⋅ Zhihao Shi ⋅ Weilin Wan ⋅ Yuhongze Zhou ⋅ YUANHAO YU ⋅ Xinxin Zuo ⋅ Qiang Sun ⋅ Juwei Lu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 379
Elastic3D: Controllable Stereo Video Conversion with Guided Latent Decoding
Nando Metzger ⋅ Prune Truong ⋅ Goutam Bhat ⋅ Konrad Schindler ⋅ Federico Tombari
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 380
Decoupling Bias, Aligning Distributions: Synergistic Fairness Optimization for Deepfake Detection
Feng Ding ⋅ Wenhui Yi ⋅ Yunpeng Zhou ⋅ Xinan He ⋅ Hong Rao ⋅ Shu Hu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 381
Target-Aware Invertible Encoder with Reconstruction Guidance for Infrared Small Target Detection
Shule Yan ⋅ Zetian Zhang ⋅ Xiao Ma ⋅ Zexuan Ji
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 382
BDNet:Bio-Inspired Dual-Backbone Small Object Detection Network
Wenchao Guan ⋅ Chuan Lin ⋅ Sihan Huang ⋅ Xiongzhen Wang ⋅ Xintao Pang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 383
ElasticFormer: Detecting Objects in HRW Shots via Elastic Computing Vision Transformer
Wenxi Li ⋅ Jingchen Huang ⋅ Chenyang Lyu ⋅ Moran Liu ⋅ Haozhe Lin ⋅ Guiguang Ding ⋅ Yuchen Guo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 384
RGB-Event based Pedestrian Attribute Recognition: A Benchmark Dataset and An Asymmetric RWKV Fusion Framework
Xiao Wang ⋅ Haiyang Wang ⋅ Shiao Wang ⋅ Qiang Chen ⋅ Jiandong Jin ⋅ Haoyu Song ⋅ Bo Jiang ⋅ Chenglong Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 385
FusionAgent: A Multimodal Agent with Dynamic Model Selection for Human Recognition
Jie Zhu ⋅ Xiao Guo ⋅ Yiyang Su ⋅ Anil Kumar Jain ⋅ Xiaoming Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 386
Free-Grained Hierarchical Visual Recognition
Seulki Park ⋅ Zilin Wang ⋅ Stella X. Yu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 387
URICA: A Uniformity Region Affine Identifier Capture Algorithm for Arbitrary Region Retrieval in Pathology Images
Ri Su ⋅ Zhao CHEN ⋅ Caleb Chen Cao ⋅ Lei Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 388
Online Data Curation for Object Detection via Marginal Contributions to Dataset-level Average Precision
Zitang Sun ⋅ Masakazu Yoshimura ⋅ Junji Otsuka ⋅ Atsushi Irie ⋅ Takeshi Ohashi
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 389
DetAny4D: Detect Anything 4D Temporally in a Streaming RGB Video
Jiawei Hou ⋅ Shenghao Zhang ⋅ Can Wang ⋅ Zheng Gu ⋅ Yonggen Ling ⋅ Taiping Zeng ⋅ Xiangyang Xue ⋅ Jingbo Zhang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 390
Follow the Saliency: Supervised Saliency for Retrieval-augmented Dense Video Captioning
Seung hee Choi ⋅ minju Jeon ⋅ Hyunwoo Oh ⋅ Jihwan Lee ⋅ Dong-Jin Kim
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 391
Video-CoE: Reinforcing Video Event Prediction via Chain of Events
Qile Su ⋅ Jing Tang ⋅ Rui Chen ⋅ Lei Sun ⋅ Xiangxiang Chu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 392
VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice
Shuming Liu ⋅ Mingchen Zhuge ⋅ Changsheng Zhao ⋅ Jun Chen ⋅ Lemeng Wu ⋅ Zechun Liu ⋅ Chenchen Zhu ⋅ zhipeng cai ⋅ Chong Zhou ⋅ Haozhe Liu ⋅ Ernie Chang ⋅ Saksham Suri ⋅ Hongyu Xu ⋅ Qi Qian ⋅ Wei Wen ⋅ Balakrishnan Varadarajan ⋅ Zhuang Liu ⋅ Hu Xu ⋅ Florian Bordes ⋅ Raghuraman Krishnamoorthi ⋅ Bernard Ghanem ⋅ Vikas Chandra ⋅ Yunyang Xiong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 393
VRR-QA: Visual Relational Reasoning in Videos Beyond Explicit Cues
Sirnam Swetha ⋅ Rohit Gupta ⋅ Parth Parag Kulkarni ⋅ David G. ⋅ Jeffrey A. Chan-Santiago ⋅ Nyle Siddiqui ⋅ Joseph Fioresi ⋅ Mubarak Shah
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 394
Question-guided Visual Compression with Memory Feedback for Long-Term Video Understanding
Sosuke Yamao ⋅ Natsuki Miyahara ⋅ Yuankai Qi ⋅ Shun Takeuchi
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 395
CURVE: A Benchmark for Cultural and Multilingual Long Video Reasoning
Darshan Singh S ⋅ Arsha Nagrani ⋅ Kawshik Manikantan ⋅ Harman Singh ⋅ Dinesh Tewari ⋅ Tobias Weyand ⋅ Cordelia Schmid ⋅ Anelia Angelova ⋅ Shachi Dave
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 396
SVBench: Evaluation of Video Generation Models on Social Reasoning
Wenshuo Peng ⋅ Gongxuan Wang ⋅ Tianmeng Yang ⋅ Chuanhao Li ⋅ Xiaojie Xu ⋅ Hui He ⋅ Kaipeng Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 397
Hierarchical Long Video Understanding with Audiovisual Entity Cohesion and Agentic Search
Xinlei Yin ⋅ Xiulian Peng ⋅ Xiao Li ⋅ Zhiwei Xiong ⋅ Yan Lu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 398
LifeEval: A Multimodal Benchmark for Assistive AI in Egocentric Daily Life Tasks
Hengjian Gao ⋅ Kaiwei Zhang ⋅ Shibo Wang ⋅ Mingjie Chen ⋅ Qihang Cao ⋅ Xianfeng Wang ⋅ Yucheng Zhu ⋅ Xiongkuo Min ⋅ Wei Sun ⋅ Dandan Zhu ⋅ Guangtao Zhai
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 399
Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning
Haoji Zhang ⋅ Xin Gu ⋅ Jiawen Li ⋅ Chixiang Ma ⋅ Sule Bai ⋅ Chubin Zhang ⋅ bowen zhang ⋅ zhichao zhou ⋅ Dongliang He ⋅ Yansong Tang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 400
Attention Surgery: An Efficient Recipe to Linearize Your Video Diffusion Transformer
Mohsen Ghafoorian ⋅ Denis Korzhenkov ⋅ Amir Habibian
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 401
YOSE: You Only Select Essential Tokens for Efficient DiT-based Video Object Removal
wu chenyang ⋅ Lina Lei ⋅ Fan Li ⋅ Chunle Guo ⋅ Dehong Kong ⋅ Xinran Qin ⋅ Zhixin Wang ⋅ Mingming Cheng ⋅ Chongyi Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 402
CADC: Content Adaptive Diffusion-Based Generative Image Compression
Xihua Sheng ⋅ lingyu ZHU ⋅ Tianyu Zhang ⋅ Dong Liu ⋅ Shiqi Wang ⋅ Jing Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 403
FG-Portrait: 3D Flow Guided Editable Portrait Animation
Yating Xu ⋅ Yunqi Miao ⋅ Evangelos Ververas ⋅ Jiankang Deng ⋅ Jifei Song
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 404
ResCa: Residual Caching for Diffusion Transformers Acceleration
Haipeng Fang ⋅ Yu Li ⋅ Fan Tang ⋅ Yixing Lu ⋅ Juan Cao ⋅ Sheng Tang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 405
IP-Adapter Is All You Need: Towards Fine-Tuning-Free Diffusion-Based Talking Face Generation
Hao Wu ⋅ Xiangyang Luo ⋅ Hao Wang ⋅ Jiawei Zhang ⋅ Yi Zhang ⋅ Jinwei Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 406
SRA 2: Variational Autoencoder Self-Representation Alignment for Efficient Diffusion Training
Mengmeng Wang ⋅ Dengyang Jiang ⋅ Liuzhuozheng Li ⋅ Yucheng Lin ⋅ Guojiang Shen ⋅ Xiangjie Kong ⋅ Yong Liu ⋅ Guang Dai ⋅ Jingdong Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 407
InnoAds-Composer: Efficient Condition Composition for E-Commerce Poster Generation
Yuxin Qin ⋅ Ke Cao ⋅ Haowei Liu ⋅ Ao Ma ⋅ Fengheng Li ⋅ Honghe Zhu ⋅ Zheng Zhang ⋅ Run Ling ⋅ Wei Feng ⋅ Xuanhua He ⋅ Zhanjie Zhang ⋅ Zhen Guo ⋅ Haoyi Bian ⋅ Jingjing Lv ⋅ Junjie Shen ⋅ Ching Law
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 408
Multi-Patch Global-to-Local Transformer Architecture For Efficient Flow Matching and Diffusion Model
Minh Quan Dao ⋅ Dimitris Metaxas
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 409
SODA: Sensitivity-Oriented Dynamic Acceleration for Diffusion Transformer
Tong Shao ⋅ Yusen Fu ⋅ Guoying Sun ⋅ Jingde Kong ⋅ Zhuotao Tian ⋅ Jingyong Su
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 410
DSERT-RoLL: Robust Multi-Modal Perception for Diverse Driving Conditions with Stereo Event-RGB-Thermal Cameras, 4D Radar, and Dual-LiDAR
Hoonhee Cho ⋅ Jae-Young Kang ⋅ Yuhwan Jeong ⋅ Yunseo Yang ⋅ Wonyoung Lee ⋅ Youngho Kim ⋅ Kuk-Jin Yoon
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 411
A Semantically Disentangled Unified Model for Multi-category 3D Anomaly Detection
SuYeon Kim ⋅ Wongyu Lee ⋅ MyeongAh Cho
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 412
ReManNet: A Riemannian Manifold Network for Monocular 3D Lane Detection
Chengzhi Hong ⋅ Bijun Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 413
PanDA: Unsupervised Domain Adaptation for Multimodal 3D Panoptic Segmentation in Autonomous Driving
Yining Pan ⋅ Shijie Li ⋅ Yuchen Wu ⋅ Xulei Yang ⋅ Na Zhao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 414
STUR3D: Spatio-Temporal Unified Representation Learning for 3D Object Detection
Huijie Fan ⋅ Pengrui huang ⋅ Qiang Wang ⋅ Baojie Fan ⋅ Jiahua Dong ⋅ Liangqiong Qu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 415
Exploring 6D Object Pose Estimation with Deformation
Zhiqiang Liu ⋅ Rui Song ⋅ Duanmu Chuangqi ⋅ Jiaojiao Li ⋅ David Ferstl ⋅ Yinlin Hu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 416
SearchAD: Large-Scale Rare Image Retrieval Dataset for Autonomous Driving
Felix Embacher ⋅ Jonas Uhrig ⋅ Marius Cordts ⋅ Markus Enzweiler
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 417
Improving Vision-language Models with Perception-centric Process Reward Models
Yingqian Min ⋅ Kun Zhou ⋅ Yifan Li ⋅ Yuhuan Wu ⋅ Han Peng ⋅ Yifan Du ⋅ Wayne Xin Zhao ⋅ Min Yang ⋅ Ji-Rong Wen
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 418
X-PCR: A Benchmark for Cross-modality Progressive Clinical Reasoning in Ophthalmic Diagnosis
Gui Wang ⋅ Zehao Zhong ⋅ YongSong Zhou ⋅ Yudong Li ⋅ Ende Wu ⋅ Wooi Ping Cheah ⋅ Rong Qu ⋅ Jianfeng Ren ⋅ Linlin Shen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 419
Better, Stronger, Faster: Tackling the Trilemma in MLLM-based Segmentation with Simultaneous Textual Mask Prediction
Jiazhen Liu ⋅ Mingkuan Feng ⋅ Long Chen
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 420
PhysInOne: Visual Physics Learning and Reasoning in One Suite
Siyuan Zhou ⋅ Hejun Wang ⋅ Hu Cheng ⋅ Jinxi Li ⋅ Dongsheng Wang ⋅ Junwei Jiang ⋅ Yixiao Jin ⋅ Jiayue Huang ⋅ Shiwei Mao ⋅ Shangjia Liu ⋅ Yafei Yang ⋅ Hongkang Song ⋅ Shenxing Wei ⋅ Zihui Zhang ⋅ DataTeam vLAR ⋅ Bing Wang ⋅ Zhihua Wang ⋅ Chuhang Zou ⋅ Bo Yang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 421
AviaSafe: A Physics-Informed Data-Driven Model for Aviation Safety–Critical Cloud Forecasts
ZIJIAN ZHU ⋅ Huang Qiusheng ⋅ Anboyu Guo ⋅ Xiaohui Zhong ⋅ Hao li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 422
TTRV: Test-Time Reinforcement Learning for Vision Language Models
Akshit Singh ⋅ Shyam Marjit ⋅ Wei Lin ⋅ Paul Gavrikov ⋅ Serena Yeung ⋅ Hilde Kuehne ⋅ Rogerio Feris ⋅ Sivan Doveh ⋅ James Glass ⋅ M. Jehanzeb Mirza
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 423
Reading or Reasoning? Format Decoupled Reinforcement Learning for Document OCR
Yufeng Zhong ⋅ Lei Chen ⋅ Zhixiong Zeng ⋅ Xuanle Zhao ⋅ Deyang Jiang ⋅ Liming Zheng ⋅ Jing Huang ⋅ Haibo Qiu ⋅ Peng Shi ⋅ Siqi Yang ⋅ Lin Ma
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 424
QUANTIPHY: A Quantitative Benchmark Evaluating Physical Reasoning Abilities of Vision-Language Models
Puyin Li ⋅ Tiange Xiang ⋅ Ella Mao ⋅ Shirley Wei ⋅ Xinye Chen ⋅ Adnan Masood ⋅ Li Fei-Fei ⋅ Ehsan Adeli
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 425
VisRes Bench: On Evaluating the Visual Reasoning Capabilities of VLMs
Brigitta Malagurski Törtei ⋅ Yasser Dahou ⋅ Ngoc Dung Huynh ⋅ Wamiq Reyaz Para ⋅ Phúc H. Lê Khắc ⋅ Ankit Singh ⋅ Sofian Chaybouti ⋅ Sanath Narayan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 426
TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition
JUNYUAN ZHANG ⋅ Bin Wang ⋅ Qintong Zhang ⋅ Fan Wu ⋅ Zichen Wen ⋅ Jialin Lu ⋅ Junjie Shan ⋅ Ziqi Zhao ⋅ Shuya Yang ⋅ Ziling Wang ⋅ Ziyang Miao ⋅ Huaping Zhong ⋅ Yuhang Zang ⋅ Xiaoyi Dong ⋅ Ka-Ho Chow ⋅ Conghui He
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 427
Urban-GS: A Unified 3D Gaussian Splatting Framework for Compact and High-Fidelity Aerial-to-Street Reconstruction
Meng Wang ⋅ Changqun Xia ⋅ Yuze Wang ⋅ Junyi Wang ⋅ Wantong Duan ⋅ Xinxiong Xie ⋅ Yue Qi
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 428
Generalizable Sparse-View 3D Reconstruction from Unconstrained Images
Vinayak Gupta ⋅ Chih-Hao Lin ⋅ Shenlong Wang ⋅ Anand Bhattad ⋅ Jia-Bin Huang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 429
RemedyGS: Defend 3D Gaussian Splatting Against Computation Cost Attacks
Yanping LI ⋅ Zhening Liu ⋅ Zijian Li ⋅ Zehong Lin ⋅ Jun Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 430
SparseCam4D: Spatio-Temporally Consistent 4D Reconstruction from Sparse Cameras
Weihong Pan ⋅ XiaoYu Zhang ⋅ Zhuang Zhang ⋅ Zhichao Ye ⋅ Nan Wang ⋅ Haomin Liu ⋅ Guofeng Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 431
IDESplat: Iterative Depth Probability Estimation for Generalizable 3D Gaussian Splatting
Wei Long ⋅ Haifeng Wu ⋅ SHIYIN JIANG ⋅ Jinhua Zhang ⋅ Xinchun Ji ⋅ Shuhang Gu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 432
GS^2: Graph-based Spatial Distribution Optimization for Compact 3D Gaussian Splatting
Xianben Yang ⋅ Tao Wang ⋅ Yuxuan Li ⋅ Yi Jin ⋅ Haibin Ling
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 433
OnlinePG: Online Open-Vocabulary Panoptic Mapping with 3D Gaussian Splatting
Hongjia Zhai ⋅ Qi Zhang ⋅ Xiaokun Pan ⋅ Xiyu Zhang ⋅ Yitong Dong ⋅ Huaqi Zhang ⋅ Dan Xu ⋅ Guofeng Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 434
Uni3R: Unified 3D Reconstruction and Semantic Understanding via Generalizable Gaussian Splatting from Unposed Multi-View Images
Xiangyu Sun ⋅ Haoyi Jiang ⋅ Liu Liu ⋅ Seungtae Nam ⋅ Gyeongjin Kang ⋅ Xinjie wang ⋅ Wei Sui ⋅ Zhizhong Su ⋅ Wenyu Liu ⋅ Xinggang Wang ⋅ Eunbyung Park
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 435
Learning Explicit Continuous Motion Representation for Dynamic Gaussian Splatting from Monocular Videos
Xuankai Zhang ⋅ Junjin Xiao ⋅ Shangwei Huang ⋅ Wei-Shi Zheng ⋅ Qing Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 436
MLLMSplat: A 2D MLLM-Powered Framework for 3D Gaussian Splatting Understanding, Generation, and Editing
Jingqiao Xiu ⋅ Can Wang ⋅ Dong Xu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 437
Dropping Anchor and Spherical Harmonics for Sparse-view Gaussian Splatting
Shuangkang Fang ⋅ I-Chao Shen ⋅ Xuanyang Zhang ⋅ Zesheng Wang ⋅ Yufeng Wang ⋅ Wenrui Ding ⋅ Gang Yu ⋅ Takeo Igarashi
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 438
RAP: Fast Feedforward Rendering-Free Attribute-Guided Primitive Importance Score Prediction for Efficient 3D Gaussian Splatting Processing
Kaifa Yang ⋅ Qi Yang ⋅ Yiling Xu ⋅ Zhu Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 439
Plug-and-Play PDE Optimization for 3D Gaussian Splatting: Toward High-Quality Rendering and Reconstruction
Yifan Mo ⋅ Youcheng Cai ⋅ Ligang Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 440
PointGS: Semantic-Consistent Unsupervised 3D Point Cloud Segmentation with 3D Gaussian Splatting
Yixiao Song ⋅ Qingyong Li ⋅ Wen Wang ⋅ Zhicheng Yan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 441
Scene Grounding in the Wild
Tamir Cohen ⋅ Leo Segre ⋅ Shay Shomer-Chai ⋅ Shai Avidan ⋅ Hadar Averbuch-Elor
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 442
Flow4DGS-SLAM: Optical Flow-Guided 4D Gaussian Splatting SLAM
Yunsong Wang ⋅ Gim Hee Lee
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 443
Revisiting 3D Reconstruction Kernels as Low-Pass Filters
Shengjun Zhang ⋅ Min Chen ⋅ Yibo Wei ⋅ Mingyu Dong ⋅ Yueqi Duan
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 444
SR3R: Rethinking Super-Resolution 3D Reconstruction With Feed-Forward Gaussian Splatting
Xiang Feng ⋅ Xiangbo Wang ⋅ Tieshi Zhong ⋅ Chengkai Wang ⋅ Yiting Zhao ⋅ Tianxiang Xu ⋅ Zhenzhong Kuang ⋅ Feiwei Qin ⋅ Xuefei Yin ⋅ Yanming Zhu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 445
GP-4DGS: Probabilistic 4D Gaussian Splatting from Monocular Video via Variational Gaussian Processes
Mijeong Kim ⋅ Jungtaek Kim ⋅ Bohyung Han
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 446
VisRef: Visual Refocusing while Thinking Improves Test-Time Scaling in Multi-Modal Large Reasoning Models
Soumya Suvra Ghosal ⋅ Youngeun Kim ⋅ Zhuowei Li ⋅ Ritwick Chaudhry ⋅ Linghan Xu ⋅ Hongjing Zhang ⋅ Jakub Zablocki ⋅ Yifan Xing ⋅ Qin ZHANG
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 447
IPR-1: Interactive Physical Reasoner
Mingyu Zhang ⋅ lifeng zhuo ⋅ Tianxi Tan ⋅ Guocan Xie ⋅ Xian Nie ⋅ Yan Li ⋅ Renjie Zhao ⋅ Zizhu He ⋅ Ziyu Wang ⋅ Jiting Cai ⋅ Yonglu Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 448
VIRO: Robust and Efficient Neuro-Symbolic Reasoning with Verification for Referring Expression Comprehension
Hyejin Park ⋅ Junhyuk Kwon ⋅ Suha Kwak ⋅ Jungseul Ok
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 449
Fuel Gauge: Estimating Chain-of-Thought Length Ahead of Time in Large Multimodal Models
Yuedong Yang ⋅ Xiwen Wei ⋅ Mustafa Munir ⋅ Radu Marculescu
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 450
Thinking in Dynamics: How Multimodal Large Language Models Perceive, Track, and Reason Dynamics in Physical 4D World
Yuzhi Huang ⋅ Kairun Wen ⋅ Rongxin Gao ⋅ Dongxuan Liu ⋅ Yibin Lou ⋅ Jie Wu ⋅ Jing Xu ⋅ Jian Zhang ⋅ Zheng Yang ⋅ yunlong lin ⋅ Chenxin Li ⋅ Panwang Pan ⋅ Junbin Lu ⋅ Jingyan Jiang ⋅ Xinghao Ding ⋅ Yue Huang ⋅ Zhi Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 451
Latent Implicit Visual Reasoning
Kelvin Li ⋅ Chuyi Shang ⋅ Leonid Karlinsky ⋅ Rogerio Feris ⋅ Trevor Darrell ⋅ Roei Herzig
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 452
Thinking with Programming Vision: Towards a Unified View for Thinking with Images
Zirun Guo ⋅ Minjie Hong ⋅ Feng Zhang ⋅ Kai Jia ⋅ Tao Jin
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 453
AV-Reasoner: Improving and Benchmarking Clue-Grounded Audio-Visual Counting for MLLMs
Lidong Lu ⋅ Guo Chen ⋅ Wei Zhu ⋅ Zhiqi Li ⋅ Yicheng Liu ⋅ Tong Lu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 454
All Roads Lead to Rome: Incentivizing Divergent Thinking in Vision-Language Models
Xinyu Tian ⋅ Shu Zou ⋅ Zhaoyuan Yang ⋅ Mengqi He ⋅ Peter Henry Tu ⋅ Jing Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 455
See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning
Shuoshuo Zhang ⋅ Yizhen Zhang ⋅ JINGJING FU ⋅ Lei Song ⋅ Jiang Bian ⋅ Yujiu Yang ⋅ Rui Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 456
Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens
Zeyuan Yang ⋅ Xueyang Yu ⋅ Delin Chen ⋅ Maohao Shen ⋅ Chuang Gan
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 457
ReaGEN: Adaptive Generation of Structured Chains-of-Thought for Efficient Multimodal Reasoning
Ruiqing Tian ⋅ Mohan Sai Singamsetti ⋅ Di Niu ⋅ Bahador Rashidi
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 458
Breaking the Regional Perception Bottleneck of Multimodal Large Language Models via External Reasoning Framework
Jinrong Zhang ⋅ Zhaoyang Xu ⋅ Xusheng He ⋅ Xinrui Li ⋅ Na Zheng ⋅ Jianlong Wu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 459
CodePercept: Code-Grounded Visual STEM Perception for MLLMs
Tongkun Guan ⋅ Zhibo Yang ⋅ Jianqiang Wan ⋅ Mingkun Yang ⋅ Zhentao Guo ⋅ Zijian Hu ⋅ Ruilin Luo ⋅ Ruizhe Chen ⋅ Sontao Jiang ⋅ Peng Wang ⋅ Wei Shen ⋅ Junyang Lin ⋅ Xiaokang Yang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 460
TableMix: Enhancing Multimodal Table Reasoning in MLLMs from a Data-Centric Perspective
Chaohu Liu ⋅ Shida Wang ⋅ Yubo Wang ⋅ Linli Xu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 461
Harnessing Chain-of-Thought Reasoning in Multimodal Large Language Models for Face Anti-Spoofing
Honglu Zhang ⋅ Zhiqin Fang ⋅ Ningning Zhao ⋅ Saihui Hou ⋅ Long Ma ⋅ Renwang Pei ⋅ Zhaofeng He
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 462
Grounded Chain-of-Thought for Multimodal Large Language Models
Qiong Wu ⋅ Xiangcong Yang ⋅ Yiyi Zhou ⋅ Chenxin Fang ⋅ Baiyang Song ⋅ Xiaoshuai Sun ⋅ Rongrong Ji
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 463
LS-ViT: Least-Squares Hessian Based Block Reconstruction for Low-Bit Post-Training Quantization of Vision Transformers
Hyunha Hwang ⋅ Xuan Truong Nguyen ⋅ Hyuk-Jae Lee
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 464
SegMo: Co-Designing Content-Aware Sparsity and Locally-Cohesive Segment Parallelism for Efficient VLM Inference
Haojuan Li ⋅ Ruohan Tang ⋅ Dongzhou Cheng ⋅ Zongpu Zhang ⋅ Jian Li ⋅ Jiaqi Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 465
Rethinking Asymmetric Quantization: Hidden Symmetry in Vision Model Weights
Masafumi Mori ⋅ Shinya Gongyo ⋅ Mitsuru Ambai
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 466
Compressed-Domain-Aware Online Video Super-Resolution
Yuhang Wang ⋅ Hai Li ⋅ Shujuan Hou ⋅ Zhetao Dong ⋅ Xiaoyao Yang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 467
CAR-SAM: Cross-Attention Reconstruction for Post-Training Quantization of the Segment Anything Model
Houji Wen ⋅ Jiangyong Yu ⋅ Dawei Yang ⋅ Jun Li
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 468
Is Bin Generation Indispensable? A Bin-Generation-Free Dataset Quantization via Semantic Perspective
Maijie Deng ⋅ Yuhua Li ⋅ Yixiong Zou ⋅ Yao Wu ⋅ Chenru Ma
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 469
High Resolution Neural Video Coding with Bi-directional Confidence-Guided Reference Information Modeling
Feng Ye ⋅ Kai Zhang ⋅ Li zhang ⋅ Chuanmin Jia
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 470
Distributed Image Compression with Multimodal Side Information at Extremely Low Bitrates
Guojun Xu ⋅ Mingyang Zhang ⋅ Jianwen Xiang ⋅ Cheng Tan ⋅ Yanchao Yang ⋅ Junwei Zhou
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 471
Task-Aware Image Signal Processor for Advanced Visual Perception
CHEN KAI ⋅ Jin Xiao ⋅ Leheng Zhang ⋅ Kexuan Shi ⋅ Shuhang Gu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 472
Enhancing Video Vision Language Model with Hippocampal Sensing
Xu Cao
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 473
VIRD: View-Invariant Representation through Dual-Axis Transformation for Cross-View Pose Estimation
Juhye Park ⋅ Wooju Lee ⋅ Dasol Hong ⋅ Changki Sung ⋅ Youngwoo Seo ⋅ DongWan Kang ⋅ Hyun Myung
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 474
WRIVINDER: Towards Spatial Intelligence for Geo-locating Ground Images onto Satellite Imagery
Chandrakanth Gudavalli ⋅ Tajuddin Manhar Mohammed ⋅ Abhay Yadav ⋅ Ananth Vishnu Bhaskar ⋅ Hardik Prajapati ⋅ Cheng Peng ⋅ Rama Chellappa ⋅ Shivkumar Chandrasekaran ⋅ B.S. Manjunath
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 475
SoPE: Spherical Coordinate-Based Positional Embedding for Enhancing Spatial Perception of 3D LVLMs
Koonting Yip ⋅ Qiyan Zhao ⋅ Wenhao Yu ⋅ Liangyu Yuan ⋅ Mingkai LI ⋅ Xiaofeng Zhang ⋅ Jianmin Ji ⋅ Yanyong Zhang ⋅ Qing Jiang ⋅ Ka-Veng Yuen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 476
RHO: Robust Holistic OSM-Based Metric Cross-View Geo-Localization
Junwei Zheng ⋅ Ruize Dai ⋅ Ruiping Liu ⋅ Zichao Zeng ⋅ Yufan Chen ⋅ Fangjinhua Wang ⋅ Kunyu Peng ⋅ Kailun Yang ⋅ Jiaming Zhang ⋅ Rainer Stiefelhagen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 477
EfficientVPR: Toward Efficient Visual Place Recognition via Scene-Aware Prompt Tuning and Adaptive Feature Enhancement
Wenjing Tang ⋅ Chuanguang Yang ⋅ Zhulin An ⋅ Libo Huang ⋅ boyu diao ⋅ Yongjun Xu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 478
Universal Guideline-Driven Image Clustering via a Hybrid LLM Agent
Wenliang Zhong ⋅ Rob Barton ⋅ Lucas Goncalves ⋅ Kushal Kumar ⋅ Feng Jiang ⋅ Hehuan Ma ⋅ Yuzhi Guo ⋅ Vidit Bansal ⋅ Karim Bouyarmane ⋅ Junzhou Huang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 479
ReLaX: Reasoning with Latent Exploration for Large Reasoning Models
Shimin Zhang ⋅ Xianwei Chen ⋅ Yufan Shen ⋅ Ziyuan Ye ⋅ Jibin Wu
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 480
VideoChat-M1: Collaborative Policy Planning for Video Understanding via Multi-Agent Reinforcement Learning
Boyu Chen ⋅ Zikang Wang ⋅ Zhengrong Yue ⋅ Kainan Yan ⋅ Chenyun Yu ⋅ Yi Huang ⋅ Zijun Liu ⋅ Yafei Wen ⋅ Xiaoxin Chen ⋅ Yang Liu ⋅ Peng Li ⋅ Yali Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 481
Think, Then Verify: A Hypothesis–Verification Multi-Agent Framework for Long Video Understanding
Zheng Wang ⋅ Haoran Chen ⋅ Haoxuan Qin ⋅ Zhipeng Wei ⋅ Tianwen Qian ⋅ Cong Bai
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 482
Reinforce to Learn, Elect to Reason: A Dual Paradigm for Video Reasoning
Songyuan Yang ⋅ Weijiang Yu ⋅ Jilin Ma ⋅ Ziyu Liu ⋅ Guijian Tang ⋅ Wenjing Yang ⋅ Huibin Tan ⋅ Nong Xiao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 483
Graph-to-Frame RAG: Visual-Space Knowledge Fusion for Training-Free and Auditable Video Reasoning
Songyuan Yang ⋅ Weijiang Yu ⋅ Ziyu Liu ⋅ Guijian Tang ⋅ Wenjing Yang ⋅ Huibin Tan ⋅ Nong Xiao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 484
LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling
Zuhao Yang ⋅ Sudong Wang ⋅ Kaichen Zhang ⋅ Keming Wu ⋅ Sicong Leng ⋅ Yifan Zhang ⋅ Bo Li ⋅ Chengwei Qin ⋅ Shijian Lu ⋅ Xingxuan Li ⋅ Lidong Bing
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 485
Multi-Modal Image Fusion via Intervention-Stable Feature Learning
Xue Wang ⋅ Zheng Guan ⋅ Wenhua Qian ⋅ Chengchao Wang ⋅ Runzhuo MA
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 486
ReCoFuse: Ultra-Robust Image Fusion via Restorative Multi-Modal Diffusion Reciprocal Coupling
HAO ZHANG ⋅ Shuhan Yang ⋅ Linfeng Tang ⋅ Xunpeng Yi ⋅ Jiayi Ma
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 487
Degradation-Robust Fusion: An Efficient Degradation-Aware Diffusion Framework for Multimodal Image Fusion in Arbitrary Degradation Scenarios
Yu Shi ⋅ Yu Liu ⋅ Zhong-Cheng Wu ⋅ Juan Cheng ⋅ Huafeng Li ⋅ Xun Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 488
DF^2-VB: Dual-level Fuzzy Fusion with View-specific Boosting for Multi-view Multi-label Classification
Yuena Lin ⋅ Haichun Cai ⋅ Yi Shan ⋅ Hao Wei ⋅ Yongjian Deng ⋅ Zhen Yang ⋅ Gengyu Lyu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 489
UniFusion: A Unified Image Fusion Framework with Robust Representation and Source-Aware Preservation
Xingyuan Li ⋅ Songcheng Du ⋅ Yang Zou ⋅ HaoYuan Xu ⋅ Zhiying Jiang ⋅ Jinyuan Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 490
Self-guided Semantic Inspection for Zero-Shot Composed Image Retrieval
Jingjing Zhang ⋅ Lei Zhang ⋅ Zheren Fu ⋅ Bo Hu ⋅ Zhendong Mao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 491
G-MIXER: Geodesic Mixup-based Implicit Semantic Expansion and Explicit Semantic Re-ranking for Zero-Shot Composed Image Retrieval
jiyoung lim ⋅ Heejae Yang ⋅ Jee-Hyong Lee
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 492
No Hard Negatives Required: Concept Centric Learning Leads to Compositionality without Degrading Zero-shot Capabilities of Contrastive Models
Hai X. Pham ⋅ David T. ⋅ Ricardo Guerrero ⋅ Brais Martinez
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 493
MUSE: Harnessing Precise and Diverse Semantics for Few-Shot Whole Slide Image Classification
Jiahao Xu ⋅ Sheng Huang ⋅ Xin Zhang ⋅ Zhixiong Nan ⋅ Jiajun Dong ⋅ Nankun Mu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 494
Pointing at Parts: Training-Free Few-Shot Grounding in Multimodal LLMs
Shiang-Feng Tsai ⋅ Yuan-Hong Liao ⋅ Jin-Cheng Jhang ⋅ Nan Qiao ⋅ Min Sun
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 495
Graph Attention Prototypical Network for Robust Few-Shot Classification
Tingyun Liu ⋅ Licheng Liu ⋅ Qibin Zhang ⋅ Qiying Feng ⋅ C.L.Philip Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 496
Mitigating The Distribution Shift of Diffusion-based Dataset Distillation
Yue Xu ⋅ Chenyu Hu ⋅ Pengyu An ⋅ Yonglu Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 497
EVLF: Early Vision-Language Fusion for Generative Dataset Distillation
WENQI CAI ⋅ Yawen Zou ⋅ Guang Li ⋅ Chunzhi Gu ⋅ Chao Zhang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 498
Fixed Anchors Are Not Enough: Dynamic Retrieval and Persistent Homology for Dataset Distillation
Muquan Li ⋅ Hang Gou ⋅ Yingyi Ma ⋅ Rongzheng Wang ⋅ Ke Qin ⋅ Tao He
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 499
Flow Map Distillation Without Data
Shangyuan Tong ⋅ Nanye Ma ⋅ Saining Xie ⋅ Tommi Jaakkola
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 500
F^2HDR: Two-Stage HDR Video Reconstruction via Flow Adapter and Physical Motion Modeling
Huanjing Yue ⋅ Dawei Li ⋅ Shaoxiong Tu ⋅ Jingyu Yang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 501
Learning Latent Transmission and Glare Maps for Lens Veiling Glare Removal
Xiaolong Qian ⋅ Qi Jiang ⋅ Lei Sun ⋅ Zongxi Yu ⋅ Kailun Yang ⋅ Peixuan Wu ⋅ Jiacheng Zhou ⋅ Yao Gao ⋅ Yaoguang Ma ⋅ Ming-Hsuan Yang ⋅ Kaiwei Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 502
Inter-Photon-Limited Videography
Andrew Xie ⋅ Dongyu Du ⋅ Sotiris Nousias ⋅ David B. Lindell ⋅ Kiriakos N. Kutulakos
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 503
A Bit is All You Need! Efficient Video Capture via Single Bit Imaging
Kanchana Vaishnavi Gandikota ⋅ Michael Moeller ⋅ Andreas Kolb ⋅ Bhaskar Choubey ⋅ Paramanand Chandramouli
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 504
From Events to Clarity: The Event-Guided Diffusion Framework for Dehazing
Ling Wang ⋅ Yunfan Lu ⋅ Wenzong Ma ⋅ Huizai Yao ⋅ Pengteng Li ⋅ Hui Xiong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 505
Electromagnetic Inverse Scattering from a Single Transmitter
Yizhe Cheng ⋅ Chunxun Tian ⋅ Haoru Wang ⋅ Wentao Zhu ⋅ Xiaoxuan Ma ⋅ Yizhou Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 506
Statistical Characteristic-Guided Denoising for Rapid High-Resolution Transmission Electron Microscopy Imaging
Hesong Li ⋅ Ziqi Wu ⋅ Ruiwen Shao ⋅ Ying Fu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 507
Physics-Guided Multistep Deformation Reversal for Ancient Bamboo Slip Restoration
Qianqian Tang ⋅ Jinchi Zhu ⋅ Xiaolu Zhou ⋅ Yongchao Xu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 508
cryoSENSE: Compressive Sensing Enables High-throughput Microscopy with Sparse and Generative Priors on the Protein Cryo-EM Image Manifold
Zain Shabeeb ⋅ Daniel Saeedi ⋅ Darin Tsui ⋅ Vida Jamali ⋅ Amirali Aghazadeh
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 509
SGDE: Self-supervised Geometry Degradation Estimation Framework for Coded Aperture Compressive Spectral Imaging
Yuqiao He ⋅ Xiaoyan LIU ⋅ Jianxu Mao ⋅ Yaonan Wang ⋅ Hui Zhang ⋅ Lizhu Liu ⋅ Yurong Chen ⋅ Wenbin He
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 510
Factorized Context Aggregation for Robust Cancer Risk Estimation via Soft Re-Ranked Retrieval and Hierarchical Anchors
Puria Azadi Moghadam ⋅ Ali Khajegili Mirabadi ⋅ Behnam Maneshgar ⋅ Hossein Farahani ⋅ Ali Bashashati
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 511
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
Zhuangcheng Gu ⋅ Guang Liang ⋅ Bin Wang ⋅ Zhiyuan Zhao ⋅ Qintong Zhang ⋅ Weijia Li ⋅ Chao Xu ⋅ Bo Zhang ⋅ Botian Shi ⋅ Jiang Wu ⋅ Wentao Zhang ⋅ Conghui He
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 512
GeneVAR: Causal MeanFlow for Autoregressive Gene-to-WSI Tile Synthesis
Jianwei Zhao ⋅ Fan Yang ⋅ XIN LI ⋅ Qiang Zhai ⋅ Ao Luo ⋅ Ziqi Ren ⋅ Zhicheng Jiao ⋅ Hong Cheng
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 513
Depth Any Endoscopy: Towards Self-Supervised Generalizable Depth Estimation in Monocular Endoscopy
Shuwei Shao ⋅ Kejin Zhu ⋅ Shixing Ma ⋅ Xinzhe Du ⋅ Baochang Zhang ⋅ Zhe Min
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 514
RoSAMDepth: Robust Self-supervised Depth Estimation Leveraging Segment Anything Model
Xuanang Gao ⋅ Ning Zhiwei ⋅ Gengming Zhang ⋅ Jiaxi Cao ⋅ Runze Yang ⋅ Zhonglong Zheng ⋅ JIE YANG ⋅ Rong Xiao ⋅ Wei Liu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 515
AdaSFormer: Adaptive Serialized Transformers for Monocular Semantic Scene Completion from Indoor Environments
xuzhi wang ⋅ Xinran Wu ⋅ Song Wang ⋅ Lingdong Kong ⋅ Ziping Zhao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 516
Dark3R: Learning Structure from Motion in the Dark
Andrew Y. Guo ⋅ Anagh Malik ⋅ SaiKiran Tedla ⋅ Yutong Dai ⋅ Yiqian Qin ⋅ Zach Salehe ⋅ Benjamin Attal ⋅ Sotiris Nousias ⋅ Kiriakos N. Kutulakos ⋅ David B. Lindell
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 517
What Makes Good Synthetic Training Data for Zero-Shot Stereo Matching?
David Yan ⋅ Alexander Raistrick ⋅ Jia Deng
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 518
TR2M: Transferring Monocular Relative Depth to Metric Depth with Language Descriptions and Dual-Level Scale-Oriented Contrast
Beilei Cui ⋅ Yiming Huang ⋅ Long Bai ⋅ Hongliang Ren
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 519
Iris: Integrating Language into Diffusion-based Monocular Depth Estimation
Ziyao Zeng ⋅ Jingcheng Ni ⋅ Daniel Wang ⋅ Patrick Rim ⋅ Younjoon Chung ⋅ Fengyu Yang ⋅ Byung-Woo Hong ⋅ Alex Wong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 520
Ov3R: Open-Vocabulary Semantic 3D Reconstruction from RGB Videos
ZIREN GONG ⋅ Xiaohan Li ⋅ Fabio Tosi ⋅ Jiawei Han ⋅ Stefano Mattoccia ⋅ Jianfei Cai ⋅ Matteo Poggi
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 521
M3DLayout: A Multi-Source Dataset of 3D Indoor Layouts and Structured Descriptions for 3D Generation
Yiheng Zhang ⋅ Zhuojiang Cai ⋅ Mingdao Wang ⋅ Meitong Guo ⋅ Tianxiao Li ⋅ Li Lin ⋅ Yuwang Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 522
UniPart: Part-Level 3D Generation with Unified 3D Geom–Seg Latents
Xufan He ⋅ Yushuang Wu ⋅ Xiaoyang Guo ⋅ Chongjie Ye ⋅ Jiaqing Zhou ⋅ Tianlei Hu ⋅ Xiaoguang Han ⋅ Dong Du
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 523
Photo3D: Advancing Photorealistic 3D Generation through Structure‑Aligned Detail Enhancement
Xinyue Liang ⋅ Zhiyuan Ma ⋅ Lingchen Sun ⋅ Yanjun Guo ⋅ Lei Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 524
Mesh-Pro: Asynchronous Advantage-guided Ranking Preference Optimization for Artist-style Quadrilateral Mesh Generation
Zhen Zhou ⋅ Jian Liu ⋅ Biwen Lei ⋅ Jing Xu ⋅ Haohan Weng ⋅ Yiling Zhu ⋅ Zhuo Chen ⋅ Junfeng Fan ⋅ Yunkai Ma ⋅ Dazhao Du ⋅ Song Guo ⋅ Fengshui Jing ⋅ Chunchao Guo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 525
Order Matters: 3D Shape Generation from Sequential VR Sketches
Yizi Chen ⋅ Sidi Wu ⋅ Tianyi Xiao ⋅ Nina Wiedemann ⋅ Loic Landrieu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 526
Think-Then-Generate: Structural Chain-of-Thought Reasoning for Consistent 3D Generation
Xinyue Liu ⋅ Jin Liu ⋅ Hongbo Wang ⋅ Ran He ⋅ Huaibo Huang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 527
ArtLLM: Generating Articulated Assets via 3D LLM
Penghao Wang ⋅ Siyuan Xie ⋅ Jiawei Zhou ⋅ Xianghui Yang ⋅ Jingwei Huang ⋅ Chunchao Guo ⋅ Jiayuan Gu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 528
PoseMaster: A Unified 3D Native Framework for Stylized Pose Generation
Hongyu Yan ⋅ Kunming Luo ⋅ Weiyu Li ⋅ Kaiyi Zhang ⋅ Yixun Liang ⋅ Jingwei Huang ⋅ Chunchao Guo ⋅ Ping Tan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 529
2D-LFM: Lifting Foundation Model without 3D Supervision
Mosam Dabhi ⋅ Irhas Gill ⋅ László A. Jeni ⋅ Simon Lucey
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 530
ActionMesh: Animated 3D Mesh Generation with Temporal 3D Diffusion
Remy Sabathier ⋅ David Novotny ⋅ Niloy J. Mitra ⋅ Tom Monnier
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 531
4DWorldBench: A Comprehensive Evaluation Framework for 3D/4D World Generation Models
Yiting Lu ⋅ Wei Luo ⋅ Peiyan Tu ⋅ Haoran Li ⋅ Hanxin Zhu ⋅ Zihao Yu ⋅ Xingrui Wang ⋅ Xinyi Chen ⋅ Xinge Peng ⋅ Xin Li ⋅ Zhibo Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 532
FabricGen: Microstructure-Aware Woven Fabric Generation
Yingjie Tang ⋅ Di Luo ⋅ Zixiong Wang ⋅ Xiaoli Ling ⋅ Jian Yang ⋅ Beibei Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 533
Leveraging Verifier-Based Reinforcement Learning in Image Editing
Hanzhong Guo ⋅ Jie Wu ⋅ Jie Liu ⋅ Yu Gao ⋅ Zilyu Ye ⋅ Linxiao Yuan ⋅ Xionghui Wang ⋅ Yizhou Yu ⋅ Weilin Huang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 534
PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling
Bowen Ping ⋅ Chengyou Jia ⋅ Minnan Luo ⋅ Changliang Xia ⋅ Xin Shen ⋅ Zhuohang Dang ⋅ Hangwei Qian
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 535
VIVA: VLM-Guided Instruction-Based Video Editing with Reward Optimization
Xiaoyan Cong ⋅ Haotian Yang ⋅ Angtian Wang ⋅ Yizhi Wang ⋅ Yiding Yang ⋅ Canyu Zhang ⋅ Chongyang Ma
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 536
MapReduce LoRA: Advancing the Pareto Front in Multi-Preference Optimization for Generative Models
Chieh-Yun Chen ⋅ Zhonghao Wang ⋅ Qi Chen ⋅ Zhifan Ye ⋅ Min Shi ⋅ Yue Zhao ⋅ Yinan Zhao ⋅ Hui Qu ⋅ Wei-An Lin ⋅ Yiru Shen ⋅ Ajinkya Kale ⋅ Irfan Essa ⋅ Humphrey Shi
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 537
Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation
Yunhong Lu ⋅ Yanhong Zeng ⋅ Haobo Li ⋅ Hao Ouyang ⋅ Qiuyu Wang ⋅ Ka Leong Cheng ⋅ Jiapeng Zhu ⋅ Hengyuan Cao ⋅ Zhipeng Zhang ⋅ Xing Zhu ⋅ Yujun Shen ⋅ Min Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 538
C^2FG: Control Classifier-Free Guidance via Score Discrepancy Analysis
Jiayang Gao ⋅ Tianyi Zheng ⋅ Jiayang Zou ⋅ Fengxiang Yang ⋅ Shice Liu ⋅ Luyao Fan ⋅ Zheyu Zhang ⋅ Hao Zhang ⋅ Jinwei Chen ⋅ Peng-Tao Jiang ⋅ Bo Li ⋅ Jia Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 539
Learning What to Trust: Bayesian Prior-Guided Optimization for Visual Generation
Ruiying Liu ⋅ Yuanzhi Liang ⋅ Haibin Huang ⋅ Tianshu Yu ⋅ Chi Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 540
Unified Customized Generation by Disentangled Reward Modeling
Shaojin Wu ⋅ Mengqi Huang ⋅ Yufeng Cheng ⋅ wenxu wu ⋅ Jiahe Tian ⋅ Yiming Luo ⋅ Fei Ding ⋅ Qian HE
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 541
Region-Aware Instance Consistency Learning for Micro-Expression Recognition
Yaomin Cai ⋅ C.L.Philip Chen ⋅ Shiting Xu ⋅ Haiqi Liu ⋅ Tong Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 542
MPL: Match-guided Prototype Learning for Few-shot Action Recognition
Feng Yang ⋅ Jie Zhao ⋅ Fulin Luo ⋅ Anyong Qin ⋅ Tiecheng Song ⋅ Yue Zhao ⋅ CHENQIANG GAO ⋅ Junwei Han
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 543
LaDy: Lagrangian-Dynamic Informed Network for Skeleton-based Action Segmentation via Spatial-Temporal Modulation
Haoyu Ji ⋅ Xueting Liu ⋅ Yu Gao ⋅ Wenze Huang ⋅ Zhihao Yang ⋅ Weihong Ren ⋅ Zhiyong Wang ⋅ Honghai LIU
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 544
LA-Pose: Latent Action Pretraining Meets Pose Estimation
Zhengqing Wang ⋅ Saurabh Nair ⋅ Prajwal Chidananda ⋅ Pujith Kachana ⋅ Samuel Li ⋅ Matthew Brown ⋅ Yasutaka Furukawa
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 545
RAAS: LLM Agentic System Architecture Search with GRPO
Jiayi Yang ⋅ Guancheng Wan ⋅ Man Zhang ⋅ Mang Ye
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 546
Temporal Representation Enhancement (TRE): Learning to Forget Dominant Patterns for Enhanced Temporal Spiking Features
Wei Liu ⋅ Li Yang ⋅ Yufei Wang ⋅ Han Xiao ⋅ Boyu Cai ⋅ Weiming Hu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 547
Chain-of-Models Pre-Training: Rethinking Training Acceleration of Vision Foundation Models
Jiawei Fan ⋅ Shigeng Wang ⋅ Chao Li ⋅ Xiaolong Liu ⋅ Anbang Yao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 548
Unlocking Pre-trained Weights: Parameter Inheritance for Zero-Shot Initialization
Jiaze Xu ⋅ Shiyu Xia ⋅ Jiaqi Lv ⋅ Xin Geng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 549
Deconstructing the Failure of Ideal Noise Correction: A Three-Pillar Diagnosis
Chen Feng ⋅ Zhuo ZHI ⋅ Zhao Huang ⋅ Jiawei Ge ⋅ Ling Xiao ⋅ Nicu Sebe ⋅ Georgios Tzimiropoulos ⋅ Ioannis Patras
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 550
Progressive Neural Architecture Generation
Caiyang Yu ⋅ Chen Huang ⋅ Yun Liu ⋅ Chenwei Tang ⋅ Wei Ju ⋅ Jiancheng Lv
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 551
A Unified Framework for Knowledge Transfer in Bidirectional Model Scaling
Jianlu Shen ⋅ Fu Feng ⋅ Jiaze Xu ⋅ Yucheng Xie ⋅ Jiaqi Lv ⋅ Xin Geng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 552
When Do Models Actually Decide? Mapping the Layer-Wise Decision Timeline in Pretrained Neural Networks
Minhyeok Lee
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 553
Temporal Interaction in Spiking Transformers with Multi-Delay Mixer
Kexin Shi ⋅ Hanwen Liu ⋅ Zeyang Song ⋅ Yang Liu ⋅ Jieyuan Zhang ⋅ Shuai Wang ⋅ Jibin Wu ⋅ Malu Zhang ⋅ Yang Yang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 554
Consensus vs. Controversy: Mapping the Decision Space Where Architectures Diverge
Minhyeok Lee
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 555
Sparsely Timing the Change: A Spiking Temporal Framework for Remote Sensing Interpretation
Shilong Li ⋅ Xiurui Xie ⋅ Qiugang Zhan ⋅ Luochao Wang ⋅ Yong Deng ⋅ Guisong Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 556
ProSoftArena: Benchmarking Hierarchical Capabilities of Multi-modal Agents in Professional Software Environments
Jiaxin Ai ⋅ Yukang Feng ⋅ Fanrui Zhang ⋅ Jianwen Sun ⋅ Zizhen Li ⋅ Chuanhao Li ⋅ Yifan Chang ⋅ Wenxiao Wu ⋅ Ruoxi Wang ⋅ Mingliang Zhai ⋅ Kaipeng Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 557
BAMI: Training-Free Bias Mitigation in GUI Grounding
Borui Zhang ⋅ Bo Zhang ⋅ Bo Wang ⋅ Wenzhao Zheng ⋅ Yuhao Cheng ⋅ Liang Tang ⋅ Yiqiang Yan ⋅ Jie Zhou ⋅ Jiwen Lu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 558
DRS-GUI: Dynamic Region Search for Training-Free GUI Grounding
Yichao Liu ⋅ Huawen Shen ⋅ Liu Yu ⋅ Shiyu Liu ⋅ Zeyu Chen ⋅ Yu ZHOU
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 559
Consistency Beyond Contrast: Enhancing Open-Vocabulary Object Detection Robustness via Contextual Consistency Learning
bozhao Li ⋅ Shaocong Wu ⋅ Tong Shao ⋅ Senqiao Yang ⋅ Qiben Shan ⋅ Zhuotao Tian ⋅ Jingyong Su
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 560
Thermal-Det: Language-Guided Cross-Modal Distillation for Open-Vocabulary Thermal Object Detection
Yasiru Ranasinghe ⋅ Elim Schenck ⋅ Florence Yellin ⋅ Shuowen Hu ⋅ Christopher Funk ⋅ Vishal M. Patel
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 561
Geometry-driven OOD Detectors Are Class-Incremental Learners
Wangwang Jia ⋅ Zijian Gao ⋅ Tianjiao Wan ⋅ Yuan Cao ⋅ Yong Dou ⋅ Kele Xu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 562
Mind the Way You Select Negative Texts: Pursuing the Distance Consistency in OOD Detection with VLMs
Zhikang Xu ⋅ Qianqian Xu ⋅ Zitai Wang ⋅ Cong Hua ⋅ Sicong Li ⋅ Zhiyong Yang ⋅ Qingming Huang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 563
Prompt-Free Unknown Label Generation for Open World Detection in Remote Sensing
Abdullah Azeem ⋅ Ruisheng Wang ⋅ Qingquan Li ⋅ Abubakar Siddique
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 564
Learning to Diversify and Focus: A Reinforcement Framework for Open-Vocabulary HOI Detection
Yongchao Xu ⋅ Jiawei Liu ⋅ Junfeng Wang ⋅ Sen Tao ⋅ Na Jiang ⋅ Zheng-Jun Zha
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 565
RINO: Rotation-Invariant Non-Rigid Correspondences
Maolin Gao ⋅ Shao Jie Hu-Chen ⋅ Congyue Deng ⋅ Riccardo Marin ⋅ Leonidas Guibas ⋅ Daniel Cremers
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 566
Hyperbolic Prototype Learning with Uncertainty-Aware Consistency for Continual Test-Time Segmentation
Siddhant Gole ⋅ Akash Pal ⋅ Amit Popat More ⋅ S Divakar Bhat ⋅ Subhasis Chaudhuri ⋅ Biplab Banerjee
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 567
DINO Eats CLIP: Adapting Beyond Knowns for Open-set 3D Object Retrieval
Xinwei He ⋅ Yansong Zheng ⋅ Qianru Han ⋅ Zhichuan Wang ⋅ Yuxuan Cai ⋅ Yang Zhou ⋅ Jingbo Xia ⋅ Yulong Wang ⋅ Jinhai Xiang ⋅ Xiang Bai
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 568
Leveraging Class Distributions in CLIP for Weakly Supervised Semantic Segmentation
Ziqian Yang ⋅ Xinqiao Zhao ⋅ Xiaolei Wang ⋅ Quan Zhang ⋅ Jimin Xiao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 569
CompetitorFormer: Mitigating Query Conflicts for 3D Instance Segmentation via Competitive Strategy
wang duanchu ⋅ Junjie Yang ⋅ Haoran Gong ⋅ Jing Liu ⋅ Di Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 570
D2Dewarp: Dual Dimensions Geometric Representation Learning Based Document Image Dewarping
Heng Li ⋅ Xiangping Wu ⋅ Qingcai Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 571
Discover, Segment, and Select: A Progressive Mechanism for Zero-shot Camouflaged Object Segmentation
Yilong Yang ⋅ Jianxin Tian ⋅ Shengchuan Zhang ⋅ Liujuan Cao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 572
D-Convexity: A Unified Differentiable Convex Shape Prior via Quasi-Concavity for Data-driven Image Segmentation
Shengzhe Chen ⋅ Hao Yan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 573
Fast Reasoning Segmentation for Images and Videos
Yiqing Shen ⋅ Mathias Unberath
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 574
Structure-Aware Representation Distillation for Tiny-Dense Object Segmentation
Xuesong Liu ⋅ Anke Xu ⋅ Wenbo Cao ⋅ Emmett Ientilucci
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 575
CRFT: Consistent–Recurrent Feature Flow Transformer for Cross-Modal Image Registration
Xuecong Liu ⋅ Mengzhu Ding ⋅ Zixuan Sun ⋅ Zhang Li ⋅ Xichao Teng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 576
FireScope: Wildfire Risk Raster Prediction With a Chain-of-Thought Oracle
Mario Markov ⋅ Stefan Ailuro ⋅ Luc Van Gool ⋅ Konrad Schindler ⋅ Danda Paudel
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 577
OlmoEarth: Stable Latent Image Modeling for Multimodal Earth Observation
Henry Herzog ⋅ Favyen Bastani ⋅ Yawen Zhang ⋅ Gabriel Tseng ⋅ Joseph Redmon ⋅ Hadrien Sablon ⋅ Ryan Park ⋅ Jacob Morrison ⋅ Alexandra Buraczynski ⋅ Karen Farley ⋅ Josh Hansen ⋅ Andrew Howe ⋅ Patrick Alan Johnson ⋅ Mark Otterlee ⋅ Ted Schmitt ⋅ Hunter Pitelka ⋅ Stephen Daspit ⋅ Rachel Ratner ⋅ Christopher Wilhelm ⋅ Sebastian Wood ⋅ Mike Jacobi ⋅ Hannah Kerner ⋅ Evan Shelhamer ⋅ Ali Farhadi ⋅ Ranjay Krishna ⋅ Patrick Beukema
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 578
TESSERA: Temporal Embeddings of Surface Spectra for Earth Representation and Analysis
Zhengpeng Feng ⋅ Clement Atzberger ⋅ Sadiq Jaffer ⋅ Jovana Knezevic ⋅ Silja Sormunen ⋅ Robin Young ⋅ Madeline C. Lisaius ⋅ Markus Immitzer ⋅ Toby Jackson ⋅ James Ball ⋅ David A. Coomes ⋅ Anil Madhavapeddy ⋅ Andrew Blake ⋅ Srinivasan Keshav
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 579
Regulating Rather than Constraining: Adaptive Guidance for Complex Spectral Reconstruction in Pansharpening
Zhuwei Wen ⋅ Zimin Xia ⋅ He Chen ⋅ Linwei Yue ⋅ Xianwei Zheng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 580
GeoMMBench and GeoMMAgent: Toward Expert-Level Multimodal Intelligence in Geoscience and Remote Sensing
Aoran Xiao ⋅ Shihao Cheng ⋅ Yonghao Xu ⋅ Yexian Ren ⋅ Hongruixuan Chen ⋅ Naoto Yokoya
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 581
Revisiting the Necessity of Full Accuracy: Weakly Supervised Object-Level Offset Correction for Misaligned Building Labels
Junda Xu ⋅ Yanmeng Liu ⋅ Xiangqiang Zeng ⋅ Jinrong Wu ⋅ Ying Qu ⋅ Libao Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 582
UniGeoSeg: Towards Unified Open-World Segmentation for Geospatial Scenes
Shuo Ni ⋅ Di Wang ⋅ He Chen ⋅ Haonan Guo ⋅ Ning Zhang ⋅ Jing Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 583
ZoomEarth: Active Perception for Ultra-High-Resolution Geospatial Vision-Language Tasks
Ruixun Liu ⋅ Bowen Fu ⋅ Jiayi Song ⋅ Kaiyu Li ⋅ Wanchen Li ⋅ Lanxuan Xue ⋅ Hui Qiao ⋅ Weizhan Zhang ⋅ Deyu Meng ⋅ Xiangyong Cao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 584
Unleashing Stealthy Backdoor Pandemic by Infecting a Single Diffusion Model
Mohaiminul Al Nahian ⋅ Abeer Matar Almalky ⋅ Sabbir Ahmed ⋅ Abdullah Al Arafat ⋅ Mamshad Nayeem Rizve ⋅ Adnan Rakin Rakin
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 585
Taming the Long Tail: Rebalancing Adversarial Training via Adaptive Perturbation
Lilin Zhang ⋅ Yimo Guo ⋅ Yue Li ⋅ Jiancheng Shi ⋅ Xianggen Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 586
Robustness Under Data Scarcity: Few-Shot Continual Adversarial Training for Evolving Threats
Wenxuan Wang ⋅ Chenglei Wang ⋅ Chengzhi Yan ⋅ Xuelin Qian ⋅ Yanning Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 587
Logit-Margin Repulsion for Backdoor Defense
Zhiguo Yang ⋅ Dongsheng Xu ⋅ Ruizhi Zhong ⋅ Jiacheng Pi ⋅ Xingxing Huang ⋅ Wenjie Ruan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 588
Thermally Activated Dual-Modal Adversarial Clothing against AI Surveillance Systems
Jiahuan Long ⋅ Tingsong Jiang ⋅ Hanqing Liu ⋅ Chao Ma ⋅ Weien Zhou ⋅ Yang Yang ⋅ Wen Yao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 589
Immunizing Models Against Harmful Long-Horizon Fine-Tuning via Contractive Optimization Dynamics
Najibul Haque Sarker ⋅ Zaber Ibn Abdul Hakim ⋅ Ali Asgarov ⋅ Chia-Wei Tang ⋅ Alvi Md Ishmam ⋅ Chris Thomas
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 590
Towards Stealthy and Effective Backdoor Attacks on Lane Detection: A Naturalistic Data Poisoning Approach
YIFAN LIAO ⋅ Yuxin Cao ⋅ Yedi Zhang ⋅ Wentao He ⋅ Yan XIAO ⋅ Xianglong Du ⋅ Zhiyong Huang ⋅ Jin Song Dong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 591
Red-teaming Retrieval-Augmented Diffusion Models via Poisoning Knowledge Bases
Xinqi Lyu ⋅ Liu of second author ⋅ Dong Wang ⋅ Bin Xiao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 592
Latent Diffusion Inversion Requires Understanding the Latent Space
Mingxing Rao ⋅ Bowen Qu ⋅ Daniel Moyer
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 593
Fractal Camouflage: A Bio-Inspired Approach for Multi-Scale Adversarial Attacks in the Infrared Domain
Chengyin Hu ⋅ Xin wang ⋅ Rui Qiu ⋅ Zhe Jia ⋅ Yingying Zhao ⋅ Kai Wang ⋅ Xu Kang ⋅ Yiwei Wei
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 594
EgoRoC: Towards Egocentric Robotic Control via Task-Agnostic Visual Alignment
Wei Feng ⋅ Chi Zhang ⋅ Nan Li ⋅ Qian Zhang ⋅ Qi Zhang ⋅ Mingyan Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 595
Describe Anything Anywhere At Any Moment
Nicolas Gorlo ⋅ Lukas Schmid ⋅ Luca Carlone
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 596
StaMo: Unsupervised Learning of Generalizable Robot Motion from Compact State Representation
Mingyu Liu ⋅ Jiuhe Shu ⋅ Hui Chen ⋅ Zeju Li ⋅ Canyu Zhao ⋅ Jiange Yang ⋅ Shenyuan Gao ⋅ Hao Chen ⋅ Chunhua Shen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 597
VLA Models Are More Generalizable Than You Think: Revisiting Physical and Spatial Modeling
weiqi li ⋅ Quande Zhang ⋅ ruifeng zhai ⋅ Liang Lin ⋅ Guangrun Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 598
Action–Geometry Prediction with 3D Geometric Prior for Bimanual Manipulation
Chongyang Xu ⋅ Li Haipeng ⋅ Shen Cheng ⋅ Haoqiang Fan ⋅ Ziliang Feng ⋅ Shuaicheng Liu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 599
Joint-Aligned Latent Action: Towards Scalable VLA Pretraining in the Wild
Hao Luo ⋅ Ye Wang ⋅ Wanpeng Zhang ⋅ Haoqi Yuan ⋅ Yicheng Feng ⋅ Haiweng Xu ⋅ Sipeng Zheng ⋅ Zongqing Lu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 600
Rethinking Camera Choice: An Empirical Study on Fisheye Camera Properties in Robotic Manipulation
Han Xue ⋅ Nan Min ⋅ Xiaotong Liu ⋅ Wendi Chen ⋅ Fang Yuan ⋅ Jun Lv ⋅ Cewu Lu ⋅ Chuan Wen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 601
INSIGHT Bench: Towards Grounded IN-SItu Guidance for Robotic ManipulaTion
Seonho Kim ⋅ Junhyeong Hong ⋅ Kyungjae Lee ⋅ Yoonseon Oh
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 602
MM-ACT: Learn from Multimodal Parallel Generation to Act
Haotian Liang ⋅ Xinyi Chen ⋅ Bin Wang ⋅ MingKang Chen ⋅ Yitian Liu ⋅ Yuhao Zhang ⋅ Zanxin Chen ⋅ Tianshuo Yang ⋅ Yilun Chen ⋅ Jiangmiao Pang ⋅ Dong Liu ⋅ Xiaokang Yang ⋅ Yao Mu ⋅ Wenqi Shao ⋅ Ping Luo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 603
HQC-NBV: A Hybrid Quantum-Classical View Planning Approach
Xiaotong Yu ⋅ Chang Wen Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 604
Motus: A Unified Latent Action World Model
Hongzhe Bi ⋅ Hengkai Tan ⋅ Shenghao Xie ⋅ Zeyuan Wang ⋅ Shuhe Huang ⋅ Haitian Liu ⋅ Ruowen Zhao ⋅ Yao Feng ⋅ Chendong Xiang ⋅ Yinze Rong ⋅ Hongyan Zhao ⋅ Hanyu Liu ⋅ Zhizhong Su ⋅ Lei Ma ⋅ Hang Su ⋅ Jun Zhu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 605
SE(3)-Equivariance with Geometric and Topological Guidance for Category-Level Object Pose Estimation
Sheng Yu ⋅ Di-Hua Zhai ⋅ Yuanqing Xia
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 606
SPEAR-1: Scaling Beyond Robot Demonstrations via 3D Understanding
Nikolay Nikolov ⋅ Giuliano Albanese ⋅ Sombit Dey ⋅ Aleksandar Yanev ⋅ Luc Van Gool ⋅ Jan-Nico Zaech ⋅ Danda Paudel
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 607
Global Prior Meets Local Consistency: Dual-Memory Augmented Vision-Language-Action Model for Efficient Robotic Manipulation
Zaijing Li ⋅ Bing Hu ⋅ Rui Shao ⋅ Gongwei Chen ⋅ Dongmei Jiang ⋅ Pengwei Xie ⋅ Jianye Hao ⋅ Liqiang Nie
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 608
RoboTAG: End-to-end Robot Pose Estimation via Topological Alignment Graph
Yifan Liu ⋅ Fangneng Zhan ⋅ Wanhua Li ⋅ Haowen Sun ⋅ Katerina Fragkiadaki ⋅ Hanspeter Pfister
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 609
MVLM: Template-Free Tracking via Vision–Language Margin Confidence and Memory-Gated Tracking
Dae-Hyeon Park ⋅ Mina Baek ⋅ Jeong-Hun Ha ⋅ Chan-Seop Park ⋅ Jamshidjon Ganiev ⋅ Seung-Hwan Bae
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 610
Interactive Tracking: A Human-in-the-Loop Paradigm with Memory-Augmented Adaptation
Yuqing Huang ⋅ Guotian Zeng ⋅ Zhenqiao Yuan ⋅ Zhenyu He ⋅ Xin Li ⋅ Yaowei Wang ⋅ Ming-Hsuan Yang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 611
VidEoMT: Your ViT is Secretly Also a Video Segmentation Model
Narges Norouzi ⋅ Idil Esen Zulfikar ⋅ Niccolò Cavagnero ⋅ Tommie Kerssies ⋅ Bastian Leibe ⋅ Gijs Dubbelman ⋅ Daan de Geus
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 612
Matching Every Pair to Track Every Point: PairFormer for All-Pairs Tracking and Video Trajectory Fields
Guangyang Wu ⋅ Youran Ding ⋅ Xinyu Che ⋅ BENYUAN SUN ⋅ Yi Yang ⋅ Xiaohong Liu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 613
Boosting Self-Supervised Tracking with Contextual Prompts and Noise Learning
Yaozong Zheng ⋅ Qihua Liang ⋅ Bineng Zhong ⋅ Shuimu Zeng ⋅ Yuanliang Xue ⋅ Ning Li ⋅ Shuxiang Song
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 614
Progressive Multi-cue Alignment for Unaligned RGBT Tracking
Jiandong Jin ⋅ Chenglong Li ⋅ Hao Feng ⋅ Andong Lu ⋅ Lili Huang ⋅ Jin Tang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 615
Real-Time Neural Video Compression with Unified Intra and Inter Coding
Hui Xiang ⋅ Yifan Bian ⋅ Li Li ⋅ Jingran Wu ⋅ Xianguo Zhang ⋅ Dong Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 616
Adapting Lightweight Image-based Counting Models for Video Crowd Counting
Weibo Shu ⋅ Antoni B. Chan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 617
Sparse Task Vector Mixup with Hypernetworks for Efficient Knowledge Transfer in Whole-Slide Image Prognosis
Pei Liu ⋅ xiangxiang Zeng ⋅ Tengfei Ma ⋅ Yucheng Xing ⋅ Xuanbai Ren ⋅ Yiping Liu
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 618
MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and Diagnosis
Yuting Zhang ⋅ Kaishen Yuan ⋅ Hao Lu ⋅ Yutao Yue ⋅ Jintai Chen ⋅ Kaishun Wu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 619
MedKCO: Medical Vision-Language Pretraining via Knowledge-Driven Cognitive Orchestration
Chenran Zhang ⋅ Ruiqi Wu ⋅ Tao Zhou ⋅ Yi Zhou
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 620
Toward Generalizable Whole Brain Representations with High-Resolution Light-Sheet Data
Minyoung E. Kim ⋅ Dae Hee Yun ⋅ Aditi V. Patel ⋅ Madeline Hon ⋅ Webster Guan ⋅ Taegeon Lee ⋅ Brian Nguyen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 621
CryoHype: Reconstructing a thousand cryo-EM structures with transformer-based hypernetworks
Jeffrey Gu ⋅ Minkyu Jeon ⋅ Ambri Ma ⋅ Serena Yeung ⋅ Ellen D. Zhong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 622
GenTract: Generative Global Tractography
Alec Sargood ⋅ Lemuel Puglisi ⋅ Elinor Thompson ⋅ Mirco Musolesi ⋅ Daniel C. Alexander
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 623
LUMINA: A Multi-Vendor Mammography Benchmark with Energy Harmonization Protocol
Hongyi Pan ⋅ Gorkem Durak ⋅ Halil Ertugrul Aktas ⋅ Andrea M. Bejar ⋅ Baver Tutun ⋅ Emre Uysal ⋅ Ezgi Bülbül ⋅ Mehmet Faith Dogan ⋅ Berrin Erok ⋅ Berna Yildirim ⋅ Sukru Mehmet Erturk ⋅ Ulas Bagci
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 624
Virtual Immunohistochemistry Staining with Dual-Aligned Multi-Task Feature Guidance
Shigeng Xie ⋅ Hongming Xu ⋅ Guiyang Jiang ⋅ Tuomo Rossi ⋅ Tommi Kärkkäinen ⋅ Fengyu Cong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 625
Can Natural Image Autoencoders Compactly Tokenize fMRI Volumes for Long-Range Dynamics Modeling?
Peter Yongho Kim ⋅ Juhyeon Park ⋅ Jungwoo Park ⋅ Jubin Choi ⋅ Jungwoo Seo ⋅ Jiook Cha ⋅ Taesup Moon
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 626
IEBGL:An Interpretability-Enhanced Brain Graph Learning Framework with LLM-Instructed Topology and Literature-Augmented Semantics
Yihang Duan ⋅ Shuo Huang ⋅ Lizhang Lizhang ⋅ Meiling Wang ⋅ Li Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 627
F^2-Assist: Multi-Phase Fetal Growth Forecast and Report Generation from Ultrasound Examination
Bin Pu ⋅ XUSHENG LIANG ⋅ Xinpeng Ding ⋅ Jinlin Wu ⋅ Zhen Lei ⋅ Shengli Li ⋅ Kenli Li ⋅ Jiawei Ma
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 628
Sparse Spectral LoRA: Routed Experts for Medical VLMs
Omid Nejatimanzari ⋅ Hojat Asgariandehkordi ⋅ Taha Koleilat ⋅ Yiming Xiao ⋅ Hassan Rivaz
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 629
SAT-RRG: LLM-Guided Self-Adaptive Training for Radiology Report Generation with Token-Level Push–Pull Optimization
YUNYI LIU ⋅ Yingshu Li ⋅ Tong Chen ⋅ Lingqiao Liu ⋅ Lei Wang ⋅ Luping Zhou
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 630
OralGPT-Plus: Learning to Use Visual Tools via Reinforcement Learning for Panoramic X-ray Analysis
Yuxuan Fan ⋅ JING HAO ⋅ Hong Chen ⋅ Jiahao Bao ⋅ Yihua Shao ⋅ Yuci Liang ⋅ Kuo Feng Hung ⋅ Hao Tang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 631
Structural–Semantic Perception for Diffusion-Guided Temporal Forgery Localization
Ligong Cao ⋅ Yeting Guo ⋅ Haoang Chi
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 632
Forensic-Friendly Image Manipulation via Controllable Latent Diffusion
Hanyu Chen ⋅ Haiwei Wu ⋅ Jinyu Tian ⋅ Jianqing Li ⋅ Jiantao Zhou
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 633
IncreFA: Breaking the Static Wall of Generative Model Attribution
Haotian Qin ⋅ Dongliang Chang ⋅ Yueying Gao ⋅ Yuexuan Tan ⋅ Lei Chen ⋅ Zhanyu Ma
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 634
AVFakeBench: A Comprehensive Audio-Video Forgery Detection Benchmark for AV-LMMs
Shuhan Xia ⋅ Peipei Li ⋅ Xuannan Liu ⋅ Dongsen Zhang ⋅ Xinyu Guo ⋅ Zekun Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 635
Detecting Compressed AI-Generated Images via Phase Spectrum Robustness
Kai Li ⋅ Wenqi Ren ⋅ Wei Wang ⋅ Xiaochun Cao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 636
Detect Any AI-Counterfeited Text Image
Chenfan Qu ⋅ Yiwu Zhong ⋅ Xuekang Zhu ⋅ Junchi Li ⋅ Changjiang Jiang ⋅ Jian liu ⋅ Lianwen Jin
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 637
DeepfakeImpact: A Two-Stage Benchmark with Real-World Impact in Deepfake Detection
Chaoyu Gong ⋅ Han Zhang ⋅ Siqiang Luo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 638
Enhancing the Security of Visual Speaker Authentication Based on Dynamic Lip-Print Analysis
Yi He ⋅ Lei Yang ⋅ Bofan Chen ⋅ Shilin Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 639
SimLBR: Learning to Detect Fake Images by Learning to Detect Real Images
Aayush Dhakal ⋅ Subash Khanal ⋅ Srikumar Sastry ⋅ Jacob Arndt ⋅ Philipe Ambrozio Dias ⋅ Dalton Lunga ⋅ Nathan Jacobs
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 640
Editprint: General Digital Image Forensics via Editing Fingerprint with Self-Augmentation Training
Haiwei Wu ⋅ Kemou Li ⋅ Yuanman Li ⋅ Jiantao Zhou
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 641
Detecting AI-Generated Forgeries via Iterative Manifold Deviation Amplification
Jiangling Zhang ⋅ Shuxuan Gao ⋅ Bofan Liu ⋅ Siqiang Feng ⋅ Jirui Huang ⋅ Yaxiong Chen ⋅ Ziyu Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 642
Goldilocks Test Sets for Face Verification
Haiyu Wu ⋅ Sicong Tian ⋅ Aman Bhatta ⋅ Jacob Gutierrez ⋅ Grace Bezold ⋅ Genesis Argueta ⋅ Karl Ricanek ⋅ Michael C. King ⋅ Kevin W. Bowyer
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 643
Fine-VAD: Towards Fine-Grained Video Anomaly Detection via Progressive Cross-Granularity Learning
Menghao Zhang ⋅ Yiyan Zhu ⋅ Pengfei Ren ⋅ Haifeng Sun ⋅ Qi Qi ⋅ Zirui Zhuang ⋅ Huazheng Wang ⋅ Lei Zhang ⋅ Jianxin Liao ⋅ Jingyu Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 644
DLVP-CLIP: Enhancing Fine-Grained Zero-Shot Anomaly Detection via Dynamic Local Visual Prompting
Gaowei Zhang ⋅ Lihe Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 645
MoECLIP: Patch-Specialized Experts for Zero-shot Anomaly Detection
Jun Yeong Park ⋅ JunYoung Seo ⋅ Minji Kang ⋅ Yu Rang Park
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 646
Alert-CLIP: Abnormality-aware Latent-Enhanced Representation Tuning of CLIP for Video Anomaly Detection
Yiyan Zhu ⋅ Menghao Zhang ⋅ Haifeng Sun ⋅ Pengfei Ren ⋅ Xianao Chu ⋅ Chenye Xu ⋅ Hong Tan ⋅ Jinghan Wang ⋅ Qi Qi ⋅ Jingyu Wang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 647
AnomalyVFM -- Transforming Vision Foundation Models into Zero-Shot Anomaly Detectors
Matic Fučka ⋅ Vitjan Zavrtanik ⋅ Danijel Skočaj
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 648
LayoutAD: Exploring Semantic-Geometric Misalignment Reasoning for Scene Layout Anomaly Detection
Zhichao Zeng ⋅ Jiasheng Zhang ⋅ Jiyun Sun ⋅ Jiangtao Cui ⋅ Xiaotian Qiao
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 649
Bidirectional Multimodal Prompt Learning with Scale-Aware Training for Few-Shot Multi-Class Anomaly Detection
Yujin Lee ⋅ Sewon Kim ⋅ Daeun Moon ⋅ Seoyoon Jang ⋅ Hyunsoo Yoon
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 650
GS-CLIP: Zero-shot 3D Anomaly Detection by Geometry-Aware Prompt and Synergistic View Representation Learning
Zehao Deng ⋅ An Liu ⋅ Yan Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 651
TLMA: Mitigating the Impact of Weakly Labeled Information for Video Anomaly Detection
Rong Xu ⋅ Runqi Wang ⋅ Yingjun Zhang ⋅ Tao Tao ⋅ Xiaomeng Li ⋅ Liping Jing
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 652
Defect Cue-Preserved Structural Feature Refinement for Few-Shot Anomaly Detection
Le Jiang ⋅ Yan Huang ⋅ Zhen Xu ⋅ Yong Xu ⋅ Hau San Wong ⋅ Si Wu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 653
Anomaly-Related Residual Fields for Cross-domain Anomaly Detection
Kewei Gao ⋅ Jiayi Xie ⋅ Zhengda Shen ⋅ Weijun Qin ⋅ Lingxiang Jia ⋅ Kejia Chen ⋅ Zunlei Feng ⋅ Yijun Bei
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 654
From Attraction to Equilibrium: Physics-Inspired Semantic Gravitons for Zero-Shot Anomaly Detection
Yuwen Pan ⋅ Yuan Wang ⋅ Shaohui Li ⋅ Zhi Li ⋅ Yu LIU ⋅ You He
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 655
Joint Learning of General and Diverse Patterns with Mixture of Memory Experts for Weakly-Supervised Video Anomaly Detection
Bo Sun ⋅ Junxi Chen ⋅ Zhe Wu ⋅ Feng Gao ⋅ Fan Yang ⋅ Li Su ⋅ Yaowei Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 656
No Need For Real Anomaly: MLLM Empowered Zero-Shot Video Anomaly Detection
Zunkai Dai ⋅ Ke Li ⋅ JIAJIA LIU ⋅ Jie Yang ⋅ Yuanyuan Qiao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 657
FB-CLIP: Fine-Grained Zero-Shot Anomaly Detection with Foreground-Background Disentanglement
Ming Hu ⋅ Yongsheng Huo ⋅ Mingyu Dou ⋅ Jianfu Yin ⋅ Peng Zhao ⋅ Yao Wang ⋅ Cong Hu ⋅ Bingliang Hu ⋅ Quan Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 658
DynamicVGGT: Learning Dynamic Point Maps for 4D Scene Reconstruction in Autonomous Driving
Zhuolin He ⋅ Jing Li ⋅ Guanghao Li ⋅ Xiaolei Chen ⋅ Jiacheng Tang ⋅ Siyang Zhang ⋅ Zhounan Jin ⋅ Feipeng Cai ⋅ Bin Li ⋅ Jian Pu ⋅ Jia Cai ⋅ Xiangyang Xue
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 659
GenieDrive: Towards Physics-Aware Driving World Model with 4D Occupancy Guided Video Generation
Zhenya Yang ⋅ Zhe Liu ⋅ Yuxiang Lu ⋅ Liping Hou ⋅ Chenxuan Miao ⋅ peng siyi ⋅ Bailan Feng ⋅ Xiang Bai ⋅ Hengshuang Zhao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 660
Test-Time 3D Occupancy Prediction
Fengyi Zhang ⋅ Xiangyu Sun ⋅ Huitong Yang ⋅ Zheng Zhang ⋅ Zi Huang ⋅ Yadan Luo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 661
Group Diffusion: Enhancing Image Generation by Unlocking Cross-Sample Collaboration
Sicheng Mo ⋅ Thao Nguyen ⋅ Richard Zhang ⋅ Nick Kolkin ⋅ Siddharth Srinivasan Iyer ⋅ Eli Shechtman ⋅ Krishna Kumar Singh ⋅ Yong Jae Lee ⋅ Bolei Zhou ⋅ Yuheng Li
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 662
Diffusion Mental Averages
Phonphrm Thawatdamrongkit ⋅ Sukit Seripanitkarn ⋅ Supasorn Suwajanakorn
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 663
dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Yi Xin ⋅ Siqi Luo ⋅ Tianxiang Xu ⋅ Qi Qin ⋅ Haoxing Chen ⋅ Kaiwen Zhu ⋅ Zhiwei Zhang ⋅ Yangfan He ⋅ Rongchao Zhang ⋅ Jinbin Bai ⋅ Shuo Cao ⋅ Bin Fu ⋅ Junjun He ⋅ Yihao Liu ⋅ Yuewen Cao ⋅ Xiaohong Liu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 664
RegionRoute: Regional Style Transfer with Diffusion Model
Bowen Chen ⋅ Jake Zuena ⋅ Alan C. ⋅ Divya Kothandaraman
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 665
Low-Rank Residual Diffusion Models
Junfu Tan ⋅ Jiang Yuan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 666
RDF-MIG: A Robust Diffusion Framework for Masked Image Generation to Augment Semantic Segmentation and Change Detection
Zian Cao ⋅ Wei Wei ⋅ QINGSHAN GAO ⋅ Yuanyuan Fu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 667
TC-Padé: Trajectory-Consistent Padé Approximation for Diffusion Acceleration
Shaoxuan He ⋅ Benlei Cui ⋅ Bukun Huang ⋅ Zhizeng Ye ⋅ Yunyun Sun ⋅ Longtao Huang ⋅ Hui Xue ⋅ Yang Yang ⋅ Haiwen Hong ⋅ Jingqun Tang ⋅ Zhou Zhao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 668
Bi-directional Autoregressive Diffusion for Large Complex Motion Interpolation
Yongrui Ma ⋅ Shijie Zhao ⋅ Mingde Yao ⋅ Junlin Li ⋅ Li zhang ⋅ Xiaohong Liu ⋅ Qi Dou ⋅ Jinwei Gu ⋅ Tianfan Xue
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 669
Guiding Token-Sparse Diffusion Models
Felix Krause ⋅ Stefan Andreas Baumann ⋅ Johannes Schusterbauer ⋅ Olga Grebenkova ⋅ Ming Gui ⋅ Vincent Tao Hu ⋅ Björn Ommer
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 670
Accelerating Diffusion-based Video Editing via Heterogeneous Caching: Beyond Full Computing at Sampled Denoising Timestep
Tianyi Liu ⋅ Ye Lu ⋅ Linfeng Zhang ⋅ Chen Cai ⋅ Jianjun Gao ⋅ Yi Wang ⋅ Kim-Hui Yap ⋅ Lap-Pui Chau
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 671
See and Fix the Flaws: Enabling VLMs and Diffusion Models to Comprehend Visual Artifacts via Agentic Data Synthesis
Jaehyun Park ⋅ Minyoung Ahn ⋅ Minkyu Kim ⋅ Jonghyun Lee ⋅ Jae-Gil Lee ⋅ Dongmin Park
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 672
High-Fidelity Virtual Try-On beyond Paired Data Scarcity via Diffusion-based Cycle-Consistent Learning
Jia Wu ⋅ Yijing Dai ⋅ Tingfeng Cao ⋅ Meiling Wu ⋅ Tao Luo ⋅ Jian Dong Zhang ⋅ Guangming Lu ⋅ Xiaoyi Zeng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 673
Sampling-Aware Quantization for Diffusion Models
Qian Zeng ⋅ Jie Song ⋅ Yuanyu Wan ⋅ Huiqiong Wang ⋅ Mingli Song
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 674
CRAFT: Aligning Diffusion Models with Fine-Tuning Is Easier Than You Think
Zening Sun ⋅ Zhengpeng Xie ⋅ Lichen Bai ⋅ Shitong Shao ⋅ Shuo Yang ⋅ Zeke Xie
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 675
Scale Space Diffusion
Soumik Mukhopadhyay ⋅ Prateksha Udhayanan ⋅ Abhinav Shrivastava
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 676
Making Training-Free Diffusion Segmentors Scale with the Generative Power
Benyuan Meng ⋅ Qianqian Xu ⋅ Zitai Wang ⋅ Xiaochun Cao ⋅ Longtao Huang ⋅ Qingming Huang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 677
Roots Beneath the Cut: Uncovering the Risk of Concept Recovery in Pruning-Based Unlearning for Diffusion Models
Ci Zhang ⋅ Zhaojun Ding ⋅ Chence Yang ⋅ Jun Liu ⋅ Xiaoming Zhai ⋅ Shaoyi Huang ⋅ Beiwen Li ⋅ Xiaolong Ma ⋅ Jin Lu ⋅ Geng Yuan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 678
Few-Step Diffusion Sampling Through Instance-Aware Discretizations
Liangyu Yuan ⋅ Ruoyu Wang ⋅ Tong Zhao ⋅ Dingwen Fu ⋅ Mingkun Lei ⋅ Beier Zhu ⋅ Chi Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 679
SpeeDiff: Scalable Pixel-Anchored End-to-End Latent Diffusion Model
Bingliang Zhang ⋅ Wenda Chu ⋅ Yizhuo Li ⋅ Linjie Yang ⋅ Yisong Yue ⋅ Katherine L. Bouman ⋅ Yang Song ⋅ Qiushan Guo
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 680
Structure-to-Intensity Diffusion for Adverse-Weather LiDAR Generation
Peiyang Ni ⋅ Longyu Yang ⋅ Lu Zhang ⋅ Kuniaki Saito ⋅ Yap-Peng Tan ⋅ Fumin Shen ⋅ Heng Tao Shen ⋅ Xiaofeng Zhu ⋅ Ping Hu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 681
Focal–General Diffusion Model with Semantic Consistent Guidance for Sign Language Production
Yiheng Yu ⋅ Sheng Liu ⋅ Yuan Feng ⋅ Zhelun Jin ⋅ Yining Jiang ⋅ Min Xu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 682
Diffusion Probe: Generated Image Result Prediction Using CNN Probes
Bukun Huang ⋅ Benlei Cui ⋅ Zhizeng Ye ⋅ Xuemei Dong ⋅ Tuo Chen ⋅ Hui Xue ⋅ Dingkang Yang ⋅ Longtao Huang ⋅ Haiwen Hong ⋅ Jingqun Tang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 683
Content-Aware Dynamic Patchification for Efficient Video Diffusion
Sheng Li ⋅ Connelly Barnes ⋅ Mamshad Nayeem Rizve ⋅ Hongwu Peng ⋅ Zhengang Li ⋅ Ohi Dibua ⋅ Alireza Ganjdanesh ⋅ Xulong Tang ⋅ Yan Kang ⋅ Yifan Gong
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 684
PixelRush: Ultra-Fast, Training-Free High-Resolution Image Generation via One-step Diffusion
Hong-Phuc Lai ⋅ Phong Nguyen ⋅ Anh Tran
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 685
Diffusion-Based sRGB Real Noise Generation via Prompt-Driven Noise Representation Learning
Jaekyun Ko ⋅ Dongjin Kim ⋅ Soomin Lee ⋅ Guanghui Wang ⋅ Tae Hyun Kim
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 686
Decoupled Residual Denoising Diffusion Models for Unified and Data Efficient Image-to-Image Translation
Ziyue Lin ⋅ Jiahe Hou ⋅ Xia Hongyu ⋅ Xinrui Xie ⋅ Feifei Wang ⋅ Yuyin Zhou ⋅ Wei Wang ⋅ Jiawei Liu ⋅ Liangqiong Qu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 687
GROW: Watermark Generation with Progressive Guidance for Diffusion Models
Pengcheng Luo ⋅ Zexi Jia ⋅ Yijia Zhong ⋅ Jinchao Zhang ⋅ Jie Zhou
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 688
MotionV2V: Editing Motion in a Video
Ryan Burgert ⋅ Charles Herrmann ⋅ Forrester Cole ⋅ Michael Ryoo ⋅ Neal Wadhwa ⋅ Andrey Voynov ⋅ Nataniel Ruiz
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 689
Mind the Generative Details: Direct Localized Detail Preference Optimization for Video Diffusion Models
Zitong Huang ⋅ Kaidong Zhang ⋅ Yukang Ding ⋅ Chao Gao ⋅ Rui Ding ⋅ Ying Chen ⋅ Wangmeng Zuo
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 690
OrthoFuse: Training-free Riemannian Fusion of Orthogonal Style-Concept Adapters for Diffusion Models
Ali Aliev ⋅ Kamil Garifullin ⋅ Nikolay Yudin ⋅ Vera Soboleva ⋅ Alexander Molozhavenko ⋅ Ivan Oseledets ⋅ Aibek Alanov ⋅ Maxim Rakhuba
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 691
DreamStyle: A Unified Framework for Video Stylization
Mengtian Li ⋅ Jinshu Chen ⋅ Songtao Zhao ⋅ Wanquan Feng ⋅ Pengqi Tu ⋅ Qian HE
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 692
Diffusion Sampling Path Tells More: An Efficient Plug-and-Play Strategy for Sample Filtering
SIXIAN WANG ⋅ Zhiwei Tang ⋅ Tsung-Hui Chang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 693
Designing Instance-Level Sampling Schedules via REINFORCE with James-Stein Shrinkage
Peiyu Yu ⋅ Suraj Kothawade ⋅ Sirui Xie ⋅ Ying Nian Wu ⋅ Hongliang Fei
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 694
Reward Sharpness-Aware Fine-Tuning for Diffusion Models
Kwanyoung Kim ⋅ Byeongsu Sim
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 695
DBMSolver: A Training-free Diffusion Bridge Sampler for High-Quality Image-to-Image Translation
SANKARSHANA VENUGOPAL ⋅ Mohammad Mostafavi ⋅ Jonghyun Choi
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 696
Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens
Yuqing Wang ⋅ Chuofan Ma ⋅ Zhijie Lin ⋅ Yao Teng ⋅ Lijun Yu ⋅ Shuai Wang ⋅ Jiaming Han ⋅ Jiashi Feng ⋅ Yi Jiang ⋅ Xihui Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 697
TAP: A Token-Adaptive Predictor Framework for Training-Free Diffusion Acceleration
Haowei Zhu ⋅ Tingxuan Huang ⋅ XING WANG ⋅ Tianyu Zhao ⋅ Jiexi Wang ⋅ Weifeng Chen ⋅ Xurui Peng ⋅ Fangmin Chen ⋅ Junhai Yong ⋅ Bin Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 698
Cross-modal Representation Learning for Diffusion-generated Image Detection
Tao Gong ⋅ Dayong Wang ⋅ Qi Chu ⋅ Bin Liu ⋅ Nenghai Yu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 699
Sparse-LaViDa: Sparse Multimodal Discrete Diffusion Language Models
Shufan Li ⋅ Jiuxiang Gu ⋅ Kangning Liu ⋅ Zhe Lin ⋅ Zijun Wei ⋅ Aditya Grover ⋅ Jason Kuen
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 700
Back to Basics: Let Denoising Generative Models Denoise
Tianhong Li ⋅ Kaiming He
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 701
CaricHarmony: Contrastive Diffusion Paths for Identity-Preserving Caricature Synthesis
Dongyu Wang ⋅ Dar-Yen Chen ⋅ Yi-Zhe Song
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 702
DiP: Taming Diffusion Models in Pixel Space
Zhennan Chen ⋅ junwei zhu ⋅ Xu Chen ⋅ Jiangning Zhang ⋅ Xiaobin Hu ⋅ Hanzhen Zhao ⋅ Chengjie Wang ⋅ Jian Yang ⋅ Ying Tai
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 703
RAPID: Reusing Attention Sparsity with Inter-step Adaptation for Efficient Video Diffusion
Shangran Lin ⋅ Lu Lu ⋅ Jian Chen ⋅ Qiang Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 704
Efficient and Training-Free Single-Image Diffusion Models
Haojun Qiu ⋅ Kiriakos N. Kutulakos ⋅ David B. Lindell
[ Poster