Skip to yearly menu bar Skip to main content


(704 events)   Timezone:  
Show all
The 2026 schedule is still incomplete
Toggle Poster Visibility
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 1
Differentiable Laplacian Matrix Guided Superpixel Segmentation
Jeremy Juybari ⋅ Joshua Hamilton ⋅ Shuvra Das ⋅ Chaofan Chen ⋅ Andre Khalil ⋅ Yifeng Zhu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 2
FILTR: Extracting Topological Features from Pretrained 3D Models
Louis Martinez ⋅ Maks Ovsjanikov
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 3
Learning Convex Decomposition via Feature Fields
Yuezhi Yang ⋅ Qixing Huang ⋅ Mikaela Angelina Uy ⋅ Nicholas Sharp
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 4
Learning Eigenstructures of Unstructured Data Manifolds
Roy Velich ⋅ Arkadi Piven ⋅ David Bensaid ⋅ Daniel Cremers ⋅ Thomas Dagès ⋅ Ron Kimmel
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 5
Mapping Networks
Lord Sen ⋅ Shyamapada Mukherjee
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 6
CineBrain: A Large-Scale Multi-Modal Audiovisual Brain Dataset for Brain-Conditioned Video Generation
Jianxiong Gao ⋅ Yichang Liu ⋅ baofeng yang ⋅ Jianfeng Feng ⋅ Yanwei Fu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 7
Hearing the Room Through the Shape of the Drum: Modal-Guided Sound Recovery from Multi-Point Surface Vibrations
Shai Bagon ⋅ Matan Kichler ⋅ Mark Sheinin
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 8
SDTrack: A Baseline for Event-based Tracking via Spiking Neural Networks
Yimeng Shan ⋅ Zhenbang Ren ⋅ Haodi Wu ⋅ Wenjie Wei ⋅ Rui-Jie Zhu ⋅ Shuai Wang ⋅ Dehao Zhang ⋅ Yichen Xiao ⋅ Jieyuan Zhang ⋅ Kexin Shi ⋅ Jingzhinan Wang ⋅ Jason K. Eshraghian ⋅ Haicheng Qu ⋅ Malu Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 9
Thinking with Drafts: Speculative Temporal Reasoning for Efficient Long Video Understanding
Pengfei Hu ⋅ Meng Cao ⋅ Yingyao Wang ⋅ Yi Wang ⋅ Jiahua Dong ⋅ Jun Song ⋅ Cheng Yu ⋅ Bo Zheng ⋅ Xiaodan Liang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 10
Wan-Weaver: Interleaved Multi-modal Generation via Decoupled Training
Jinbo Xing ⋅ Zeyinzi Jiang ⋅ Yuxiang Tuo ⋅ Chaojie Mao ⋅ Xiaotang Gai ⋅ Xi Chen ⋅ Jingfeng Zhang ⋅ Yulin Pan ⋅ Zhen Han ⋅ Jie Xiao ⋅ Keyu Yan ⋅ Chenwei Xie ⋅ Chongyang Zhong ⋅ Kai Zhu ⋅ Tong Shen ⋅ Lianghua Huang ⋅ Yu Liu ⋅ Yujiu Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 11
CURE: Curriculum-guided Multi-task Training for Reliable Anatomy Grounded Report Generation
Pablo Messina ⋅ Andrés Villa ⋅ Juan León Alcázar ⋅ Karen Sanchez ⋅ Carlos Hinojosa ⋅ Denis Parra ⋅ Alvaro Soto ⋅ Bernard Ghanem
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 12
DK-DDIL: Adaptive Knowledge Retention for Dynamic Domain-Incremental Learning in Medical Imaging
Yuxi Ma ⋅ Sujie Liu ⋅ Jing Yang ⋅ Jiacheng Wang ⋅ Yiping Chen ⋅ Baptiste Magnier ⋅ Liansheng Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 13
Dual-level Adapter Boosting Prompt-free Curvilinear Structure Segmentation
Kai Zhu ⋅ Li Chen ⋅ Jun Cheng
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 14
LATA: Laplacian-Assisted Transductive Adaptation for Conformal Uncertainty in Medical VLMs
Behzad Bozorgtabar ⋅ Dwarikanath Mahapatra ⋅ Sudipta Roy ⋅ Muzammal Naseer ⋅ Imran Razzak ⋅ Zongyuan Ge
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 15
Medic-AD: Towards Medical Vision-Language Model's Clinical Intelligence
Woohyeon Park ⋅ Jaeik Kim ⋅ Sunghwan Steve Cho ⋅ Pa Hong ⋅ Wookyoung Jeong ⋅ Yoojin Nam ⋅ Namjoon Kim ⋅ Ginny Y. Wong ⋅ Ka Chun Cheung ⋅ Jaeyoung Do
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 16
SegMoTE: Token-Level Mixture of Experts for Medical Image Segmentation
Yujie Lu ⋅ Jingwen Li ⋅ Sibo Ju ⋅ Yanzhou Su ⋅ He Yao ⋅ Yisong Liu ⋅ Min Zhu ⋅ Junlong Cheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 17
Efficient Unrolled Networks for Large-Scale 3D Inverse Problems
Romain Vo ⋅ Julián Tachella
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 18
FedAdamom: Adaptive Momentum for Improved Generalization in Federated Optimization
Wenjie Hou ⋅ Tianxiang Chen ⋅ Feng Wang ⋅ Tiantong Wu ⋅ Zhiming Zheng ⋅ Shaoting Tang ⋅ Wei Yang Bryan Lim
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 19
SimScale: Learning to Drive via Real-World Simulation at Scale
Haochen Tian ⋅ Tianyu Li ⋅ Haochen Liu ⋅ Jiazhi Yang ⋅ Yihang Qiu ⋅ Guang Li ⋅ junli wang ⋅ Yinfeng Gao ⋅ Zhang Zhang ⋅ Liang Wang ⋅ Hangjun Ye ⋅ Long Chen ⋅ Hongyang Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 20
Texvent: Asynchronous Event Data Simulation via Text Prompt
Ruofei Wang ⋅ Peiqi Duan ⋅ Ka Chun Cheung ⋅ Simon See ⋅ Boxin Shi ⋅ Renjie Wan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 21
WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World
Ao Liang ⋅ Lingdong Kong ⋅ Tianyi Yan ⋅ Hongsi Liu ⋅ Yu Yang ⋅ Ziqi Huang ⋅ Wei Yin ⋅ Jialong Zuo ⋅ Yixuan Hu ⋅ Dekai Zhu ⋅ Dongyue Lu ⋅ Youquan Liu ⋅ Guangfeng Jiang ⋅ Linfeng Li ⋅ Xiangtai Li ⋅ Long Zhuo ⋅ Lai Xing Ng ⋅ Benoit R. Cottereau ⋅ Changxin Gao ⋅ Liang Pan ⋅ Wei Tsang Ooi ⋅ Ziwei Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 22
BuildingGPT: Auto-Regressive Building Wireframe Reconstruction Model with Reinforcement Learning
Yuzhou Liu ⋅ Lingjie Zhu ⋅ Hanqiao Ye ⋅ Yujun Liu ⋅ Shangfeng Huang ⋅ Xiang Gao ⋅ Ruisheng Wang ⋅ Shuhan Shen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 23
Emergent Extreme-View Geometry in 3D Foundation Models
Yiwen Zhang ⋅ Joseph Tung ⋅ Ruojin Cai ⋅ David Fouhey ⋅ Hadar Averbuch-Elor
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 24
LiteVGGT: Boosting Vanilla VGGT via Geometry-aware Cached Token Merging
Zhijian Shu ⋅ Cheng Lin ⋅ Tao Xie ⋅ Wei Yin ⋅ Ben Li ⋅ Zhiyuan Pu ⋅ Weize Li ⋅ Yao Yao ⋅ Xun Cao ⋅ Xiaoyang Guo ⋅ Xiaoxiao Long
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 25
LASER: Layer-wise Scale Alignment for Training-Free Streaming 4D Reconstruction
Tianye Ding ⋅ Yiming Xie ⋅ Yiqing Liang ⋅ Moitreya Chatterjee ⋅ Pedro Miraldo ⋅ Huaizu Jiang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 26
PanoVGGT: Feed-Forward 3D Reconstruction from Panoramic Imagery
Yijing Guo ⋅ Mengjun Chao ⋅ Luo Wang ⋅ Tianyang Zhao ⋅ Haizhao Dai ⋅ Yingliang Zhang ⋅ Jingyi Yu ⋅ Yujiao Shi
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 27
Rascene: High-Fidelity 3D Scene Imaging with mmWave Communication Signals
Kunzhe Song ⋅ Geo Jie Zhou ⋅ Xiaoming Liu ⋅ Huacheng Zeng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 28
VGG-T^3: Offline Feed-Forward 3D Reconstruction at Scale
Sven Elflein ⋅ Ruilong Li ⋅ Sérgio Agostinho ⋅ Žan Gojčič ⋅ Laura Leal-Taixe ⋅ Qunjie Zhou ⋅ Aljoša Ošep
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 29
SEA-Flow3D: Simplified, Efficient, and Accurate Scene Flow via Spatial Vector Sampling and Multi-scale Refinement
Han Ling ⋅ Quansen Sun ⋅ Yinghua Yao ⋅ Ivor Tsang ⋅ Yinghui Sun
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 30
OmniVGGT: Omni-Modality Driven Visual Geometry Grounded Transformer
Hao Li ⋅ Hao Li ⋅ Yalun Dai ⋅ Yushi Lan ⋅ Yihang Luo ⋅ Tianyu Qi ⋅ Zhengshen Zhang ⋅ Yufeng Zhan ⋅ Junfei Zhang ⋅ Wenchao Xu ⋅ Ziwei Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 31
DROID-SLAM in the Wild
Moyang Li ⋅ Zihan Zhu ⋅ Marc Pollefeys ⋅ Daniel Barath
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 32
HeSS: Head Sensitivity Score for Sparsity Redistribution in VGGT
Yongsung Kim ⋅ Wooseok Song ⋅ Jaihyun Lew ⋅ Hun Hwangbo ⋅ Jaehoon Lee ⋅ Sungroh Yoon
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 33
Dense Metric Depth Completion from Sparse Direct Time-of-Flight Sensors
Hakyeong Kim ⋅ Ruicheng Wang ⋅ Chengtang Yao ⋅ Jiaolong Yang ⋅ Min H. Kim
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 34
Online3R: Online Learning for Consistent Sequential Reconstruction Based on Geometry Foundation Model
Shunkai Zhou ⋅ Zike Yan ⋅ fei xue ⋅ Dong Wu ⋅ Yuchen Deng ⋅ Hongbin Zha
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 35
Neu-PiG: Neural Preconditioned Grids for Fast Dynamic Surface Reconstruction on Long Sequences
Julian Kaltheuner ⋅ Hannah Dröge ⋅ Markus Plack ⋅ Patrick Stotko ⋅ Reinhard Klein
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 36
Learning 3D Reconstruction with Priors in Test Time
Lei Zhou ⋅ Haoyu Wu ⋅ Akshat Dave ⋅ Dimitris Samaras
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 37
ArchSym: Detecting 3D-Grounded Architectural Symmetries in the Wild
Hanyu Chen ⋅ Ruojin Cai ⋅ Steve Marschner ⋅ Noah Snavely
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 38
PointTPA: Dynamic Network Parameter Adaptation for 3D Scene Understanding
Siyuan Liu ⋅ Chaoqun Zheng ⋅ Xin Zhou ⋅ Tianrui Feng ⋅ Dingkang Liang ⋅ Xiang Bai
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 39
tttLRM: Test-Time Training for Long Context and Autoregressive 3D Reconstruction
Chen Wang ⋅ Hao Tan ⋅ Wang Yifan ⋅ Zhiqin Chen ⋅ Yuheng Liu ⋅ Kalyan Sunkavalli ⋅ Sai Bi ⋅ Lingjie Liu ⋅ Yiwei Hu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 40
Hint2Gen: Bridging Understanding and Generation via Code-structured Hints
Yuanpeng Tu ⋅ Yunpeng Chen ⋅ Xi Chen ⋅ Liang Li ⋅ Hengshuang Zhao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 41
Compositional Text-to-Image Generation Via Region-aware Bimodal Direct Preference Optimization
Zhuohan Liu ⋅ Wujian Peng ⋅ Yitong Chen ⋅ Zuxuan Wu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 42
Learning by Analogy: A Causal Framework for Compositional Generalization
Lingjing Kong ⋅ Shaoan Xie ⋅ Yang Jiao ⋅ Yetian Chen ⋅ Yanhui Guo ⋅ Simone Shao ⋅ Yan Gao ⋅ Guangyi Chen ⋅ Kun Zhang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 43
ID-Crafter: VLM-Grounded Online RL for Compositional Multi-Subject Video Generation
Panwang Pan ⋅ Jingjing Zhao ⋅ Yuchen Lin ⋅ Chenguo Lin ⋅ Chenxin Li ⋅ Hengyu Liu ⋅ Tingting Shen ⋅ Yadong Mu
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 44
GenColorBench: A Color Evaluation Benchmark for Text-to-Image Generation
Muhammad Atif Butt ⋅ Alexandra Gomez-Villa ⋅ Tao Wu ⋅ Javier Vazquez-Corral ⋅ Joost van de Weijer ⋅ Kai Wang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 45
Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation
Chenxi Zhao ⋅ Chen Zhu ⋅ Xiaokun Feng ⋅ Aiming Hao ⋅ Jiashu Zhu ⋅ Jiachen Lei ⋅ Jiahong Wu ⋅ Xiangxiang Chu ⋅ Jufeng Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 46
When Pretty Isn’t Useful: Investigating Why Modern Text-to-Image Models Fail as Reliable Training Data Generators
Krzysztof Adamkiewicz ⋅ Brian B. Moser ⋅ Stanislav Frolov ⋅ Tobias Christian Nauen ⋅ Federico Raue ⋅ Andreas Dengel
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 47
TempoControl: Temporal Attention Guidance for Text-to-Video Models
Shira Schiber ⋅ Ofir Lindenbaum ⋅ Idan Schwartz
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 48
Hear What Matters! Text-conditioned Selective Video-to-Audio Generation
Junwon Lee ⋅ Juhan Nam ⋅ Jiyoung Lee
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 49
MultiCrafter: High-Fidelity Multi-Subject Generation via Disentangled Attention and Identity-Aware Preference Alignment
Tao Wu ⋅ Yibo Jiang ⋅ Yehao Lu ⋅ Zhizhong Wang ⋅ Zeyi Huang ⋅ Zequn Qin ⋅ Xi Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 50
Resolving the Identity Crisis in Text-to-Image Generation
Shubhankar Borse ⋅ Farzad Farhadzadeh ⋅ Munawar Hayat ⋅ Fatih Porikli
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 51
DiffGraph: An Automated Agent-driven Model Merging Framework for In-the-Wild Text-to-Image Generation
Zhuoling Li ⋅ Hossein Rahmani ⋅ Jiarui Zhang ⋅ Yu Xue ⋅ Majid Mirmehdi ⋅ Jason Kuen ⋅ Jiuxiang Gu ⋅ Jun Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 52
Gloria: Consistent Character Video Generation via Content Anchors
Yuhang Yang ⋅ Fan Zhang ⋅ Huaijin Pi ⋅ Ailing Zeng ⋅ Shuai Guo ⋅ Guowei Xu ⋅ Wei Zhai ⋅ Yang Cao ⋅ Zheng-Jun Zha
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 53
DreamShot: Personalized Storyboard Synthesis with Video Diffusion Prior
Junjia Huang ⋅ Binbin Yang ⋅ Pengxiang Yan ⋅ Jiyang Liu ⋅ Bin Xia ⋅ Zhao Wang ⋅ Yitong Wang ⋅ Liang Lin ⋅ Guanbin Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 54
M4V: Multimodal Mamba for Efficient Text-to-Video Generation
Jiancheng Huang ⋅ Gengwei Zhang ⋅ Zequn Jie ⋅ Siyu Jiao ⋅ Yinlong Qian ⋅ Ling Chen ⋅ Yunchao Wei ⋅ Lin Ma
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 55
Property-Informed Diffusion-Based Text-to-Microstructure Generation
Bingxuan Dai ⋅ Hongsong Wang ⋅ Jie Gui
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 56
DreamingComics: A Story Visualization Pipeline via Subject and Layout Customized Generation using Video Models
Patrick Kwon ⋅ Chen Chen
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 57
Mixture of States: Routing Token-Level Dynamics for Multimodal Generation
Haozhe Liu ⋅ Ding Liu ⋅ Mingchen Zhuge ⋅ Zijian Zhou ⋅ Tian Xie ⋅ Sen He ⋅ Yukang Yang ⋅ Shuming Liu ⋅ Yuren Cong ⋅ Jiadong Guo ⋅ Hongyu Xu ⋅ Ke Xu ⋅ Kam-Woh Ng ⋅ Juan C. Perez ⋅ Juan-Manuel Pérez-Rúa ⋅ Tao Xiang ⋅ Wei Liu ⋅ Shikun Liu ⋅ Jürgen Schmidhuber
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 58
HiCoGen: Hierarchical Compositional Text-to-Image Generation in Diffusion Models via Reinforcement Learning
Hongji Yang ⋅ Yucheng Zhou ⋅ Wencheng Han ⋅ Runzhou Tao ⋅ Zhongying Qiu ⋅ Jianfei Yang ⋅ Jianbing Shen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 59
TherA: Thermal-Aware Visual-Language Prompting for Controllable RGB-to-Thermal Infrared Translation
Dong-Guw Lee ⋅ Tai Hyoung Rhee ⋅ Hyunsoo Jang ⋅ Young-Sik Shin ⋅ Ukcheol Shin ⋅ Ayoung Kim
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 60
See What I Mean: Aligning Vision and Language Representations for Video Fine-grained Object Understanding
Bo-Yuan Sun ⋅ Bowen Yin ⋅ Yuanming Li ⋅ Xihan Wei ⋅ Qibin Hou
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 61
CoV-Align: Efficient Fine-grained Cross-Modal Alignment with Cohesive Visual Semantics Priority
Hengqi Liu ⋅ Wanting Zhou ⋅ Longteng Kong ⋅ Fangxiang Feng ⋅ Lei Ren ⋅ Wei Chen ⋅ Xiaojie Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 62
TDATR: Improving End-to-End Table Recognition via Table Detail-Aware Learning and Cell-Level Visual Alignment
Qin Chunxia ⋅ Chenyu Liu ⋅ Pengcheng Xia ⋅ Jun Du ⋅ Baocai Yin ⋅ Bing Yin ⋅ Cong Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 63
A Mixed Diet Makes DINO An Omnivorous Vision Encoder
Rishabh Kabra ⋅ Maks Ovsjanikov ⋅ Drew A Hudson ⋅ Ye Xia ⋅ Skanda Koppula ⋅ André Araujo ⋅ Joao Carreira ⋅ Niloy J. Mitra
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 64
Uncertainty-guided Compositional Alignment with Part-to-Whole Semantic Representativeness in Hyperbolic Vision-Language Models
Hayeon Kim ⋅ Ji Ha Jang ⋅ Junghun James Kim ⋅ Se Young Chun
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 65
TaskForce: Cooperative Multi-agent Reinforcement Learning for Multi-task Optimization
Wonhyeok Choi ⋅ Kyumin Hwang ⋅ Jihun Park ⋅ Kyoungmin Lee ⋅ Seunghun Lee ⋅ Jaeyeul Kim ⋅ Minwoo Choi ⋅ Sunghoon Im
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 66
PhyCritic: Multimodal Critic Models for Physical AI
Tianyi Xiong ⋅ Shihao Wang ⋅ Guilin Liu ⋅ Yi Dong ⋅ Ming Li ⋅ Heng Huang ⋅ Jan Kautz ⋅ Zhiding Yu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 67
R-C2: Cycle-Consistent Reinforcement Learning Improves Multimodal Reasoning
Zirui Zhang ⋅ Haoyu Dong ⋅ Kexin Pei ⋅ Chengzhi Mao
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 68
Multimodal RewardBench 2: Evaluating Omni Reward Models for Interleaved Text and Image
Yushi Hu ⋅ Reyhane Askari ⋅ Melissa Hall ⋅ Emily Dinan ⋅ Luke Zettlemoyer ⋅ Marjan Ghazvininejad
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 69
Unified Generation and Self-Verification for Vision-Language Models via Advantage Decoupled Preference Optimization
Xinyu Qiu ⋅ Heng Jia ⋅ Zhengwen Zeng ⋅ Shuheng Shen ⋅ Changhua Meng ⋅ Yi Yang ⋅ Linchao Zhu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 70
Anchoring the Mind of Multimodal Reasoners: Cognitive Bias as a Vector for Jailbreak Attacks
Linhua Cong ⋅ Bingrui Sima ⋅ Kun He
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 71
InsCal: Calibrated Multi-Source Fully Test-Time Prompt Tuning for Object Detection
Xiaofan Que ⋅ Dingrong Wang ⋅ Xumin Liu ⋅ Qi Yu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 72
Why Not Hyperparameter-Friendly Optimisation? A Monotonic Adaptive Norm Rescaling Approach For Long-Tailed Recognition
Shuo Zhang ⋅ Chenqi Li ⋅ Tingting Zhu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 73
Decoupling Vision and Language: Codebook Anchored Visual Adaptation
Jason Wu ⋅ Tianchen Zhao ⋅ Chang Liu ⋅ Jiarui Cai ⋅ Zheng Zhang ⋅ Zhuowei Li ⋅ Aaditya Singh ⋅ Xiang Xu ⋅ Mani Srivastava ⋅ Jonathan Wu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 74
MemFlow: A Lightweight Forward Memorizing Framework for Quick Domain Adaptive Feature Mapping
Jianming Lv ⋅ Chengjun Wang ⋅ Depin Liang ⋅ Qianli Ma ⋅ Wei Chen ⋅ Xueqi Cheng
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 75
Mind the Discriminability Trap in Source-Free Cross-domain Few-shot Learning
ZHENYU ZHANG ⋅ Yixiong Zou ⋅ Yuhua Li ⋅ Ruixuan Li ⋅ Guangyao Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 76
Vision-Language Model Guided Source-Free Domain Adaptation via Optimal Transport
Shuo Han ⋅ Xu Tang ⋅ Jingjing Ma ⋅ Xiangrong Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 77
Masked Representation Modeling for Domain-Adaptive Segmentation
Wenlve Zhou ⋅ Zhiheng Zhou ⋅ Tiantao Xian ⋅ Yikui Zhai ⋅ Weibin Wu ⋅ Biyun MA
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 78
TaskIT: Memory-Efficient Fine-Tuning of Multi-LoRA LLMs via Cross-Task Importance Transfer
Cheng Fang ⋅ Zimu Zhou ⋅ Ke Ma ⋅ Bin Guo
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 79
ARES: Unifying Asymmetric RGB-Event Stereo for Probabilistic Scene Flow Estimation
Jie Long Lee ⋅ Gim Hee Lee
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 80
MER-Tracker: Towards High-Speed 3D Point Tracking via Multi-View Event-RGB Hybrid Cameras
Yiqian Chang ⋅ Qinghong Ye ⋅ Haoran Xu ⋅ Jianing Li ⋅ Dongyang Ma ⋅ Xuan Wang ⋅ Wei Zhang ⋅ Yonghong Tian ⋅ Peixi Peng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 81
Moving Border Ownership for Event-based Motion Segmentation
Zhiyuan Hua ⋅ Cornelia Fermuller ⋅ Yiannis Aloimonos
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 82
TTAPFormer: Robust Arbitrary Point Tracking via Transient Asynchronous Fusion of Frames and Events
Jiaxiong Liu ⋅ Zhen Tan ⋅ Jinpu Zhang ⋅ Yi Zhou ⋅ Hui Shen ⋅ Xieyuanli Chen ⋅ Dewen Hu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 83
EventHub: Data Factory for Generalizable Event-Based Stereo Networks without Active Sensors
Luca Bartolomei ⋅ Fabio Tosi ⋅ Matteo Poggi ⋅ Stefano Mattoccia ⋅ Guillermo Gallego
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 84
Seeing Motion Through Polarity for Event-based Action Recognition
Meiqi Cao ⋅ Jiachao Zhang ⋅ Xin Jiang ⋅ Rui Yan ⋅ Yazhou Yao ⋅ Zechao Li ⋅ Xiangbo Shu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 85
Multi-Scale Gaussian-Language Map for Zero-shot Embodied Navigation and Reasoning
Sixian Zhang ⋅ Yiyao Wang ⋅ Xinhang Song ⋅ Keming Zhang ⋅ Zijian Xu ⋅ Shuqiang Jiang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 86
Explore with Long-term Memory: A Benchmark and Multimodal LLM-based Reinforcement Learning Framework for Embodied Exploration
sen wang ⋅ Bangwei Liu ⋅ Zhenkun Gao ⋅ Lizhuang Ma ⋅ Xuhong Wang ⋅ Yuan Xie ⋅ Xin Tan
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 87
SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL
Siyi Chen ⋅ Mikaela Angelina Uy ⋅ Chan Hee Song ⋅ Faisal Ladhak ⋅ Adithya Murali ⋅ Qing Qu ⋅ Stan Birchfield ⋅ Valts Blukis ⋅ Jonathan Tremblay
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 88
TeamHOI: Learning a Unified Policy for Cooperative Human-Object Interactions with Any Team Size
Stefan Lionar ⋅ Gim Hee Lee
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 89
AREA3D: Active Reconstruction Agent with Unified Feed-Forward 3D Perception and Vision-Language Guidance
Tianling Xu ⋅ Shengzhe GAN ⋅ Leslie Gu ⋅ Yuelei Li ⋅ Fangneng Zhan ⋅ Hanspeter Pfister
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 90
Experience Transfer for Multimodal LLM Agents in Minecraft Game
Chenghao Li ⋅ Jun Liu ⋅ Songbo Zhang ⋅ HuaDong Jian ⋅ Hao Ni ⋅ LIK-HANG LEE ⋅ SUNG BAE BAE ⋅ Guoqing Wang ⋅ Yang Yang ⋅ Chaoning Zhang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 91
MSGNav: Unleashing the Power of Multi-modal 3D Scene Graph for Zero-Shot Embodied Navigation
Xun Huang ⋅ Shijia Zhao ⋅ Yunxiang Wang ⋅ Xin Lu ⋅ Wanfa Zhang ⋅ Rongsheng Qu ⋅ Weixin Li ⋅ Yunhong Wang ⋅ Chenglu Wen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 92
SaPaVe: Towards Active Perception and Manipulation in Vision-Language Action Models for Robotics
Mengzhen Liu ⋅ Enshen Zhou ⋅ Cheng Chi ⋅ Yi Han ⋅ Shanyu Rong ⋅ Liming Chen ⋅ Pengwei Wang ⋅ Zhongyuan Wang ⋅ Shanghang Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 93
MANSION: Multi-floor lANguage-to-3D Scene generatIOn for loNg-horizon tasks
Lirong Che ⋅ Shuo Wen ⋅ Huang Shan ⋅ wang chuang ⋅ yuzhe yang ⋅ Gregory Dudek ⋅ Chuang Wang ⋅ Jian Su
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 94
RealAppiance: Let High-fidelity Appliance Assets Controllable and Workable as Aligned Real Manauls
Yuzheng Gao ⋅ Yuxing Long ⋅ Lei Kang ⋅ Yuchong Guo ⋅ Ziyan Yu ⋅ Shangqing Mao ⋅ Jiyao Zhang ⋅ Ruihai Wu ⋅ Dongjiang Li ⋅ Hui Shen ⋅ Hao Dong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 95
ForeAct: Steering Your VLA with Efficient Visual Foresight Planning
Zhuoyang Zhang ⋅ Shang Yang ⋅ Qinghao Hu ⋅ Luke J. Huang ⋅ James Hou ⋅ Yufei Sun ⋅ Yao Lu ⋅ Song Han
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 96
Affordance Field Intervention: Enabling VLAs to Escape Memory Traps in Robotic Manipulation
Siyu Xu ⋅ Zijian Wang ⋅ Yunke Wang ⋅ Chenghao Xia ⋅ Tao Huang ⋅ Chang Xu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 97
MERIT: Multi-domain Efficient RAW Image Translation
Wenjun Huang ⋅ Shenghao Fu ⋅ Yian Jin ⋅ Yang Ni ⋅ Ziteng Cui ⋅ Hanning Chen ⋅ Yirui He ⋅ Yezi Liu ⋅ Sanggeon Yun ⋅ SungHeon Jeong ⋅ Ryozo Masukawa ⋅ William Youngwoo Chung ⋅ Mohsen Imani
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 98
Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing
Yusu Qian ⋅ Eli Bocek-Rivele ⋅ Liangchen Song ⋅ Jialing Tong ⋅ Yinfei Yang ⋅ Jiasen Lu ⋅ Wenze Hu ⋅ Zhe Gan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 99
Probabilistic Prompt Adaptation for Unified Image Aesthetics and Quality Assessment
Takayuki Hara ⋅ Yuya Otsuka
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 100
EMMA: Concept Erasure Benchmark with Comprehensive Semantic Metrics and Diverse Categories
Lu Wei ⋅ Yuta Nakashima ⋅ Noa Garcia
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 101
Too Vivid to Be Real? Benchmarking and Calibrating Generative Color Fidelity
Zhengyao Fang ⋅ Zexi Jia ⋅ Yijia Zhong ⋅ Pengcheng Luo ⋅ Jinchao Zhang ⋅ Guangming Lu ⋅ Jun Yu ⋅ Wenjie Pei
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 102
WiseEdit: Benchmarking Cognition- and Creativity-Informed Image Editing
Kaihang Pan ⋅ Weile Chen ⋅ Haiyi Qiu ⋅ Qifan Yu ⋅ Wendong Bu ⋅ zehan wang ⋅ Yun Zhu ⋅ Juncheng Li ⋅ Siliang Tang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 103
UnicEdit-10M: A Dataset and Benchmark Breaking the Scale-Quality Barrier via Unified Verification for Reasoning-Enriched Edits
Keming Ye ⋅ Zhipeng Huang ⋅ Canmiao Fu ⋅ Qingyang Liu ⋅ Jiani Cai ⋅ Zheqi Lv ⋅ Chen Li ⋅ Jing LYU ⋅ Zhou Zhao ⋅ Shengyu Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 104
Inter-Edit: First Benchmark for Interactive Instruction-Based Image Editing
Delong Liu ⋅ Haotian Hou ⋅ Zhaohui Hou ⋅ Zhiyuan Huang ⋅ Shihao Han ⋅ Mingjie Zhan ⋅ Zhicheng Zhao ⋅ Fei Su
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 105
PR-IQA: Partial-Reference Image Quality Assessment for Diffusion-Based Novel View Synthesis
Inseong Choi ⋅ Siwoo Lee ⋅ Seung-Hun Nam ⋅ Soohwan Song
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 106
LumiMotion: Improving Gaussian Relighting with Scene Dynamics
Joanna Kaleta ⋅ Piotr Wójcik ⋅ Kacper Marzol ⋅ Tomasz Trzciński ⋅ Kacper Kania ⋅ Marek Kowalski
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 107
Let it Snow! Animating 3D Gaussian Scenes with Dynamic Weather Effects via Physics-Guided Score Distillation
Gal Fiebelman ⋅ Hadar Averbuch-Elor ⋅ Sagie Benaim
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 108
iLRM: An Iterative Large 3D Reconstruction Model
Gyeongjin Kang ⋅ Seungtae Nam ⋅ Seung kwon Yang ⋅ Xiangyu Sun ⋅ Sameh Khamis ⋅ Abdelrahman Mohamed ⋅ Eunbyung Park
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 109
MVInverse: Feed-forward Multiview Inverse Rendering in Seconds
Xiangzuo Wu ⋅ Chengwei Ren ⋅ Jun Zhou ⋅ Xiu Li ⋅ Yuan Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 110
From None to All: Self-Supervised 3D Reconstruction via Novel View Synthesis
Ranran Huang ⋅ Weixun Luo ⋅ Ye Mao ⋅ Krystian Mikolajczyk
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 111
MoRel: Long-Range Flicker-Free 4D Motion Modeling via Anchor Relay-based Bidirectioanl Blending with Hierarchical Densification
Sangwoon Kwak ⋅ WEEYOUN KWON ⋅ Jun Young Jeong ⋅ Geonho Kim ⋅ Won-Sik Cheong ⋅ Jihyong Oh
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 112
Multi-view Pyramid Transformer: Look Coarser to See Broader
Gyeongjin Kang ⋅ Seung kwon Yang ⋅ Seungtae Nam ⋅ Younggeun Lee ⋅ Jungwoo Kim ⋅ Eunbyung Park
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 113
CaT-GS: Efficient 3DGS Rendering for Large Scale Scenes via Inter-frame Caching and Tile Scheduling
TingJia Zhang ⋅ Bo Chen ⋅ Shengzhong Liu ⋅ Fan Wu ⋅ Guihai Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 114
RL‑ScanIQA: Reinforcement-Learned Scanpaths for Blind 360° Image Quality Assessment
yujia wang ⋅ Yuyan Li ⋅ Jiuming Liu ⋅ Fang-Lue Zhang ⋅ Xinhu Zheng ⋅ Neil.A Dodgson
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 115
Benchmarking Endoscopic Surgical Image Restoration and Beyond
Jialun Pei ⋅ Diandian Guo ⋅ Donghui Yang ⋅ Zhixi Li ⋅ Yuxin Feng ⋅ Long Ma ⋅ Bo Du ⋅ Pheng-Ann Heng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 116
SDUIE: Semi-Supervised Diffusion for Underwater Image Enhancement with Quant-Text Dual Control
Xiaofeng Cong ⋅ Yu-Xin Zhang ⋅ Hao Shen ⋅ Yeying Jin ⋅ Junming Hou ⋅ Jie Gui
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 117
HiDRA: Hierarchical Degradation Representation and Adaptation with Generative Priors for Enhancing Infrared Vision
Zihang Chen ⋅ Zhu Liu ⋅ Changbo Yan ⋅ Jinyuan Liu ⋅ Risheng Liu
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 118
BluRef: Unsupervised Image Deblurring with Dense-Matching References
Bang-Dang Pham ⋅ Anh Tran ⋅ Cuong Pham ⋅ Minh Nguyen Nguyen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 119
Bi-Bridge: Bidirectional Diffusion Bridges for Low-Light Image Enhancement
Zeyu Hua ⋅ HUI LI ⋅ Yu Wang ⋅ Song Wang ⋅ Congchao Zhu ⋅ Caixia Zheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 120
UniLDiff: Unlocking the Power of Diffusion Priors for All-in-One Image Restoration
Zihan Cheng ⋅ Liangtai Zhou ⋅ Dian Chen ⋅ Ni Tang ⋅ Xiaotong Luo ⋅ Yuan Xie ⋅ Yanyun Qu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 121
MatAnyone 2: Scaling Video Matting via a Learned Quality Evaluator
Peiqing Yang ⋅ Shangchen Zhou ⋅ Kai Hao ⋅ Qingyi Tao
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 122
SelfHVD: Self-Supervised Handheld Video Deblurring
Honglei Xu ⋅ Zhilu Zhang ⋅ Junjie Fan ⋅ Xiaohe Wu ⋅ Wangmeng Zuo
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 123
Spatio-Temporal Difference Guided Motion Deblurring with the Complementary Vision Sensor
Yapeng Meng ⋅ Lin Yang ⋅ Yuguo Chen ⋅ Xiangru Chen ⋅ Taoyi Wang ⋅ Lijian Wang ⋅ Zheyu Yang ⋅ Yihan Lin ⋅ Rong Zhao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 124
Learning Where to Look and How to Judge: Resolution-agnostic Image Quality Assessment with Quality-aware Saliency
Hakan Emre Gedik ⋅ Shashank Gupta ⋅ Alan C.
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 125
Bridging RGB and Hematoxylin Components: An Interleaved Guidance and Fusion Framework for Point Supervised Nuclei Segmentation
Zihan Huan ⋅ Xipeng Pan ⋅ Hualong Zhang ⋅ Siyang Feng ⋅ Rushi Lan ⋅ Huadeng Wang ⋅ Haoxiang Lu ⋅ Zhenbing Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 126
Virtual Nodes Guided Dynamic Graph Neural Network for Brain Tumor Segmentation with Missing Modalities
Sha Tao ⋅ Jiao PAN ⋅ Yu Guo ⋅ Chao Yao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 127
VoxTell: Free-Text Promptable Universal 3D Medical Image Segmentation
Maximilian Rokuss ⋅ Moritz Langenberg ⋅ Yannick Kirchhoff ⋅ Fabian Isensee ⋅ Benjamin Hamm ⋅ Constantin Ulrich ⋅ Sebastian Regnery ⋅ Lukas Bauer ⋅ Efthimios Katsigiannopulos ⋅ Tobias Norajitra ⋅ Klaus Maier-Hein
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 128
Photo-Guided Tooth Segmentation on 3D Oral Scan Model
Shaojie Zhuang ⋅ Guangshun Wei ⋅ Jiangxin He ⋅ Yuanfeng Zhou
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 129
Breaking the Continuum: Discrete Distribution Learning for Structural MRI Reconstruction
Tianle Lyu ⋅ Mengjingcheng Mo ⋅ Ting Wen ⋅ Zhen Song ⋅ Zinan Xiong ⋅ Yanjie Zhu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 130
Uni-Hema: Unified Model for Digital Hematopathology
Abdul Rehman ⋅ Iqra Rasool ⋅ Ayisha Imran ⋅ Mohsen Ali ⋅ Waqas Sultani
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 131
Post-training Feature Pruning for Fundus Images Classification
Van-Nguyen Pham ⋅ Duc-Tai Le ⋅ Junghyun Bum ⋅ Hyunseung Choo
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 132
Sketch2CT: Multimodal Diffusion for Structure-Aware 3D Medical Volume Generation
Delin An ⋅ Chaoli Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 133
SafeLogo: Turning Your Logos into Jailbreak Shields via Micro-Regional Adversarial Training
Zhiyi Duan ⋅ Xiaoyue Zhang ⋅ Tianxing Man
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 134
Anti-I2V: Safeguarding your Photos from Malicious Image-to-video Generation
Hong Duc Vu ⋅ Anh Nguyen ⋅ Chi Tran ⋅ Anh Tran
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 135
UniGame: Turning a Unified Multimodal Model Into Its Own Adversary
Zhaolong Su ⋅ Wang Lu ⋅ Hao Chen ⋅ Yixuan Li ⋅ Jindong Wang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 136
Hierarchically Robust Zero-shot Vision-language Models
Junhao Dong ⋅ Yifei Zhang ⋅ Hao Zhu ⋅ Yew-Soon Ong ⋅ Piotr Koniusz
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 137
Beyond Text Prompts: Precise Concept Erasure through Text–Image Collaboration
Jun Li ⋅ Lizhi Xiong ⋅ Ziqiang Li ⋅ Weiwei Jiang ⋅ Zhangjie Fu ⋅ Yong Li ⋅ Guo-Sen Xie
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 138
AGENTSAFE: Benchmarking the Safety of Embodied Agents on Hazardous Instructions
Zonghao Ying ⋅ Le Wang ⋅ Yisong Xiao ⋅ Jiakai Wang ⋅ Yuqing Ma ⋅ Jinyang Guo ⋅ Zhenfei Yin ⋅ Mingchuan Zhang ⋅ Aishan Liu ⋅ Xianglong Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 139
ReMoE: Region-Mixture Experts for Adversarially-Robust Vision Transformers
Qinghao Zhong ⋅ Bingzhi Chen ⋅ Yishu Liu ⋅ Minhua Lu ⋅ Guangming Lu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 140
TreeTeaming: Autonomous Red-Teaming of Vision-Language Models via Hierarchical Strategy Exploration
Chunxiao Li ⋅ Lijun Li ⋅ Jing Shao
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 141
SO-Bench: A Structural Output Evaluation of Multimodal LLM
Di Feng ⋅ Kaixin Ma ⋅ Feng Nan ⋅ Haofeng Chen ⋅ Bohan Zhai ⋅ David Griffiths ⋅ Mingfei Gao ⋅ Zhe Gan ⋅ Eshan Verma ⋅ Yinfei Yang ⋅ Zhifeng Chen ⋅ Afshin Dehghan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 142
Chain-of-Thought Guided Multi-Modal Object Re-Identification
Ya Gao ⋅ Shihao Li ⋅ ZhaoJun Liu ⋅ AIHUA ZHENG ⋅ Chenglong Li ⋅ Jin Tang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 143
When Lines Meet Textures: Spatial-Frequency Aligned Diffusion Features for Cross-Sparsity Correspondence
Mingrui Zhu ⋅ Fengzhi Wang ⋅ Xin Wei ⋅ Jun Wang ⋅ Nannan Wang ⋅ Xinbo Gao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 144
CountGD++: Generalized Prompting for Open-World Counting
Niki Amini-Naieni ⋅ Andrew Zisserman
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 145
AudioStory: Generating Long-Form Narrative Audio with Large Language Models
Yuxin Guo ⋅ Teng Wang ⋅ Yuying Ge ⋅ Shijie Ma ⋅ Yixiao Ge ⋅ Wei Zou
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 146
Parameter-Efficient Adaptation for MLLMs via Implicit Modality Decomposition
Mingfang Zhang ⋅ Yunhong Wang ⋅ Lu Wang ⋅ Jiaxin Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 147
Hyperbolic Gramian Volumes for Multimodal Alignment
Saiyang Na ⋅ Feng Jiang ⋅ Qifeng Zhou ⋅ Wenliang Zhong ⋅ Thao M. Dang ⋅ Yuzhi Guo ⋅ Hehuan Ma ⋅ Chunyuan Li ⋅ Weizhi An ⋅ Junzhou Huang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 148
Venus: Benchmarking and Empowering Multimodal Large Language Models for Aesthetic Guidance and Cropping
Tianxiang Du ⋅ Hulingxiao He ⋅ Yuxin Peng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 149
AutoCut: End-to-end advertisement video editing based on multimodal discretization and controllable generation
Milton Zhou ⋅ Sizhong Qin ⋅ Yongzhi Li ⋅ Quan Chen ⋅ Peng Jiang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 150
StableMTL: Repurposing Latent Diffusion Models for Multi-Task Learning from Partially Annotated Synthetic Datasets
Anh Quan Cao ⋅ Ivan Lopes ⋅ Raoul de Charette
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 151
CaReFlow: Cyclic Adaptive Rectified Flow for Multimodal Fusion
Sijie Mai ⋅ Shiqin Han
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 152
Lenses: Toward Polysemous Vision–Language Understanding
Hani Alomari ⋅ Ali Asgarov ⋅ Chris Thomas
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 153
CoRiM: Conflict-driven Risk Minimization for Dynamic Multimodal Fusion
shihao Zou ⋅ Wei Wei
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 154
Uncertainty-Aware Exploratory Direct Preference Optimization for Multimodal Large Language Models
Huatian Zhang ⋅ Zhendong Mao ⋅ Lei Zhang ⋅ Yongdong Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 155
CICA: Coupling Confidence-Aware Pretraining with Confidence-Informed Attention for Robust Multimodal Sentiment Analysis
Haoyu Jiang ⋅ Xiaoliang Chen ⋅ Duoqian Miao ⋅ Xiaolin Qin ⋅ Xianyong Li ⋅ Yajun Du
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 156
SAMTok: Representing Any Mask with Two Words
yikang zhou ⋅ Tao Zhang ⋅ Dengxian Gong ⋅ Yuanzheng Wu ⋅ Ye Tian ⋅ Haochen Wang ⋅ Haobo Yuan ⋅ Jiacong Wang ⋅ Lu Qi ⋅ Hao Fei ⋅ Shunping Ji ⋅ Anran Wang ⋅ Zhuochen Wang ⋅ Yujing Wang ⋅ Cheng CHEN ⋅ Xiangtai Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 157
Multi-Metric Representation Learning Strategy Based on Clustering for Fine-Grained Multimodal Sentiment Analysis
Yidan Wang ⋅ Zongheng Wang ⋅ Hongjie Xing ⋅ Chunguo Li ⋅ Xiaoxiao Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 158
Cinematic Audio Source Separation Using Visual Cues
Kang Zhang ⋅ Suyeon Lee ⋅ Arda Senocak ⋅ Joon Chung
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 159
MMSD3.0: A Multi-Image Benchmark for Real-World Multimodal Sarcasm Detection
HAOCHEN ZHAO ⋅ Yuyao Kong ⋅ Yongxiu Xu ⋅ Gaopeng Gou ⋅ Hongbo Xu ⋅ Yubin Wang ⋅ Haoliang Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 160
Anchor-Guided Gradient Alignment for Incomplete Multimodal Learning
Zhi-Hao Guan ⋅ Longfei Huang ⋅ Yang Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 161
PyraTok: Language-Aligned Pyramidal Tokenizer for Video Understanding and Generation
Onkar Susladkar ⋅ Tushar Prakash ⋅ Adheesh Juvekar ⋅ Kiet A. Nguyen ⋅ Dong-Hwan Jang ⋅ Inderjit S Dhillon ⋅ Ismini Lourentzou
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 162
VDE: Training-Free Accelerating Rectified Flow Model via Velocity Decomposition and Estimation
Junwen Tan ⋅ Jinglin Liang ⋅ Hongyuan Chen ⋅ Shuangping Huang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 163
Kontinuous Kontext: Continuous Strength Control for Instruction-based Image Editing
Rishubh Parihar ⋅ Or Patashnik ⋅ Daniil Ostashev ⋅ R. Venkatesh Babu ⋅ Daniel Cohen-Or ⋅ Kuan-Chieh Jackson Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 164
VideoCoF: Unified Video Editing with Temporal Reasoner
xiangpeng yang ⋅ Ji Xie ⋅ Yiyuan Yang ⋅ Yue Ma ⋅ Yan Huang ⋅ Min Xu ⋅ Qiang Wu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 165
Progressive Supernet Training for Efficient Visual Autoregressive Modeling
Xiaoyue Chen ⋅ Yuling Shi ⋅ kaiyuan Li ⋅ Huandong Wang ⋅ Yong Li ⋅ Xiaodong Gu ⋅ Xinlei Chen ⋅ Mingbao Lin
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 166
CoT-Edit: Let CoT Guide Instruction Video Editing
Sen Liang ⋅ Fengbin Guan ⋅ Youliang Zhang ⋅ Xin Li ⋅ Zhibo Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 167
Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset
Qingyan Bai ⋅ Qiuyu Wang ⋅ Hao Ouyang ⋅ Yue Yu ⋅ Hanlin Wang ⋅ Wen Wang ⋅ Ka Leong Cheng ⋅ Shuailei Ma ⋅ Yanhong Zeng ⋅ Zichen Liu ⋅ Yinghao Xu ⋅ Yujun Shen ⋅ Qifeng Chen
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 168
Test-Time Instance-Specific Parameter Composition: A New Paradigm for Adaptive Generative Modeling
Minh-Tuan Tran ⋅ Xuan-May Le ⋅ Quan Hung Tran ⋅ Mehrtash Harandi ⋅ Dinh Phung ⋅ Trung Le
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 169
Understanding, Accelerating, and Improving MeanFlow Training
Jin-Young Kim ⋅ Hyojun Go ⋅ Lea Bogensperger ⋅ Julius Erbach ⋅ Nikolai Kalischek ⋅ Federico Tombari ⋅ Konrad Schindler ⋅ Dominik Narnhofer
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 170
Meta-CoT: Enhancing Granularity and Generalization in Image Editing
Shiyi Zhang ⋅ YIJI CHENG ⋅ Tiankai Hang ⋅ Zijin Yin ⋅ Runze He ⋅ Yu Xu ⋅ Wenxun Dai ⋅ yunlong lin ⋅ Chunyu Wang ⋅ qinglin lu ⋅ Yansong Tang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 171
Dual-Granularity Memory for Efficient Video Generation
Hongjun Wang ⋅ Lin Liu ⋅ Jianguo Li ⋅ Tao Lin
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 172
Unified Camera Positional Encoding for Controlled Video Generation
Cheng Zhang ⋅ Boying Li ⋅ Meng Wei ⋅ Yan-Pei Cao ⋅ Camilo Cruz Gambardella ⋅ Dinh Phung ⋅ Jianfei Cai
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 173
EditMGT: Unleashing Potentials of Masked Generative Transformers in Image Editing
Wei Chow ⋅ Linfeng Li ⋅ Lingdong Kong ⋅ Zefeng Li ⋅ Qi Xu ⋅ Hang Song ⋅ Tian Ye ⋅ Xian Wang ⋅ Jinbin Bai ⋅ Shilin Xu ⋅ Xiangtai Li ⋅ Junting Pan ⋅ Shaoteng Liu ⋅ Ran Zhou ⋅ Tianshu Yang ⋅ Songhua Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 174
MU-GeNeRF: Multi-view Uncertainty-guided Generalizable Neural Radiance Fields for Distractor-aware Scene
wenjie mu ⋅ Zhan Li ⋅ Chuanzhou su ⋅ XUANYI SHEN ⋅ Ziniu Liu ⋅ Fan Lu ⋅ Yujian Mo ⋅ Junqiao Zhao ⋅ Tiantian Feng ⋅ chen ye ⋅ Guang Chen
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 175
PLACID: Identity-Preserving Multi-Object Compositing via Video Diffusion with Synthetic Trajectories
Gemma Canet Tarrés ⋅ Manel Baradad ⋅ Francesc Moreno-Noguer ⋅ Yumeng Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 176
Object-WIPER: Training-Free Object and Associated Effect Removal in Videos
Saksham Singh Kushwaha ⋅ Sayan Nag ⋅ Yapeng Tian ⋅ Kuldeep Kulkarni
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 177
Mobile-VTON: High-Fidelity On-Device Virtual Try-On
Zhenchen Wan ⋅ Ce Chen ⋅ Runqi Lin ⋅ Jiaxin Huang ⋅ Tianxi Chen ⋅ Yanwu Xu ⋅ Tongliang Liu ⋅ Mingming Gong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 178
Progress by Pieces: Test-Time Scaling for Autoregressive Image Generation
Joonhyung Park ⋅ Hyeongwon Jang ⋅ Joowon Kim ⋅ Eunho Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 179
Towards Robust Sequential Decomposition for Complex Image Editing
Zilai Zeng ⋅ Mingdeng Cao ⋅ Zijie Li ⋅ Xiaochen Lian ⋅ Yichun Shi ⋅ Peihao Zhu ⋅ Chen Sun ⋅ Peng Wang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 180
Layer Consistency Matters: Elegant Latent Transition Discrepancy for Generalizable Synthetic Image Detection
Yawen Yang ⋅ Feng Li ⋅ Shuqi Kong ⋅ Yunfeng Diao ⋅ Xinjian Gao ⋅ Zenglin Shi ⋅ Meng Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 181
Chain of Event-Centric Causal Thought for Physically Plausible Video Generation
Zixuan Wang ⋅ Yixin Hu ⋅ Haolan Wang ⋅ Feng Chen ⋅ Yan Liu ⋅ Wen Li ⋅ Yinjie Lei
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 182
LoL: Longer than Longer, Scaling Video Generation to Hour
Jiaxing Cui ⋅ Jie Wu ⋅ Ming Li ⋅ Tao Yang ⋅ Xiaojie Li ⋅ Rui Wang ⋅ Andrew Bai ⋅ Yuanhao Ban ⋅ Cho-Jui Hsieh
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 183
FlowMotion: Training-Free Flow Guidance for Video Motion Transfer
Zhen Wang ⋅ Youcan Xu ⋅ Jun Xiao ⋅ Long Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 184
Learning Straight Flows: Variational Flow Matching for Efficient Generation
Chenrui Ma ⋅ Xi Xiao ⋅ Tianyang Wang ⋅ Xiao Wang ⋅ Yanning Shen
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 185
SIGMA: Selective-Interleaved Generation with Multi-Attribute Tokens
Xiaoyan Zhang ⋅ Zechen Bai ⋅ Haofan Wang ⋅ Yiren Song
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 186
DNF-SR: Dual-Input and Negative-Aware Feature Fine-Tuning for Real-World Image Super-Resolution
Shuhao Han ⋅ Wenjie Liao ⋅ Hayden Vance ⋅ Hang Dong ⋅ Rui Zhang ⋅ Chunle Guo ⋅ Chongyi Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 187
IFCSR: Inference-Free Fidelity-Realism Control for One-Step Diffusion-based Real-World Image Super-Resolution
Jonghee Back ⋅ Jongju Kim ⋅ Jeong-Uk Kim ⋅ Eunjin Kim ⋅ Minyong Jeon
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 188
Edge-Focused Super-Resolution for Omnidirectional Images with Spherical Geometric Augmentation
Shaolin Wang ⋅ Yuying Li ⋅ Lei Zhong ⋅ Shigang Li ⋅ Jianfeng Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 189
TUDSR: Twice Upsampling-Diffusion for Higher Super-Resolution
Zhiqiang Wu ⋅ Yitong Dong ⋅ Xian Wei
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 190
PS-SR: Pseudo-Single-Step Video Super-Resolution via Speculative Diffusion
Aiqiu Wu ⋅ Zhaofan Qiu ⋅ Ting Yao ⋅ Tao Mei
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 191
Disentangled Textual Priors for Diffusion-based Image Super-Resolution
Lei Jiang ⋅ Xin Liu ⋅ Xinze Tong ⋅ Zhiliang Li ⋅ Jie Liu ⋅ Jie Tang ⋅ Gangshan Wu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 192
Remote Sensing Image Super-Resolution for Imbalanced Textures: A Texture-Aware Diffusion Framework
Enzhuo Zhang ⋅ Sijie Zhao ⋅ Dilxat Muhtar ⋅ Zhenshi Li ⋅ Xueliang Zhang ⋅ Pengfeng Xiao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 193
Rethinking Diffusion Model-Based Video Super-Resolution: Leveraging Dense Guidance from Aligned Features
Jingyi Xu ⋅ Meisong Zheng ⋅ Ying Chen ⋅ Minglang Qiao ⋅ Xin Deng ⋅ Mai Xu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 194
DreamSR: Towards Ultra-High-Resolution Image Super-Resolution via a Receptive-Field Enhanced Diffusion Transformer
Qingji Dong ⋅ Hang Dong ⋅ Mingqin Chen ⋅ Rui Zhang ⋅ Yitong Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 195
FiDeSR: High-Fidelity and Detail-Preserving One-Step Diffusion Super-Resolution
Aro Kim ⋅ Myeongjin Jang ⋅ Chaewon Moon ⋅ Youngjin Shin ⋅ Jinwoo Jeong ⋅ Sang-hyo Park
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 196
STCDiT: Spatio-Temporally Consistent Diffusion Transformer for High-Quality Video Super-Resolution
Junyang Chen ⋅ Jiangxin Dong ⋅ Long Sun ⋅ Yixin Yang ⋅ Jinshan Pan
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 197
Towards Highly-Constrained Human Motion Generation with Retrieval-Guided Diffusion Noise Optimization
Hanchao Liu ⋅ Fang-Lue Zhang ⋅ Shining Zhang ⋅ Tai-Jiang Mu ⋅ Shi-Min Hu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 198
Learning to Control Physically-simulated 3D Characters via Generating and Mimicking 2D Motions
Jianan Li ⋅ Xiao Chen ⋅ Tao Huang ⋅ Tien-Tsin Wong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 199
Human Geometry Distribution for 3D Animation Generation
Xiangjun Tang ⋅ Biao Zhang ⋅ Peter Wonka
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 200
A Temporal and Content Co-Awareness Latent Diffusion for Controllable Hand Image Generation
Shuang Hao ⋅ Pengfei Ren ⋅ Haifeng Sun ⋅ Ting Pan ⋅ Qi Qi ⋅ Lei Zhang ⋅ Cong Liu ⋅ Jianxin Liao ⋅ Jingyu Wang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 201
Superman: Unifying Skeleton and Vision for Human Motion Perception and Generation
Xinshun Wang ⋅ Peiming Li ⋅ Ziyi Wang ⋅ Zhongbin Fang ⋅ Zhichao Deng ⋅ Songtao Wu ⋅ Xiangtai Li ⋅ Mengyuan Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 202
Learning to Assist: Physics-Grounded Human-Human Control via Multi-Agent Reinforcement Learning
Yuto Shibata ⋅ Kashu Yamazaki ⋅ Lalit Jayanti ⋅ Yoshimitsu Aoki ⋅ Mariko Isogawa ⋅ Katerina Fragkiadaki
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 203
Stability-Driven Motion Generation for Object-Guided Human-Human Co-Manipulation
Jiahao Xu ⋅ Xiaohan Yuan ⋅ Xingchen Wu ⋅ Chongyang Xu ⋅ Kun Li ⋅ Buzhen Huang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 204
Causal Motion Diffusion Models for Autoregressive Motion Generation
Qing Yu ⋅ Akihisa Watanabe ⋅ Kent Fujiwara
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 205
Towards Storytelling Animations: Joint Synthesis of Human and Camera Motions
Boyuan Cheng ⋅ Yingjie Xi ⋅ Rui He ⋅ Jinhe Na ⋅ Ying Cao ⋅ Pengjie Wang ⋅ Jian Jun Zhang ⋅ Xiaosong Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 206
MoLingo: Motion–Language Alignment for Text-to-Human Motion Generation
Yannan He ⋅ Garvita Tiwari ⋅ Xiaohan Zhang ⋅ Pankaj Bora ⋅ Tolga Birdal ⋅ Jan Lenssen ⋅ Gerard Pons-Moll
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 207
End-to-End Language-Action Model for Humanoid Whole Body Control
Yuxuan Wang ⋅ Haobin Jiang ⋅ Shiqing Yao ⋅ Ziluo Ding ⋅ Zongqing Lu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 208
Toward Early Quality Assessment of Text-to-Image Diffusion Models
Huanlei Guo ⋅ Hongxin Wei ⋅ Bingyi Jing
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 209
CoD: A Diffusion Foundation Model for Image Compression
Zhaoyang Jia ⋅ Zihan Zheng ⋅ Naifu Xue ⋅ Jiahao Li ⋅ Bin Li ⋅ Zongyu Guo ⋅ Xiaoyi Zhang ⋅ Houqiang Li ⋅ Yan Lu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 210
Diffusion MRI Transformer with a Diffusion Space Rotary Positional Embedding (D-RoPE)
Gustavo Chau Loo Kung ⋅ Mohammad H. Abbasi ⋅ Camila Blank ⋅ Juze Zhang ⋅ Alan Q. Wang ⋅ Sophie Ostmeier ⋅ Akshay Chaudhari ⋅ Kilian Pohl ⋅ Ehsan Adeli
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 211
Language-Guided One-Step Diffusion Model for Nighttime Flare Removal
Aoxiang Ning ⋅ Kailong Yu ⋅ Minglong Xue ⋅ Liyuan Pan ⋅ Jinhong He ⋅ Wenchao Yan ⋅ Mingliang Zhou ⋅ Yirui Wu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 212
SpiralDiff: Spiral Diffusion with LoRA for RGB-to-RAW Conversion Across Cameras
Huanjing Yue ⋅ Shangbin Xie ⋅ Cong Cao ⋅ Qian Wu ⋅ Lei Zhang ⋅ Zhao Lei ⋅ Jingyu Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 213
PnP-CM: Consistency Models as Plug-and-Play Priors for Inverse Problems
Merve Gulle ⋅ junno yun ⋅ Yasar Utku Alcalar ⋅ Mehmet Akcakaya
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 214
Landscape-Awareness for Geometric View Diffusion Model
Yan-Ting Chen ⋅ Hao-Wei Chen ⋅ Tsu-Ching Hsiao ⋅ Chun-Yi Lee
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 215
Otil: Accelerating Diffusion Model Inference via Communication-Efficient Multi-GPU Parallelism
Xin Li ⋅ Shujun Tian ⋅ Tao Lu ⋅ Han Bao ⋅ Zonghui Wang ⋅ Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 216
REACH: Explicit Recovery Behavior for Diffusion Policies
zundong Ke ⋅ Junlin Chen ⋅ Jiayi Zhu ⋅ Kuanhao Xia ⋅ Jiayuan Gu ⋅ boyi zhao
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 217
OralGPT-Omni: A Versatile Dental Multimodal Large Language Model
JING HAO ⋅ Yuci Liang ⋅ Lizhuo Lin ⋅ Yuxuan Fan ⋅ Wenkai Zhou ⋅ Kaixin Guo ⋅ Zanting Ye ⋅ Yanpeng Sun ⋅ Xinyu Zhang ⋅ Yanqi Yang ⋅ Qiankun Li ⋅ Hao Tang ⋅ James Kit-Hon Tsoi ⋅ Linlin Shen ⋅ Kuo Feng Hung
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 218
CrossHOI-Bench: A Unified Benchmark for HOI Evaluation across Vision-Language Models and HOI-Specific Methods
Qinqian Lei ⋅ Bo Wang ⋅ Robby T. Tan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 219
The LLM Bottleneck: Why Open-Source Vision LLMs Struggle with Hierarchical Visual Recognition
Yuwen Tan ⋅ Yuan Qing ⋅ Boqing Gong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 220
Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench
Fenfen Lin ⋅ Yesheng Liu ⋅ Haiyu Xu ⋅ Yue Chen ⋅ Zheqi He ⋅ Mingxuan Zhao ⋅ Miguel Hu Chen ⋅ JG Yao ⋅ Xi Yang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 221
KαLOS finds Consensus: A Meta-Algorithm for Evaluating Inter-Annotator Agreement in Complex Vision Tasks
David Tschirschwitz ⋅ Volker Rodehorst
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 222
Beyond Single Images: A Comprehensive Benchmark for Album-Level Vision-Language Understanding
Shawn Huang ⋅ Brian Price ⋅ Yifei Fan ⋅ Bryan Morse
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 223
LIBERO-Plus: A Progressive Robustness Benchmark for Visual-Language-Action Models
Senyu Fei ⋅ Siyin Wang ⋅ Junhao Shi ⋅ Zihao Dai ⋅ Jikun Cai ⋅ Pengfang Qian ⋅ Li Ji ⋅ Xinzhe He ⋅ Shiduo Zhang ⋅ Zhaoye Fei ⋅ Jinlan Fu ⋅ Jingjing Gong ⋅ Xipeng Qiu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 224
Scenes as Tokens: Multi-Scale Normal Distributions Transform Tokenizer for General 3D Vision–Language Understanding
Yutao Tang ⋅ Cheng Zhao ⋅ Gaurav Mittal ⋅ Rohith Kukkala ⋅ Rama Chellappa ⋅ Cheng Peng ⋅ Mei Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 225
LangRef3DGS: Natural Language-Guided 3D Referential Segmentation from Partial Observations via 3D Gaussian Splatting
xulun ye ⋅ Qin Zhang ⋅ Kun Zhou
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 226
Hear you are: Teaching LLMs Spatial Reasoning with Vision and Spatial Sound
Hyeonggon Ryu ⋅ Joon Chung ⋅ David Harwath
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 227
EgoMind: Activating Spatial Cognition through Linguistic Reasoning in MLLMs
Zhenghao Chen ⋅ Huiqun Wang ⋅ Di Huang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 228
SAQN: Semantic-based Adaptive Query Network for 3D Referring Expression Segmentation
Jiale Huang ⋅ Shangfei Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 229
EagleVision: A Dual-Stage Framework with BEV-grounding-based Chain-of-Thought for Spatial Intelligence
Jiaxu Wan ⋅ Xu Wang ⋅ Mengwei Xie ⋅ Hang Zhang ⋅ Mu Xu ⋅ Yang Han ⋅ Ding Yuan ⋅ Hong Zhang ⋅ Yifan Yang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 230
Abstract 3D Perception for Spatial Intelligence in Vision-Language Models
Yifan Liu ⋅ Fangneng Zhan ⋅ Kaichen Zhou ⋅ Yilun Du ⋅ Paul Pu Liang ⋅ Hanspeter Pfister
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 231
PV-Ground: Text-Guided Point-Voxel Interaction for 3D Visual Grounding
Junpeng Shang ⋅ Feifei Shao ⋅ Jun Xiao ⋅ Lin Li ⋅ Hongwei Wang ⋅ Dongfang Ma
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 232
Masking Matters: Unlocking the Spatial Reasoning Capabilities of LLMs for 3D Scene-Language Understanding
Yerim Jeon ⋅ Miso Lee ⋅ WonJun Moon ⋅ Jae-Pil Heo
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 233
SpatialStack: Layered Geometry-Language Fusion for 3D VLM Spatial Reasoning
Jian Zhang ⋅ Shijie Zhou ⋅ Bangya LIU ⋅ Achuta Kadambi ⋅ Zhiwen Fan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 234
Geometrically-Constrained Agent for Spatial Reasoning
Zeren Chen ⋅ Xiaoya Lu ⋅ Zhijie Zheng ⋅ Pengrui Li ⋅ Lehan He ⋅ Yijin Zhou ⋅ Jing Shao ⋅ Bohan Zhuang ⋅ Lu Sheng
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 235
PARSE: Part-Aware Relational Spatial Modeling
Yinuo Bai ⋅ Peijun Xu ⋅ Kuixiang Shao ⋅ Yuyang Jiao ⋅ Jingxuan Zhang ⋅ Kaixin Yao ⋅ Jiayuan Gu ⋅ Jingyi Yu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 236
R4: Retrieval-Augmented Reasoning for Vision-Language Models in 4D Spatio-Temporal Space
Tin Stribor Sohn ⋅ Maximilian Dillitzer ⋅ Jason J. Corso ⋅ Eric Sax
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 237
MCHDoc: A Comprehensive Benchmark for Reading Multi-Carrier Chinese Historical Documents
YiJun Sheng ⋅ Shipeng Zhu ⋅ Ruijia Zuo ⋅ Na Nie ⋅ Hui Xue
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 238
Cross-modal Fuzzy Alignment Network for Text-Aerial Person Retrieval and A Large-scale Benchmark
Yifei Deng ⋅ Chenglong Li ⋅ YUYANG ZHANG ⋅ Guyue Hu ⋅ Jin Tang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 239
CodeMMR: Bridging Natural Language, Code, and Image for Unified Retrieval
Jiahui Geng ⋅ Qing Li ⋅ Fengyu Cai ⋅ Fakhri Karray
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 240
DiT-Distill: Open-Set Fine-Grained Retrieval via Generative Curriculum Knowledge
Xin Jiang ⋅ Hao Tang ⋅ Meiqi Cao ⋅ Junyao Gao ⋅ Fei Shen ⋅ Zechao Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 241
ReCALL: Recalibrating Capability Degradation for MLLM-based Composed Image Retrieval
tianyu yang ⋅ ChenWei He ⋅ xiangzhao hao ⋅ Tianyue Wang ⋅ Jiarui Guo ⋅ Haiyun Guo ⋅ Leigang Qu ⋅ Jinqiao Wang ⋅ Tat-seng Chua
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 242
Love Me, Love My Label: Rethinking the Role of Labels in Prompt Retrieval for Visual In-Context Learning
Tianci Luo ⋅ Haohao Pan ⋅ Jinpeng Wang ⋅ Niu Lian ⋅ Xinrui Chen ⋅ Bin Chen ⋅ Shu-Tao Xia ⋅ Chun Yuan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 243
Rethinking BCE Loss for Multi-Label Image Recognition with Fine-Tuning
Ao Zhou ⋅ Zhiwei Jiang ⋅ Zifeng Cheng ⋅ Cong Wang ⋅ Yafeng Yin ⋅ Shufan Yang ⋅ Qing Gu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 244
CAST: Context-Aware Dynamic Latent Space Transformation for Interactive Text-to-Image Retrieval
Xuanzuo Lin ⋅ Min Zhang ⋅ Daizong Liu ⋅ Zhiwen Zuo ⋅ Xun Yang ⋅ Changting Lin ⋅ Xun Wang ⋅ Jianfeng Dong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 245
PriVi: Towards a General-Purpose Video Model for Primate Behavior in the Wild
Felix B. Mueller ⋅ Jan F. Meier ⋅ Timo Lüddecke ⋅ Richard Vogg ⋅ Roger L. Freixanet ⋅ Valentin Hassler ⋅ Tiffany Bosshard ⋅ Elif Karakoc ⋅ William O'Hearn ⋅ Sofia M. Pereira ⋅ Sandro Sehner ⋅ Kaja Wierucka ⋅ Judith Burkart ⋅ Claudia Fichtel ⋅ Julia Fischer ⋅ Alexander Gail ⋅ Catherine Hobaiter ⋅ Julia Ostner ⋅ Liran Samuni ⋅ Oliver Schülke ⋅ Neda Shahidi ⋅ Erin G. Wessling ⋅ Alexander S. Ecker
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 246
Seeing Conversations: Communication Context Identification in Egocentric Video
Tobias Dorszewski ⋅ Jens Hjortkjær
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 247
Interactive Episodic Memory with User Feedback
Nikesh Subedi ⋅ Loris Bazzani ⋅ Ziad Al-Halah
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 248
Seeing without Pixels: Perception from Camera Trajectories
Zihui Xue ⋅ Kristen Grauman ⋅ Dima Damen ⋅ Andrew Zisserman ⋅ Tengda Han
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 249
PFGNet: A Fully Convolutional Frequency-Guided Peripheral Gating Network for Efficient Spatiotemporal Predictive Learning
Xinyong Cai ⋅ Changbin Sun ⋅ Yong Wang ⋅ Hongyu Yang ⋅ Yuankai Wu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 250
Minerva-Ego: Spatiotemporal Hints for Egocentric Video Understanding
Arsha Nagrani ⋅ Jasper Uijlings ⋅ Shyamal Buch ⋅ Tobias Weyand ⋅ Sudheendra Vijayanarasimhan ⋅ Bo Hu ⋅ Ramin Mehran ⋅ David A. Ross ⋅ Cordelia Schmid
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 251
StreamRAG: Enhancing Real-Time Video Understanding with Retrieval Augmentation
Junlin Xie ⋅ Quanlong Zheng ⋅ Ruifei Zhang ⋅ Kuo Wang ⋅ Yanhao Zhang ⋅ Jinguo Luo ⋅ Haonan Lu ⋅ Xiang Wan ⋅ Guanbin Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 252
ViKey: Enhancing Temporal Understanding in Videos via Visual Prompting
Yeonkyung Lee ⋅ Dayun Ju ⋅ Youngmin Kim ⋅ seil kang ⋅ Seong Jae Hwang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 253
SkillSight: Efficient First-Person Skill Assessment with Gaze
Chi Hsuan Wu ⋅ Kumar Ashutosh ⋅ Kristen Grauman
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 254
BriMA: Bridged Modality Adaptation for Multi-Modal Continual Action Quality Assessment
Kanglei Zhou ⋅ Chang Li ⋅ Qingyi Pan ⋅ Liyuan Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 255
Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO
JUNHAO CHENG ⋅ Liang Hou ⋅ Xin Tao ⋅ Jing Liao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 256
MedLIME: A Distribution-Aligned and Evidence-Supported Framework for Medical Saliency Explanations
Raghav Magazine ⋅ Xingjian Li ⋅ Min Xu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 257
Inside-Out: Measuring Generalization in Vision Transformers Through Inner Workings
Yunxiang Peng ⋅ Mengmeng Ma ⋅ Ziyu Yao ⋅ Xi Peng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 258
Language Models Can Explain Visual Features via Steering
Javier Ferrando ⋅ Enrique Lopez-Cuena ⋅ Pablo Agustin Martin-Torres ⋅ Daniel Hinjos ⋅ Anna Arias Duart ⋅ Dario Garcia-Gasulla
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 259
Making the Classification Explanation Faithful to the Confidence Score
Jian-Xun Mi ⋅ Lu Pan ⋅ Weisheng Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 260
Intrinsic Concept Extraction Based on Compositional Interpretability
Hanyu Shi ⋅ Hong Tao ⋅ Guoheng Huang ⋅ Jianbin Jiang ⋅ Xuhang Chen ⋅ Chi-Man Pun ⋅ Shanhu Wang ⋅ Pan Pan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 261
Attribution-Guided Model Rectification of Unreliable Neural Network Behaviors
Peiyu Yang ⋅ Naveed Akhtar ⋅ Jiantong Jiang ⋅ Ajmal Mian
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 262
Measuring the (Un)Faithfulness of Concept-Based Explanations
Shubham Kumar ⋅ Narendra Ahuja
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 263
Deformation-based In-Context Learning for Point Cloud Understanding
Chengxing Lin ⋅ Jinhong Deng ⋅ Yinjie Lei ⋅ Wen Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 264
ELiC: Efficient LiDAR Geometry Compression via Cross-Bit-depth Feature Propagation and Bag-of-Encoders
Junsik Kim ⋅ Gun Bang ⋅ Soowoong Kim
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 265
ESAM++: Efficient Online 3D Perception on the Edge
Qin Liu ⋅ Lavisha Aggarwal ⋅ Saptarashmi Bandyopadhyay ⋅ Vikas Bahirwani ⋅ Marc Niethammer ⋅ Ehsan Adeli ⋅ Andrea Colaco
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 266
DualReg: Dual-Space Filtering and Reinforcement for Rigid Registration
Jiayi Li ⋅ Yuxin Yao ⋅ Qiuhang Lu ⋅ Juyong Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 267
Hg-I2P: Bridging Modalities for Generalizable Image-to-Point-Cloud Registration via Heterogeneous Graphs
Pei An ⋅ Junfeng Ding ⋅ Jiaqi Yang ⋅ Yulong Wang ⋅ Jie Ma ⋅ Liangliang Nan
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 268
Rethinking 2D-3D Registration: A Novel Network for High-Value Zone Selection and Representation Consistency Alignment
Zhixin Cheng ⋅ Bohao Liao ⋅ Jiacheng Deng ⋅ Xiaotian Yin ⋅ Xinjun Li ⋅ Yujia Chen ⋅ Baoqun Yin ⋅ Tianzhu Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 269
Adaptive 3D Perception for Small Aerial Targets Under Sparse Sampling via Reinforcement Learning
Shenghai Yuan ⋅ Yihan Wei ⋅ Jason Yee ⋅ Zhuoran Qiao ⋅ boyang lou ⋅ Enwen Hu
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 270
3D sans 3D Scans: Scalable Pre-training from Video-Generated Point Clouds
Ryousuke Yamada ⋅ Kohsuke Ide ⋅ Yoshihiro Fukuhara ⋅ Hirokatsu Kataoka ⋅ Gilles Puy ⋅ Andrei Bursuc ⋅ Yuki M Asano
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 271
StreamVLO: Streaming Visual–LiDAR Odometry with Cumulative Drift Compensation
Mengmeng Liu ⋅ Jiuming Liu ⋅ Michael Ying Yang ⋅ Chaokang Jiang ⋅ Jiangtao Li ⋅ Yunpeng Zhang ⋅ Hesheng Wang ⋅ Francesco Nex ⋅ Hao Cheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 272
Mamba Learns in Context: Structure-Aware Domain Generalization for Multi-Task Point Cloud Understanding
Jincen Jiang ⋅ Qianyu Zhou ⋅ Yuhang Li ⋅ Kui Su ⋅ Meili Wang ⋅ Jian Chang ⋅ Jian Jun Zhang ⋅ Xuequan Lu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 273
Routing on Demand: DSNet for Efficient Progressive Point Cloud Denoising
Xiaoqian Cheng ⋅ Dong Xiao ⋅ Husen Li ⋅ Zheng Liu ⋅ Renjie Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 274
Hyper-PCN: Hypergraph-Based Point Cloud Completion via High-Order Correlation Modeling
Linfei Li ⋅ Pei Tan ⋅ Siqi Li ⋅ Changqing Zou ⋅ Yue Gao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 275
Towards Calibrating Prompt Tuning of Vision- Language Models
Ashshak Sharifdeen ⋅ Fahad Shamshad ⋅ Muhammad Akhtar Munir ⋅ Abhishek Basu ⋅ Mohamed Ismithdeen ⋅ Jeyapriyan Jeyamohan ⋅ Chathurika Silva ⋅ Karthik Nandakumar ⋅ Muhammad Haris Khan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 276
DEVA: Fine-tuning Multimodal Large Language Models for Visual Perception Tasks
Debasmit Das ⋅ Munawar Hayat ⋅ Fatih Porikli
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 277
LOREAL: Mitigating Low-Resolution Challenges in Vision-Language Models with Attribute-driven Prompt Self-Distillation
Xucong Wang ⋅ Pengkun Wang ⋅ Zhe Zhao ⋅ Liheng Yu ⋅ Rui Mao ⋅ Yang Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 278
OpenVision 2: A Family of Generative Pretrained Visual Encoders for Multimodal Learning
Yanqing Liu ⋅ Xianhang li ⋅ Letian Zhang ⋅ Zirui Wang ⋅ Zeyu Zheng ⋅ Yuyin Zhou ⋅ Cihang Xie
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 279
Language-guided Frequency Modulation for Large Vision-Language Models
Shuyi Ouyang ⋅ Gongfan Fang ⋅ Xinyin Ma ⋅ Yen-Wei Chen ⋅ Lanfen Lin ⋅ Xinchao Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 280
TANGO: Text-Anchored Guided Optimization for Robust Fine-tuning Vision-Language Models under Label Noise
Tengfei Ma ⋅ Weiran Pan ⋅ Wei Wei
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 281
Cluster-Wise Spatio-Temporal Masking for Efficient Video-Language Pretraining
Weijun Zhuang ⋅ Yuqing Huang ⋅ Weikang Meng ⋅ Xin Li ⋅ Ming Liu ⋅ Xiaopeng Hong ⋅ Yaowei Wang ⋅ Wangmeng Zuo
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 282
Reconstructing CLIP for Open-Vocabulary Dense Perception
Yajie Liu ⋅ Jinjin Zhang ⋅ Qingjie Liu ⋅ Di Huang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 283
DPL: Decoupled Prototype Learning for Enhancing Robustness of Vision–Language Transformers to Missing Modalities
Jueqing Lu ⋅ Yuanyuan Qi ⋅ Xiaohao Yang ⋅ Shuaicheng Niu ⋅ Fucai Ke ⋅ Shujie Zhou ⋅ Wei Tan ⋅ Jionghao Lin ⋅ Wray Buntine ⋅ Hamid Rezatofighi ⋅ Lan Du
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 284
BrepVGAE: Variational Graph Autoencoder with Unified Latent Representation for B-rep
Hao Guo ⋅ Liyuan Deng ⋅ Yongkang Dai ⋅ Ruohan Wang ⋅ Jiahao Li ⋅ Yunpeng Bai ⋅ Yilei Shi
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 285
NeuROK: Generative 4D Neural Object Kinematics
Chen Geng ⋅ Guangzhao He ⋅ Yue Gao ⋅ Yunzhi Zhang ⋅ Shangzhe Wu ⋅ Jiajun Wu
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 286
BrickNet: Graph-Backed Generative Brick Assembly
Peter Kulits ⋅ Cordelia Schmid
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 287
Unified Vector Floorplan Generation via Markup Representation
Kaede Shiohara ⋅ Toshihiko Yamasaki
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 288
CME-CAD: Heterogeneous Collaborative Multi-Expert Reinforcement Learning for CAD Code Generation
Ke Niu ⋅ Haiyang Yu ⋅ Zhuofan Chen ⋅ Zhengtao Yao ⋅ Weitao Jia ⋅ Xiaodong Ge ⋅ Jingqun Tang ⋅ Benlei Cui ⋅ Bin Li ⋅ Xiangyang Xue
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 289
Robo-SGG: Exploiting Layout-Oriented Normalization and Restitution Can Improve Robust Scene Graph Generation
Changsheng Lv ⋅ Zijian Fu ⋅ Mengshi Qi
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 290
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens
Yiying Yang ⋅ Wei Cheng ⋅ Sijin Chen ⋅ Honghao Fu ⋅ Xianfang Zeng ⋅ Yujun Cai ⋅ Gang Yu ⋅ Xingjun Ma
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 291
EpiAgent: An Agent-Centric System for Ancient Inscription Restoration
Shipeng Zhu ⋅ Ang Chen ⋅ Na Nie ⋅ Pengfei Fang ⋅ Min-Ling Zhang ⋅ Hui Xue
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 292
The Invisible Gorilla Effect in Out-of-distribution Detection
Harry Anthony ⋅ Ziyun Liang ⋅ Hermione Warr ⋅ Konstantinos Kamnitsas
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 293
Interpretable Debiasing of Vision-Language Models for Social Fairness
Na Min An ⋅ Yoonna Jang ⋅ Yusuke Hirota ⋅ Ryo Hachiuma ⋅ Isabelle Augenstein ⋅ Hyunjung Shim
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 294
Image-based Outlier Synthesis With Training Data
Sudarshan Regmi
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 295
SALMUBench: A Benchmark for Sensitive Association-Level Multimodal Unlearning
Cai Selvas-Sala ⋅ Lei Kang ⋅ Lluis Gomez
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 296
Scaling Test-Time Robustness of Vision-Language Models via Self-Critical Inference Framework
Kaihua Tang ⋅ JIAXIN QI ⋅ Jinli Ou ⋅ Yuhua Zheng ⋅ Jianqiang Huang
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 297
When Understanding Becomes a Risk: Authenticity and Safety Risks in the Emerging Image Generation Paradigm
Ye Leng ⋅ Junjie Chu ⋅ Mingjie Li ⋅ Chenhao Lin ⋅ Chao Shen ⋅ Michael Backes ⋅ Yun Shen ⋅ Yang Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 298
IrisFP: Adversarial-Example-based Model Fingerprinting with Enhanced Uniqueness and Robustness
Ziye Geng ⋅ Guang Yang ⋅ Yihang Chen ⋅ Changqing Luo
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 299
Mark4D: Temporally-Consistent Watermarking for 4D Gaussian Splatting
Jaejin Lee ⋅ Minjae Jeong ⋅ Joonhyuk Park ⋅ Yechan Hwang ⋅ Seunghun Baek ⋅ Won Hwa Kim
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 300
Machine Unlearning via Adaptive Gradient Reweighting and Multi-stage Objective Optimization
Juxin Lu ⋅ Haoyu Shi ⋅ Mengyao Wang ⋅ Huaiwen Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 301
Taming Noise-Induced Prototype Degradation for Privacy-Preserving Personalized Federated Fine-Tuning
Yuhua Wang ⋅ Qinnan Zhang ⋅ Xiaodong Li ⋅ Huan Zhang ⋅ Yifan Sun ⋅ Wangjie Qiu ⋅ Hainan Zhang ⋅ Yongxin Tong ⋅ Zhiming Zheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 302
FedMOP: Achieving Enhanced Privacy and Performance in Federated Learning via Momentum Orthogonal Projection
Yunlong Zhao ⋅ Xiaoheng Deng ⋅ Hongyan Xu ⋅ Zhuohua Qiu ⋅ Xiaowen Hu ⋅ Shan You ⋅ Yi Chen ⋅ Chang Xu ⋅ Xiu Su
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 303
HFedATM: Hierarchical Federated Domain Generalization via Optimal Transport and Regularized Mean Aggregation
Thinh Nguyen ⋅ Le Trung Phan ⋅ Binh Nguyen ⋅ Khoa D Doan ⋅ KOK SENG WONG
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 304
Single-Round Scalable Analytic Federated Learning
Alan T. L. Bacellar ⋅ Mustafa Munir ⋅ Felipe M.G. França ⋅ Priscila Machado Vieira Lima ⋅ Radu Marculescu ⋅ Lizy Kurian John
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 305
Controllable Federated Prompt Learning at Test Time
Rui Zhu ⋅ Liang Bai ⋅ Yanming Guo ⋅ Yirun Ruan ⋅ Tianyuan Yu ⋅ Zhihe Lu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 306
FedRE: A Representation Entanglement Framework for Model-Heterogeneous Federated Learning
Yuan Yao ⋅ Lixu Wang ⋅ Jiaqi Wu ⋅ Jin Song ⋅ Simin Chen ⋅ Zehua Wang ⋅ Zijian Tian ⋅ Wei Chen ⋅ Huixia Li ⋅ Xiaoxiao Li
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 307
Conversational Image Segmentation: Grounding Abstract Concepts with Scalable Supervision
Aadarsh Sahoo ⋅ Georgia Gkioxari
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 308
Spatial Matters: Position-Guided 3D Referring Expression Segmentation
Yabing Wang ⋅ Zhuotao Tian ⋅ Le Wang ⋅ Zheng Qin ⋅ Sanping Zhou
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 309
Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation
Tianming Liang ⋅ Haichao Jiang ⋅ Yuting Yang ⋅ Chaolei Tan ⋅ Shuai Li ⋅ Wei-Shi Zheng ⋅ Jian-Fang Hu
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 310
Refer-Agent: A Collaborative Multi-Agent System with Reasoning and Reflection for Referring Video Object Segmentation
Haichao Jiang ⋅ Tianming Liang ⋅ Wei-Shi Zheng ⋅ Jian-Fang Hu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 311
CaptionFormer: Unified Segmentation, Tracking, and Captioning for Spatio-Temporal Objects
Gabriel Fiastre ⋅ Antoine Yang ⋅ Cordelia Schmid
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 312
TransPrune: Token Transition Pruning for Efficient Large Vision-Language Model
Ao Li ⋅ Yuxiang Duan ⋅ Jinghui Zhang ⋅ Congbo Ma ⋅ Yutong Xie ⋅ Gustavo Carneiro ⋅ Mohammad Yaqub ⋅ Hu Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 313
QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models
Jingxuan Zhang ⋅ Yun-Ta Hsieh ⋅ Zhongwei Wan ⋅ Haokun Lin ⋅ Xin Wang ⋅ Ziqi Wang ⋅ Yingtie Lei ⋅ Mi Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 314
Revisiting Multimodal KV Cache Compression: A Frequency-Domain-Guided Outlier-KV-Aware Approach
Yaoxin Yang ⋅ Peng Ye ⋅ Xudong Tan ⋅ Chongjun Tu ⋅ Maosen Zhao ⋅ Jia Hao ⋅ Tao Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 315
Collaborative Multi-Mode Pruning for Vision-Language Models
Zimeng Wu ⋅ Yunhong Wang ⋅ Donghao Wang ⋅ Jiaxin Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 316
ZOO-Prune: Training-Free Token Pruning via Zeroth-Order Gradient Estimation in Vision-Language Models
Youngeun Kim ⋅ Youjia Zhang ⋅ Huiling Liu ⋅ Aecheon Jung ⋅ Sunwoo Lee ⋅ Sungeun Hong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 317
HAWK: Head Importance-Aware Visual Token Pruning in Multimodal Models
Qihui Zhu ⋅ Tao Zhang ⋅ yuchen wang ⋅ Shuangwu chen ⋅ Xiaobin Tan ⋅ Jian Yang ⋅ Yang Liu ⋅ Yinfei Pan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 318
CORE: Compact Object-centric REpresentations as a New Paradigm for Token Merging in LVLMs
Jingyu Lei ⋅ Gaoang Wang ⋅ Der-Horng Lee
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 319
Imbalanced View Contribution Evaluation and Refinement for Deep Incomplete Multi-View Clustering
Taichun Zhou ⋅ Zhibin Dong ⋅ Hao Tan ⋅ Siwei Wang ⋅ Xinwang Liu ⋅ En Zhu ⋅ Di Hu ⋅ Tianrui Liu ⋅ chuankun Li ⋅ Kunlun He
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 320
Multi-Hierarchical Contrastive Spectral Fusion for Multi-View Clustering
Bing Cai ⋅ Xiaoli Wang ⋅ Gui-Fu Lu ⋅ Zechao Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 321
SECOS: Semantic Capture for Rigorous Classification in Open-World Semi-Supervised Learning
Hezhao Liu ⋅ jiacheng yang ⋅ Junlong Gao ⋅ Mengke Li ⋅ Yiqun Zhang ⋅ Shreyank Gowda Gowda ⋅ Yang Lu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 322
Multi-Modal Representation Learning via Semi-Supervised Rate Reduction for Generalized Category Discovery
Wei He ⋅ Xianghan Meng ⋅ Zhiyuan Huang ⋅ Xianbiao Qi ⋅ Rong Xiao ⋅ CHUNGUANG LI
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 323
TimeBridge: Self-Supervised Video Representation Learning via Start-End Joint Embedding and In-Between Frame Prediction
Qin Wang ⋅ Abigail Morrison ⋅ Hanno Scharr ⋅ Kai Krajsek
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 324
Mitigating Instance Entanglement in Instance-Dependent Partial Label Learning
Rui Zhao ⋅ Bin Shi ⋅ Kai Sun ⋅ Bo Dong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 325
Residual Connections Harm Generative Representation Learning
Xiao Zhang ⋅ Ruoxi Jiang ⋅ William Gao ⋅ Rebecca Willet ⋅ Michael Maire
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 326
Neural Mixture Density Processes
yi ding ⋅ Qi Tao ⋅ Xingxing Liang ⋅ Longfei Zhang ⋅ Yiqin Lv ⋅ weitao song ⋅ Fangjie Yang ⋅ Qi Wang ⋅ Guangquan Cheng
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 327
Large-scale Robust Enhanced Ensemble Clustering via Outlier Decoupling
Jiaxuan Xu ⋅ Lei Duan ⋅ Xinye Wang ⋅ Liang Du
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 328
DriveLaW: Unifying Planning and Video Generation in a Latent Driving World
Tianze Xia ⋅ Yongkang Li ⋅ Lijun Zhou ⋅ Jingfeng Yao ⋅ Kaixin Xiong ⋅ Haiyang Sun ⋅ Bing Wang ⋅ Kun Ma ⋅ Guang Chen ⋅ Hangjun Ye ⋅ Wenyu Liu ⋅ Xinggang Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 329
DLWM: Dual Latent World Models enable Holistic Gaussian-centric Pre-training in Autonomous Driving
Yiyao Zhu ⋅ Ying Xue ⋅ Haiming Zhang ⋅ Guangfeng Jiang ⋅ Wending Zhou ⋅ Xu Yan ⋅ Jiantao Gao ⋅ Yingjie CAI ⋅ Bingbing Liu ⋅ Zhen Li ⋅ Shaojie Shen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 330
Latent Chain-of-Thought World Modeling for End-to-End Driving
Shuhan Tan ⋅ Kashyap Chitta ⋅ Yuxiao Chen ⋅ Thomas Tian ⋅ Yurong You ⋅ Yan Wang ⋅ Wenjie Luo ⋅ Yulong Cao ⋅ Philipp Krähenbühl ⋅ Marco Pavone ⋅ Boris Ivanovic
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 331
RLFTSim: Realistic and Controllable Multi-Agent Traffic Simulation via Reinforcement Learning Fine-Tuning
Ehsan Ahmadi ⋅ Hunter Schofield ⋅ Behzad Khamidehi ⋅ Fazel Arasteh ⋅ Jinjun Shan ⋅ Lili Mou ⋅ Dongfeng Bai ⋅ Kasra Rezaee
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 332
TrafficAlign: Aligning Large Language Models for Traffic Scenario Generation
Zhi Tu ⋅ Liangkun Niu ⋅ Tianyi Zhang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 333
Failure Modes for Deep Learning–Based Online Mapping: How to Measure and Address Them
Michael Hubbertz ⋅ Qi Han ⋅ Tobias Meisen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 334
Linking Modality Isolation in Heterogeneous Collaborative Perception
Changxing Liu ⋅ Zichen Chao ⋅ Siheng Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 335
LEAD: Minimizing Learner-Expert Asymmetry in End-to-End Driving
Long Nguyen ⋅ Micha Fauth ⋅ Bernhard Jaeger ⋅ Daniel Dauner ⋅ Maximilian Igl ⋅ Andreas Geiger ⋅ Kashyap Chitta
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 336
DriverGaze360: OmniDirectional Driver Attention with Object-Level Guidance
Shreedhar Govil ⋅ Didier Stricker ⋅ Jason Rambach
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 337
Diffusion Forcing Planner: History-Annealed Planning with Time-Dependent Guidance for Autonomous Driving
Zehan Zhang ⋅ Yaoyi Li ⋅ Neng Zhang ⋅ Jia Cai
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 338
DIMOS: Disentangling Instance-level Moving Object Segmentation
Hongxiang HUANG ⋅ Hongwei Ren ⋅ Xiaopeng LIN ⋅ Yulong Huang ⋅ Zeke Xie ⋅ Bojun Cheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 339
EvObj: Learning Evolving Object-centric Representations for 3D Instance Segmentation without Scene Supervision
Jiahao Chen ⋅ Zihui Zhang ⋅ Yafei Yang ⋅ Jinxi Li ⋅ Shenxing Wei ⋅ Zhixuan Sun ⋅ Bo Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 340
Live Interactive Training for Video Segmentation
Xinyu Yang ⋅ Haozheng Yu ⋅ Yihong Sun ⋅ Bharath Hariharan ⋅ Jennifer J. Sun
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 341
Robust Promptable Video Object Segmentation
Sohyun Lee ⋅ Yeho Gwon ⋅ Lukas Hoyer ⋅ Konrad Schindler ⋅ Christos Sakaridis ⋅ Suha Kwak
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 342
Scene-VLM: Multimodal Video Scene Segmentation via Vision-Language Models
Nimrod Berman ⋅ Adam Botach ⋅ Emanuel Ben-Baruch ⋅ Shunit Haviv Hakimi ⋅ Asaf Gendler ⋅ Ilan Naiman ⋅ Erez Yosef ⋅ Igor Kviatkovsky
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 343
Concept-Aware LoRA for Domain-Aligned Segmentation Dataset Generation
Minho Park ⋅ Sunghyun Park ⋅ Jungsoo Lee ⋅ Hyojin Park ⋅ Kyuwoong Hwang ⋅ Fatih Porikli ⋅ Jaegul Choo ⋅ Sungha Choi
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 344
BEV-CAR: Enhancing Monocular Bird’s Eye View Segmentation with Context-Aware Rasterization
Yixin Xiong ⋅ Ke Wang ⋅ Tongtong Cheng ⋅ Chunhui Liu ⋅ Kai Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 345
Exploring the Underwater World Segmentation without Extra Training
Bingyu Li ⋅ Tao Huo ⋅ Da Zhang ⋅ Zhiyuan Zhao ⋅ Junyu Gao ⋅ Xuelong Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 346
Learning from Oblivion: Predicting Knowledge-Overflowed Weights via Retrodiction of Forgetting
Jinhyeok Jang ⋅ Jaehong Kim ⋅ Jung Uk Kim
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 347
Cross-Architecture Adaptation: Cloud-Edge Continual Test-Time Adaptation with Dynamic Sampling and Heterogeneous Distillation
Zirui Xu ⋅ Xianhang Chu ⋅ Jiahao Li ⋅ Xu Yang ⋅ Cheng Deng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 348
Towards Dynamic Modality Alignment in Multimodal Continual Learning
Jiayao Tan ⋅ Fan Lyu ⋅ Tianle Liu ⋅ Fuyuan Hu ⋅ Wei Feng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 349
ϕ-DPO: Fairness Direct Preference Optimization Approach to Continual Learning in Large Multimodal Models
Thanh-Dat Truong ⋅ Huu-Thien Tran ⋅ Jackson Cothren ⋅ Bhiksha Raj ⋅ Khoa Luu
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 350
Incremental Object Detection via Future-Aware Decoupled Cross-Head Distillation
Chenfeng Yin ⋅ De Cheng ⋅ Wenlong Luo ⋅ Mingyue Zeng ⋅ Shizhou Zhang ⋅ Nannan Wang ⋅ Xinbo Gao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 351
Smart Replay: Adaptive Scheduling of Memory Rehearsal for Computational Resource-Aware Incremental Learning
Jianting CHEN ⋅ Dianzhi Yu ⋅ Irwin King
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 352
ReBaPL: Repulsive Bayesian Prompt Learning
Yassir Bendou ⋅ Omar Ezzahir ⋅ Remove middle name Fernandes ⋅ Gabriel Mahuas ⋅ Victoria Shevchenko ⋅ Mike Gartrell
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 353
Spectral Mixture-of-Experts for Continual Learning
Chen Yin ⋅ Xingbo Dong ⋅ Xuelin Shen ⋅ Zhe Jin
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 354
ActAvatar: Temporally-Aware Precise Action Control for Talking Avatars
Ziqiao Peng ⋅ Yi Chen ⋅ Yifeng Ma ⋅ Guozhen Zhang ⋅ Zhiyao Sun ⋅ Zixiang Zhou ⋅ Youliang Zhang ⋅ zhengguang zhou ⋅ Zhaoxin Fan ⋅ Hongyan Liu ⋅ Yuan Zhou ⋅ qinglin lu ⋅ Jun He
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 355
ViBES: A Conversational Agent with Behaviorally-Intelligent 3D Virtual Body
Juze Zhang ⋅ Changan Chen ⋅ Xin Chen ⋅ Heng Yu ⋅ Tiange Xiang ⋅ Ali Khan ⋅ Shrinidhi K. Lakshmikanth ⋅ Ehsan Adeli
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 356
DeX-Portrait: Disentangled and Expressive Portrait Animation via Explicit and Latent Motion Representations
Yuxiang Shi ⋅ Zhe Li ⋅ Yanwen Wang ⋅ Hao Zhu ⋅ Xun Cao ⋅ Ligang Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 357
SketchFaceGS: Real-Time Sketch-Driven Face Editing and Generation with Gaussian Splatting
Bo Li ⋅ Jiahao Kang ⋅ Yubo Ma ⋅ Feng-Lin Liu ⋅ Bin Liu ⋅ Fang-Lue Zhang ⋅ Lin Gao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 358
MIBURI: Towards Expressive Interactive Gesture Synthesis
M. Hamza Mughal ⋅ Rishabh Dabral ⋅ Vera Demberg ⋅ Christian Theobalt
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 359
Personalized Image Descriptions from Attention Sequences
Ruoyu Xue ⋅ Hieu Le ⋅ Jingyi Xu ⋅ Sounak Mondal ⋅ Abe Leite ⋅ Gregory Zelinsky ⋅ Minh Nguyen Nguyen ⋅ Dimitris Samaras
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 360
GA-VLN: Geometry-Aware BEV Representation for Efficient Vision-Language Navigation
Jiahao Yang ⋅ Zihan Wang ⋅ Xiangyang Li ⋅ Xing Zhu ⋅ Yujun Shen ⋅ Yinghao Xu ⋅ Shuqiang Jiang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 361
IMAIA: Interactive Maps AI Assistant for Travel Planning and Geo-Spatial Intelligence
Jieren Deng ⋅ Zhizhang Hu ⋅ Ziyan He ⋅ Aleksandar Cvetkovic ⋅ Pak Kiu Chung ⋅ Dragomir Yankov ⋅ Chiqun Zhang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 362
OctoNav: Towards Generalist Embodied Navigation
Chen Gao ⋅ Liankai Jin ⋅ Xingyu Peng ⋅ Jiazhao Zhang ⋅ Yue Deng ⋅ Annan Li ⋅ He Wang ⋅ Si Liu
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 363
WalkGPT: Grounded Vision–Language Conversation with Depth-Aware Segmentation for Pedestrian Navigation
Rafi Ibn Sultan ⋅ Hui Zhu ⋅ Xiangyu Zhou ⋅ Chengyin Li ⋅ Prashant Khanduri ⋅ Marco Brocanelli ⋅ Dongxiao Zhu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 364
SpaceDrive: Infusing Spatial Awareness into VLM-based Autonomous Driving
Peizheng Li ⋅ Zhenghao Zhang ⋅ David Holtz ⋅ Hang Yu ⋅ Yutong Yang ⋅ Yuzhi Lai ⋅ Rui Song ⋅ Andreas Geiger ⋅ Andreas Zell
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 365
SMAP: Semantic Route Planning with Map-Grounded Multimodal Alignment
Wenjie Zhang ⋅ Chen Yang ⋅ Xin Lu ⋅ Zhen Wang ⋅ Yue Liu ⋅ Bobo Xi ⋅ Pengbo Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 366
IDperturb: Enhancing Variation in Synthetic Face Generation via Angular Perturbations
Fadi Boutros ⋅ Eduarda Caldeira ⋅ Tahar Chettaoui ⋅ Naser Damer
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 367
Fresco: Frequency–Spatial Consistent Optimization for Fine-Grained Head Avatar Modeling
shikun zhang ⋅ Yong Li ⋅ Yiqun Wang ⋅ Qiuhong Ke ⋅ Cunjian Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 368
Motion-Aware Animatable Gaussian Avatars Deblurring
Muyao Niu ⋅ Yifan Zhan ⋅ Qingtian Zhu ⋅ Zhuoxiao Li ⋅ Wei Wang ⋅ Zhihang Zhong ⋅ Xiao Sun ⋅ Yinqiang Zheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 369
ELITE: Efficient Gaussian Head Avatar from a Monocular Video via Learned Initialization and Test-time Generative Adaptation
Kim Youwang ⋅ Lee Hyoseok ⋅ Park Subin ⋅ Gerard Pons-Moll ⋅ Tae-Hyun Oh
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 370
Multi-view Consistent 3D Gaussian Head Avatars 'without' Multi-view Generation
Aviral Chharia ⋅ Fernando De la Torre
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 371
MAD: Modality-Adaptive Decoding for Mitigating Cross-Modal Hallucinations in Multimodal Large Language Models
Sang Yun Chung ⋅ Se Yeon Kim ⋅ Youngchae Chee ⋅ Yong Man Ro
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 372
Cross-Modal Attention Calibration for LVLM Hallucination Mitigation
Jiaming Li ⋅ Jiacheng Zhang ⋅ Zequn Jie ⋅ Lin Ma ⋅ Ming Li ⋅ Xiaonan Luo ⋅ Guanbin Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 373
3D-VCD: Hallucination Mitigation in 3D-LLM Embodied Agents through Visual Contrastive Decoding
Makanjuola Ogunleye ⋅ Eman Abdelrahman ⋅ Ismini Lourentzou
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 374
Exposing and Evaluating Hallucinations for GUI Grounding
Zicheng Zhang ⋅ Hongyi Jing ⋅ Rui Lv ⋅ Shuo Fang ⋅ Shiai Zhu ⋅ Junying Wang ⋅ Chunyi Li ⋅ Xiaohong Liu ⋅ Chenguang Ma ⋅ Guangtao Zhai
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 375
Understanding and Mitigating Hallucinations in Multimodal Chain-of-Thought Models
Ji Ma ⋅ Wei Suo ⋅ Peng Wang ⋅ Yanning Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 376
Beyond the Global Scores: Fine-Grained Token Grounding as a Robust Detector of LVLM Hallucinations
Tuan Dung Nguyen ⋅ Minh Khoi Ho ⋅ Qi Chen ⋅ Yutong Xie ⋅ Cam-Tu Nguyen ⋅ Minh Khoi Nguyen ⋅ Dang Huy Pham Nguyen ⋅ Anton van den Hengel ⋅ Johan Verjans ⋅ Le Nguyen ⋅ Vu Minh Hieu Phan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 377
StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation
Ke Xing ⋅ longfei li ⋅ Yuyang Yin ⋅ Hanwen Liang ⋅ Guixun Luo ⋅ Chen Fang ⋅ Jue Wang ⋅ Konstantinos N. Plataniotis ⋅ Xiaojie Jin ⋅ Yao Zhao ⋅ Yunchao Wei
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 378
Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout
Hidir Yesiltepe ⋅ Tuna Han Salih Meral ⋅ Adil Kaan Akan ⋅ Kaan Oktay ⋅ Pinar Yanardag
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 379
AniMimic: Imitating 3D Animation from Video Priors
Tianyi Xie ⋅ Yunuo Chen ⋅ Yaowei Guo ⋅ Yin Yang ⋅ Bolei Zhou ⋅ Demetri Terzopoulos ⋅ Ying Jiang ⋅ Chenfanfu Jiang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 380
VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control
Sixiao Zheng ⋅ Minghao Yin ⋅ Wenbo Hu ⋅ Xiaoyu Li ⋅ Ying Shan ⋅ Yanwei Fu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 381
ScenDi: 3D-to-2D Scene Diffusion Cascades for Urban Generation
Hanlei Guo ⋅ Jiahao Shao ⋅ Xinya Chen ⋅ Xiyang Tan ⋅ Sheng Miao ⋅ Yujun Shen ⋅ Yiyi Liao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 382
MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE
Ruijie Zhu ⋅ Jiahao Lu ⋅ Wenbo Hu ⋅ Xiaoguang Han ⋅ Jianfei Cai ⋅ Ying Shan ⋅ Chuanxia Zheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 383
GeodesicNVS: Probability Density Geodesic Flow Matching for Novel View Synthesis
Xuqin Wang ⋅ Tao Wu ⋅ Yanfeng Zhang ⋅ Lu Liu ⋅ mingwei Sun ⋅ Yongliang Wang ⋅ Niclas Zeller ⋅ Daniel Cremers
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 384
WorldStereo: Bridging Controllable Video Generation and Scene Reconstruction via 3D Geometric Memories
Yisu Zhang ⋅ Chenjie Cao ⋅ Tengfei Wang ⋅ Xuhui Zuo ⋅ Junta Wu ⋅ Jianke Zhu ⋅ Chunchao Guo
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 385
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
Yuxue Yang ⋅ Lue Fan ⋅ Ziqi Shi ⋅ Junran Peng ⋅ Feng Wang ⋅ Zhaoxiang Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 386
Taming Video Models for 3D and 4D Generation via Zero-Shot Camera Control
Chenxi Song ⋅ Yanming Yang ⋅ Tong Zhao ⋅ Ruibo Li ⋅ Chi Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 387
Improving Motion in Image-to-Video Models via Adaptive Low-Pass Guidance
William June Suk Choi ⋅ Kyungmin Lee ⋅ Sihyun Yu ⋅ Yisol Choi ⋅ Jinwoo Shin ⋅ Kimin Lee
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 388
SANER: Switchable Adapter with Non-parametric Enhanced Routing for Person De-Reidentification
Yimin Liu ⋅ Nan Pu ⋅ Fengxiang Yang ⋅ Wenjing Li ⋅ Zhihui Li ⋅ Zhun Zhong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 389
BIT: Matching-based Bi-directional Interaction Transformation Network for Visible-Infrared Person Re-Identification
Haoxuan Xu ⋅ Guanglin Niu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 390
Vision-Language Attribute Disentanglement and Reinforcement for Lifelong Person Re-Identification
Kunlun Xu ⋅ Haotong Cheng ⋅ Jiangmeng Li ⋅ Xu Zou ⋅ Jiahuan Zhou
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 391
Diversity over Uniformity: Rethinking Representation in Generated Image Detection
Qinghui He ⋅ Haifeng Zhang ⋅ Qiao Qin ⋅ Bo Liu ⋅ Xiuli Bi ⋅ Bin Xiao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 392
Mining Instance-Centric Vision–Language Contexts for Human–Object Interaction Detection
Soo Won Seo ⋅ Kyungchae Lee ⋅ Hyungchan Cho ⋅ Taein Son ⋅ Nam Ik Cho ⋅ Jun Won Choi
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 393
FSLoRA: Harmonizing Detection and Re-Identification via Freq-Spatial Low-Rank Adapter for One-Stage Person Search
Yanling TIAN ⋅ Shanshan Zhang ⋅ Di Chen ⋅ Jian Yang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 394
EEGiT: Teaching Vision Transformers to Understand the EEG signal
Jiahao Zhou ⋅ Chenghao Xu ⋅ Wei Wang ⋅ Erkun Yang ⋅ Cheng Deng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 395
FedBPrompt: Federated Domain Generalization Person Re-Identification via Body Distribution Aware Visual Prompts
Xin Xu ⋅ Weilong Li ⋅ Wei Liu ⋅ Wenke Huang ⋅ Zhixi Yu ⋅ Bin Yang ⋅ Xiaoying Liao ⋅ Kui Jiang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 396
Pose-guided Enriched Feature Learning for Federated-by-camera Person Re-identification
JooHyung Oh ⋅ Minyoung Oh ⋅ Sung Whan Yoon ⋅ Jae-Young Sim
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 397
UAV-CB: A Complex-Background RGB–T Dataset and Local Frequency Bridge Network for UAV Detection
Shenghui Huang ⋅ Menghao Hu ⋅ Longkun Zou ⋅ Hongyu Chi ⋅ Zekai Li ⋅ Feng Gao ⋅ Fan Yang ⋅ Qingyao Wu ⋅ Ke Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 398
TimeViper: A Hybrid Mamba-Transformer Vision-Language Model for Efficient Long Video Understanding
Boshen Xu ⋅ Zihan Xiao ⋅ Jiaze Li ⋅ Jianzhong Ju ⋅ Zhenbo Luo ⋅ Jian Luan ⋅ Qin Jin
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 399
StreamReady: Learning What to Answer and When in Long Streaming Videos
Shehreen Azad ⋅ Vibhav Vineet ⋅ Yogesh Rawat
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 400
LongVideo-R1: Smart Navigation for Low-cost Long Video Understanding
Jihao Qiu ⋅ Lingxi Xie ⋅ Xinyue Huo ⋅ Qi Tian ⋅ Qixiang Ye
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 401
Agentic Video Summarization via Self-Reflecting Multimodal Understanding
Miaotian Guo ⋅ Shuguang Dou ⋅ Yin Li ⋅ Aidong Men ⋅ Dongsheng Jiang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 402
Self-Critical Distillation Network for Video-based Commonsense Captioning
Mengqi Yuan ⋅ Gengyun Jia ⋅ Bing-Kun Bao
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 403
Ego-Grounding for Personalized Question-Answering in Egocentric Videos
Junbin Xiao ⋅ Shenglang Zhang ⋅ Pengxiang Zhu ⋅ Angela Yao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 404
AdaSpark: Adaptive Sparsity for Efficient Long-Video Understanding
Handong Li ⋅ Zikang Liu ⋅ Longteng Guo ⋅ Tongtian Yue ⋅ Yepeng Tang ⋅ Xinxin Zhu ⋅ Chuanyang Zheng ⋅ Ziming Wang ⋅ Zhibin Wang ⋅ Jun Song ⋅ Cheng Yu ⋅ Bo Zheng ⋅ Jing Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 405
EarlyTom: Early Token Compression Completes Fast Video Understanding
Hesong Wang ⋅ Xin Jin ⋅ Lu Lu ⋅ Chenhaowen Li ⋅ Jian Chen ⋅ Qiang Liu ⋅ Huan Wang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 406
VideoWorld 2: Learning Transferable Knowledge from Real-world Videos
Zhongwei Ren ⋅ Yunchao Wei ⋅ Xiao Yu ⋅ Guixun Luo ⋅ Yao Zhao ⋅ Bingyi Kang ⋅ Jiashi Feng ⋅ Xiaojie Jin
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 407
VirtueBench: Evaluating Trustworthiness under Uncertainty in Long Video Understanding
Xueqing Yu ⋅ Bohan Li ⋅ Yan Li ⋅ Zhenheng Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 408
DiverseDiT: Towards Diverse Representation Learning in Diffusion Transformers
Mengping Yang ⋅ Stewart Tan ⋅ Binglei Li ⋅ Xiaomeng Yang ⋅ Hesen Chen ⋅ Hao li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 409
RenderFlow: Single-Step Neural Rendering via Flow Matching
Shenghao Zhang ⋅ Runtao Liu ⋅ Christopher Schroers ⋅ Yang Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 410
ResDiT: Evoking the Intrinsic Resolution Scalability in Diffusion Transformers
Yiyang Ma ⋅ Feng Zhou ⋅ Xuedan Yin ⋅ Pu Cao ⋅ Yonghao Dang ⋅ Jianqin Yin
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 411
Masked Region Transformer for Layered Image Generation and Editing at Scale
Zhicong Tang ⋅ Jingye Chen ⋅ Zhao Zhang ⋅ Mohan Zhou ⋅ Yuchi Liu ⋅ Yifan Pu ⋅ Yalong Bai ⋅ Ethan Smith ⋅ Yuhui Yuan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 412
DDT: Decoupled Diffusion Transformer
Shuai Wang ⋅ Zhi Tian ⋅ Weilin Huang ⋅ Limin Wang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 413
Just-in-Time: Training-Free Spatial Acceleration for Diffusion Transformers
Wenhao Sun ⋅ Ji Li ⋅ Zhaoqiang Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 414
Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality
Zekai Luo ⋅ Zongze Du ⋅ Zhouhang Zhu ⋅ Hao Zhong ⋅ Muzhi Zhu ⋅ Wen Wang ⋅ Yuling Xi ⋅ Chenchen Jing ⋅ Hao Chen ⋅ Chunhua Shen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 415
ShapeAR: Generating Editable Shape Layers via Autoregressive Diffusion
Souymodip Chakraborty ⋅ Ankur Singh ⋅ Amit Vikram Singh ⋅ Vineet Batra ⋅ Ankit Phogat
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 416
ReHyAt: Recurrent Hybrid Attention for Video Diffusion Transformers
Mohsen Ghafoorian ⋅ Amir Habibian
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 417
RecTok: Reconstruction Distillation along Rectified Flow
Qingyu Shi ⋅ Size Wu ⋅ Jinbin Bai ⋅ Kaidong Yu ⋅ Yujing Wang ⋅ Yunhai Tong ⋅ Xiangtai Li ⋅ Xuelong Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 418
EgoXtreme: A Dataset for Robust Object Pose Estimation in Egocentric Views under Extreme Conditions
Taegyoon Yoon ⋅ Yegyu Han ⋅ Seojin Ji ⋅ Jaewoo Park ⋅ Sojeong Kim ⋅ Taein Kwon ⋅ Hyung-Sin Kim
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 419
CoIn3D: Revisiting Configuration-Invariant Multi-Camera 3D Object Detection
Zhaonian Kuang ⋅ Rui Ding ⋅ Haotian Wang ⋅ Xinhu Zheng ⋅ Meng Yang ⋅ Gang Hua
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 420
H^2A^2: Homogeneity-Aware and Heterogeneity-Aware Feature Perception for Unified Indoor 3D Object Detection
Tao Xie ⋅ Tao An ⋅ Feng Liu ⋅ Jin Wensheng ⋅ Zhengyu Li ⋅ lijun zhao ⋅ Ruifeng Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 421
Cov2Pose: Leveraging Spatial Covariance for Direct Manifold-aware 6-DoF Object Pose Estimation
Nassim Ali Ousalah ⋅ Peyman Rostami ⋅ Vincent Gaudillière ⋅ Emmanuel Koumandakis ⋅ Anis Kacem ⋅ Enjie Ghorbel ⋅ Djamila Aouada
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 422
Towards Intrinsic-Aware Monocular 3D Object Detection
Zhihao Zhang ⋅ Abhinav Kumar ⋅ Xiaoming Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 423
SToRe3D: Sparse Token Relevance in ViTs for Efficient Multi-View 3D Object Detection
Sandro Papais ⋅ lezhou feng ⋅ Charles Cossette ⋅ Lingting Ge
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 424
SPAN: Spatial-Projection Alignment for Monocular 3D Object Detection
Yifan Wang ⋅ Yian Zhao ⋅ Fanqi Pu ⋅ Xiaochen Yang ⋅ YANG TANG ⋅ Xi Chen ⋅ Wenming Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 425
DSCA: Dynamic Subspace Concept Alignment for Lifelong VLM Editing
Gyanendra Das ⋅ Sai Jena
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 426
FailureAtlas: Mapping the Failure Landscape of T2I Models via Active Exploration
Muxi Chen ⋅ Zhaohua Zhang ⋅ Chenchen Zhao ⋅ Mingyang Chen ⋅ Wenyu Jiang ⋅ Tianwen Jiang ⋅ Jianhuan Zhuo ⋅ Yu Tang ⋅ Qiuyong Xiao ⋅ Jihong Zhang ⋅ Qiang Xu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 427
HDR-VLM: HDR-Domain Adaptation of VLMs and Preference-Aligned Quality Assessment for HDR Video Color Grading
Hao Yuan ⋅ Jiabin Zhang ⋅ Yajing Wu ⋅ Ruixuan Pang ⋅ Jing Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 428
RobustVisRAG: Causality-Aware Vision-Based Retrieval-Augmented Generation under Visual Degradations
I-Hsiang (Aaron) Chen ⋅ Yu-Wei Liu ⋅ Tse-Yu Wu ⋅ Yu-Chien Chiang ⋅ Jen-Chieh Yang ⋅ Wei-Ting Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 429
BiomedCCPL: Causal Conditional Prompt Learning for Biomedical Vision-Language Models
Xueliang Cui ⋅ Juncai Zhang ⋅ Jiacheng Hou ⋅ Dan Lu ⋅ Hao Zhang ⋅ Ruxin Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 430
DynamicGTR: Leveraging Graph Topology Representation Preferences to Boost VLM Capabilities on Graph QAs
Yanbin Wei ⋅ Jiangyue Yan ⋅ Chun Kang ⋅ Yang Chen ⋅ Hua Liu ⋅ James Kwok ⋅ Yu Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 431
VisualOverload: Probing Visual Understanding of VLMs in Really Dense Scenes
Paul Gavrikov ⋅ Wei Lin ⋅ M. Jehanzeb Mirza ⋅ Soumya Jahagirdar ⋅ Muhammad Huzaifa ⋅ Sivan Doveh ⋅ James Glass ⋅ Serena Yeung ⋅ Hilde Kuehne
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 432
Revisiting Visual Corruptions in LVLMs: A Shape–Texture Perspective on Model Failures
Xinkuan Qiu ⋅ Meina Kan ⋅ Zhenliang He ⋅ Yongbin Zhou ⋅ Shiguang Shan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 433
From Intuition to Investigation: A Tool-Augmented Reasoning MLLM Framework for Generalizable Face Anti-Spoofing
Haoyuan Zhang ⋅ Keyao Wang ⋅ Guosheng Zhang ⋅ Haixiao Yue ⋅ Zhiwen Tan ⋅ Siran Peng ⋅ Tianshuo Zhang ⋅ Xiao Tan ⋅ Kunbin Chen ⋅ Wei He ⋅ Jingdong Wang ⋅ Ajian Liu ⋅ Xiangyu Zhu ⋅ Zhen Lei
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 434
Trust-calibrated Collaborative Learning for Long-Tailed Visual Recognition
Hao Zhou ⋅ Tingjin Luo
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 435
SunFaded: Illumination-Aware Gaussian Splatting for Dark Scenes with Camera-Mounted Active Lighting
Wenjie Chang ⋅ Tianle Ding ⋅ Wenfei Yang ⋅ Tianzhu Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 436
TokenSplat: Token-aligned 3D Gaussian Splatting for Feed-forward Pose-free Reconstruction
Yihui Li ⋅ Chengxin Lv ⋅ Zichen Tang ⋅ Hongyu Yang ⋅ Di Huang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 437
GOR-IS: 3D Gaussian Object Removal In the Intrinsic Space
Yonghao Zhao ⋅ Yupeng Gao ⋅ Jian Yang ⋅ Jin Xie ⋅ Beibei Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 438
AeroGS: Scale-Aware Gaussian Splatting for Pose-Free Dynamic UAV Scene Reconstruction
Tingyun Li ⋅ Xinyi Liu ⋅ Yongjun Zhang ⋅ Yi Wan ⋅ Xiaoan Liu ⋅ Weiwei Fan ⋅ Jiahao Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 439
Intrinsic Geometry-Appearance Consistency Optimization for Sparse-View Gaussian Splatting
Kaiqiang Xiong ⋅ Rui Peng ⋅ Jiahao Wu ⋅ Zhanke Wang ⋅ Jie Liang ⋅ Xiaoyun Zheng ⋅ Feng Gao ⋅ Ronggang Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 440
AERGS-SLAM: Auto-Exposure-Robust Stereo 3D Gaussian Splatting SLAM
Zhiyu Zhou ⋅ Feng Hui ⋅ Yu Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 441
Learning Differentiable Hierarchies in 3D Gaussian Splatting
Youqi Pan ⋅ Wugen Zhou ⋅ Hongbin Zha
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 442
WeatherCity: Urban Scene Reconstruction with Controllable Multi-Weather Transformation
Wenhua Wu ⋅ Huai Guan ⋅ Zhe Liu ⋅ Hesheng Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 443
Cross-View Splatter: Feed-Forward View Synthesis with Georeferenced Images
Matias Turkulainen ⋅ Akshay Krishnan ⋅ Filippo Aleotti ⋅ Mohamed Sayed ⋅ Guillermo Garcia-Hernando ⋅ Juho Kannala ⋅ Arno Solin ⋅ Gabriel Brostow ⋅ Daniyar Turmukhambetov
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 444
TagSplat: Topology-Aware Gaussian Splatting for Dynamic Mesh Modeling and Tracking
Hanzhi Guo ⋅ dongdong weng ⋅ Mo Su ⋅ Yixiao Chen ⋅ Xiaonuo Dongye ⋅ Chenyu Xu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 445
Hierarchical Visual Relocalization with Nearest View Synthesis from Feature Gaussian Splatting
Huaqi Tao ⋅ Bingxi Liu ⋅ Guangcheng Chen ⋅ Fulin Tang ⋅ Li He ⋅ Hong Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 446
Tracking-Guided 4D Generation: Foundation-Tracker Motion Priors for 3D Model Animation
Su Sun ⋅ Cheng Zhao ⋅ Himangi Mittal ⋅ Gaurav Mittal ⋅ Rohith Kukkala ⋅ Yingjie Chen ⋅ Mei Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 447
3D Gaussian Splatting from Unposed Spike Stream
Yijia Guo ⋅ Tong Hu ⋅ Liwen Hu ⋅ Lei Ma ⋅ Tiejun Huang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 448
SparseOIT: Improving Order-Independent Transparency 3DGS via Active Set Method
Wentao Yang ⋅ FanZhen KONG ⋅ Zejian Kang ⋅ Xiangru Huang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 449
ClipGStream: Clip-Stream Gaussian Splatting for Any Length and Any Motion Multi-View Dynamic Scene Reconstruction
Jie Liang ⋅ Jiahao Wu ⋅ Chao Wang ⋅ Jiayu Yang ⋅ Xiaoyun Zheng ⋅ Kaiqiang Xiong ⋅ Zhanke Wang ⋅ Jinbo Yan ⋅ Feng Gao ⋅ Ronggang Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 450
Space-Time Forecasting of Dynamic Scenes with Motion-aware Gaussian Grouping
Junmyeong Lee ⋅ Hoseung Choi ⋅ Minsu Cho
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 451
MoRGS: Efficient Per-Gaussian Motion Reasoning for Streamable Dynamic 3D Scenes
Wonjoon Lee ⋅ Sungmin Woo ⋅ Donghyeong Kim ⋅ Jungho Lee ⋅ Sangheon Park ⋅ Sangyoun Lee
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 452
BEA-GS: BEyond RAdiance Supervision in 3DGS for Precise Object Extraction
Alessio Mazzucchelli ⋅ María Naranjo Almeida ⋅ Jorge Bustos Sanchez ⋅ Mariella Dimiccoli ⋅ Francesc Moreno-Noguer ⋅ Jordi Sanchez-Riera ⋅ Adrian Penate-Sanchez
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 453
EDGS: Eliminating Densification for Efficient Convergence of 3DGS
Dmytro Kotovenko ⋅ Olga Grebenkova ⋅ Björn Ommer
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 454
ReasonMap: Towards Fine-Grained Visual Reasoning from Transit Maps
Sicheng Feng ⋅ Song Wang ⋅ Shuyi Ouyang ⋅ Lingdong Kong ⋅ Zikai Song ⋅ Jianke Zhu ⋅ Huan Wang ⋅ Xinchao Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 455
Conan: Progressive Learning to Reason Like a Detective over Multi-Scale Visual Evidence
Kun Ouyang ⋅ Yuanxin Liu ⋅ Linli Yao ⋅ Yishuo Cai ⋅ Hao Zhou ⋅ Fandong Meng ⋅ Jie Zhou ⋅ Xu Sun
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 456
DialogueVPR: Towards Conversational Visual Place Recognition
yukun Song ⋅ Changwei Wang ⋅ Xingtian Pei ⋅ Shibiao Xu ⋅ Wenhao Xu ⋅ Shunpeng Chen ⋅ Yu Zhang ⋅ Ke Zhang ⋅ Rongtao Xu ⋅ Xuxiang Feng ⋅ Pengyang Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 457
Perceptual-Evidence Anchored Reinforced Learning for Multimodal Reasoning
Chi Zhang ⋅ Haibo Qiu ⋅ Qiming Zhang ⋅ Yufei Xu ⋅ Zhixiong Zeng ⋅ Siqi Yang ⋅ Peng Shi ⋅ Lin Ma ⋅ Jing Zhang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 458
Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm
Jingqi Tong ⋅ Yurong Mou ⋅ Hangcheng Li ⋅ Mingzhe Li ⋅ Yongzhuo Yang ⋅ Ming Zhang ⋅ Qiguang Chen ⋅ Tianyi Liang ⋅ Xiaomeng Hu ⋅ Yining Zheng ⋅ Xinchi Chen ⋅ Jun Zhao ⋅ Xuanjing Huang ⋅ Xipeng Qiu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 459
VinQA: Visual Elements Interleaved Long-form Answer Generation for Real-World Multimodal Document QA
Young Rok Jang ⋅ Hyesoo Kong ⋅ Kyunghwan An ⋅ Jae Sub Huh ⋅ Gyeonghun KIM ⋅ Stanley Jungkyu Choi
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 460
DocSeeker: Structured Visual Reasoning with Evidence Grounding for Long Document Understanding
Hao Yan ⋅ Yuliang Liu ⋅ Xingchen Liu ⋅ Yuyi Zhang ⋅ Minghui Liao ⋅ Jihao Wu ⋅ Wei Chen ⋅ Xiang Bai
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 461
Recurrent Reasoning with Vision-Language Models for Estimating Long-Horizon Embodied Task Progress
Yuelin Zhang ⋅ Sijie Cheng ⋅ Chen Li ⋅ Zongzhao Li ⋅ Yuxin Huang ⋅ Yang Liu ⋅ Wenbing Huang
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 462
VGent: Visual Grounding via Modular Design for Disentangling Reasoning and Prediction
Weitai Kang ⋅ Jason Kuen ⋅ Mengwei Ren ⋅ Zijun Wei ⋅ Yan Yan ⋅ Kangning Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 463
Grounding Everything in Tokens for Multimodal Large Language Models
Xiangxuan Ren ⋅ Zhongdao Wang ⋅ Liping Hou ⋅ Pin Tang ⋅ Guoqing Wang ⋅ Chao Ma
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 464
Evolving Contextual Safety in Multi-Modal Large Language Models via Inference-Time Self-Reflective Memory
Ce Zhang ⋅ Jinxi He ⋅ Junyi He ⋅ Katia Sycara ⋅ Yaqi Xie
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 465
ChartR: Evaluating Reasoning Accuracy and Robustness in Chart Question Answering
Xiaojun Chen ⋅ Sixiao Luo ⋅ Ziqi Liu ⋅ Min Yang ⋅ Qin Zhang ⋅ Liang-Jie Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 466
Think Visually, Reason Textually: Vision-Language Synergy in Abstract Reasoning
Beichen Zhang ⋅ Yuhang Zang ⋅ Xiaoyi Dong ⋅ Yuhang Cao ⋅ Haodong Duan ⋅ Dahua Lin ⋅ Jiaqi Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 467
VKG-QA: Visual Knowledge Graph-based Question Answer for Large Multimodal Models
Yuntao Du ⋅ Yiming Wang ⋅ Renshuo Yuan ⋅ Jincheng Yue ⋅ Yijing Chen ⋅ Yue Fan ⋅ Bo Zhang ⋅ Qian Li ⋅ Lizhen Cui
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 468
Med-CMR: A Fine-Grained Benchmark Integrating Visual Evidence and Clinical Logic for Medical Complex Multimodal Reasoning
Haozhen Gong ⋅ Xiaozhong Ji ⋅ Yuansen Liu ⋅ Wenbin Wu ⋅ Xiaoxiao Yan ⋅ jingjing liu ⋅ Kai WU ⋅ Jiazhen Pan ⋅ Bailiang Jian ⋅ Jiangning Zhang ⋅ Xiaobin Hu ⋅ Hongwei Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 469
Human-like Abstract Visual Reasoning via Understanding and Solving Reasoning Loop
Xinwang Chen ⋅ Xiuxing Li ⋅ Qing Li ⋅ Ziyue Zhuang ⋅ Yutong Wu ⋅ Ziyu Li ⋅ Zhuo Wang ⋅ Kai Li ⋅ Jianye Hao ⋅ Xia Wu
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 470
VITAL: Vision-Encoder-centered Pre-training for LMMs in Visual Quality Assessment
Ziheng Jia ⋅ Linhan Cao ⋅ Jinliang Han ⋅ Zicheng Zhang ⋅ Jiaying Qian ⋅ Wang Jiarui ⋅ Zijian Chen ⋅ Guangtao Zhai ⋅ Xiongkuo Min
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 471
Generative Video Compression with One-Dimensional Latent Representation
Zihan Zheng ⋅ Zhaoyang Jia ⋅ Naifu Xue ⋅ Jiahao Li ⋅ Bin Li ⋅ Zongyu Guo ⋅ Xiaoyi Zhang ⋅ Zhenghao Chen ⋅ Houqiang Li ⋅ Yan Lu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 472
Markovian Scale Prediction: A New Era of Visual Autoregressive Generation
Yu Zhang ⋅ Jingyi Liu ⋅ Yiwei Shi ⋅ Qi Zhang ⋅ Duoqian Miao ⋅ Changwei Wang ⋅ Longbing Cao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 473
Learned Image Compression via Sparse Attention and Adaptive Frequency
Huidong Ma ⋅ Xinyan Shi ⋅ Hui Sun ⋅ Xiaofei Yue ⋅ xiaoguang Liu ⋅ Gang Wang ⋅ Wentong Cai
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 474
UPLiFT: Efficient Pixel-Dense Feature Upsampling with Local Attenders
Matthew Walmer ⋅ Saksham Suri ⋅ Anirud Aggarwal ⋅ Abhinav Shrivastava
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 475
VecAttention: Vector-wise Sparse Attention for Accelerating Long Context Inference
Anmin Liu ⋅ Ruixuan Yang ⋅ Huiqiang Jiang ⋅ Bin Lin ⋅ Minmin Sun ⋅ Yong Li ⋅ CHEN ZHANG ⋅ Tao Xie
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 476
Ultra-Fast Neural Video Compression
Jiahao Li ⋅ Wenxuan Xie ⋅ Zhaoyang Jia ⋅ Bin Li ⋅ Zongyu Guo ⋅ Xiaoyi Zhang ⋅ Yan Lu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 477
Parallax to Align Them All: An OmniParallax Attention Mechanism for Distributed Multi-View Image Compression
Haotian Zhang ⋅ Feiyue Long ⋅ Yixin Yu ⋅ Jian Xue ⋅ Haocheng Tang ⋅ Tongda Xu ⋅ Zhenning Shi ⋅ Yan Wang ⋅ Siwei Ma ⋅ Jiaqi Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 478
Scaling Parallel Sequence Models to Vision Foundation Models
Yitong Jiang ⋅ Collin McCarthy ⋅ Hongjun Wang ⋅ Hanrong Ye ⋅ Qi Dou ⋅ Tianfan Xue ⋅ Jinwei Gu ⋅ Jan Kautz ⋅ Danny Yin ⋅ Pavlo Molchanov ⋅ Sifei Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 479
Revisiting Model Stitching In the Foundation Model Era
Zheda Mai ⋅ Ke Zhang ⋅ Fu-En Wang ⋅ Zixiao Ken Wang ⋅ Albert Chen ⋅ Lu Xia ⋅ Min Sun ⋅ Wei-Lun Chao ⋅ Cheng-Hao Kuo
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 480
GeoAgent: Learning to Geolocate Everywhere with Reinforced Geographic Characteristics
Modi Jin ⋅ Yiming Zhang ⋅ Bo-Yuan Sun ⋅ Dingwen Zhang ⋅ Mingming Cheng ⋅ Qibin Hou
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 481
VLM-Loc: Localization in Point Cloud Maps via Vision-Language Models
Shuhao Kang ⋅ Youqi Liao ⋅ Peijie Wang ⋅ Wenlong Liao ⋅ Qilin Zhang ⋅ Benjamin Busam ⋅ Xieyuanli Chen ⋅ Yun Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 482
HOLO: Homography-Guided Pose Estimator Network for Fine-Grained Visual Localization on SD Maps
Xuchang Zhong ⋅ Xu Cao ⋅ Jinke Feng ⋅ Hao Fang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 483
TriLite: Efficient Weakly Supervised Object Localization with Universal Visual Features and Tri-Region Disentanglement
Arian Sabaghi ⋅ Jose Oramas
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 484
GeoSURGE: Geo-localization using Semantic Fusion with Hierarchy of Geographic Embeddings
Angel Daruna ⋅ Nicholas Meegan ⋅ Han-Pang Chiu ⋅ Supun Samarasekera ⋅ Rakesh “Teddy” Kumar
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 485
Towards Visual Query Localization in the 3D World
liang peng ⋅ Bohan Tan ⋅ Zhipeng Zhang ⋅ Haobo Li ⋅ Yifan Jiao ⋅ Xingping Dong ⋅ Libo Zhang
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 486
OVOD-Agent: A Markov–Bandit Framework for Proactive Visual Reasoning and Self-Evolving Detection
Chujie Wang ⋅ Jianyu Lu ⋅ Zhiyuan Luo ⋅ Xi Chen ⋅ Chu He
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 487
Pixel2Phys: Distilling Governing Laws from Visual Dynamics
Ruikun Li ⋅ Jun Yao ⋅ Yingfan Hua ⋅ SHIXIANG TANG ⋅ Biqing Qi ⋅ Bin Liu ⋅ Wanli Ouyang ⋅ Yan Lu
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 488
Tutor-Student Reinforcement Learning: A Dynamic Curriculum for Robust Deepfake Detection
Zhanhe Lei ⋅ Zhongyuan Wang ⋅ Jikang Cheng ⋅ Baojin Huang ⋅ Yuhong Yang ⋅ Zhen Han ⋅ Chao Liang ⋅ Dengpan Ye
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 489
Seeing as Experts Do: A Knowledge-Augmented Agent for Open-Set Fine-Grained Visual Understanding
Junhan Chen ⋅ Zilu Zhou ⋅ Yujun Tong ⋅ Dongliang Chang ⋅ Yitao Luo ⋅ Zhanyu Ma
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 490
Dynamic Important Example Mining for Reinforcement Finetuning
Haoru Tan ⋅ WU Sitong ⋅ Yanfeng Chen ⋅ Shizhen Zhao ⋅ Yangtian Sun ⋅ Tianjia Liu ⋅ Chirui Chang ⋅ Shaofeng Zhang ⋅ Xingwu Sun ⋅ Xiuzhe Wu ⋅ Ruobing Xie ⋅ Xiaojuan Qi
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 491
Specificity-aware reinforcement learning for fine-grained open-world classification
Samuele Angheben ⋅ Davide Berasi ⋅ Alessandro Conti ⋅ Elisa Ricci ⋅ Yiming Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 492
SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning
Jitesh Jain ⋅ Jialuo Li ⋅ Zixian Ma ⋅ Jieyu Zhang ⋅ Chris Dongjoo Kim ⋅ Sangho Lee ⋅ Rohun Tripathi ⋅ Tanmay Gupta ⋅ Christopher Clark ⋅ Humphrey Shi
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 493
Uncertainty-Aware Modality Fusion for Unaligned RGB-T Salient Object Detection
Mianzhao Wang ⋅ Fan Shi ⋅ Xu Cheng ⋅ Chen Jia ⋅ Shengyong Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 494
Fusion in Your Way: Aligning Image Fusion with Heterogeneous Demands via Direct Preference Optimization
Weijian Su ⋅ Songqian Zhang ⋅ Yuqi Han ⋅ Jian Zhuang ⋅ Yongdong Huang ⋅ Qiang Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 495
More Than Meets the Eye: A Unified Image Fusion Framework via Semantic-Pixel Entropy Trade-off for Zero-Shot Generalization
Xiaowen Liu ⋅ Jing Li ⋅ Hongtao Huo ⋅ Haozhe Cao ⋅ Renhua Wang ⋅ Xu Dong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 496
Beyond Sequential Tools: A Unified VLM Agent System for Photographic Post-Processing via Dynamic Multi-Expert Fusion
Honglin Xiong ⋅ Chenjie Zhu ⋅ Jianbiao Ding ⋅ Zixuan Ni ⋅ Wei Li ⋅ Zhenpeng Mi ⋅ Qian Wang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 497
Multi-modal Frequency Decomposition Network for Semantic Scene Completion
Die Zuo ⋅ Lubo Wang ⋅ Ruonan Liu ⋅ Qing Guo ⋅ Chong Wang ⋅ Dongdong Wu ⋅ Wei Feng ⋅ Kairui Yang ⋅ Di Lin
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 498
BiEvLight: Bi-level Learning of Task-Aware Event Refinement for Low-Light Image Enhancement
Zishu Yao ⋅ Xiang-Xiang Su ⋅ Shengning Zhou ⋅ Guang-Yong Chen ⋅ Guodong Fan ⋅ Xing Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 499
FusionRegister: Every Infrared and Visible Image Fusion Deserves Registration
Congcong Bian ⋅ HaoLong Ma ⋅ Hui Li ⋅ Zhongwei Shen ⋅ Xiaoqing Luo ⋅ Xiaoning Song ⋅ Xiao-Jun Wu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 500
OmniFood8K: Single-Image Nutrition Estimation via Hierarchical Frequency-Aligned Fusion
Dongjian Yu ⋅ Weiqing Min ⋅ Qian Jiang ⋅ Xing Lin ⋅ Xin Jin ⋅ Shuqiang Jiang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 501
Enhancing Unregistered Hyperspectral Image Super-Resolution via Unmixing-based Abundance Fusion Learning
Yingkai Zhang ⋅ Tao Zhang ⋅ Jing Nie ⋅ Ying Fu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 502
LRHDR: Learning Representation-enhanced HDR Video Reconstruction
Chenzhuo Liao ⋅ Xin Chen ⋅ Bingchen Li ⋅ Yu Meng ⋅ Tao Yue ⋅ Xuemei Hu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 503
Cross-Domain Few-Shot Segmentation via Multi-view Progressive Adaptation
Jiahao Nie ⋅ Guanqiao Fu ⋅ Wenbin An ⋅ Yap-Peng Tan ⋅ Alex C. Kot ⋅ Shijian Lu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 504
Interpretable Cross-Domain Few-Shot Learning with Rectified Target-Domain Local Alignment
Yaze Zhao ⋅ Yixiong Zou ⋅ Yuhua Li ⋅ Ruixuan Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 505
PP-Brep: Few-Shot B-rep Classification with Hybrid Graph Representation
Jiacheng Hao ⋅ Chunying Liu ⋅ Hao Guo ⋅ Ruohan Wang ⋅ Hongping Gan ⋅ Yilei Shi
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 506
AgentDet: A Shared-Blackboard Multi-Agent Framework for Zero-/Few-Shot Object Detection
Haolin Li ⋅ Yaohua Wang ⋅ Ze Yan ⋅ Lijie Wen ⋅ Biqing Huang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 507
SFR-Net: Steering-Fusion-Refining Network in Multi-label Zero-Shot Sewer Defect Detection
Zhao-Min Chen ⋅ Xinjian Huang ⋅ Yisu Ge ⋅ Yu Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 508
Noise-Aware Few-Shot Learning through Bi-directional Multi-View Prompt Alignment
Lu Niu ⋅ Cheng Xue
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 509
Learnability-Guided Diffusion for Dataset Distillation
Jeffrey A. Chan-Santiago ⋅ Mubarak Shah
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 510
Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals
Xiangyu Fan ⋅ Zesong Qiu ⋅ Zhuguanyu Wu ⋅ Fanzhou Wang ⋅ Zhiqian Lin ⋅ Tianxiang Ren ⋅ Dahua Lin ⋅ RUIHAO GONG ⋅ Lei Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 511
Progressive Mask Distillation for Self-supervised Video Representation
Kewei Wu ⋅ Chong Liang ⋅ Zhao Xie ⋅ Dan Guo
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 512
HierAmp: Coarse-to-Fine Autoregressive Amplification for Generative Dataset Distillation
Lin Zhao ⋅ Xinru Jiang ⋅ Xi Xiao ⋅ Qihui Fan ⋅ Lei Lu ⋅ Yanzhi Wang ⋅ Xue Lin ⋅ OCTAVIA CAMPS ⋅ Pu Zhao ⋅ Jianyang Gu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 513
SpiderCam: Low-Power Snapshot Depth from Differential Defocus
Marcos A. Ferreira ⋅ Tianao Li ⋅ John Mamish ⋅ Josiah Hester ⋅ Yaman Sangar ⋅ Qi Guo ⋅ Emma Alexander
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 514
Computational Speckle Pattern Interferometry
Shengxi Wu ⋅ Sophia Yang ⋅ Dorian Chan ⋅ Matthew O’Toole
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 515
DetectSCI: Toward Object-Guided ROI Reconstruction for High-Resolution Video Snapshot Compressive Imaging
Xingjian Jiang ⋅ Lishun Wang ⋅ Ping Wang ⋅ Xin Yuan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 516
Solving a Nonlinear Blind Inverse Problem for Tagged MRI with Physics and Deep Generative Priors
Zhangxing Bian ⋅ Shuwen Wei ⋅ Samuel W. Remedios ⋅ Junyu Chen ⋅ Aaron Carass ⋅ Blake E. Dewey ⋅ Jerry L Prince
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 517
Nonlinear Color Transfer via Learnable Bezier Flows
Junhyoung Lee ⋅ Seongwoon Jo ⋅ JeongHun Park ⋅ Yeonji Ryou ⋅ Jeongha Yang ⋅ Jangho Kim
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 518
VT-Intrinsic: Physics-Based Decomposition of Reflectance and Shading using a Single Visible-Thermal Image Pair
Zeqing Yuan ⋅ Mani Ramanagopal ⋅ Aswin C. Sankaranarayanan ⋅ Srinivasa G. Narasimhan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 519
GH-NAF: Grid-Adaptive Hash-Level–Attended Neural Attenuation Fields for Discrepancy-Aware CBCT
Seong Je Oh ⋅ Ju Hwan Lee ⋅ Chae Yeon Lim ⋅ Donghwan Lee ⋅ Myung Jin Ching ⋅ Kyungsu Kim
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 520
Computer Vision with a Superpixelation Camera
Sasidharan Mahalingam ⋅ Rachel Brown ⋅ Atul Ingle
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 521
Color-Encoded Illumination for High-Speed Volumetric Scene Reconstruction
David Novikov ⋅ Eilon Vaknin ⋅ Narek Tumanyan ⋅ Mark Sheinin
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 522
Multi-Scale Gradient-Guided Unrolling Architecture with Adaptive Mamba for Compressive Sensing
Le Yang ⋅ Hongping Gan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 523
Deciphering Genotype-Phenotype Mechanisms from High-Content Profiling via Knowledge-Guided Multi-modal Graph Learning
Hanjing Lin ⋅ Jiahua Rao ⋅ Youhan Sun ⋅ Jiancong Xie ⋅ Yuedong Yang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 524
Bulk RNA-seq Guided Multi-modal Detection of Anomalous Regions in Human Cancer via Spatial Transcriptomics
Hang Shi ⋅ Ruocheng Yang ⋅ Wenjie You ⋅ Zhilin Huang ⋅ Daoqiang Zhang ⋅ WEI SHAO
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 525
Intervention-Aware Multiscale Representation Learning from Imaging Phenomics and Perturbation Transcriptomics
Jiayuan Chen ⋅ Ruoqi Liu ⋅ Zishan Gu ⋅ Ping Zhang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 526
ParaUni: Enhance Generation in Unified Multimodal Model with Reinforcement-driven Hierarchical Parallel Information Interaction
Jiangtong Tan ⋅ Lin Liu ⋅ Jie Huang ⋅ Xiaopeng Zhang ⋅ Qi Tian ⋅ Feng Zhao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 527
PhysVid: Physics Aware Local Conditioning for Generative Video Models
Saurabh Pathak ⋅ Elahe Arani ⋅ Mykola Pechenizkiy ⋅ Bahram Zonooz
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 528
PromptLoop: Plug-and-Play Prompt Refinement via Latent Feedback for Diffusion Model Alignment
Suhyeon Lee ⋅ Jong Chul
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 529
EvoID: Reinforced Evolution for Identity-Preserving Video Generation
Yiheng Zhang ⋅ Zhaofan Qiu ⋅ Zunxu Liu ⋅ Yingwei Pan ⋅ Ting Yao ⋅ Tao Mei
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 530
Masked Auto-Regressive Variational Acceleration: Fast Inference Makes Practical Reinforcement Learning
Yuxuan Gu ⋅ Weimin Bai ⋅ Yifei Wang ⋅ Weijian Luo ⋅ He Sun
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 531
PhyCo: Learning Controllable Physical Priors for Generative Motion
Sriram Narayanan ⋅ Ziyu Jiang ⋅ Srinivasa G. Narasimhan ⋅ Manmohan Chandraker
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 532
Unified Multimodal Models as Auto-Encoders
Zhiyuan Yan ⋅ Kaiqing Lin ⋅ Hao Li ⋅ Junyan Ye ⋅ Hui Han ⋅ Haochen Wang ⋅ Zhendong Wang ⋅ Bin Lin ⋅ Li Hao ⋅ Xinyan Xiao ⋅ Jingdong Wang ⋅ Haifeng Wang ⋅ Li Yuan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 533
Expand and Prune: Maximizing Trajectory Diversity for Effective GRPO in Generative Models
Shiran Ge ⋅ Chenyi Huang ⋅ Yuang Ai ⋅ Qihang Fan ⋅ Huaibo Huang ⋅ Ran He
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 534
ThinkingViT: Matryoshka Thinking Vision Transformer for Elastic Inference
Ali Hojjat ⋅ Janek Haberer ⋅ Sören Pirk ⋅ Olaf Landsiedel
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 535
Drainage: A Unifying Framework for Addressing Class Uncertainty
Yasser Taha ⋅ Grégoire Montavon ⋅ Nils Körber
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 536
Neural Differentiation in Deep Networks: A Theoretical Framework for Expressivity and Representational Diversity
Boyuan Wang ⋅ Richard Jiang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 537
DuetMerging: Synergizing Dynamic and Static Strategies for Mitigating Task Interference in Model Merging
Yan Li ⋅ Guiping Cao ⋅ Yaguang Song ⋅ Ming Tao ⋅ Haoran Gong ⋅ Junhui Liu ⋅ Yaowei Wang ⋅ Dongmei Jiang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 538
SASNet: Spatially-Adaptive Sinusoidal Networks for INRs
Haoan Feng ⋅ Diana Aldana ⋅ Tiago Novello ⋅ Leila De Floriani
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 539
Generative Modeling of Weights: Generalization or Memorization?
Boya Zeng ⋅ Yida Yin ⋅ Zhiqiu Xu ⋅ Zhuang Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 540
Vision-Oriented Lightweight Neural Architecture Search with Budget-Adaptive Evaluation
Yi Fan ⋅ Yu-Bin Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 541
Improving Sparse Autoencoder with Dynamic Attention
Dongsheng Wang ⋅ Jinsen Zhang ⋅ Dawei Su ⋅ Hui Huang
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 542
Stepwise Credit Assignment for GRPO on Flow-Matching Models
Yash Savani ⋅ Branislav Kveton ⋅ Yuchen Liu ⋅ Yilin Wang ⋅ Jing Shi ⋅ Subhojyoti Mukherjee ⋅ Nikos Vlassis ⋅ Krishna Kumar Singh
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 543
FINE: Factorizing Knowledge for Initialization of Variable-sized Diffusion Models
Yucheng Xie ⋅ Fu Feng ⋅ Ruixiao Shi ⋅ Jianlu Shen ⋅ Jing Wang ⋅ Yong Rui ⋅ Xin Geng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 544
Hyperbolic Busemann Neural Networks
Ziheng Chen ⋅ Bernhard Schölkopf ⋅ Nicu Sebe
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 545
FlowDIS: Language-Guided Dichotomous Image Segmentation with Flow Matching
Andranik Sargsyan ⋅ Shant Navasardyan
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 546
Image-to-Point Cloud Feature Back-Projection for Multimodal Training of 3D Semantic Segmentation
Jiawei Han ⋅ Matteo Poggi ⋅ HUAN LI ⋅ Changshuo Wang ⋅ Kaiqi Liu ⋅ Wei Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 547
NG-GS: NeRF-guided 3D Gaussian Splatting Segmentation
Yi He ⋅ Tao Wang ⋅ Yi Jin ⋅ Congyan Lang ⋅ Yidong Li ⋅ Haibin Ling
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 548
Teaching DINOv3 About Partial 3D Geometry: A Self-Supervised Geometry-Aware Approach
Viktoria Ehm ⋅ Dongliang Cao ⋅ Riccardo Marin ⋅ Daniel Scholz ⋅ Weikang Wang ⋅ Florian Bernard ⋅ Daniel Cremers
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 549
SemLayer: Semantic-aware Generative Segmentation and Layer Construction for Abstract Icons
Haiyang Xu ⋅ Ronghuan Wu ⋅ Li-Yi Wei ⋅ Nanxuan Zhao ⋅ Chenxi Liu ⋅ Cuong Nguyen ⋅ Zhuowen Tu ⋅ Zhaowen Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 550
MatchED: Crisp Edge Detection Using End-to-End, Matching-based Supervision
bedrettin cetinkaya ⋅ Sinan Kalkan ⋅ Emre Akbas
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 551
SegGBC: Justifiable Coarse-to-Fine Granular-Ball Computing for Enhancing Clustering Image Segmentation
Qianpeng Chong ⋅ Wenyi Zeng ⋅ Xiuxuan Shen ⋅ Jiajie Li ⋅ Qian Yin ⋅ Xin Zheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 552
Seeing Beyond: Extrapolative Domain Adaptive Panoramic Segmentation
Yuanfan Zheng ⋅ Kunyu Peng ⋅ Xu Zheng ⋅ Kailun Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 553
MatchMask: Mask-Centric Generative Data Augmentation for Label-Scarce Semantic Segmentation
Yuqi Lin ⋅ Hao Zhang ⋅ Wenqi Shao ⋅ Shiqu Liu ⋅ Zhihong Gu ⋅ Wenxiao Wang ⋅ Xiaofei He ⋅ Kaipeng Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 554
Boundary-Responsive Differentiable Gating for Superpixel-Based Segmentation
Fatmaelzahraa Ali Ahmed ⋅ Zhihe Lu ⋅ Gianni Di ⋅ Diram Tabaa ⋅ Mohamed Hamdy ⋅ Muraam Abdel-Ghani ⋅ Abdulaziz Al-Ali ⋅ Muhammad Arsalan ⋅ Shidin Balakrishnan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 555
Task-Oriented Data Synthesis and Control-Rectify Sampling for Remote Sensing Semantic Segmentation
Yunkai Yang ⋅ Yudong Zhang ⋅ Kunquan Zhang ⋅ Jinxiao Zhang ⋅ Xinying Chen ⋅ Haohuan Fu ⋅ Runmin Dong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 556
FUSAR-GPT: A Spatiotemporal Feature-Embedded and Two-Stage Decoupled Visual Language Model for SAR Imagery
Xiaokun Zhang ⋅ Yi Yang ⋅ Ziqi Ye ⋅ Baiyun Baiyun ⋅ Xiaorong Guo ⋅ Qingchen Fang ⋅ Ry Zhang ⋅ Xinpeng Zhou ⋅ Haipeng Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 557
UniChange: Unifying Change Detection with Multimodal Large Language Model
Xu Zhang ⋅ Danyang Li ⋅ Xiaohang Dong ⋅ Tianhao Wu ⋅ Hualong Yu ⋅ Jianye Wang ⋅ Qicheng Li ⋅ Xiang Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 558
Spatiotemporal Pyramid Flow Matching for Climate Emulation
Jeremy A. Irvin ⋅ Jiaqi Han ⋅ Zikui Wang ⋅ Abdulaziz Alharbi ⋅ Yufei Zhao ⋅ Nomin-Erdene Bayarsaikhan ⋅ Daniele Visioni ⋅ Andrew Y. Ng ⋅ Duncan Watson-Parris
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 559
See What We Cannot See: A Geo-guided Reasoning Benchmark for Object Counting under Adverse Earth Observation Conditions
Jiayi Wang ⋅ Zhihong Tan ⋅ Hongchen Wei ⋅ Daiqing Yang ⋅ Zhenzhong Chen
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 560
MM-OVSeg: Multimodal Optical–SAR Fusion for Open-Vocabulary Segmentation in Remote Sensing
YIMIN WEI ⋅ Aoran Xiao ⋅ Hongruixuan Chen ⋅ Junshi Xia ⋅ Naoto Yokoya
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 561
RECS4R: Bridging Semantics and Geometry for Referring Remote Sensing Interpretation
Jinming Chai ⋅ Lingling Li ⋅ Licheng Jiao ⋅ Xiaoqiang Lu ⋅ Long Sun ⋅ Xu Liu ⋅ Wenping Ma ⋅ Weibin Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 562
Fourier Angle Alignment for Oriented Object Detection in Remote Sensing
Changyu Gu ⋅ Linwei Chen ⋅ Lin Gu ⋅ Ying Fu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 563
Learning to Infer Parameterized Representations of Plants from 3D Scans
Samara Ghrer ⋅ Christophe Godin ⋅ Stefanie Wuhrer
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 564
Good Can Sometimes be Bad: A Unified Attack against 3D Point Cloud Classifier by a Flexible Isotropic Resampling
linkun fan ⋅ Jiahao Zhang ⋅ JunTao Zhang ⋅ Lei Zhang ⋅ Fazhi He ⋅ Daojun Han
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 565
V-Attack: Targeting Disentangled Value Features for Controllable Adversarial Attacks on LVLMs
Sen Nie ⋅ Jie Zhang ⋅ Jianxin Yan ⋅ Shiguang Shan ⋅ Xilin Chen
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 566
FeatureFool: Zero-Query Fooling of Video Models via Feature Map
Duoxun Tang ⋅ Xi Xiao ⋅ Guangwu Hu ⋅ Kangkang Sun ⋅ Xiao Yang ⋅ Dongyang Chen ⋅ Qing Li ⋅ Yongjie Yin ⋅ Jiyao Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 567
RankOOD - Class Ranking-based Out-of-Distribution Detection
Dishanika Denipitiyage ⋅ Naveen Karunanayake ⋅ Suranga Seneviratne ⋅ Sanjay Chawla
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 568
AdvFM: Lookahead Flow-Matching Velocity-Field Attacks for Imperceptible and Transferable Adversarial Examples
Runze Liu ⋅ Zeyue Wang ⋅ Fanghui Sun ⋅ Rui Liu ⋅ Yihan Yan ⋅ Shen Wang ⋅ Zhaoyang Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 569
The Power of Decaying Steps: Enhancing Attack Stability and Transferability for Sign-based Optimizers
Wei Tao ⋅ Yang Dai ⋅ Jincai Huang ⋅ Qing Tao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 570
Your Classifier Can Do More: Towards Balancing the Gaps in Classification, Robustness, and Generation
kaichao jiang ⋅ He Wang ⋅ Xiaoshuai Hao ⋅ Xiulong Yang ⋅ Ajian Liu ⋅ Qi Chu ⋅ Yunfeng Diao ⋅ Richang Hong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 571
Learning Mutual View Information Graph for Adaptive Adversarial Collaborative Perception
Yihang Tao ⋅ Senkang Hu ⋅ Haonan An ⋅ Zhengru Fang ⋅ Hangcheng Cao ⋅ Yuguang Fang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 572
Hierarchical Attacks for Multi‑Modal Multi‑Agent Reasoning
Hao Zhou ⋅ Tiru Wu ⋅ yan jiang ⋅ Wanqi Zhou ⋅ Junxing Hu ⋅ Ai Han
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 573
Omni-Attack: Adversarial Attacks on Open-Ended VQA in Black-Box Multimodal LLMs
Kai Hu ⋅ Weichen Yu ⋅ Li Zhang ⋅ Alexander Robey ⋅ Andy Zou ⋅ Haoqi Hu ⋅ Chengming Xu ⋅ Matt Fredrikson
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 574
CoMo: Learning Continuous Latent Motion from Internet Videos for Scalable Robot Learning
Jiange Yang ⋅ tom tomlinson ⋅ Haoyi Zhu ⋅ Mingyu Liu ⋅ Kaijing Ma ⋅ Yating Wang ⋅ Gangshan Wu ⋅ Tong He ⋅ Limin Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 575
Δynamics: Language-Based Representation for Inferring Rigid-Body Dynamics From Videos
Chia-Hsiang Kao ⋅ Cong Phuoc Huynh ⋅ Chien-Yi Wang ⋅ Noranart Vesdapunt ⋅ Stefan Stojanov ⋅ Bharath Hariharan ⋅ Oleksandr Obiednikov ⋅ Ning Zhou
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 576
PvP: Data-Efficient Humanoid Robot Learning with Proprioceptive-Privileged Contrastive Representations
Mingqi Yuan ⋅ Tao Yu ⋅ Haolin Song ⋅ Bo Li ⋅ Xin Jin ⋅ Hua Chen ⋅ Wenjun Zeng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 577
Diagnose, Correct, and Learn from Manipulation Failures via Visual Symbols
Xianchao Zeng ⋅ Xinyu Zhou ⋅ Youcheng Li ⋅ Jiayou Shi ⋅ Tianle Li ⋅ Liangming Chen ⋅ Lei Ren ⋅ Yonglu Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 578
RealVLG-R1: A Large-Scale Real-World Visual-Language Grounding Benchmark for Robotic Perception and Manipulation
Linfei Li ⋅ Lin Zhang ⋅ Ying Shen
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 579
GeCo-SRT: Geometry-aware Continual Adaptation for Cross-Task Sim-to-Real Transfer
Wenbo Yu ⋅ Wenke Xia ⋅ Weitao Zhang ⋅ Di Hu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 580
ActiveGrasp: Information-Guided Active Grasping with Calibrated Energy-based Model
Boshu Lei ⋅ Wen Jiang ⋅ Kostas Daniilidis
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 581
BiPreManip: Learning Affordance-Based Bimanual Pre-Manipulation through Anticipatory Collaboration
Yan Shen ⋅ Feng Jiang ⋅ Zichen He ⋅ Xiaoqi Li ⋅ Yuchen Liu ⋅ Zhiyu Li ⋅ Ruihai Wu ⋅ Hao Dong
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 582
Learning Surgical Robotic Manipulation with 3D Spatial Priors
Yu Sheng ⋅ Lidian Wang ⋅ Xiaomeng Chu ⋅ Jiajun Deng ⋅ Min Cheng ⋅ Yanyong Zhang ⋅ Bei Hua ⋅ Houqiang Li ⋅ Jianmin Ji
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 583
SimRecon: SimReady Compositional Scene Reconstruction from Real Videos
Chong Xia ⋅ Kai Zhu ⋅ Zizhuo Wang ⋅ Fangfu Liu ⋅ Zhizheng Zhang ⋅ Yueqi Duan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 584
STRNet: Visual Navigation with Spatio-Temporal Representation through Dynamic Graph Aggregation
Hao Ren ⋅ Zetong Bi ⋅ Yiming Zeng ⋅ Zhaoliang Wan ⋅ Lu Qi ⋅ Hui Cheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 585
RaUF: Learning the Spatial Uncertainty Field of Radar
Shengpeng Wang ⋅ Kuangyu Wang ⋅ Wei Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 586
SIR: Structured Image Representations for Explainable Robot Learning
Paul Mattes ⋅ Jan Schwab ⋅ Jens Bosch ⋅ Maximilian Li ⋅ Nils Blank ⋅ Minh-Trung Tang ⋅ Moritz Haberland ⋅ Rudolf Lioutikov
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 587
Instance-level Visual Active Tracking with Occlusion-Aware Planning
Haowei Sun ⋅ Kai Zhou ⋅ Hao Gao ⋅ Shiteng Zhang ⋅ Jinwu Hu ⋅ Xutao Wen ⋅ Qixiang Ye ⋅ Mingkui Tan
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 588
Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight
Yi Yang ⋅ Xueqi Li ⋅ Yiyang Chen ⋅ Jin Song ⋅ Yihan Wang ⋅ Zipeng Xiao ⋅ Jiadi Su ⋅ You Qiaoben ⋅ Pengfei Liu ⋅ Zhijie Deng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 589
AnthroTAP: Learning Point Tracking with Real-World Motion
Inès Hyeonsu Kim ⋅ Seokju Cho ⋅ Jahyeok Koo ⋅ Junghyun Park ⋅ Gabriel Huang ⋅ Honglak Lee ⋅ Joon-Young Lee ⋅ Seungryong Kim
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 590
Tracking by Predicting 3-D Gaussians Over Time
Tanish Baranwal ⋅ Himanshu Singh Singh ⋅ Jathushan Rajasegaran ⋅ Jitendra Malik
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 591
Toward Low-Cost yet Effective Temporal Learning for UAV Tracking
chaocan xue ⋅ Qihua Liang ⋅ Bineng Zhong ⋅ Yanting Zu ⋅ Yuanliang Xue ⋅ Haiying Xia ⋅ Shuxiang Song
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 592
Rethinking Two-Stage Referring-by-Tracking in Referring Multi-Object Tracking: Make it Strong Again
Weize Li ⋅ Yunhao Du ⋅ Qixiang Yin ⋅ Zhicheng Zhao ⋅ Fei Su
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 593
Occlusion-Aware SORT: Observing Occlusion for Robust Multi-Object Tracking
Chunjiang Li ⋅ Jianbo Ma ⋅ Li Shen ⋅ Yanru Chen ⋅ Liangyin Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 594
CoWTracker: Tracking by Warping instead of Correlation
Zihang Lai ⋅ Eldar Insafutdinov ⋅ Edgar Sucar ⋅ Andrea Vedaldi
[ Slides
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 595
Learning Long-term Motion Embeddings for Efficient Kinematics Generation
Nick Stracke ⋅ Kolja Bauer ⋅ Stefan Andreas Baumann ⋅ Miguel Ángel Bautista ⋅ Joshua Susskind ⋅ Björn Ommer
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 596
SpatialVID: A Large-Scale Video Dataset with Spatial Annotations
Jiahao Wang ⋅ Yufeng Yuan ⋅ Rujie Zheng ⋅ Youtian Lin ⋅ Jian Gao ⋅ Lin-Zhuo Chen ⋅ Yajie Bao ⋅ Chang Zeng ⋅ Yanxi Zhou ⋅ Xiaoxiao Long ⋅ Hao Zhu ⋅ Zhaoxiang Zhang ⋅ Xun Cao ⋅ Yao Yao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 597
Beyond Explicit Language: Plug-and-Play Visual-to-Linguistic Modeling Toward General Object Tracking
Kaiyang Lan ⋅ Ying Cui ⋅ Chenchen Jing ⋅ Jianwei Zheng ⋅ Dongyan Guo
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 598
FairLLaVA: Fairness-Aware Parameter-Efficient Fine-Tuning for Large Vision-Language Assistants
Mahesh Bhosale ⋅ Abdul Wasi Lone ⋅ Shantam Srivastava ⋅ Shifa Latif ⋅ Tianyu Luan ⋅ Mingchen Gao ⋅ David Doermann ⋅ Xuan Gong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 599
InvCoSS: Inversion-driven Continual Self-supervised Learning in Medical Multi-modal Image Pre-training
Zihao Luo ⋅ Shaohao Rui ⋅ Zhenyu Tang ⋅ Guotai Wang ⋅ Xiaosong Wang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 600
PETAR: Localized Findings Generation with Mask-Aware Vision-Language Modeling for PET Automated Reporting
Danyal Maqbool ⋅ Changhee Lee ⋅ Zachary Huemann ⋅ Samuel D. Church ⋅ Matthew E. Larson ⋅ Scott B. Perlman ⋅ Tomas A. Romero ⋅ Joshua D. Warner ⋅ Meghan Lubner ⋅ Xin Tie ⋅ Jameson Merkow ⋅ Junjie Hu ⋅ Steve Y. Cho ⋅ Tyler J. Bradshaw
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 601
From Panel to Pixel: Zoom-In Vision–Language Pretraining from Biomedical Scientific Literature
Kun yuan ⋅ Min Woo ⋅ Zhen Chen ⋅ Alejandro Lozano ⋅ Xiangteng He ⋅ Shi Li ⋅ Nassir Navab ⋅ Xiaoxiao Sun ⋅ Nicolas Padoy ⋅ Serena Yeung
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 602
LEMON: A Large Endoscopic MONocular Dataset and Foundation Model for Perception in Surgical Settings
chengan che ⋅ Chao Wang ⋅ Tom Vercauteren ⋅ Sophia Tsoka ⋅ Luis Carlos Garcia Peraza Herrera
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 603
D2T2 - Multimodal Automated Planning for Brachytherapy
Lance C. Moore ⋅ Aranyo Mitra ⋅ Ryan Truong ⋅ Karoline Kallis ⋅ Kelly Kisling ⋅ Sandra M. Meyers ⋅ Nuno Vasconcelos
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 604
TopoCL: Topological Contrastive Learning for Medical Imaging
Guangyu Meng ⋅ Pengfei Gu ⋅ Peixian Liang ⋅ John P. Lalor ⋅ Erin Wolf Chambers ⋅ Danny Z. Chen
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 605
Diffusion with a Linguistic Compass: Steering the Generation of Clinically Plausible Future sMRI Representations for Early MCI Conversion Prediction
Zhihao Tang ⋅ Chaozhuo Li ⋅ Litian Zhang ⋅ Xi Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 606
Personalized Longitudinal Medical Report Generation via Temporally-Aware Federated Adaptation
He Zhu ⋅ Ren Togo ⋅ Takahiro Ogawa ⋅ Kenji Hirata ⋅ Minghui Tang ⋅ Takaaki Yoshimura ⋅ Hiroyuki Sugimori ⋅ Noriko Nishioka ⋅ Yukie Shimizu ⋅ Kohsuke Kudo ⋅ Miki Haseyama
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 607
Decoding 3D Perception via BrainSSD: Synergistic Fusion of EEG Representations from Static and Dynamic Visual Streams
Yincheng Yao ⋅ Enze Shi ⋅ Shu Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 608
Duala: Dual-Level Alignment of Subjects and Stimuli for Cross-Subject fMRI Decoding
Shumeng Li ⋅ Jintao Guo ⋅ Jian Zhang ⋅ Yulin Zhou ⋅ Luyang Cao ⋅ Yinghuan Shi
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 609
OmniBrainBench: A Comprehensive Multimodal Benchmark for Brain Imaging Analysis Across Multi-stage Clinical Tasks
Zhihao Peng ⋅ Cheng Wang ⋅ Shengyuan Liu ⋅ Zhiying Liang ⋅ Zanting Ye ⋅ Min Jie Ju ⋅ Peter YM Woo ⋅ Yixuan Yuan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 610
Beyond Pixel Simulation: Pathology Image Generation via Diagnostic Semantic Tokens and Prototype Control
Minghao Han ⋅ Yichen Liu ⋅ Yizhou Liu ⋅ Zizhi Chen ⋅ Jingqun Tang ⋅ Xuecheng Wu ⋅ Dingkang Yang ⋅ Lihua Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 611
MedFG-VQA: Low-Frequency Memory and Graph Attention for Lightweight Medical VQA
haowen gu ⋅ Gensheng Pei ⋅ Zeren Sun ⋅ Mingwu Ren ⋅ Xiangbo Shu ⋅ Yazhou Yao ⋅ Fumin Shen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 612
FISHuman: Fine-grained Single-image 3D Human Reconstruction via Multi-view 4D Remeshing
Hanxi Liu ⋅ Yifang Men ⋅ Zhouhui Lian
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 613
DuoMo: Dual Motion Diffusion for World-Space Human Reconstruction
Yufu Wang ⋅ Evonne Ng ⋅ Soyong Shin ⋅ Rawal Khirodkar ⋅ Yuan Dong ⋅ Zhaoen Su ⋅ Jinhyung Park ⋅ Kris Kitani ⋅ Alexander Richard ⋅ Fabian Prada ⋅ Michael Zollhoefer
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 614
RAM: Recover Any 3D Human Motion in-the-Wild
Sen Jia ⋅ Ning Zhu ⋅ Jinqin Zhong ⋅ Jiale Zhou ⋅ Huaping Zhang ⋅ Jenq-Neng Hwang ⋅ Lei Li
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 615
From 2D Alignment to 3D Plausibility: Unifying Heterogeneous 2D Priors and Penetration-Free Diffusion for Occlusion-Robust Two-Hand Reconstruction
Gaoge Han ⋅ Yongkang Cheng ⋅ Zhe Chen ⋅ Shaoli Huang ⋅ Tongliang Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 616
MV-Fashion: Towards Enabling Virtual Try-On and Size Estimation with Multi-View Paired Data
Hunor Laczko ⋅ Libang Jia ⋅ Loc-Phat Truong ⋅ Diego Hernández ⋅ Sergio Escalera ⋅ Jordi Gonzàlez ⋅ Meysam Madadi
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 617
Forecasting 3D Scanpaths in Egocentric Video
Fiona Ryan ⋅ Ishwarya Ananthabhotla ⋅ Yijun Qian ⋅ Judy Hoffman ⋅ James M. ⋅ Vamsi Krishna Ithapu ⋅ Calvin Murdock
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 618
M4Human: A Large-Scale Multimodal mmWave Radar Benchmark for Human Mesh Reconstruction
Fan Junqiao ⋅ Yunjiao Zhou ⋅ Yizhuo Yang ⋅ Xinyuan Cui ⋅ Jiarui Zhang ⋅ Lihua Xie ⋅ Jianfei Yang ⋅ Chris Xiaoxuan Lu ⋅ Fangqiang Ding
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 619
ReGenHOI: Unifying Reconstruction and Generation for 3D Human–Object Interaction Understanding
miao xu ⋅ Xiangyu Zhu ⋅ Zidu Wang ⋅ XUSHENG LIANG ⋅ Bao Li ⋅ Jinlin Wu ⋅ Zelin Zang ⋅ Zhen Lei
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 620
Through the Frequency Lens: Cross-Domain Generalisable Gaze Estimation with Adaptive Modulation
Yang Xu ⋅ Yiwei Bao ⋅ Feng Lu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 621
Mocap-2-to-3: Multi-view Lifting for Monocular Motion Recovery with 2D Pretraining
Zhumei Wang ⋅ Zechen Hu ⋅ Ruoxi Guo ⋅ Huaijin Pi ⋅ Ziyong Feng ⋅ Liang Zhang ⋅ Mingtao Pei ⋅ Siyuan Huang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 622
SHands: A Multi-View Dataset and Benchmark for Surgical Hand-Gesture and Error Recognition Toward Medical Training
Le Ma ⋅ Thiago Freitas dos Santos ⋅ Nadia Magnenat-Thalmann ⋅ Katarzyna Wac
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 623
Beyond Static Frames: Temporal Aggregate-and-Restore Vision Transformer for Human Pose Estimation
Hongwei Fang ⋅ Jiahang Cai ⋅ Xun Wang ⋅ Wenwu Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 624
IMU-HOI: A Symbiotic Framework for Coherent Human-Object Interaction and Motion Capture via Contact-Conscious Inertial Fusion
Lizhou Lin ⋅ Songpengcheng Xia ⋅ Zengyuan Lai ⋅ Lan Sun ⋅ Jiarui Yang ⋅ Ling Pei
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 625
Learning Forgery-Aware Lip Representations Without Forgery Priors
Bofan Chen ⋅ Hongyu Zhu ⋅ Yi He ⋅ Sichu Liang ⋅ Shilin Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 626
Beyond [CLS] Token: Query-Driven Token-Level Forgery Purification for Generalizable Deepfake Detection
Wang Changshuo ⋅ Jiangming Wang ⋅ Ke-Yue Zhang ⋅ Taiping Yao ⋅ Shouhong Ding ⋅ Shunli Wang ⋅ Ran Yi ⋅ Lizhuang Ma
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 627
GEM-TFL: Bridging Weak and Full Supervision for Forgery Localization through EM-Guided Decomposition and Temporal Refinement
Xiaodong Zhu ⋅ Yuanming Zheng ⋅ Suting Wang ⋅ Junqi Yang ⋅ Yuhong Yang ⋅ Weiping Tu ⋅ Zhongyuan Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 628
TokenTrace: Multi-Concept Attribution through Watermarked Token Recovery
Li Zhang ⋅ Shruti Agarwal ⋅ John Collomosse ⋅ Pengtao Xie ⋅ Vishal Asnani
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 629
Unleashing Vision-Language Semantics for Deepfake Video Detection
Jiawen Zhu ⋅ Yunqi Miao ⋅ Xueyi Zhang ⋅ Jiankang Deng ⋅ Guansong Pang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 630
A Difference-in-Difference Approach to Detecting AI-Generated Images
Xinyi Qi ⋅ Kai Ye ⋅ Chengchun Shi ⋅ Ying Yang ⋅ Jin Zhu ⋅ Hongyi Zhou
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 631
RDFace: A Benchmark Dataset for Rare Disease Facial Image Analysis under Extreme Data Scarcity and Phenotype-Aware Synthetic Generation
Ganlin Feng ⋅ Yuxi Long ⋅ Hafsa Moontari Ali ⋅ Erin Lou ⋅ Fahad Butt ⋅ Qian Liu ⋅ Yang Wang ⋅ Pingzhao Hu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 632
ActivityForensics: A Comprehensive Benchmark for Localizing Manipulated Activity in Videos
Peijun Bao ⋅ Anwei Luo ⋅ Gang Pan ⋅ Alex C. Kot ⋅ Xudong Jiang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 633
Zero-shot Detection of AI-Generated Image via RAW-RGB Alignment
Haiwei Wu ⋅ Fengpeng Li ⋅ Zhilin Tu ⋅ Yuanman Li ⋅ Xiong Li ⋅ Jiantao Zhou
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 634
Scaling Up AI-Generated Image Detection with Generator-Aware Prototypes
Ziheng Qin ⋅ Yuheng Ji ⋅ Renshuai Tao ⋅ Yuxuan Tian ⋅ Yuyang Liu ⋅ Yipu Wang ⋅ Xiaolong Zheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 635
Investigating Self-Supervised Representations for Audio-Visual Deepfake Detection
Dragos-Alexandru Boldisor ⋅ Stefan Smeu ⋅ Dan Oneata ⋅ Elisabeta Oneata
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 636
TIACam: Text-Anchored Invariant Feature Learning with Auto-Augmentation for Camera-Robust Zero-Watermarking
Abdullah All Tanvir ⋅ Agnibh Dasgupta ⋅ Xin Zhong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 637
FastRef: Fast Prototype Refinement for Few-shot Industrial Anomaly Detection
Yufei Li ⋅ Long Tian ⋅ Yuyang Dai ⋅ Wenchao Chen ⋅ Liang Bao ⋅ Xiyang Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 638
RC-NF: Robot-Conditioned Normalizing Flow for Real-Time Anomaly Detection in Robotic Manipulation
Shijie Zhou ⋅ Bin Zhu ⋅ Jiarui Yang ⋅ Xiangyu Zhao ⋅ Jingjing Chen ⋅ Yu-Gang Jiang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 639
Reasoning-Driven Anomaly Detection and Localization with Image-Level Supervision
yizhou jin ⋅ Yuezhu Feng ⋅ Jinjin Zhang ⋅ Peng Wang ⋅ Qingjie Liu ⋅ Yunhong Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 640
MMR-AD: A Large-Scale Multimodal Dataset for Benchmarking General Anomaly Detection with Multimodal Large Language Models
Xincheng Yao ⋅ Zefeng Qian ⋅ Chao Shi ⋅ Jiayang Song ⋅ Chongyang Zhang
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 641
Wavelet-Driven 3D Anomaly Detection under Pose-Agnostic and Sparse-View
Mingwen Shao ⋅ Qiao Zhang ⋅ Xinyuan Chen ⋅ Xiang Lv ⋅ Lingzhuang Meng ⋅ Chang Liu ⋅ Qinglin Zhan ⋅ Ling Jian
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 642
Hunting Normality from Query Sample via Residual Learning for Generalist Anomaly Detection
Xiaolei Wang ⋅ Yuexin Wang ⋅ Tianhong Dai ⋅ Huihui Bai ⋅ Yao Zhao ⋅ Jimin Xiao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 643
GPFlow: Gaussian Prototype Probability Flow for Unsupervised Multi-Modal Anomaly Detection
YITING LI ⋅ Xulei Yang ⋅ Jingyi Liao ⋅ Jing Zhang ⋅ Fayao Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 644
HP-Edit: A Human-Preference Post-Training Framework for Image Editing
Fan Li ⋅ Chonghuinan Wang ⋅ Lina Lei ⋅ Yuping Qiu ⋅ Jiaqi Xu ⋅ Jiaxiu Jiang ⋅ Xinran Qin ⋅ Zhikai Chen ⋅ Fenglong Song ⋅ Zhixin Wang ⋅ Renjing Pei ⋅ Wangmeng Zuo
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 645
It's Never Too Late: Noise Optimization for Collapse Recovery in Trained Diffusion Models
Anne Harrington ⋅ A. Koepke ⋅ Shyamgopal Karthik ⋅ Trevor Darrell ⋅ Alexei A. Efros
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 646
RebRL: Reinforcing Discrete Visual Diffusion Models with Rebalanced Timestep Credits
Mu Zhang ⋅ Tianren Ma ⋅ Yunfan Liu ⋅ Kun Hu ⋅ Qixiang Ye
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 647
Ego-InBetween: Generating Object State Transitions in Ego-Centric Videos
Mengmeng Ge ⋅ Takashi Isobe ⋅ Xu Jia ⋅ Yanan Sun ⋅ Zetong Yang ⋅ Weinong Wang ⋅ Dong Zhou ⋅ Dong Li ⋅ Huchuan Lu ⋅ Emad Barsoum
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 648
Towards Fine-Grained Attribution: Instance-Aware Preference Optimization for Aligning Diffusion Models
Jiayang Sun ⋅ Pin Wang ⋅ Hongbo Wang ⋅ Xinyue Liu ⋅ Huaibo Huang ⋅ Ran He
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 649
SketchRevive: Fine-Grained Pixel-to-Vector Sketch Completion with Diffusion-Prior-Guided Multimodal LLMs
Ran Zuo ⋅ Haoxiang Hu ⋅ Chenxi Pei ⋅ Yanxuan Liu ⋅ Wenwen Qiang ⋅ Fang Liu ⋅ Xiaoming Deng ⋅ Cuixia Ma ⋅ Yong-Jin Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 650
UniPercept: A Unified Diffusion Model for Generalizable Visual Perception
Zuyan Zhao ⋅ Zhenliang He ⋅ Meina Kan ⋅ Shiguang Shan ⋅ Xilin Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 651
Visual Diffusion Models are Geometric Solvers
Nir Goren ⋅ Shai Yehezkel ⋅ Omer Dahary ⋅ Andrey Voynov ⋅ Or Patashnik ⋅ Daniel Cohen-Or
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 652
You Only Erase Once: Erasing Anything without Bringing Unexpected Content
Yixing Zhu ⋅ Qing Zhang ⋅ Wenju Xu ⋅ Wei-Shi Zheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 653
Smoothing the Score Function to Enhance Generalization in Diffusion Models
Xinyu Zhou ⋅ Jiawei Zhang ⋅ Stephen J. Wright
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 654
NS-Diff: Fluid Navier–Stokes Guided Video Diffusion via Reinforcement Learning
Zijun Deng ⋅ Yuxin Peng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 655
PropFly: Learning to Propagate via On-the-Fly Supervision from Pre-trained Video Diffusion Models
Wonyong Seo ⋅ Jaeho Moon ⋅ Jaehyup Lee ⋅ Soo Ye Kim ⋅ Munchurl Kim
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 656
Generative Neural Video Compression via Video Diffusion Prior
Qi Mao ⋅ Hao Cheng ⋅ Tinghan Yang ⋅ Libiao Jin ⋅ Siwei Ma
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 657
AdaCluster: Adaptive Query-Key Clustering for Sparse Attention in Video Generation
Haoyue Tan ⋅ Shengnan Wang ⋅ Yulin Qiao ⋅ juncheng zhang ⋅ Youhui Bai ⋅ Ping Gong ⋅ Zewen Jin ⋅ Cheng Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 658
Denoising, Fast and Slow: Difficulty-Aware Adaptive Sampling for Image Generation
Johannes Schusterbauer ⋅ Ming Gui ⋅ Yusong Li ⋅ Pingchuan Ma ⋅ Felix Krause ⋅ Björn Ommer
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 659
Image Diffusion Preview with Consistency Solver
Fu-Yun Wang ⋅ Hao Zhou ⋅ Liangzhe Yuan ⋅ Sanghyun Woo ⋅ Boqing Gong ⋅ Bohyung Han ⋅ Ming-Hsuan Yang ⋅ Han Zhang ⋅ Yukun Zhu ⋅ Ting Liu ⋅ Long Zhao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 660
The Drift Kernel: Why Diffusion Models Change Even When Told Not To
Gokul Srinath Seetha Ram ⋅ Rashmi Elavazhagan
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 661
Interpretable Prompts made Edit-Friendly: Token-to-Token Similarity Reduction in dLLMs for Edit-Friendly Hard Prompt Inversion
Naresh Kumar Devulapally ⋅ Shruti Agarwal ⋅ Vishal Asnani ⋅ Vishnu Suresh Lokhande
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 662
LESA: Learnable Stage-Aware Predictors for Diffusion Model Acceleration
Peiliang Cai ⋅ Jiacheng Liu ⋅ Haowen Xu ⋅ Xinyu Wang ⋅ Chang Zou ⋅ Linfeng Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 663
Vision Foundation Models Can Be Good Tokenizers for Latent Diffusion Models
Tianci Bi ⋅ Xiaoyi Zhang ⋅ Yan Lu ⋅ Nanning Zheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 664
Adaptive Spectral Feature Forecasting for Diffusion Sampling Acceleration
Jiaqi Han ⋅ Juntong Shi ⋅ Puheng Li ⋅ Haotian Ye ⋅ Qiushan Guo ⋅ Stefano Ermon
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 665
Proxy-Tuning: Tailoring Multimodal Autoregressive Models for Subject-Driven Image Generation
Yi Wu ⋅ Shengju Qian ⋅ Lingting Zhu ⋅ Lei Liu ⋅ Wandi Qiao ⋅ Ziqiang Li ⋅ Lequan Yu ⋅ Bin Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 666
EasyOmnimatte: Taming Pretrained Inpainting Diffusion Models for End-to-End Video Layered Decompositio
Yihan Hu ⋅ Xuelin Chen ⋅ Xiaodong Cun
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 667
Hierarchical Codec Diffusion for Video-to-Speech Generation
Jiaxin Ye ⋅ Gaoxiang Cong ⋅ Chenhui Wang ⋅ Xin-Cheng Wen ⋅ Zhaoyang Li ⋅ Boyuan Cao ⋅ Hongming Shan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 668
Semantic Alignment for Pose-Invariant Identity Preserving Diffusion
Jiwon Kim ⋅ SeonHwa Kim ⋅ Soobin Park ⋅ Eunju Cha ⋅ Kyong Hwan Jin
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 669
Causality in Video Diffusers is Separable from Denoising
Xingjian Bai ⋅ Guande He ⋅ Zhengqi Li ⋅ Eli Shechtman ⋅ Xun Huang ⋅ Zongze Wu
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 670
2ndMatch: Finetuning Pruned Diffusion Models via Second-Order Jacobian Matching
Caleb Zheng ⋅ Eli Shlizerman
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 671
Hear What You See: Video-to-Audio Generation with Diffusion Transformer and Semantic-Temporal Alignment-Ranked Direct Preference Optimization
Kai Wang ⋅ Tao Zhou ⋅ jiayi lei ⋅ Jing Wang ⋅ Jinman Zhao ⋅ Weiguo Pian ⋅ Yuan Cheng ⋅ Yapeng Tian ⋅ Peng Gao ⋅ Bin Fu ⋅ Yihao Liu ⋅ Dimitrios Hatzinakos ⋅ Yuewen Cao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 672
MacTok: Robust Continuous Tokenization for Image Generation
Hengyu Zeng ⋅ Xin Gao ⋅ Guanghao Li ⋅ Yuxiang Yan ⋅ Jiaoyang Ruan ⋅ Ma Junpeng ⋅ Haoyu Albert Wang ⋅ Jian Pu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 673
Group Editing: Edit Multiple Images in One Go
Yue Ma ⋅ Xinyu Wang ⋅ Qianli Ma ⋅ Qinghe Wang ⋅ Mingzhe Zheng ⋅ xiangpeng yang ⋅ Hao Li ⋅ Chongbo Zhao ⋅ Jixuan Ying ⋅ Harry Yang ⋅ Hongyu Liu ⋅ Qifeng Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 674
Adaptive Video Distillation: Mitigating Oversaturation and Temporal Collapse in Few-Step Generation
Yuyang You ⋅ Yongzhi Li ⋅ Jiahui Li ⋅ Yadong Mu ⋅ Quan Chen ⋅ Peng Jiang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 675
Beyond the Golden Data: Resolving the Motion-Vision Quality Dilemma via Timestep Selective Training
Xiangyang Luo ⋅ Qingyu Li ⋅ Yuming Li ⋅ Guanbo Huang ⋅ Yongjie Zhu ⋅ Wenyu Qin ⋅ Meng Wang ⋅ Pengfei Wan ⋅ Shao-Lun Huang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 676
Toward Diffusible High-Dimensional Latent Spaces: A Frequency Perspective
Bolin Lai ⋅ XuDong Wang ⋅ Saketh Rambhatla ⋅ James M. ⋅ Zsolt Kira ⋅ Rohit Girdhar ⋅ Ishan Misra
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 677
Elucidating the SNR-t Bias of Diffusion Probabilistic Models
Meng Yu ⋅ Lei Sun ⋅ Jianhao Zeng ⋅ Xiangxiang Chu ⋅ Kun Zhan
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 678
What Is It Like to Be a Noise? An Entropy-based Gaussian Noise Regularization for Diffusion Models
Pascal Chang ⋅ Kai Lascheit ⋅ Jingwei Tang ⋅ Markus Gross ⋅ Vinicius Azevedo
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 679
FlashVSR: Towards Real-time Diffusion-Based Streaming Video Super Resolution
Junhao Zhuang ⋅ Shi Guo ⋅ Xin Cai ⋅ Xiaohui Li ⋅ Yihao Liu ⋅ Chun Yuan ⋅ Tianfan Xue
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 680
DiffusionHarmonizer: Bridging Neural Reconstruction and Photorealistic Simulation with Online Diffusion Enhancer
Yuxuan Zhang ⋅ Katarina Tothova ⋅ Zian Wang ⋅ Kangxue Yin ⋅ Haithem Turki ⋅ Riccardo de Lutio ⋅ Yen-Yu Chang ⋅ Or Litany ⋅ Sanja Fidler ⋅ Žan Gojčič
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 681
GDRO: Group-level Reward Post-training Suitable for Diffusion Models
Yiyang Wang ⋅ Xi Chen ⋅ Xiaogang Xu ⋅ Yu Liu ⋅ Hengshuang Zhao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 682
RFDM: Residual Flow Diffusion Models for Video Editing
Mohammadreza Salehi ⋅ Mehdi Noroozi ⋅ Luca Morreale ⋅ Ruchika Chavhan ⋅ Malcolm Chadwick ⋅ Alberto Gil Couto Pimentel Ramos ⋅ Abhinav Mehrotra
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 683
FreqEdit: Preserving High-Frequency Features for Robust Multi-Turn Image Editing
Yucheng Liao ⋅ Jiajun Liang ⋅ Kaiqian Cui ⋅ Baoquan Zhao ⋅ Haoran Xie ⋅ Wei Liu ⋅ Qing Li ⋅ Xudong Mao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 684
Graph-Guided Online Concept Erasure for Text-to-Image Diffusion Models
Ning Han ⋅ Zhenyu Ge ⋅ Feng Han ⋅ Yuhua Sun ⋅ Chengqing Li ⋅ Jingjing Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 685
HierEdit: Region-Aware Hierarchical Diffusion for Efficient High-Resolution Editing
Yuyao Zhang ⋅ Alexander Huang-Menders ⋅ Yu-Wing Tai
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 686
CTCal: Rethinking Text-to-Image Diffusion Models via Cross-Timestep Self-Calibration
Xiefan Guo ⋅ Xinzhu Ma ⋅ Haiyu Zhang ⋅ Di Huang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 687
Edit2Perceive: Image Editing Diffusion Models Are Strong Dense Perceivers
Yiqing Shi ⋅ Yiren Song ⋅ Mike Zheng Shou
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 688
DeltaQuant: 4-bit Video Diffusion Models with Spatiotemporal Delta Smoothing
Xingyang Li ⋅ Samuel Tesfai ⋅ Zhekai Zhang ⋅ Haocheng Xi ⋅ Shuo Yang ⋅ Lvmin Zhang ⋅ Yufei Sun ⋅ Kelly Peng ⋅ Maneesh Agrawala ⋅ Ion Stoica ⋅ Kurt Keutzer ⋅ Jun-Yan Zhu ⋅ Song Han ⋅ Yujun Lin ⋅ Muyang Li
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 689
D2Cache: Second-Order Delta Caching for Higher Video Diffusion Acceleration
Enhuai Liu ⋅ Yunke Wang ⋅ Changming Sun ⋅ Chang Xu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 690
DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation
Zehong Ma ⋅ Longhui Wei ⋅ Shuai Wang ⋅ Shiliang Zhang ⋅ Qi Tian
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 691
Test-Time Alignment of Text-to-Image Diffusion Models via Null-Text Embedding Optimisation
Taehoon Kim ⋅ Henry Gouk ⋅ Timothy Hospedales
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 692
Accelerating Diffusion Model Training under Minimal Budgets: A Condensation-Based Perspective
Rui Huang ⋅ Shitong Shao ⋅ zikai zhou ⋅ Pukun Zhao ⋅ Hangyu Guo ⋅ Tian Ye ⋅ Lichen Bai ⋅ Shuo Yang ⋅ Zeke Xie
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 693
Denoising as Path Planning: Training-Free Acceleration of Diffusion Models with DPCache
Bowen Cui ⋅ Yuanbin Wang ⋅ Huajiang Xu ⋅ Biaolong Chen ⋅ Aixi Zhang ⋅ Hao Jiang ⋅ Zhengzheng Jin ⋅ Xu Liu ⋅ Pipei Huang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 694
Taming Sampling Perturbations with Variance Expansion Loss for Latent Diffusion Models
Qifan Li ⋅ Xingyu Zhou ⋅ Jinhua Zhang ⋅ Weiyi You ⋅ Shuhang Gu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 695
Guiding Diffusion Models with Semantically Degraded Conditions
shilong han ⋅ Yuming Zhang ⋅ Hongxia Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 696
Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion
Yueming Pan ⋅ Ruoyu Feng ⋅ Qi Dai ⋅ Yuqi Wang ⋅ Wenfeng LIN ⋅ MINGYU GUO ⋅ Chong Luo ⋅ Nanning Zheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 697
Reviving ConvNeXt for Efficient Convolutional Diffusion Models
Taesung Kwon ⋅ Lorenzo Bianchi ⋅ Lennart Wittke ⋅ Felix Watine ⋅ Fabio Carrara ⋅ Jong Chul ⋅ Romann Weber ⋅ Vinicius Azevedo
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 698
Coupled Diffusion Sampling for Training-Free Multi-View Image Editing
Hadi Alzayer ⋅ Yunzhi Zhang ⋅ Chen Geng ⋅ Jia-Bin Huang ⋅ Jiajun Wu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 699
Improving Diffusion Generalization with Weak-to-Strong Segmented Guidance
Liangyu Yuan ⋅ Yufei Huang ⋅ Mingkun Lei ⋅ Tong Zhao ⋅ Ruoyu Wang ⋅ Chi Changxi ⋅ Yiwei Wang ⋅ Chi Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 700
Adaptive Auxiliary Prompt Blending for Target-Faithful Diffusion Generation
Kwanyoung Lee ⋅ SeungJu Cha ⋅ Yebin Ahn ⋅ Hyunwoo Oh ⋅ Sungho Koh ⋅ Dong-Jin Kim
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 701
SegQuant: A Semantics-Aware and Generalizable Quantization Framework for Diffusion Models
Jiaji Zhang ⋅ Ruichao Sun ⋅ Hailiang Zhao ⋅ Jiaju Wu ⋅ Peng Chen ⋅ Hao Li ⋅ Yuying Liu ⋅ Kingsum Chow ⋅ GANG XIONG ⋅ Shuiguang Deng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 702
BAgger: Backwards Aggregation for Mitigating Drift in Autoregressive Video Diffusion Models
Ryan Po ⋅ Eric Ryan Chan ⋅ Changan Chen ⋅ Gordon Wetzstein
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 703
Accelerating Autoregressive Video Diffusion via History-Guided Cache and Residual Correction
Kepan Nan ⋅ Wangbo Zhao ⋅ Penghao Zhou ⋅ Jun Li ⋅ Zhenheng Yang ⋅ Jian Yang ⋅ Ying Tai
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 704
MusicInfuser: Making Video Diffusion Listen and Dance
Susung Hong ⋅ Ira Kemelmacher-Shlizerman ⋅ Brian Curless ⋅ Steve M. Seitz
[ Poster