Skip to yearly menu bar Skip to main content


(657 events)   Timezone:  
Show all
The 2026 schedule is still incomplete
Toggle Poster Visibility
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 1
MAMMA: Markerless Accurate Multi-person Motion Acquisition
Hanz Cuevas Velasquez ⋅ Anastasios Yiannakidis ⋅ Soyong Shin ⋅ Giorgio Becherini ⋅ Markus Höschle ⋅ Joachim Tesch ⋅ Taylor Obersat ⋅ Tsvetelina Alexiadis ⋅ Eni Halilaj ⋅ Michael J. Black
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 2
Natural Human Motion Recovery by Aligning High-Order Temporal Dynamics from Monocular Videos
Dingkun Wei ⋅ Zehong Shen ⋅ Yan Xia ⋅ Yujun Shen ⋅ Georgios Pavlakos ⋅ Xiaowei Zhou
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 3
PoseGAM: Robust Unseen Object Pose Estimation via Geometry-Aware Multi-View Reasoning
Jianqi Chen ⋅ Biao Zhang ⋅ Xiangjun Tang ⋅ Peter Wonka
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 4
SAM 3D Body: Robust Full-Body Human Mesh Recovery
Xitong Yang ⋅ Devansh Kukreja ⋅ Don Pinkus ⋅ Taosha Fan ⋅ Jinhyung Park ⋅ Soyong Shin ⋅ Jinkun Cao ⋅ Jia-Wei Liu ⋅ Nicolás Ugrinovic ⋅ Anushka Sagar ⋅ Jitendra Malik ⋅ Matt Feiszli ⋅ Piotr Dollár ⋅ Kris Kitani
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 5
SAM 3D: 3Dfy Anything in Images
Xingyu Chen ⋅ Fu-Jen Chu ⋅ Pierre Gleize ⋅ Kevin J Liang ⋅ Alexander Sax ⋅ Hao Tang ⋅ Weiyao Wang ⋅ Michelle Guo ⋅ Thibaut Hardin ⋅ Xiang Li ⋅ Aohan Lin ⋅ Jia-Wei Liu ⋅ Ziqi Ma ⋅ Anushka Sagar ⋅ Bowen Song ⋅ Xiaodong Wang ⋅ Jianing "Jed" Yang ⋅ Bowen Zhang ⋅ Piotr Dollár ⋅ Georgia Gkioxari ⋅ Matt Feiszli ⋅ Jitendra Malik
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 6
SPARK: Sim-ready Part-level Articulated Reconstruction with VLM Knowledge
Yumeng He ⋅ Ying Jiang ⋅ Jiayin Lu ⋅ Yin Yang ⋅ Chenfanfu Jiang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 7
3DReflecNet: A Large-Scale Dataset for 3D Reconstruction of Reflective, Transparent, and Low-Texture Objects
Zhicheng Liang ⋅ Haoyi Yu ⋅ Boyan Li ⋅ Dayou Zhang ⋅ Zijian Cao ⋅ Tianyi Gong ⋅ Junhua Liu ⋅ Shuguang Cui ⋅ Fangxin Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 8
GLINT: Modeling Scene-Scale Transparency via Gaussian Radiance Transport
Youngju Na ⋅ Jaeseong Yun ⋅ Soohyun Ryu ⋅ Hyunsu Kim ⋅ Sung-Eui Yoon ⋅ Suyong Yeon
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 9
Neural Field-Based 3D Surface Reconstruction of Microstructures from Multi-Detector Signals in Scanning Electron Microscopy
Shuo Chen ⋅ Yijin Li ⋅ Xi Zheng ⋅ Guofeng Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 10
PhyGaP: Physically-Grounded Gaussians with Polarization Cues
Jiale Wu ⋅ Xiaoyang Bai ⋅ Zongqi He ⋅ Weiwei Xu ⋅ YIFAN PENG
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 11
PPISP: Physically-Plausible Compensation and Control of Photometric Variations in Radiance Field Reconstruction
Isaac Deutsch ⋅ Nicolas Moënne-Loccoz ⋅ Gavriel State ⋅ Žan Gojčič
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 12
SeeGroup: Multi-Layer Depth Estimation of Transparent Surfaces via Self-Determined Grouping
Hongyu Wen ⋅ Jia Deng
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 13
Energy-GS: Image Energy-guided Pose Alignment Gaussian Splatting with redesigned pose gradient flow
Yu Gao ⋅ Lutong Su ⋅ Ruixiang Huang ⋅ Tianji Jiang ⋅ Jiadong Tang ⋅ Yufeng Yue ⋅ Yi Yang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 14
MeshSplatting: Differentiable Rendering with Opaque Meshes
Jan Held ⋅ Sanghyun Son ⋅ Renaud Vandeghen ⋅ Daniel Rebain ⋅ Matheus Gadelha ⋅ Yi Zhou ⋅ Anthony Cioppa ⋅ Ming C. Lin ⋅ Marc Van Droogenbroeck ⋅ Andrea Tagliasacchi
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 15
Proxy-GS: Unified Occlusion Priors for Training and Inference in Structured 3D Gaussian Splatting
Yuanyuan Gao ⋅ YUNING GONG ⋅ Yifei Liu ⋅ Jingfeng Li ⋅ Dan Xu ⋅ Yanci Zhang ⋅ Dingwen Zhang ⋅ Xiao Sun ⋅ Zhihang Zhong
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 16
RetimeGS: Continuous-Time Reconstruction of 4D Gaussian Splatting
Xuezhen Wang ⋅ Li Ma ⋅ Yulin Shen ⋅ Zeyu Wang ⋅ Pedro V. Sander
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 17
Selfi: Self-improving Reconstruction Engine via 3D Geometric Feature Alignment
Youming Deng ⋅ Songyou Peng ⋅ Junyi Zhang ⋅ Kathryn Heal ⋅ Tiancheng Sun ⋅ John Flynn ⋅ Steve Marschner ⋅ Lucy Chai
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 18
Z-Order Transformer for Feed-Forward Gaussian Splatting
Can Wang ⋅ Lei Liu ⋅ Wei Jiang ⋅ Dong Xu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 19
4D Primitive-Mâché: Glueing Primitives for Persistent 4D Scene Reconstruction
Kirill Mazur ⋅ Marwan Taher ⋅ Andrew J. Davison
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 20
Efficiently Reconstructing Dynamic Scenes One D4RT at a Time
Chuhan Zhang ⋅ Guillaume Le Moing ⋅ Skanda Koppula ⋅ Ignacio Rocco ⋅ Liliane Momeni ⋅ Junyu Xie ⋅ Shuyang Sun ⋅ Rahul Sukthankar ⋅ Joëlle K. Barral ⋅ Raia Hadsell ⋅ Zoubin Ghahramani ⋅ Andrew Zisserman ⋅ Junlin Zhang ⋅ Mehdi S. M. Sajjadi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 21
FUSER: Feed-Forward Multiview 3D Registration Transformer and SE(3)^N Diffusion Refinement
Haobo Jiang ⋅ Jin Xie ⋅ Jian Yang ⋅ Liang Yu ⋅ Jianmin Zheng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 22
Residual Primitive Fitting of 3D Shapes with SuperFrusta
Aditya Ganeshan ⋅ Matheus Gadelha ⋅ Thibault Groueix ⋅ Zhiqin Chen ⋅ Siddhartha Chaudhuri ⋅ Vladimir G. Kim ⋅ Wang Yifan ⋅ Daniel Ritchie
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 23
SmokeSVD: Smoke Reconstruction from A Single View via Progressive Novel View Synthesis and Refinement with Diffusion Models
Chen Li ⋅ Shanshan Dong ⋅ Sheng Qiu ⋅ Jianmin Han ⋅ Yibo Zhao ⋅ Zan Gao ⋅ Taku Komura ⋅ Kemeng Huang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 24
SparseWorld-TC: Trajectory-Conditioned Sparse Occupancy World Model
Jiayuan Du ⋅ Yiming Zhao ⋅ Zhenglong Guo ⋅ Yong Pan ⋅ Wenbo Hou ⋅ Zhihui Hao ⋅ Kun Zhan ⋅ Qijun Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 25
Affostruction: 3D Affordance Grounding with Generative Reconstruction
Chunghyun Park ⋅ Seunghyeon Lee ⋅ Minsu Cho
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 26
MV-RoMa: From Pairwise Matching into Multi-View Track Reconstruction
JongMin Lee ⋅ Seungyeop Kang ⋅ Sungjoo Yoo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 27
Unified Primitive Proxies for Structured Shape Completion
Zhaiyu Chen ⋅ Yuqing Wang ⋅ Xiao Xiang Zhu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 28
ART: Articulated Reconstruction Transformer
Zizhang Li ⋅ Cheng Zhang ⋅ Zhengqin Li ⋅ Henry Howard-Jenkins ⋅ Zhaoyang Lv ⋅ Chen Geng ⋅ Jiajun Wu ⋅ Richard Newcombe ⋅ Jakob Engel ⋅ Zhao Dong
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 29
SCE-SLAM: Scale-Consistent Monocular SLAM via Scene Coordinate Embeddings
Yuchen Wu ⋅ Jiahe Li ⋅ Xiaohan Yu ⋅ Lina Yu ⋅ Jin Zheng ⋅ Xiao Bai
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 30
S2D: Sparse to Dense Lifting for 3D Reconstruction with Minimal Inputs
Yuzhou Ji ⋅ Qijian Tian ⋅ He Zhu ⋅ Xiaoqi Jiang ⋅ Guangzhi Cao ⋅ Lizhuang Ma ⋅ Yuan Xie ⋅ Xin Tan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 31
Pip-Stereo: Progressive Iterations Pruner for Iterative Optimization based Stereo Matching
Jintu Zheng ⋅ Qizhe Liu ⋅ Huangxin Xu ⋅ zhuojie Chen
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 32
Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching
Bowen Wen ⋅ Shaurya Dewan ⋅ Stan Birchfield
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 33
E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training
Qitao Zhao ⋅ Hao Tan ⋅ Qianqian Wang ⋅ Sai Bi ⋅ Kai Zhang ⋅ Kalyan Sunkavalli ⋅ Shubham Tulsiani ⋅ Hanwen Jiang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 34
QVGGT: Post-Training Quantized Visual Geometry Grounded Transformer
Zhizhen Pan ⋅ Hesong Wang ⋅ Huan Wang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 35
SRGCD: Stability-Driven Region Growth Framework for 3D Change Detection
Yue Wu ⋅ Tao Peng ⋅ Yongzhe Yuan ⋅ Kaiyuan Feng ⋅ Hao Li ⋅ Maoguo Gong ⋅ Qiguang Miao ⋅ Wenping Ma
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 36
D-Prism: Differentiable Primitives for Structured Dynamic Modeling
Xingyuan Yu ⋅ Yijin Li ⋅ Chong Zeng ⋅ Yuhang Ming ⋅ Hujun Bao ⋅ Guofeng Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 37
STAC: Plug-and-Play Spatio-Temporal Aware Cache Compression for Streaming 3D Reconstruction
Runze Wang ⋅ Yuxuan Song ⋅ Youcheng Cai ⋅ Ligang Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 38
Stabilizing Streaming Video Geometry via Dynamic Feature Normalization
Xiaoyang Lyu ⋅ Muxin Liu ⋅ Xiaoshan Wu ⋅ Ruicheng Wang ⋅ Yihua Huang ⋅ Yangtian Sun ⋅ Shaoshuai Shi ⋅ Xiaojuan Qi
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 39
LaS-Comp: Zero-shot 3D Completion with Latent–Spatial Consistency
Weilong Yan ⋅ Li Haipeng ⋅ Hao Xu ⋅ Nianjin Ye ⋅ Yihao Ai ⋅ Shuaicheng Liu ⋅ Jingyu Hu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 40
Pano360: Perspective to Panoramic Vision with Geometric Consistency
Zhengdong Zhu ⋅ Weiyi Xue ⋅ Zuyuan Yang ⋅ Wenlve Zhou ⋅ Zhiheng Zhou
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 41
EfficientMonoHair: Fast Strand-Level Reconstruction from Monocular Video via Multi-View Direction Fusion
Da Li ⋅ Dominik Engel ⋅ Deng Luo ⋅ Ivan Viola
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 42
OSPO: Object-Centric Self-Improving Preference Optimization for Text-to-Image Generation
Yoonjin Oh ⋅ Yongjin Kim ⋅ Hyomin Kim ⋅ Donghwan Chi ⋅ Sungwoong Kim
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 43
MoReGen: Multi-Agent Motion-Reasoning Engine for Code-based Text-to-Video Synthesis
Xiangyu Bai ⋅ He Liang ⋅ Bishoy Galoaa ⋅ Utsav Nandi ⋅ Shayda Moezzi ⋅ Yuhang He ⋅ Sarah Ostadabbas
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 44
StyleTextGen: Style-Conditioned Multilingual Scene Text Generation
Zeyu Chen ⋅ Fangmin Zhao ⋅ Yan Shu ⋅ Yichao Liu ⋅ Liu Yu ⋅ Yu ZHOU
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 45
CRAFT-LoRA: Content-Style Personalization via Rank-Constrained Adaptation and Training-Free Fusion
Yu Li ⋅ Yujun Cai ⋅ Chi Zhang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 46
OneHOI: Unifying Human-Object Interaction Generation and Editing
Jiun Tian Hoe ⋅ Weipeng Hu ⋅ Xudong Jiang ⋅ Yap-Peng Tan ⋅ Chee Seng Chan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 47
GlyphPrinter: Region-Grouped Direct Preference Optimization for Glyph-Accurate Visual Text Rendering
Xincheng Shuai ⋅ Ziye Li ⋅ Henghui Ding ⋅ Dacheng Tao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 48
Self-Paced and Self-Corrective Masked Prediction for Movie Trailer Generation
Sidan Zhu ⋅ Hongteng Xu ⋅ Dixin Luo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 49
TV2TV: A Unified Framework for Interleaved Language and Video Generation
Xiaofeng Zhang ⋅ Youssef Emad ⋅ Melissa Hall ⋅ John Nguyen ⋅ Karthik Padthe ⋅ Liam Robbins ⋅ Amir Bar ⋅ Delong Chen ⋅ Michal Drozdzal ⋅ Maha Elbayad ⋅ Yushi Hu ⋅ Shang-Wen Li ⋅ Jakob Verbeek ⋅ XuDong Wang ⋅ Marjan Ghazvininejad ⋅ Luke Zettlemoyer ⋅ Emily Dinan
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 50
Narrative Weaver: Towards Controllable Long-Range Visual Consistency with Multi-Modal Conditioning
Zhengjian Yao ⋅ Yongzhi Li ⋅ Xinyuan Gao ⋅ Quan Chen ⋅ Peng Jiang ⋅ Yanye Lu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 51
Ref4D-VideoBench: Four-Dimensional Reference-Based Evaluation of Text-to-Video Generative Models
Jiajia Wei ⋅ YuJia He ⋅ Yuhan Hou ⋅ Hang Qi ⋅ Sihua Wang ⋅ Jincheng Shi ⋅ Kwok Fung Li ⋅ Zibin Zheng ⋅ Weibin Wu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 52
PureCC: Pure Learning for Text-to-Image Concept Customization
Zhichao Liao ⋅ Xiaole Xian ⋅ Qingyu Li ⋅ Wenyu Qin ⋅ Meng Wang ⋅ Weicheng Xie ⋅ Siyang Song ⋅ Pingfa Feng ⋅ Long ZENG ⋅ Liang Pan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 53
Disentangling to Re-couple: Resolving the Similarity-Controllability Paradox in Subject-Driven Text-to-Image Generation
Shuang Li ⋅ Chao Deng ⋅ Hang Chen ⋅ Liqun Liu ⋅ zhenyu hu ⋅ Te Cao ⋅ Mengge Xue ⋅ Yuan Chen ⋅ Peng Shu ⋅ Huan Yu ⋅ Jie Jiang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 54
Yume1.5: A Text-Controlled Interactive World Generation Model
Xiaofeng Mao ⋅ Zhen Li ⋅ Chuanhao Li ⋅ Xiaojie Xu ⋅ Kaining Ying ⋅ Kaipeng Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 55
PosterReward: Unlocking Accurate Evaluation for High-Quality Graphic Design Generation
Jianyu LAI ⋅ Sixiang Chen ⋅ Jialin Gao ⋅ Hengyu Shi ⋅ Zhongying Liu ⋅ Fuxiang Zhai ⋅ Junfeng Luo ⋅ Xiaoming Wei ⋅ Lujia Wang ⋅ Lei Zhu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 56
Scone: Bridging Composition and Distinction in Subject-Driven Image Generation via Unified Understanding-Generation Modeling
Yuran Wang ⋅ Bohan Zeng ⋅ Chengzhuo Tong ⋅ Wenxuan Liu ⋅ Yang Shi ⋅ Xiaochen Ma ⋅ Hao Liang ⋅ Yuanxing Zhang ⋅ Wentao Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 57
SLVMEval: Synthetic Meta Evaluation Benchmark for Text-to-Long Video Generation
Ryosuke Matsuda ⋅ Keito Kudo ⋅ Haruto Yoshida ⋅ Nobuyuki Shimizu ⋅ Jun Suzuki
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 58
PROMPTMINER: Black-Box Prompt Stealing against Text-to-Image Generative Models via Reinforcement Learning and VLM-Guided Optimization
Mingzhe Li ⋅ Renhao 'Norman' Zhang ⋅ Zhiyang Wen ⋅ Siqi Pan ⋅ Bruno da Silva ⋅ Juan Zhai ⋅ Shiqing Ma
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 59
FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing
Guangzhao Li ⋅ Yanming Yang ⋅ Chenxi Song ⋅ Xiaohong Liu ⋅ Chi Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 60
Self-Evaluation Unlocks Any-Step Text-to-Image Generation
Xin Yu ⋅ Xiaojuan Qi ⋅ Zhengqi Li ⋅ Kai Zhang ⋅ Richard Zhang ⋅ Zhe Lin ⋅ Eli Shechtman ⋅ Tianyu Wang ⋅ Yotam Nitzan
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 61
Say Cheese! Detail-Preserving Portrait Collection Generation via Natural Language Edits
Zelong Sun ⋅ Jiahui Wu ⋅ Ying Ba ⋅ Dong Jing ⋅ Zhiwu Lu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 62
LVLM-Aided Alignment of Task-Specific Vision Models
Alexander Koebler ⋅ Lukas Kuhn ⋅ Ingo Thon ⋅ Florian Buettner
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 63
DeepAlign: Mitigating Modality Conflict through Modality-Specific Alignment
Shuo Li ⋅ Bingchen Miao ⋅ Wendong Bu ⋅ Juncheng Li ⋅ Hanwang Zhang ⋅ Fei Wu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 64
PG-VTON: Single-Pass Training-Free Virtual Try-On via Patch-Guided Reference Alignment
Guohao Zhao ⋅ Yuxin Peng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 65
Linguistic Priors for Visual Decoupling: Towards Symmetric Vision-Brain Alignment
Dongjun Liu ⋅ Weichen Dai ⋅ Jingsheng Qian ⋅ Honggang Liu ⋅ Hangjie Yi ⋅ Wanzeng Kong
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 66
Scaling Spatial Intelligence with Multimodal Foundation Models
Zhongang Cai ⋅ Wang Ruisi ⋅ Chenyang Gu ⋅ Fanyi Pu ⋅ Junxiang Xu ⋅ YUBO WANG ⋅ Wanqi Yin ⋅ Zhitao Yang ⋅ Chen Wei ⋅ Tongxi Zhou ⋅ Qingping SUN ⋅ Hui En Pang ⋅ Jiaqi Li ⋅ Oscar Qian ⋅ Zhiqian Lin ⋅ Xuanke Shi ⋅ Kewang Deng ⋅ Xiaoyang Han ⋅ Zukai Chen ⋅ Xiangyu Fan ⋅ Hanming Deng ⋅ Lewei Lu ⋅ Liang Pan ⋅ Bo Li ⋅ Ziwei Liu ⋅ Quan Wang ⋅ Dahua Lin ⋅ Lei Yang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 67
R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning
Qi Yang ⋅ Bolin Ni ⋅ Shiming Xiang ⋅ Houwen Peng
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 68
SafeGRPO: Self-Rewarded Multimodal Safety Alignment via Rule-Governed Policy Optimization
Xuankun Rong ⋅ Wenke Huang ⋅ Tingfeng Wang ⋅ Daiguo Zhou ⋅ Bo Du ⋅ Mang Ye
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 69
AVATAR: Reinforcement Learning to See, Hear, and Reason Over Video
Yogesh Kulkarni ⋅ Pooyan Fazli
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 70
CogniVerse: Revolutionizing Multi-Modal Retrieval-Augmented Generation with Cognitive Reflection and Geometric Reasoning
Xiang Fang ⋅ Wanlong Fang ⋅ Changshuo Wang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 71
FOZO: Forward-Only Zeroth-Order Prompt Optimization for Test-Time Adaptation
Xingyu Wang ⋅ Tao Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 72
Language Does Matter for Cross-Domain Few-Shot Visual Feature Enhancement
Fei Zhou ⋅ Xiwen Zhang ⋅ Qingqing Qiu ⋅ Lei Zhang ⋅ Wei Wei ⋅ Chen Ding ⋅ Yi Zhang ⋅ Liang Li ⋅ Xiangyu Yue ⋅ Yanning Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 73
Back to Source: Open-Set Continual Test-Time Adaptation via Domain Compensation
Yingkai Yang ⋅ Chaoqi Chen ⋅ Hui Huang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 74
Bridging Domain Expertise and Generalization for Performance Estimation
Shuxuan Li ⋅ Zhilin Zhao ⋅ Quyu Kong ⋅ Wei-Shi Zheng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 75
Adaptive Data Augmentation with Multi-armed Bandit: Sample-Efficient Embedding Calibration for Implicit Pattern Recognition
Minxue Tang ⋅ Yangyang Yu ⋅ Aolin Ding ⋅ MAZIYAR BARAN POUYAN ⋅ Taha Belkhouja ⋅ Yujia Bao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 76
Bridging Domains through Subspace-Aware Model Merging
Levy Chaves ⋅ Chao Zhou ⋅ Rebekka Burkholz ⋅ Eduardo Valle ⋅ Sandra Avila
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 77
DA-Mamba: Learning Domain-Aware State Space Model for Global-Local Alignment in Domain Adaptive Object Detection
Haochen Li ⋅ Rui Zhang ⋅ Hantao Yao ⋅ Xin Zhang ⋅ Yifan Hao ⋅ Shaohui Peng ⋅ Yongwei Zhao ⋅ Ling Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 78
Scaling Dense Event-Stream Pretraining from Visual Foundation Models
Zhiwen Chen ⋅ Junhui Hou ⋅ Zhiyu Zhu ⋅ Jinjian Wu ⋅ Guangming Shi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 79
Event Stream Filtering via Probability Flux Estimation
Jinze Chen ⋅ Wei Zhai ⋅ Yang Cao ⋅ Bin Li ⋅ Zheng-Jun Zha
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 80
AIMDepth: Asymmetric Image-Event Mamba for Monocular Depth Estimation
Luoxi Jing ⋅ Dianxi Shi ⋅ YuShe Cao ⋅ Yuanze Wang ⋅ Junze Zhang ⋅ Yuning Cui ⋅ Mengzhu Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 81
Time-Specialized Event-Image Alignment for Blur-to-Video Decomposition
Zhijing Sun ⋅ Senyan Xu ⋅ Ruixuan Jiang ⋅ Kean Liu ⋅ Runze Tian ⋅ Xueyang Fu ⋅ Zheng-Jun Zha
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 82
eRetinexGS: Retinex Modeling for Low-Light Scene Enhancement via Event Streams and 3D Gaussian Splatting
Haojie Yan ⋅ Zehao Chen ⋅ Yan Liu ⋅ Shi Gu ⋅ Peng Lin ⋅ De Ma ⋅ Huajin Tang ⋅ Qian Zheng ⋅ Gang Pan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 83
Unsupervised 3d Motion Estimation Using Event Camera
Han Han ⋅ Wei Zhai ⋅ Tiesong Zhao ⋅ Bin Li ⋅ Yang Cao ⋅ Zheng-Jun Zha
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 84
Goal-Driven Reward by Video Diffusion Models for Reinforcement Learning
Qi Wang ⋅ Mian Wu ⋅ Yuyang Zhang ⋅ Mingqi Yuan ⋅ Wenyao Zhang ⋅ Haoxiang You ⋅ Yunbo Wang ⋅ Xin Jin ⋅ Xiaokang Yang ⋅ Wenjun Zeng
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 85
ModularAgent: A Task-Aware Modular Framework for Joint Optimization of Multimodal Large Language Models and World Models
Yu-Wei Zhan ⋅ Xin Wang ⋅ Pengzhe Mao ⋅ Tongtong Feng ⋅ Ren Wang ⋅ Wenwu Zhu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 86
AstraNav-Memory: Contexts Compression for Long Memory
Junjun Hu ⋅ Xinda Xue ⋅ Botao Ren ⋅ Minghua Luo ⋅ Jintao Chen ⋅ Haochen Bai ⋅ Liangliang You ⋅ Mu Xu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 87
Test-Time Perturbation Learning with Delayed Feedback for Vision-Language-Action Models
Zehua Zang ⋅ Xi Wang ⋅ Fuchun Sun ⋅ Xiao Xu ⋅ Lixiang Liu ⋅ Jiahuan Zhou ⋅ Jiangmeng Li
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 88
OVSegDT: Segmenting Transformer for Open-Vocabulary Object Goal Navigation
Tatiana Zemskova ⋅ Aleksei Staroverov ⋅ Dmitry Yudin ⋅ Aleksandr Panov
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 89
ShowUI-π: Flow-based Generative Models as GUI Dexterous Hands
Siyuan Hu ⋅ Kevin Qinghong Lin ⋅ Mike Zheng Shou
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 90
ActiveVLA: Injecting Active Perception into Vision-Language-Action Models for Precise 3D Robotic Manipulation
Zhenyang Liu ⋅ Yongchong Gu ⋅ Yikai Wang ⋅ Xiangyang Xue ⋅ Yanwei Fu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 91
ACoT-VLA: Action Chain-of-Thought for Vision-Language-Action Models
Linqing Zhong ⋅ Yi Liu ⋅ Yifei Wei ⋅ Ziyu Xiong ⋅ Si Liu ⋅ Guangrui Ren
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 92
BridgeEQA: Virtual Embodied Agents for Real Bridge Inspections
Subin Varghese ⋅ Joshua Gao ⋅ Asad Ur Rahman ⋅ Vedhus Hoskere
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 93
SyncMos: Scalable Motion Synchronisation for Multi-Agent Scene Interaction
Lingxiao Li ⋅ Dongwon Kim ⋅ Lingyan Ruan ⋅ Taesoo Kwon ⋅ Bin Chen ⋅ Taehyun Rhee
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 94
Planning in 8 Tokens: A Compact Discrete Tokenizer for Latent World Model
Dongwon Kim ⋅ Gawon Seo ⋅ Jinsung Lee ⋅ Minsu Cho ⋅ Suha Kwak
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 95
Omni-Attribute: Open-vocabulary Attribute Encoder for Visual Concept Personalization
Tsai-Shien Chen ⋅ Aliaksandr Siarohin ⋅ Gordon Guocheng Qian ⋅ Kuan-Chieh Jackson Wang ⋅ Egor Nemchinov ⋅ Moayed Haji Ali ⋅ Riza Alp Guler ⋅ Willi Menapace ⋅ Ivan Skorokhodov ⋅ Anil Kag ⋅ Jun-Yan Zhu ⋅ Sergey Tulyakov
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 96
IF-Bench: Benchmarking and Enhancing MLLMs for Infrared Images with Generative Visual Prompting
Tao Zhang ⋅ Yuyang Hong ⋅ Yang Xia ⋅ Kun Ding ⋅ Zeyu Zhang ⋅ Ying Wang ⋅ Shiming Xiang ⋅ Chunhong Pan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 97
InstantRetouch: Efficient and High-Fidelity Instruction-Guided Image Retouching with Bilateral Space
Jiarui Wu ⋅ Yujin Wang ⋅ Ruikang Li ⋅ Fan Zhang ⋅ Mingde Yao ⋅ Tianfan Xue
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 98
MICON-Bench: Benchmarking and Enhancing Multi-Image Context Image Generation in Unified Multimodal Models
Mingrui Wu ⋅ Hang Liu ⋅ Jiayi Ji ⋅ Xiaoshuai Sun ⋅ Rongrong Ji
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 99
The Devil is in Attention Sharing: Improving Complex Non-rigid Image Editing Faithfulness via Attention Synergy
Zhuo Chen ⋅ Fanyue Wei ⋅ Runze Xu ⋅ Jingjing Li ⋅ Lixin Duan ⋅ Angela Yao ⋅ Wen Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 100
ShreddingNet: Coarse-to-Fine Restoration for Multi-Source Shredded Manuscripts
Haoyang Cui ⋅ Hao Jiang ⋅ Yadong Mu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 101
Image Guides Images: Consistent Video Amodal Completion with Rectified In-Context Exemplar Guidance
Xiaoyu Kong ⋅ Ketong Ren ⋅ Dongyu She ⋅ Weiming Dong ⋅ Miao Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 102
Radiance Meshes for Volumetric Reconstruction
Alexander Mai ⋅ Trevor Hedstrom ⋅ George Kopanas ⋅ Janne Kontkanen ⋅ Falko Kuester ⋅ Jonathan T. Barron
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 103
Aesthetic Camera Viewpoint Suggestion with 3D Aesthetic Field
Sheyang Tang ⋅ Armin Shafiee Sarvestani ⋅ Jialu Xu ⋅ Xiaoyu Xu ⋅ Zhou Wang
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 104
CoRoGS: Contextual Gaussian Splatting for Robust Large-Deviation View Synthesis
Xin Ma ⋅ Peng Lu ⋅ Yisong Chen ⋅ Chengwei Pan ⋅ Sheng Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 105
ChronoGS: Disentangling Invariants and Changes in Multi-Period Scenes
Zhongtao Wang ⋅ Jiaqi Dai ⋅ Qingtian Zhu ⋅ Yilong Li ⋅ Mai Su ⋅ Fei Zhu ⋅ Meng GAI ⋅ Shaorong Wang ⋅ Chengwei Pan ⋅ Yisong Chen ⋅ Guoping Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 106
Real-Time Dynamic Scene Rendering with Controlled Compressibility and Contact Awareness
Boya Shi ⋅ Naiyang Guan ⋅ Xiaodong Yi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 107
Splatent: Splatting Diffusion Latents for Novel View Synthesis
Or Hirschorn ⋅ Omer Sela ⋅ Inbar Huberman-Spiegelglas ⋅ Netalee Efrat Sela ⋅ Eli Alshan ⋅ Ianir Ideses ⋅ Frederic Devernay ⋅ Yochai Zvik ⋅ Lior Fritz
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 108
ParticleGS: Learning Neural Gaussian Particle Dynamics from Videos for Prior-free Physical Motion Extrapolation
Jinsheng Quan ⋅ Qiaowei Miao ⋅ Yichao Xu ⋅ Zizhuo Lin ⋅ Ying Li ⋅ Wei Yang ⋅ Zhihui Li ⋅ Yawei Luo
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 109
Dynamic-Static Decomposition for Novel View Synthesis of Dynamic Scenes with Spiking Neurons
Lingyun Dai ⋅ Zehao Chen ⋅ Yan Liu ⋅ Shi Gu ⋅ Peng Lin ⋅ De Ma ⋅ Huajin Tang ⋅ Qian Zheng ⋅ Gang Pan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 110
DiffSoup: Direct Differentiable Rasterization of Triangle Soup for Extreme Radiance Field Simplification
Kenji Tojo ⋅ Bernd Bickel ⋅ Nobuyuki Umetani
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 111
Gyro-based Deep Video Deblurring
Jaesung Rim ⋅ Woohyeok Kim ⋅ Haeyun Lee ⋅ Heemin Yang ⋅ Ke Wang ⋅ Sunghyun Cho
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 112
Residual Diffusion Bridge Model for Image Restoration
Hebaixu Wang ⋅ Jing Zhang ⋅ Haoyang Chen ⋅ Haonan Guo ⋅ Di Wang ⋅ Jiayi Ma ⋅ Bo Du
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 113
MMDIR: Multimodal Instruction-Driven Framework for Mixed-Degradation Document Image Restoration
Heng Li ⋅ Xingyuan Wang ⋅ Yang Fan ⋅ Yunan Zhang ⋅ Xiangping Wu ⋅ Qingcai Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 114
Rectifying Latent Space for Generative Single-Image Reflection Removal
Mingjia Li ⋅ Jin Hu ⋅ Hainuo Wang ⋅ Qiming Hu ⋅ Jiarui Wang ⋅ Xiaojie Guo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 115
Towards Generalized Multimodal Homography Estimation
Jinkun You ⋅ Jiaxin Cheng ⋅ Jie Zhang ⋅ Yicong Zhou
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 116
Edit-aware RAW reconstruction
Abhijith Punnappurath ⋅ Luxi Zhao ⋅ Ke Zhao ⋅ Hue Nguyen ⋅ Radek Grzeszczuk ⋅ Michael S. Brown
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 117
Face2Scene: Using Facial Degradation as an Oracle for Diffusion-Based Scene Restoration
Amirhossein Kazerouni ⋅ Maitreya Suin ⋅ Tristan T Aumentado-Armstrong ⋅ Sina Honari ⋅ Amanpreet Walia ⋅ Iqbal Mohomed ⋅ Kosta Derpanis ⋅ Babak TAATI ⋅ Alex Levinshtein
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 118
HG-Lane: High-Fidelity Generation of Lane Scenes under Adverse Weather and Lighting Conditions without Re-annotation
Daichao Zhao ⋅ Qiupu Chen ⋅ Feng He ⋅ Xin Ning ⋅ Qiankun Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 119
NanoSD: Edge Efficient Foundation Model for Real Time Image Restoration
Subhajit Sanyal ⋅ Srinivas Soumitri Miriyala ⋅ Akshay Janardan Bankar ⋅ Manjunath Arveti ⋅ Sowmya Vajrala ⋅ Shreyas Pandith ⋅ Sravanth Kodavanti ⋅ Abhishek Ameta ⋅ Harshit Harshit ⋅ Amit Unde
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 120
MR. Illuminate: Zero-Shot Low-Light Image Enhancement with Diffusion Prior
Joshua Cho ⋅ Sara Aghajanzadeh ⋅ Zhen Zhu ⋅ David Forsyth
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 121
FoundIR-v2: Optimizing Pre-Training Data Mixtures for Image Restoration Foundation Model
Xiang Chen ⋅ Jinshan Pan ⋅ Jiangxin Dong ⋅ Jian Yang ⋅ Jinhui Tang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 122
SPEGC: Continual Test-Time Adaptation via Semantic-Prompt-Enhanced Graph Clustering for Medical Image Segmentation
Xiaogang Du ⋅ Jiawei Zhang ⋅ Tongfei Liu ⋅ Tao Lei ⋅ Yingbo Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 123
BackSplit: The Importance of Sub-dividing the Background in Biomedical Lesion Segmentation
Rachit Saluja ⋅ Asli Cihangir ⋅ Ruining Deng ⋅ Johannes C. Paetzold ⋅ Fengbei Liu ⋅ Mert Sabuncu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 124
Divide, Conquer, and Aggregate: Asymmetric Experts for Class-Imbalanced Semi-Supervised Medical Image Segmentation
Yajun Liu
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 125
CROWn: A Unified Framework for Anti‑Aliased Downsampling and Phase‑Calibrated Fusion in 3D Medical Segmentation
Xingru Huang ⋅ Shuanghua Ye ⋅ Zhao Huang ⋅ Wenwen Tang ⋅ Huiyu Zhou ⋅ Zhiwen Zheng ⋅ Jin Liu ⋅ Xiaoshuai Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 126
Rethinking Box Supervision: Bias-Free Weakly Supervised Medical Segmentation
Jun Wei ⋅ Hui Huang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 127
Semi-supervised Echocardiography Video Segmentation via Anchor Semantic Awareness and Continuous Pseudo-label Reforging
Yunpeng Fang ⋅ Yimu Sun ⋅ Jingxing Guo ⋅ Huisi Wu ⋅ Jing Qin
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 128
TANGO: Learning Distribution-wise Foundation Prior Consistency and Instance-wise Style Calibration for Medical Image Generalization
Chuang Liu ⋅ Yichao Cao ⋅ Xiu Su ⋅ Haogang Zhu
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 129
MambaLiteUNet: Cross-Gated Adaptive Feature Fusion for Robust Skin Lesion Segmentation
Md Maklachur Rahman ⋅ Soon Ki Jung ⋅ Tracy Hammond
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 130
Breaking Multimodal LLM Safety via Video-Driven Prompting
Dong Wang ⋅ XIANGYU HE ⋅ Xinqi Lyu ⋅ Bin Xiao
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 131
When LoRA Betrays: Backdooring Text-to-Image Models by Masquerading as Benign Adapters
Liangwei Lyu ⋅ Jiaqi Xu ⋅ Jianwei Ding ⋅ Qiyao Deng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 132
RecoverMark: Robust Watermarking for Localization and Recovery of Manipulated Faces
Haonan An ⋅ Xiaohui Ye ⋅ Guang Hua ⋅ Yihang Tao ⋅ Hangcheng Cao ⋅ Xiangyu Yu ⋅ Yuguang Fang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 133
A Provable Energy-Guided Test-Time Defense Boosting Adversarial Robustness of Large Vision-Language Models
Mujtaba Hussain Mirza ⋅ Antonio D’Orazio ⋅ Odelia Melamed ⋅ Iacopo Masi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 134
FORCE: Transferable Visual Jailbreaking Attacks via Feature Over-Reliance CorrEction
Runqi Lin ⋅ Alasdair Paren ⋅ Suqin Yuan ⋅ Muyang Li ⋅ Philip H.S. Torr ⋅ Adel Bibi ⋅ Tongliang Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 135
PureProof: Diffusion-Resistant Black-box Targeted Attack on Large Vision-Language Models
Yiming CAO ⋅ Dong Wang ⋅ Xinqi Lyu ⋅ Bin Xiao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 136
UniDef: Universal Defense Against Unauthorized Image Manipulation
Mingwen Shao ⋅ Lingzhuang Meng ⋅ Xiang Lv ⋅ Mengyao Wu ⋅ Xinyuan Chen ⋅ Qiao Zhang ⋅ Chang Liu ⋅ Yuanjian Qiao ⋅ Chao Dong
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 137
Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following
Tianyi Xiong ⋅ Yi Ge ⋅ Ming Li ⋅ Zuolong Zhang ⋅ Pranav Kulkarni ⋅ Kaishen Wang ⋅ Qi He ⋅ Zeying Zhu ⋅ Chenxi Liu ⋅ Ruibo Chen ⋅ Tong Zheng ⋅ Yanshuo Chen ⋅ Xiyao Wang ⋅ Ray Zhang ⋅ Wenhu Chen ⋅ Heng Huang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 138
MERLIN: Building Low-SNR Robust Multimodal LLMs for Electromagnetic Signals
Junyu Shen ⋅ Zhendong She ⋅ Chenghanyu Zhang ⋅ Yuchuang Sun ⋅ Luqing Luo ⋅ Dingwei Tan ⋅ Zonghao Guo ⋅ Bo Guo ⋅ Zehua Han ⋅ Wupeng Xie ⋅ Yaxin Mu ⋅ Peng Zhang ⋅ Peipei Li ⋅ Fengxiang Wang ⋅ Yangang Sun ⋅ Maosong Sun
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 139
Rethinking Cross-Modal Anchor Alignment for Mitigating Error Accumulation
Bin Liu ⋅ Wei Sun ⋅ Qianqian Wang ⋅ Wei Feng ⋅ Yijie Chen ⋅ Haixi Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 140
SOUPLE: Enhancing Audio-Visual Localization and Segmentation with Learnable Prompt Contexts
Khanh Binh Nguyen ⋅ Chae Jung Park
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 141
Omni-MMSI: Toward Identity-attributed Social Interaction Understanding
Xinpeng Li ⋅ Bolin Lai ⋅ Hardy Chen ⋅ Shijian Deng ⋅ Cihang Xie ⋅ Yuyin Zhou ⋅ James M. ⋅ Yapeng Tian
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 142
Inconsistency-aware Multimodal Schrödinger Bridge for Deepfake Localization
Jiayu Xiong ⋅ Jing Wang ⋅ Qi Zhang ⋅ Wanlong Wang ⋅ Jun Xue
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 143
MASQuant: Modality-Aware Smoothing Quantization for Multimodal Large Language Models
lulu hu ⋅ Xiao Wenhu ⋅ Chen Xin ⋅ Xinhua Xu ⋅ Bowen Xu ⋅ Kun Li ⋅ Yongliang Tao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 144
Seeing Through Touch: Tactile-Driven Visual Localization of Material Regions
Seongyu Kim ⋅ Seungwoo Lee ⋅ Hyeonggon Ryu ⋅ Joon Chung ⋅ Arda Senocak
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 145
Seeing What Matters: A Training-Free Self-Guided Framework for Multimodal Detail Perception and Reasoning
Mingjie Ma ⋅ yichao ma ⋅ Zhong Yang ⋅ Guohui Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 146
Illuminating Visual Identity in Universal Multimodal Embeddings
Jiawei Cao ⋅ Junyi Feng ⋅ Jiashen Hua ⋅ Ziheng Huang ⋅ Bing Deng ⋅ Kaijie Wu ⋅ Chaochen Gu ⋅ Jieping Ye
[ Slides
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 147
Anti-Degradation Lifelong Multi-View Clustering
Xingfeng Li ⋅ Hao Pan ⋅ Honglin Yuan ⋅ Yuan Sun ⋅ Xujian Zhao ⋅ Jiaqi Lin ⋅ Zhenwen Ren
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 148
The Coherence Trap: When MLLM-Crafted Narratives Exploit Manipulated Visual Contexts
Yuchen Zhang ⋅ Yaxiong Wang ⋅ Yujiao Wu ⋅ Lianwei Wu ⋅ Li Zhu ⋅ Zhedong Zheng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 149
Efficient and High-Fidelity Omni Modality Retrieval
Chuong Huynh ⋅ Manh Luong ⋅ Abhinav Shrivastava
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 150
Same Content, Different Answers: Cross-Modal Inconsistency in MLLMs
Angela van Sprang ⋅ Laurens Samson ⋅ Ana Lucic ⋅ Erman Acar ⋅ Sennay Ghebreab ⋅ Yuki M Asano
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 151
Tri-Subspaces Disentanglement for Multimodal Sentiment Analysis
Chunlei Meng ⋅ Jiabin Luo ⋅ Zhenglin Yan ⋅ Zhenyu Yu ⋅ Rong Fu ⋅ Zhongxue Gan ⋅ Chun Ouyang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 152
HAVE-Bench: Hierarchical Audio-Visual Evaluation from Perception to Interaction
Zhong Muyan ⋅ Erfei Cui ⋅ Sen Xing ⋅ Weiyun Wang ⋅ Wen Wu ⋅ Yuchen Hu ⋅ Yanting Zhang ⋅ Xiaowei Hu ⋅ Wenhai Wang ⋅ Chao Zhang ⋅ Jifeng Dai
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 153
Predictive Regularization Against Visual Representation Degradation in Multimodal Large Language Models
Enguang Wang ⋅ Qiang Wang ⋅ Yuanchen Wu ⋅ Ke Yan ⋅ Xinbin Yuan ⋅ Shouhong Ding ⋅ Xialei Liu ⋅ Mingming Cheng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 154
THE MORE, THE MERRIER: CONTRASTIVE FUSION FOR HIGHER-ORDER MULTIMODAL ALIGNMENT
Stefanos Koutoupis ⋅ Michaela Areti Zervou ⋅ Konstantinos Kontras ⋅ Maarten De Vos ⋅ Panagiotis Tsakalides ⋅ Grigorios Tsagkatakis
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 155
CineSRD: Leveraging Visual, Acoustic, and Linguistic Cues for Open-World Visual Media Speaker Diarization
Liangbin Huang ⋅ Xiaohua Liao ⋅ Chaoqun Cui ⋅ Shijing Wang ⋅ Zhaolong Huang ⋅ Yanlong Du ⋅ Wenji Mao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 156
HandDreamer: Zero-Shot Text to 3D Hand Model Generation using Corrective Hand Shape Guidance
Green Rosh ⋅ Prateek Kukreja ⋅ Vishakha SR ⋅ Pawan Prasad B H
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 157
UST-Hand: An Uncertainty-aware Spatiotemporal Point Cloud Interaction Network for 3D Self-supervised Hand Pose Estimation
Tianhao Han ⋅ HaoYang ZHANG ⋅ Liang Xie ⋅ Haochen Chang ⋅ Kun Gao ⋅ Yuan Cheng ⋅ Pengfei Ren ⋅ Erwei Yin
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 158
ForeHOI: Feed-forward 3D Object Reconstruction from Daily Hand-Object Interaction Videos
Yuantao Chen ⋅ Jiahao Chang ⋅ Chongjie Ye ⋅ Chaoran Zhang ⋅ Zhaojie Fang ⋅ Chenghong Li ⋅ Xiaoguang Han
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 159
Hoi! - A Multimodal Dataset for Force-Grounded, Cross-View Articulated Manipulation
Tim Engelbracht ⋅ René Zurbrügg ⋅ Matteo Wohlrapp ⋅ Martin Büchner ⋅ Abhinav Valada ⋅ Marc Pollefeys ⋅ Hermann Blum ⋅ Zuria Bauer
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 160
Enhancing Hands in 3D Whole-Body Pose Estimation with Conditional Hands Modulator
Gyeongsik Moon
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 161
TouchDream: 3D Object Completion through Imagined Touch
Yuanbo Wang ⋅ Xinning Wang ⋅ Zhaoxuan Zhang ⋅ Changlong Wang ⋅ qianchen xia ⋅ Xiaopeng Wei ⋅ Xin Yang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 162
ForceVLA2: Unleashing Hybrid Force-Position Control with Force Awareness for Contact-Rich Manipulation
Yang Li ⋅ Zhaxizhuoma ⋅ Hongru Jiang ⋅ Junjie Xia ⋅ Hongquan Zhang ⋅ Jinda Du ⋅ Yunsong Zhou ⋅ Jia Zeng ⋅ Ce Hao ⋅ Jieji Ren ⋅ Qiaojun Yu ⋅ Cewu Lu ⋅ Yu Qiao ⋅ Jiangmiao Pang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 163
TokenHand: Discrete Token Representation for Efficient Hand Mesh Reconstruction
Xinguo He ⋅ Yixin Shen ⋅ Rahul Chaudhari
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 164
Artiverse: A Diverse and Physically Grounded Dataset for Articulated Objects
Denys Iliash ⋅ Jiayi Liu ⋅ Egor Fokin ⋅ Qirui Wu ⋅ Ali Mahdavi Amiri ⋅ Manolis Savva ⋅ Angel Xuan Chang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 165
MatPedia: A Universal Generative Foundation for High-Fidelity Material Synthesis
Di Luo ⋅ Shuhui Yang ⋅ Mingxin Yang ⋅ Jiawei Lu ⋅ Yixuan Tang ⋅ Xintong Han ⋅ Zhuo Chen ⋅ Beibei Wang ⋅ Chunchao Guo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 166
LogCD: Local-to-global Consistency Distillation for Few-step Image Generation
Qingsong Xie ⋅ Zhenyi Liao ⋅ Chen Chen ⋅ Zhijie Deng ⋅ Haonan Lu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 167
EditCtrl: Disentangled Local and Global Control for Real-Time Generative Video Editing
Yehonathan Litman ⋅ Shikun Liu ⋅ Dario Seyb ⋅ Nicholas Milef ⋅ Yang Zhou ⋅ Carl Marshall ⋅ Shubham Tulsiani ⋅ Caleb Leak
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 168
Anchoring and Rescaling Attention for Semantically Coherent Inbetweening
Tae Eun Choi ⋅ Sumin Shim ⋅ Junhyeok Kim ⋅ Seong Jae Hwang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 169
FlashMotion: Few-Step Controllable Video Generation with Trajectory Guidance
Quanhao Li ⋅ Zhen Xing ⋅ Rui Wang ⋅ Haidong Cao ⋅ Qi Dai ⋅ Daoguo Dong ⋅ Zuxuan Wu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 170
LightMover: Generative Light Movement with Color and Intensity Controls
Gengze Zhou ⋅ Tianyu Wang ⋅ Soo Ye Kim ⋅ ZHIXIN SHU ⋅ Xin Yu ⋅ Yannick Hold-Geoffroy ⋅ Sumit Chaturvedi ⋅ Qi Wu ⋅ Zhe Lin ⋅ Scott Cohen
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 171
Parallel Jacobi Decoding for Fast Autoregressive Image Generation
Boya Liao ⋅ Ying Li ⋅ Siyong Jian ⋅ Huan Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 172
CARE-Edit: Condition-Aware Routing of Experts for Contextual Image Editing
Yucheng Wang ⋅ Zedong Wang ⋅ Yuetong Wu ⋅ Yue Ma ⋅ Dan Xu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 173
CREval: An Automated Interpretable Evaluation for Creative Image Manipulation under Complex Instructions
Chonghuinan Wang ⋅ Zihan Chen ⋅ Yuxiang Wei ⋅ Tianyi Jiang ⋅ Xiaohe Wu ⋅ Fan Li ⋅ Wangmeng Zuo ⋅ Hongxun Yao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 174
EchoVDiff: Cardiac-Cycle Echocardiography Video Generation from Arbitrary Frame
Jiansong Zhang ⋅ Xiaying Yang ⋅ Xiaoling Luo ⋅ Linlin Shen
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 175
Re-Align: Structured Reasoning-guided Alignment for In-Context Image Generation and Editing
Runze He ⋅ YIJI CHENG ⋅ Tiankai Hang ⋅ Zhimin Li ⋅ Yu Xu ⋅ Zijin Yin ⋅ Shiyi Zhang ⋅ Wenxun Dai ⋅ Penghui Du ⋅ Ao Ma ⋅ Chunyu Wang ⋅ qinglin lu ⋅ Jizhong Han ⋅ Jiao Dai
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 176
ChimeraLoRA: Multi-Head LoRA-Guided Synthetic Datasets
Hoyoung Kim ⋅ Minwoo Jang ⋅ Jabin Koo ⋅ Sangdoo Yun ⋅ Jungseul Ok
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 177
Frequency-Aware Flow Matching for High-Quality Image Generation
Sucheng Ren ⋅ Qihang Yu ⋅ Ju He ⋅ Xiaohui Shen ⋅ Liang-Chieh Chen
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 178
STARFlow-V: End-to-End Video Generative Modeling with Autoregressive Normalizing Flows
Jiatao Gu ⋅ Ying Shen ⋅ Tianrong Chen ⋅ Laurent Dinh ⋅ Yuyang Wang ⋅ Miguel Ángel Bautista ⋅ David Berthelot ⋅ Joshua Susskind ⋅ Shuangfei Zhai
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 179
MixFlow Training: Alleviating Exposure Bias with Slowed Interpolation Mixture
Hui Li ⋅ Jiayue Lyu ⋅ Fu-Yun Wang ⋅ Kaihui Cheng ⋅ Siyu Zhu ⋅ Jingdong Wang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 180
Improving Controllable Generation: Faster Training and Better Performance via x0-Supervision
Amadou S. SANGARE ⋅ Adrien Maglo ⋅ Mohamed Chaouch ⋅ Bertrand Luvison
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 181
Visual-Aware CoT: Achieving High-Fidelity Visual Consistency in Unified Models
Zixuan Ye ⋅ Quande Liu ⋅ Cong Wei ⋅ Yuanxing Zhang ⋅ Xintao Wang ⋅ Pengfei Wan ⋅ Kun Gai ⋅ Wenhan Luo
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 182
OrionEdit: Bridging Reference and Source Images for Generalized Cross-Image Editing
Zeyu Jiang ⋅ Lai-Man Po ⋅ XUYUAN XU ⋅ Yexin Wang ⋅ Guoping Gong ⋅ Haoxuan Wu ⋅ Chenbo Yan ⋅ Kun Li ⋅ Yuyang Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 183
PositionIC: Unified Position and Identity Consistency for Image Customization
Junjie Hu ⋅ Tianyang Han ⋅ Kai Ma ⋅ Jialin Gao ⋅ Yang Song ⋅ Xianhua He ⋅ Junfeng Luo ⋅ Xiaoming Wei ⋅ Wenqiang Zhang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 184
P-Flow: Prompting Visual Effects Generation
Rui Zhao ⋅ Mike Zheng Shou
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 185
Clair Obscur: an Illumination-Aware Method for Real-World Image Vectorization
Xingyue Lin ⋅ Shuai Peng ⋅ Xiangyu Xie ⋅ Jianhua Zhu ⋅ Yuxuan Zhou ⋅ Liangcai Gao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 186
SURF: Signature-Retained Fast Video Generation
Kaixin Ding ⋅ Xi Chen ⋅ Sihui Ji ⋅ Yuan Gao ⋅ Liang Hou ⋅ Xin Tao ⋅ Hengshuang Zhao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 187
The devil is in the details: Enhancing Video Virtual Try-On via Keyframe-Driven Details Injection
Qingdong He ⋅ Xueqin Chen ⋅ Yanjie Pan ⋅ Peng Tang ⋅ Pengcheng Xu ⋅ Zhenye Gan ⋅ Chengjie Wang ⋅ Xiaobin Hu ⋅ Jiangning Zhang ⋅ Yabiao Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 188
Lynx: Towards High-Fidelity Personalized Video Generation
Shen Sang ⋅ Tiancheng Zhi ⋅ Tianpei Gu ⋅ Jing Liu ⋅ Linjie Luo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 189
VisionDirector: Vision-Language Guided Closed-Loop Refinement for Generative Image Synthesis
Meng Chu ⋅ Senqiao Yang ⋅ Haoxuan Che ⋅ Suiyun Zhang ⋅ Xichen Zhang ⋅ Shaozuo Yu ⋅ Haokun GUI ⋅ Zhefan Rao ⋅ Dandan Tu ⋅ Rui Liu ⋅ Jiaya Jia
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 190
ClusterMark: Towards Robust Watermarking for Autoregressive Image Generators with Visual Token Clustering
Denis Lukovnikov ⋅ Andreas Müller ⋅ Erwin Quiring ⋅ Asja Fischer
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 191
Stable Mean Flow: Lyapunov-Inspired One-Step Flow Matching
Guangxun Zhang ⋅ Mason Haberle ⋅ Davi Geiger
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 192
OPRO: Orthogonal Panel-Relative Operators for Panel-Aware In-Context Image Generation
Sanghyeon Lee ⋅ Minwoo Lee ⋅ Euijin Shin ⋅ Kangyeol Kim ⋅ Seunghwan Choi ⋅ Jaegul Choo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 193
First Frame Is the Place to Go for Video Content Customization
Jingxi Chen ⋅ Zongxia Li ⋅ Zhichao Liu ⋅ Guangyao Shi ⋅ Xiyang Wu ⋅ Fuxiao Liu ⋅ Cornelia Fermuller ⋅ Brandon Y. Feng ⋅ Yiannis Aloimonos
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 194
Scaling Zero-Shot Reference-to-Video Generation
Zijian Zhou ⋅ Shikun Liu ⋅ Haozhe Liu ⋅ Haonan Qiu ⋅ Zhaochong An ⋅ Weiming Ren ⋅ Zhiheng Liu ⋅ Xiaoke Huang ⋅ Kam-Woh Ng ⋅ Tian Xie ⋅ Xiao Han ⋅ Yuren Cong ⋅ Hang Li ⋅ Chuyan Zhu ⋅ Aditya Patel ⋅ Tao Xiang ⋅ Sen He
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 195
MotionEdit: Benchmarking and Learning Motion-Centric Image Editing
Yixin Wan ⋅ Lei Ke ⋅ Wenhao Yu ⋅ Kai-Wei Chang ⋅ Dong Yu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 196
VDOT: Efficient Unified Video Creation via Optimal Transport Distillation
Yutong Wang ⋅ Haiyu Zhang ⋅ Tianfan Xue ⋅ Yu Qiao ⋅ Yaohui Wang ⋅ Chang Xu ⋅ Xinyuan Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 197
Real-Time Generation of Streamable Talking Portrait Video with Reference-Guided Deep Compression VAEs
Sicheng Xu ⋅ Yu Deng ⋅ Shoukang Hu ⋅ Yichuan Wang ⋅ Yizhong Zhang ⋅ Zhan Chen ⋅ Jiaolong Yang ⋅ Baining Guo
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 198
RunawayEvil: Jailbreaking the Image-to-Video Generative Models
yueming lyu ⋅ Rufan Qian ⋅ Yueming Lyu ⋅ Qinglong Liu ⋅ Linzhuang Zou ⋅ Jie Qin ⋅ Songhua Liu ⋅ Caifeng Shan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 199
MultiAnimate: Pose-Guided Image Animation Made Extensible
Yingcheng Hu ⋅ Haowen Gong ⋅ Chuanguang Yang ⋅ Zhulin An ⋅ Yongjun Xu ⋅ Songhua Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 200
Translating Signals to Languages for sEMG-Based Activity Recognition
Ming Wang ⋅ Haoxuan Qu ⋅ Qiuhong Ke ⋅ Wei Zhou ⋅ Hossein Rahmani ⋅ Jun Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 201
Open the Motion Door: Atomic Motion Decomposition and Recomposition for Open-Vocabulary Motion Generation
Ke Fan ⋅ Jiangning Zhang ⋅ Ran Yi ⋅ Jingyu Gong ⋅ Yabiao Wang ⋅ yating wang ⋅ Xin Tan ⋅ Chengjie Wang ⋅ Lizhuang Ma
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 202
Multi-level Causal LLM-based Text-to-Motion Generation with Human Alignment
Chen Xiaodong ⋅ Qian Bao ⋅ Xudong Liu ⋅ Jianping Fang ⋅ Jintao Fang ⋅ Yongdong Zhang ⋅ Tao Mei ⋅ Wu Liu
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 203
MotionHiFlow: Text-to-Motion via Hierarchical Flow Matching
Heng Li ⋅ Xiaotong Lin ⋅ Ling-An Zeng ⋅ Yulei Kang ⋅ Shuai Li ⋅ Jian-Fang Hu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 204
LaMoGen: Language to Motion Generation Through LLM-Guided Symbolic Inference
Junkun JIANG ⋅ Ho Yin Au ⋅ Jingyu Xiang ⋅ Jie Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 205
Accelerating Diffusion via Hybrid Data-Pipeline Parallelism Based on Conditional Guidance Scheduling
Euisoo Jung ⋅ Byunghyun Kim ⋅ Hyunjin Kim ⋅ Seonghye Cho ⋅ Jae-Gil Lee
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 206
GVIS: Generative Vector Image Steganography
ZiHao Xu ⋅ Dawei xu ⋅ Zihan Li ⋅ Xixi Zheng ⋅ Chuan Zhang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 207
MaxMark: High-Capacity Diffusion-Native Watermarking via Robust and Invertible Latent Embedding
Xuanhang Chang ⋅ Zhonghao Yang ⋅ Cheng Zhuo ⋅ YU LI
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 208
GeoRK2: Geometry-Guided Runge–Kutta Integration for Diffusion Transformer Acceleration
Chaoqun Sun ⋅ Zongjing Fu ⋅ Powei Chang ⋅ Jinpeng Zhang ⋅ JianXiang Xiang ⋅ Yukang Gao ⋅ Chenyu Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 209
Test-time Sparsity for Extreme Fast Action Diffusion
Kangye Ji ⋅ Yuan Meng ⋅ Jianbo Zhou ⋅ Ye Li ⋅ Chen Tang ⋅ Zhi Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 210
Trainable Log-linear Sparse Attention for Efficient Diffusion Transformers
Yifan Zhou ⋅ Zeqi Xiao ⋅ Tianyi Wei ⋅ Shuai Yang ⋅ Xingang Pan
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 211
A Self-Conditioned Representation Guided Diffusion Model for Realistic Text-to-LiDAR Scene Generation
Wentao Qu ⋅ Guofeng Mei ⋅ Yang Wu ⋅ Yongshun Gong ⋅ Xiaoshui Huang ⋅ Liang Xiao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 212
When Local Rules Create Global Order: Self-Organized Representation Learning for Latent Diffusion Models
Junrong Lian ⋅ Weijian Deng ⋅ Pengxu Wei ⋅ Yaqin Chen ⋅ Qixiang Ye ⋅ Liang Lin
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 213
ViStoryBench: Comprehensive Benchmark Suite for Story Visualization
Cailin Zhuang ⋅ Ailin Huang ⋅ Hu Yaoqi ⋅ Jingwei Wu ⋅ Wei Cheng ⋅ Jiaqi Liao ⋅ Hongyuan Wang ⋅ Xinyao Liao ⋅ Weiwei Cai ⋅ Hengyuan Xu ⋅ Xuanyang Zhang ⋅ Xianfang Zeng ⋅ Zhewei Huang ⋅ Gang Yu ⋅ Chi Zhang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 214
R4-CGQA: Retrieval-based Vision Language Models for Computer Graphics Image Quality Assessment
Zhuangzi Li ⋅ Jian Jin ⋅ Shilv Cai ⋅ Weisi Lin
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 215
A³: Towards Advertising Aesthetic Assessment
Kaiyuan Ji ⋅ Yixuan Gao ⋅ Lu Sun ⋅ Yushuo Zheng ⋅ Zijian Chen ⋅ Jianbo Zhang ⋅ Xiangyang Zhu ⋅ Yuan Tian ⋅ Zicheng Zhang ⋅ Guangtao Zhai
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 216
GraphVLM: Benchmarking Vision Language Models for Multimodal Graph Learning
Jiajin Liu ⋅ Dongzhe Fan ⋅ Chuanhao Ji ⋅ Daochen Zha ⋅ Qiaoyu Tan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 217
Phrase-Grounding-Aware Supervised Fine-Tuning for Chart Recognition via Side-Masked Attention
Koichiro Ito
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 218
VL-RouterBench: A Benchmark for Vision–Language Model Routing
Zhehao Huang ⋅ Baijiong Lin ⋅ Jingyuan Zhang ⋅ Jingying Wang ⋅ Yuhang Liu ⋅ Ning Lu ⋅ Tao Li ⋅ Xiaolin Huang
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 219
CLIP Is Shortsighted: Paying Attention Beyond the First Sentence
Marc-Antoine Lavoie ⋅ Anas Mahmoud ⋅ Aldo Zaimi ⋅ Arsene Fansi Tchango ⋅ Steven L. Waslander
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 220
G^2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning
Wenbo hu ⋅ JINGLI LIN ⋅ Yilin Long ⋅ Yunlong Ran ⋅ Lihan Jiang ⋅ Yifan Wang ⋅ Chenming Zhu ⋅ Runsen Xu ⋅ Tai Wang ⋅ Jiangmiao Pang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 221
UZ3DVG: Unaided Zero-Shot 3D Visual Grounding with Generated Language Conditions
Wenbin Tan ⋅ Jiawen Lin ⋅ Yuan Xie ⋅ Yachao Zhang ⋅ Yanyun Qu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 222
LangField4D: Learning Identity-Adaptive and Spatio-Temporal Continuous 4D Language Fields for Dynamic Scenes
Yichao Xu ⋅ Qiaowei Miao ⋅ Jinsheng Quan ⋅ Wei Yang ⋅ Zhihui Li ⋅ Yawei Luo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 223
Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning
Yuhong Liu ⋅ Beichen Zhang ⋅ Yuhang Zang ⋅ Yuhang Cao ⋅ Long Xing ⋅ Xiaoyi Dong ⋅ Haodong Duan ⋅ Dahua Lin ⋅ Jiaqi Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 224
CLIPoint3D: Language-Grounded Few-Shot Unsupervised 3D Point Cloud Domain Adaptation
Mainak Singha ⋅ Sarthak Mehrotra ⋅ Paolo Casari ⋅ Subhasis Chaudhuri ⋅ Elisa Ricci ⋅ Biplab Banerjee
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 225
GeoTikzBridge: Advancing Multimodal Code Generation for Geometric Perception and Reasoning
Jiayin Sun ⋅ Caixia Sun ⋅ Boyu Yang ⋅ hailin li ⋅ Xiao Chen ⋅ Yi Zhang ⋅ Errui Ding ⋅ Liang Li ⋅ Chao Deng ⋅ Junlan Feng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 226
Keep it SymPL: Symbolic Projective Layout for Allocentric Spatial Reasoning in Vision-Language Models
Jaeyun Jang ⋅ Seunghui Shin ⋅ Taeho Park ⋅ Hyoseok Hwang
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 227
Geometry-Guided 3D Visual Token Pruning for Video-Language Models
Han Li ⋅ Zehao Huang ⋅ Jiahui Fu ⋅ Naiyan Wang ⋅ Si Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 228
Context-Nav: Context-Driven Exploration and Viewpoint-Aware 3D Spatial Reasoning for Instance Navigation
Won Shik Jang ⋅ Ue-Hwan Kim
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 229
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models
Shengchao Zhou ⋅ Yuxin Chen ⋅ Yuying Ge ⋅ Wei Huang ⋅ Jiehong Lin ⋅ Ying Shan ⋅ Xiaojuan Qi
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 230
PanoEnv: Exploring 3D Spatial Intelligence in Panoramic Environments with Reinforcement Learning
Zekai Lin ⋅ Xu Zheng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 231
Hilbert-Geo: Solving Solid Geometric Problems by Neural-Symbolic Reasoning
Ruoran Xu ⋅ Haoyu Cheng ⋅ Bin Dong ⋅ Qiufeng Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 232
Direction-aware 3D Large Multimodal Models
QUAN LIU ⋅ Weihao Xuan ⋅ Junjue Wang ⋅ Naoto Yokoya ⋅ Ling Shao ⋅ Shijian Lu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 233
CLAY: Conditional Visual Similarity Modulation in Vision-Language Embedding Space
Sohwi Lim ⋅ Lee Hyoseok ⋅ Jungjoon Park ⋅ Tae-Hyun Oh
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 234
Tackling Alignment Ambiguity in Person Retrieval through Conversational Attribute Mining
Hao Zou ⋅ Runqing Zhang ⋅ Jin Ding ⋅ xue zhou ⋅ Jianxiao Zou ⋅ Mingzhu Cai
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 235
Beyond Global Similarity: Multi-Conditional Retrieval for Fine-Grained Cross-Modal Understanding
Xuan Lu ⋅ Kangle Li ⋅ Haohang Huang ⋅ Rui Meng ⋅ Wenjun Zeng ⋅ Xiaoyu Shen
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 236
Imagine Before Concentration: Diffusion-Guided Registers Enhance Partially Relevant Video Retrieval
Jun Li ⋅ Xuhang Lou ⋅ Jinpeng Wang ⋅ Yuting Wang ⋅ Yaowei Wang ⋅ Shu-Tao Xia ⋅ Bin Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 237
What Is the Optimal Ranking Score Between Precision and Recall? We Can Always Find It and It Is Rarely F1
Sébastien Piérard ⋅ Adrien Deliege ⋅ Marc Van Droogenbroeck
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 238
Robust Remote Sensing Image–Text Retrieval with Noisy Correspondence
qiya song ⋅ Yiqiang Xie ⋅ Yuan Sun ⋅ Renwei Dian ⋅ Xudong Kang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 239
PinPoint: Evaluation of Composed Image Retrieval with Explicit Negatives, Multi-Image Queries, and Paraphrase Testing
Rohan Mahadev ⋅ Joyce Yuan ⋅ Patrick Poirson ⋅ David Xue ⋅ Hao-Yu Wu ⋅ Dmitry Kislyuk
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 240
Single-step Diffusion-based Video Coding with Semantic-Temporal Guidance
Naifu Xue ⋅ Zhaoyang Jia ⋅ Jiahao Li ⋅ Bin Li ⋅ Zihan Zheng ⋅ Yuan Zhang ⋅ Yan Lu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 241
Memory Matters: Boosting Training-Free Zero-Shot Temporal Action Localization with a Learnable Lookup Table
Han Jiang ⋅ Haoyu Tang ⋅ Xiaoxuan Mu ⋅ Chen Li ⋅ Jihua Zhu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 242
TVHighlights: LLM-Guided Human-Free Collaborative Training for Video Highlight Detection in Movies and TV Dramas
Qi Qiu ⋅ Xuan Wu ⋅ Jiawei Peng ⋅ Yuan Miao ⋅ Xu Yang ⋅ Yanlong Du
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 243
Color When It Counts: Grayscale-Guided Online Triggering for Always-On Streaming Video Sensing
Weitong Cai ⋅ Hang Zhang ⋅ Yukai Huang ⋅ Shitong Sun ⋅ Jiankang Deng ⋅ Songcen Xu ⋅ Jifei Song ⋅ Zhensong Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 244
Reinforcing Structured Chain-of-Thought for Video Understanding
Peiyao Wang ⋅ Haotian Xu ⋅ Noranart Vesdapunt ⋅ Rui Hou ⋅ Jingyi Zhang ⋅ Haibin Ling ⋅ Oleksandr Obiednikov ⋅ Ning Zhou ⋅ Kah Fu Fu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 245
FlexiVideo: Variation-Aware Temporal Dynamics Modeling for Efficient Video Understanding
Da Peng ⋅ Xuesong Yang ⋅ Zonghao Guo ⋅ Yichen Zhang ⋅ Chi Chen ⋅ Yidan Zhang ⋅ Yuan Yao ⋅ Fang Wan ⋅ Wei Ke ⋅ Maosong Sun
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 246
MS-Temba: Multi-Scale Temporal Mamba for Understanding Long Untrimmed Videos
Arkaprava Sinha ⋅ Monish Soundar Raj ⋅ Pu Wang ⋅ Ahmed Helmy ⋅ Hieu Le ⋅ Srijan Das
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 247
Learning Effective Sign Features without Text for Gloss-free Sign Language Translation
Shiwei Gan ⋅ Xiao Liu ⋅ Yafeng Yin ⋅ Nan Liu ⋅ Kuizhuang Liu ⋅ Desibieer Tuerdaken ⋅ Zhiwei Jiang ⋅ Lei Xie ⋅ Sanglu Lu ⋅ Hongkai Wen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 248
META: Meta Evolution of Tool Trajectory Adaptation for Long-Video Understanding
Jing Huang ⋅ Luyuan Chen ⋅ Zhijie Xu ⋅ Yadong Li ⋅ Xingzhong Xu ⋅ Siye Chen ⋅ Jie Liu ⋅ Ming Kong ⋅ Qiang Zhu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 249
GT-SVJ: Generative-Transformer-Based Self-Supervised Video Judge For Efficient Video Reward Modeling
Shivanshu Shekhar ⋅ Uttaran Bhattacharya ⋅ Raghavendra Addanki ⋅ Mehrab Tanjim ⋅ Somdeb Sarkhel ⋅ Tong Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 250
Local Motion Matters: A Deconstruct–Recompose Paradigm for Reinforcement Learning Pre-training from Videos
Jinwen Wang ⋅ Youfang Lin ⋅ Xiaobo Hu ⋅ Shuo Wang ⋅ Kai Lv
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 251
Align Once to Explain: Feature Alignment for Scalable B-cosification of Foundational Vision Transformers
Raphael Maser ⋅ Siddhartha Gairola ⋅ Sukrut Rao ⋅ Bernt Schiele
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 252
Rounded or Streamlined Head? Bridging Concept Bottleneck Models and Attribute-Described Object Parts
Yang Liu ⋅ Jiajin Zhang ⋅ Yaojun Hu ⋅ Bingguang Hao ⋅ Xin Cao ⋅ Yingda Xia ⋅ Danyang Tu ⋅ Shi Gu ⋅ Ling Zhang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 253
CIGMA: Causal Information-Gain Mechanistic Attribution of Attention Heads in Vision Transformers
Maisha Maliha ⋅ Dean F. Hougen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 254
Rethinking Concept Bottleneck Models: From Pitfalls to Solutions
Merve Tapli ⋅ Quentin Bouniot ⋅ Wolfgang Stammer ⋅ Zeynep Akata ⋅ Emre Akbas
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 255
Make it SING: Analyzing Semantic Invariants in Classifiers
Harel Yadid ⋅ Meir Yossef Levi ⋅ Roy Betser ⋅ Guy Gilboa
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 256
Back to the Feature: Explaining Video Classifiers with Video Counterfactual Explanations
Chao Wang ⋅ chengan che ⋅ Xinyue Chen ⋅ Sophia Tsoka ⋅ Luis Carlos Garcia Peraza Herrera
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 257
LEADER: Learning Reliable Local-to-Global Correspondences for LiDAR Relocalization
Jianshi Wu ⋅ Minghang Zhu ⋅ dq Liu ⋅ Wen Li ⋅ Sheng Ao ⋅ Siqi Shen ⋅ Chenglu Wen ⋅ Cheng Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 258
UniCorrn: Unified Correspondence Transformer Across 2D and 3D
Prajnan Goswami ⋅ Tianye Ding ⋅ Feng Liu ⋅ Huaizu Jiang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 259
Probabilistic Discrepancy Learning for Roadside LiDAR Scene Completion
Xiaogang Wu ⋅ Jinchao Hu ⋅ Zixian Wang ⋅ Dun Liu ⋅ BoXiang Cheng ⋅ Yiqiang Wu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 260
TACO: Task-Aware Contrastive Learning for Joint LiDAR Localization and 3D Object Detection
Leyuan Xing ⋅ huanjia zhang ⋅ Dongyu Pan ⋅ Hai Wu ⋅ Qiming Xia ⋅ Kezheng Xiong ⋅ Wen Li ⋅ Chenglu Wen ⋅ Cheng Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 261
Adapting Point Cloud Analysis via Multimodal Bayesian Distribution Learning
Xingyu Zhu ⋅ Yi Liang ⋅ Shuo Wang ⋅ Wenbo Zhu ⋅ Yongliang Wu ⋅ Beier Zhu ⋅ Hanwang Zhang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 262
Learning Coordinate-based Convolutional Kernels for Continuous SE(3) Equivariant and Efficient Point Cloud Analysis
Jaein Kim ⋅ Hee Bin Yoo ⋅ Dong-Sig Han ⋅ Byoung-Tak Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 263
R3-PCQA: Ray-Reprojection-Reinforcement for No-Reference 3D Point Cloud Quality Assessment
Junhyuk Seo ⋅ Sanghyuk SEO ⋅ Dawoon Kim ⋅ Heeseok Oh
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 264
Geometric-Aware Hypergraph Reasoning for Novel Class Discovery in Point Cloud Segmentation
Zihao Zhang ⋅ Aming Wu ⋅ Li Yang ⋅ Yahong Han ⋅ Jialie Shen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 265
PointCSP: Cross-Sample Semantic Propagation and Stability Preservation in Self-Supervised Point Cloud Learning
Xinxing Yu ⋅ Ajian Liu ⋅ Sunyuan Qiang ⋅ Hui Ma ⋅ Liying Yang ⋅ Yuzhong Wang ⋅ Zhi Rao ⋅ Yanyan Liang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 266
U4D: Uncertainty-Aware 4D World Modeling from LiDAR Sequences
Xiang Xu ⋅ Ao Liang ⋅ Youquan Liu ⋅ Linfeng Li ⋅ Lingdong Kong ⋅ Ziwei Liu ⋅ Qingshan Liu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 267
TerraSeg: Self-Supervised Ground Segmentation for Any LiDAR
Ted Lentsch ⋅ Santiago Montiel-Marín ⋅ Holger Caesar ⋅ Dariu M. Gavrila
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 268
Where Does Vision Meet Language? Understanding and Refining Visual Fusion in MLLMs via Contrastive Attention
Shezheng Song ⋅ Shasha Li ⋅ Shan Zhao ⋅ Xiaopeng Li ⋅ Qian Wan ⋅ Chengyu Wang ⋅ Tianwei Yan ⋅ Ma Jun ⋅ Jie Yu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 269
UniRefiner: Teaching Pre-trained ViTs to Self-Dispose Dross via Contrastive Register
Congpei Qiu ⋅ Zhaoyu Hu ⋅ Wei Ke ⋅ Zhuotao Tian ⋅ Yanhao Wu ⋅ Tong Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 270
SigLino: Efficient Multi-Teacher Distillation for Agglomerative Vision Foundation Models
Sofian Chaybouti ⋅ Sanath Narayan ⋅ Yasser Dahou ⋅ Phúc H. Lê Khắc ⋅ Ankit Singh ⋅ Ngoc Dung Huynh ⋅ Wamiq Reyaz Para ⋅ Hilde Kuehne ⋅ Hakim Hacid
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 271
Heuristic-inspired Reasoning Priors Facilitate Data-Efficient Referring Object Detection
Xu Zhang ⋅ Zhe Chen ⋅ Jing Zhang ⋅ Dacheng Tao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 272
LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning
Zebin You ⋅ Shen Nie ⋅ Xiaolu Zhang ⋅ JUN ZHOU ⋅ Zhiwu Lu ⋅ Ji-Rong Wen ⋅ Chongxuan Li
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 273
AVION: Aerial Vision–Language Instruction from Offline Teacher to Prompt-Tuned Network
Yu Hu ⋅ Jianyang Gu ⋅ Hao Liu ⋅ Yue Cao ⋅ Jozsef Hamari ⋅ Zheng Liu ⋅ Mohsen Zardadi
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 274
CrossVL: Complexity-Aware Feature Routing and Paired Curriculum for Cross-View Vision-Language Detection
Zhipeng Liu ⋅ Chunbo Luo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 275
Masking Teacher and Reinforcing Student for Distilling Vision-Language Models
Byung-Kwan Lee ⋅ Yu-Chiang Frank Wang ⋅ Ryo Hachiuma
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 276
Role-SynthCLIP: A Role-Play Driven Diverse Synthetic Data Approach
Yuanxiang Huangfu ⋅ Chaochao wang ⋅ weilei wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 277
BiMotion: B-spline Motion for Text-guided Dynamic 3D Character Generation
Miaowei Wang ⋅ Qingxuan Yan ⋅ Zhi Cao ⋅ Yayuan Li ⋅ Oisin Mac Aodha ⋅ Jason J. Corso ⋅ Amir Vaxman
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 278
PSDesigner: Automated Graphic Design with a Human-Like Creative Workflow
Xincheng Shuai ⋅ Song Tang ⋅ Yutong Huang ⋅ Henghui Ding ⋅ Dacheng Tao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 279
CADFS: A Big CAD Program Dataset and Framework for Computer-Aided Design with Large Language Models
Vladislav Pyatov ⋅ Gleb Bobrovskikh ⋅ Saveliy Galochkin ⋅ Nikita Boldyrev ⋅ Oleg Voynov ⋅ Alexander Filippov ⋅ Gonzalo Ferrer ⋅ Peter Wonka ⋅ Evgeny Burnaev
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 280
MapRoute:Precise-Concept Erasing Mappers via Semantic Routing
Sihao Li ⋅ Baixi Baixi ⋅ Shuohong Xia ⋅ Yunyun Yang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 281
PhotoFramer: Multi-modal Image Composition Instruction
Zhiyuan You ⋅ Ke Wang ⋅ He Zhang ⋅ Xin Cai ⋅ Jinjin Gu ⋅ Tianfan Xue ⋅ Chao Dong ⋅ Zhoutong Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 282
Can We Build Scene Graphs, Not Classify Them? FlowSG: Progressive Image-Conditioned Scene Graph Generation with Flow Matching
Xin Hu ⋅ Ke Qin ⋅ Wen Yin ⋅ Yuan-Fang Li ⋅ Ming Li ⋅ Tao He
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 283
DuetSVG: Unified Multimodal SVG Generation with Internal Visual Guidance
Peiying Zhang ⋅ Nanxuan Zhao ⋅ Matthew Fisher ⋅ Yiran Xu ⋅ Jing Liao ⋅ Difan Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 284
Bias Is a Subspace, Not a Coordinate: A Geometric Rethinking of Post‑hoc Debiasing in Vision-Language Models
Dachuan Zhao ⋅ Weiyue Li ⋅ Zhenda Shen ⋅ Yushu Qiu ⋅ Bowen Xu ⋅ Haoyu Chen ⋅ Yongchao Chen
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 285
Frequency-domain Manipulation for Face Obfuscation
Jintae Kim ⋅ Keunsoo Ko ⋅ Chang-Su Kim
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 286
Towards Reasoning-Preserving Unlearning in Multimodal Large Language Models
Hongji Li ⋅ Manjiang Yu ⋅ Junchi Yao ⋅ PRIYANKA SINGH ⋅ Xue Li ⋅ Di Wang ⋅ Lijie Hu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 287
Erasing Thousands of Concepts: Towards Scalable and Practical Concept Erasure for Text-to-Image Diffusion Models
Hoigi Seo ⋅ Byung Hyun Lee ⋅ Jaehyun Cho ⋅ Sungjin Lim ⋅ Se Young Chun
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 288
POUR: A Provably Optimal Method for Unlearning Representation via Neural Collapse
Anjie Le ⋅ Can Peng ⋅ Yuyuan Liu ⋅ Alison Noble
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 289
Do Vision-Language Models Leak What They Learn? Adaptive Token-Weighted Model Inversion Attacks
Ngoc-Bao Nguyen ⋅ Sy-Tuyen Ho ⋅ Koh Jun Hao ⋅ Ngai-Man Cheung
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 290
Protego: User-Centric Pose-Invariant Privacy Protection Against Face Recognition-Induced Digital Footprint Exposure
Ziling Wang ⋅ Shuya Yang ⋅ Jialin Lu ⋅ Ka-Ho Chow
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 291
SPDMark: Selective Parameter Displacement for Robust Video Watermarking
Samar Fares ⋅ Nurbek Tastan ⋅ Karthik Nandakumar
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 292
Enhancing Visual Representation with Textual Semantics: Textual Semantics-Powered Prototypes for Heterogeneous Federated Learning
Xinghao Wu ⋅ Jianwei Niu ⋅ Xuefeng Liu ⋅ Guogang Zhu ⋅ Jiayuan Zhang ⋅ Shaojie Tang ⋅ Wei Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 293
FedHarmony: Harmonizing Heterogeneous Label Correlations in Federated Multi-Label Learning
Zhiqiang Kou ⋅ Junxiang Wu ⋅ Wenke Huang ⋅ Wenwen He ⋅ Ming-Kun Xie ⋅ Changwei Wang ⋅ Yuheng Jia ⋅ Di Jiang ⋅ Yang Liu ⋅ Xin Geng ⋅ Qiang Yang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 294
FedSST: Rethinking Fair Federated Graph Learning under Structural Shift
Dingyi Zhao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 295
GDFA: Geometry-Driven Federated Unlearning with Directional Task Vector Alignment
Xiuting Weng ⋅ Ruizhi Pu ⋅ Yuanhang Yao ⋅ Kun Yue ⋅ Zhiwen Tang ⋅ Lixing Yu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 296
FedARA: Resource-adaptive Low-rank Personalized Federated Learning via Anchor-driven Representation Alignment on Heterogeneous Edge Devices
Ruonan Zhao ⋅ Zheng Wang ⋅ Debin Liu ⋅ shijie lv ⋅ Laurence Yang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 297
InterRVOS: Interaction-Aware Referring Video Object Segmentation
Woojeong Jin ⋅ Seongchan Kim ⋅ Jaeho Lee ⋅ Seungryong Kim
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 298
RE-VLM: Event-Augmented Vision-Language Model for Scene Understanding
Hanqing Liu ⋅ Mingjie Liu ⋅ Luoping Cui ⋅ Endian Lin ⋅ Donghong Jiang ⋅ Chuang Zhu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 299
RegFormer: Transferable Relational Grounding for Efficient Weakly-Supervised Human-Object Interaction Detection
Jihwan Park ⋅ Chanhyeong Yang ⋅ Jinyoung Park ⋅ Taehoon Song ⋅ Hyunwoo J. Kim
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 300
Learning to Refuse: Refusal-Aware Reinforcement Fine-Tuning for Hard-Irrelevant Queries in Video Temporal Grounding
Jin-Seop Lee ⋅ Sungjoon Lee ⋅ SeongJun Jung ⋅ Boyang Li ⋅ Jee-Hyong Lee
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 301
GroundVTS: Visual Token Sampling in Multimodal Large Language Models for Video Temporal Grounding
Rong Fan ⋅ Kaiyan Xiao ⋅ Minghao Zhu ⋅ Liuyi Wang ⋅ KAI DAI ⋅ Zhao Yang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 302
TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs
Jun Zhang ⋅ Teng Wang ⋅ Yuying Ge ⋅ Yixiao Ge ⋅ Xinhao Li ⋅ Limin Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 303
Tokenization Allows Multimodal Large Language Models to Understand, Generate and Edit Architectural Floor Plans
Sizhong Qin ⋅ Ramon Elias Weber ⋅ Xinzheng Lu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 304
MeToM: Metadata-Guided Token Merging for Efficient Video LLMs
Zhuojie Wu ⋅ Shijie Wang ⋅ Xin Yu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 305
Token Reduction via Local and Global Contexts Optimization for Efficient Video Large Language Models
Jinlong Li ⋅ Liyuan Jiang ⋅ Haonan Zhang ⋅ Nicu Sebe
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 306
VLIC: Vision-Language Models As Perceptual Judges for Human-Aligned Image Compression
Kyle Sargent ⋅ Ruiqi Gao ⋅ Philipp Henzler ⋅ Charles Herrmann ⋅ Aleksander Holynski ⋅ Li Fei-Fei ⋅ Jiajun Wu ⋅ Jason Y. Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 307
Mostly Text, Smart Visuals: Asymmetric Text-Visual Pruning for Large Vision-Language Models
Sijie Li ⋅ Biao Qian ⋅ Jungong Han
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 308
Attention-aware Inference Optimizations for Large Vision-Language Models with Memory-efficient Decoding
Fatih Ilhan ⋅ Gaowen Liu ⋅ Ramana Kompella ⋅ Selim Tekin ⋅ Tiansheng Huang ⋅ Zachary Yahn ⋅ Yichang Xu ⋅ Ling Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 309
CoIn: Coverage and Informativeness-Guided Token Reduction for Efficient Large Multimodal Models
Chenxi Du ⋅ Yongheng Deng ⋅ Jiani Liu ⋅ Yujia Zhang ⋅ Xi Chen ⋅ Ju Ren
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 310
TAMER: A Tri-Modal Contrastive Alignment and Multi-Scale Embedding Refinement Framework for Zero-Shot ECG Diagnosis
Xuewei Zhou ⋅ Yajie Meng ⋅ Pan Zeng ⋅ Xianfang Tang ⋅ Feifei Cui ⋅ Qiangguo Jin ⋅ Jialiang Yang ⋅ Junlin Xu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 311
Your Dissimilarities Define You: Complementary Learning Exploiting Class Diversities
Dimitrios Katsikas ⋅ Nikolaos Passalis ⋅ Anastasios Tefas
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 312
CGU-Bayes: Causal Graph Uncertainty-Guided Bayesian Inference for Domain Generalization
Naiyu Yin ⋅ Hanjing Wang ⋅ Yue Yu ⋅ Tian Gao ⋅ Amit Dhurandhar ⋅ Chung-Hao Lee ⋅ Qiang Ji
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 313
Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning
Shashanka Venkataramanan ⋅ Valentinos Pariza ⋅ Mohammadreza Salehi ⋅ Lukas Knobel ⋅ Elias Ramzi ⋅ Spyros Gidaris ⋅ Andrei Bursuc ⋅ Yuki M Asano
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 314
Towards Stable Self-Supervised Object Representations in Unconstrained Egocentric Video
Yuting Tan ⋅ Xilong Cheng ⋅ Yunxiao Qin ⋅ Zhengnan Li ⋅ Jingjing Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 315
LRDUN: A Low-Rank Deep Unfolding Network for Efficient Spectral Compressive Imaging
HE HUANG ⋅ Yujun Guo ⋅ Wei He
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 316
Neural Collapse in Test-Time Adaptation
Xiao Chen ⋅ Zhongjing Du ⋅ Jiazhen Huang ⋅ Jiang Xu ⋅ Li Lu ⋅ Jingyan Jiang ⋅ Zhi Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 317
CLEX: Complementary Label Exchange Learning for Noisy Facial Expression Recognition
Lin Wang ⋅ Fang Liu ⋅ Xiaofen Xing ⋅ Kailing Guo ⋅ Xiangmin Xu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 318
TruckDrive: Long-Range Autonomous Highway Driving Dataset
Filippo Ghilotti ⋅ Edoardo Palladin ⋅ Samuel Brucker ⋅ Adam Sigal ⋅ Mario Bijelic ⋅ Felix Heide
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 319
Neuro-Cognitive Reward Modeling for Human-Centered Autonomous Vehicle Control
Zhuoli Zhuang ⋅ Yu-Cheng Chang ⋅ Yu-Kai Wang ⋅ Thomas Do ⋅ Chin-teng Lin
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 320
E3AD: An Emotion-Aware Vision-Language-Action Model for Human-Centric End-to-End Autonomous Driving
Yihong Tang ⋅ Haicheng Liao ⋅ Tong Nie ⋅ Junlin He ⋅ Ao Qu ⋅ Kehua Chen ⋅ Wei Ma ⋅ Zhenning Li ⋅ Lijun Sun ⋅ Chengzhong Xu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 321
The Blind Spot of Adaptation: Quantifying and Mitigating Forgetting in Fine-tuned Driving Models
Runhao Mao ⋅ Hanshi Wang ⋅ Yixiang Yang ⋅ Qianli Ma ⋅ Jingmeng Zhou ⋅ Zhipeng Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 322
Den-TP: A Density-Balanced Data Curation and Evaluation Framework for Trajectory Prediction
Ruining Yang ⋅ Yi Xu ⋅ Yun Fu ⋅ Lili Su
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 323
Percept-WAM: Perception-Enhanced World-Awareness-Action Model for Robust End-to-End Autonomous Driving
Jianhua Han ⋅ Meng Tian ⋅ Jiangtong Zhu ⋅ Fan He ⋅ Huixin Zhang ⋅ Sitong Guo ⋅ Dechang Zhu ⋅ Hao Tang ⋅ Pei Xu ⋅ Yuze Guo ⋅ Minzhe Niu ⋅ Haojie Zhu ⋅ Qichao Dong ⋅ Xuechao Yan ⋅ Siyuan Dong ⋅ Lu Hou ⋅ Qingqiu Huang ⋅ Xiaosong Jia ⋅ Hang Xu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 324
GaussianDWM: 3D Gaussian Driving World Model for Unified Scene Understanding and Multi-Modal Generation
Tianchen Deng ⋅ Xuefeng Chen ⋅ Yi Chen ⋅ Qu Chen ⋅ Yuyao Xu ⋅ Lijin Yang ⋅ Le Xu ⋅ Yu Zhang ⋅ Bo Zhang ⋅ Wuxiong Huang ⋅ Hesheng Wang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 325
Mind the Hitch: Dynamic Calibration and Articulated Perception for Autonomous Trucks
morui zhu ⋅ Yongqi Zhu ⋅ Song Fu ⋅ Qing Yang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 326
DriveMoE: Mixture-of-Experts for Vision-Language-Action Model in End-to-End Autonomous Driving
Zhenjie Yang ⋅ Yilin Chai ⋅ Xiaosong Jia ⋅ Qifeng Li ⋅ Yuqian Shao ⋅ Xuekai Zhu ⋅ Haisheng Su ⋅ Junchi Yan
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 327
Beyond Rule-Based Agents: Active Markov Games for Realistic Multi-Agent Interaction in Autonomous Driving
Yuan Gui ⋅ Hongchen Luo ⋅ Jiao Wang ⋅ Qu Liqi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 328
Test-Time Multi-Prompt Adaptation for Open-Vocabulary Remote Sensing Image Segmentation
Ting Yang ⋅ Qilong Wang ⋅ Qibin Hou ⋅ Qinghua Hu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 329
ReScene4D: Temporally Consistent Semantic Instance Segmentation of Evolving Indoor 3D Scenes
Emily Steiner ⋅ Jianhao Zheng ⋅ Henry Howard-Jenkins ⋅ Chris Xie ⋅ Iro Armeni
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 330
CrackSSM: Reviving SSMs for Crack Segmentation via Dynamic Scanning
Yubin Gu ⋅ Boyang Hou ⋅ Yuan Meng ⋅ Wenting Luo ⋅ Jiayi Ji ⋅ Xiaoshuai Sun
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 331
BiPA: Bilevel Prompt Adaptation for Underwater Instance Segmentation
Long Ma ⋅ Haoze Zheng ⋅ Yuhang Mao ⋅ Jinyuan Liu ⋅ Chengpei Xu ⋅ Xinwei Xue ⋅ Yi Wang ⋅ Xiangjian He ⋅ Weimin Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 332
RS-SSM: Refining Forgotten Specifics in State Space Model for Video Semantic Segmentation
Kai Zhu ⋅ Zhenyu Cui ⋅ Zehua Zang ⋅ Jiahuan Zhou
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 333
Scene-Centric Unsupervised Video Panoptic Segmentation
Christoph Reich ⋅ Oliver Hahn ⋅ Nikita Araslanov ⋅ Laura Leal-Taixe ⋅ Christian Rupprecht ⋅ Daniel Cremers ⋅ Stefan Roth
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 334
Bootstrapping Video Semantic Segmentation Model via Distillation-assisted Test-Time Adaptation
Jihun Kim ⋅ Hoyong Kwon ⋅ Hyeokjun Kweon ⋅ Kuk-Jin Yoon
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 335
GeoFree-CoSeg: Unsupervised Point Cloud-Image Cross-Modal Co-Segmentation Without Geometric Alignment
Xin Duan ⋅ Xiabi Liu ⋅ Liyuan Pan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 336
Parameter-efficient Continual Learning for Enhancing Plasticity without Forgetting under Limited Model Capacity
Yitian Chen ⋅ Shigeng Zhang ⋅ Xuan Liu ⋅ Mingming Lu ⋅ Kai Chen ⋅ Hongye Zhu ⋅ Xinning Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 337
Dual-Estimator: Decoupling Global and Local Semantic Shift for Drift Compensation in Class-Incremental Learning
Fankang Xu ⋅ Lu Jin ⋅ Yanpeng Sun ⋅ Shiyu Xuan ⋅ Zechao Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 338
Continual Distillation of Teachers from Different Domains
Nicolas Michel ⋅ Maorong Wang ⋅ Jiangpeng He ⋅ Toshihiko Yamasaki
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 339
Multimodal Continual Instruction Tuning with Dynamic Gradient Guidance
Songze Li ⋅ Mingyu Gao ⋅ Tonghua Su ⋅ Xu-Yao Zhang ⋅ Zhongjie Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 340
Learning from Itself: Mining Internal Knowledge from Vision Language Models for Continual Learning
Yizheng Gong ⋅ Siyue Yu ⋅ Waleed Al-Nuaimy ⋅ Jimin Xiao
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 341
AdaPrior: Bayesian-Inspired Adaptive Prior Correction for Long-Tailed Continual Learning
S Divakar Bhat ⋅ Amit Popat More ⋅ Mudit Soni ⋅ Bhuvan Aggarwal
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 342
An Optimal Transport-driven Approach for Cultivating Latent Space in Online Incremental Learning
Quyen Tran ⋅ Hai Nguyen ⋅ Minh Quan Dao ⋅ Hoang Phan ⋅ Linh Ngo Van ⋅ Khoat Than ⋅ Dinh Phung ⋅ Dimitris Metaxas ⋅ Trung Le
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 343
HAD: Heterogeneity-Aware Distillation for Lifelong Heterogeneous Learning
Xuerui Zhang ⋅ Xuehao Wang ⋅ Zhan Zhuang ⋅ Linglan Zhao ⋅ Ziyue Li ⋅ Xinmin Zhang ⋅ Zhihuan Song ⋅ Yu Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 344
U-Mind: A Unified Framework for Real-Time Multimodal Interaction with Audiovisual Generation
xiang deng ⋅ Feng Gao ⋅ Yong Zhang ⋅ Youxin Pang ⋅ Xu Xiaoming ⋅ Zhuoliang Kang ⋅ Xiaoming Wei ⋅ Yebin Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 345
StreamAvatar: Streaming Diffusion Models for Real-Time Interactive Human Avatars
Zhiyao Sun ⋅ Ziqiao Peng ⋅ Yifeng Ma ⋅ Yi Chen ⋅ zhengguang zhou ⋅ Zixiang Zhou ⋅ Guozhen Zhang ⋅ Youliang Zhang ⋅ Yuan Zhou ⋅ qinglin lu ⋅ Yong-Jin Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 346
FlashLips: 100-FPS Mask-Free Latent Lip-Sync using Reconstruction Instead of Diffusion or GANs
Andreas Zinonos ⋅ Michał Stypułkowski ⋅ Antoni Bigata Casademunt ⋅ Stavros Petridis ⋅ Maja Pantic ⋅ Nikita Drobyshev
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 347
WildCap: Facial Albedo Capture in the Wild via Hybrid Inverse Rendering
Yuxuan Han ⋅ Xin Ming ⋅ Tianxiao Li ⋅ Zhuofan Shen ⋅ Qixuan Zhang ⋅ Lan Xu ⋅ Feng Xu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 348
EmoTaG: Emotion-Aware Talking Head Synthesis on Gaussian Splatting with Few-Shot Personalization
Haolan Xu ⋅ Keli Cheng ⋅ Lei Wang ⋅ Ning Bi ⋅ Xiaoming Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 349
DyaDiT: A Multi-Modal Diffusion Transformer for Socially Favorable Dyadic Gesture Generation
YICHEN PENG ⋅ Jyun-Ting Song ⋅ Siyeol Jung ⋅ RUOFAN LIU ⋅ Haiyang Liu ⋅ Xuangeng Chu ⋅ Ruicong Liu ⋅ Erwin Wu ⋅ Hideki Koike ⋅ Kris Kitani
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 350
TRM-VLA: Temporal-Aware Chain-of-Thought Reasoning and Memorization for Vision-Language-Action Models
LI XIANG ⋅ Yali Li ⋅ Yuan Wang ⋅ Shengjin Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 351
VGGDrive: Empowering Vision-Language Models with Cross-View Geometric Grounding for Autonomous Driving
Jie Wang ⋅ Guang Li ⋅ Zhijian Huang ⋅ Chenxu Dang ⋅ Hangjun Ye ⋅ Yahong Han ⋅ Long Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 352
NoRD: A Data-Efficient Vision-Language-Action Model that Drives without Reasoning
Ishaan Rawal ⋅ Shubh Gupta ⋅ Yihan Hu ⋅ Wei Zhan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 353
HTNav: A Hybrid Navigation Framework with Tiered Structure for Urban Aerial Vision-and-Language Navigation
Chengjie Fan ⋅ Cong Pan ⋅ Zijian Liu ⋅ Ningzhong Liu ⋅ Jie Qin
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 354
CycleBEV: Regularizing View Transformation Networks via View Cycle Consistency for Bird’s-Eye-View Semantic Segmentation
Jeongbin Hong ⋅ Dooseop Choi ⋅ Taeg-Hyun An ⋅ KYOUNG AN AN ⋅ Kyoung-Wook Min
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 355
STAvatar: Soft Binding and Temporal Density Control for Monocular 3D Head Avatars Reconstruction
Jiankuo Zhao ⋅ Xiangyu Zhu ⋅ Zidu Wang ⋅ Zhen Lei
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 356
CrowdGaussian: Reconstructing High-Fidelity 3D Gaussians for Human Crowd from a Single Image
Yizheng Song ⋅ Yiyu Zhuang ⋅ Qipeng Xu ⋅ Haixiang Wang ⋅ Jiahe Zhu ⋅ Jing Tian ⋅ Siyu Zhu ⋅ Hao Zhu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 357
OMG-Avatar: One-shot Multi-LOD Gaussian Head Avatar
Jianqiang Ren ⋅ Lin Liu ⋅ Steven Hoi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 358
Globally Optimal Pose from Orthographic Silhouettes
Agniva Sengupta ⋅ Dilara Kus ⋅ Jianning Li ⋅ Stefan Zachow
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 359
AvatarPointillist: AutoRegressive 4D Gaussian Avatarization
Hongyu Liu ⋅ Xuan Wang ⋅ Zijian Wu ⋅ yating wang ⋅ Ziyu Wan ⋅ Yue Ma ⋅ Runtao Liu ⋅ Boyao Zhou ⋅ Yujun Shen ⋅ Qifeng Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 360
COPO: Causal-Oriented Policy Optimization for Hallucinations of MLLMs
Peizheng Guo ⋅ Jingyao Wang ⋅ Wenwen Qiang ⋅ Jiahuan Zhou ⋅ Changwen Zheng ⋅ Gang Hua
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 361
Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding
Zhongxing Xu ⋅ Zhonghua Wang ⋅ Zhe Qian ⋅ Dachuan Shi ⋅ feilong tang ⋅ Ming Hu ⋅ Shiyan Su ⋅ Xiaocheng Zou ⋅ Wei Feng ⋅ Dwarikanath Mahapatra ⋅ Yifan Peng ⋅ Minquan Lin ⋅ Zongyuan Ge
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 362
AdaIAT: Adaptively Increasing Attention to Generated Text to Alleviate Hallucinations in LVLM
Lian Zhong ⋅ Ziqiang He ⋅ Jibin Zheng ⋅ Jin Li ⋅ Z. Wang ⋅ xiangui Kang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 363
HulluEdit: Single-Pass Evidence-Consistent Subspace Editing for Mitigating Hallucinations in Large Vision-Language Models
Yangguang Lin ⋅ Quan Fang ⋅ Yufei Li ⋅ Jiachen Sun ⋅ Junyu Gao ⋅ Jitao Sang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 364
SEASON: Mitigating Temporal Hallucination in Video Large Language Models via Self-Diagnostic Contrastive Decoding
Chang-Hsun Wu ⋅ Kai-Po Chang ⋅ Yu-Yang Sheng ⋅ Hung-Kai Chung ⋅ Kuei-Chun Wang ⋅ Yu-Chiang Frank Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 365
One Token, Two Fates: A Unified Framework via Vision Token Manipulation Against MLLMs Hallucination
Zhan Fa ⋅ Yue Duan ⋅ Jian Zhang ⋅ Lei Qi ⋅ Yinghuan Shi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 366
EgoX: Egocentric Video Generation from a Single Exocentric Video
Taewoong Kang ⋅ Kinam Kim ⋅ Dohyeon Kim ⋅ Minho Park ⋅ Junha Hyung ⋅ Jaegul Choo
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 367
SymphoMotion: Joint Control of Camera Motion and Object Dynamics for Coherent Video Generation
Guiyu Zhang ⋅ Yabo Chen ⋅ Xunzhi Xiang ⋅ Junchao Huang ⋅ Zhongyu Wang ⋅ Li Jiang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 368
Pantheon360: Taming Digital Twin Generation via 3D-Aware 360° Video Diffusion
Ting-Hsuan Chen ⋅ Ying-Huan Chen ⋅ Tao Tu ⋅ Jie-Ying Lee ⋅ Cho-Ying Wu ⋅ Fangzhou Lin ⋅ Hengyuan Zhang ⋅ David Paz ⋅ Xinyu Huang ⋅ Yuliang Guo ⋅ Yu-Lun Liu ⋅ Yue Wang ⋅ Liu Ren
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 369
SeeU: Seeing the Unseen World via 4D Dynamics-aware Generation
Yu Yuan ⋅ Tharindu Wickremasinghe ⋅ Zeeshan Nadir ⋅ Xijun Wang ⋅ Yiheng Chi ⋅ Stanley H. Chan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 370
ReDirector: Creating Any-Length Video Retakes with Rotary Camera Encoding
Byeongjun Park ⋅ Byung-Hoon Kim ⋅ Hyungjin Chung ⋅ Jong Chul
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 371
Scaling4D: Pushing the Frontier of Video Novel View Synthesis through Large-Scale Monocular Videos
Hongrui Cai ⋅ Junjie Luo ⋅ Zhihong Fu ⋅ Shengnan Zhu ⋅ Jiawei Wen ⋅ Wanquan Feng ⋅ Songtao Zhao ⋅ Qian HE
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 372
PHANTOM: Physics-Infused Video Generation via Joint Modeling of Visual and Latent Physical Dynamics
Ying Shen ⋅ Jerry Xiong ⋅ Tianjiao Yu ⋅ Ismini Lourentzou
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 373
WorldReel: 4D Video Generation with Consistent Geometry and Motion Modeling
Shaoheng Fang ⋅ Hanwen Jiang ⋅ Yunpeng Bai ⋅ Niloy J. Mitra ⋅ Qixing Huang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 374
Let Your Image Move with Your Motion! -- Implicit Multi-Object Multi-Motion Transfer
Li Yuze ⋅ Dong Gong ⋅ Xiao Cao ⋅ Junchao Yuan ⋅ Dongsheng Li ⋅ Lei Zhou ⋅ Yun Sing Koh ⋅ Cheng Yan ⋅ Xinyu Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 375
SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time
Zhening Huang ⋅ Hyeonho Jeong ⋅ Xuelin Chen ⋅ Yulia Gryaditskaya ⋅ Tuanfeng Wang ⋅ Joan Lasenby ⋅ Chun-Hao Huang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 376
D2FANet: Enhancing Video Object Detection with Dual-Domain Feature Aggregation Network
Qiang Qi ⋅ Wenqi Shang ⋅ Meifang Wang ⋅ Xiao Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 377
HierUQ: Hierarchical Uncertainty Quantification with Adaptive Granularity Reconciliation for Degraded Image Classification
YANG CHU ⋅ Xiaomeng Yang ⋅ Keli Deng ⋅ Yuntao Qian
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 378
ID-Sim: An Identity-Focused Similarity Metric
Julia Chae ⋅ Nick Kolkin ⋅ Jui-Hsien Wang ⋅ Richard Zhang ⋅ Sara Beery ⋅ Cusuh Ham
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 379
Hier-COS: Making Deep Features Hierarchy-aware via Composition of Orthogonal Subspaces
Depanshu Sani ⋅ Saket Anand
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 380
Towards Cross-Modal Preservation, Consistency and Alignment for Privacy-Preserving Visible-Infrared Person Re-Identification
Yudi Xie ⋅ Zhongao Zhou ⋅ Bin Yang ⋅ Zhenghan Chen ⋅ Mang Ye
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 381
Enhancing Mixture-of-Experts Specialization via Cluster-Aware Upcycling
Sanghyeok Chu ⋅ Pyunghwan Ahn ⋅ Gwangmo Song ⋅ Seung Hwan Kim ⋅ Honglak Lee ⋅ Bohyung Han
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 382
COPE: Consistent Occlusion and Prompt Enhancement Network for Occluded Person Re-identification
Sun Siyi ⋅ Jinliang Lin ⋅ Juanjuan Weng ⋅ Zhihui Liu ⋅ Shaozi Li ⋅ Zhiming Luo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 383
Assignment-Driven Hash Learning in a Hyper-Semantic Space for On-the-Fly Category Discovery
Kaibing Yang ⋅ Yucheng Wang ⋅ Tingzhang Luo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 384
DyFCLT: Dynamic Frequency-Decoupled Cross-Modal Learning Transformer for Multimodal Tiny Object Detection
Chaolang Li ⋅ Pengwen Dai ⋅ Jingyu Li ⋅ Siyuan Yao ⋅ Yuchen Jiang ⋅ Zhuoran Zheng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 385
EW-DETR: Evolving World Object Detection via Incremental Low-Rank DEtection TRansformer
Munish Monga ⋅ Vishal Chudasama ⋅ Pankaj Wasnik ⋅ C.V. Jawahar
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 386
Building a Precise Video Language with Human–AI Oversight
Zhiqiu Lin ⋅ Siyuan Cen ⋅ Chancharik Mitra ⋅ Isaac Li ⋅ Yuhan Huang ⋅ Yu Tong Tiffany Ling ⋅ Hewei Wang ⋅ Irene Pi ⋅ Shihang Zhu ⋅ Yili Han ⋅ Yilun Du ⋅ Deva Ramanan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 387
CoCoVideo: The High-Quality Commercial-Model-Based Contrastive Benchmark for AI-Generated Video Detection
Huidong Feng ⋅ Wentao Chen ⋅ Jie Chen ⋅ Xinqi Cai ⋅ Ruolong Ma ⋅ Yinglin Zheng ⋅ Yuxin Lin ⋅ Ming Zeng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 388
Towards Sparse Video Understanding and Reasoning
Chenwei Xu ⋅ Zhen Ye ⋅ Shang Wu ⋅ Weijian Li ⋅ Zihan Wang ⋅ Zhuofan Xia ⋅ Lie Lu ⋅ Pranav Maneriker ⋅ Fan Du ⋅ Manling Li ⋅ Han Liu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 389
Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding
Jialuo Li ⋅ Bin Li ⋅ Jiahao Li ⋅ Yan Lu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 390
MuKV: Multi-Grained KV Cache Compression for Long Streaming Video Question-Answering
Junbin Xiao ⋅ Jiajun Chen ⋅ Tianxiang Sun ⋅ Xun Yang ⋅ Angela Yao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 391
ParallelVLM: Lossless Video-LLM Acceleration with Visual Alignment Aware Parallel Speculative Decoding
Quan Kong ⋅ Yuhao Shen ⋅ Yicheng Ji ⋅ Huan Li ⋅ Cong Wang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 392
TiViBench: Benchmarking Think-in-Video Reasoning for Video Generation
Harold Haodong Chen ⋅ Disen Lan ⋅ Wen-Jie Shu ⋅ Qingyang Liu ⋅ Zihan Wang ⋅ Sirui CHEN ⋅ Wenkai Cheng ⋅ Kanghao Chen ⋅ Hongfei (Faye) Zhang ⋅ Zixin Zhang ⋅ Rongjin Guo ⋅ Yu Cheng ⋅ Ying-Cong Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 393
What Are You Doing? A Closer Look at Controllable Human Video Generation
Emanuele Bugliarello ⋅ Anurag Arnab ⋅ Roni Paiss ⋅ Christy Koh ⋅ Pieter-Jan Kindermans ⋅ Cordelia Schmid
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 394
Score2Instruct: Scaling Up Video Quality-Centric Instructions via Automated Dimension Scoring
Qizhi Xie ⋅ Kun Yuan ⋅ Yunpeng Qu ⋅ Jiachao Gong ⋅ Mingda Wu ⋅ Ming Sun ⋅ Chao Zhou ⋅ Jihong Zhu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 395
CFG-Ctrl: Control-Based Classifier-Free Diffusion Guidance
Hanyang Wang ⋅ Yiyang Liu ⋅ Jiawei Chi ⋅ Fangfu Liu ⋅ Ran Xue ⋅ Yueqi Duan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 396
Towards Holistic Modeling for Video Frame Interpolation with Auto-regressive Diffusion Transformers
Xinyu Peng ⋅ Han Li ⋅ Yuyang Huang ⋅ Ziyang Zheng ⋅ Yaoming Wang ⋅ Xin Chen ⋅ Wenrui Dai ⋅ Chenglin Li ⋅ Junni Zou ⋅ Hongkai Xiong
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 397
DDiT: Dynamic Patch Scheduling for Efficient Diffusion Transformers
Dahye Kim ⋅ Deepti Ghadiyaram ⋅ Raghudeep Gadde
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 398
Towards High-resolution and Disentangled Reference-based Sketch Colorization
Dingkun Yan ⋅ Xinrui Wang ⋅ Ru Wang ⋅ Zhuoru Li ⋅ Jinze Yu ⋅ Yusuke Iwasawa ⋅ Yutaka Matsuo ⋅ Jiaxian Guo
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 399
MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation
Yiren Song ⋅ Cheng Liu ⋅ Mike Zheng Shou
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 400
Layer-wise Instance Binding for Regional and Occlusion Control in Text-to-Image Diffusion Transformers
Ruidong Chen ⋅ Yancheng Bai ⋅ Xuanpu Zhang ⋅ Jianhao Zeng ⋅ Lanjun Wang ⋅ Dan Song ⋅ Lei Sun ⋅ Xiangxiang Chu ⋅ An-An Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 401
Memory-Efficient Fine-Tuning Diffusion Transformers via Dynamic Patch Sampling and Block Skipping
Sunghyun Park ⋅ Jeongho Kim ⋅ Hyoungwoo Park ⋅ Debasmit Das ⋅ Sungrack Yun ⋅ Munawar Hayat ⋅ Jaegul Choo ⋅ Fatih Porikli ⋅ Seokeon Choi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 402
COT-FM: Cluster-wise Optimal Transport Flow Matching
Chiensheng Chiang ⋅ Kuan-Hsun Tu ⋅ Jia-Wei Liao ⋅ Cheng-Fu Chou ⋅ Tsung-Wei Ke
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 403
Interpretable Motion-Attentive Maps: Spatio-Temporally Localizing Concepts in Video Diffusion Transformers
Youngjun Jun ⋅ seil kang ⋅ Woojung Han ⋅ Seong Jae Hwang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 404
Guiding a Diffusion Transformer with the Internal Dynamics of Itself
Xingyu Zhou ⋅ Qifan Li ⋅ Xiaobin Hu ⋅ Hai Chen ⋅ Shuhang Gu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 405
CoopDiff: A Diffusion-Guided Approach for Cooperation under Corruptions
Gong Chen ⋅ Chaokun Zhang ⋅ Pengcheng Lv
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 406
RARE: Learn to RAnk and REtrieve for Monocular 3D Object Detection
Hyeonjeong Park ⋅ Peixi Xiong ⋅ Xiaoqian Ruan ⋅ Dian Jia ⋅ Pei Yu ⋅ Wei Tang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 407
COG: Confidence-aware Optimal Geometric Correspondence for Unsupervised Single-reference Novel Object Pose Estimation
Yuchen Che ⋅ JINGTU WU ⋅ Hao ZHENG ⋅ Asako Kanezaki
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 408
Learnability-Driven Submodular Optimization for Active Roadside 3D Detection
Ruiyu Mao ⋅ Baoming Zhang ⋅ Nicholas Ruozzi ⋅ Yunhui Guo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 409
Look Before You Fuse: 2D-Guided Cross-Modal Alignment for Robust 3D Detection
Xiang Li ⋅ Zhangchi Hu ⋅ Xu Xiao ⋅ Bin Kong
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 410
Long-SCOPE: Fully Sparse Long-Range Cooperative 3D Perception
Jiahao Wang ⋅ Zikun Xu ⋅ Yuner Zhang ⋅ Zhongwei Jiang ⋅ Chenyang Lu ⋅ Shuocheng Yang ⋅ Yuxuan Wang ⋅ Jiaru Zhong ⋅ Chuang Zhang ⋅ Shaobing Xu ⋅ Jianqiang Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 411
Dynamics-Aware Preference Optimization for Vision-Language Models
jusheng zhang ⋅ Kaitong Cai ⋅ Jing Yang ⋅ Jian Wang ⋅ Keze Wang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 412
Selection-as-Nonlinearity: Bridging Attention and Activation via a Joint Game–Decision Lens for Interpretable, Discriminative Visual Representations
Sudong Cai ⋅ Shuai Yuan ⋅ Bingzhi Chen ⋅ Rui Mao ⋅ Bing Wang
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 413
Learning What Helps: Task-Aligned Context Selection for Vision Tasks
Jingyu Guo ⋅ Emir Konuk ⋅ Fredrik Strand ⋅ Christos Matsoukas ⋅ Kevin Smith
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 414
Consensus Entropy: Harnessing Multi-VLM Agreement for Self-Verifying and Self-Improving OCR
Yulong Zhang ⋅ Tianyi Liang ⋅ Erfei Cui ⋅ Guoqing Wang ⋅ Xu Guo ⋅ Chenhui Li ⋅ Gongshen Liu
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 415
NeuroRule: Bridging Vision and Logic with Differentiable Rule Induction
Muhammad Zarar ⋅ Mingzheng Zhang ⋅ Xiaowang Zhang ⋅ Zhiyong Feng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 416
Beyond Graph Model: Reliable VLM Fine-Tuning via Random Graph Adapter
Bo Jiang ⋅ Xueyang Ze ⋅ Beibei Wang ⋅ Xixi Wang ⋅ Xixi Wan ⋅ Bin Luo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 417
Ego: Embedding-Guided Personalization of Vision-Language Models
Soroush Seifi ⋅ Simon Gardier ⋅ Vaggelis Dorovatas ⋅ Daniel Olmeda Reino ⋅ Rahaf Aljundi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 418
JoPPO: Hierarchical Photography Assessment via Contrastive Joint Conditional Probabilistic Reinforcement Learning
Yifan Yang ⋅ Juntuo Wang ⋅ Yuming Qiao ⋅ Xudong Zhang ⋅ Chunyang Yu ⋅ Yan Li ⋅ Xiao Lin ⋅ Liang Luo ⋅ Dan Meng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 419
AeroAgent: A Vision–Physics–Decision Framework for Aerodynamic Vehicle Design
Ye Liu ⋅ Shouyi Li ⋅ Huiyu Yang ⋅ Jianghang gu ⋅ Wenhao Fan ⋅ Zhongxin Yang ⋅ Ding Wang ⋅ Simeng Chen ⋅ Zirun Jiang ⋅ Yuanwei Bin ⋅ Shiyi Chen ⋅ Yuntian Chen
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 420
MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe
Tianyu Yu ⋅ Zefan Wang ⋅ Chongyi Wang ⋅ Fuwei Huang ⋅ Wenshuo Ma ⋅ Zhihui He ⋅ Tianchi Cai ⋅ Weize Chen ⋅ Yuxiang Huang ⋅ Ranchi Zhao ⋅ Bokai Xu ⋅ Junbo Cui ⋅ Yingjing Xu ⋅ Liqing Ruan ⋅ Luoyuan Zhang ⋅ Hanyu Liu ⋅ Jingkun Tang ⋅ Hongyuan Liu ⋅ Qining Guo ⋅ Wenhao Hu ⋅ Bingxiang He ⋅ Jie Zhou ⋅ Jie Cai ⋅ Ji Qi ⋅ Zonghao Guo ⋅ Chi Chen ⋅ Guoyang Zeng ⋅ Yuxuan Li ⋅ Ganqu Cui ⋅ Ning Ding ⋅ Xu Han ⋅ Yuan Yao ⋅ Zhiyuan Liu ⋅ Maosong Sun
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 421
Prune Wisely, Reconstruct Sharply: Compact 3D Gaussian Splatting via Adaptive Pruning and Difference-of-Gaussian Primitives
Haoran Wang ⋅ Guoxi Huang ⋅ Fan Zhang ⋅ David Bull ⋅ Nantheera Anantrasirichai
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 422
MSCD-GS: Motion-Separated Cooperative Deblurring Dynamic Reconstruction via Gaussian Splatting
yongjian liao ⋅ Xu Zou ⋅ Wenjun Chen ⋅ Huixuan Li ⋅ Xiaoen Xie ⋅ Chunxi Li ⋅ Shixiang Huang ⋅ Gang Zhang ⋅ Jiahuan Zhou ⋅ Sheng Zhong ⋅ Luxin Yan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 423
P2GS: Physical Prior-guided Gaussian Splatting for Photometrically Consistent Urban Reconstruction
Kota Shimomura ⋅ Hidehisa Arai ⋅ Tsubasa Takahashi ⋅ Takayoshi Yamashita ⋅ Hironobu Fujiyioshi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 424
iSplat: Iterative Learning for Fine-Grained Gaussian Splatting
Haifeng Wu ⋅ Wei Long ⋅ Shuhang Gu ⋅ Lixin Duan ⋅ Wen Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 425
Off The Grid: Detection of Primitives for Feed-Forward 3D Gaussian Splatting
Arthur Moreau ⋅ Richard Shaw ⋅ Michal Nazarczuk ⋅ Jisu Shin ⋅ Thomas Tanay ⋅ Zhensong Zhang ⋅ Songcen Xu ⋅ Eduardo Pérez-Pellitero
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 426
MAPo: Motion-Aware Partitioning of Deformable 3D Gaussian Splatting for High-Fidelity Dynamic Scene Reconstruction
Han Jiao ⋅ Jiakai Sun ⋅ Yexing Xu ⋅ Lei Zhao ⋅ Wei Xing ⋅ Huaizhong Lin
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 427
FreeArtGS: Articulated Gaussian Splatting Under Free-moving Scenario
Hang Dai ⋅ Hongwei Fan ⋅ Han Zhang ⋅ Duojin Wu ⋅ Jiyao Zhang ⋅ Hao Dong
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 428
HeroGS: Hierarchical Guidance for Robust 3D Gaussian Splatting under Sparse Views
Jiashu Li ⋅ Xumeng Han ⋅ Zhaoyang Wei ⋅ Zipeng Wang ⋅ Kuiran Wang ⋅ Guorong Li ⋅ Zhenjun Han ⋅ Jianbin Jiao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 429
SharpTimeGS: Sharp and Stable Dynamic Gaussian Splatting via Lifespan Modulation
Zhanfeng Liao ⋅ Jiajun Zhang ⋅ Hanzhang Tu ⋅ Zhixi Wang ⋅ Yunqi Gao ⋅ Hongwen Zhang ⋅ Yebin Liu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 430
Physically Inspired Gaussian Splatting for HDR Novel View Synthesis
Huimin Zeng ⋅ Yue Bai ⋅ hailing wang ⋅ Yun Fu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 431
PhysIR-Splat: Physically Consistent Thermal Infrared Radiative Transfer in 3D Gaussian Splatting
Jingyuan Gao ⋅ Yumeng Hu ⋅ Fei Gao ⋅ Mingjin Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 432
4C4D: 4 Camera 4D Gaussian Splatting
Junsheng Zhou ⋅ Zhifan Yang ⋅ Liang Han ⋅ Wenyuan Zhang ⋅ Kanle Shi ⋅ Shenkun Xu ⋅ Yu-Shen Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 433
SplatSuRe: Selective Super-Resolution for Multi-view Consistent 3D Gaussian Splatting
Pranav Asthana ⋅ Alex Hanson ⋅ Allen Tu ⋅ Tom Goldstein ⋅ Matthias Zwicker ⋅ Amitabh Varshney
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 434
GaussianZoom: Progressive Zoom-in Generative 3D Gaussian Splatting with Geometric and Semantic Guidance
Jiale Shi ⋅ Jiarui Hu ⋅ Zesong Yang ⋅ Kaixuan Luan ⋅ Hujun Bao ⋅ Zhaopeng Cui
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 435
MotionScale: Reconstructing Appearance, Geometry, and Motion of Dynamic Scenes with Scalable 4D Gaussian Splatting
Haoran Zhou ⋅ Gim Hee Lee
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 436
PRIMU: Uncertainty Estimation for Novel Views in Gaussian Splatting from Primitive-Based Representations of Error and Coverage
Thomas Gottwald ⋅ Edgar Heinert ⋅ Peter Stehr ⋅ Chamuditha Jayanga Galappaththige ⋅ Matthias Rottmann
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 437
TGSFormer: Scalable Temporal Gaussian Splatting for Embodied Semantic Scene Completion
Rui Qian ⋅ Haozhi Cao ⋅ Tianchen Deng ⋅ TIANXIN HU ⋅ Weixiang Guo ⋅ Shenghai Yuan ⋅ Lihua Xie
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 438
Disco-GS: Gaussian Splatting in Dynamic Color Lighting
Ashish Kumar ⋅ A. N. Rajagopalan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 439
ReAG: Reasoning-Augmented Generation for Knowledge-based Visual Question Answering
Alberto Compagnoni ⋅ Marco Morini ⋅ Sara Sarto ⋅ Federico Cocchi ⋅ Davide Caffagni ⋅ Marcella Cornia ⋅ Lorenzo Baraldi ⋅ Rita Cucchiara
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 440
GuardTrace-VL: Detecting Unsafe Multimodel Reasoning via Iterative Safety Supervision
Yuxiao Xiang ⋅ Junchi Chen ⋅ Zhenchao Jin ⋅ Changtao Miao ⋅ Haojie Yuan ⋅ Qi Chu ⋅ Tao Gong ⋅ Nenghai Yu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 441
AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition
Zichuan Lin ⋅ Yicheng Liu ⋅ Yang Yang ⋅ Lvfang Tao ⋅ Deheng Ye
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 442
See It, Say It, Sorted: An Iterative Training-Free Framework for Visually-Grounded Multimodal Reasoning in LVLMs
Yongchang Zhang ⋅ Xianzheng Ma ⋅ Tianyi Liu ⋅ Guangquan Zhou ⋅ Yang Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 443
Will Multimodal Models Be Dazzled by Multi-Image Visual Puzzles?
zhi zhu ⋅ YaoQi Fan ⋅ Zhe Chen ⋅ Yue Cao ⋅ Yangzhou Liu ⋅ Tong Lu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 444
GThinker: Towards General Multimodal Reasoning via Cue-Guided Rethinking
Yufei Zhan ⋅ Ziheng Wu ⋅ Yousong Zhu ⋅ Rongkun Xue ⋅ Guanghao Zhou ⋅ Ruipu Luo ⋅ Zhenghao Chen ⋅ Can Zhang ⋅ Yifan Li ⋅ Zhentao he ⋅ Zheming Yang ⋅ Ming Tang ⋅ Minghui Qiu ⋅ Jinqiao Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 445
Visual Grounding for Object Questions
Martin Nicolas Everaert ⋅ Xiruo Liu ⋅ Hiroyuki Takeda ⋅ Raja Bala ⋅ Vivek Yadav ⋅ Vidya Narayanan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 446
CARE What Fails: Contrastive Anchored-REflection for Verifiable Multimodal Reasoning
Yongxin Wang ⋅ Zhicheng Yang ⋅ Meng Cao ⋅ Mingfei Han ⋅ Haokun Lin ⋅ Yingying Zhu ⋅ Xiaojun Chang ⋅ Xiaodan Liang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 447
What Do Visual Tokens Really Encode? Uncovering Sparsity and Redundancy in Multimodal Large Language Models
Yingqi Fan ⋅ Junlong Tong ⋅ Anhao Zhao ⋅ Xiaoyu Shen
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 448
Think-as-You-See: Streaming Chain-of-Thought Reasoning for Large Vision-Language Models
Jialiang Zhang ⋅ Junlong Tong ⋅ Junyan Lin ⋅ Hao Wu ⋅ Yirong Sun ⋅ Yunpu Ma ⋅ Xiaoyu Shen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 449
Stable and Efficient Single-Rollout RL for Multimodal Reasoning
Rui Liu ⋅ Dian Yu ⋅ Lei Ke ⋅ Haolin Liu ⋅ Yujun Zhou ⋅ Zhenwen Liang ⋅ Haitao Mi ⋅ Pratap Tokekar ⋅ Dong Yu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 450
Revisiting the Necessity of Lengthy Chain-of-Thought in Vision-centric Reasoning Generalization
Yifan Du ⋅ Kun Zhou ⋅ Yingqian Min ⋅ Yue Ling ⋅ Wayne Xin Zhao ⋅ Youbin Wu ⋅ Ji-Rong Wen
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 451
Monet: Reasoning in Latent Visual Space Beyond Image and Language
Qixun Wang ⋅ Yang Shi ⋅ Yifei Wang ⋅ Yuanxing Zhang ⋅ Pengfei Wan ⋅ Kun Gai ⋅ Xianghua Ying ⋅ Yisen Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 452
STAR-R1: Multi-View Spatial TrAnsformation Reasoning by Reinforcing Multimodal LLMs
Zongzhao Li ⋅ Zongyang Ma ⋅ Mingze Li ⋅ Songyou Li ⋅ Yu Rong ⋅ Tingyang Xu ⋅ Ziqi Zhang ⋅ Deli Zhao ⋅ Wenbing Huang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 453
From Where Things Are to What They Are For: Benchmarking Spatial–Functional Intelligence in Multimodal LLMs
Le Zhang ⋅ Jihan Yang ⋅ Soundarya Krishnan ⋅ Jimit Majmudar ⋅ Xiou Ge ⋅ Prasoon Puri ⋅ Prathamesh Saraf ⋅ Shruti Bhargava ⋅ Dhivya Piraviperumal ⋅ Yinan Ling ⋅ Cindy Pan ⋅ Hong Yu ⋅ Aishwarya Agrawal ⋅ Bo-Hsiang Tseng
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 454
Deeper Thought, Weaker Aim: Understanding and Mitigating Perceptual Impairment during Reasoning in Multimodal Large Language Models
Ruiying Peng ⋅ Xueyu Wu ⋅ Jing Lei ⋅ Lu Hou ⋅ Yuanzheng Ma ⋅ Xiaohui Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 455
S2D: Selective Spectral Decay for Quantization-Friendly Conditioning of Neural Activations
Arnav Chavan ⋅ Nahush Lele ⋅ Udbhav Bamba ⋅ Sankalp Dayal ⋅ Aditi Raghunathan ⋅ Deepak Gupta
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 456
OneSparse: A Unified Framework for Sparse Activation Layers in Vision Models
Xingkui Zhu ⋅ Dingkang Liang ⋅ Cheng Chen ⋅ Daoxin Zhang ⋅ lv hanxiang ⋅ Zhe Xu ⋅ Yao Hu ⋅ Xiang Bai
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 457
What Matters in Practical Learned Image Compression
Kedar Tatwawadi ⋅ Parisa Rahimzadeh ⋅ Zhanghao Sun ⋅ Zhiqi Chen ⋅ Ziyun Yang ⋅ Sanjay Nair ⋅ Divija Hasteer ⋅ Oren Rippel
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 458
BinaryAttention: One-Bit QK-Attention for Vision and Diffusion Transformers
Chaodong XIAO ⋅ Zhengqiang ZHANG ⋅ Lei Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 459
Ultra-Low Bitrate Perceptual Image Compression with Shallow Encoder
Tianyu Zhang ⋅ Dong Liu ⋅ Chang Wen Chen
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 460
LazyVAR: Accelerating Visual Autoregressive Models via Scale-wise Token Pruning and Parallel Group Decoding
Rongge Mao ⋅ Chengqi Dong ⋅ S Kevin Zhou
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 461
Spk2VidNet: A Hierarchical Recurrent Architecture for High-Fidelity Video Reconstruction from Long Spike-Camera Streams
Yuanlin Wang ⋅ Ruiqin Xiong ⋅ Jiyu Xie ⋅ Zhenkun Zhu ⋅ Zhaofei Yu ⋅ Xiaopeng Fan ⋅ Tiejun Huang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 462
Adaptive Learned Image Compression with Graph Neural Networks
Yunuo Chen ⋅ Bing He ⋅ Zezheng Lyu ⋅ Hongwei Hu ⋅ Qunshan Gu ⋅ Yuan Tian ⋅ Guo Lu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 463
SGI: Structured 2D Gaussians for Efficient and Compact Large Image Representation
Zixuan Pan ⋅ Kaiyuan Tang ⋅ Jun Xia ⋅ Yifan Qin ⋅ Lin Gu ⋅ Chaoli Wang ⋅ Jianxu Chen ⋅ Yiyu Shi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 464
VVS: Accelerating Speculative Decoding for Visual Autoregressive Generation via Partial Verification Skipping
Haotian Dong ⋅ Ye Li ⋅ Rongwei Lu ⋅ Chen Tang ⋅ Shu-Tao Xia ⋅ Zhi Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 465
HypeVPR: Exploring Hyperbolic Space for Perspective to Equirectangular Visual Place Recognition
Suhan Woo ⋅ Seongwon Lee ⋅ jinwoo jang ⋅ Euntai Kim
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 466
LoD-Loc v3: Generalized Aerial Localization in Dense Cities using Instance Silhouette Alignment
Shuaibang Peng ⋅ Juelin Zhu ⋅ Xia Li ⋅ Kun Yang ⋅ Yu Liu ⋅ Maojun Zhang ⋅ Shen Yan
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 467
CoLoR: The Devil is in Scene Coordinate Regression for Large-Scale Visual Localization
Xindong Mao ⋅ Hang Li ⋅ Yuchen Wu ⋅ Jiahe Li ⋅ Xiao Bai ⋅ Jin Zheng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 468
Affine Perspective-Three-Point Problem
Gaku Nakano
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 469
Sky2Ground: A Benchmark for Site Modeling under Varying Altitude
Zengyan Wang ⋅ Sirshapan Mitra ⋅ Rajat Modi ⋅ Hui Xian Grace Lim ⋅ Yogesh Rawat
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 470
SemanticVLA: Towards Semantic Reasoning over Action Memorization via Synergistic Explicit Trace and Latent Action Planning
Fei Ni ⋅ Zhuo Chen ⋅ Yifu Yuan ⋅ Zibin Dong ⋅ Xianze Yao ⋅ Shan Luo ⋅ Jianye Hao ⋅ Jiankang Deng ⋅ Stefanos Zafeiriou
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 471
WebGym: Scaling Training Environments for Long-Horizon Visual Web Agents with Realistic Tasks
Hao Bai ⋅ Alexey Taymanov ⋅ Tong Zhang ⋅ Aviral Kumar ⋅ Spencer Whitehead
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 472
Beyond Perceptual Shortcuts: Causal-Inspired Debiasing Optimization for Generalizable Video Reasoning in Lightweight MLLMs
Jingze Wu ⋅ Quan Zhang ⋅ Hongfei Suo ⋅ Zeqiang Cai ⋅ Hongbo Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 473
APPO: Attention-guided Perception Policy Optimization for Video Reasoning
Henghui Du ⋅ Chang Zhou ⋅ Xi Chen ⋅ Di Hu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 474
RetouchIQ: MLLM Agents for Instruction-Based Image Retouching with Generalist Reward
Qiucheng Wu ⋅ Jing Shi ⋅ Simon Jenni ⋅ Kushal Kafle ⋅ Tianyu Wang ⋅ Shiyu Chang ⋅ Handong Zhao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 475
EVA: Efficient Reinforcement Learning for End-to-End Video Agent
Yaolun Zhang ⋅ Ruohui Wang ⋅ Jiahao Wang ⋅ Yepeng Tang ⋅ Xuanyu Zheng ⋅ Haonan Duan ⋅ Hao Lu ⋅ Hanming Deng ⋅ Lewei Lu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 476
Visual Document Understanding and Reasoning: A Multi-Agent Collaboration Framework with Agent-Wise Adaptive Test-Time Scaling
Xinlei Yu ⋅ Chengming Xu ⋅ Zhangquan Chen ⋅ Yudong Zhang ⋅ Shilin Lu ⋅ Cheng Yang ⋅ Jiangning Zhang ⋅ Shuicheng Yan ⋅ Xiaobin Hu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 477
GazeOnce360: Fisheye-Based 360° Multi-Person Gaze Estimation with Global–Local Feature Fusion
Zhuojiang Cai ⋅ Zhenghui Sun ⋅ Feng Lu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 478
Bridging Human Evaluation to Infrared and Visible Image Fusion
Jinyuan Liu ⋅ Xingyuan Li ⋅ Qingyun Mei ⋅ HaoYuan Xu ⋅ Zhiying Jiang ⋅ Long Ma ⋅ Risheng Liu ⋅ Xin Fan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 479
Beyond Strict Pairing: Arbitrarily Paired Training for High-Performance Infrared and Visible Image Fusion
Yanglin Deng ⋅ Tianyang Xu ⋅ Chunyang Cheng ⋅ Hui Li ⋅ Xiao-Jun Wu ⋅ Josef Kittler
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 480
Semantic-Adaptive Diffusion for Dynamic Spatiotemporal Fusion
Jinsong Zhang ⋅ Ying Qu ⋅ Yuan Liao ⋅ Hairong Qi ⋅ Zhenzhou Shao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 481
Bayesian Decomposition and Semantic Completion for Few-shot Semantic Segmentation
Guangchen Shi ⋅ Yirui Wu ⋅ Wei Zhu ⋅ Tao Wang ⋅ Hao Zhang ⋅ Bo Li ⋅ Tong Lu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 482
From Few-way to Many-way: Rethinking Few-shot Fine-grained Image Classification
Li-Jun Zhao ⋅ Zhen-Duo Chen ⋅ Xin Luo ⋅ Xin-Shun Xu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 483
STiTch: Semantic Transition and Transportation in Collaboration for Training-Free Zero-Shot Composed Image Retrieval
Miaoge Li ⋅ Dongsheng Wang ⋅ Zening Sun ⋅ Jinsen Zhang ⋅ Wenhan Luo ⋅ Jingcai Guo
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 484
Selective, Regularized, and Calibrated: Harnessing Vision Foundation Models for Cross-Domain Few-Shot Semantic Segmentation
junyuan ma ⋅ Xunzhi Xiang ⋅ Wenbin Li ⋅ Qi Fan ⋅ Yang Gao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 485
FlowComposer: Composable Flows for Compositional Zero-Shot Learning
Zhenqi He ⋅ Lin Li ⋅ Long Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 486
ManifoldGD: Training-Free Hierarchical Manifold Guidance for Diffusion-Based Dataset Distillation
Ayush Roy ⋅ Wei-Yang Alex Lee ⋅ Rudrasis Chakraborty ⋅ Vishnu Suresh Lokhande
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 487
DMGD: Train-Free Dataset Distillation with Semantic-Distribution Matching in Diffusion Models
Qichao Wang ⋅ Yunhong Lu ⋅ Hengyuan Cao ⋅ Junyi Zhang ⋅ Min Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 488
UniRain: Unified Image Deraining with RAG-based Dataset Distillation and Multi-objective Reweighted Optimization
Qianfeng Yang ⋅ Qiyuan Guan ⋅ Xiang Chen ⋅ Jiyu Jin ⋅ Guiyue Jin ⋅ Jiangxin Dong
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 489
Leveraging Multispectral Sensors for Color Correction in Mobile Cameras
Luca Cogo ⋅ Marco Buzzelli ⋅ Simone Bianco ⋅ Javier Vazquez-Corral ⋅ Raimondo Schettini
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 490
Differentiable Adaptive 4D Structured Illumination for Joint Capture of Shape and Reflectance
Huakeng Ding ⋅ Yaowen Chen ⋅ Kun Zhou ⋅ Hongzhi Wu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 491
Optical Diffraction-based Convolution for Semiconductor Lithography
Young-Han Son ⋅ Dong-Hee Shin ⋅ Deok-Joong Lee ⋅ Hyun Jung Lee ⋅ Tae-Eui Kam
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 492
GSNR: Graph Smooth Null-Space Representation for Inverse Problems
Romario Gualdrón-Hurtado ⋅ Roman Jacome ⋅ Rafael S. Suárez ⋅ Henry Arguello
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 493
MatE: Material Extraction from Single-Image via Geometric Prior
Zeyu Zhang ⋅ Wei Zhai ⋅ Jian Yang ⋅ Yang Cao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 494
αMatte4K & µMatting: Dataset and Model for Ultra-Micro Precision Alpha Video Matting
Xinyi Chen ⋅ Hang Dong ⋅ Baowei Jiang ⋅ Shenkun Xu ⋅ Youqi Guan ⋅ Kanle Shi ⋅ Kun Gai ⋅ Haichuan Song
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 495
Revisiting Optimal Coding for I-ToF under Practical Sensor Constraints
WENBIN LUO ⋅ Takafumi Iwaguchi ⋅ Ryusuke Sagawa ⋅ Hiroshi Kawasaki
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 496
Dynamic Black-hole Emission Tomography with Physics-informed Neural Fields
Berthy T. Feng ⋅ Andrew A. Chael ⋅ David Bromley ⋅ Aviad Levis ⋅ William Freeman ⋅ Katherine L. Bouman
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 497
Exploring Spatiotemporal Feature Propagation for Video-Level Compressive Spectral Reconstruction: Dataset, Model and Benchmark
Lijing Cai ⋅ Zhan Shi ⋅ Chenglong Huang ⋅ Jinyao Wu ⋅ Qiping Li ⋅ Zikang Huo ⋅ Linsen Chen ⋅ Chongde Zi ⋅ Xun Cao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 498
Generalizable Radio-Frequency Radiance Fields for Spatial Spectrum Synthesis
Kang Yang ⋅ Yuning Chen ⋅ Wan Du
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 499
SAR2Net: Learning Spatially Anchored Representations for Retrieval-Guided Cross-Stain Alignment
Tianle Shen ⋅ Fang Yan ⋅ Xiaofan Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 500
Advancing Cancer Prognosis with Hierarchical Fusion of Genomic, Proteomic and Pathology Imaging Data from a Systems Biology Perspective
Junjie Zhou ⋅ Bao Xue ⋅ Meiling Wang ⋅ WEI SHAO ⋅ Daoqiang Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 501
PromptStereo: Zero-Shot Stereo Matching via Structure and Motion Prompts
Xianqi Wang ⋅ Hao Yang ⋅ Hangtian Wang ⋅ JunDa Cheng ⋅ Gangwei Xu ⋅ Min Lin ⋅ Xin Yang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 502
Any Resolution Any Geometry: From Multi-View To Multi-Patch
Wenqing Cui ⋅ Zhenyu Li ⋅ Mykola Lavreniuk ⋅ Jian Shi ⋅ Ramzi Idoughi ⋅ Xiangjun Tang ⋅ Peter Wonka
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 503
Paparazzo: Active Mapping of Moving 3D Objects
Davide Allegro ⋅ Shiyao Li ⋅ Stefano Ghidoni ⋅ Vincent Lepetit
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 504
DepthFocus: Controllable Depth Estimation for See-Through Scenes
junhong min ⋅ Jimin Kim ⋅ Minwook Kim ⋅ Cheol-Hui Min ⋅ YOUNGPIL JEON ⋅ Minyong Choi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 505
OVI-MAP: Open-Vocabulary Instance-Semantic Mapping
Zilong Deng ⋅ Federico Tombari ⋅ Marc Pollefeys ⋅ Johanna Wald ⋅ Daniel Barath
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 506
PTC-Depth: Pose-Refined Monocular Depth Estimation with Temporal Consistency
Leezy Han ⋅ Seunggyu Kim ⋅ Dongseok Shim ⋅ Hyeonbeom Lee
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 507
SceneScribe-1M: A Large-Scale Video Dataset with Comprehensive Geometric and Semantic Annotations
Yunnan Wang ⋅ Kecheng Zheng ⋅ Jianyuan Wang ⋅ Minghao Chen ⋅ David Novotny ⋅ Christian Rupprecht ⋅ Yinghao Xu ⋅ Xing Zhu ⋅ Wenjun Zeng ⋅ Xin Jin ⋅ Yujun Shen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 508
Omni-3DEdit: Generalized Versatile 3D Editing in One-Pass
Liyi Chen ⋅ Pengfei Wang ⋅ Guowen Zhang ⋅ Zhiyuan Ma ⋅ Lei Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 509
Ani3DHuman: Photorealistic 3D Human Animation with Self-guided Stochastic Sampling
Qi Sun ⋅ Can Wang ⋅ Jiaxiang Shang ⋅ Yingchun Liu ⋅ Jing Liao
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 510
Variational Graph-based Normal Integration
Lixiong Chen ⋅ Bohan Yu ⋅ Victor Adrian Prisacariu ⋅ Imari Sato
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 511
Vinedresser3D: Towards Agentic Text-guided 3D Editing
Yankuan Chi ⋅ Xiang Li ⋅ Zixuan Huang ⋅ James M.
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 512
MV2UV: Generating High-quality UV Texture Maps with Multiview Prompts
Zheng Zhang ⋅ Qinchuan Zhang ⋅ Yuteng Ye ⋅ Zhi Chen ⋅ Penglei Ji ⋅ Mengfei Li ⋅ Wenxiao ZHANG ⋅ Yuan Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 513
Learning Hierarchical Hyperbolic Mixture Model for Part-aware 3D Generation
Qitong Yang ⋅ Mingtao Feng ⋅ Zijie Wu ⋅ Huixin Zhu ⋅ Weisheng Dong ⋅ Yaonan Wang ⋅ Ajmal Mian
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 514
MeshRipple: Structured Autoregressive Generation of Artist-Meshes
JunKai Lin ⋅ Hang Long ⋅ Huipeng Guo ⋅ Jielei Zhang ⋅ JiaYi Yang ⋅ Tianle Guo ⋅ Yang Yang ⋅ Jianwen Li ⋅ Wenxiao ZHANG ⋅ Matthias Nießner ⋅ Wei Yang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 515
FACE: A Face-based Autoregressive Representation for High-Fidelity and Efficient Mesh Generation
Hanxiao Wang ⋅ Yuanchen Guo ⋅ Ying-Tian Liu ⋅ Zi-Xin Zou ⋅ Biao Zhang ⋅ Weize Quan ⋅ Ding Liang ⋅ Yan-Pei Cao ⋅ Dong-Ming Yan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 516
Easy3E: Feed-Forward 3D Asset Editing via Rectified Voxel Flow
Shimin Hu ⋅ Yuanyi Wei ⋅ Fei Zha ⋅ Yudong Guo ⋅ Juyong Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 517
CUPID: Generative 3D Reconstruction via Joint Object and Pose Modeling
Binbin Huang ⋅ Haobin Duan ⋅ Yiqun Zhao ⋅ Zibo Zhao ⋅ Yi Ma ⋅ Shenghua Gao
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 518
3D-Fixer: Coarse-to-Fine In-place Completion for 3D Scenes from a Single Image
Ze-Xin Yin ⋅ Liu Liu ⋅ Xinjie wang ⋅ Wei Sui ⋅ Zhizhong Su ⋅ Jian Yang ⋅ Jin Xie
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 519
DRM: Diffusion-based Reward Model With Step-wise Guidance
Jaxon Zhang ⋅ Binxin Yang ⋅ Hubery Yin ⋅ Chen Li ⋅ Jing LYU
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 520
Taming Preference Mode Collapse via Directional Decoupling Alignment in Diffusion Reinforcement Learning
Chubin Chen ⋅ Sujie Hu ⋅ Jiashu Zhu ⋅ Meiqi Wu ⋅ Jintao Chen ⋅ Yanxun Li ⋅ Nisha Huang ⋅ Chengyu Fang ⋅ Jiahong Wu ⋅ Xiangxiang Chu ⋅ Xiu Li
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 521
VA-π: Variational Policy Alignment for Pixel-Aware Autoregressive Generation
Xinyao Liao ⋅ QIYUAN HE ⋅ Kai Xu ⋅ Xiaoye Qu ⋅ Yicong Li ⋅ Wei Wei ⋅ Angela Yao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 522
SoliReward: Mitigating Susceptibility to Reward Hacking and Annotation Noise in Video Generation Reward Models
Jiesong Lian ⋅ Ruizhe Zhong ⋅ Zixiang Zhou ⋅ Xiaoyue Mi ⋅ Long Hu ⋅ Yuan Zhou ⋅ qinglin lu ⋅ yixue Hao ⋅ Junchi Yan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 523
AnyID: Ultra-Fidelity Universal Identity-Preserving Video Generation from Any Visual References
Jiahao Wang ⋅ Hualian Sheng ⋅ Sijia Cai ⋅ Yuxiao Yang ⋅ Weizhan Zhang ⋅ Caixia Yan ⋅ Bing Deng ⋅ Jieping Ye
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 524
Style-GRPO: Semantic-Aware Preference Optimization for Image Style Transfer Guided by Reward Modeling
Jianbin Zhao ⋅ Chaoran Feng ⋅ Miao Yu ⋅ Yingtao Li ⋅ Zhenyu Tang ⋅ Wangbo Yu ⋅ Yian Zhao ⋅ Xiaomin Li ⋅ Li Yuan ⋅ Yonghong Tian
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 525
LAMP: Language-Assisted Motion Planning for Controllable Video Generation
Muhammed Burak Kizil ⋅ Enes Şanlı ⋅ Niloy J. Mitra ⋅ Erkut Erdem ⋅ Aykut Erdem ⋅ Duygu Ceylan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 526
Diverse Video Generation with Determinantal Point Process-Guided Policy Optimization
Tahira Kazimi ⋅ Connor Dunlop ⋅ Pinar Yanardag
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 527
Spectral Scalpel: Amplifying Adjacent Action Discrepancy via Frequency-Selective Filtering for Skeleton-Based Action Segmentation
Haoyu Ji ⋅ Bowen Chen ⋅ Zhihao Yang ⋅ Wenze Huang ⋅ Yu Gao ⋅ Xueting Liu ⋅ Weihong Ren ⋅ Zhiyong Wang ⋅ Honghai LIU
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 528
DETACH : Decomposed Spatio-Temporal Alignment for Exocentric Video and Ambient Sensors with Staged Learning
Junho Yoon ⋅ Jaemo Jeong ⋅ Hyunju Kim ⋅ Dongman Lee
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 529
Learning a Unified Latent Action Space from Videos with Action-centric Cycle Consistency
Guangyan Chen ⋅ Qi Shao ⋅ Te Cui ⋅ Zichen Zhou ⋅ Weixin Mao ⋅ Luojie Yang ⋅ Meiling Wang ⋅ Yi Yang ⋅ Hua Chen ⋅ Yufeng Yue
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 530
VideoNet: A Large-Scale Dataset for Domain-Specific Action Recognition
Tanush Yadav ⋅ Reza Salehi ⋅ Jae Sung Park ⋅ Vivek Ramanujan ⋅ Hannaneh Hajishirzi ⋅ Yejin Choi ⋅ Ali Farhadi ⋅ Rohun Tripathi ⋅ Ranjay Krishna
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 531
BD-Merging: Bias-Aware Dynamic Model Merging with Evidence-Guided Contrastive Learning
Yuhan Xie ⋅ Chen Lyu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 532
Dynamic Momentum Recalibration in Online Gradient Learning
Zhipeng Yao ⋅ Rui Yu ⋅ Guisong Chang ⋅ Ying Li ⋅ Yu Zhang ⋅ Dazhou Li
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 533
Spherical Leech Quantization for Visual Tokenization and Generation
Yue Zhao ⋅ Hanwen Jiang ⋅ Zhenlin Xu ⋅ Chutong Yang ⋅ Ehsan Adeli ⋅ Philipp Krähenbühl
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 534
MSPT: Efficient Large-Scale Physical Modeling via Parallelized Multi-Scale Attention
Pedro M. P. Curvo ⋅ Jan-Willem van de Meent ⋅ Maksim Zhdanov
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 535
GR-Gauge: Cost-efficient Training Configuration By Gauging the Gradient Redundancy
Guanjie Wang ⋅ Chen Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 536
E^2-SCI: Elastic Edge–Cloud Speculative Decoding via Credit Inertia
Senyao Li ⋅ Haozhao Wang ⋅ Zhaobai Jiang ⋅ Zhanbo Jin ⋅ Hao Fan ⋅ Ruixuan Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 537
HyperNAS: Enhancing Architecture Representation for NAS Predictor via Hypernetwork
Jindi Lv ⋅ Yuhao Zhou ⋅ Yuxin Tian ⋅ Qing Ye ⋅ Wentao Feng ⋅ Jiancheng Lv
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 538
NeuroFlow: Toward Unified Visual Encoding and Decoding from Neural Activity
Weijian Mai ⋅ Mu Nan ⋅ Yu Zhu ⋅ Jiahang Cao ⋅ Rui Zhang ⋅ Yuqin Dai ⋅ Chunfeng Song ⋅ Andrew F. Luo ⋅ Jiamin Wu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 539
Spectral Conformal Risk Control: Distribution-Free Tail Guarantees via Bayesian Quadrature
Mohammad Mahdi Kazemi Esfeh ⋅ Qi Yan ⋅ Yongxing Zhang ⋅ Zahra Gholami ⋅ Renjie Liao ⋅ Purang Abolmaesumi
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 540
Edge-RecViT: Efficient Vision Transformer via Semantic-Refined Dynamic Recursion
YiZhou Li ⋅ Jinyi Xu ⋅ Mingyu Yin ⋅ Xianyi Zhao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 541
ERMoE: Eigen-Reparameterized Mixture-of-Experts for Stable Routing and Interpretable Specialization
Anzhe Cheng ⋅ Shukai Duan ⋅ Shixuan Li ⋅ Chenzhong Yin ⋅ Mingxi Cheng ⋅ Heng Ping ⋅ Tamoghna Chattopadhyay ⋅ Sophia Thomopoulos ⋅ Shahin Nazarian ⋅ Paul Thompson ⋅ Paul Bogdan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 542
GUI-SAGE: Enhancing GUI Automation with Self-Explanatory Learning
Fei Tang ⋅ Zhangxuan Gu ⋅ Zhengxi Lu ⋅ Shangzhan Zhang ⋅ Zhengwen Zeng ⋅ Shuheng Shen ⋅ Changhua Meng ⋅ Yuchen Yan ⋅ Wenqi Zhang ⋅ Yongliang Shen ⋅ Weiming Lu ⋅ Yueting Zhuang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 543
GUIDE: A Benchmark for Understanding and Assisting Users in Open-Ended GUI Tasks
Saelyne Yang ⋅ Jaesang Yu ⋅ Yi-Hao Peng ⋅ Kevin Qinghong Lin ⋅ Jae Won Cho ⋅ Yale Song ⋅ Juho Kim
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 544
HiconAgent: History Context-aware Policy Optimization for GUI Agents
Xurui Zhou ⋅ Gongwei Chen ⋅ Yuquan Xie ⋅ Zaijing Li ⋅ Kaiwen Zhou ⋅ Shuai Wang ⋅ Shuo Yang ⋅ Zhuotao Tian ⋅ Rui Shao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 545
PET-DINO: Unifying Visual Cues into Grounding DINO with Prompt-Enriched Training
Weifu Fu ⋅ Jinyang Li ⋅ Bin-Bin Gao ⋅ Jialin Li ⋅ Yuhuan Lin ⋅ Hanqiu Deng ⋅ Wenbing Tao ⋅ Yong Liu ⋅ Chengjie Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 546
SDDF: Specificity-Driven Dynamic Focusing for Open-Vocabulary Camouflaged Object Detection
Jiaming Liang ⋅ Yifeng Zhan ⋅ Chunlin Liu ⋅ Weihua Zheng ⋅ bingye Peng ⋅ Qiwei Liang ⋅ Boyang Cai ⋅ Xiaochun Mai ⋅ Qiang Nie
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 547
Towards Open-Vocabulary Industrial Defect Understanding with a Large-Scale Multimodal Dataset
Tsai-Ching Ni ⋅ ZhenQi Chen ⋅ YuanFu Yang
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 548
Common Inpainted Objects In-N-Out of Context
Tianze Yang ⋅ Tyson Jordan ⋅ Ruitong Sun ⋅ Ninghao Liu ⋅ Jin Sun
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 549
Prompt-Free Universal Region Proposal Network
Qihong Tang ⋅ Changhan Liu ⋅ Shaofeng Zhang ⋅ Wenbin Li ⋅ Qi Fan ⋅ Yang Gao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 550
Rewis3d: Reconstruction Improves Weakly-Supervised Semantic Segmentation
Jonas Ernst ⋅ Wolfgang Boettcher ⋅ Lukas Hoyer ⋅ Jan Lenssen ⋅ Bernt Schiele
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 551
PaNDaS: Learnable Shape Interpolation Modeling with Localized Control
Thomas Besnier ⋅ Emery Pierson ⋅ Sylvain Arguillere ⋅ Maks Ovsjanikov ⋅ Mohamed Daoudi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 552
Hilbert Curve-Based Attention Enabling Topology-Preserving Image Tensor Representation for Semantic Segmentation Network
Linkang Xu ⋅ Gang Li ⋅ Yue Song ⋅ Xiangxin Ji
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 553
Towards High-Quality Image Segmentation: Improving Topology Accuracy by Penalizing Neighbor Pixels
J. Miguel Valverde ⋅ Dim P. Papadopoulos ⋅ Rasmus Larsen ⋅ Anders Bjorholm Dahl
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 554
SAGE: Style-Adaptive Generalization for Privacy-Constrained Semantic Segmentation Across Domains
Qingmei Li ⋅ Yang Zhang ⋅ peifeng zhang ⋅ Haohuan Fu ⋅ Juepeng Zheng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 555
Better than Average: Spatially-Aware Aggregation of Segmentation Uncertainty Improves Downstream Performance
Vanessa Emanuela Guarino ⋅ Claudia Winklmayr ⋅ Jannik Franzen ⋅ Josef Rumberger ⋅ Manuel Pfeuffer ⋅ Sonja Greven ⋅ Klaus Maier-Hein ⋅ Dagmar Kainmueller ⋅ Christoph Karg ⋅ Carsten T. Lüth
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 556
Universal 3D Shape Matching via Coarse-to-Fine Language Guidance
Qinfeng Xiao ⋅ Guofeng Mei ⋅ Bo Yang ⋅ Zhang Liying ⋅ Liying Zhang ⋅ Kit-lun Yick
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 557
Direct Segmentation without Logits Optimization for Training-Free Open-Vocabulary Semantic Segmentation
Jiahao Li ⋅ Yang Lu ⋅ Yachao Zhang ⋅ Fangyong Wang ⋅ Yuan Xie ⋅ Yanyun Qu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 558
CDICS: Delving Into Fine-Grained Attribute for In-Context Segmentation via Compositional Prompts and Phased Decoupling
Zhiyu Li ⋅ Dianmo Sheng ⋅ Qi Chu ⋅ Shilong Chen ⋅ Tao Gong ⋅ Zhou Wei ⋅ Nenghai Yu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 559
Discriminative Perception via Anchored Description for Reasoning Segmentation
Tao Yang ⋅ Qing Zhou ⋅ Yanliang Li ⋅ Qi Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 560
SegEarth-R2: Towards Comprehensive Language-guided Segmentation for Remote Sensing Images
Zepeng Xin ⋅ Kaiyu Li ⋅ Luodi Chen ⋅ Wanchen Li ⋅ Xiao Yuchen ⋅ Hui Qiao ⋅ Weizhan Zhang ⋅ Deyu Meng ⋅ Xiangyong Cao
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 561
Cross-Scale Pansharpening via ScaleFormer and the PanScale Benchmark
Ke Cao ⋅ Xuanhua He ⋅ Xueheng Li ⋅ Lingting Zhu ⋅ Yingying Wang ⋅ Ao Ma ⋅ Zhanjie Zhang ⋅ Man Zhou ⋅ Chengjun Xie ⋅ Jie Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 562
CrossEarth-Gate: Fisher-Guided Adaptive Tuning Engine for Efficient Adaptation of Cross-Domain Remote Sensing Semantic Segmentation
Shilei Cao ⋅ Ziyang Gong ⋅ Hehai Lin ⋅ Yang Liu ⋅ Jiashun Cheng ⋅ Xiaoxing Hu ⋅ Haoyuan Liang ⋅ Guowen Li ⋅ Chengwei Qin ⋅ Hong Cheng ⋅ Xue Yang ⋅ Juepeng Zheng ⋅ Haohuan Fu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 563
Multigrain-aware Semantic Prototype Scanning and Tri-Token Prompt Learning Embraced High-Order RWKV for Pan-Sharpening
Junfeng Li ⋅ Wenyang Zhou ⋅ Xueheng Li ⋅ Xuanhua He ⋅ Jianhou Gan ⋅ Wenqi Ren
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 564
ACPV-Net: All-Class Polygonal Vectorization for Seamless Vector Map Generation from Aerial Imagery
Weiqin Jiao ⋅ Hao Cheng ⋅ George Vosselman ⋅ Claudio Persello
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 565
Beyond Endpoints: Path-Centric Reasoning for Vectorized Off-Road Network Extraction
wenfei guan ⋅ Jilin Mei ⋅ Tong Shen ⋅ Xumin Wu ⋅ Shuo Wang ⋅ Chen Min ⋅ Yu Hu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 566
Rotation Invariant and Symmetry Aware Pixel Difference Network for Remote Sensing Object Detection
Jialei Zhan ⋅ Li Liu ⋅ Jiehua Zhang ⋅ Yuhang Xie ⋅ Yongxiang Liu ⋅ Jiangming Chen ⋅ Mingming Cheng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 567
F2Net: A Frequency-Fused Network for Ultra-High Resolution Remote Sensing Segmentation
Hengzhi Chen ⋅ Liqian Feng ⋅ Wenhua Wu ⋅ Xiaogang Zhu ⋅ Qiuxia Wu ⋅ Lianlei Shan ⋅ Kun Hu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 568
RoadGIE: Towards A Global-Scale Aerial Benchmark for Generalizable Interactive Road Extraction
Chenxu Peng ⋅ Chenxu Wang ⋅ Yimian Dai ⋅ Yongxiang Liu ⋅ Mingming Cheng ⋅ Xiang Li
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 569
PGA: Prior-free Generative Attack for Practical No-box Scenario
hongyu peng ⋅ Xiang Yuan ⋅ Gong Cheng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 570
Lipschitz Optimization for Formal Verification of Homographies
Jean-Guillaume Durand ⋅ Panagiotis Kouvaros ⋅ Maxime Gariel ⋅ Alessio Lomuscio
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 571
Batman: Benign Knowledge Alignment Through Malicious Null Space in Federated Backdoor Attack
Wenwen He ⋅ Wenke Huang ⋅ Yiyang Fang ⋅ Wenjie Qu ⋅ Jiaheng Zhang ⋅ Mang Ye
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 572
Out of Sight, Out of Track: Adversarial Attacks on Propagation-based Multi-Object Trackers via Query State Manipulation
Halima Bouzidi ⋅ Haoyu Liu ⋅ Yonatan Achamyeleh ⋅ Praneetsai Iddamsetty ⋅ Mohammad Al Faruque
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 573
Eliminate Distance Differences Induced by Backdoor Attacks: Layer-Selective Training and Clipping to Mask Backdoor Models
Xuzeng Li ⋅ Tao Zhang ⋅ Xiangyun Tang ⋅ JIACHENG WANG ⋅ Jian Wang ⋅ Jiawen Kang ⋅ Jiqiang Liu ⋅ Zhen Han ⋅ Dusit Niyato ⋅ Dong In Kim
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 574
Mitigating Error Amplification in Fast Adversarial Training
Mengnan Zhao ⋅ Lihe Zhang ⋅ Bo Wang ⋅ Tianhang Zheng ⋅ Hong Zhong ⋅ Geyong Min
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 575
Physical Adversarial Clothing Evades Visible-Thermal Detectors via Non-Overlapping RGB-T Pattern
Xiaopei Zhu ⋅ Guanning Zeng ⋅ Zhanhao Hu ⋅ Jun Zhu ⋅ Xiaolin Hu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 576
What Your Features Reveal: Data-Efficient Black-Box Feature Inversion Attack for Split DNNs
Zhihan Ren ⋅ Lijun He ⋅ Jiaxi Liang ⋅ Xinzhu Fu ⋅ Haixia Bi ⋅ Fan Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 577
Exposing Functional Fusion: A New Class of Strategic Backdoor in Dynamic Prompt Architectures
Zeyao Liu ⋅ Zhendong Zhao ⋅ Xiaojun Chen ⋅ Xin Zhao ⋅ Yuexin Xuan ⋅ XIAOSHUANG JI
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 578
Learning to See and Act: Task-Aware Virtual View Exploration for Robotic Manipulation
Yongjie Bai ⋅ Zhouxia Wang ⋅ Yang Liu ⋅ Kaijun Luo ⋅ Yifan Wen ⋅ Mingtong Dai ⋅ weixing chen ⋅ Ziliang Chen ⋅ Lingbo Liu ⋅ Guanbin Li ⋅ Liang Lin
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 579
Evo-1: Lightweight Vision-Language-Action Model with Preserved Semantic Alignment
Tao Lin ⋅ Yilei Zhong ⋅ Yuxin Du ⋅ Jingjing Zhang ⋅ Jiting Liu ⋅ Yinxinyu Chen ⋅ Encheng Gu ⋅ Ziyan Liu ⋅ Hongyi Cai ⋅ Yanwen Zou ⋅ Lixing Zou ⋅ Zhaoye Zhou ⋅ Gen Li ⋅ Bo Zhao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 580
FM-Steer: Enhance Generalist Policies with Value-Guided Cascaded Denoising
Haoming Song ⋅ Delin Qu ⋅ Yuanqi Yao ⋅ Qizhi Chen ⋅ Jiarui Li ⋅ Qi Lv ⋅ Yiwen Tang ⋅ Li Kang ⋅ Heng Zhou ⋅ Xianqiang Gao ⋅ Yuhang Tang ⋅ Xiaofan Li ⋅ Modi Shi ⋅ Guangrui Ren ⋅ Maoqing Yao ⋅ Bin Zhao ⋅ Dong Wang ⋅ Xuelong Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 581
Bootstrap Dynamic-Aware 3D Visual Representation for Scalable Robot Learning
Qiwei Liang ⋅ Boyang Cai ⋅ Minghao Lai ⋅ Sitong Zhuang ⋅ Tao Lin ⋅ Yan Qin ⋅ Yixuan Ye ⋅ Jiaming Liang ⋅ Renjing Xu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 582
Visual Sim-to-Real at Scale for Humanoid Loco-Manipulation
Tairan He ⋅ Zi Wang ⋅ Haoru Xue ⋅ Qingwei Ben ⋅ Zhengyi Luo ⋅ Wenli Xiao ⋅ Ye Yuan ⋅ Xingye Da ⋅ Fernando Castañeda ⋅ Shankar Sastry ⋅ Changliu Liu ⋅ Guanya Shi ⋅ Jim Fan ⋅ Yuke Zhu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 583
Contact-Aware Neural Dynamics
Changwei Jing ⋅ Jai Krishna Bandi ⋅ Jianglong Ye ⋅ Yan Duan ⋅ Pieter Abbeel ⋅ Xiaolong Wang ⋅ Sha Yi
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 584
AVA-VLA: Improving Vision-Language-Action models with Active Visual Attention
Lei Xiao ⋅ Jifeng Li ⋅ Juntao Gao ⋅ Feiyang Ye ⋅ Yan Jin ⋅ Jingjing Qian ⋅ Jing Zhang ⋅ Yong Wu ⋅ Xiaoyuan Yu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 585
UAST: Unified Active Search and Tracking for Arbitrary Targets with UAVs
Liang Qin ⋅ Min Wang ⋅ Xingyu Lu ⋅ Aowen Qiu ⋅ Wengang Zhou ⋅ Houqiang Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 586
SwiftVLA: Unlocking Spatiotemporal Dynamics for Lightweight VLA Models at Minimal Overhead
Chaojun Ni ⋅ Chen Cheng ⋅ Xiaofeng Wang ⋅ Zheng Zhu ⋅ Wenzhao Zheng ⋅ Boyuan Wang ⋅ Tianrun Chen ⋅ Guosheng Zhao ⋅ Haoyun Li ⋅ Zhehao Dong ⋅ Qiang Zhang ⋅ Yun Ye ⋅ Yang Wang ⋅ Guan Huang ⋅ Wenjun Mei
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 587
Visual-RRT: Finding Paths toward Visual-Goals via Differentiable Rendering
Sebin Lee ⋅ Jumin Lee ⋅ Taeyeon Kim ⋅ Youngju Na ⋅ Woobin Im ⋅ Sung-Eui Yoon
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 588
Cross-Hand Latent Representation for Vision-Language-Action Models
Guangqi Jiang ⋅ Yutong Liang ⋅ Jianglong Ye ⋅ Jia-Yang Huang ⋅ Changwei Jing ⋅ Yan Duan ⋅ Pieter Abbeel ⋅ Xiaolong Wang ⋅ Xueyan Zou
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 589
Beyond Success: Refining Elegant Robot Manipulation from Mixed-Quality Data via Just-in-Time Intervention
Yanbo Mao ⋅ Jianlong Fu ⋅ Ruoxuan Zhang ⋅ Hongxia Xie ⋅ Meibao Yao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 590
Physically Ground Commonsense Knowledge for Articulated Object Manipulation with Analytic Concepts
Jiude Wei ⋅ Yuxuan Li ⋅ Cewu Lu ⋅ Jianhua Sun
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 591
GeoPredict: Leveraging Predictive Kinematics and 3D Gaussian Geometry for Precise VLA Manipulation
Jingjing Qian ⋅ Boyao Han ⋅ Chen Shi ⋅ Lei Xiao ⋅ Long Yang ⋅ Shaoshuai Shi ⋅ Li Jiang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 592
From Manuals to Actions: A Unified VLA Model for Chain-of-Thought Manual Generation and Robotic Manipulation
Chenyang Gu ⋅ Jiaming Liu ⋅ Hao Chen ⋅ Runzhong Huang ⋅ Qingpo Wuwu ⋅ Xiaoqi Li ⋅ Zhuoyang Liu ⋅ Ying Li ⋅ Ray Zhang ⋅ Peng Jia ⋅ Pheng-Ann Heng ⋅ Shanghang Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 593
Real-World Point Tracking with Verifier-Guided Pseudo-Labeling
Görkay Aydemir ⋅ Fatma Güney ⋅ Weidi Xie
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 594
Rethinking Occlusion Modeling for UAV Tracking
Jian Zhang ⋅ Xincheng Yu ⋅ Yi Lin
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 595
Adaptive Capacity Autoregressive Visual Tracking
Tong Lin ⋅ Yifan Bai ⋅ Shiyi Liang ⋅ Ruigang Niu ⋅ Xing Wei
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 596
Spatio-Temporal Conditional Denoising Transformer for Modality-Missing RGBT Tracking
Andong Lu ⋅ Ziyi Zha ⋅ Jiandong Jin ⋅ Shihao Li ⋅ Chenglong Li ⋅ Jin Tang ⋅ Bin Luo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 597
Breaking Smooth-Motion Assumptions: A UAV Benchmark for Multi-Object Tracking in Complex and Adverse Conditions
Jingtao Ye ⋅ Kexin Zhang ⋅ Xunchi Ma ⋅ Johann Li ⋅ Guangming Zhu ⋅ Peiyi Shen ⋅ Linhua Jiang ⋅ Xiangdong Zhang ⋅ Liang Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 598
TrackMAE: Video Representation Learning via Track Mask and Predict
Renaud Vandeghen ⋅ Fida Mohammad Thoker ⋅ Marc Van Droogenbroeck ⋅ Bernard Ghanem
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 599
Dual-branch Distilled Transformer for Efficient Asymmetric UAV Tracking
Hongtao Yang ⋅ Bineng Zhong ⋅ Qihua Liang ⋅ Yaozong Zheng ⋅ Xiantao Hu ⋅ Yuanliang Xue ⋅ Shuxiang Song
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 600
Multi-view Crowd Tracking Transformer with View-Ground Interactions Under Large Real-World Scenes
Qi Zhang ⋅ Jixuan Chen ⋅ Zhang Kaiyi ⋅ Xinquan Yu ⋅ Antoni B. Chan ⋅ Hui Huang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 601
Scaling Self-Supervised and Cross-Modal Pretraining for Volumetric CT Transformers
Cris Claessens ⋅ Christiaan Viviers ⋅ Giacomo D'Amicantonio ⋅ Egor Bondarev ⋅ Fons van der Sommen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 602
MuViT: Multi-Resolution Vision Transformers for Learning Across Scales in Microscopy
Albert Dominguez Mantes ⋅ Gioele La Manno ⋅ Martin Weigert
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 603
SemVideo: Reconstructs What You Watch from Brain Activity via Hierarchical Semantic Guidance
Minghan Yang ⋅ LAN YANG ⋅ Ke Li ⋅ Honggang Zhang ⋅ Kaiyue Pang ⋅ Yi-Zhe Song
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 604
Multimodal Causality-Driven Representation Learning for Generalizable Medical Image Segmentation
XUSHENG LIANG ⋅ Lihua Zhou ⋅ Nianxin Li ⋅ miao xu ⋅ Ziyang Song ⋅ Dong Yi ⋅ Jinlin Wu ⋅ Jiawei Ma ⋅ Hongbin Liu ⋅ Zhen Lei ⋅ Jiebo Luo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 605
Simple Agents Outperform Experts in Biomedical Imaging Workflow Optimization
Xuefei Wang ⋅ Kai A. Horstmann ⋅ Ethan Lin ⋅ Jonathan Chen ⋅ Alexander Farhang ⋅ Sophia Stiles ⋅ Atharva Sehgal ⋅ Jonathan Light ⋅ David Valen ⋅ Yisong Yue ⋅ Jennifer J. Sun
[ Slides
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 606
TopoSlide: Topologically-Informed Histopathology Whole Slide Image Representation Learning
Shahira Abousamra ⋅ Asmita Sood ⋅ Sylvia Plevritis
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 607
Beyond the Static-World: Lifelong Learning for All-in-One Medical Image Restoration
Shihao Shan ⋅ Hongying Liu ⋅ Fanhua Shang ⋅ Liang Wan ⋅ Jingjing Deng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 608
Hyperbolic Relational Prompts for Intersectional Fairness in Medical VLMs
Jiayu Qian ⋅ Zongxian Yang ⋅ Guanxing Chen ⋅ Pengwei Hu ⋅ KC Tan ⋅ Yan Wang ⋅ Yu-An Huang ⋅ Zhi-An Huang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 609
RNED: Rotary Number Encoding and Decoding for Quantitative Medical VLM Analysis
Fengbei Liu ⋅ Sunwoo Kwak ⋅ Nusrat Binta Nizam ⋅ Ilan Richter ⋅ Ashley Beecy ⋅ Jayant Raikhelkar ⋅ Deborah Estrin ⋅ Mert Sabuncu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 610
MLLM-HWSI: A Multimodal Large Language Model for Hierarchical Whole Slide Image Understanding
Basit Alawode ⋅ Arif Mahmood ⋅ Muaz Radi ⋅ Shahad Albastaki ⋅ Asim Khan ⋅ Muhammad Bilal ⋅ Moshira Ali Abdalla ⋅ Mohammed Bennamoun ⋅ Sajid Javed
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 611
Learning Generalizable 3D Medical Image Representations from Mask-Guided Self-Supervision
Yunhe Gao ⋅ Yabin Zhang ⋅ Chong Wang ⋅ Jiaming Liu ⋅ Maya Varma ⋅ Jean-Benoit Delbrouck ⋅ Akshay Chaudhari ⋅ Curtis Langlotz
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 612
BiOTPrompt: Bidirectional Optimal Transport Guided Prompting for Disease Evolution-aware Radiology Report Generation
Tengfei Liu ⋅ Yijian Fan ⋅ Boyue Wang ⋅ Yongli Hu ⋅ Mingjie Li ⋅ Jinghua Li ⋅ Junbin Gao ⋅ Xiaojun Chang ⋅ Zhihui Li ⋅ Baocai Yin
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 613
Learning to See Through a Baby’s Eyes: Early Visual Diets Enable Robust Visual Intelligence in Humans and Machines
Yusen Cai ⋅ Qing Lin ⋅ BHARGAVA SATYA NUNNA ⋅ Mengmi Zhang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 614
UDAPose: Unsupervised Domain Adaptation for Low-Light Human Pose Estimation
Haopeng Chen ⋅ Yihao Ai ⋅ Kabeen Kim ⋅ Robby T. Tan ⋅ Yixin Chen ⋅ Bo Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 615
Enhancing Accuracy of Uncertainty Estimation in Appearance-based Gaze Tracking with Probabilistic Evaluation and Calibration
Qiaojie Zheng ⋅ Jiucai Zhang ⋅ Amy Zhang ⋅ Xiaoli Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 616
SCAPO: Self-Supervised Category-Level Articulated Pose Estimation from a Single 3D Observation
Can Zhang ⋅ Gim Hee Lee
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 617
Composite-Attribute Person Re-Identification via Pose-Guided Disentanglement
Kartik Patwari ⋅ Noranart Vesdapunt ⋅ Chien-Yi Wang ⋅ Dawei Li ⋅ Cong Phuoc Huynh ⋅ Ning Zhou ⋅ Chen-Nee Chuah ⋅ Kah Fu Fu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 618
Representing 3D Faces with Learnable B-Spline Volumes
Prashanth Chandran ⋅ Daoye Wang ⋅ Timo Bolkart
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 619
RHINO: Reconstructing Human Interactions with Novel Objects from Monocular Videos
Lixin Xue ⋅ Chengwei Zheng ⋅ Georgios Paschalidis ⋅ Chen Guo ⋅ Manuel Kaufmann ⋅ Juan Zarate ⋅ Dimitrios Tzionas
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 620
HumanBA: Human-Aware Bundle Adjustment via Global Human-Camera Decoupling
Tanuj Sur ⋅ Tanuj Sur ⋅ Tze Ho Elden Tse ⋅ Angela Yao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 621
HamiPose: Hamiltonian Optimization for Unsupervised Domain Adaptive Pose Estimation
Jiawen Li ⋅ Fei Jiang ⋅ Dandan Zhu ⋅ Aimin Zhou
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 622
KASALv2: Fully Automatic 3D Rotational Symmetry Classification and Axis Localization
Mengxin Zhang ⋅ Yulin Wang ⋅ Chen LUO ⋅ Yongzhe Li ⋅ Yijun Zhou
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 623
AnyLift: Scaling Motion Reconstruction from Internet Videos via 2D Diffusion
Hongjie Li ⋅ Heng Yu ⋅ Jiaman Li ⋅ Hong-Xing Yu ⋅ Ehsan Adeli ⋅ C. Karen Liu ⋅ Jiajun Wu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 624
Active Inference for Micro-Gesture Recognition: EFE-Guided Temporal Sampling and Adaptive Learning
Weijia Feng ⋅ Jingyu Yang ⋅ Ruojia Zhang ⋅ Fengtao Sun ⋅ Qian Gao ⋅ Chenyang Wang ⋅ tongtong Su ⋅ Jia Guo ⋅ Xiaobai Li ⋅ Minglai Shao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 625
ArtPro: Self-Supervised Articulated Object Reconstruction with Adaptive Integration of Mobility Proposals
Xuelu Li ⋅ Zhaonan Wang ⋅ Xiaogang Wang ⋅ Lei Wu ⋅ Manyi Li ⋅ Changhe Tu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 626
Similarity-Consistent Likelihood Diffusion enables Hidden Person Detection from Wall Reflections
Zhiwen Zheng ⋅ Hao Zhou ⋅ Huiyu Qi ⋅ Zhao Huang ⋅ Guangyuan Zhang ⋅ Shaowei Jiang ⋅ Wenwen Tang ⋅ Bin Yang ⋅ Jin Liu ⋅ Xiaoshuai Zhang ⋅ Xingru Huang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 627
VLM-Guided Group Preference Alignment for Diffusion-based Human Mesh Recovery
Wenhao Shen ⋅ Hao Wang ⋅ Wanqi Yin ⋅ Fayao Liu ⋅ Xulei Yang ⋅ Chao Liang ⋅ Zhongang Cai ⋅ Guosheng Lin
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 628
Occluded Human Body Capture with Frequency Domain Denoising Prior
Buzhen Huang ⋅ Chongyang Xu ⋅ Wentao Tang ⋅ Yuan Shu ⋅ Jingyi Ju ⋅ Binghui Zuo ⋅ Yangang Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 629
ResiHMR: Residual-Limb Aware Single-Image 3D Human Mesh Recovery for Individuals with Limb Loss
Jiaying Ying ⋅ Heming Du ⋅ Kaihao Zhang ⋅ Sean M. Tweedy ⋅ Xin Yu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 630
OnlineHMR: Video-based Online World-Grounded Human Mesh Recovery
Yiwen Zhao ⋅ Ce Zheng ⋅ Yufu Wang ⋅ Hsueh-Han Daniel Yang ⋅ Liting Wen ⋅ László A. Jeni
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 631
MimiCAT: Mimic with Correspondence-Aware Cascade-Transformer for Category-Free 3D Pose Transfer
Zenghao Chai ⋅ Chen Tang ⋅ Yongkang Wong ⋅ Xulei Yang ⋅ Mohan Kankanhalli
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 632
Exploring Adaptive Masked Reconstruction for Self-Supervised Skeleton-Based Action Recognition
Shengkai Sun ⋅ Zhiyong Cheng ⋅ Zefan Zhang ⋅ Jianfeng Dong ⋅ Zhihui Li ⋅ Meng Wang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 633
DFD-HR: Generalizable Deepfake Detection via Hierarchical Routing Learning
JIAMU SUN ⋅ Zhiyuan Yan ⋅ Ke-Yue Zhang ⋅ Taiping Yao ⋅ Shouhong Ding
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 634
MGDHand: Multi-Granularity Prior-to-Inertial Distillation Framework for Sequential 3D Hand Pose Estimation from Sparse IMUs
Xinyi Wang ⋅ Pengfei Ren ⋅ HaoYang ZHANG ⋅ Hanling Zhan ⋅ Yingxi Li ⋅ Liang Xie ⋅ Yue Gao ⋅ Erwei Yin
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 635
CARI4D: Category Agnostic 4D Reconstruction of Human-Object Interaction
Xianghui Xie ⋅ Bowen Wen ⋅ Yan Chang ⋅ Hesam Rabeti ⋅ Jiefeng Li ⋅ Ye Yuan ⋅ Gerard Pons-Moll ⋅ Stan Birchfield
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 636
E-3DPSM: A State Machine for Event-based Egocentric 3D Human Pose Estimation
Mayur Deshmukh ⋅ Hiroyasu Akada ⋅ Helge Rhodin ⋅ Christian Theobalt ⋅ Vladislav Golyanik
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 637
Bézier Degradation Modeling for LiDAR-based Human Motion Capture
Xiaoqi An ⋅ Lin Zhao ⋅ Jun Li ⋅ Chen Gong ⋅ Jian Yang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 638
UniSH: Unifying Scene and Human Reconstruction in a Feed-Forward Pass
Mengfei Li ⋅ Peng Li ⋅ Zheng Zhang ⋅ Jiahao Lu ⋅ Chengfeng Zhao ⋅ Wei Xue ⋅ Qifeng Liu ⋅ Sida Peng ⋅ Wenxiao ZHANG ⋅ Wenhan Luo ⋅ Yuan Liu ⋅ Yike Guo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 639
Illumination-Consistent Human-Scene Reconstruction from Monocular Video
Rongbin Zheng ⋅ Wensheng Li ⋅ Lingzhe Zeng ⋅ Dong Wang ⋅ Chengying Gao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 640
Attribution as Retrieval: Model-Agnostic AI-Generated Image Attribution
Hongsong Wang ⋅ Renxi Cheng ⋅ Chaolei Han ⋅ Jie Gui
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 641
Agent4FaceForgery: Multi-Agent LLM Framework for Realistic Face Forgery Detection
Yingxin Lai ⋅ Zitong YU ⋅ Jun Wang ⋅ Linlin Shen ⋅ Yong Xu ⋅ Xiaochun Cao
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 642
Enabling Supervised Learning of Generative Signatures for Generalized Synthetic Image Detection
Jianwei Fei ⋅ Yunshu Dai ⋅ Xiaoyu Zhou ⋅ Zhihua Xia ⋅ Alessandro Piva
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 643
DiffusionFF: A Diffusion-based Framework for Joint Face Forgery Detection and Fine-Grained Artifact Localization
Siran Peng ⋅ Haoyuan Zhang ⋅ Li Gao ⋅ Tianshuo Zhang ⋅ Xiangyu Zhu ⋅ Bao Li ⋅ Weisong Zhao ⋅ Zhen Lei
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 644
All in One: Unifying Deepfake Detection, Tampering Localization, and Source Tracing with a Robust Landmark-Identity Watermark
Junjiang Wu ⋅ Liejun Wang ⋅ Zhiqing Guo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 645
Towards an Incremental Unified Multimodal Anomaly Detection: Augmenting Multimodal Denoising From an Information Bottleneck Perspective
Kaifang Long ⋅ Lianbo Ma ⋅ Jiaqi Liu ⋅ liming liu ⋅ Guoyang Xie
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 646
AG-VAS: Anchor-Guided Zero-Shot Visual Anomaly Segmentation with Large Multimodal Models
Zhen Qu ⋅ Xian Tao ⋅ Xiaoyi Bao ⋅ Dingrong Wang ⋅ ShiChen Qu ⋅ Zhengtao Zhang ⋅ Xingang Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 647
Dual-Prototype-Guided Multi-task Learning for Unsupervised Anomaly Detection and Classification
Qianhao Luo ⋅ Jiajia Mi ⋅ Mingtao Yan ⋅ JingSheng Liu ⋅ ShuYang Pang ⋅ Weiling Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 648
The Road Less Seen: Segment Exploration for Weakly Supervised Video Anomaly Detection
Anusha Achaya ⋅ Hitesh Sapkota ⋅ Qi Yu ⋅ Xumin Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 649
Omni-AD: A Large-scale and Versatile Benchmark for Industrial Anomaly Detection
Dahu Shi ⋅ Chengshen He ⋅ Shaochen Zhang ⋅ Bo Qian ⋅ Xiaochen Quan ⋅ Wencong Zhang ⋅ Xing Wei
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 650
Back to Point: Exploring Point-Language Models for Zero-Shot 3D Anomaly Detection
Kaiqiang Li ⋅ Gang Li ⋅ Mingle Zhou ⋅ Min Li ⋅ Delong Han ⋅ Jin Wan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 651
Complementary Prototype Mapping for Efficient Multimodal Anomaly Detection
Yuan Zhao ⋅ Zhang xiaoqin to Xiaoqin Zhang ⋅ Huchuan Lu ⋅ Lihe Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 652
LiDAS: Lighting-driven Dynamic Active Sensing for Nighttime Perception
Simon de Moreau ⋅ Andrei Bursuc ⋅ Hafid EL IDRISSI ⋅ Fabien Moutarde
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 653
Gau-Occ: Geometry-Completed Gaussians for Multi-Modal 3D Occupancy Prediction
Chengxin Lv ⋅ Yihui Li ⋅ Hongyu Yang ⋅ Yunhong Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 654
OpenVO: Open-World Visual Odometry with Temporal Dynamics Awareness
Phuc Nguyen ⋅ Anh N Nhu ⋅ Ming C. Lin
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 655
An Instance-Centric Panoptic Occupancy Prediction Benchmark for Autonomous Driving
Yi Feng ⋅ Junwu E ⋅ Zizhan Guo ⋅ Yu Ma ⋅ Hanli Wang ⋅ Rui Fan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 656
OneOcc: Semantic Occupancy Prediction for Legged Robots with a Single Panoramic Camera
Hao Shi ⋅ Ze Wang ⋅ Shangwei Guo ⋅ Mengfei Duan ⋅ Song Wang ⋅ Teng Chen ⋅ Kailun Yang ⋅ Lin Wang ⋅ Kaiwei Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 657
ProOOD: Prototype-Guided Out-of-Distribution 3D Occupancy Prediction
Yuheng Zhang ⋅ Mengfei Duan ⋅ Kunyu Peng ⋅ Yuhang Wang ⋅ Di Wen ⋅ Danda Paudel ⋅ Luc Van Gool ⋅ Kailun Yang
[ Poster