Skip to yearly menu bar Skip to main content


(666 events)   Timezone:  
Show all
The 2026 schedule is still incomplete
Toggle Poster Visibility
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 1
A Style is Worth One Code: Unlocking Code-to-Style Image Generation with Discrete Style Space
Huijie Liu ⋅ Shuhao Cui ⋅ Haoxiang Cao ⋅ Shuai Ma ⋅ Kai Wu ⋅ Guoliang Kang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 2
Adversarial Style Optimization: Enhancing VLM Jailbreaks by GRPO-based Stylistic Triggers Optimization
Bingjun Luo ⋅ Jialin Guo ⋅ Yue Yao ⋅ Xinpeng Ding
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 3
ANTS: Adaptive Negative Textual Space Shaping for OOD Detection via Test-Time MLLM Understanding and Reasoning
Wenjie Zhu ⋅ Yabin Zhang ⋅ Xin Jin ⋅ Wenjun Zeng ⋅ Lei Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 4
ARGUS: Defending Against Multimodal Indirect Prompt Injection via Steering Instruction-Following Behavior
Weikai Lu ⋅ Ziqian Zeng ⋅ Kehua Zhang ⋅ Haoran Li ⋅ Huiping Zhuang ⋅ Ruidong Wang ⋅ Cen Chen ⋅ Hao Peng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 5
TEAR: Temporal-aware Automated Red-teaming for Text-to-Video Models
Jiaming He ⋅ Guanyu Hou ⋅ Hongwei Li ⋅ Zhicong Huang ⋅ Kangjie Chen ⋅ Yi Yu ⋅ Wenbo Jiang ⋅ Guowen Xu ⋅ Tianwei Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 6
ViT^3: Unlocking Test-Time Training in Vision
Dongchen Han ⋅ Yining Li ⋅ Tianyu Li ⋅ Zixuan Cao ⋅ Ziming Wang ⋅ Jun Song ⋅ Cheng Yu ⋅ Bo Zheng ⋅ Gao Huang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 7
Black-box Membership Inference Attacks on the Pre-training Data of Image-generation Models
Tao Qi ⋅ Huili Wang ⋅ Yuanhong Huang ⋅ Wendan Wang ⋅ Lianchao Zhao ⋅ Jinrui Wang ⋅ Zichen Qin ⋅ Shangguang Wang ⋅ Yongfeng Huang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 8
Data Leakage Detection and De-duplication in Large Scale Geospatial Image Datasets
Yeshwanth Kumar Adimoolam ⋅ Charalambos Poullis ⋅ Melinos Averkiou
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 9
RAVEN: Erasing Invisible Watermarks via Novel View Synthesis
Fahad Shamshad ⋅ Nils Lukas ⋅ Karthik Nandakumar
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 10
LDP-Slicing: Local Differential Privacy for Images via Randomized Bit-Plane Slicing
Yuanming Cao ⋅ Chengqi Li ⋅ Wenbo He
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 11
NOWA: Null-space Optical Watermark for Invisible Capture Fingerprinting and Tamper Localization
Edwin Vargas ⋅ Jhon Lopez ⋅ Henry Arguello ⋅ Ashok Veeraraghavan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 12
Revisiting Geometric Obfuscation with Dual Convergent Lines for Privacy-Preserving Image Queries in Visual Localization
Jeonggon Kim ⋅ Heejoon Moon ⋅ Je Hyeong Hong
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 13
Advancing Image Classification with Discrete Diffusion Classification Modeling
Omer Belhasin ⋅ Shelly Golan ⋅ Ran El-Yaniv ⋅ Michael Elad
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 14
Does YOLO Really Need to See Every Training Image in Every Epoch?
Xingxing Xie ⋅ Jiahua Dong ⋅ Junwei Han ⋅ Gong Cheng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 15
Fine-grained Image Aesthetic Assessment: Learning Discriminative Scores from Relative Ranks
Zhichao Yang ⋅ Jianjie Wang ⋅ Zhixianhe Zhang ⋅ Pangu Xie ⋅ Xiangfei Sheng ⋅ Pengfei Chen ⋅ Leida Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 16
NuWa: Deriving Lightweight Class-Specific Vision Transformers for Edge Devices
Ziteng Wei ⋅ Qiang He ⋅ Bing Li ⋅ Feifei Chen ⋅ Hai Jin ⋅ Yun Yang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 17
Plant Taxonomy Meets Plant Counting: A Fine-Grained, Taxonomic Dataset for Counting Hundreds of Plant Species
Jinyu Xu ⋅ Tianqi Hu ⋅ Xiaonan Hu ⋅ Letian Zhou ⋅ Songliang Cao ⋅ Meng Zhang ⋅ Hao Lu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 18
Rethinking Dataset Distillation: Hard Truths about Soft Labels
Priyam Dey ⋅ Aditya Sahdev ⋅ Sunny Bhati ⋅ Konda Reddy Mopuri ⋅ R. Venkatesh Babu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 19
Customized Fusion: A Closed-Loop Dynamic Network for Adaptive Multi-Task-Aware Infrared-Visible Image Fusion
Zengyi Yang ⋅ Yu Liu ⋅ Juan Cheng ⋅ Zhiqin Zhu ⋅ Yafei Zhang ⋅ Huafeng Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 20
Dual Band Thermal Videography: Separating Time-Varying Reflection and Emission Near Ambient Conditions
Sriram Narayanan ⋅ Mani Ramanagopal ⋅ Srinivasa G. Narasimhan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 21
MetaSpectra+: A Compact Broadband Metasurface Camera for Snapshot Hyperspectral+ Imaging
Yuxuan Liu ⋅ Wei Xu ⋅ Qi Guo
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 22
Spectrum from Defocus: Fast Spectral Imaging with Chromatic Focal Stack
M. Kerem Aydin ⋅ Yi-Chun Hung ⋅ Jaclyn Pytlarz ⋅ Qi Guo ⋅ Emma Alexander
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 23
Towards Photorealistic and Efficient Bokeh Rendering via Diffusion Framework
Linxiao Shi ⋅ Siming Zheng ⋅ Zerong Wang ⋅ Hao Zhang ⋅ Jinwei Chen ⋅ Bo Li ⋅ Shifeng Chen ⋅ Peng-Tao Jiang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 24
UnReflectAnything: RGB-Only Highlight Removal by Rendering Synthetic Specular Supervision
Alberto Rota ⋅ Mert Kiray ⋅ Mert Asim Karaoglu ⋅ Patrick Ruhkamp ⋅ Elena De Momi ⋅ Nassir Navab ⋅ Benjamin Busam
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 25
AVGGT: Rethinking Global Attention for Accelerating VGGT
Xianbing Sun ⋅ Zhikai Zhu ⋅ Zhengyu Lou ⋅ Bo Yang ⋅ Jinyang Tang ⋅ Liqing Zhang ⋅ He Wang ⋅ Jianfu Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 26
ManifoldNeuS: Manifold-aware View Optimizability for Pose-Free Neural Surface Reconstruction
Xinxin Liu ⋅ Xue Wang ⋅ Guoqing Zhou ⋅ Qing Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 27
LongStream: Long-Sequence Streaming Autoregressive Visual Geometry
Chong Cheng ⋅ Xianda Chen ⋅ Tao Xie ⋅ Wei Yin ⋅ Weiqiang Ren ⋅ Qian Zhang ⋅ Xiaoyang Guo ⋅ Hao Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 28
RPGFusion: 4D Radar Prior-Guided Multi-Modal Fusion for 3D Detection
Xin Qiu ⋅ Wenjie Liu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 29
MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second
Chenguo Lin ⋅ Yuchen Lin ⋅ Panwang Pan ⋅ Yifan Yu ⋅ Tao Hu ⋅ Honglei Yan ⋅ Katerina Fragkiadaki ⋅ Yadong Mu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 30
JRM: Joint Reconstruction Model for Multiple Objects without Alignment
Qirui Wu ⋅ Mohd Yawar Nihal Siddiqui ⋅ Duncan Frost ⋅ Samir Aroudj ⋅ Armen Avetisyan ⋅ Richard Newcombe ⋅ Angel Xuan Chang ⋅ Jakob Engel ⋅ Henry Howard-Jenkins
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 31
Inferring Compositional 4D Scenes without Ever Seeing One
Ahmet Berke Gökmen ⋅ Ajad Chhatkuli ⋅ Luc Van Gool ⋅ Danda Paudel
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 32
FreeScale: Scaling 3D Scenes via Certainty-Aware Free-View Generation
Chenhan Jiang ⋅ Yu Chen ⋅ Qingwen Zhang ⋅ Jifei Song ⋅ Songcen Xu ⋅ Dit-Yan Yeung ⋅ Jiankang Deng
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 33
Complet4R: Geometric Complete 4D Reconstruction
Weibang Wang ⋅ Kenan Li ⋅ Zhuoguang Chen ⋅ Yijun Yuan ⋅ Hang Zhao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 34
Unblur-SLAM: Dense Neural SLAM for Blurry Inputs
Qi Zhang ⋅ Denis Rozumny ⋅ Francesco Girlanda ⋅ Sezer Karaoglu ⋅ Marc Pollefeys ⋅ Theo Gevers ⋅ Martin R. Oswald
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 35
Learning Compact 3D Representations from Feed-Forward Novel View Synthesis
Honggyu An ⋅ Jaewoo Jung ⋅ Mungyeom Kim ⋅ Chaehyun Kim ⋅ Minkyeong Jeon ⋅ Jisang Han ⋅ Kazumi Fukuda ⋅ Takuya Narihira ⋅ HYUNAH KO ⋅ Junsu Kim ⋅ Sunghwan Hong ⋅ Yuki Mitsufuji ⋅ Seungryong Kim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 36
Fast Spatial Tracking with Visual Geometry Transformer
Chengjie Huang ⋅ GUILE WU ⋅ Dongfeng Bai ⋅ Bingbing Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 37
How Much 3D Do Video Foundation Models Encode?
Zixuan Huang ⋅ Xiang Li ⋅ Zhaoyang Lv ⋅ James M.
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 38
MetroGS: Efficient and Stable Reconstruction of Geometrically Accurate High-Fidelity Large-Scale Scenes
Kehua Chen ⋅ Tianlu Mao ⋅ Xinzhu Ma ⋅ Hao Jiang ⋅ Zehao Li ⋅ Zihan Liu ⋅ Shuqin Gao ⋅ Honglong Zhao ⋅ Feng Dai ⋅ Yucheng Zhang ⋅ Zhaoqi Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 39
RnG: A Unified Transformer for Complete 3D Modeling from Partial Observations
Mochu Xiang ⋅ Zhelun Shen ⋅ Xuesong li ⋅ Jiahui Ren ⋅ Jing Zhang ⋅ Chen Zhao ⋅ Shanshan Liu ⋅ Haocheng Feng ⋅ Jingdong Wang ⋅ Yuchao Dai
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 40
Long-Tail Internet Photo Reconstruction
Yuan Li ⋅ Yuanbo Xiangli ⋅ Hadar Averbuch-Elor ⋅ Noah Snavely ⋅ Ruojin Cai
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 41
Emergent Outlier View Rejection in Visual Geometry Grounded Transformers
Jisang Han ⋅ Sunghwan Hong ⋅ Jaewoo Jung ⋅ Wooseok Jang ⋅ Honggyu An ⋅ Qianqian Wang ⋅ Seungryong Kim ⋅ Chen Feng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 42
Flow3r: Factored Flow Prediction for Scalable Visual Geometry Learning
Zhongxiao Cong ⋅ Qitao Zhao ⋅ Minsik Jeon ⋅ Shubham Tulsiani
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 43
MultiBanana: A Challenging Benchmark for Multi-Reference Text-to-Image Generation
Yuta Oshima ⋅ Daiki Miyake ⋅ Kohsei Matsutani ⋅ Yusuke Iwasawa ⋅ Masahiro Suzuki ⋅ Yutaka Matsuo ⋅ Hiroki Furuta
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 44
HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives
Yihao Meng ⋅ Hao Ouyang ⋅ Yue Yu ⋅ Qiuyu Wang ⋅ Wen Wang ⋅ Ka Leong Cheng ⋅ Hanlin Wang ⋅ Shuailei Ma ⋅ Yixuan LI ⋅ Chen Cheng ⋅ Yanhong Zeng ⋅ Xing Zhu ⋅ Yujun Shen ⋅ Huamin Qu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 45
Design Your Ad: Personalized Advertising Image and Text Generation with Unified Autoregressive Models
Yexing Xu ⋅ Wei Feng ⋅ Shen Zhang ⋅ Haohan Wang ⋅ Yuxin Qin ⋅ Yaoyu Li ⋅ Ao Ma ⋅ Yuhao Luo ⋅ Lu Wang ⋅ Xudong Ren ⋅ Haoran Wang ⋅ Run Ling ⋅ Zheng Zhang ⋅ Jingjing Lv ⋅ Junjie Shen ⋅ Ching Law ⋅ Longguang Wang ⋅ Yulan Guo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 46
SketchDeco: Training-Free Latent Composition for Precise Sketch Colourisation
Chaitat Utintu ⋅ Yi-Zhe Song
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 47
ConsistCompose: Unified Multimodal Layout Control for Image Composition
Xuanke Shi ⋅ Boxuan Li ⋅ Xiaoyang Han ⋅ Zhongang Cai ⋅ Lei Yang ⋅ Quan Wang ⋅ Dahua Lin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 48
A Training-Free Style-Personalization via SVD-Based Feature Decomposition
Kyoungmin Lee ⋅ Jihun Park ⋅ Jongmin Gim ⋅ Wonhyeok Choi ⋅ Kyumin Hwang ⋅ Jaeyeul Kim ⋅ Sunghoon Im
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 49
Beyond Patches: Global-aware Autoregressive Model for Multimodal Few-Shot Font Generation
Haonan Cai ⋅ Yuxuan Luo ⋅ Zhouhui Lian
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 50
ImageRAGTurbo: Towards One-step Text-to-Image Generation with Retrieval-Augmented Diffusion Models
Peijie Qiu ⋅ Hariharan Ramshankar ⋅ Arnau Ramisa ⋅ Amit C C ⋅ Rene Vidal ⋅ Vamsi Salaka ⋅ Rahul Bhagat
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 51
OmniSonic: Towards Universal and Holistic Audio Generation from Video and Text
Weiguo Pian ⋅ Saksham Singh Kushwaha ⋅ Zhimin Chen ⋅ Shijian Deng ⋅ Kai Wang ⋅ Yunhui Guo ⋅ Yapeng Tian
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 52
Ar2Can: An Architect and an Artist Leveraging a Canvas for Multi-Human Generation
Shubhankar Borse ⋅ Phuc Pham ⋅ Farzad Farhadzadeh ⋅ Seokeon Choi ⋅ Phong Nguyen ⋅ Anh Tran ⋅ Sungrack Yun ⋅ Munawar Hayat ⋅ Fatih Porikli
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 53
Curriculum Group Policy Optimization: Adaptive Sampling for Unleashing the Potential of Text-to-Image Generation
Baoteng Li ⋅ Xianghao Zang ⋅ Xinran Wang ⋅ Xiangyu Na ⋅ Zhixiang He ⋅ Hao Sun ⋅ Chi Zhang ⋅ Zhongjiang He ⋅ Tianwei Cao ⋅ Kongming Liang ⋅ Zhanyu Ma
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 54
SplitFlux: Learning to Decouple Content and Style from a Single Image
Yitong Yang ⋅ Yinglin Wang ⋅ Changshuo Wang ⋅ Yongjun Zhang ⋅ Ziyang Chen ⋅ Shuting He
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 55
FontCrafter: High-Fidelity Element-Driven Artistic Font Creation with Visual In-Context Generation
Wuyang Luo ⋅ Chengkaitan Chengkaitan to Chengkai Tan ⋅ Chang Ge ⋅ Binye Hong ⋅ Su Yang ⋅ Yongjiu Ma
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 56
EmoStyle: Emotion-Driven Image Stylization
Jingyuan Yang ⋅ Zihuan Bai ⋅ Hui Huang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 57
Text-Image Conditioned 3D Generation
Jiazhong Cen ⋅ Jiemin Fang ⋅ Sikuang Li ⋅ Guanjun Wu ⋅ Chen Yang ⋅ Taoran Yi ⋅ Zanwei Zhou ⋅ zhikuan bao ⋅ Lingxi Xie ⋅ Wei Shen ⋅ Qi Tian
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 58
IntroSVG: Learning from Rendering Feedback for Text-to-SVG Generation via an Introspective Generator–Critic Framework
Feiyu Wang ⋅ Jiayuan Yang ⋅ Zhiyuan Zhao ⋅ Da Zhang ⋅ Bingyu Li ⋅ Peng Liu ⋅ Junyu Gao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 59
AnyDoc: Enhancing Document Generation via Large-Scale HTML/CSS Data Synthesis and Height-Aware Reinforcement Optimization
Jiawei Lin ⋅ Wanrong Zhu ⋅ Vlad I Morariu ⋅ Christopher Tensmeyer
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 60
Reasoning Diffusion for Unpaired Test Time Out-of-distribution Text-Image to Video Generation
Zirui Pan ⋅ Xin Wang ⋅ Yipeng Zhang ⋅ Hong Chen ⋅ Kecheng Zheng ⋅ Wenwu Zhu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 61
SpatialReward: Verifiable Spatial Reward Modeling for Fine-Grained Spatial Consistency in Text-to-Image Generation
Sashuai zhou ⋅ Qiang Zhou ⋅ Ma Junpeng ⋅ Yue Cao ⋅ Ruofan Hu ⋅ Ziang Zhang ⋅ Xiaoda Yang ⋅ Zhibin Wang ⋅ Jun Song ⋅ Cheng Yu ⋅ Bo Zheng ⋅ Zhou Zhao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 62
STAGE: Storyboard-Anchored Generation for Cinematic Multi-shot Narrative
Peixuan Zhang ⋅ Zijian Jia ⋅ Kaiqi Liu ⋅ Shuchen Weng ⋅ Si Li ⋅ Boxin Shi
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 63
MTA: Multimodal Task Alignment for BEV Perception and Captioning
Yunsheng Ma ⋅ Burhan Yaman ⋅ Xin Ye ⋅ Jingru Luo ⋅ Feng Tao ⋅ Abhirup Mallik ⋅ Ziran Wang ⋅ Liu Ren
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 64
β-CLIP: Text-Conditioned Contrastive Learning for Multi-Granular Vision-Language Alignment
Fatimah Zohra ⋅ Chen Zhao ⋅ Hani Itani ⋅ Bernard Ghanem
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 65
SafeRoPE: Risk-specific Head-wise Embedding Rotation for Safe Generation in Rectified Flow Transformers
Xiang Yang ⋅ Feifei Li ⋅ Mi Zhang ⋅ Geng Hong ⋅ Xiaoyu You ⋅ Min Yang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 66
FALCON: False-Negative Aware Learning of Contrastive Negatives in Vision-Language Alignment
Myunsoo Kim ⋅ Seong-Woong Shim ⋅ Byung-Jun Lee
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 67
Spatial-Aware VLA Pretraining through Visual-Physical Alignment from Human Videos
Yicheng Feng ⋅ Wanpeng Zhang ⋅ Ye Wang ⋅ Hao Luo ⋅ Haoqi Yuan ⋅ Sipeng Zheng ⋅ Zongqing Lu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 68
Training One Model to Master Cross-Level Agentic Actions via Reinforcement Learning
Kaichen He ⋅ Zihao Wang ⋅ Muyao Li ⋅ Anji Liu ⋅ Yitao Liang
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 69
Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
Yurun Chen ⋅ Xueyu Hu ⋅ Yuhan Liu ⋅ Ziqi Wang ⋅ Zeyi Liao ⋅ Lin Chen ⋅ Feng Wei ⋅ Yuxi qian ⋅ Bo Zheng ⋅ Keting Yin ⋅ Shengyu Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 70
EMO-R3: Reflective Reinforcement Learning for Emotional Reasoning in Multimodal Large Language Models
Yiyang Fang ⋅ Wenke Huang ⋅ Pei Fu ⋅ Yihao Yang ⋅ Kehua Su ⋅ Zhenbo Luo ⋅ Jian Luan ⋅ Mang Ye
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 71
EvoGraph-R1: Self-Evolving Multimodal Knowledge Hypergraphs for Agentic Retrieval
Jiashi Lin ⋅ Changhong Jiang ⋅ Xiangru Lin ⋅ Ruifei Zhang ⋅ Xinyi Zhu ⋅ Jiyao Liu ⋅ Cheng Tang ⋅ Ye Du ⋅ Shujian Gao ⋅ Junzhi Ning ⋅ Lihao Liu ⋅ Ziyan Huang ⋅ Tianbin Li ⋅ Jin Ye ⋅ Junjun He
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 72
Cross-modal Identity Mapping: Minimizing Information Loss in Modality Conversion via Reinforcement Learning
Haonan Jia ⋅ Shichao Dong ⋅ Xin Dong ⋅ Zenghui Sun ⋅ Jin Wang ⋅ Jinsong Lan ⋅ Xiaoyong Zhu ⋅ Bo Zheng ⋅ Kaifu Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 73
Downscaling Intelligence: Exploring Perception and Reasoning Bottlenecks in Small Multimodal Models
Mark Endo ⋅ Serena Yeung
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 74
Stabilizing Feature Geometry in Noisy Pretrained Models for Robust Downstream Tasks
Quanyu Zhang ⋅ Zhongyi Han ⋅ Hao Sun ⋅ Yongshun Gong ⋅ Xiaoyan Wang ⋅ Yilong Yin ⋅ Shuo Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 75
Black-Box Domain Adaptation for Object Detection with Retention-Driven Knowledge Compression
Yuwu Lu ⋅ Chunzhi Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 76
Decoupled and Reusable Adaptation for Efficient Cross-Modal Transfer
Yajing Liu ⋅ Yumeng Zhang ⋅ Yue Si ⋅ Baojie Fan ⋅ Jiandong Tian
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 77
Preference-Aligned LoRA Merging: Preserving Subspace Coverage and Addressing Directional Anisotropy
Wooseong Jeong ⋅ Wonyoung Lee ⋅ Kuk-Jin Yoon
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 78
Curvature-Aware Zeroth-Order Optimization for Memory-Efficient Test-Time Adaptation
Junming Zhang ⋅ Shuyu Yin ⋅ Peilin Liu ⋅ Rendong Ying ⋅ Fei Wen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 79
Label-Free Cross-Task LoRA Merging with Null-Space Compression
Wonyoung Lee ⋅ Wooseong Jeong ⋅ Kuk-Jin Yoon
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 80
Basis-Oriented Low-rank Transfer for Few-Shot and Test-Time Adaptation
Junghwan Park ⋅ Woojin Cho ⋅ Junhyuk Heo ⋅ Darongsae Kwon ⋅ Kookjin Lee
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 81
GeCo: Geometry-Consistent Regularization for Domain Generalized Semantic Segmentation
Qi Zang ⋅ Dong Zhao ⋅ Nan Pu ⋅ Wenjing Li ⋅ Zhun Zhong ⋅ Meng Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 82
Event-based Motion Deblurring with Unpaired Data
Hoonhee Cho ⋅ Yuhwan Jeong ⋅ Kuk-Jin Yoon
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 83
Stable Spike: Dual Consistency Optimization via Bitwise AND Operations for Spiking Neural Networks
Yongqi Ding ⋅ Kunshan Yang ⋅ Linze Li ⋅ Yiyang Zhang ⋅ Mengmeng Jing ⋅ Lin Zuo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 84
Event-based Visual Deformation Measurement
Yuliang Wu ⋅ Wei Zhai ⋅ Yuxin Cui ⋅ Tiesong Zhao ⋅ Yang Cao ⋅ Zheng-Jun Zha
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 85
Bidirectional Cross-Modal Prompting for Event-Frame Asymmetric Stereo
Ninghui Xu ⋅ Fabio Tosi ⋅ Lihui Wang ⋅ Jiawei Han ⋅ Luca Bartolomei ⋅ Zhiting Yao ⋅ Matteo Poggi ⋅ Stefano Mattoccia
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 86
SpikeTrack: High-performance and Energy-efficient Event-Based Object Tracking with Spiking Neural Network
Yang Wang ⋅ Jiqing Zhang ⋅ Chuanyu Sun ⋅ Qianhui Liu ⋅ Huilin Ge ⋅ Ziqi Wei ⋅ Xin Yang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 87
Event Structural Valley: A Unified Theoretical and Practical Framework for Event Camera Autofocus
Xijie Xiang ⋅ Lin Zhu ⋅ Wei Zhang ⋅ Yonghong Tian
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 88
Adaptive Spatial-Temporal Window: Unlocking the Potential of Event Cameras in Heterogeneous Velocity Scenarios
Zhipeng Sui ⋅ Haiqing Hao ⋅ Weihua He ⋅ Seng-Hong Lee ⋅ Wenhui Wang
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 89
Do You Have Freestyle? Expressive Humanoid Locomotion via Audio Control
Zhe Li ⋅ Cheng Chi ⋅ Yangyang Wei ⋅ Boan Zhu ⋅ Tao Huang ⋅ Zhenguo Sun ⋅ Yibo Peng ⋅ Pengwei Wang ⋅ Zhongyuan Wang ⋅ Fangzhou Liu ⋅ Chang Xu ⋅ Shanghang Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 90
CLaD: Planning with Grounded Foresight via Cross-Modal Latent Dynamics
Andrew Jeong ⋅ Jaemin Kim ⋅ Sebin Lee ⋅ Sung-Eui Yoon
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 91
InternData-A1: Pioneering High-Fidelity Synthetic Data for Pre-training Generalist Policy
Yang Tian ⋅ Yuyin Yang ⋅ Yiman Xie ⋅ Zetao Cai ⋅ Xu Shi ⋅ Ning Gao ⋅ Hangxu Liu ⋅ Xuekun Jiang ⋅ Zherui Qiu ⋅ Feng Yuan ⋅ Yaping Li ⋅ Ping Wang ⋅ Junhao Cai ⋅ Jia Zeng ⋅ Hao Dong ⋅ Jiangmiao Pang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 92
DemoFunGrasp: Universal Dexterous Functional Grasping via Demonstration-Editing Reinforcement Learning
Chuan Mao ⋅ Haoqi Yuan ⋅ Ziye Huang ⋅ Chaoyi Xu ⋅ Kai Ma ⋅ Zongqing Lu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 93
GeniNav: Generative Model Driven Image-Goal Navigation via Imagination-Guided Consistency Flow Matching
Yuqi Chen ⋅ Junjie Gao ⋅ Yongzhou Pan ⋅ Siyuan Song ⋅ ZIXUAN ZHANG ⋅ Jiaping Xiao ⋅ Mir Feroskhan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 94
Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation
Pingrui Zhang ⋅ Yifei Su ⋅ Pengyuan Wu ⋅ Dong An ⋅ Li Zhang ⋅ Zhigang Wang ⋅ Dong Wang ⋅ Bin Zhao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 95
DRAMA: Next-Gen Dynamic Orchestration for Resilient Multi-Agent Ecosystems in Flux
Xinkui Zhao ⋅ Yifan Zhang ⋅ Sai Liu ⋅ Naibo Wang ⋅ Guanjie Cheng ⋅ Yueshen Xu ⋅ Chang Liu ⋅ Shuiguang Deng ⋅ Jianwei Yin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 96
Arcadia: Toward a Full-Lifecycle Framework for Embodied Lifelong Learning
Minghe Gao ⋅ Juncheng Li ⋅ Yuze Lin ⋅ Xuqi Liu ⋅ Jiaming Ji ⋅ Xiaoran Pan ⋅ Zihan Xu ⋅ Xian Li ⋅ Mingjie Li ⋅ Wei Ji ⋅ Rong Wei ⋅ Rui Tang ⋅ Qizhou Wang ⋅ Kai Shen ⋅ Jun Xiao ⋅ Qi Wu ⋅ Siliang Tang ⋅ Yueting Zhuang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 97
Wanderland: Geometrically Grounded Simulation for Open-World Embodied AI
Xinhao Liu ⋅ Jiaqi Li ⋅ Youming Deng ⋅ Ruxin Chen ⋅ Yingjia Zhang ⋅ Yifei Ma ⋅ Li Guo ⋅ Yiming Li ⋅ Jing Zhang ⋅ Chen Feng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 98
ORV: 4D Occupancy-centric Robot Video Generation
Xiuyu Yang ⋅ Bohan Li ⋅ Shaocong Xu ⋅ Nan Wang ⋅ Chongjie Ye ⋅ Zhaoxi Chen ⋅ Minghan Qin ⋅ Yikang Ding ⋅ Zheng Zhu ⋅ Xin Jin ⋅ Hang Zhao ⋅ Hao Zhao
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 99
DextER: Language-driven Dexterous Grasp Generation with Embodied Reasoning
Junha Lee ⋅ Eunha Park ⋅ Minsu Cho
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 100
Language-Free Generative Editing from One Visual Example
Omar Elezabi ⋅ Eduard Zamfir ⋅ Zongwei Wu ⋅ Radu Timofte
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 101
Omni IIE Bench: Benchmarking the Practical Capabilities of Image Editing Models
Yujia Yang ⋅ Yuanxiang Wang ⋅ Zhenyu Guan ⋅ Tiankun Yang ⋅ Chenxi Bao ⋅ Haopeng Jin ⋅ Jinwen Luo ⋅ Xinyu Zuo ⋅ Lisheng Duan ⋅ Haijin Liang ⋅ Jin Ma ⋅ Xinming Wang ⋅ Ruiwen Tao ⋅ Hongzhu Yi
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 102
LuxRemix: Lighting Decomposition and Remixing for Indoor Scenes
Ruofan Liang ⋅ Norman Müller ⋅ Ethan Weber ⋅ Duncan Zauss ⋅ Nandita Vijaykumar ⋅ Peter Kontschieder ⋅ Christian Richardt
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 103
CompBench: Benchmarking Complex Instruction-guided Image Editing
Bohan Jia ⋅ Wenxuan Huang ⋅ Yuntian Tang ⋅ Junbo Qiao ⋅ Jincheng Liao ⋅ Shaosheng Cao ⋅ Fei Zhao ⋅ Zhaopeng Feng ⋅ Zhouhong Gu ⋅ Zhenfei Yin ⋅ Lei Bai ⋅ Wanli Ouyang ⋅ Lin Chen ⋅ Fei Zhao ⋅ Zihan Wang ⋅ Yuan Xie ⋅ Shaohui Lin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 104
Garments2Look: A Multi-Reference Dataset for High-Fidelity Outfit-Level Virtual Try-On with Clothing and Accessories
Junyao Hu ⋅ Zhongwei Cheng ⋅ Waikeung Wong ⋅ Xingxing Zou
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 105
Learning Personalized Photographic Style from Pairwise User Preferences
Jinwoo Kim ⋅ Jihye Yoo ⋅ Seon Joo Kim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 106
CogniEdit: Dense Gradient Flow Optimization for Fine-Grained Image Editing
Yan Li ⋅ Lin Liu ⋅ Xiaopeng Zhang ⋅ Wei Xue ⋅ Wenhan Luo ⋅ Yike Guo ⋅ Qi Tian
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 107
Efficient Weighted Sampling via Score-based Generative Models
Heasung Kim ⋅ Taekyun Lee ⋅ Hyeji Kim ⋅ Gustavo De Veciana
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 108
MOSAIC-GS: Monocular Scene Reconstruction via Advanced Initialization for Complex Dynamic Environments
Svitlana Morkva ⋅ Vaishakh Patil ⋅ Alessio Tonioni ⋅ Michael Oechsle ⋅ Maximum Wilder-Smith ⋅ Marco Hutter
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 109
REArtGS++: Generalizable Articulation Reconstruction with Temporal Geometry Constraint via Planar Gaussian Splatting
Di Wu ⋅ Liu Liu ⋅ Anran Huang ⋅ 玉研 刘 ⋅ Qiaojun Yu ⋅ Shaofan Liu ⋅ Liangtu Song ⋅ Cewu Lu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 110
Dynamic-eDiTor: Training-Free Text-Driven 4D Scene Editing with Multimodal Diffusion Transformer
Dong In Lee ⋅ Hyungjun Doh ⋅ Seunggeun Chi ⋅ Runlin Duan ⋅ Sangpil Kim ⋅ Karthik Ramani
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 111
FaithFusion: Harmonizing Reconstruction and Generation via Pixel-wise Information Gain
YuAn Wang ⋅ Xiaofan Li ⋅ Chi Huang ⋅ Wenhao Zhang ⋅ Hao Li ⋅ Bosheng Wang ⋅ Xun Sun ⋅ Jun Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 112
IR-HGP: Physically-Aware Gaussian Inverse Rendering for High-Illumination Scenes via Generative Priors
Qingan Zhang ⋅ Wensheng Li ⋅ Chengying Gao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 113
Seeing through boxes: Non-Line-of-Sight 3D Reconstruction from Radar Signals
Jiachen Lu ⋅ Hailan Shanbhag ⋅ Haitham Al Hassanieh
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 114
Speeding Up the Learning of 3D Gaussians with Much Shorter Gaussian Lists
Jiaqi Liu ⋅ Zhizhong Han
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 115
DynamicTree: Interactive Real Tree Animation via Sparse Voxel Spectrum
Yaokun Li ⋅ Lihe Ding ⋅ Xiao Chen ⋅ Guang Tan ⋅ Tianfan Xue
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 116
WildRayZer: Self-supervised Large View Synthesis in Dynamic Environments
Xuweiyi Chen ⋅ Wentao Zhou ⋅ Zezhou Cheng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 117
DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images
Xiaoxue Chen ⋅ Ziyi Xiong ⋅ Yuantao Chen ⋅ Gen Li ⋅ Nan Wang ⋅ Hongcheng Luo ⋅ Long Chen ⋅ Haiyang Sun ⋅ Bing Wang ⋅ Guang Chen ⋅ Hongyang Li ⋅ Ya-Qin Zhang ⋅ Hangjun Ye ⋅ Hao Zhao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 118
Retrieve-to-Restore: Efficient All-in-One Image Restoration with a Retrieval-Based Degradation Bank
Chenxu Wang ⋅ Kai Zhang ⋅ Jian Yang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 119
MRI Contrast Enhancement Kinetics World Model
Jindi Kong ⋅ Yuting He ⋅ Cong Xia ⋅ Rongjun Ge ⋅ Shuo Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 120
ReflexSplit: Single Image Reflection Separation via Layer Fusion-Separation
Chia-Ming Lee ⋅ Yu-Fan Lin ⋅ Jin-Hui Jiang ⋅ Yu-Jou Hsiao ⋅ Chih-Chung Hsu ⋅ Yu-Lun Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 121
Rethinking Knowledge Transfer in Image Quality Assessment: A Perceptual Preference Structure Alignment Perspective
Aobo Li ⋅ Jinjian Wu ⋅ Yongxu Liu ⋅ Jupo Ma ⋅ Weisheng Dong
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 122
ZeroIDIR: Zero-Reference Illumination Degradation Image Restoration with Perturbed Consistency Diffusion Models
Hai Jiang ⋅ Zhen Liu ⋅ Yinjie Lei ⋅ Songchen Han ⋅ Bing Zeng ⋅ Shuaicheng Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 123
White-Balance First, Adjust Later: Cross-Camera Color Constancy via Vision-Language Evaluation
Shuwei Li ⋅ Lei Tan ⋅ Robby T. Tan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 124
Unpaired Image Deraining Using Reward-Guided Self-Reinforcement Strategy
Yinghao Chen ⋅ Yeying Jin ⋅ Xiang Chen ⋅ Yanyan Wei ⋅ Ziyang Yan ⋅ Yaowen Fu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 125
LF-BVN: Blind-View Network for Self-Supervised Light Field Denoising
Longzhao Guo ⋅ shuo zhang ⋅ Chen Gao ⋅ Qian Tian ⋅ Youfang Lin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 126
rPPG-VQA: A Video Quality Assessment Framework for Unsupervised rPPG Training
Tianyang Dai ⋅ Ming Chang ⋅ Yan Chen ⋅ Yang Hu
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 127
Efficient Real-Time Raw-to-Raw Denoising for Extreme Low-Light Ultra HD Video on Mobile Devices
Charantej Reddy Pochimireddy ⋅ Subhasmita Sahoo ⋅ Apoorva Verma ⋅ Palavalli Shyam ⋅ Swapnil Malviya ⋅ Sarvesh Sarvesh ⋅ Raj Narayana Gadde
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 128
Towards Generalized Representations for Low-Light Understanding: When Signal Constancy Meets Semantic Enrichment
Yifan Li ⋅ Haofeng Huang ⋅ Wenhan Yang ⋅ Jiaying Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 129
Synergistic Bleeding Region and Point Detection in Laparoscopic Surgical Videos
Jialun Pei ⋅ Zhangjun Zhou ⋅ Diandian Guo ⋅ Zhixi Li ⋅ Jing Qin ⋅ Bo Du ⋅ Pheng-Ann Heng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 130
MedCLIPSeg: Probabilistic Vision-Language Adaptation for Data-Efficient and Generalizable Medical Image Segmentation
Taha Koleilat ⋅ Hojat Asgariandehkordi ⋅ Omid Nejatimanzari ⋅ Berardino Barile ⋅ Yiming Xiao ⋅ Hassan Rivaz
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 131
AD-GBC: Anisotropic Granular-Ball Skip-Connection Refiner for UNet-Based Medical Image Segmentation
Xiya Shen ⋅ Qinglin Zhao ⋅ Li Feng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 132
OSA: Echocardiography Video Segmentation via Orthogonalized State Update and Anatomical Prior-aware Feature Enhancement
Rui Wang ⋅ Huisi Wu ⋅ Jing Qin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 133
VesMamba: 3D Pulmonary Vessel Segmentation from CT images via Mamba with Structural Perception and Scale-aware Filtering
Zhipeng Liu ⋅ Guilian Chen ⋅ Zheng Jiang ⋅ Huisi Wu ⋅ Jing Qin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 134
SemiGDA: Generative Dual-distribution Alignment for Semi-Supervised Medical Image Segmentation
kaiwen Huang ⋅ Yi Zhou ⋅ Yizhe Zhang ⋅ Jingxiong Li ⋅ Tao Zhou
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 135
Diffusion-Based Native Adversarial Synthesis for Enhanced Medical Segmentation Generalization
Hongyu Zhang ⋅ Haipeng Chen ⋅ Zhimin Xu ⋅ Chengxin Yang ⋅ Yingda Lyu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 136
CG-Reasoner: Centroid-Guided Positional Reasoning Segmentation for Medical Imaging with a Robust Visual-Text Consistency Metric
Lakshmikar R. ⋅ Ming Ma
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 137
Instruction-Guided Lesion Segmentation for Chest X-rays with Automatically Generated Large-Scale Dataset
Geon Choi ⋅ Hangyul Yoon ⋅ Hyunju Shin ⋅ Hyunki Park ⋅ Sang Hoon Seo ⋅ Eunho Yang ⋅ Edward Choi
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 138
Towards Highly Transferable Vision-Language Attack via Semantic-Augmented Dynamic Contrastive Interaction
Yuanbo Li ⋅ Tianyang Xu ⋅ Cong Hu ⋅ Tao Zhou ⋅ Xiao-Jun Wu ⋅ Josef Kittler
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 139
Towards Human-Imperceptible Backdoor Attacks on Text-to-Image Diffusion Models
Changkun Wu ⋅ Chenghao Chen ⋅ Wu kun ⋅ Chong Fu ⋅ Biru Zhu ⋅ Zhenyu Wen ⋅ Zhen Hong
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 140
TTP: Test-Time Padding for Adversarial Detection and Robust Adaptation on Vision-Language Models
Zhiwei Li ⋅ Yitian Pang ⋅ Weining Wang ⋅ Zhenan Sun ⋅ Qi Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 141
DualMirage: Hunting Stealthy Multimodal LLM Agents via CAPTCHAs with Contour and Adversarial Illusions
Bei Chen ⋅ Gaolei Li ⋅ Jun Wu ⋅ Jianhua Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 142
Models as Lego Builders: Assembling Malice from Benign Blocks via Semantic Blueprints
Chenxi Li ⋅ Xianggan Liu ⋅ Dake Shen ⋅ Yaosong Du ⋅ Zhibo Yao ⋅ Hao Jiang ⋅ Linyi Jiang ⋅ Chengwei Cao ⋅ Jingzhe Zhang ⋅ RanYi Peng ⋅ Peiling Bai ⋅ Xiande Huang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 143
Source Models Leak What They Shouldn’t: Unlearning Zero-Shot Transfer in Domain Adaptation Through Adversarial Optimization
Arnav Devalapally ⋅ Poornima Jain ⋅ Kartik Srinivas ⋅ Vineeth Balasubramanian
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 144
A Unified Perspective on Adversarial Membership Manipulation in Vision Models
RUIZE GAO ⋅ Kaiwen Zhou ⋅ Yongqiang Chen ⋅ Feng Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 145
Shedding Light on VLN Robustness: A Black-box Framework for Indoor Lighting-based Adversarial Attack
Chenyang LI ⋅ Wenbing Tang ⋅ Yihao Huang ⋅ Simon Sinong Zhan ⋅ Ming Hu ⋅ Xiaojun Jia ⋅ Yang Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 146
OddGridBench: Exposing the Lack of Fine-Grained Visual Discrepancy Sensitivity in Multimodal Large Language Models
tengjin Weng ⋅ Wenhao Jiang ⋅ Jingyi Wang ⋅ Ming Li ⋅ Lin Ma ⋅ Zhong Ming
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 147
Beyond What's Shared: Recovering Lost Unique Information from Intermediate Layers to Boost Multimodal Geo-Foundation Models
JangHyeon Lee ⋅ Philipe Ambrozio Dias ⋅ Yao-Yi Chiang ⋅ Dalton Lunga
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 148
WikiCLIP: An Efficient Contrastive Baseline for Open-domain Visual Entity Recognition
Shan Ning ⋅ Longtian Qiu ⋅ Jiaxuan Sun ⋅ Xuming He
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 149
CLCR: Cross-Level Semantic Collaborative Representation for Multimodal Learning
Chunlei Meng ⋅ Guanhong Huang ⋅ Rong Fu ⋅ Runmin Jian ⋅ Zhongxue Gan ⋅ Chun Ouyang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 150
Learning Anchor in Dual Orthogonal Space for Fast Multi-view Clustering
Yalan Qin ⋅ Hanzhou Wu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 151
Bootstrapping Multi-view Learning for Test-time Noisy Correspondence
Changhao He ⋅ Di Xue ⋅ Shuxian Li ⋅ Yanji Hao ⋅ Xi Peng ⋅ Peng Hu
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 152
Differences That Matter: Auditing Models for Capability Gap Discovery and Rectification
Qihao Liu ⋅ Chengzhi Mao ⋅ Yaojie Liu ⋅ Alan L. Yuille ⋅ Wen-Sheng Chu
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 153
FAVE: A Structured Benchmark for Fine-Grained Audio-Visual Temporal Evaluation in Multimodal LLMs
Weiheng Lu ⋅ An Yu ⋅ Jian Li ⋅ Zhenfei Zhang ⋅ Felix X.-F. Ye ⋅ Ming-Ching Chang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 154
Omni2Sound: Towards Unified Video-Text-to-Audio Generation
yusheng dai ⋅ Zehua Chen ⋅ Yuxuan Jiang ⋅ Qiuhong Ke ⋅ Jianfei Cai ⋅ Jun Zhu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 155
EmoThinker: Advancing Visual-Acoustic Emotion Analysis via Structural Token Selection and Chain-of-Thought Reasoning
Qinfu Xu ⋅ Liyuan Pan ⋅ Yiwei Wei ⋅ Shaozu Yuan ⋅ Jiaqi Chen ⋅ Tianyu Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 156
Enhancing Descriptive Captions with Visual Attributes for Multimodal Perception
Yanpeng Sun ⋅ JING HAO ⋅ Ke Zhu ⋅ Jiang-Jiang Liu ⋅ Xiaofan Li ⋅ Na Zhao ⋅ Zechao Li ⋅ Jingdong Wang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 157
DiG: Differential Grounding for Enhancing Fine-Grained Perception in Multimodal Large Language Models
Zhou Tao ⋅ Shida Wang ⋅ YongXiang Hua ⋅ Haoyu Cao ⋅ Linli Xu
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 158
Vision-Speech Models: Teaching Speech Models to Converse about Images
Amélie Royer ⋅ Moritz Böhle ⋅ Laurent Mazaré ⋅ Neil Zeghidour ⋅ Alexandre Défossez ⋅ Patrick Pérez
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 159
EMMA: Extracting Multiple physical parameters from Multimodal Data
Farhat Shaikh ⋅ Ayan Banerjee ⋅ Sandeep Gupta
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 160
MMGait: Towards Multi-Modal Gait Recognition
Chenye Wang ⋅ Qingyuan Cai ⋅ Saihui Hou ⋅ Aoqi Li ⋅ Yongzhen Huang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 161
OSMO: Open-vocabulary Self-eMOtion Tracking
Mohamed Abdelfattah ⋅ Bugra Tekin ⋅ Fadime Sener ⋅ Necati Cihan Camgoz ⋅ Eric Sauser ⋅ Shugao Ma ⋅ Alex Alahi ⋅ Edoardo Remelli
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 162
MuCo: Multi-turn Contrastive Learning for Multimodal Embedding Model
Geonmo Gu ⋅ Byeongho Heo ⋅ Jaemyung Yu ⋅ Jaehui Hwang ⋅ Taekyung Kim ⋅ Sangmin Lee ⋅ HeeJae Jun ⋅ Yoohoon Kang ⋅ Sangdoo Yun ⋅ Dongyoon Han
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 163
Cross-Modal Emotion Transfer for Emotion Editing in Talking Face Video
Chanhyuk Choi ⋅ Taesoo Kim ⋅ Donggyu Lee ⋅ Siyeol Jung ⋅ Taehwan Kim
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 164
Unleashing the Intrinsic Visual Representation Capability of Multimodal Large Language Models
Hengzhuang Li ⋅ Xinsong Zhang ⋅ QIMING PENG ⋅ Bin Luo ⋅ Han Hu ⋅ Dengyang Jiang ⋅ Han-Jia Ye ⋅ Teng Zhang ⋅ Hai Jin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 165
Active Perceptual Inference: A Corticothalamic-Inspired Dynamic Nested Recurrent Network for Multimodal Sentiment Analysis with Incomplete Data
Yujuan Zhang ⋅ Qing Li ⋅ Ziyu Li ⋅ Xiuxing Li ⋅ Zhuo Wang ⋅ Mengrui Xu ⋅ Xia Wu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 166
Scalable Trajectory Generation for Whole-Body Mobile Manipulation
Yida Niu ⋅ Xinhai Chang ⋅ Xin Liu ⋅ Ziyuan Jiao ⋅ Yixin Zhu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 167
Breaking the 3D Dataset Bottleneck: Fast Scalable Generation of Aligned 3D Assets from Scratch for Category 6D Pose Estimation and Robotic Grasping
Duret Guillaume ⋅ Danylo Mazurak ⋅ Florence Zara ⋅ Jan Peters ⋅ Liming Chen
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 168
Real-Time Multimodal Fingertip Contact Detection via Depth and Motion Fusion for Vision-Based Human–Computer Interaction
Mukhiddin Toshpulatov ⋅ Wookey Lee ⋅ Suan Lee ⋅ Geehyuk Lee
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 169
Glove2Hand: Synthesizing Natural Hand-Object Interaction from Multi-Modal Sensing Gloves
Xinyu Zhang ⋅ Ziyi Kou ⋅ Chuan Qin ⋅ Mia Huang ⋅ Ergys Ristani ⋅ Ankit Kumar ⋅ Lele Chen ⋅ Kun He ⋅ Abdeslam Boularias ⋅ Li Guan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 170
UniDex: A Robot Foundation Suite for Universal Dexterous Hand Control from Egocentric Human Videos
Gu Zhang ⋅ Qicheng Xu ⋅ Haozhe Zhang ⋅ Jianhan Ma ⋅ Long He ⋅ Yiming Bao ⋅ Zeyu Ping ⋅ Zhecheng Yuan ⋅ Chenhao Lu ⋅ Chengbo Yuan ⋅ Tianhai Liang ⋅ Xiaoyu Tian ⋅ Maanping Shao ⋅ Feihong Zhang ⋅ Mingyu Ding ⋅ Yang Gao ⋅ Hao Zhao ⋅ Hang Zhao ⋅ Huazhe Xu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 171
ConsID-Gen: View-Consistent and Identity-Preserving Image-to-Video Generation
Mingyang Wu ⋅ Ashirbad Mishra ⋅ Soumik Dey ⋅ Shuo Xing ⋅ Naveen Ravipati ⋅ Hansi Wu ⋅ Binbin Li ⋅ Zhengzhong Tu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 172
DiverseGRPO: Mitigating Mode Collapse in Image Generation via Diversity-Aware GRPO
Henglin Liu ⋅ Huijuan Huang ⋅ Jing Wang ⋅ Chang Liu ⋅ Xiu Li ⋅ Xiangyang Ji
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 173
VAR RL Done Right: Tackling Asynchronous Policy Conflicts in Visual Autoregressive Generation
Shikun Sun ⋅ Liao Qu ⋅ Huichao Zhang ⋅ Yiheng Liu ⋅ Yangyang Song ⋅ Xian Li ⋅ Yi Jiang ⋅ Xu Wang ⋅ Jia Jia ⋅ Daniel Kang Du ⋅ Xinglong Wu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 174
Video Generation with Stable Transparency via Shiftable RGB-A Distribution Learner
Haotian Dong ⋅ Wenjing Wang ⋅ Chen Li ⋅ Jing LYU ⋅ Di Lin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 175
MOFA-VTON: More Fashion Possibilities with Fine-Grained Adaptations in Virtual Try-On
Xiaoyu Han ⋅ Chenyang Wang ⋅ Jing Wang ⋅ Shunyuan Zheng ⋅ Quanling Meng ⋅ Shengping Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 176
Scaling Multi-Identity Consistency for Image Customization via Multi-to-Multi Matching Paradigm
Yufeng Cheng ⋅ wenxu wu ⋅ Shaojin Wu ⋅ Mengqi Huang ⋅ Fei Ding ⋅ Qian HE
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 177
NOVA: Sparse Control, Dense Synthesis for Pair-Free Video Editing
Tianlin Pan ⋅ Jiayi Dai ⋅ Chenpu Yuan ⋅ Zhengyao Lv ⋅ Binxin Yang ⋅ Hubery Yin ⋅ Chen Li ⋅ Jing LYU ⋅ Caifeng Shan ⋅ Chenyang Si
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 178
Functional Mean Flow in Hilbert Space
Zhiqi Li ⋅ Yuchen Sun ⋅ Greg Turk ⋅ Bo Zhu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 179
Benchmarking Single-Factor Physical Video-to-Audio Generation
Tingle Li ⋅ Siddharth Gururani ⋅ Kevin Shih ⋅ Gantavya Bhatt ⋅ Sang-gil Lee ⋅ Zhifeng Kong ⋅ Arushi Goel ⋅ Gopala Anumanchipalli ⋅ Ming-Yu Liu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 180
UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions
Guozhen Zhang ⋅ Zixiang Zhou ⋅ Teng Hu ⋅ Ziqiao Peng ⋅ Youliang Zhang ⋅ Yi Chen ⋅ Yuan Zhou ⋅ qinglin lu ⋅ Limin Wang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 181
Refaçade: Editing Object with Given Reference Texture
Youze Huang ⋅ Penghui Ruan ⋅ Bojia Zi ⋅ Xianbiao Qi ⋅ Jianan Wang ⋅ Rong Xiao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 182
Free-Lunch Long Video Generation via Layer-Adaptive O.O.D Correction
Jiahao Tian ⋅ Chenxi Song ⋅ Wei Cheng ⋅ Chi Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 183
Not All Birds Look The Same: Identity-Preserving Generation For Birds
Aaron Sun ⋅ Oindrila Saha ⋅ Subhransu Maji
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 184
HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preserving Human-Product Images
Yi Chen Liu ⋅ Donghao Zhou ⋅ Jie Wang ⋅ Xin Gao ⋅ Guisheng Liu ⋅ Jiatong Li ⋅ Quanwei Zhang ⋅ Qiang Lyu ⋅ Lanqing Guo ⋅ Shilei Wen ⋅ Weiqiang Wang ⋅ Pheng-Ann Heng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 185
EffectErase: Joint Video Object Removal and Insertion for High-Quality Effect Erasing
YANG FU ⋅ Yike Zheng ⋅ Ziyun Dai ⋅ Henghui Ding
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 186
Clothe and Pose
Nakul Sharma ⋅ Aayush Bansal ⋅ Minh Vo
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 187
FlowPortal: Residual-Corrected Flow for Training-Free Video Relighting and Background Replacement
Wenshuo Gao ⋅ Junyi Fan ⋅ Jiangyue Zeng ⋅ Shuai Yang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 188
The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment
Ziheng Ouyang ⋅ Yiren Song ⋅ Yaoli Liu ⋅ Shihao Zhu ⋅ Qibin Hou ⋅ Mingming Cheng ⋅ Mike Zheng Shou
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 189
Rethinking UMM Visual Generation: Masked Modeling for Efficient Image-Only Pre-training
Peng Sun ⋅ Jun XIE ⋅ Tao Lin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 190
VibeToken: Scaling 1D Image Tokenizers and Autoregressive Models for Dynamic Resolution Generations
Maitreya Patel ⋅ Jingtao Li ⋅ Weiming Zhuang ⋅ Yezhou Yang ⋅ Lingjuan Lv
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 191
Bidirectional Normalizing Flow: From Data to Noise and Back
Yiyang Lu ⋅ Qiao Sun ⋅ Xianbang Wang ⋅ Zhicheng Jiang ⋅ Hanhong Zhao ⋅ Kaiming He
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 192
ShotDirector: Directorially Controllable Multi-Shot Video Generation with Cinematographic Transitions
Xiaoxue Wu ⋅ Xinyuan Chen ⋅ Yaohui Wang ⋅ Yu Qiao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 193
Are Image-to-Video Models Good Zero-Shot Image Editors?
Zechuan Zhang ⋅ Zhenyuan Chen ⋅ Zongxin Yang ⋅ Yi Yang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 194
FastLightGen: Fast and Light Video Generation with Fewer Steps and Parameters
Shitong Shao ⋅ Yufei Gu ⋅ Zeke Xie
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 195
Unified Latent Space for Understanding and Generation via Semantic Auto-encoder
Xiaojie Li ⋅ Yang Zhao ⋅ Ming Li ⋅ Yancheng Zhang ⋅ Zonglin Lyu ⋅ Yunpeng Chen ⋅ Rui Wang ⋅ Daquan Zhou
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 196
AHS: Adaptive Head Synthesis via Synthetic Data Augmentations
Taewoong Kang ⋅ Hyojin Jang ⋅ Sohyun Jeong ⋅ Seunggi Moon ⋅ Gihwi Kim ⋅ Hoon Jin Jung ⋅ Jaegul Choo
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 197
CASR: A Robust Cyclic Framework for Arbitrary Large-Scale Super-Resolution with Distribution Alignment and Self-Similarity Awareness
Wenhao Guo ⋅ Zhaoran Zhao ⋅ Peng Lu ⋅ Sheng Li ⋅ Qian Qiao ⋅ RuiDe Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 198
Thermal Diffusion Matters: Infrared Spatial-Temporal Video Super-Resolution through Heat Conduction Priors
Mingxuan Zhou ⋅ Shuang Li ⋅ Yutang Zhang ⋅ Jing Geng ⋅ Yirui Shen ⋅ Jingxuan Kang ⋅ Fuzhen Zhuang ⋅ Shuigen Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 199
TextOVSR: Text-Guided Real-World Opera Video Super-Resolution
Hua Chang ⋅ Xin Xu ⋅ Wei Liu ⋅ Jiayi Wu ⋅ Kui Jiang ⋅ Fei Ma ⋅ Qi Tian
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 200
VoDaSuRe: A Large-Scale Dataset Revealing Domain Shift in Volumetric Super-Resolution
August Leander Høeg ⋅ Sophia Bardenfleth ⋅ Hans Martin Kjer ⋅ Tim Dyrby ⋅ Vedrana Dahl ⋅ Anders Bjorholm Dahl
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 201
GDPO-SR: Group Direct Preference Optimization for One-Step Generative Image Super-Resolution
Qiaosi Yi ⋅ Shuai Li ⋅ Rongyuan Wu ⋅ Lingchen Sun ⋅ Zhengqiang ZHANG ⋅ Lei Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 202
Adaptive Anisotropic Gaussian Splatting for Multi-contrast MRI Arbitrary-Scale Super-Resolution with Anatomy Guidance
Qiuhai Yan ⋅ Kang Chen ⋅ Zhengjie Lu ⋅ Tingting Wang ⋅ Faming Fang ⋅ Guixu Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 203
SignPR: A Progressive Vector-Quantized Diffusion Framework for Sign Language Production
Xiao Liu ⋅ Shiwei Gan ⋅ Yafeng Yin ⋅ Bowen Guo ⋅ Zhiwei Jiang ⋅ Shunmei Meng ⋅ Lei Xie ⋅ Sanglu Lu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 204
LLaMo: Scaling Pretrained Language Models for Unified Motion Understanding and Generation with Continuous Autoregressive Tokens
Zekun Li ⋅ Sizhe An ⋅ Chengcheng Tang ⋅ Chuan Guo ⋅ Ivan Shugurov ⋅ Linguang Zhang ⋅ Amy Zhao ⋅ Srinath Sridhar ⋅ Lingling Tao ⋅ Abhay Mittal
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 205
FlashCap: Millisecond-Accurate Human Motion Capture via Flashing LEDs and Event-Based Vision
Zekai Wu ⋅ Shuqi Fan ⋅ Mengyin Liu ⋅ Yuhua Luo ⋅ Xincheng Lin ⋅ Ming Yan ⋅ Junhao Wu ⋅ Xiuhong Lin ⋅ Yuexin Ma ⋅ Chenglu Wen ⋅ Lan Xu ⋅ Siqi Shen ⋅ Cheng Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 206
Geometric Neural Distance Fields for Learning Human Motion Priors
Zhengdi Yu ⋅ Simone Foti ⋅ Linguang Zhang ⋅ Amy Zhao ⋅ Cem Keskin ⋅ Stefanos Zafeiriou ⋅ Tolga Birdal
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 207
3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation
Zhixue Fang ⋅ Xu He ⋅ Songlin Tang ⋅ Haoxian Zhang ⋅ Qingfeng Li ⋅ Xiaoqiang Liu ⋅ Pengfei Wan ⋅ Kun Gai
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 208
Decoupled Generative Modeling for Human-Object Interaction Synthesis
Hwanhee Jung ⋅ Seunggwan Lee ⋅ Jeongyoon Yoon ⋅ SeungHyeon Kim ⋅ Giljoo Nam ⋅ Qixing Huang ⋅ Sangpil Kim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 209
LiveGesture: Streamable Co-Speech Gesture Generation Model
Muhammad Usama Saleem ⋅ Mayur Jagdishbhai Patel ⋅ Ekkasit Pinyoanuntapong ⋅ Zhongxing Qin ⋅ Li Yang ⋅ Hongfei Xue ⋅ Ahmed Helmy ⋅ Chen Chen ⋅ Pu Wang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 210
HandX: Scaling Bimanual Motion and Interaction Generation
Zimu Zhang ⋅ Yucheng Zhang ⋅ Xiyan Xu ⋅ Ziyin Wang ⋅ Sirui Xu ⋅ Kai Zhou ⋅ Bing Zhou ⋅ Chuan Guo ⋅ Jian Wang ⋅ Yu-Xiong Wang ⋅ Liang-Yan Gui
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 211
MaskAdapt: Learning Flexible Motion Adaptation via Mask-Invariant Prior for Physics-Based Characters
Soomin Park ⋅ Eunseong Lee ⋅ Kwang Bin Lee ⋅ Sung-Hee Lee
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 212
FloodDiffusion: Tailored Diffusion Forcing for Streaming Motion Generation
YIYI CAI ⋅ Yuhan Wu ⋅ Kunhang Li ⋅ YOU ZHOU ⋅ Bo Zheng ⋅ Haiyang Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 213
ProjFlow: Projection Sampling with Flow Matching for Zero‑Shot Exact Spatial Motion Control
Akihisa Watanabe ⋅ Qing Yu ⋅ Edgar Simo-Serra ⋅ Kent Fujiwara
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 214
Correspondence-Attention Alignment for Multi-View Diffusion Models
Minkyung Kwon ⋅ Jinhyeok Choi ⋅ Jiho Park ⋅ Seonghu Jeon ⋅ Jinhyuk Jang ⋅ Junyoung Seo ⋅ Minseop Kwak ⋅ Jin-Hwa Kim ⋅ Seungryong Kim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 215
GenErase: Generalizable and Semantically-Aware Concept Erasure in Diffusion Models
Korada Sri Vardhana ⋅ Soma Biswas
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 216
MatMart: Material Reconstruction of 3D Objects via Diffusion
Xiuchao Wu ⋅ Pengfei Zhu ⋅ Jiangjing Lyu ⋅ Xinguo Liu ⋅ Jie Guo ⋅ Yanwen Guo ⋅ Weiwei Xu ⋅ Chengfei Lv
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 217
Region-Adaptive Sampling for Diffusion Transformers
Ziming Liu ⋅ Yifan Yang ⋅ Chengruidong Zhang ⋅ Yiqi Zhang ⋅ Lili Qiu ⋅ Yang You ⋅ Yuqing Yang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 218
Diffusion Guided Chain-of-Vision for Large Autoregressive Vision Models
Xinyang Wang ⋅ Kecheng Zheng ⋅ Minfeng Zhu ⋅ Wei Wu ⋅ Fan Lu ⋅ Wei Zhai ⋅ Wei Chen
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 219
Guiding Diffusion-based Reconstruction with Contrastive Signals for Balanced Visual Representation
Boyu Han ⋅ Qianqian Xu ⋅ Shilong Bao ⋅ Zhiyong Yang ⋅ Ruochen Cui ⋅ Xilin Zhao ⋅ Qingming Huang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 220
ConceptPrism: Concept Disentanglement in Personalized Diffusion Models via Residual Token Optimization
Minseo Kim ⋅ Minchan Kwon ⋅ Dongyeun Lee ⋅ Yunho Jeon ⋅ Junmo Kim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 221
Heterogeneous Decentralized Diffusion Models
Zhiying Jiang ⋅ Raihan Seraj ⋅ Marcos Villagra ⋅ Bidhan Roy
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 222
Refining Few-Step Text-to-Multiview Diffusion via Reinforcement Learning
Ziyi Zhang ⋅ Li Shen ⋅ Deheng Ye ⋅ Yong Luo ⋅ Huangxuan Zhao ⋅ Meng Liu ⋅ Wei Yu ⋅ Lefei Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 223
GroundingME: Exposing the Visual Grounding Gap in MLLMs through Multi-Dimensional Evaluation
Rang Li ⋅ Lei Li ⋅ Shuhuai Ren ⋅ Hao Tian ⋅ Shuhao Gu ⋅ Shicheng Li ⋅ Zihao Yue ⋅ Yudong Wang ⋅ Wenhan Ma ⋅ Zhe Yang ⋅ Jingyuan Ma ⋅ Zhifang Sui ⋅ Fuli Luo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 224
ENC-Bench: A Benchmark for Evaluating Multimodal Large Language Models in Electronic Navigational Chart Understanding
Ao Cheng ⋅ Xingming Li ⋅ Xuanyu Ji ⋅ Xixiang He ⋅ Qiyao Sun ⋅ Chunping Qiu ⋅ Runke Huang ⋅ Qingyong Hu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 225
Nonparametric Deep Fine-grained Clustering with Low-Rank Guided Vision-Language Model
xulun ye ⋅ Benyu Wu ⋅ Jie Hong ⋅ Kun Zhou
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 226
RealBirdID: Benchmarking Bird Species Identification in the Era of MLLMs
Logan Lawrence ⋅ Oindrila Saha ⋅ Rangel Daroya ⋅ Mustafa Chasmai ⋅ Wuao Liu ⋅ Max Hamilton ⋅ Aaron Sun ⋅ Seoyun Jeong ⋅ Fabien Delattre ⋅ Subhransu Maji ⋅ Grant Horn
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 227
Fast SceneScript: Fast and Accurate Language‑Based 3D Scene Understanding via Multi‑Token Prediction
Ruihong Yin ⋅ Xuepeng Shi ⋅ Oleksandr Bailo ⋅ Marco Manfredi ⋅ Theo Gevers
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 228
PP-OCRv5: A Specialized 5M-Parameter Model Rivaling Billion-Parameter Vision-Language Models on OCR Tasks
Cheng Cui ⋅ yubo zhang ⋅ Ting Sun ⋅ Xueqing Wang ⋅ Hongen Liu ⋅ Manhui Lin ⋅ Yue Zhang ⋅ Tingquan Gao ⋅ Changda Zhou ⋅ Jiaxuan Liu ⋅ Zelun Zhang ⋅ Jing Zhang ⋅ Jun Zhang ⋅ Yi Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 229
World in a Frame: Understanding Culture Mixing as a New Challenge for Vision-Language Models
Eunsu Kim ⋅ Junyeong Park ⋅ Na Min An ⋅ Junseong Kim ⋅ Hitesh Laxmichand Patel ⋅ Jiho Jin ⋅ Julia Kruk ⋅ Amit Agarwal ⋅ Srikant Panda ⋅ Fenal Ashokbhai Ilasariya ⋅ Hyunjung Shim ⋅ Alice Oh
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 230
Gastric-X: A Multimodal Multi-Phase Benchmark Dataset for Advancing Vision-Language Models in Gastric Cancer Analysis
Yuanzhe Li ⋅ Hao Chen ⋅ Rui Yin ⋅ Juyan Ba ⋅ Yu Zhang ⋅ Sheng Lu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 231
HiSpatial: Taming Hierarchical 3D Spatial Understanding in Vision-Language Models
Huizhi Liang ⋅ Yichao Shen ⋅ Yu Deng ⋅ Sicheng Xu ⋅ ZhiYuan Feng ⋅ Tong Zhang ⋅ Yaobo Liang ⋅ Jiaolong Yang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 232
HandVQA: Diagnosing and Improving Fine-Grained Spatial Reasoning about Hands in Vision-Language Models
Khalequzzaman Chowdhury Sayem ⋅ Mubarrat Chowdhury ⋅ Yihalem Yimolal Tiruneh ⋅ Muneeb Ahmed Khan ⋅ Muhammad Salman Ali ⋅ Binod Bhattarai ⋅ Seungryul Baek
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 233
Probing and Bridging Geometry–Interaction Cues for Affordance Reasoning in Vision Foundation Models
Qing Zhang ⋅ Xuesong li ⋅ Jing Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 234
ARC Is a Vision Problem!
Keya Hu ⋅ Ali Cy ⋅ Linlu Qiu ⋅ Xiaoman Delores Ding ⋅ Runqian Wang ⋅ Yeyin Eva Zhu ⋅ Jacob Andreas ⋅ Kaiming He
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 235
Geoint-R1: Formalizing Multimodal Geometric Reasoning with Dynamic Auxiliary Constructions
Jingxuan Wei ⋅ Caijun Jia ⋅ Qi Chen ⋅ Honghao He ⋅ Linzhuang Sun ⋅ Conghui He ⋅ Lijun Wu ⋅ Bihui Yu ⋅ Cheng Tan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 236
S^2-MLLM: Boosting Spatial Reasoning Capability of MLLMs for 3D Visual Grounding with Structural Guidance
Beining Xu ⋅ Siting Zhu ⋅ Zhao Jin ⋅ Junxian Li ⋅ Hesheng Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 237
Learning Multi-View Spatial Reasoning from Cross-View Relations
Suchae Jeong ⋅ Jaehwi Song ⋅ Haeone Lee ⋅ Hanna Kim ⋅ Jian Kim ⋅ Dongjun Lee ⋅ Dong Kyu Shin ⋅ Changyeon Kim ⋅ Dongyoon Hahm ⋅ Woogyeol Jin ⋅ Juheon Choi ⋅ Kimin Lee
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 238
Exploring Spatial Intelligence from a Generative Perspective
Muzhi Zhu ⋅ Shunyao Jiang ⋅ Huanyi Zheng ⋅ Zekai Luo ⋅ Hao Zhong ⋅ Anzhou Li ⋅ Kaijun Wang ⋅ Jintao Rong ⋅ Yang Liu ⋅ Hao Chen ⋅ Tao Lin ⋅ Chunhua Shen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 239
Physical Object Understanding with a Physically Controllable World Model
Rahul Venkatesh ⋅ Klemen Kotar ⋅ Lilian Naing Chen ⋅ Wanhee Lee ⋅ Gia Ancone ⋅ Seungwoo Kim ⋅ Luca Thomas Wheeler ⋅ Jared Watrous ⋅ Honglin Chen ⋅ Daniel Bear ⋅ Stefan Stojanov ⋅ Daniel L.K. Yamins
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 240
QueryMe: Query-Driven Open-Vocabulary 3D Object Affordances Grounding from Multimodal Evidence
Weiyu Zhao ⋅ Ru Li ⋅ Jiaqi Liu ⋅ Sizhe Zhao ⋅ Qinglin Liu ⋅ Shengping Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 241
Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views
Zhangquan Chen ⋅ Manyuan Zhang ⋅ Xinlei Yu ⋅ Xufang Luo ⋅ Mingze Sun ⋅ Zihao Pan ⋅ Xiang An ⋅ Yan Feng ⋅ Peng Pei ⋅ Xunliang Cai ⋅ Ruqi Huang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 242
EG-3DVG: Expression and Geometry Aware Grounding Decoder for 3D Visual Grounding
GwangWook Park ⋅ Hyo-Jun Lee ⋅ Jong-Hyeon Baek ⋅ Hanul Kim ⋅ Yeong Jun Koh
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 243
AffordMatcher: Affordance Learning in 3D Scenes from Visual Signifiers
Nghia Vu ⋅ Tuong Do ⋅ Khang Nguyen ⋅ Baoru Huang ⋅ Nhat Le ⋅ Binh Xuan Nguyen ⋅ Erman Tjiputra ⋅ Quang D. Tran ⋅ Ravi Prakash ⋅ Te-Chuan Chiu ⋅ Anh Nguyen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 244
SpatiaLQA: A Benchmark for Evaluating Spatial Logical Reasoning in Vision-Language Models
Yuechen Xie ⋅ Xiaoyan Zhang ⋅ Yicheng Shan ⋅ Zhu Hao ⋅ Rui Tang ⋅ Rong Wei ⋅ Mingli Song ⋅ Yuanyu Wan ⋅ Jie Song
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 245
Air-Know: Arbiter-Calibrated Knowledge-Internalizing Robust Network for Composed Image Retrieval
Zhiheng Fu ⋅ Yupeng Hu ⋅ Qianyun Yang ⋅ Shiqi Zhang ⋅ Zhiwei Chen ⋅ Zixu Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 246
Intra-class Distribution-guided Generative Hashing with Neighbor Refinement for Cross-modal Retrieval
Hao Sun ⋅ Yadong Huo ⋅ Qibing Qin ⋅ Wenfeng Zhang ⋅ Lei Huang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 247
Language-driven Fine-grained Retrieval
Shijie Wang ⋅ Xin Yu ⋅ Yadan Luo ⋅ Zijian Wang ⋅ Pengfei Zhang ⋅ Zi Huang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 248
MRD: Multi-resolution Retrieval-Detection Fusion for High-Resolution Image Understanding
Fan Yang ⋅ Xingping Dong ⋅ Xin Yu ⋅ Wenhan Luo ⋅ Wei Liu ⋅ Kaihao Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 249
RetFormer: Multimodal Retrieval for Enhancing Image Recognition
Tianrui Yu ⋅ Xiubo Liang ⋅ Hongzhi Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 250
DREAM: Document Recognition with Explicit Adaptive Memory
TIANQI ZHAO ⋅ Di Wu ⋅ Liangrui Peng ⋅ Yifan Huang ⋅ Kemeng Zhao ⋅ Shuo Li ⋅ Zhiyu Li ⋅ Yizhu Wang ⋅ Borui Jiang ⋅ Yuyang Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 251
RMIR: A Benchmark Dataset for Reasoning-Intensive Multimodal Image Retrieval
Yijiang Li ⋅ Kunal Kotian ⋅ Ali Marjaninejad ⋅ Meir Friedenberg ⋅ Kaushik Pavani ⋅ Sunny Dasgupta
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 252
POGA: Paraphrased and Oppositional Graph Alignment for Fine-Grained Cross-Modal Retrieval
Junfeng Zhang ⋅ Zhe Xue ⋅ Yuankai Qi ⋅ Junping Du ⋅ Xiangyang Kong ⋅ Yishuo Yan ⋅ Amin Beheshti ⋅ Jian Yang ⋅ Anton van den Hengel ⋅ Ming-Hsuan Yang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 253
Chain-of-Frames: Advancing Video Understanding in Multimodal LLMs via Frame-Aware Reasoning
SARA GHAZANFARI ⋅ Francesco Croce ⋅ Nicolas Flammarion ⋅ Prashanth Krishnamurthy ⋅ Farshad Khorrami ⋅ Siddharth Garg
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 254
TempR1: Improving Temporal Understanding of MLLMs via Temporal-Aware Multi-Task Reinforcement Learning
Tao Wu ⋅ Li Yang ⋅ Gen Zhan ⋅ Yabin ZHANG ⋅ Yiting Liao ⋅ Junlin Li ⋅ Deliang Fu ⋅ Li zhang ⋅ Limin Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 255
RiskProp: Collision-Anchored Self-Supervised Risk Propagation For Early Accident Anticipation
Yiyang Zou ⋅ Tianhao Zhao ⋅ Peilun Xiao ⋅ Hongyu Jin ⋅ Longyu Qi ⋅ Yuxuan Li ⋅ Liyin Liang ⋅ Yifeng Qian ⋅ Chunbo Lai ⋅ Yutian Lin ⋅ Zhihui Li ⋅ Yu Wu
[ Slides
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 256
MotionEnhancer: Leveraging Video Diffusion for Motion-Enhanced Vision-Language Models
Yifan Xu ⋅ Chao Zhang ⋅ Ruifei Ma ⋅ Fei Gao ⋅ Zhifei Yang ⋅ Jiaxing Qi ⋅ Zhipeng Chen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 257
MedGRPO: Multi-Task Reinforcement Learning for Heterogeneous Medical Video Understanding
Yuhao Su ⋅ Anwesa Choudhuri ⋅ Zhongpai Gao ⋅ Benjamin Planche ⋅ Van Nguyen Nguyen ⋅ Meng Zheng ⋅ Yuhan Shen ⋅ Arun Innanje ⋅ Terrence Chen ⋅ Ehsan Elhamifar ⋅ Ziyan Wu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 258
Asynchronous Temporal Modeling with Two-Agent Framework for Streaming Dense Video Captioning
Yolo Yunlong Tang ⋅ Chao Huang ⋅ Susan Liang ⋅ Jing Bi ⋅ Yicheng Wang ⋅ Daiki Shimada ⋅ Chenliang Xu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 259
TRCoRSurg: Temporal-Relational Co-Reasoning for Surgical Video Triplet Recognition
Fang Li ⋅ Shihao Zou ⋅ Weixin Si ⋅ Yang Gao ⋅ Shuai Li ⋅ Aimin Hao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 260
OASIS: On-Demand Hierarchical Event Memory for Streaming Video Reasoning
Zhijia Liang ⋅ Jiaming Li ⋅ Weikai Chen ⋅ Yanhao Zhang ⋅ Haonan Lu ⋅ Guanbin Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 261
One-Shot Flow, Any-Time Frame: A Bidirectional Warping Framework for Event-Based Video Frame Interpolation
Linghui Fu ⋅ Yuhan Liu ⋅ Hao Chen ⋅ Zhen Yang ⋅ Yongjian Deng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 262
TF-CADE: Foreground-Concentrated Text-Video Alignment for Zero-Shot Temporal Action Detection
Yearang Lee ⋅ Ho-Joong Kim ⋅ Seong-Whan Lee
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 263
PRISM: Prototype-based Reasoning with Inter-modal Semantic Mining for Interpretable Image Recognition
Anni Yu ⋅ Yu-Bin Yang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 264
Concept Regions Matter: Benchmarking CLIP with a New Cluster-Importance Approach
Aishwarya Agarwal ⋅ Srikrishna Karanam ⋅ Vineet Gandhi
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 265
PhaseWin Search Framework Enable Efficient Object-Level Interpretation
Zihan Gu ⋅ Ruoyu Chen ⋅ Junchi Zhang ⋅ Yue Hu ⋅ Hua Zhang ⋅ Xiaochun Cao
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 266
Beyond Top Activations: Efficient and Reliable Crowdsourced Evaluation of Automated Interpretability
Tuomas Oikarinen ⋅ Ge Yan ⋅ Akshay Kulkarni ⋅ Tsui-Wei Weng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 267
From Weights to Concepts: Data-Free Interpretability of CLIP via Singular Vector Decomposition
Francesco Gentile ⋅ Nicola DallAsen ⋅ Francesco Tonini ⋅ Massimiliano Mancini ⋅ Lorenzo Vaquero ⋅ Elisa Ricci
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 268
Hierarchical Concept Embedding & Pursuit for Interpretable Image Classification
Nghia Nguyen ⋅ Tianjiao Ding ⋅ Rene Vidal
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 269
Interpretable and Steerable Concept Bottleneck Sparse Autoencoders
Akshay Kulkarni ⋅ Tsui-Wei Weng ⋅ Vivek Narayanaswamy ⋅ Shusen Liu ⋅ Wesam A. Sakla ⋅ Kowshik Thopalli
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 270
C-LaV: Conditional Latent Velocity Field Denoising for Weather-Robust LiDAR Place Recognition
Xuewei Cao ⋅ Jiayue Yang ⋅ Zhiwen Zeng ⋅ Yanyong Zhang ⋅ Yan Xia
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 271
Towards Foundation Models for 3D Scene Understanding: Instance-Aware Self-Supervised Learning for Point Clouds
Bin Yang ⋅ Mohamed Abdelsamad ⋅ Miao Zhang ⋅ Alexandru Paul Condurache
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 272
Generalized-CVO: Fast and Correspondence-Free Local Point Cloud Registration with Second Order Riemannian Optimization
Ray (Rui) Zhang ⋅ Carl Greiff ⋅ Thomas Lew ⋅ John Subosits
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 273
LiDeRe: A Lightweight Readout for Fast and Data-Efficient Dense Prediction
Timo Lüddecke ⋅ Jan F. Meier ⋅ Jan van Delden ⋅ Alexander S. Ecker
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 274
AnyPcc: Compressing Any Point Cloud with a Single Universal Model
Kangli Wang ⋅ Qianxi Yi ⋅ Yuqi Ye ⋅ Shihao Li ⋅ Wei Gao
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 275
CoLC: Communication-Efficient Collaborative Perception with LiDAR Completion
Yushan Han ⋅ Hui Zhang ⋅ Qiming Xia ⋅ Yi Jin ⋅ Yidong Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 276
Spectral-Geometric Neural Fields for Pose-Free LiDAR View Synthesis
Yinuo Jiang ⋅ Jun Cheng ⋅ Yiran Wang ⋅ Cheng Cheng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 277
C-GenReg: Training-Free 3D Point Cloud Registration by Multi-View-Consistent Geometry-to-Image Generation with Probabilistic Modalities Fusion
Yuval Haitman ⋅ Amit Efraim ⋅ Joseph M. Francos
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 278
PatchAlign3D: Local Feature Alignment for Dense 3D Shape Understanding
Souhail Hadgi ⋅ Bingchen Gong ⋅ Ramana Sundararaman ⋅ Emery Pierson ⋅ Lei Li ⋅ Peter Wonka ⋅ Maks Ovsjanikov
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 279
FoV-Net: Rotation-Invariant CAD B-rep Learning via Field-of-View Ray Casting
Matteo Ballegeer ⋅ Dries F. Benoit
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 280
Neural Distribution Prior for LiDAR Out-of-Distribution Detection
Zizhao Li ⋅ Zhengkang Xiang ⋅ Jiayang Ao ⋅ Feng Liu ⋅ Joseph West ⋅ Kourosh Khoshelham
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 281
DENALI: A Dataset Enabling Non-Line-of-Sight Spatial Reasoning with Low-Cost LiDARs
Nikhil Behari ⋅ Diego Rivero ⋅ Luke Apostolides ⋅ Suman Ghosh ⋅ Paul Pu Liang ⋅ Ramesh Raskar
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 282
Concept-Aware Batch Sampling Improves Language-Image Pretraining
Adhiraj Ghosh ⋅ Vishaal Udandarao ⋅ Thao Nguyen ⋅ Matteo Farina ⋅ Mehdi Cherti ⋅ Jenia Jitsev ⋅ Sewoong Oh ⋅ Elisa Ricci ⋅ Ludwig Schmidt ⋅ Matthias Bethge
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 283
HiFICL: High-Fidelity In-Context Learning for Multimodal Tasks
Xiaoyu Li ⋅ Yuhang Liu ⋅ xuanshuo kang ⋅ zheng luo ⋅ Fangqi Lou ⋅ 吴晓华 吴晓华 ⋅ Zihan Xiong
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 284
InstAP: Instance-Aware Vision-Language Pre-Train for Spatial-Temporal Understanding
Ashutosh Kumar ⋅ Rajat Saini ⋅ Jingjing Pan ⋅ Mustafa Erdogan ⋅ Mingfang Zhang ⋅ Betty Le ⋅ Norimasa Kobori ⋅ Quan Kong
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 285
Vocabulary Scaling Law: Tuning Open-vocabulary Predictors for Their Openness
Ziliang Chen ⋅ Yulu Li ⋅ Liangda Fang ⋅ jusheng zhang ⋅ Yongsen Zheng ⋅ Quanlong Guan ⋅ Xipeng Chen
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 286
Render-to-Adapt: Unsupervised Personal Adaptation for Gaze Estimation
Yangshi Ge ⋅ Zheng Liu ⋅ Feng Lu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 287
ViTPrompt: Training-Free Prompt Refinement with Visual Tokens for Open-Vocabulary Detection
Yitong Qin ⋅ Lihua Zhou ⋅ Jiwei Wei ⋅ Ran Ran ⋅ Shiyuan He ⋅ Zeyu Ma ⋅ Shuaifeng Li ⋅ Nianxin Li ⋅ Heng Tao Shen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 288
Cluster-Aware Neural Collapse Prompt Tuning for Long-Tailed Generalization of Vision-Language Models
Boyang Guo ⋅ Liang Li ⋅ Lin Peng ⋅ Yuhan Gao ⋅ Xichun Sheng ⋅ Chenggang Yan
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 289
LLMind: Bio-inspired Training-free Adaptive Visual Representations for Vision-Language Models
Soumyaratna Debnath ⋅ Bui Manh Duc ⋅ Zinan Liu ⋅ Lin Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 290
Dynamic Logits Adjustment and Exploration for Test-Time Adaptation in Vision Language Models
Haoyan Wu ⋅ Yahao Liu ⋅ Yinjie Lei ⋅ Lixin Duan ⋅ Wen Li
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 291
CAPT: Confusion-Aware Prompt Tuning for Reducing Vision-Language Misalignment
Maoyuan Shao ⋅ Yutong Gao ⋅ Xinyang Huang ⋅ Lijuan Sun ⋅ Guoshun Nan ⋅ Chuang Zhu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 292
GenMatter: Perceiving Physical Objects with Generative Matter Models
Eric Li ⋅ Arijit Dasgupta ⋅ Yoni Friedman ⋅ Mathieu Huot ⋅ Vikash Mansinghka ⋅ Thomas O'Connell ⋅ William Freeman ⋅ Joshua B. Tenenbaum
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 293
Bidirectional Query-Driven Generation of Parametric CAD Sketch
Yang Liu ⋅ Daxuan Ren ⋅ Yijie Ding ⋅ Jianmin Zheng ⋅ Fang Deng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 294
The Missing GAP: From Solving Square Jigsaw Puzzles to Handling Real World Archaeological Fragments
Ofir Itzhak Shahar ⋅ Gur Elkin ⋅ Ohad Ben-Shahar
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 295
Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation
Yiwen Tang ⋅ Ziyu Guo ⋅ Kaixin Zhu ⋅ Ray Zhang ⋅ Qizhi Chen ⋅ Dongzhi Jiang ⋅ Junli Liu ⋅ Bohan Zeng ⋅ Haoming Song ⋅ Delin Qu ⋅ Tianyi Bai ⋅ Dan Xu ⋅ Wentao Zhang ⋅ Bin Zhao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 296
OmniDocLayout: Towards Diverse Document Layout Generation via Coarse-to-Fine LLM Learning
Hengrui Kang ⋅ Zhuangcheng Gu ⋅ Zhiyuan Zhao ⋅ Zichen Wen ⋅ Bin Wang ⋅ Weijia Li ⋅ Conghui He
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 297
Yo'City: Personalized and Boundless 3D Realistic City Scene Generation via Self-Critic Expansion
Keyang Lu ⋅ Sifan Zhou ⋅ Hongbin Xu ⋅ Gang Xu ⋅ Zhifei Yang ⋅ Yikai Wang ⋅ Zhen Xiao ⋅ Jieyi Long ⋅ Ming Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 298
Repurposing 3D Generative Model for Autoregressive Layout Generation
Haoran Feng ⋅ Yifan Niu ⋅ Zehuan Huang ⋅ Yangtian Sun ⋅ Chunchao Guo ⋅ Yuxin Peng ⋅ Lu Sheng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 299
CAD-Refiner: A Unified Framework for CAD Generation and Iterative Editing
Meng Yuan ⋅ Dawei Lin ⋅ Hongxia Xie ⋅ Tieru Wu ⋅ Rui Ma
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 300
A Debiased Reconstruction-based Framework for Training-Free Detection of AI-Generated Images
Sungik Choi ⋅ Hankook Lee ⋅ Jaehoon Lee ⋅ Robin Kim ⋅ Stanley Jungkyu Choi ⋅ Moontae Lee
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 301
Global Information Thresholding for Sufficient and Necessary Circuits
Jegyeong Cho
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 302
PrivateEyes: Gaze-Preserving Anonymization for Data Sharing
Surabhi Gupta ⋅ Dinesh Prabhu Muthumariappan ⋅ Biplab Ch Das ⋅ Anoop Kolar Rajagopal ⋅ Kiran Nanjunda Iyer ⋅ Donghwan Seo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 303
From Measurement to Mitigation: Quantifying and Reducing Identity Leakage in Image Representation Encoders with Linear Subspace Removal
Daniel George ⋅ Charles Yeh ⋅ Daniel Lee ⋅ Yifei Zhang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 304
Bias In, Bias Out? Finding Unbiased Subnetworks in Vanilla Models
Ivan Luiz De Moura Matos ⋅ Djalil Sad Saoud ⋅ Ekaterina Iakovleva ⋅ Vito Paolo ⋅ Enzo Tartaglione
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 305
pH-Strips for Selective Forgetting: A Blunt but Fast Diagnostic Baseline for Machine Unlearning
Chengyao Qian ⋅ Jing Wu ⋅ Trung Le ⋅ Dinh Phung ⋅ Mehrtash Harandi
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 306
Decoupling Defense Strategies for Robust Image Watermarking
Jiahui Chen ⋅ Zehang Deng ⋅ Zeyu Zhang ⋅ Chaoyang Li ⋅ Lianchen Jia ⋅ Lifeng Sun
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 307
Unsafe2Safe: Controllable Image Anonymization for Downstream Utility
Minh Dinh ⋅ SouYoung Jin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 308
Rel-Zero: Harnessing Patch-Pair Invariance for Robust Zero-Watermarking Against AI Editing
Pengzhen Chen ⋅ Yanwei Liu ⋅ Xiaoyan Gu ⋅ Xiaojun Chen ⋅ Wu Liu ⋅ Weiping Wang
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 309
Computation and Communication Efficient Federated Unlearning via On-server Gradient Conflict Mitigation and Expression
Minh-Duong Nguyen ⋅ Senura Hansaja Wanasekara ⋅ Le-Tuan Nguyen ⋅ Ken-Tye Yong ⋅ Quoc-Viet Pham ⋅ Nguyen H. Tran ⋅ Dung D. Le
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 310
DP-FedAdamW: An Efficient Optimizer for Differentially Private Federated Large Models
Jin Liu ⋅ Ning Xi ⋅ Yinbin Miao ⋅ Junkang Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 311
Submodel Extraction for Efficient and Personalized Federated Learning via Optimal Transport
Zheng Jiang ⋅ Nan He ⋅ Yiming Chen ⋅ Lifeng Sun
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 312
FedSDR: Federated Graph Learning with Structural Noise Detection and Reconstruction
Jiaqi Liu ⋅ Zihan Tan ⋅ Guancheng Wan ⋅ Wenke Huang ⋅ He Li ⋅ Mang Ye
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 313
FedDAP: Domain-Aware Prototype Learning for Federated Learning under Domain Shift
Huy Q. Le ⋅ Loc X. Nguyen ⋅ Yu Qiao ⋅ Seong Tae Kim ⋅ Eui-Nam Huh ⋅ Choong Seon Hong
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 314
FedAFD: Multimodal Federated Learning via Adversarial Fusion and Distillation
Min Tan ⋅ Junchao Ma ⋅ Yinfu FENG ⋅ Jiajun Ding ⋅ Wenwen Pan ⋅ Tingting Han ⋅ Qian Zheng ⋅ Zhenzhong Kuang ⋅ Zhou Yu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 315
VIRST: Video-Instructed Reasoning Assistant for SpatioTemporal Segmentation
Jihwan Hong ⋅ Jaeyoung Do
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 316
AXG-Reasoner: Error Detection and Explanation in Long Task Videos with Vision–Language Models
Shih-Po Lee ⋅ Ehsan Elhamifar
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 317
Stay in your Lane: Role Specific Queries with Overlap Suppression Loss for Dense Video Captioning
Seung Hyup Baek ⋅ Jimin Lee ⋅ Hyeongkeun Lee ⋅ Jae Won Cho
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 318
T2SGrid: Temporal-to-Spatial Gridification for Video Temporal Grounding
Chaohong Guo ⋅ Yihan He ⋅ Yongwei Nie ⋅ Fei Ma ⋅ Xuemiao Xu ⋅ Chengjiang Long
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 319
HanDyVQA: A Video QA Benchmark for Fine-Grained Hand-Object Interaction Dynamics
Masatoshi Tateno ⋅ Gido Kato ⋅ Hirokatsu Kataoka ⋅ Yoichi Sato ⋅ Takuma Yagi
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 320
SAIL: Similarity-Aware Guidance and Inter-Caption Augmentation-based Learning for Weakly-Supervised Dense Video Captioning
Ye-Chan Kim ⋅ SeungJu Cha ⋅ Si-Woo Kim ⋅ minju Jeon ⋅ HyunGee Kim ⋅ Dong-Jin Kim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 321
Token Warping Helps MLLMs Look from Nearby Viewpoints
Phillip Y. Lee ⋅ Chanho Park ⋅ Mingue Park ⋅ Seungwoo Yoo ⋅ Juil Koo ⋅ Minhyuk Sung
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 322
Variation-aware Vision Token Dropping for Faster Large Vision-Language Models
Chen junjie ⋅ Xuyang Liu ⋅ Zichen Wen ⋅ Yiyu Wang ⋅ Siteng Huang ⋅ Junjie Chen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 323
Fine-Grained Post-Training Quantization for Large Vision Language Models with Quantization-Aware Integrated Gradients
Ziwei Xiang ⋅ Fanhu Zeng ⋅ Hongjian Fang ⋅ Rui-Qi Wang ⋅ Renxing Chen ⋅ Yanan Zhu ⋅ yi chen ⋅ Peipei Yang ⋅ Xu-Yao Zhang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 324
Blink: Dynamic Visual Token Resolution for Enhanced Multimodal Understanding
Yuchen Feng ⋅ Zhenyu Zhang ⋅ Naibin Gu ⋅ Yilong Chen ⋅ Peng Fu ⋅ Zheng Lin ⋅ Shuohuan Wang ⋅ Yu Sun ⋅ Hua Wu ⋅ Weiping Wang ⋅ Haifeng Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 325
IF-Prune: Information-Flow Guided Token Pruning for Efficient Vision-Language Models
Guohao Sun ⋅ Yufei Wang ⋅ Sizhuo Ma ⋅ Yuege Xie ⋅ Yuting Cheng ⋅ ZHIQIANG TAO ⋅ Jian Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 326
EvoComp: Learning Visual Token Compression for Multimodal Large Language Models via Semantic-Guided Evolutionary Labeling
Jiafei Song ⋅ Fengwei Zhou ⋅ Jin Qu ⋅ Wenjin Jason Li ⋅ Tong Wu ⋅ Gengjian Xue ⋅ Zhikang Zhao ⋅ Daomin Wei ⋅ Yichao Lu ⋅ Bailin Na
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 327
DocPrune: Efficient Document Question Answering via Background, Question, and Comprehension-aware Token Pruning
Joonmyung Choi ⋅ Sanghyeok Lee ⋅ Jongha Kim ⋅ Sehyung Kim ⋅ Dohwan Ko ⋅ Jihyung Kil ⋅ Hyunwoo J. Kim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 328
QuietPrune: Query-Guided Early Token Pruning for Vision-Language Models
Tianxiao Gao ⋅ Shanwei Zhao ⋅ Shuo Fang ⋅ Shiai Zhu ⋅ Chenguang Ma
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 329
The Devil Is in Gradient Entanglement: Energy-Aware Gradient Coordinator for Robust Generalized Category Discovery
Haiyang Zheng ⋅ Nan Pu ⋅ Yaqi Cai ⋅ Teng Long ⋅ Wenjing Li ⋅ Nicu Sebe ⋅ Zhun Zhong
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 330
LLM-Guided Probabilistic Fusion for Label-Efficient Document Layout Analysis
Ibne Farabi Shihab ⋅ Sanjeda Akter ⋅ Anuj Sharma
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 331
Coordinate Denoising for Non‑Equilibrium Molecular Representation Learning
Qianwei Tang ⋅ Baile Xu ⋅ Jian Zhao ⋅ Furao Shen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 332
Plug-and-Play Incomplete Multi-View Clustering via Janus-Faced Affinity Learning with Topology Harmonization
Shengju Yu ⋅ Suyuan Liu ⋅ Wenhao SHAO ⋅ Siwei Wang ⋅ KE LIANG ⋅ Xihong Yang ⋅ Tiejun Li ⋅ Xinwang Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 333
Meta-Learning In-Context Enables Training-Free Cross Subject Brain Decoding
Mu Nan ⋅ Muquan Yu ⋅ Weijian Mai ⋅ Jacob S. Prince ⋅ Hossein Adeli ⋅ Rui Zhang ⋅ Jiahang Cao ⋅ Benjamin Becker ⋅ John S. Pyles ⋅ Margaret M. Henderson ⋅ Chunfeng Song ⋅ Nikolaus Kriegeskorte ⋅ Michael J. Tarr ⋅ Xiaoqing Hu ⋅ Andrew F. Luo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 334
Measure The Feature Universe: Topology-based Pseudo Labeling and Gravity Consistency for Source-Free Domain Adaptation
Jae Yun Lee ⋅ Hyeok Nam ⋅ Sung In Cho
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 335
Conditional Factuality Controlled LLMs with Generalization Certificates via Conformal Sampling
Kai Ye ⋅ Qingtao Pan ⋅ Shuo Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 336
Harnessing the Power of Foundation Models for Accurate Material Classification
QINGRAN LIN ⋅ Fengwei Yang ⋅ Chaolun Zhu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 337
Content-Aware Frequency Encoding for Implicit Neural Representations with Fourier-Chebyshev Features
Junbo Ke ⋅ Yangyang Xu ⋅ Chao Wang ⋅ You-Wei Wen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 338
ActiveAD: Planning-Oriented Active Learning for End-to-End Autonomous Driving
Han Lu ⋅ Xiaosong Jia ⋅ Yichen Xie ⋅ Siyu Sun ⋅ Wenlong Liao ⋅ Xiaokang Yang ⋅ Junchi Yan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 339
TeFlow: Enabling Multi-frame Supervision for Self-Supervised Feed-forward Scene Flow Estimation
Qingwen Zhang ⋅ Chenhan Jiang ⋅ Xiaomeng Zhu ⋅ Yunqi Miao ⋅ Yushan Zhang ⋅ Olov Andersson ⋅ Patric Jensfelt
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 340
Think Before You Drive: World Model-Inspired Multimodal Grounding
Haicheng Liao ⋅ Huanming Shen ⋅ Bonan Wang ⋅ yong kang li ⋅ Yihong Tang ⋅ Chengyue Wang ⋅ Dingyi Zhuang ⋅ Kehua Chen ⋅ HAI YANG ⋅ Chengzhong Xu ⋅ Zhenning Li
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 341
DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning
Zhe Liu ⋅ Runhui Huang ⋅ Rui Yang ⋅ Siming Yan ⋅ Zining Wang ⋅ Lu Hou ⋅ Di Lin ⋅ Xiang Bai ⋅ Hengshuang Zhao
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 342
DrivePTS: A Progressive Learning Framework with Textual and Structural Enhancement for Driving Scene Generation
Zhechao Wang ⋅ Yiming Zeng ⋅ Lufan Ma ⋅ Zeqing Fu ⋅ Chen Bai ⋅ Dongshuo Yin ⋅ Ziyao Lin ⋅ Cheng Lu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 343
WOD-E2E: Waymo Open Dataset for End-to-End Driving in Challenging Long-tail Scenarios
Runsheng Xu ⋅ Hubert Lin ⋅ Wonseok Jeon ⋅ Hao Feng ⋅ Yuliang Zou ⋅ Liting Sun ⋅ John Gorman ⋅ Kate Tolstaya ⋅ Sarah Tang ⋅ Brandyn White ⋅ Ben Sapp ⋅ Mingxing Tan ⋅ Jyh-Jing Hwang ⋅ Dragomir Anguelov
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 344
GuideFlow: Constraint-Guided Flow Matching for Planning in End-to-End Autonomous Driving
Lin Liu ⋅ Caiyan Jia ⋅ Guanyi Yu ⋅ Ziying Song ⋅ Junqiao Li ⋅ Feiyang Jia ⋅ Peiliang Wu ⋅ Xiaoshuai Hao ⋅ Yadan Luo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 345
ResAD: Normalized Residual Trajectory Modeling for End-to-End Autonomous Driving
Zhiyu Zheng ⋅ Shaoyu Chen ⋅ haoran yin ⋅ xinbang zhang ⋅ Jialv Zou ⋅ Xinggang Wang ⋅ Qian Zhang ⋅ Lefei Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 346
KnowVal: A Knowledge-Augmented and Value-Guided Autonomous Driving System
Zhongyu Xia ⋅ Wenhao Chen ⋅ Yongtao Wang ⋅ Ming-Hsuan Yang
[ Slides
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 347
FoSS: Modeling Long-Range Dependencies and Multimodal Uncertainty in Trajectory Prediction via Fourier–State Space Integration
Yizhou Huang ⋅ Genze Jiang ⋅ Yihua Cheng ⋅ Kezhi Wang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 348
NexusFlow: Unifying Disparate Tasks under Partial Supervision via Invertible Flow Networks
Fangzhou Lin ⋅ Yuping Wang ⋅ Yuliang Guo ⋅ Zixun Huang ⋅ Xinyu Huang ⋅ Haichong Zhang ⋅ Kazunori Yamada ⋅ Zhengzhong Tu ⋅ Liu Ren ⋅ Ziming Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 349
Visual Prototype Conditioned Focal Region Generation for UAV-Based Object Detection
Wenhao Li ⋅ Zimeng Wu ⋅ Yu Wu ⋅ Zehua Fu ⋅ Jiaxin Chen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 350
Consistent Instance Field for Dynamic Scene Understanding
Junyi Wu ⋅ Van Nguyen Nguyen ⋅ Benjamin Planche ⋅ Jiachen Tao ⋅ Changchang Sun ⋅ Zhongpai Gao ⋅ Zhenghao Zhao ⋅ Anwesa Choudhuri ⋅ Gengyu Zhang ⋅ Meng Zheng ⋅ Feiran Wang ⋅ Terrence Chen ⋅ Yan Yan ⋅ Ziyan Wu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 351
CLP: A Real-World Dataset of Contaminated Lens Protectors for Robust Semantic Segmentation
Sungyong Park ⋅ Sooyoung Choi ⋅ Hyunseo Koh ⋅ Youngjae Choi ⋅ Heewon Kim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 352
ReSAM: Refine, Requery, and Reinforce: Self-Prompting Point-Supervised Segmentation for Remote Sensing Images
Muhammad Naseer Subhani
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 353
Heuristic Self-Paced Learning for Domain Adaptive Semantic Segmentation under Adverse Conditions
Shiqin Wang ⋅ Haoyang Chen ⋅ Huaizhou Huang ⋅ Yinkan He ⋅ Dongfang Sun ⋅ Xiaoqing Chen ⋅ Xingyu Liu ⋅ Zheng Wang ⋅ Kaiyan Zhao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 354
SAM2Text: Towards Prompt-Free and Multi-Resolution Video Scene Text Segmentation
Jing-Yao Zhang ⋅ Heng Zhang ⋅ Mingsen Zhang ⋅ Binbin Yang ⋅ Fei Yin
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 355
Reinforcing Video Reasoning Segmentation to Think Before It Segments
Sitong Gong ⋅ Yunzhi Zhuge ⋅ Lu Zhang ⋅ Jiazuo Yu ⋅ Pingping Zhang ⋅ Xu Jia ⋅ Huchuan Lu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 356
VideoMaMa: Mask-Guided Video Matting via Generative Prior
Sangbeom Lim ⋅ Seoung Wug Oh ⋅ Gabriel Huang ⋅ Heeji Yoon ⋅ Seungryong Kim ⋅ Joon-Young Lee
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 357
Quantized Residuals to Continuous Prompts for Few-Shot Class Incremental Learning in Vision-Language Models
Abhishek Kumar Sinha ⋅ Nitant Dube ⋅ Soma Biswas
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 358
The Golden Subspace: Where Efficiency Meets Generalization in Continual Test-Time Adaptation
Guannan Lai ⋅ Da-Wei Zhou ⋅ Zhenguo Li ⋅ Han-Jia Ye
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 359
SAIDO: Generalizable Detection of AI-Generated Images via Scene-Aware and Importance-Guided Dynamic Optimization in Continual Learning
Yongkang Hu ⋅ Yu Cheng ⋅ YuShuo Zhang ⋅ Yuan Xie ⋅ Zhaoxia Yin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 360
Is Parameter Isolation Better for Prompt-Based Continual Learning?
Jiangyang Li ⋅ Chenhao Ding ⋅ SongLin Dong ⋅ Qiang Wang ⋅ Jianchao Zhao ⋅ Yuhang He ⋅ Yihong Gong
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 361
Octopus: History-Free Gradient Orthogonalization for Continual Learning in Multimodal Large Language Models
Yuehao Liu ⋅ Shanyan Guan ⋅ Weijia Zhang ⋅ Xuanming Shang ⋅ Yanhao Ge ⋅ Wei Li ⋅ Chao Ma
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 362
Affordance-First Decomposition for Continual Learning in Video–Language Understanding
Mengzhu xu ⋅ Hanzhi Liu ⋅ Ningkang Peng ⋅ qianyu Chen ⋅ Canran Xiao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 363
Quantum-Gated Task-interaction Knowledge Distillation for Pre-trained Model-based Class-Incremental Learning
Linjie Li ⋅ HUIYU XIAO ⋅ Jiarui Cao ⋅ Zhenyu Wu ⋅ Yang Ji
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 364
Elastic Weight Consolidation Done Right for Continual Learning
Xuan Liu ⋅ Xiaobin Chang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 365
On Token's Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models
Chongyang Zhao ⋅ Mingsong Li ⋅ Haodong Lu ⋅ Dong Gong
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 366
Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation
Jiangning Zhang ⋅ junwei zhu ⋅ Zhenye Gan ⋅ Donghao Luo ⋅ Chuming Lin ⋅ FeiFan Xu ⋅ Xu Peng ⋅ Jianlong Hu ⋅ Yuansen Liu ⋅ Yijia Hong ⋅ Weijian Cao ⋅ Han Feng ⋅ Xu Chen ⋅ Chencan Fu ⋅ Keke He ⋅ Xiaobin Hu ⋅ Chengjie Wang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 367
Talking Together: Synthesizing Co-Located 3D Conversations from Audio
Mengyi Shan ⋅ Shouchieh Chang ⋅ Ziqian Bai ⋅ Shichen Liu ⋅ Yinda Zhang ⋅ Luchuan Song ⋅ Rohit Pandey ⋅ Sean Fanello ⋅ Zeng Huang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 368
InfinityHuman: Towards Long-Term Audio-Driven Human Animation
Xiaodi Li ⋅ Pan Xie ⋅ Yi Ren ⋅ Qijun Gan ⋅ Chen Zhang ⋅ Fangyuan Kong ⋅ Xiang Yin ⋅ Zehuan Yuan ⋅ BINGYUE PENG
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 369
Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision
Hyunsoo Cha ⋅ Wonjung Woo ⋅ Byungjun Kim ⋅ Hanbyul Joo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 370
AudioAvatar: Personalized Audio-driven Whole-body Talking Avatars
Seungeun Lee ⋅ SeungJun Moon ⋅ Hah Min Lew ⋅ Ji-Su Kang ⋅ Gyeong-Moon Park
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 371
One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer
Shijun Shi ⋅ Jing Xu ⋅ Zhihang Li ⋅ Chunli Peng ⋅ Xiaoda Yang ⋅ Lijing Lu ⋅ Kai Hu ⋅ Jiangning Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 372
Counterfactual VLA: Self-Reflective Vision-Language-Action Model with Adaptive Reasoning
Zhenghao Peng ⋅ Wenhao Ding ⋅ Yurong You ⋅ Yuxiao Chen ⋅ Wenjie Luo ⋅ Thomas Tian ⋅ Yulong Cao ⋅ Apoorva Sharma ⋅ Danfei Xu ⋅ Boris Ivanovic ⋅ Boyi Li ⋅ Yan Wang ⋅ Marco Pavone
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 373
SGDrive: Scene-to-Goal Hierarchical World Cognition for Autonomous Driving
jingyu li ⋅ Junjie Wu ⋅ Dongnan Hu ⋅ Xiangkai Huang ⋅ Bin Sun ⋅ Zhihui Hao ⋅ XianPeng Lang ⋅ Xiatian Zhu ⋅ Li Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 374
CapNav: Benchmarking Vision Language Models on Capability-conditioned Indoor Navigation
Xia Su ⋅ Ruiqi Chen ⋅ Benlin Liu ⋅ Jingwei Ma ⋅ Zonglin Di ⋅ Ranjay Krishna ⋅ Jon Froehlich
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 375
AutoTraces: Autoregressive Trajectory Forecasting via Multimodal Large Language Models
Teng Wang ⋅ Yanting Lu ⋅ Ruize Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 376
AwareVLN: Reasoning with Self-awareness for Vision-Language Navigation
Wenxuan Guo ⋅ Xiuwei Xu ⋅ Yichen Liu ⋅ Xiangyu Li ⋅ Hang Yin ⋅ Huangxing Chen ⋅ Wenzhao Zheng ⋅ Jianjiang Feng ⋅ Jie Zhou ⋅ Jiwen Lu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 377
Progress-Think: Semantic Progress Reasoning for Vision-Language Navigation
Shuo Wang ⋅ Yucheng Wang ⋅ Guoxin Lian ⋅ Yongcai Wang ⋅ Maiyue Chen ⋅ Kaihui Wang ⋅ Bo Zhang ⋅ Zhizhong Su ⋅ Yutian Zhou ⋅ Wanting Li ⋅ Deying Li ⋅ Zhaoxin Fan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 378
Tavatar: Topology-Aware Gaussian Attribute Derivation for Animatable Human Avatars
Hailin Luo ⋅ Yifan Yang ⋅ Jiazhi Shu ⋅ Zixiong Huang ⋅ Qi Chen ⋅ Qing Du ⋅ Mingkui Tan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 379
PercHead: Perceptual Head Model for Single-Image 3D Head Reconstruction & Editing
Antonio Oroz ⋅ Matthias Nießner ⋅ Tobias Kirschstein
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 380
PhysHead: Simulation-Ready Gaussian Head Avatars
Berna Kabadayi ⋅ Vanessa Sklyarova ⋅ Wojciech Zielonka ⋅ Justus Thies ⋅ Gerard Pons-Moll
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 381
ReWeaver: Towards Simulation-Ready and Topology-Accurate Garment Reconstruction
Ming Li ⋅ Hui Shan ⋅ Kai Zheng ⋅ Chentao Shen ⋅ Siyu Liu ⋅ Yanwei Fu ⋅ Zhen Chen ⋅ Xiangru Huang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 382
FHAvatar: Fast and High-Fidelity Reconstruction of Face-and-Hair Composable 3D Head Avatar from Few Casual Captures
Yujie Sun ⋅ Zhuoqiang CAI ⋅ Chaoyue Niu ⋅ Jianchuan Chen ⋅ Zhiwen Chen ⋅ Chengfei Lv ⋅ Fan Wu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 383
Feed-Forward One-Shot Animatable Textured Mesh Avatar Reconstruction
Yisheng He
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 384
Reallocating Attention Across Layers to Reduce Multimodal Hallucination
Haolang Lu ⋅ Bolun Chu ⋅ WeiYe Fu ⋅ Guoshun Nan ⋅ Junning Liu ⋅ Minghui Pan ⋅ Qiankun Li ⋅ Yi Yu ⋅ Hua Wang ⋅ Kun Wang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 385
VES-RFT: Rewarding Visual Evidence Sensitivity to Mitigate Hallucinations in Large Vision–Language Models
XUEGE HOU ⋅ Wenshuo Li ⋅ Yali Li ⋅ Han Shu ⋅ Yuan Wang ⋅ Xinghao Chen ⋅ Shengjin Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 386
Fighting Hallucinations with Counterfactuals: Diffusion-Guided Perturbations for LVLM Hallucination Suppression
Hamidreza Dastmalchi ⋅ Aijun An ⋅ Ali Cheraghian ⋅ Hamed Barzamini
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 387
Unstitching the Chimera: Frame-Level Risk and Train-Free Mitigation for Video Hallucination
Songyuan Yang ⋅ Guijian Tang ⋅ Kun Hu ⋅ Haotian Wang ⋅ Shixuan Liu ⋅ Wenjing Yang ⋅ Long Lan ⋅ Huibin Tan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 388
CausalLens: Sensitivity-Guided Multi-Head Causal Intervention for Hallucination Mitigation in Large Vision-Language Models
Junyang Ji ⋅ Qifan Liu ⋅ Wenming Yang ⋅ Zhihai He
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 389
Breaking the Illusion: When Positive Meets Negative in Multimodal Decoding
Yubo Jiang ⋅ Yitong An ⋅ Xin Yang ⋅ Abudukelimu Wuerkaixi ⋅ Xuxin Cheng ⋅ Fengying Xie ⋅ Zhiguo Jiang ⋅ Cao Liu ⋅ Ke Zeng ⋅ Haopeng Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 390
FlexTraj: Image-to-Video Generation with Flexible Point Trajectory Control
Zhiyuan Zhang ⋅ Can Wang ⋅ Dongdong Chen ⋅ Jing Liao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 391
Diff4Splat: Repurposing Video Diffusion Models for Dynamic Scene Generation
Panwang Pan ⋅ Chenguo Lin ⋅ Chenxin Li ⋅ Jingjing Zhao ⋅ Yuchen Lin ⋅ Haopeng Li ⋅ yunlong lin ⋅ Kairun Wen ⋅ Yixuan Yuan ⋅ Yadong Mu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 392
Spatia: Video Generation with Updatable Spatial Memory
Jinjing Zhao ⋅ Fangyun Wei ⋅ Zhening Liu ⋅ Hongyang Zhang ⋅ Chang Xu ⋅ Yan Lu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 393
Geometry-as-context: Modulating Explicit 3D in Scene-consistent Video Generation to Geometry Context
JiaKui Hu ⋅ Jialun Liu ⋅ Liying Yang ⋅ Xinliang Zhang ⋅ Kaiwen Li ⋅ Shuang Zeng ⋅ Yuanwei Li ⋅ Haibin Huang ⋅ Chi Zhang ⋅ Yanye Lu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 394
EgoControl: Controllable Egocentric Video Generation via 3D Full-Body Poses
Enrico Pallotta ⋅ Sina Mokhtarzadeh Azar ⋅ Lars Doorenbos ⋅ Serdar Ozsoy ⋅ Umar Iqbal ⋅ Jürgen Gall
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 395
CustomTex: High-fidelity Indoor Scene Texturing via Multi-Reference Customization
Weilin Chen ⋅ Jiahao Rao ⋅ Wenhao Wang ⋅ Xinyang Li ⋅ Xuan Cheng ⋅ Liujuan Cao
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 396
FoleyDesigner: Immersive Stereo Foley Generation with Precise Spatio-Temporal Alignment for Film Clips
Mengtian Li ⋅ Kunyan Dai ⋅ Yi Ding ⋅ Ruobing Ni ⋅ Ying Zhang ⋅ Wenwu Wang ⋅ Zhifeng Xie
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 397
Physical Simulator In-the-Loop Video Generation
Lin Geng Foo ⋅ Mark He Huang ⋅ Alexandros Lattas ⋅ Stylianos Moschoglou ⋅ Thabo Beeler ⋅ Christian Theobalt
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 398
Refracting Reality: Generating Images with Realistic Transparent Objects
Yue Yin ⋅ Enze Tao ⋅ Dylan Campbell
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 399
Generating Humanless Environment Walkthroughs from Egocentric Walking Tour Videos
Yujin Ham ⋅ Junho Kim ⋅ Vivek Boominathan ⋅ Guha Balakrishnan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 400
EgoFlow: Gradient-Guided Flow Matching for Egocentric 6DoF Object Motion Generation
Abhishek Saroha ⋅ Huajian Zeng ⋅ Xingxing Zuo ⋅ Daniel Cremers ⋅ Xi Wang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 401
Spatial-Frequency Collaborative Learning for Occluded Visible-Infrared Person Re-Identification
JIan Yu ⋅ Yujian Feng ⋅ Shuai You ⋅ Zhongkai Zhou ⋅ Fei Wu ⋅ Zhengjun Jing ⋅ Yimu Ji
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 402
Mind the Gap: Transferring Labels to Align Object Detection Datasets
Mikhail Kennerley ⋅ Angelica I Aviles-Rivero ⋅ Carola-Bibiane Schönlieb ⋅ Robby T. Tan
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 403
SSM-Aware Token-Efficient VMamba via Adaptive Patch Pruning and Merging for Person Re-Identification
Huiyuan Huang ⋅ SANG MIN YOON
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 404
Tri-Modal Fusion Transformers for UAV-based Object Detection
Craig Iaboni ⋅ Pramod Abichandani
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 405
View-Aware Semantic Alignment for Aerial-Ground Person Re-Identification
Quan Zhang ⋅ Zeqiang Cai ⋅ Peiming Zhao ⋅ Jingze Wu ⋅ Cailun Wu ⋅ Hongbo Chen ⋅ Jianhuang Lai
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 406
RHCNet: Residual-Guided Hierarchical Calibration Network for Robust Underwater Object Detection
Yueying Wang ⋅ Yiteng Guo ⋅ Weidong Zhang ⋅ Jie Wen ⋅ Liquan Shen ⋅ Huaicheng Yan ⋅ Xin Xu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 407
X-AVDT: Audio-Visual Cross-Attention for Robust Deepfake Detection
Youngseo Kim ⋅ Kwan Yun ⋅ Seokhyeon Hong ⋅ Sihun Cha ⋅ Colette Suhjung Koo ⋅ Junyong Noh
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 408
Beyond Duality: A Hybrid Framework of Leveraging Shared and Private Features for RGB-Event Object Detection
Keyao Wang ⋅ Shuai Liu ⋅ Hengda Shi ⋅ Lukui Shi ⋅ Haiyong Chen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 409
FVBench: Benchmarking Deepfake Video Detection Capability of Large Multimodal Models
Wang Jiarui ⋅ Huiyu Duan ⋅ Juntong Wang ⋅ Xiongkuo Min
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 410
AKCMamba-YOLO: Selective State Space Models For Real-Time Object Detection
Long Chen ⋅ Hui Wang ⋅ Man Xu ⋅ Zexuan Li ⋅ Zizhu Fan
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 411
When AVSR Meets Video Conferencing: Dataset, Degradation, and the Hidden Mechanism Behind Performance Collapse
Yihuan Huang ⋅ Jun Xue ⋅ Liu Jiajun ⋅ Daixian Li ⋅ Tong Zhang ⋅ Zhuolin Yi ⋅ Yanzhen Ren ⋅ Kai Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 412
Your One-Stop Solution for AI-Generated Video Detection
Long Ma ⋅ Zihao Xue ⋅ Yan Wang ⋅ Zhiyuan Yan ⋅ Jin Xu ⋅ Xiaorui Jiang ⋅ Haiyang Yu ⋅ Yong Liao ⋅ Zhen Bi
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 413
UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation
Jiehui Huang ⋅ Yuechen Zhang ⋅ Xu He ⋅ Yuan Gao ⋅ Zhi Cen ⋅ Bin Xia ⋅ Yan Zhou ⋅ Xin Tao ⋅ Pengfei Wan ⋅ Jiaya Jia
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 414
Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning
Yifei Li ⋅ Wenzhao Zheng ⋅ Yanran Zhang ⋅ Runze Sun ⋅ Yu Zheng ⋅ Lei Chen ⋅ Jie Zhou ⋅ Jiwen Lu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 415
HumanVBench: Probing Human-Centric Video Understanding in MLLMs with Automatically Synthesized Benchmarks
Ting Zhou ⋅ Daoyuan Chen ⋅ Qirui Jiao ⋅ Bolin Ding ⋅ Yaliang Li ⋅ Ying Shen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 416
HERBench: A Benchmark for Multi-Evidence Integration in Video Question Answering
Dan Ben Ami ⋅ Gabriele Serussi ⋅ Kobi Cohen ⋅ Chaim Baskin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 417
Seeing the Scene Matters: Revealing Forgetting in Video Understanding Models with a Scene-Aware Long-Video Benchmark
Seng Nam Chen ⋅ Hao Chen ⋅ Chenglam Ho ⋅ Xinyu Mao ⋅ Jinping Wang ⋅ Yu Zhang ⋅ Chao Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 418
Thinking with Frames: Generative Video Distortion Evaluation via Frame Reward Model
Yuan Wang ⋅ Borui Liao ⋅ Huijuan Huang ⋅ Jinda Lu ⋅ Ouxiang Li ⋅ Kuien Liu ⋅ Meng Wang ⋅ Xiang Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 419
MovieRecapsQA: A Multimodal Open-Ended Video Question-Answering Benchmark
Shaden Shaar ⋅ Bradon Thymes ⋅ Sirawut Chaixanien ⋅ Claire Cardie ⋅ Bharath Hariharan
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 420
Training-free, Perceptually Consistent Low-Resolution Previews with High-Resolution Image for Efficient Workflows of Diffusion Models
Wongi Jeong ⋅ Hoigi Seo ⋅ Se Young Chun
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 421
One Model, Many Budgets: Elastic Latent Interfaces for Diffusion Transformers
Moayed Haji Ali ⋅ Willi Menapace ⋅ Ivan Skorokhodov ⋅ Dogyun Park ⋅ Anil Kag ⋅ Michael Vasilkovsky ⋅ Sergey Tulyakov ⋅ Vicente Ordonez ⋅ Aliaksandr Siarohin
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 422
Reflection Separation from a Single Image via Joint Latent Diffusion
Zheng-Hui Huang ⋅ Zhixiang Wang ⋅ Yu-Lun Liu ⋅ Yung-Yu Chuang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 423
MMFace-DiT: A Dual-Stream Diffusion Transformer for High-Fidelity Multimodal Face Generation
Bharath Krishnamurthy ⋅ Ajita Rattani
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 424
DisCa: Accelerating Video Diffusion Transformers with Distillation-Compatible Learnable Feature Caching
Chang Zou ⋅ Changlin Li ⋅ Songtao Liu ⋅ Zhao Zhong ⋅ Kailin Huang ⋅ Linfeng Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 425
MatLat: Material Latent Space for PBR Texture Generation
Kyeongmin Yeo ⋅ Yunhong Min ⋅ Jaihoon Kim ⋅ Minhyuk Sung
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 426
VMonarch: Efficient Video Diffusion Transformers with Structured Attention
Cheng Liang ⋅ Haoxian Chen ⋅ Liang Hou ⋅ Qi Fan ⋅ Gangshan Wu ⋅ Xin Tao ⋅ Limin Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 427
DiffDecompose: Layer-Wise Decomposition of Alpha-Composited Images via Diffusion Transformers
Zitong Wang ⋅ Hang Zhao ⋅ Qianyu Zhou ⋅ Xuequan Lu ⋅ Xiangtai Li ⋅ Hao Yang ⋅ Bo Yang ⋅ Yiren Song
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 428
Calibri: Enhancing Diffusion Transformers via Parameter-Efficient Calibration
Danil Tokhchukov ⋅ Aysel Mirzoeva ⋅ Andrey Kuznetsov ⋅ Konstantin Sobolev
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 429
Transition Matching Distillation for Fast Video Generation
Weili Nie ⋅ Julius Berner ⋅ Nanye Ma ⋅ Chao Liu ⋅ Saining Xie ⋅ Arash Vahdat
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 430
Diffusion-Based Makeup Transfer with Facial Region-Aware Makeup Features
Zheng Gao ⋅ Debin Meng ⋅ Yunqi Miao ⋅ Zhensong Zhang ⋅ Songcen Xu ⋅ Ioannis Patras ⋅ Jifei Song
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 431
UniPR: Unified Object-level Real-to-Sim Perception and Reconstruction from a Single Stereo Pair
Chuanrui Zhang ⋅ Yingshuang Zou ⋅ ZhengXian Wu ⋅ Yonggen Ling ⋅ Yuxiao Yang ⋅ Ziwei Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 432
Query2Uncertainty: Robust Uncertainty Quantification and Calibration for 3D Object Detection under Distribution Shift
Till Beemelmanns ⋅ Alexey Nekrasov ⋅ Stefan Vilceanu ⋅ Jonas Steinhaus ⋅ Timo Woopen ⋅ Bastian Leibe ⋅ Lutz Eckstein
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 433
DICArt: Advancing Category-level Articulated Object Pose Estimation in Discrete State-Spaces
Li Zhang ⋅ Mingyu Mei ⋅ Ailing Wang ⋅ Xianhui Meng ⋅ Yan Zhong ⋅ Xinyuan Song ⋅ Liu Liu ⋅ Rujing Wang ⋅ Zaixing He ⋅ Cewu Lu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 434
PoseGaussian: 6D Pose Estimation for Unseen Objects via Sparse-View Object-Level 3D Gaussian Splatting
Wubin Shi ⋅ Shaoyan Gai ⋅ Feipeng Da
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 435
VGGT-Det: Mining VGGT Internal Priors for Sensor-Geometry-Free Multi-View Indoor 3D Object Detection
Yang Cao ⋅ Feize Wu ⋅ Dave Chen ⋅ Yingji Zhong ⋅ Lanqing Hong ⋅ Dan Xu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 436
MonoSAOD: Monocular 3D Object Detection with Sparsely Annotated Label
Junyoung Jung ⋅ Seokwon Kim ⋅ Jung Uk Kim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 437
V2U4Real: A Real-world Large-scale Dataset for Vehicle-to-UAV Cooperative Perception
Weijia Li ⋅ Haoen Xiang ⋅ Tianxu Wang ⋅ Shuaibing Wu ⋅ Qiming Xia ⋅ Cheng Wang ⋅ Chenglu Wen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 438
SketchVL: Policy Optimization via Fine-Grained Credit Assignment for Chart Understanding and More
Muye Huang ⋅ Lingling Zhang ⋅ Yifei Li ⋅ Yaqiang Wu ⋅ Jun Liu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 439
A Causal Marriage between VLM and IRM from Understanding to Reasoning
Ziliang Chen ⋅ Tianang Xiao ⋅ jusheng zhang ⋅ Yongsen Zheng ⋅ Yang Liu ⋅ Zhao-Rong Lai ⋅ Liang Lin
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 440
Why Does RL Generalize Better Than SFT? A Data-Centric Perspective on VLM Post-Training
Aojun Lu ⋅ Tao Feng ⋅ Hangjie Yuan ⋅ Wei Li ⋅ Yanan Sun
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 441
SoC: Semantic Orthogonal Calibration for Test-Time Prompt Tuning
Leo Fillioux ⋅ Omprakash Chakraborty ⋅ Ismail Ben Ayed ⋅ Paul-Henry Cournède ⋅ Stergios Christodoulidis ⋅ Maria Vakalopoulou ⋅ Jose Dolz
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 442
Learning to Select Visual Tools from Experience
Zeyi Huang ⋅ Yuyang Ji ⋅ Anirudh Sundara Rajan ⋅ Zefan Cai ⋅ Wen Xiao ⋅ Haohan Wang ⋅ Junjie Hu ⋅ Yong Jae Lee
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 443
Agile Deliberation: Concept Deliberation for Subjective Visual Classification
Leijie Wang ⋅ Otilia Stretcu ⋅ Wei Qiao ⋅ Thomas Denby ⋅ Krishnamurthy Viswanathan ⋅ Enming Luo ⋅ Chun-Ta Lu ⋅ Tushar Dogra ⋅ Ranjay Krishna ⋅ Ariel Fuxman
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 444
Tea-Adapter: Teacher Adapter for Efficient Conditional Generation
Yinhan Zhang ⋅ Yue Ma ⋅ Fangqiu Yi ⋅ Chenyang Qi ⋅ Chi Zhang ⋅ Kunyu Feng ⋅ Zeyu Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 445
From Failure to Feedback: Group Revision Unlocks Hard Cases in Object-Level Grounding
Yuyuan Liu ⋅ Yiping Ji ⋅ Anjie Le ⋅ Jiayuan Zhu ⋅ Jiazhen Pan ⋅ Can Peng ⋅ Jiajun Deng ⋅ Fengbei Liu ⋅ Junde Wu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 446
Perception Characteristics Distance: Measuring Stability and Robustness of Perception System in Dynamic Conditions under a Certain Decision Rule
Boyu Jiang ⋅ Liang Shi ⋅ Zhengzhi Lin ⋅ Lanxin Xiang ⋅ Loren Stowe ⋅ Feng Guo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 447
FinPercep-RM: A Fine-grained Reward Model and Co-evolutionary Curriculum for RL-based Real-world Super-Resolution
Yidi Liu ⋅ Zihao Fan ⋅ Jie Huang ⋅ Jie Xiao ⋅ Dong Li ⋅ Wenlong Zhang ⋅ Lei Bai ⋅ Xueyang Fu ⋅ Zheng-Jun Zha
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 448
Twin-T & TwintVQA: A Reliable Structure–Detail Separating VLM and a Comprehensive Benchmark for Chart and Table Tasks
Jiahua Bao ⋅ Siyao Cheng ⋅ Jiaxing Du ⋅ Qingtao Xia ⋅ Changjiang He ⋅ Zeming Lang ⋅ Jie Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 449
SDGS: Spatial Difference Guided Gaussian Splatting for Simultaneous Localization and 3D Reconstruction
Yijian Tian ⋅ Mingtao Ou ⋅ Pan Zijian ⋅ Xinglong Ji
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 450
RT-Splatting: Joint Reflection-Transmission Modeling with Gaussian Splatting
Ji Shi ⋅ Xianghua Ying ⋅ Bowei Xing ⋅ Ruohao Guo ⋅ Wenzhen Yue
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 451
Pose-Free Omnidirectional Gaussian Splatting for 360-Degree Videos with Consistent Depth Priors
Chuanqing Zhuang ⋅ Xin Lu ⋅ Zehui Deng ⋅ Zhengda Lu ⋅ Yiqun Wang ⋅ Junqi Diao ⋅ Jun Xiao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 452
Distilling Unsigned Distance Function for Surface Reconstruction from 3D Gaussian Splatting
Qian Li ⋅ Rao Fu ⋅ Jiangtao Li ⋅ Fan Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 453
Exact-GS: Mathematically Rigorous and Accurate 3D Gaussian Splatting for 3D X-ray Reconstruction
Guangpu Yang ⋅ Steffen Kieß ⋅ Hanxiang Luo ⋅ Xingyu Liu ⋅ Sven Simon
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 454
DualSplat: Robust 3D Gaussian Splatting via Pseudo-Mask Bootstrapping from Reconstruction Failures
Xu Wang ⋅ Zhiru Wang ⋅ Shiyun Xie ⋅ Chengwei Pan ⋅ Yisong Chen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 455
E2EGS: Event-to-Edge Gaussian Splatting for Pose-Free 3D Reconstruction
Yunsoo Kim ⋅ Changki Sung ⋅ Dasol Hong ⋅ Hyun Myung
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 456
Neural Gabor Splatting: Enhanced Gaussian Splatting with Neural Gabor for High-frequency Surface Reconstruction
Haato Watanabe ⋅ Nobuyuki Umetani
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 457
DirectFisheye-GS: Enabling Native Fisheye Input in Gaussian Splatting with Cross-View Joint Optimization
Zhengxian Yang ⋅ Fei Xie ⋅ Xutao Xue ⋅ Rui Zhang ⋅ Taicheng Huang ⋅ Yang Liu ⋅ Mengqi Ji ⋅ Tao Yu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 458
VAD-GS: Visibility-Aware Densification for 3D Gaussian Splatting in Dynamic Urban Scenes
Yikang Zhang ⋅ Rui Fan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 459
GauMVC: Generative Decoupled Gaussian Representation for Human-centric Multi-view Video Compression
Ruoke Yan ⋅ Mingjia Yang ⋅ Xinfeng Zhang ⋅ Haocheng Tang ⋅ Qian Yin ⋅ Zhipin Deng ⋅ Kai Zhang ⋅ Li zhang ⋅ Siwei Ma
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 460
A Geometric Algebra-Informed 3DGS Framework for Wireless Channel Prediction
Jingzhou Shen ⋅ Tianya Zhao ⋅ Xuyu Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 461
RaGS: Unleashing 3D Gaussian Splatting from 4D Radar and Monocular Cue for 3D Object Detection
Xiaokai Bai ⋅ Chenxu Zhou ⋅ Lianqing Zheng ⋅ Jianan Liu ⋅ Siyuan Cao ⋅ Xiaohan Zhang ⋅ Yiming Li ⋅ Zhengzhuang Zhang ⋅ Hui-Liang Shen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 462
Cross-Instance Gaussian Splatting Registration via Geometry-Aware Feature-Guided Alignment
Roy Amoyal ⋅ Oren Freifeld ⋅ Chaim Baskin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 463
ActivePolicy: Active Gaussian Reconstruction and Optimization Strategy Based on Global-Local Information Gain
Yingzhao Li ⋅ Yanjie Liu ⋅ lijun zhao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 464
Uncertainty-driven 3D Gaussian Splatting Active Mapping via Anisotropic Visibility Field
Shangjie Xue ⋅ Jesse Dill ⋅ Dhruv Ahuja ⋅ Frank Dellaert ⋅ Panagiotis Tsiotras ⋅ Danfei Xu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 465
SV-GS: Sparse View 4D Reconstruction with Skeleton-Driven Gaussian Splatting
Jun-Jee Chao ⋅ Volkan Isler
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 466
NimbusGS: Unified 3D Scene Reconstruction under Hybrid Weather
Yanying Li ⋅ Jinyang Li ⋅ Shengfeng He ⋅ Yangyang Xu ⋅ Junyu Dong ⋅ Yong Du
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 467
SparseSplat: Towards Applicable Feed-Forward 3D Gaussian Splatting with Pixel-Unaligned Prediction
Zicheng Zhang ⋅ Xiangting Meng ⋅ Ke Wu ⋅ Wenchao Ding
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 468
REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding
Jiaze Li ⋅ Hao Yin ⋅ Wenhui Tan ⋅ Jingyang Chen ⋅ Boshen Xu ⋅ Yuxun Qu ⋅ Yijing Chen ⋅ Jianzhong Ju ⋅ Zhenbo Luo ⋅ Jian Luan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 469
Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning
Chi-Pin Huang ⋅ Yunze Man ⋅ Zhiding Yu ⋅ Min-Hung Chen ⋅ Jan Kautz ⋅ Yu-Chiang Frank Wang ⋅ Fu-En Yang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 470
Unlocking Token Rewards via Training-Free Reward Attribution
WU Sitong ⋅ Haoru Tan ⋅ Bin Xia ⋅ Xichen Zhang ⋅ Jingyao Li ⋅ Shaofeng Zhang ⋅ Xiaojuan Qi ⋅ Bei Yu ⋅ Jiaya Jia
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 471
MedMO: Grounding and Understanding Multimodal Large Language Model for Medical Images
Ankan Deria ⋅ Komal Kumar ⋅ Adinath Madhavrao Dukre ⋅ Eran Segal ⋅ Salman Khan ⋅ Imran Razzak
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 472
When to Think and When to Look: Uncertainty-Guided Lookback
Jing Bi ⋅ Filippos Bellos ⋅ JunJia Guo ⋅ Yayuan Li ⋅ Chao Huang ⋅ Yolo Yunlong Tang ⋅ Luchuan Song ⋅ Susan Liang ⋅ Zhongfei Zhang ⋅ Jason J. Corso ⋅ Chenliang Xu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 473
StaR-KVQA: Structured Reasoning Traces for Implicit-Knowledge Visual Question Answering
Zhihao Wen ⋅ Wenkang Wei ⋅ Yuan Fang ⋅ Xingtong Yu ⋅ hui zhang ⋅ Weicheng Zhu ⋅ Xin Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 474
Understanding Counting Mechanisms in Large Language and Vision-Language Models
Hosein Hasani ⋅ Amirmohammad Izadi ⋅ Fatemeh Askari ⋅ Mobin Bagherian ⋅ Sadegh Mohammadian ⋅ Mohammad Izadi ⋅ Mahdieh Baghshah
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 475
CLiViS: Unleashing Cognitive Map through Linguistic-Visual Synergy for Embodied Visual Reasoning
Kailing Li ⋅ Qi'ao Xu ⋅ Tianwen Qian ⋅ Yuqian Fu ⋅ Yang Jiao ⋅ Xiaoling Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 476
Proof-of-Perception: Certified Tool-Using Multimodal Reasoning with Compositional Conformal Guarantees
Arya Fayyazi ⋅ Haleh Akrami
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 477
Thinking Diffusion: Penalize and Guide Visual-Grounded Reasoning in Diffusion Multimodal Language Models
Keuntae Kim ⋅ Mingyu Kang ⋅ Yong Suk Choi
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 478
Don’t Show Pixels, Show Cues: Unlocking Visual Tool Reasoning in Language Models via Perception Programs
Muhammad Kamran Janjua ⋅ Hugo Silva ⋅ Di Niu ⋅ Bahador Rashidi
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 479
Hugging Visual Prompt and Segmentation Tokens: Consistency Learning for Fine-Grained Visual Understanding in MLLMs
jing yang ⋅ Sen Yang ⋅ Boqiang Duan ⋅ Ming Dai ⋅ Wei Zhang ⋅ Xiao Tan ⋅ Kunbin Chen ⋅ Wei He ⋅ Jingdong Wang ⋅ Hanli Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 480
VisionLeaf: Entropy-Guided Leaf-First Reasoning for Efficient and Accurate Think-with-Image
Haokun GUI ⋅ Senqiao Yang ⋅ Mingkang Zhu ⋅ Meng Chu ⋅ WU Sitong ⋅ Changsheng Lu ⋅ Zihao Wang ⋅ Zhuotao Tian ⋅ Jiaya Jia
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 481
GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models
Jingxuan Wei ⋅ Caijun Jia ⋅ Xi Bai ⋅ Xinglong Xu ⋅ Siyuan Li ⋅ Linzhuang Sun ⋅ Bihui Yu ⋅ Conghui He ⋅ Lijun Wu ⋅ Cheng Tan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 482
Beyond Depth: Evaluating the Width-centric Reasoning Capability of MLLMs
Mingrui Chen ⋅ Hexiong Yang ⋅ Haogeng Liu ⋅ Huaibo Huang ⋅ Ran He
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 483
GenSplat: Bridging the Generalization Gap in 3DGS Language Comprehension
Fang Liu ⋅ Yuhao Liu ⋅ Ke Xu ⋅ Gerhard Hancke ⋅ Rynson W.H. Lau
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 484
CC-VQA: Conflict- and Correlation-Aware Method for Mitigating Knowledge Conflict in Knowledge-Based Visual Question Answering
Yuyang Hong ⋅ Jiaqi Gu ⋅ Yujing Lou ⋅ Lubin Fan ⋅ Qi Yang ⋅ Ying Wang ⋅ Kun Ding ⋅ Yue Wu ⋅ Shiming Xiang ⋅ Jieping Ye
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 485
LoPrune: Efficient Data Pruning for LoRA-Based Fine-Tuning of Vision Transformer
Qiang He ⋅ Yaozong Yang ⋅ KAIBIN WANG ⋅ Ziteng Wei ⋅ Feifei Chen ⋅ Caslon Chua ⋅ Yun Yang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 486
Multi-Scale Local Speculative Decoding for Image Generation
Elia Peruzzo ⋅ Guillaume Sautiere ⋅ Amir Habibian
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 487
Globscope: Toward a Global View of the Loss Landscape
Mashiat Mustaq ⋅ Xavier M.
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 488
RADAR: VQ-VAE Decoder of VAR is a Good Student for Restoring Against Degradation by Acceleration
Ziyang Wang ⋅ Yue Zhang ⋅ Mingdao Wang ⋅ Yasen Zhang ⋅ Teer Song ⋅ Yu Tian ⋅ Xueming LI
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 489
Beyond Single Solution: Multi-Hypothesis Deep Unfolding Network for Image Compressive Sensing
Wenxue Cui ⋅ Hualin Li ⋅ Yuhang Qin ⋅ Yifu Xu ⋅ Xiaopeng Fan ⋅ Debin Zhao
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 490
FlashDecoder: Real-Time Latent-to-Pixel Streaming Decoder with Transformers
Minguk Kang ⋅ Suha Kwak
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 491
MambaSIC: Mamba-based Stereo Image Compression with Bi-directional Multi-reference Entropy Model
Shiyu Qin ⋅ XINJIE ZHANG ⋅ Zhening Liu ⋅ Jinpeng Wang ⋅ Bin Chen ⋅ Jiawei Li ⋅ Yifan Ren ⋅ Shu-Tao Xia ⋅ Jun Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 492
Neural Dynamic GI: Random-Access Neural Compression for Temporal Lightmaps in Dynamic Lighting Environments
Jianhui Wu ⋅ Jian Zhou ⋅ Zhi Zhou ⋅ Zhangjin Huang ⋅ Chao Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 493
Discovering Adaptive Task Dependencies for Efficient Multi-Task Representation Compression
Zhimeng Huang ⋅ Rongao Yuan ⋅ Junlong Gao ⋅ Qi Mao ⋅ Siwei Ma ⋅ Wen Gao ⋅ Chuanmin Jia
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 494
OmniZip: Learning a Unified and Lightweight Lossless Compressor for Multi-Modal Data
Yan Zhao ⋅ Zhengxue Cheng ⋅ Junxuan Zhang ⋅ Dajiang Zhou ⋅ Qunshan Gu ⋅ Qi Wang ⋅ Li Song
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 495
Perceptual Neural Video Compression with Color Separation and Rank Chain
xiongzhuang liang ⋅ Chuanbo Tang ⋅ Zhuoyuan Li ⋅ Li Li ⋅ Dong Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 496
Beyond Matching to Tiles: Bridging Unaligned Aerial and Satellite Views for Vision-Only UAV Navigation
Liu Kejia ⋅ Haoyang Zhou ⋅ Ruoyu Xu ⋅ Peicheng Wang ⋅ Mingli Song ⋅ Haofei Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 497
GeoFlow: Real-Time Fine-Grained Cross-View Geolocalization via Iterative Flow Prediction
Ayesh Abu Lehyeh ⋅ Xiaohan Zhang ⋅ Ahmad Arrabi ⋅ Waqas Sultani ⋅ Chen Chen ⋅ Safwan Wshah
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 498
PiLoT: Neural Pixel-to-3D Registration for UAV-based Ego and Target Geo-localization
Xiaoya Cheng ⋅ Long Wang ⋅ Yan Liu ⋅ Xinyi Liu ⋅ Hanlin Tan ⋅ Yu Liu ⋅ Maojun Zhang ⋅ Shen Yan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 499
PAUL: Uncertainty-Guided Partition and Augmentation for Robust Cross-View Geo-Localization under Noisy Correspondence
Zheng Li ⋅ Xueyi Zhang ⋅ Yanming Guo ⋅ Yuxiang Xie ⋅ Ding Zhaoyun ⋅ Siqi Cai ⋅ Haizhou Li ⋅ Mingrui Lao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 500
UniGeoRS: A Unified Benchmark for Tri-view Geo-Localization
Xiao Liang ⋅ Huaizhi Tang ⋅ Feiyang Zhang ⋅ Shiji Yuan ⋅ Chun Hu ⋅ Dezhi Zheng ⋅ Kang Ma
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 501
VGA: Empowering Aerial-Ground Localization by Visual Geometry Alignment
Tao Jun Lin ⋅ Yujiao Shi ⋅ Hongdong Li
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 502
Watch and Learn: Learning to Use Computers from Online Videos
Chan Hee Song ⋅ Yiwen Song ⋅ Palash Goyal ⋅ Yu Su ⋅ Oriana Riva ⋅ Hamid Palangi ⋅ Tomas Pfister
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 503
OneThinker: All-in-one Reasoning Model for Image and Video
Kaituo Feng ⋅ Manyuan Zhang ⋅ Hongyu Li ⋅ Kaixuan Fan ⋅ shuang chen ⋅ Yilei Jiang ⋅ Dian Zheng ⋅ Peiwen Sun ⋅ Yiyuan Zhang ⋅ Haoze Sun ⋅ Yan Feng ⋅ Peng Pei ⋅ Xunliang Cai ⋅ Xiangyu Yue
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 504
Incentivizing Versatile Video Reasoning in MLLMs via Data-Efficient Reinforcement Learning
Xiaodong Wang ⋅ Zhirong Wu ⋅ Langling Huang ⋅ Yuxi Zheng ⋅ Peixi Peng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 505
Act2See: Emergent Active Visual Perception for Video Reasoning
Martin Q. Ma ⋅ Yuxiao Qu ⋅ Aditya Agrawal ⋅ Willis Guo ⋅ Paul Pu Liang ⋅ Ruslan Salakhutdinov ⋅ Louis-Philippe Morency
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 506
VideoSeek: Long-Horizon Video Agent with Tool-Guided Seeking
Jingyang Lin ⋅ Jialian Wu ⋅ Jiang Liu ⋅ Ximeng Sun ⋅ Ze Wang ⋅ Xiaodong Yu ⋅ Jiebo Luo ⋅ Zicheng Liu ⋅ Emad Barsoum
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 507
ViLoMem: Agentic Learner with Grow-and-Refine Multimodal Semantic Memory
Weihao Bo ⋅ Shan Zhang ⋅ Yanpeng Sun ⋅ Jingjing Wu ⋅ Qunyi Xie ⋅ Xiao Tan ⋅ Kunbin Chen ⋅ Wei He ⋅ Xiaofan Li ⋅ Na Zhao ⋅ Jingdong Wang ⋅ Zechao Li
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 508
ReMoT: Reinforcement Learning with Motion Contrast Triplets
Cong Wan ⋅ Zeyu Guo ⋅ Jiangyang Li ⋅ SongLin Dong ⋅ Yifan Bai ⋅ Lin Peng ⋅ Zhiheng Ma ⋅ Yihong Gong
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 509
Incentivizing Generative Zero-Shot Learning via Outcome-Reward Reinforcement Learning with Visual Cues
Wenjin Hou ⋅ Xiaoxiao Sun ⋅ Hehe Fan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 510
Semantic-Guided Global-Local Collaborative Prompt Learning for Few-Shot Class Incremental Learning
yongxin yan ⋅ Weisen Chen ⋅ Xingye Chen ⋅ Yuanjie Shao ⋅ Zhengrong Zuo ⋅ Wenming Tan ⋅ Wenqi Ren ⋅ Changxin Gao ⋅ Nong Sang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 511
Beyond Heuristic Prompting: A Concept-Guided Bayesian Framework for Zero-Shot Image Recognition
Hui Liu ⋅ Kecheng Chen ⋅ Jialiang Wang ⋅ Xianming Liu ⋅ Wenya Wang ⋅ Haoliang Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 512
One Patch to Caption Them All: A Unified Zero-Shot Captioning Framework
Lorenzo Bianchi ⋅ Giacomo Pacini ⋅ Fabio Carrara ⋅ Nicola Messina ⋅ Giuseppe Amato ⋅ Fabrizio Falchi
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 513
Data-Centric Meta-Learning for Robust Few-Shot Generalization
Jongmin Lim ⋅ Soobin CHA ⋅ Jaehun Park ⋅ Inho Oh ⋅ Minho Park ⋅ Kwangsu Kim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 514
Bridging the Modality Gap in Compositional Zero-Shot Learning via Sparse Alignment and Unimodal Memory Bank
Yang Zhang ⋅ Zhixiang Chi ⋅ Xudong Yan ⋅ Yang Wang ⋅ Songhe Feng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 515
LIFT and PLACE: A Simple, Stable, and Effective Knowledge Distillation Framework for Lightweight Diffusion Models
Hyunsoo Han ⋅ Sangyeop Yeo ⋅ Jaejun Yoo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 516
WaDi: Weight Direction-aware Distillation for One-step Image Synthesis
Lei Wang ⋅ Yang Cheng ⋅ Senmao Li ⋅ Ge Wu ⋅ Yaxing Wang ⋅ Jian Yang
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 517
Uncertainty-Aware Knowledge Distillation for Multimodal Large Language Models
Jingchen Sun ⋅ Shaobo Han ⋅ Deep Patel ⋅ Wataru Kohno ⋅ Can Jin ⋅ Changyou Chen
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 518
Beyond Soft Label: Dataset Distillation via Orthogonal Gradient Matching
Deyu Bo ⋅ Xinchao Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 519
BHCast: Unlocking Black Hole Plasma Dynamics from a Single Blurry Image with Long-Term Forecasting
Renbo Tu ⋅ Ali SaraerToosi ⋅ Nicholas S. Conroy ⋅ Gennady Pekhimenko ⋅ Aviad Levis
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 520
RawMetaDiff: Unlocking Extreme Darkness from Dual-Exposure RAW with Meta-Guided Diffusion
Panjun Liu ⋅ Jiyuan Xia ⋅ YUANSHEN GUAN ⋅ Yong Li ⋅ Zhiqiang Lang ⋅ Ruikang Xu ⋅ Chang Chen ⋅ Dehua Song ⋅ Fenglong Song ⋅ Zhiwei Xiong
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 521
Prospective Dynamic 3D MRI Reconstruction via Latent-Space Motion Tracking from Single Measurement
Lixuan Chen ⋅ Zhongnan Liu ⋅ Jesse Hamilton ⋅ James M. Balter ⋅ Jeong Joon Park ⋅ Liyue Shen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 522
Lens Component Deletion based on Differentiable Ray Tracing
Wenguan Zhang ⋅ Qirun Zhang ⋅ Tuo Sun ⋅ Jiajian He ⋅ Jiahui Xu ⋅ Huajun Feng ⋅ Qi Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 523
X-band Radar Non-Line-of-Sight Imaging
Dongyu Du ⋅ Mingkun Zhao ⋅ Yutong Yang ⋅ Dominik Scheuble ⋅ Xiaolong Huang ⋅ Zijian Shao ⋅ Mario Bijelic ⋅ Kaushik Sengupta ⋅ Felix Heide
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 524
3M-TI: High-Quality Mobile Thermal Imaging via Calibration-free Multi-Camera Cross-Modal Diffusion
Minchong Chen ⋅ Xiaoyun Yuan ⋅ Junzhe Wan ⋅ Jianing Zhang ⋅ Jun Zhang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 525
UAVLight: A Benchmark for Illumination-Robust 3D Reconstruction in Unmanned Aerial Vehicle (UAV) Scenes
Kang DU ⋅ Xue Liao ⋅ Junpeng Xia ⋅ Chaozheng Guo ⋅ Yi Gu ⋅ Yirui Guan ⋅ Duotun Wang ⋅ Sheng Huang ⋅ Zeyu Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 526
Polarization State Tracing for Reflection Removal and Color-Consistent Reconstruction
Dongyue Wang ⋅ Yang Lu ⋅ Jiandong Tian
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 527
GFRRN: Explore the Gaps in Single Image Reflection Removal
Yu Chen ⋅ Zewei He ⋅ Xingyu Liu ⋅ Zixuan Chen ⋅ Zhe-Ming Lu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 528
Efficient All-Pairs Correlation Volume Sampling for Optical Flow Estimation
Karlis Martins Briedis ⋅ Markus Gross ⋅ Christopher Schroers
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 529
Cross-Slice Knowledge Transfer via Masked Multi-Modal Heterogeneous Graph Contrastive Learning for Spatial Gene Expression Inference
Zhiceng Shi ⋅ Changmiao Wang ⋅ Jun Wan ⋅ Wenwen Min
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 530
Adapting a Pre-trained Single-Cell Foundation Model to Spatial Gene Expression Generation from Histology Images
Donghai Fang ⋅ Yongheng Li ⋅ Zhen WANG ⋅ Yuansong Zeng ⋅ Wenwen Min
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 531
HyperST: Hierarchical Hyperbolic Learning for Spatial Transcriptomics Prediction
Chen Zhang ⋅ Yilu An ⋅ Ying Chen ⋅ Hao Li ⋅ Xitong Ling ⋅ Lihao Liu ⋅ Junjun He ⋅ Yuxiang Lin ⋅ Zihui Wang ⋅ Rongshan Yu
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 532
SO(3)-Equivariant ViT-Adapter for Data-Efficient Zero-Shot Sim-to-Real Indoor Panoramic Depth Estimation
Ziyan He ⋅ Qiudan Zhang ⋅ Lin Ma ⋅ Xu Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 533
Sparsity-Aware Voxel Attention and Foreground Modulation for 3D Semantic Scene Completion
Yu Xue ⋅ Longjun Gao ⋅ Yuanqi Su ⋅ HaoAng Lu ⋅ Xiaoning Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 534
XPaintNet: An eXtreme Lightweight Framework for Stereoscopic Conversion without Inpainting Network
Kihwan Yoon ⋅ Juyeon Shin ⋅ Jeongheum Kang ⋅ Sijung Kim ⋅ Minyong Jeon
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 535
MD2E: Modeling Depth-to-Edge Cues for Monocular Metric Depth Estimation
Chao Ning ⋅ Minghe Shen ⋅ Naoto Yokoya
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 536
LiteSense: Lifting Lightweight ToF with RGB for High-Resolution Metric Depth Estimation
Yusheng Li ⋅ Lizhi LOU ⋅ Yan Tang ⋅ Zekai Miao ⋅ shaoming zhang ⋅ Jianmei Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 537
3D-Aware Multi-Task Learning with Cross-View Correlations for Dense Scene Understanding
Xiaoye Wang ⋅ Chen Tang ⋅ Xiangyu Yue ⋅ Wei-Hong Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 538
The Midas Touch for Metric Depth
Yu Ma ⋅ Zizhan Guo ⋅ Zuyi Xiong ⋅ Haoran Zhang ⋅ Yi Feng ⋅ Hongbo Zhao ⋅ Hanli Wang ⋅ Rui Fan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 539
Lifting Unlabeled Internet-level Data for 3D Scene Understanding
Yixin Chen ⋅ Yaowei Zhang ⋅ Huangyue Yu ⋅ Junchao He ⋅ Yan Wang ⋅ Jiangyong Huang ⋅ Hongyu Shen ⋅ Junfeng Ni ⋅ Shaofei Wang ⋅ Baoxiong Jia ⋅ Song-Chun Zhu ⋅ Siyuan Huang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 540
ObjectMorpher: 3D-Aware Image Editing via Deformable 3DGS
Yuhuan Xie ⋅ Aoxuan Pan ⋅ Yihua Huang ⋅ Chirui Chang ⋅ Peng Dai ⋅ Xin Yu ⋅ Xiaojuan Qi
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 541
PhysX-Anything: Simulation-Ready Physical 3D Assets from Single Image
Ziang Cao ⋅ Fangzhou Hong ⋅ Zhaoxi Chen ⋅ Liang Pan ⋅ Ziwei Liu
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 542
MeshFlow: Efficient Artistic Mesh Generation via MeshVAE and Flow-based Diffusion Transformer
Weiyu Li ⋅ Antoine Toisoul ⋅ Tom Monnier ⋅ Roman Shapovalov ⋅ Rakesh Ranjan ⋅ Ping Tan ⋅ Andrea Vedaldi
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 543
WonderZoom: Multi-Scale 3D World Generation
Jin Cao ⋅ Hong-Xing Yu ⋅ Jiajun Wu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 544
SceneTok: A Compressed, Diffusable Token Space for 3D Scenes
Mohammad Asim ⋅ Christopher Wewer ⋅ Jan Lenssen
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 545
PixARMesh: Autoregressive Mesh-Native Single-View Scene Reconstruction
Xiang Zhang ⋅ Sohyun Yoo ⋅ Hongrui Wu ⋅ Chuan Li ⋅ Jianwen Xie ⋅ Zhuowen Tu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 546
Extend3D: Town-Scale 3D Generation
Seungwoo Yoon ⋅ Jinmo Kim ⋅ Jaesik Park
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 547
Pano3DComposer: Feed-Forward Compositional 3D Scene Generation from Single Panoramic Image
Zidian Qiu ⋅ Ancong Wu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 548
MeshWeaver: Sparse-Voxel-Guided Surface Weaving for Autoregressive Mesh Generation
Jiale Xu ⋅ Wang Zhao ⋅ Ying Shan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 549
CaliTex: Geometry-Calibrated Attention for View-Coherent 3D Texture Generation
Chenyu Liu ⋅ Hongze CHEN ⋅ Jingzhi Bao ⋅ Lingting Zhu ⋅ Runze Zhang ⋅ Weikai Chen ⋅ Zeyu HU ⋅ Yingda Yin ⋅ Keyang Luo ⋅ Xin Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 550
CraftMesh: High-Fidelity Generative Mesh Manipulation via Poisson Seamless Fusion
James Jincheng Hu ⋅ Yuxiao Wu ⋅ Youcheng Cai ⋅ Ligang Liu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 551
LoG3D: Ultra-High-Resolution 3D Shape Modeling via Local-to-Global Partitioning
Xinran Yang ⋅ Shuichang Lai ⋅ Jiangjing Lyu ⋅ Hongjie Li ⋅ Bowen Pan ⋅ Yuanqi Li ⋅ Jie Guo ⋅ Zhengkang Zhou ⋅ Yanwen Guo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 552
MaskFocus: Focusing Policy Optimization on Critical Steps for Masked Image Generation
Guohui Zhang ⋅ Hu Yu ⋅ Xiaoxiao Ma ⋅ Yaning Pan ⋅ Hang Xu ⋅ Jie Huang ⋅ Feng Zhao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 553
Efficient Training for Human Video Generation with Entropy-Guided Prioritized Progressive Learning
Changlin Li ⋅ Jiawei Zhang ⋅ Shuhao Liu ⋅ Sihao Lin ⋅ Zeyi Shi ⋅ Zhihui Li ⋅ Xiaojun Chang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 554
PosterOmni: Generalized Artistic Poster Creation via Task Distillation and Unified Reward Feedback
Sixiang Chen ⋅ Jianyu LAI ⋅ Jialin Gao ⋅ Hengyu Shi ⋅ Zhongying Liu ⋅ Tian Ye ⋅ Junfeng Luo ⋅ Xiaoming Wei ⋅ Lei Zhu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 555
GRPO-Guard: Mitigating Implicit Over-Optimization in Flow Matching via Regulated Clipping
Jing Wang ⋅ Jiajun Liang ⋅ Jie Liu ⋅ Henglin Liu ⋅ Gongye Liu ⋅ Jun Zheng ⋅ Wanyuan Pang ⋅ Ao Ma ⋅ Zhenyu Xie ⋅ Xintao Wang ⋅ Meng Wang ⋅ Pengfei Wan ⋅ Xiaodan Liang
[ Slides
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 556
The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation
Weijia Mao ⋅ Hao Chen ⋅ Zhenheng Yang ⋅ Mike Zheng Shou
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 557
Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning
Guanjie Chen ⋅ Shirui Huang ⋅ Yifu Sun ⋅ Kai Liu ⋅ Jianchen Zhu ⋅ Xiaoye Qu ⋅ Yu Cheng ⋅ Peng Chen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 558
VISTA: A Test-Time Self-Improving Video Generation Agent
Do Xuan Long ⋅ Xingchen Wan ⋅ Hootan Nakhost ⋅ Chen-Yu Lee ⋅ Tomas Pfister ⋅ Sercan O Arik
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 559
Neighbor GRPO: Contrastive ODE Policy Optimization Aligns Flow Models
Dailan He ⋅ Guanlin Feng ⋅ Xingtong Ge ⋅ Yazhe Niu ⋅ Yi Zhang ⋅ Bingqi Ma ⋅ Guanglu Song ⋅ Yu Liu ⋅ Hongsheng Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 560
SMV-EAR: Bring Spatiotemporal Multi-View Representation Learning into Efficient Event-Based Action Recognition
Rui Fan ⋅ Weidong Hao ⋅ Juntao Guan ⋅ Lai Rui ⋅ Tong Wu ⋅ Fanhong Zeng ⋅ Lin Gu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 561
Hierarchical Action Learning for Weakly-Supervised Action Segmentation
Junxian Huang ⋅ Ruichu Cai ⋅ Juntao Fang ⋅ Hao Zhu ⋅ Boyan Xu ⋅ Weilin Chen ⋅ Zijian Li ⋅ Shenghua Gao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 562
Gamba: Mamba-based graph convolutional network with dynamic graph topology learning for action recognition
Rouyi Zhou ⋅ 漾之 吴 ⋅ Jiajun Wen ⋅ Can Gao ⋅ Feng Liu ⋅ Zhihui Lai ⋅ Linlin Shen
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 563
Beyond Binary Contrast: Modeling Continuous Skeleton Action Spaces with Transitional Anchors
Yingjie Feng ⋅ Yi Wang ⋅ Jiaze Wang ⋅ Anfeng Liu ⋅ Zhuotao Tian
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 564
PRISM: Learning a Shared Primitive Space for Transferable Skeleton Action Representation
Di Yang ⋅ Yaohui Wang ⋅ Shuai Shao ⋅ Francois Bremond ⋅ Jiangtao Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 565
TWEO: Transformers Without Extreme Outliers Enables FP8 Training And Quantization For Dummies
Guang Liang ⋅ Jie Shao ⋅ Ningyuan Tang ⋅ Xinyao Liu ⋅ Jianxin Wu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 566
Unified Spherical Frontend: Learning Rotation-Equivariant Representations of Spherical Images from Any Camera
Mukai Yu ⋅ Mosam Dabhi ⋅ Liuyue Xie ⋅ Sebastian Scherer ⋅ László A. Jeni
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 567
The Surprising Effectiveness of Noise Pretraining for Implicit Neural Representations
Kushal Vyas ⋅ Alper Kayabasi ⋅ Daniel Kim ⋅ Vishwanath Saragadam ⋅ Ashok Veeraraghavan ⋅ Guha Balakrishnan
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 568
DABO: Difficulty-Aware Bayesian Optimization with Diffusion-Learned Priors
Mengyang Li ⋅ Pinlong Zhao
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 569
Towards Knowledge-augmented Bayesian Deep Learning For Computer Vision
Wang Ma ⋅ Hanjing Wang ⋅ Yufei Zhang ⋅ Darsha Udayanga ⋅ Qiang Ji
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 570
NESTOR: A Nested MOE-based Neural Operator for Large-Scale PDE Pre-Training
Dengdi Sun ⋅ Xiaoya Zhou ⋅ Xiao Wang ⋅ Hao Si ⋅ Wanli Lyu ⋅ Jin Tang ⋅ Bin Luo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 571
Evidential Transformation Network: Turning Pretrained Models into Evidential Models for Post-hoc Uncertainty Estimation
Yongchan Chun ⋅ Chanhee Park ⋅ Jeongho Yoon ⋅ Jaehyung Seo ⋅ Heuiseok Lim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 572
Beyond Euclidean Gossip: KL-Barycentric Consensus on Heterogeneous and Imbalanced Images
Lu Xu ⋅ Guosheng Yin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 573
Prime Once, then Reprogram Locally: An Efficient Alternative to Black-Box Service Model Adaptation
Yunbei Zhang ⋅ Chengyi Cai ⋅ Feng Liu ⋅ Jihun Hamm
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 574
Batch Loss Score for Dynamic Data Pruning
Qing Zhou ⋅ Bingxuan Zhao ⋅ Tao Yang ⋅ Hongyuan Zhang ⋅ Junyu Gao ⋅ Qi Wang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 575
Teacher-Guided Routing for Sparse Vision Mixture-of-Experts
Masahiro Kada ⋅ Ryota Yoshihashi ⋅ Satoshi Ikehata ⋅ Rei Kawakami ⋅ Ikuro Sato
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 576
WebChain: A Large-Scale Human-Annotated Dataset of Real-World Web Interaction Traces
Sicheng Fan ⋅ Rui Wan ⋅ Yifei Leng ⋅ Gaoning Liang ⋅ LI LING ⋅ Yanyi Shang ⋅ Dehan Kong
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 577
MangoBench: A Benchmark for Multi-Agent Goal-Conditioned Offline Reinforcement Learning
Yi Wang ⋅ Ningze Zhong ⋅ Zhiheng Fu ⋅ Longguang Wang ⋅ Ye Zhang ⋅ Yulan Guo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 578
iSHIFT: Lightweight Slow-Fast GUI Agent with Adaptive Perception
Sarthak Mehrotra ⋅ Sairam Rebbapragada ⋅ Mani Bonthu ⋅ Vineeth Balasubramanian
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 579
MMBench-GUI: A Unified Hierarchical Evaluation Framework for Multi-Platform GUI Agents
Xuehui Wang ⋅ Zhenyu Wu ⋅ JingJing Xie ⋅ Zichen Ding ⋅ Bowen Yang ⋅ Zehao Li ⋅ Zhaoyang Liu ⋅ Qingyun Li ⋅ Xuan Dong ⋅ Zhe Chen ⋅ Weiyun Wang ⋅ Xiangyu Zhao ⋅ Jixuan Chen ⋅ Haodong Duan ⋅ Tianbao Xie ⋅ Chenyu Yang ⋅ Shiqian Su ⋅ Yue Yu ⋅ Yanting Zhang ⋅ Xiangyu Yue ⋅ Weijie Su ⋅ Xizhou Zhu ⋅ Wei Shen ⋅ Jifeng Dai ⋅ Wenhai Wang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 580
Boosting Vision-Language Models Towards Cross-Domain Incremental Object Detection
Xu Wang ⋅ Zihan Lin ⋅ Yixin Zhang ⋅ Zilei Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 581
UniSpector: Towards Universal Open-set Defect Recognition via Spectral-Contrastive Visual Prompting
Geonuk Kim ⋅ Minhoi Kim ⋅ Kangil Lee ⋅ Minsu Kim ⋅ Hyeonseong Jeon ⋅ JEONGHOON HAN ⋅ Hyoungjoon Lim ⋅ Junho Yim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 582
Unlearning without Forgetting: Securely Removing Targeted Concepts from Large-Scale Vision-Language Open-Vocabulary Detectors
Zhongze Wu ⋅ Xiu Su ⋅ Feng Yang ⋅ Dan Niu ⋅ Shan You ⋅ Yueyi Luo ⋅ Jun Long
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 583
UNI-OOD: Unified Object- and Image-level Out-of-Distribution Detection via Cross-Context Attentive Vision-Language Modeling
Yuchuan Li ⋅ Azadeh Motamedi ⋅ Hyock Ju Kwon ⋅ Chul B Park ⋅ Il-Min Kim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 584
S2C2Seg: Semantic-Spatial Consistency and Category Optimization for Open-Vocabulary Segmentation
Yuhao Qing ⋅ Yueying Wang ⋅ Chaoyang Chen ⋅ Weidong Zhang ⋅ Jie Wen ⋅ Xin Xu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 585
NoOVD: Novel Category Discovery and Embedding for Open-Vocabulary Object Detection
Yupeng Zhang ⋅ Ruize Han ⋅ Zhiwei Chen ⋅ Wei Feng ⋅ Liang Wan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 586
The Missing Point in Vision Transformers for Universal Image Segmentation
Sajjad Shahabodini ⋅ Mobina Mansoori ⋅ Farnoush Bayatmakou ⋅ Jamshid Abouei ⋅ Konstantinos N. Plataniotis ⋅ Arash Mohammadi
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 587
PromptMoE: A Segmentation Refinement Framework Leveraging Mixture of Experts for Improved Prompting
Stephen Price ⋅ Danielle L. Cote ⋅ Elke A. Rundensteiner
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 588
The Power of Prior: Training-Free Open-Vocabulary Semantic Segmentation with LLaVA
Bingfeng Zhang ⋅ Siyue Yu ⋅ Hui Li ⋅ Jiahua Lin ⋅ Wenwu Wang ⋅ Jimin Xiao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 589
Beyond Text: Visual Description Assembly by Probabilistic Model for CLIP-based Weakly Supervised Semantic Segmentation
Xianglin Qiu ⋅ Jian Wang ⋅ Xiaolei Wang ⋅ Zhen Zhang ⋅ Jimin Xiao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 590
High-Precision Dichotomous Image Segmentation via Depth Integrity-Prior and Fine-Grained Patch Strategy
Xianjie Liu ⋅ Keren Fu ⋅ Qijun Zhao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 591
GeoSAM2: Unleashing the Power of SAM2 for 3D Part Segmentation
Ken Deng ⋅ Yunhan Yang ⋅ Jingxiang Sun ⋅ Xihui Liu ⋅ Yebin Liu ⋅ Ding Liang ⋅ Yan-Pei Cao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 592
Material Magic Wand: Material-Aware Grouping of 3D Parts in Untextured Meshes
Umangi Jain ⋅ Vladimir G. Kim ⋅ Matheus Gadelha ⋅ Igor Gilitschenski ⋅ Zhiqin Chen
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 593
Synthetic Object Compositions for Scalable and Accurate Learning in Detection, Segmentation, and Grounding
Weikai Huang ⋅ Jieyu Zhang ⋅ Taoyang jia ⋅ Chenhao Zheng ⋅ Ziqi Gao ⋅ Jae Sung Park ⋅ Ranjay Krishna
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 594
Unlocking 3D Affordance Segmentation with 2D Semantic Knowledge
Yu Huang ⋅ Zelin Peng ⋅ Changsong Wen ⋅ Xiaokang Yang ⋅ Wei Shen
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 595
HySeg: Learning Generative Priors for Structure-Aware Remote Sensing Segmentation
Jie Qiu ⋅ XIN LI ⋅ Fan Yang ⋅ Yan Wang ⋅ Dong Yu ⋅ Changying Wang ⋅ Linwei Dai ⋅ Yongxiang Chen ⋅ Youqin Chen ⋅ Jianzhang Chen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 596
Real-Time Long Horizon Air Quality Forecasting via Group-Relative Policy Optimization
Inha Kang ⋅ Eunki Kim ⋅ Wonjeong Ryu ⋅ Jaeyo Shin ⋅ Seungjun Yu ⋅ Yoon-Hee Kang ⋅ Seongeun Jeong ⋅ Eunhye Kim ⋅ Soontae Kim ⋅ Hyunjung Shim
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 597
MMVIP: A Visible-infrared Paired Dataset for Multi-weather Marine Vision
Yunpeng Yin ⋅ Lihan Wang ⋅ Zhaoshen He ⋅ Xinqiang He ⋅ Xingming Liao ⋅ Zhuowei Wang ⋅ Lianglun Cheng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 598
Beyond Tie Points: Satellite Image Block Adjustment based on Dense Feature Consistency
Yi Liu ⋅ Yi Wan ⋅ Lei Yu ⋅ Panwang Xia ⋅ Qiong Wu ⋅ Yingying Pei ⋅ Xuejun Huang ⋅ Junjian Zhang ⋅ Xiangyuan Cai ⋅ Hongwei Hu ⋅ Yongjun Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 599
Spectrally Distilled Representations Aligned with Instruction-Augmented LLMs for Satellite Imagery
Minh Do ⋅ Wei Xiang ⋅ Kang Han ⋅ Di Wu ⋅ Khoa T. Phan ⋅ Yi-Ping Phoebe Chen ⋅ Gaowen Liu ⋅ Ramana Kompella
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 600
Global Underwater Geolocation from Time-Lapse Polarization Imagery
Sara Aghajanzadeh ⋅ Xiaoyang Bai ⋅ Zhongmin Zhu ⋅ David Forsyth ⋅ Viktor Gruev
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 601
Olbedo: An Albedo and Shading Aerial Dataset for Large-Scale Outdoor Environments
Shuang Song ⋅ Debao Huang ⋅ Deyan Deng ⋅ Haolin Xiong ⋅ Yang Tang ⋅ Yajie Zhao ⋅ Rongjun Qin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 602
PRUE: A Practical Recipe for Field Boundary Segmentation at Scale
Gedeon Muhawenayo ⋅ Caleb Robinson ⋅ Subash Khanal ⋅ Zhanpei Fang ⋅ Isaac Corley ⋅ Alexander Wollam ⋅ Tianyi Gao ⋅ Leonard Strnad ⋅ Ryan Avery ⋅ Lyndon Estes ⋅ Ana Tárano ⋅ Nathan Jacobs ⋅ Hannah Kerner
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 603
SARMAE: Masked Autoencoder for SAR Representation Learning
Danxu Liu ⋅ Di Wang ⋅ Hebaixu Wang ⋅ Haoyang Chen ⋅ Wentao Jiang ⋅ Yilin Cheng ⋅ Haonan Guo ⋅ Wei Cui ⋅ Jing Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 604
LNEM: Lunar Neural Elevation Model
Suwan Lee ⋅ Jo Ryeong Yim ⋅ Kibaek Park ⋅ Dong-Gyu Kim ⋅ Eunhyeuk Kim ⋅ Minsup Jeong ⋅ Chae Kyung Sim ⋅ Seokju Lee
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 605
A Polarized Reflection and Material Dataset of Real World Objects
Jing Yang ⋅ Krithika Dharanikota ⋅ Emily Jia ⋅ Haiwei Chen ⋅ Yajie Zhao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 606
LaSM: Layer-wise Scaling Mechanism for Defending Pop-up Attack on GUI Agents
Zihe Yan ⋅ Zhuosheng Zhang ⋅ Jiaping Gui ⋅ Gongshen Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 607
RaPA: Enhancing Transferable Targeted Attacks via Random Parameter Pruning
Tongrui Su ⋅ Qingbin Li ⋅ Shengyu Zhu ⋅ Wei Chen ⋅ Xueqi Cheng
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 608
All Vehicles Can Lie: Efficient Adversarial Defense in Fully Untrusted-Vehicle Collaborative Perception via Pseudo-Random Bayesian Inference
Yi Yu ⋅ Libing Wu ⋅ Zhuangzhuang Zhang ⋅ Jing Qiu ⋅ Lijuan Huo ⋅ Jiaqi Feng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 609
A Combination of Noise and Bilateral Filters Achieve Supralinear and Scalable Adversarial Robustness in CNNs
Nicolas Stalder ⋅ Benjamin F Grewe ⋅ Matteo Saponati ⋅ Pau Vilimelis Aceituno
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 610
DeepProtect: Proactive Face-Swapping Defense using Identity Blending and Attribute Distortion
Eungi Lee ⋅ Seung-hyeok Back ⋅ Hyung-Il Kim ⋅ Seok Bong Yoo
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 611
Write Where It Matters: Policy-Guided Watermarks for 3D Gaussian Splatting
Nan Li ⋅ Yike Zeng ⋅ Qian Zhang ⋅ Qi Zhang ⋅ Zhiyi Pan ⋅ Wei Feng ⋅ Liang Wan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 612
Attack for Defense: Adversarial Agents for Point Prompt Optimization Empowering Segment Anything Model
Xueyu Liu ⋅ Xiaoyi Zhang ⋅ Meilin Liu ⋅ Guangze Shi ⋅ Jia Shen ⋅ Yujie Wang ⋅ Cai Zhao ⋅ Ziyuan He ⋅ Yongfei Wu ⋅ Mingqiang Wei ⋅ Yongle Chen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 613
RevINN: An End-to-End Invertible Neural Network for Reversible Adversarial Examples Generation
Jielun Huang ⋅ Chi-Man Pun ⋅ Guoheng Huang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 614
CamPI: Physical Adversarial Examples through Camera Power Signal Injection
yanze ren ⋅ Mingyuan Lv ⋅ Qinhong Jiang ⋅ Yan Jiang ⋅ Chen Yan ⋅ Xiaoyu Ji ⋅ Wenyuan Xu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 615
Authorize-on-Demand: Dynamic Authorization with Legality-Aware Intellectual Property Protection for VLMs
Lianyu Wang ⋅ Meng Wang ⋅ Huazhu Fu ⋅ Daoqiang Zhang
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 616
GraspALL: Adaptive Structural Compensation from Illumination Variation for Robotic Garment Grasping in Any Low-Light Conditions
Haifeng Zhong ⋅ Wenshuo Han ⋅ Zhouyu Wang ⋅ Runyang Feng ⋅ Fan Tang ⋅ Tong-yee Lee ⋅ zipei fan ⋅ Ruihai Wu ⋅ Yuran Wang ⋅ Hao Dong ⋅ Hechang Chen ⋅ Hyung Jin Chang ⋅ Yixing Gao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 617
Opening the Sim-to-Real Door for Humanoid Pixel-to-Action Policy Transfer
Haoru Xue ⋅ Tairan He ⋅ Zi Wang ⋅ Qingwei Ben ⋅ Wenli Xiao ⋅ Zhengyi Luo ⋅ Xingye Da ⋅ Fernando Castañeda ⋅ Guanya Shi ⋅ Shankar Sastry ⋅ Jim Fan ⋅ Yuke Zhu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 618
Learning Cross-View Object Correspondence via Cycle-Consistent Mask Prediction
Shannan Yan ⋅ Leqi Zheng ⋅ Keyu Lv ⋅ Jingchen Ni ⋅ Hongyang Wei ⋅ Jiajun Zhang ⋅ Guangting Wang ⋅ Jing LYU ⋅ Chun Yuan ⋅ Fengyun Rao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 619
RoboWheel: A Data Engine from Real-World Human Demonstrations for Cross-Embodiment Robotic Learning
Yuhong Zhang ⋅ Zihan Gao ⋅ Shengpeng Li ⋅ Ling-Hao Chen ⋅ Kaisheng Liu ⋅ Runqing Cheng ⋅ Xiao Lin ⋅ Junjia Liu ⋅ Zhuoheng Li ⋅ Jingyi Feng ⋅ Ziyan He ⋅ Jintian Lin ⋅ Zheyan Huang ⋅ Zhifang Liu ⋅ Haoqian Wang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 620
Chain of World: World Model Thinking in Latent Motion
Fuxiang Yang ⋅ Donglin Di ⋅ Lulu Tang ⋅ Xuancheng Zhang ⋅ Lei Fan ⋅ Hao Li ⋅ Wei Chen ⋅ Tonghua Su ⋅ Baorui Ma
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 621
Scalable Feature Matching via State Space Modeling and Sparse Correlation
Choo Sin Wai ⋅ Bo Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 622
Video2Robo: 3DGS-based Synthetic Data from One Video Enables Scalable Robot Learning
Yinan Deng ⋅ Kejia Hu ⋅ Ye Chen ⋅ Jianyu Dou ⋅ Jiahui Wang ⋅ Jingyu Zhao ⋅ Haojia Ao ⋅ Yi Yang ⋅ Yufeng Yue
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 623
ConsisVLA-4D: Advancing Spatiotemporal Consistency in Efficient 3D-Perception and 4D-Reasoning for Robotic Manipulation
Wei Li ⋅ Jizhihui Liu ⋅ Yixing Li ⋅ Junwen Tong ⋅ Rui Shao ⋅ Liqiang Nie
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 624
SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models
Senyu Fei ⋅ Siyin Wang ⋅ Li Ji ⋅ Ao Li ⋅ Shiduo Zhang ⋅ Liming Liu ⋅ Jinlong Hou ⋅ Jingjing Gong ⋅ Xianzhong Zhao ⋅ Xipeng Qiu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 625
GeoDexGrasp: Geometry-aware Generation for Data-efficient and Physics-plausible Dexterous Grasping
Bing Han ⋅ Weiyuan Liu ⋅ changlong Zhang ⋅ Chenxi Wang ⋅ Zhibin Zhao ⋅ Zhi Zhai
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 626
Lifelong Imitation Learning with Multimodal Latent Replay and Incremental Adjustment
Yu Fanqi ⋅ Matteo Tiezzi ⋅ Tommaso Apicella ⋅ Cigdem Beyan ⋅ Vittorio Murino
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 627
From Observation to Action: Latent Action-based Primitive Segmentation for VLA Pre-training in Industrial Settings
Jiajie Zhang ⋅ Sören Schwertfeger ⋅ Alexander Kleiner
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 628
AGiLe: Learning Robust Long-Horizon Manipulation via Affordance-Grounded Bidirectional Latent Planning
Zixuan Chen ⋅ Xiangrong Feng ⋅ Jieqi Shi ⋅ Lin Shao ⋅ Jing Huo ⋅ Yang Gao
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 629
Language-Grounded Decoupled Action Representation for Robotic Manipulation
WuDing Weng ⋅ Tongshu Wu ⋅ Liucheng Chen ⋅ Siyu xie ⋅ Zheng Wang ⋅ Xing Xu ⋅ Jingkuan Song ⋅ Heng Tao Shen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 630
Learning to Act Robustly with View-Invariant Latent Actions
Youngjoon Jeong ⋅ Junha Chun ⋅ Taesup Kim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 631
ORBIT: Benchmarking SfM in the Wild with 360° Video
Sara Sabour ⋅ Richard Tucker ⋅ Marcus Brubaker ⋅ Saurabh Saxena ⋅ Junhwa Hur ⋅ Andrea Tagliasacchi ⋅ Deqing Sun ⋅ David J. Fleet ⋅ Richard Szeliski ⋅ Noah Snavely
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 632
SpikeTrack: A Spike-driven Framework for Efficient Visual Tracking
Qiuyang Zhang ⋅ Jiujun Cheng ⋅ Qichao Mao ⋅ Cong Liu ⋅ Yu Fang ⋅ Yuhong Li ⋅ Mengying Ge ⋅ Shangce Gao
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 633
Time Without Time: Pseudo-Temporal Representation for Space-Time Super-Resolution
Hee Min Choi ⋅ Hyoa Kang ⋅ Suji Kim ⋅ Dokwan Oh ⋅ Nam Ik Cho
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 634
Envisioning the Future, One Step at a Time
Stefan Andreas Baumann ⋅ Jannik Wiese ⋅ Tommaso Martorella ⋅ M. Kalayeh ⋅ Björn Ommer
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 635
FlowFM: Advancing Dark Optical Flow Estimation with Flow Matching
Fengyuan Zuo ⋅ Haiyan Jin ⋅ Yuanlin Zhang ⋅ Zhaolin Xiao ⋅ Bin Wang ⋅ Yuerong Mu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 636
Drift-Resilient Temporal Priors for Visual Tracking
Yuqing Huang ⋅ Liting Lin ⋅ Weijun Zhuang ⋅ Zhenyu He ⋅ Xin Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 637
An Efficient Token Compression Framework for Visual Object Tracking
Weijing Wu ⋅ Qihua Liang ⋅ Bineng Zhong ⋅ Haiying Xia ⋅ Zhiyi Mo ⋅ Shuxiang Song
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 638
No Labels, No Look-Ahead: Unsupervised Online Video Stabilization with Classical Priors
Kan Ren ⋅ Gang Wan ⋅ TAO LIU
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 639
From Detection to Association: Learning Discriminative Object Embeddings for Multi-Object Tracking
Yuqing Shao ⋅ Yuchen Yang ⋅ Rui Yu ⋅ Weilong Li ⋅ Xu Guo ⋅ Huaicheng Yan ⋅ Wei Wang ⋅ Xiao Sun
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 640
Momentum Memory for Knowledge Distillation in Computational Pathology
yongxin guo ⋅ Hao Lu ⋅ Onur C. ⋅ Zhengjie Zhu ⋅ Muhammet F. ⋅ Metin N.
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 641
Modeling the Brain’s Grammar: ROI-Guided fMRI Pretraining for Transferable and Interpretable Vision Decoding
Yulong Liu ⋅ Hua Xu ⋅ Yiyang Cai ⋅ Chunyang Jiang ⋅ Sirui Han ⋅ Yike Guo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 642
Joint Spectral Image Reconstruction and Semantic Segmentation with Cooperative Unfolding
Zijun He ⋅ Ping Wang ⋅ Xiaodong Wang ⋅ Chang Chen ⋅ Xin Yuan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 643
X-WIN: Building Chest Radiograph World Model via Predictive Sensing
Zefan Yang ⋅ Ge Wang ⋅ James Hendler ⋅ Mannudeep K. Kalra ⋅ Pingkun Yan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 644
fMRI-LM: Towards a Universal Foundation Model for Language-Aligned fMRI Understanding
Yuxiang Wei ⋅ Yanteng Zhang ⋅ Xi Xiao ⋅ Chengxuan Qian ⋅ Tianyang Wang ⋅ Vince D. Calhoun
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 645
Tell2Adapt: A Unified Framework for Source Free Unsupervised Domain Adaptation via Vision Foundation Model
Yulong Shi ⋅ Shijie Li ⋅ Ziyi Li ⋅ Lin Qi
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 646
TIM: Temporal Decoupling with Iterative Mutual-Refinement Model for Longitudinal Radiology Report Generation
Yiheng Dong ⋅ Yi Lin ⋅ Shilong Huang ⋅ Xiyan Yang ⋅ Xin Yang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 647
Ultrasound-CLIP: Semantic-Aware Contrastive Pre-training for Ultrasound Image-Text Understanding
Jiayun Jin ⋅ Haolong Chai ⋅ Xueying Huang ⋅ Xiaoqing Guo ⋅ Zengwei Zheng ⋅ Zhan Zhou ⋅ Junmei Wang ⋅ Xinyu Wang ⋅ Jie Liu ⋅ Binbin Zhou
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 648
Act Like a Pathologist: Tissue-Aware Whole Slide Image Reasoning
Wentao Huang ⋅ Weimin Lyu ⋅ Peiliang Lou ⋅ Qingqiao Hu ⋅ Xiaoling Hu ⋅ Shahira Abousamra ⋅ Wenchao Han ⋅ Ruifeng Guo ⋅ Jiawei Zhou ⋅ Chao Chen ⋅ Chen Wang
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 649
BiGMINT: Biologically-guided Hierarchical Multimodal Integration for Modeling Multiple Compound Activities in Drug Discovery
Pushpak Pati ⋅ Bo Li ⋅ Abbas Rayabat Khan ⋅ Tomé Albuquerque ⋅ Steffen Jaensch ⋅ Amina Mollaysa ⋅ Walid Hassan ⋅ Samantha J. Allen ⋅ Joke Reumers ⋅ Helai P. Mohammad ⋅ Scott Oloff ⋅ Tommaso Mansi ⋅ Rui Liao ⋅ Dmytro S. Lituiev ⋅ Zhoubing Xu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 650
Modeling Spatiotemporal Neural Frames for High Resolution Brain Dynamic
Wanying Qu ⋅ Jianxiong Gao ⋅ Wei Wang ⋅ Yanwei Fu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 651
CMR-RD: Long-Tailed Adaptive VLM for Explainable CMR Diagnosis
Yansong Li ⋅ Zhongxi Qiu ⋅ Yun Tian ⋅ Zheng jinyu ⋅ Shuo Li
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 652
Clinically-Grounded Counterfactual Reasoning for Medical Video Diagnosis
Jianzhe Gao ⋅ Churan Wang ⋅ Weiyi Zhang ⋅ Jianghua Li ⋅ Lian Li ⋅ Wenguan Wang ⋅ Yixin Zhu ⋅ Yizhou Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 653
FBTA: Enabling Single-GPU End-to-End Gigapixel WSI Classification with Feature Bridging and Translation Alignment
Jiuyang Dong ⋅ Jiahan Li ⋅ Junjun Jiang ⋅ Yongbing Zhang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 654
Ultra Diffusion Poser: Diffusion-Based Human Motion Tracking from Sparse Inertial Sensors and Ranging-based Between-sensor Distances
Dominik Hollidt ⋅ Tommaso Bendinelli ⋅ Christian Holz
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 655
Egocentric Visibility-Aware Human Pose Estimation
Peng Dai ⋅ Yu Zhang ⋅ Feng Yiqiang ⋅ ZhenFan Fan ⋅ Yang Zhang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 656
Shoe Style-Invariant and Ground-Aware Learning for Dense Foot Contact Estimation
Daniel Jung ⋅ Kyoung Mu Lee
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 657
OMG-Bench: A New Challenging Benchmark for Skeleton-based Online Micro Hand Gesture Recognition
Haochen Chang ⋅ Pengfei Ren ⋅ Buyuan Zhang ⋅ Da Li ⋅ Tianhao Han ⋅ HaoYang ZHANG ⋅ Liang Xie ⋅ Hongbo Chen ⋅ Erwei Yin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 658
Recovering Physically Plausible Human-Object Interactions from Monocular Videos
Dingbang Huang ⋅ Etienne Vouga ⋅ Qixing Huang ⋅ Georgios Pavlakos
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 659
MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos
Kehong Gong ⋅ Zhengyu Wen ⋅ Xiaoyu He ⋅ Mingxi Xu ⋅ Qi WANG ⋅ ning Zhang ⋅ Zhengyu Li ⋅ Dongze Lian ⋅ Wei Zhao ⋅ He Xiaoyu ⋅ Mingyuan Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 660
TeHOR: Text-Guided 3D Human and Object Reconstruction with Textures
Hyeongjin Nam ⋅ Daniel Jung ⋅ Kyoung Mu Lee
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 661
SHOW3D: Capturing Scenes of 3D Hands and Objects in the Wild
Patrick Rim ⋅ Kevin Harris ⋅ Braden Copple ⋅ Shangchen Han ⋅ Xu Xie ⋅ Ivan Shugurov ⋅ Sizhe An ⋅ He Wen ⋅ Alex Wong ⋅ Tomas Hodan ⋅ Kun He
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 662
CrossHOI: Learning Cross-View Representations for Monocular 3D Human-Object Interaction Reconstruction
Pei Geng ⋅ Shanshan Zhang ⋅ Jian Yang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 663
Gaussian-Mixture Latent Flow for Stochastic 3D Human Motion Prediction
Yue Ma ⋅ Frederick W. B. Li ⋅ Xiaohui Liang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 664
SGSoft: Learning Fused Semantic-Geometric Features for 3D Shape Correspondence via Template-Guided Soft Signals
Soyeon Yoon ⋅ Chang Wook Seo ⋅ Hyunjung Shim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 665
Beyond Single-View Sufficiency: CVBench for Cross-View Human Understanding
Tianchen Guo ⋅ Chen Liu ⋅ Xin Yu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 666
Breaking Spurious Correlations: Uncertainty-Driven Causal Transformers for AU Detection
Yuru Wang ⋅ Yue Zhou
[ Poster