Skip to yearly menu bar Skip to main content


(670 events)   Timezone:  
Show all
The 2026 schedule is still incomplete
Toggle Poster Visibility
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 1
Breaking Semantic Boundaries: Distribution-Guided Semantic Exploration for Creative Generation
Fu Feng ⋅ Yucheng Xie ⋅ Ruixiao Shi ⋅ Xu Yang ⋅ Jing Wang ⋅ Xin Geng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 2
Guiding a Diffusion Model by Swapping Its Tokens
Weijia Zhang ⋅ Yuehao Liu ⋅ Shanyan Guan ⋅ Wu Ran ⋅ Yanhao Ge ⋅ Wei Li ⋅ Chao Ma
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 3
PixelDiT: Pixel Diffusion Transformers for Image Generation
Yongsheng Yu ⋅ Wei Xiong ⋅ Weili Nie ⋅ Yichen Sheng ⋅ Shiqiu Liu ⋅ Jiebo Luo
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 4
SeaCache: Spectral-Evolution-Aware Cache for Accelerating Diffusion Models
Jiwoo Chung ⋅ Sangeek Hyun ⋅ MinKyu Lee ⋅ Byeongju Han ⋅ Geonho Cha ⋅ Dongyoon Wee ⋅ Youngjun Hong ⋅ Jae-Pil Heo
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 5
SenCache: Accelerating Diffusion Model Inference via Sensitivity-Aware Caching
Yasaman Haghighi ⋅ Alex Alahi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 6
Streaming Diffusion Model for Fast Infrared and Visible Video Fusion
Jinyuan Liu ⋅ Ludan Sun ⋅ Tengyu Ma ⋅ Chunyan Yang ⋅ Zhiying Jiang ⋅ Long Ma ⋅ Risheng Liu ⋅ Xin Fan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 7
ComPose: A Unified Completion-Pose Framework for Robust Category-Level Object Pose Estimation
Huan Ren ⋅ Yihan Chen ⋅ Chuxin Wang ⋅ Nailong Liu ⋅ Wenfei Yang ⋅ Tianzhu Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 8
CoSMo3D: Open-World Promptable 3D Semantic Segmentation through LLM-Guided Canonical Spatial Modeling
Li Jin ⋅ Weikai Chen ⋅ Yujie Wang ⋅ Yingda Yin ⋅ Zeyu HU ⋅ Runze Zhang ⋅ Keyang Luo ⋅ Shengju Qian ⋅ Xin Wang ⋅ Xueying Qin
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 9
GeoViS: Geospatially Rewarded Visual Search for Remote Sensing Visual Grounding
Peirong Zhang ⋅ Yidan Zhang ⋅ Luxiao Xu ⋅ Jinliang Lin ⋅ Zonghao Guo ⋅ Fengxiang Wang ⋅ Xue Yang ⋅ Kaiwen Wei ⋅ Lei Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 10
RobotSeg: A Model and Dataset for Segmenting Robots in Image and Video
Haiyang Mei ⋅ Qiming Huang ⋅ Hai Ci ⋅ Mike Zheng Shou
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 11
S^2AM3D: Scale-controllable Part Segmentation of 3D Point Clouds
Han Su ⋅ Tianyu Huang ⋅ Zichen Wan ⋅ Xiaohe Wu ⋅ Wangmeng Zuo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 12
Scalable Multi-View Subspace Clustering with Tensorized Anchor Guidance
Miao Jia ⋅ Xingchen Hu ⋅ Jiyuan Liu ⋅ Siwei Wang ⋅ Min Wang ⋅ Zijian Chen
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 13
3D-LATTE: Latent Space 3D Editing from Textual Instructions
Maria Parelli ⋅ Michael Oechsle ⋅ Michael Niemeyer ⋅ Federico Tombari ⋅ Andreas Geiger
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 14
AnchorFlow: Training-Free 3D Editing via Latent Anchor-Aligned Flows
Fan Ma ⋅ Fan Ma ⋅ Chengzhuo Gui ⋅ Xiaobo Xia ⋅ Hehe Fan ⋅ Yi Yang ⋅ Tat-seng Chua
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 15
ChordEdit: One-Step Low-Energy Transport for Image Editing
Liangsi Lu ⋅ Xuhang Chen ⋅ Minzhe Guo ⋅ Shichu Li ⋅ Jingchao Wang ⋅ Yang Shi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 16
Faithful Contouring: Near-Lossless 3D Voxel Representation Free from Iso-surface
Yihao Luo ⋅ Xianglong He ⋅ Chuanyu Pan ⋅ Yiwen Chen ⋅ Jiaqi Wu ⋅ Yangguang Li ⋅ Wanli Ouyang ⋅ Yuanming Hu ⋅ Guang Yang ⋅ Choon Hwai Yap
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 17
Native and Compact Structured Latents for 3D Generation
Jianfeng XIANG ⋅ Xiaoxue Chen ⋅ Sicheng Xu ⋅ Ruicheng Wang ⋅ Zelong Lv ⋅ Yu Deng ⋅ Hongyuan Zhu ⋅ Yue Dong ⋅ Hao Zhao ⋅ Nicholas Jing Yuan ⋅ Jiaolong Yang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 18
SliderEdit: Continuous Image Editing with Fine-Grained Instruction Control
Arman Zarei ⋅ Samyadeep Basu ⋅ Mobina Pournemat ⋅ Sayan Nag ⋅ Ryan A. Rossi ⋅ Soheil Feizi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 19
Differentiable Vector Quantization for Rate-Distortion Optimization of Generative Image Compression
SHIYIN JIANG ⋅ Wei Long ⋅ Minghao Han ⋅ Zhenghao Chen ⋅ Ce Zhu ⋅ Shuhang Gu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 20
FINER: MLLMs Hallucinate under Fine-grained Negative Queries
Rui Xiao ⋅ Sanghwan Kim ⋅ Yongqin Xian ⋅ Zeynep Akata ⋅ Stephan Alaniz
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 21
MDCS-MoAME: Multi-directional Composite Scanning with Mixture of Attention and Mamba Experts for Cancer Survival Prediction
Linjie Qu ⋅ Jin Xiao ⋅ Xiangrong Liu ⋅ Changming Sun ⋅ Hui Cui ⋅ Yuqi Fang ⋅ Ran Su ⋅ Qiangguo Jin ⋅ leyi wei
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 22
PAS: A Training-Free Stabilizer for Temporal Encoding in Video LLMs
Bowen Sun ⋅ Yujun Cai ⋅ Ming-Hsuan Yang ⋅ Hang Wu ⋅ Yiwei Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 23
PAVAS: Physics-Aware Video-to-Audio Synthesis
Oh Hyun-Bin ⋅ Yuhta Takida ⋅ Toshimitsu Uesaka ⋅ Tae-Hyun Oh ⋅ Yuki Mitsufuji
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 24
ProPhy: Progressive Physical Alignment for Dynamic World Simulation
Zijun Wang ⋅ Panwen Hu ⋅ Jing Wang ⋅ Terry Jingchen Zhang ⋅ Yuhao Cheng ⋅ Long Chen ⋅ Yiqiang Yan ⋅ Zutao Jiang ⋅ Hanhui Li ⋅ Xiaodan Liang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 25
V-DPM: 4D Video Reconstruction with Dynamic Point Maps
Edgar Sucar ⋅ Eldar Insafutdinov ⋅ Zihang Lai ⋅ Andrea Vedaldi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 26
Registration-Free Learnable Multi-View Capture of Faces in Dense Semantic Correspondence
Panagiotis P. Filntisis ⋅ George Retsinas ⋅ Radek Daněček ⋅ Vanessa Sklyarova ⋅ Petros Maragos ⋅ Timo Bolkart
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 27
Mesh4D: 4D Mesh Reconstruction and Tracking from Monocular Video
Zeren Jiang ⋅ Chuanxia Zheng ⋅ Iro Laina ⋅ Diane Larlus ⋅ Andrea Vedaldi
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 28
SPE-MVS: Spatial Position Encoding Enhanced Multi-View Stereo with Monocular Depth Priors
Shaoqian Wang ⋅ Jiadai Sun ⋅ Bosen Hou ⋅ Qiang Wang ⋅ Bin Fan ⋅ Bo Li ⋅ Bin Lu ⋅ Yuchao Dai
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 29
Block-Sparse Global Attention for Efficient Multi-View Geometry Transformers
Chung-Shien Brian Wang ⋅ Christian Schmidt ⋅ Jens Piekenbrinck ⋅ Bastian Leibe
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 30
SMVRT: Implicit Human 3D Modeling Using Sparse Multi-View Volumetric Reconstruction with Transformer Fusion
Chuanmao Fan ⋅ Chenxi Zhao ⋅ Ye Duan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 31
LiDAR Prompted Spatio-Temporal Multi-View Stereo for Autonomous Driving
Qihao Sun ⋅ Jiarun Liu ⋅ Ziqian Ni ⋅ Jianyun Xu ⋅ Sheng Yang ⋅ Tao Xie ⋅ lijun zhao ⋅ Ruifeng Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 32
Any4D: Unified Feed-Forward Metric 4D Reconstruction
Jay Karhade ⋅ Nikhil Keetha ⋅ Yuchen Zhang ⋅ Tanisha Gupta ⋅ Akash Sharma ⋅ Sebastian Scherer ⋅ Deva Ramanan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 33
Co-Me: Confidence Guided Token Merging for Visual Geometric Transformers
Yutian Chen ⋅ Yuheng Qiu ⋅ Ruogu Li ⋅ Jay Patrikar ⋅ Sebastian Scherer
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 34
Point4Cast: Streaming Dynamic Scene Reconstruction and Forecasting
Xinhang Liu ⋅ Pedro Miraldo ⋅ Suhas Lohit ⋅ Huaizu Jiang ⋅ Naoko Sawada ⋅ Yu-Wing Tai ⋅ Chi-Keung Tang ⋅ Moitreya Chatterjee
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 35
AMB3R: Accurate Feed-forward Metric-scale 3D Reconstruction with Backend
Hengyi Wang ⋅ Lourdes Agapito
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 36
AlignPose: Generalizable 6D Pose Estimation via Multi-view Feature-metric Alignment
Anna Šárová Mikeštíková ⋅ Médéric Fourmy ⋅ Martin Cífka ⋅ Josef Sivic ⋅ Vladimir Petrik
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 37
Parallelised Differentiable Straightest Geodesics for 3D Meshes
Hippolyte Verninas ⋅ Caner Korkmaz ⋅ Stefanos Zafeiriou ⋅ Tolga Birdal ⋅ Simone Foti
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 38
Geometry-Aligned and Anomaly-Aware Reconstruction for 3D Anomaly Detection
linchun wu ⋅ Qin Zou ⋅ Yuanhao Yue ⋅ Zhongyuan Wang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 39
DVGT: Driving Visual Geometry Transformer
Sicheng Zuo ⋅ Zixun Xie ⋅ Wenzhao Zheng ⋅ Shaoqing Xu ⋅ Fang Li ⋅ Shengyin Jiang ⋅ Long Chen ⋅ Zhi-xin Yang ⋅ Jiwen Lu
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 40
FMPose3D: monocular 3D pose estimation via flow matching
Ti Wang ⋅ Xiaohang Yu ⋅ Mackenzie Weygandt Mathis
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 41
MoRE: 3D Visual Geometry Reconstruction Meets Mixture-of-Experts
Jingnan Gao ⋅ Zhe Wang ⋅ Xianze Fang ⋅ Xingyu Ren ⋅ Zhuo Chen ⋅ Shengqi Liu ⋅ Yuhao Cheng ⋅ Jiangjing Lyu ⋅ Xiaokang Yang ⋅ Yichao Yan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 42
Foundation Encoders Are All You Need for Preference-Aware Personalization
Hyungjin Kim ⋅ Seokho Ahn ⋅ Young-Duk Seo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 43
Where Culture Fades: Revealing the Cultural Gap in Text-to-Image Generation
Chuancheng Shi ⋅ Shangze Li ⋅ Shiming Guo ⋅ Simiao Xie ⋅ Wenhua Wu ⋅ Jingtong Dou ⋅ Chao Wu ⋅ Canran Xiao ⋅ Cong Wang ⋅ Zifeng Cheng ⋅ Fei Shen ⋅ Tat-seng Chua
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 44
ThinkGen: Generalized Thinking for Visual Generation
Siyu Jiao ⋅ Yiheng Lin ⋅ Yujie Zhong ⋅ Qi She ⋅ Wei zhou ⋅ Xiaohan Lan ⋅ Zilong Huang ⋅ Fei Yu ⋅ Yingchen Yu ⋅ Yunqing Zhao ⋅ Yao Zhao ⋅ Yunchao Wei
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 45
CoLoGen: Progressive Learning of Concept–Localization Duality for Unified Image Generation
YuXin Song ⋅ Yu Lu ⋅ Haoyuan Sun ⋅ Huanjin Yao ⋅ Fanglong Liu ⋅ Yifan Sun ⋅ Haocheng Feng ⋅ Hang Zhou ⋅ Jingdong Wang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 46
Talk2Move: Reinforcement Learning for Text-Instructed Object-Level Geometric Transformation in Scenes
Jing Tan ⋅ Zhaoyang Zhang ⋅ Yantao Shen ⋅ Jiarui Cai ⋅ Shuo Yang ⋅ Jiajun Wu ⋅ Wei Xia ⋅ Zhuowen Tu ⋅ Stefano Soatto
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 47
When Safety Collides: Resolving Multi-Category Harmful Conflicts in Text-to-Image Diffusion via Adaptive Safety Guidance
Yongli Xiang ⋅ Ziming Hong ⋅ Zhaoqing Wang ⋅ Xiangyu Zhao ⋅ Bo Han ⋅ Tongliang Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 48
PSR: Scaling Multi-Subject Personalized Image Generation with Pairwise Subject-Consistency Rewards
Shulei Wang ⋅ Longhui Wei ⋅ XIN HE ⋅ Jianbo Ouyang ⋅ Hui Lu ⋅ Zhou Zhao ⋅ Qi Tian
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 49
HBridge: H-Shape Bridging of Heterogeneous Experts for Unified Multimodal Understanding and Generation
Xiang Wang ⋅ Zhifei Zhang ⋅ He Zhang ⋅ Zhe Lin ⋅ Yuqian Zhou ⋅ Qing Liu ⋅ Shiwei Zhang ⋅ Yijun Li ⋅ Shaoteng Liu ⋅ Haitian Zheng ⋅ Jason Kuen ⋅ Yuehuan Wang ⋅ Changxin Gao ⋅ Nong Sang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 50
Multimodal Semantic Bias Mitigation for Diverse Text-To-3D Generation
Yukuan Min ⋅ Muli Yang ⋅ Jinhao Zhang ⋅ Yuxuan Wang ⋅ Yihang Zhu ⋅ Jiexi Yan ⋅ Cheng Deng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 51
Visual Personalization Turing Test
Rameen Abdal ⋅ James Burgess ⋅ Sergey Tulyakov ⋅ Kuan-Chieh Jackson Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 52
Composing Concepts from Images and Videos via Concept-prompt Binding
Xianghao Kong ⋅ Zeyu Zhang ⋅ Yuwei Guo ⋅ Zhuoran ZHAO ⋅ Songchun Zhang ⋅ Anyi Rao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 53
Less is More: Data-Efficient Adaptation for Controllable Text-to-Video Generation
Shihan Cheng ⋅ Nilesh Kulkarni ⋅ David Hyde ⋅ Dmitriy Smirnov
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 54
Semantic Derivative Flow: Graph-Guided Diffusion for Controllable Instance Interactions
Shibin Mei ⋅ Hang Wang ⋅ Bingbing Ni
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 55
Improving Text-to-Image Generation with Intrinsic Self-Confidence Rewards
Seungwook Kim ⋅ Minsu Cho
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 56
Hierarchical Enhancement of Semantic Priors for Disentangled Text-Driven Motion Generation
Wenhan Lv ⋅ Shaopan Wang ⋅ Xiangyu Wu ⋅ Tianchu Hang ⋅ Zhongquan Jian ⋅ Qingqiang Wu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 57
Simpleposter: A Simple Baseline For Product Poster Generation
Benlei Cui ⋅ Fangao Zeng ⋅ Weitao Jiang ⋅ Yuwen Zhai ⋅ Haiwen Hong ⋅ Longtao Huang ⋅ Hui Xue ⋅ Wenxiang Shang ⋅ Pipei Huang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 58
Prompt Yourself: Awakening Textual Semantics in 1D Visual Tokenizers
hualiang wang ⋅ Siming Fu ⋅ Weinan Jia ⋅ Yuning Lu ⋅ Mu Liu ⋅ Jidong Jiang ⋅ Xiaomeng Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 59
SkyReels-Text: Fine-Grained Font-Controllable Text Editing for Poster Design
Yunjie Yu ⋅ Jingchen Wu ⋅ Junchen Zhu ⋅ Chunze Lin ⋅ Guibin Chen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 60
Image Generation from Contextually-Contradictory Prompts
Saar Huberman ⋅ Or Patashnik ⋅ Omer Dahary ⋅ Ron Mokady ⋅ Daniel Cohen-Or
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 61
PromptEnhancer: Taming Your Rewriter for Text-to-Image Generation via Fine-Grained Reward
Linqing Wang ⋅ zhiyong xu ⋅ XiMing Xing ⋅ YIJI CHENG ⋅ Zhiyuan Zhao ⋅ Donghao Li ⋅ Tiankai Hang ⋅ Zhenxi Li ⋅ Jiale Tao ⋅ Qixun Wang ⋅ Ruihuang Li ⋅ Comi Chen ⋅ Xin LI ⋅ Mingrui Wu ⋅ Xinchi Deng ⋅ Shuyang Gu ⋅ Chunyu Wang ⋅ qinglin lu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 62
Aligning Text, Images and 3D Structure Token-by-Token
Aadarsh Sahoo ⋅ Vansh Tibrewal ⋅ Georgia Gkioxari
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 63
RefTon: Reference person shot assist virtual Try-on
Liuzhuozheng Li ⋅ Yue Gong ⋅ Shanyuan Liu ⋅ Zanyi Wang ⋅ Dengyang Jiang ⋅ Liebucha Wu ⋅ Bo Cheng ⋅ Yuhang Ma ⋅ Dawei Leng ⋅ Yuhui Yin
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 64
GaussianVision: Vision-Language Alignment from Compressed Image Representations using 2D Gaussian Splatting
Yasmine Omri ⋅ Connor Ding ⋅ Tsachy Weissman ⋅ Thierry Tambe
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 65
Copy-Transform-Paste: Zero-Shot Object-Object Alignment Guided by Vision-Language and Geometric Constraints
Rotem Gatenyo ⋅ Ohad Fried
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 66
Gravitation-Driven Semantic Alignment for Text Video Retrieval
Yi YANG ⋅ Zheng Wang ⋅ Xing Xu ⋅ Jingkuan Song ⋅ Heng Tao Shen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 67
MoE-GRPO: Optimizing Mixture-of-Experts via Reinforcement Learning in Vision-Language Models
Dohwan Ko ⋅ Jinyoung Park ⋅ Seoung Choi ⋅ Sanghyeok Lee ⋅ Seohyun Lee ⋅ Hyunwoo J. Kim
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 68
M^3KG-RAG: Multi-hop Multimodal Knowledge Graph-enhanced Retrieval-Augmented Generation
Hyeongcheol Park ⋅ Jiyoung Seo ⋅ Jaewon Mun ⋅ Hogun Park ⋅ Wonmin Byeon ⋅ Sung June Kim ⋅ Hyeonsoo Im ⋅ JeungSub Lee ⋅ Sangpil Kim
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 69
Evolutionary Multimodal Reasoning via Hierarchical Semantic Representation for Intent Recognition
Qianrui Zhou ⋅ Hua Xu ⋅ Yunjin Gu ⋅ Yifan Wang ⋅ Songze Li ⋅ Hanlei Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 70
ReFAct: Empowering Multimodal Web Agents with Visual and Context Focusing
Rui Wu ⋅ Shuo Zhang ⋅ Xiaoxuan Tang ⋅ Ruirui Zhang ⋅ Yi Liu ⋅ Tao Jiang ⋅ Wenhao Xu ⋅ Yong Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 71
PersonaVLM: Long-Term Personalized Multimodal LLMs
Chang Nie ⋅ Chaoyou Fu ⋅ Yi-Fan Zhang ⋅ Haihua Yang ⋅ Caifeng Shan
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 72
MR-RAG: Multimodal Relevance-Aware Retrieval-Augmented Generation for Medical Visual Question Answering
Xuze Li ⋅ Haozhao Wang ⋅ Zhenyu Huang ⋅ Zhongxu Wang ⋅ Zhang Jinghua ⋅ Ruixuan Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 73
Decoupling Stability and Plasticity for Multi-Modal Test-Time Adaptation
Yongbo He ⋅ Zirun Guo ⋅ Tao Jin
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 74
CUE: Concept-Aware Multi-Label Expansion to Mitigate Concept Confusion in Long-Tailed Learning
Ruichi Zhang ⋅ Chikai Shang ⋅ jiacheng yang ⋅ Mengke Li ⋅ Yang Zhou ⋅ Junlong Gao ⋅ Yang Lu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 75
Energy Waveify and Redistribution for Test-Time Adaptation: A Control System Perspective
Zhenbin Wang ⋅ Lei Zhang ⋅ Lituan Wang ⋅ Zhenwei Zhang ⋅ Guangwu Qian ⋅ Yan Wang ⋅ Wei Huang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 76
CD-Buffer: Complementary Dual-Buffer Framework for Test-Time Adaptation in Adverse Weather Object Detection
Youngjun Song ⋅ Hyeongyu Kim ⋅ Dosik Hwang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 77
CoFiDA-M: Concept-Aware Feature Modulation for Cross-Domain Adaptation with Image-Only Inference
Nurjahan Sultana ⋅ Moi Hoon Yap ⋅ Xinqi Fan ⋅ Wenqi Lu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 78
Towards Multimodal Domain Generalization with Few Labels
Hongzhao Li ⋅ Hao Dong ⋅ Hualei Wan ⋅ Shupan Li ⋅ Mingliang Xu ⋅ Muhammad Haris Khan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 79
Reclaiming Lost Text Layers for Source-Free Cross-Domain Few-Shot Learning
ZHENYU ZHANG ⋅ Guangyao Chen ⋅ Yixiong Zou ⋅ Yuhua Li ⋅ Ruixuan Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 80
Event6D: Event-based Novel Object 6D Pose Tracking
Jae-Young Kang ⋅ Hoonhee Cho ⋅ Taeyeop Lee ⋅ Minjun Kang ⋅ Bowen Wen ⋅ Youngho Kim ⋅ Kuk-Jin Yoon
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 81
EV-CGNet: Co-visible Focused 3D-guided 2D Event Keypoint Detection Network
Yuan Gao ⋅ Tianle Ding ⋅ Yuqing Zhu ⋅ Tianzhu Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 82
AE2VID: Event-based Video Reconstruction via Aperture Modulation
Chenxu Bai ⋅ Boyu Li ⋅ Peiqi Duan ⋅ xinyu zhou ⋅ Hanyue Lou ⋅ Boxin Shi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 83
From Contrast to Consistency: Rethinking Event-based Continuous-Time Optical Flow Estimation
rui hu ⋅ Song Wu ⋅ Wen Yang ⋅ Jinjian Wu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 84
Spike-driven Discrete Aggregation for Event-based Object Detection
Huaning Li ⋅ Ziming Wang ⋅ Runhao Jiang ⋅ Yan Rui ⋅ Huajin Tang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 85
x^2-Fusion: Cross-Modality and Cross-Dimension Flow Estimation in Event Edge Space
Ruishan Guo ⋅ Ciyu Ruan ⋅ Haoyang Wang ⋅ Zihang GONG ⋅ Jingao Xu ⋅ Xinlei Chen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 86
FloVerse: Floor Plan-Guided Multi-Modal Navigation
weiqi Huang ⋅ Shuangyi Dong ⋅ Jiaxin Li ⋅ Yifei Guo ⋅ Zan Wang ⋅ Wei Liang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 87
TrajRAG: Retrieving Geometric-Semantic Experience for Zero-Shot Object Navigation
Yiyao Wang ⋅ Sixian Zhang ⋅ Keming Zhang ⋅ Xinhang Song ⋅ Songjie Du ⋅ Shuqiang Jiang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 88
History to Future: Evolving Agent with Experience and Thought for Zero-shot Vision-and-Language Navigation
Guangzhao Dai ⋅ Shuo Wang ⋅ Zihan Wang ⋅ Guo-Sen Xie ⋅ Yang Yang ⋅ Jinshan Pan ⋅ Qianru Sun ⋅ Xiangbo Shu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 89
DreamSAC: Learning Hamiltonian World Models via Symmetry Exploration
Jinzhou Tang ⋅ Fan Feng ⋅ Minghao Fu ⋅ Wenjun Lin ⋅ Jing Yang ⋅ Biwei Huang ⋅ Keze Wang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 90
Beyond Scanpaths: Graph-Based Gaze Simulation in Dynamic Scenes
Luke Palmer ⋅ Petar Palasek ⋅ Hazem Abdelkawy
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 91
CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning
Zhenquan Yao ⋅ Zitong Huang ⋅ yihan zeng ⋅ Jianhua Han ⋅ Hang Xu ⋅ Chun-Mei Feng ⋅ Jianwei Ma ⋅ Wangmeng Zuo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 92
Rethinking Visual Rearrangement from A Diffusion Perspective
Tianliang Qi ⋅ Xinhang Song ⋅ Yuyi Liu ⋅ Shuqiang Jiang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 93
APEX: A Decoupled Memory-based Explorer for Asynchronous Aerial Object Goal Navigation
Daoxuan Zhang ⋅ Ping Chen ⋅ Xiaobo Xia ⋅ Xiu Su ⋅ Ruichen Zhen ⋅ Jianqiang Xiao ⋅ Shuo Yang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 94
Bridging the 2D-3D Gap: A Hierarchical Semantic-Geometric Map for Vision Language Navigation
Kailing Li ⋅ Tianwen Qian ⋅ Lijin Yang ⋅ Yuqian Fu ⋅ Jingyu Gong ⋅ Xiaoling Wang ⋅ Liang He
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 95
InterAgent: Physics-based Multi-agent Command Execution via Diffusion on Interaction Graphs
Bin Li ⋅ Ruichi Zhang ⋅ Han Liang ⋅ Jingyan Zhang ⋅ Juze Zhang ⋅ Xin Chen ⋅ Lan Xu ⋅ Jingyi Yu ⋅ Jingya Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 96
When Robots Should Say ''I Don’t Know'': Benchmarking Abstention in Embodied Question Answering
Tao Wu ⋅ Chuhao Zhou ⋅ Guangyu Zhao ⋅ Haozhi Cao ⋅ Yewen Pu ⋅ Jianfei Yang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 97
RoboAgent: Chaining Basic Capabilities for Embodied Task Planning
Peiran Xu ⋅ Jiaqi Zheng ⋅ Yadong Mu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 98
Towards Training-free Scene Text Editing
Yubo Li ⋅ Xugong Qin ⋅ peng zhang ⋅ Hailun Lin ⋅ Gangyan Zeng ⋅ Kexin Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 99
VINS-120K: Ultra High-Resolution Image Editing with A Large-Scale Dataset
Zhizhou Chen ⋅ Shanyan Guan ⋅ Zhanxin Gao ⋅ En Ci ⋅ Yanhao Ge ⋅ Wei Li ⋅ Zhenyu Zhang ⋅ Jian Yang ⋅ Ying Tai
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 100
ArtiMuse: Fine-Grained Image Aesthetics Assessment with Joint Scoring and Expert-Level Understanding
Shuo Cao ⋅ Nan Ma ⋅ Jiayang Li ⋅ Xiaohui Li ⋅ Lihao Shao ⋅ Kaiwen Zhu ⋅ Yu Zhou ⋅ Yuandong Pu ⋅ Jiarui Wu ⋅ Jiaquan Wang ⋅ Bo Qu ⋅ Wenhai Wang ⋅ Yu Qiao ⋅ Dajuin Yao ⋅ Yihao Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 101
Charge: A Comprehensive Novel View Synthesis Benchmark and Dataset to Bind Them All
Michal Nazarczuk ⋅ Thomas Tanay ⋅ Arthur Moreau ⋅ Zhensong Zhang ⋅ Eduardo Pérez-Pellitero
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 102
Region-Wise Correspondence Prediction between Manga Line Art Images
Yingxuan Li ⋅ Jiafeng Mao ⋅ Qianru Qiu ⋅ Yusuke Matsui
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 103
WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation
Wei Chow ⋅ Jiachun Pan ⋅ Yongyuan Liang ⋅ Mingze Zhou ⋅ Xue Song ⋅ Liyu Jia ⋅ Saining Zhang ⋅ Siliang Tang ⋅ Juncheng Li ⋅ Fengda Zhang ⋅ Weijia Wu ⋅ Hanwang Zhang ⋅ Tat-seng Chua
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 104
I2I-Bench: A Comprehensive Benchmark Suite for Image-to-Image Editing Models
Juntong Wang ⋅ Wang Jiarui ⋅ Huiyu Duan ⋅ Jiaxiang Kang ⋅ Guangtao Zhai ⋅ Xiongkuo Min
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 105
TokenGS: Decoupling 3D Gaussian Prediction from Pixels with Learnable Tokens
Jiawei Ren ⋅ Michal Tyszkiewicz ⋅ Jiahui Huang ⋅ Žan Gojčič
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 106
Hermite Radial Basis Function for Surface Reconstruction via Differentiable Rendering
Hugo Blanc ⋅ Jean-Emmanuel Deschaud ⋅ Alexis Paljic
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 107
RF4D:Neural Radar Fields for Novel View Synthesis in Outdoor Dynamic Scenes
Jiarui Zhang ⋅ Zhihao Li ⋅ Chong Wang ⋅ Bihan Wen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 108
Voxify3D: Pixel Art Meets Volumetric Rendering
Yi-Chuan Huang ⋅ Jiewen Chan ⋅ Hao-Jen Chien ⋅ Yu-Lun Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 109
Node-RF: Learning Generalized Continuous Space-Time Scene Dynamics with Neural ODE-based NeRFs
Hiran Sarkar ⋅ Liming Kuang ⋅ Yordanka Velikova ⋅ Benjamin Busam
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 110
FluidGaussian: Propagating Simulation-Based Uncertainty Toward Functionally-Intelligent 3D Reconstruction
Yuqiu Liu ⋅ Jialin Song ⋅ Marissa Ramirez de Chanlatte ⋅ Rochishnu Chowdhury ⋅ Rushil Paresh Desai ⋅ Wuyang Chen ⋅ Daniel Martin ⋅ Michael Mahoney
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 111
GaussFusion: Improving 3D Reconstruction in the Wild with A Geometry-Informed Video Generator
Liyuan Zhu ⋅ Manjunath Narayana ⋅ Michal Stary ⋅ Will Hutchcroft ⋅ Gordon Wetzstein ⋅ Iro Armeni
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 112
LagerNVS: Latent Geometry for Fully Neural Real-time Novel View Synthesis
Stanislaw Szymanowicz ⋅ Minghao Chen ⋅ Jianyuan Wang ⋅ Christian Rupprecht ⋅ Andrea Vedaldi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 113
Turbo-GS: Accelerating 3D Gaussian Fitting for High-Resolution Radiance Fields
Ankit Dhiman ⋅ Tao Lu ⋅ Srinath Ravi ⋅ Emre Arslan ⋅ Angela Xing ⋅ Yuanbo Xiangli ⋅ R. Venkatesh Babu ⋅ Srinath Sridhar
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 114
BiProLoRA: Bilevel Prompt LoRA for Real Scene Recovery
Nan An ⋅ Long Ma ⋅ Tengyu Ma ⋅ Zhu Liu ⋅ Yingchi Liu ⋅ Risheng Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 115
Degradation-Consistent Test-Time Adaptation for All-in-One Image Restoration
Ni Tang ⋅ Shenghao nie ⋅ Xiaotong Luo ⋅ Yuan Xie ⋅ Yanyun Qu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 116
CanonCGT: Reference-Based Color Grading via Canonical Pivot Representation
JINWON KO ⋅ Keunsoo Ko ⋅ Chang-Su Kim
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 117
2-Shots in the Dark: Low-Light Denoising with Minimal Data Acquisition
Liying Lu ⋅ Raphael Achddou ⋅ Sabine Süsstrunk
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 118
Restore, Assess, Repeat: A Unified Framework for Iterative Image Restoration
I-Hsiang (Aaron) Chen ⋅ Isma Hadji ⋅ Enrique Sanchez ⋅ Adrian Bulat ⋅ Sy-Yen Kuo ⋅ Radu Timofte ⋅ Georgios Tzimiropoulos ⋅ Brais Martinez
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 119
It Takes Two: A Duet of Periodicity and Directionality for Burst Flicker Removal
lishen qu ⋅ Shihao Zhou ⋅ Jie Liang ⋅ Hui Zeng ⋅ Lei Zhang ⋅ Jufeng Yang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 120
Scan Clusters, Not Pixels: A Cluster-Centric Paradigm for Efficient Ultra-high-definition Image Restoration
Chen Wu ⋅ Ling Wang ⋅ Zhuoran Zheng ⋅ Yuning Cui ⋅ Zhixiong Yang ⋅ Xiangyu Chen ⋅ Yue Zhang ⋅ Weidong Jiang ⋅ Jingyuan Xia
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 121
Seeing Beyond 8bits: Subjective and Objective Quality Assessment of HDR-UGC Videos
SHRESHTH SAINI ⋅ Bowen Chen ⋅ Yilin Wang ⋅ Neil Birkbeck ⋅ Balu Adsumilli ⋅ Alan C.
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 122
Dynamic Exposure Burst Image Restoration
Woohyeok Kim ⋅ Jaesung Rim ⋅ Daeyeon Kim ⋅ Sunghyun Cho
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 123
FAPE-IR: Frequency-Aware Planning and Execution Framework for All-in-One Image Restoration
Jingren Liu ⋅ Shuning Xu ⋅ Qirui Yang ⋅ Yun wang ⋅ Xiangyu Chen ⋅ Zhong Ji
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 124
ColorFLUX: A Structure-Color Decoupling Framework for Old Photo Colorization
Bingchen Li ⋅ Zhixin Wang ⋅ Fan Li ⋅ Jiaqi Xu ⋅ Jiaming Guo ⋅ Renjing Pei ⋅ Xin Li ⋅ Zhibo Chen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 125
VEMamba: Efficient Isotropic Reconstruction of Volume Electron Microscopy with Axial-Lateral Consistent Mamba
Longmi Gao ⋅ Pan Gao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 126
Anatomica: Localized Control over Geometric and Topological Properties for Anatomical Diffusion Models
Karim Kadry ⋅ Abdalla Abdelwahed ⋅ Ajay Manicka ⋅ Naravich Chutisilp ⋅ Farhad R. Nezami ⋅ Elazer R Edelman
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 127
EMGauss: Continuous Slice-to-3D Reconstruction via Dynamic Gaussian Modeling in Volume Electron Microscopy
Yumeng He ⋅ Zanwei Zhou ⋅ Yekun Zheng ⋅ Chen Liang ⋅ Yunbo Wang ⋅ Xiaokang Yang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 128
Underground Plant Exploration: Non-Destructive 3D Root Assessment with GPR Based on Point Graph Neural Network
Yuwei Zhou ⋅ Guoyu Lu
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 129
Uni-Encoder Meets Multi-Encoders: Representation Before Fusion for Brain Tumor Segmentation with Missing Modalities
Peibo Song ⋅ Xiaotian Xue ⋅ Jinshuo Zhang ⋅ zihao wang ⋅ Jinhua liu ⋅ Shujun Fu ⋅ Fangxun Bao ⋅ Si Yong Yeo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 130
MicroFM: Physics-guided Flow Matching for Isotropic Microscopy Reconstruction
Xingzu Zhan ⋅ Runmin Jiang ⋅ Vatsal Gupta ⋅ Tanush Swaminathan ⋅ Yanwen Wang ⋅ Genpei Zhang ⋅ Haili Wang ⋅ Min Xu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 131
Dynamic Stream Network for Combinatorial Explosion Problem in Deformable Medical Image Registration
Shaochen Bi ⋅ Yuting He ⋅ Weiming Wang ⋅ Hao Chen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 132
PMRNet: Physics-informed Multi-scale Refinement Network for Medical Image Segmentation
Boce Kang
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 133
Towards Robust Vision Transformers: Path Dependency Analysis and a Simple Two-Stage Adversarial Training
Seongmin Kim ⋅ Byung Cheol Song
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 134
PA-Attack: Guiding Gray-Box Attacks on LVLM Vision Encoders with Prototypes and Attention
Hefei Mei ⋅ Zirui Wang ⋅ Chang Xu ⋅ Jianyuan Guo ⋅ Minjing Dong
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 135
When CLIP Sees More, It Fights Back Harder: Multi-View Guided Adaptive Counterattacks for Test-Time Adversarial Robustness
Sunoh Kim ⋅ Daeho Um
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 136
Hidden Dangers of Compositional Generation: Diagnosing Semantic Safety Failures in Text-to-Image Models
Haoming Yang ⋅ Ke Ma ⋅ ligonf zhang ⋅ Xiaojun Jia ⋅ Yingfei Sun ⋅ Qianqian Xu ⋅ Qingming Huang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 137
VisiLock: Authorizing Instruction-based Image editing with Dual Score Distillation
Van Thanh ⋅ Yun Fu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 138
JANUS: A Lightweight Framework for Jailbreaking Text-to-Image Models via Distribution Optimization
Haolun Zheng ⋅ Yu He ⋅ Tailun Chen ⋅ Shuo Shao ⋅ Zhixuan Chu ⋅ Hongbin zhou ⋅ Lan Tao ⋅ Zhan Qin ⋅ Kui Ren
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 139
GenBreak: Red Teaming Text-to-Image Generation Using Large Language Models
Zilong Wang ⋅ Xiang Zheng ⋅ Xiaosen Wang ⋅ Bo Wang ⋅ Xingjun Ma
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 140
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models
Zhiheng Liu ⋅ Weiming Ren ⋅ Haozhe Liu ⋅ Zijian Zhou ⋅ Shoufa Chen ⋅ Haonan Qiu ⋅ Xiaoke Huang ⋅ Zhaochong An ⋅ Fanny Yang ⋅ Aditya Patel ⋅ Viktar Atliha ⋅ Tony Ng ⋅ Xiao Han ⋅ Chuyan Zhu ⋅ Chenyang Zhang ⋅ Ding Liu ⋅ Juan-Manuel Pérez-Rúa ⋅ Sen He ⋅ Jürgen Schmidhuber ⋅ Wenhu Chen ⋅ Ping Luo ⋅ Wei Liu ⋅ Tao Xiang ⋅ Jonas Schult ⋅ Yuren Cong
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 141
Generate, Analyze, and Refine: Training-Free Sound Source Localization via MLLM Meta-Reasoning
Subin Park ⋅ Jung Uk Kim
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 142
MMCP-GEN: A Modality-Extensible Diffusion Language Model for Conditional Protein Sequence Generation
Zeyu An ⋅ Wanyu Lin ⋅ Feng Tan ⋅ Shujun Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 143
Few-shot Acoustic Synthesis with Multimodal Flow Matching
Amandine Brunetto
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 144
CLIP-like Model as a Foundational Density Ratio Estimator
Fumiya Uchiyama ⋅ Rintaro Yanagi ⋅ Shohei Taniguchi ⋅ Shota Takashiro ⋅ Masahiro Suzuki ⋅ Hirokatsu Kataoka ⋅ Yusuke Iwasawa ⋅ Yutaka Matsuo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 145
Learning What Matters: Prioritized Concept Learning via Relative Error-driven Sample Selection
Qian Yang ⋅ Shivam Chandhok ⋅ Oscar Mañas ⋅ Kanishk Jain ⋅ Aishwarya Agrawal ⋅ Leonid Sigal
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 146
EgoAVU: Egocentric Audio-Visual Understanding
Ashish Seth ⋅ Xinhao Mei ⋅ Changsheng Zhao ⋅ Varun Nagaraja ⋅ Ernie Chang ⋅ Gregory P. Meyer ⋅ Gael Le Lan ⋅ Yunyang Xiong ⋅ Vikas Chandra ⋅ Yangyang Shi ⋅ Dinesh Manocha ⋅ zhipeng cai
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 147
Dictionary-Aligned Concept Control for Safeguarding Multimodal LLMs
Jinqi Luo ⋅ Jinyu Yang ⋅ Tal Neiman ⋅ Lei Fan ⋅ Bing Yin ⋅ Son Dinh Tran ⋅ Mubarak Shah ⋅ Rene Vidal
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 148
Multimodal Protein Language Models for Enzyme Kinetic Parameters: From Substrate Recognition to Conformational Adaptation
Fei Wang ⋅ Xinye Zheng ⋅ Kun Li ⋅ Yanyan Wei ⋅ Yuxin Liu ⋅ Ganpeng Hu ⋅ Tong Bao ⋅ Jingwen Yang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 149
Echoes Over Time: Unlocking Length Generalization in Video-to-Audio Generation Models
Christian Simon ⋅ Masato Ishii ⋅ Wei-Yao Wang ⋅ Koichi Saito ⋅ Akio Hayakawa ⋅ Dongseok Shim ⋅ Zhi Zhong ⋅ Shuyang Cui ⋅ Takashi Shibuya ⋅ Shusuke Takahashi ⋅ Yuki Mitsufuji
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 150
Adaptive Confidence Regularization for Multimodal Failure Detection
Moru Liu ⋅ Hao Dong ⋅ Olga Fink ⋅ Mario Trapp
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 151
Factorize, Reconstruct, Enhance: A Unified Framework for Multimodal Sentiment Analysis
Zhilu Yang ⋅ Mingcheng Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 152
PhenoYieldNet: Learning Crop-Aware Phenological Responses for Multi-Crop Yield Prediction
Yu Luo ⋅ Xiaogang Zhu ⋅ Shan Zeng ⋅ Wei Xiang ⋅ Thomas Francis Bishop ⋅ Zhiyong Wang ⋅ Kun Hu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 153
Conflict-Aware Adaptive Cross-Reconstruction for Multimodal Sentiment Analysis
Yan Wang ⋅ Fuyuan Cao ⋅ Xingwang Zhao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 154
EduDiag: A Benchmark for Educational Diagnostic Reasoning with Error Tracing and Correction on Large Multimodal Models
Jiali Chen ⋅ Yuqi Xue ⋅ Xusen Hei ⋅ DingBa Fu ⋅ wei yuancheng ⋅ Jiayuan Xie ⋅ Yi Cai
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 155
UniM: A Unified Any-to-Any Interleaved Multimodal Benchmark
Yanlin Li ⋅ Minghui Guo ⋅ Kaiwen Zhang ⋅ Shize Zhang ⋅ Yiran Zhao ⋅ Haodong Li ⋅ Congyue Zhou ⋅ Weijie Zheng ⋅ Yushen Yan ⋅ Shengqiong Wu ⋅ Wei Ji ⋅ Lei Cui ⋅ Furu Wei ⋅ Hao Fei ⋅ Mong-Li Lee ⋅ Wynne Hsu
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 156
Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement
Chunlei Zhang ⋅ Jiahao Xia ⋅ Yun Xiao ⋅ Bo Jiang ⋅ Liying Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 157
ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding
Jovana Kondic ⋅ Pengyuan Li ⋅ Dhiraj Joshi ⋅ Isaac Sanchez ⋅ Ben wiesel ⋅ Shafiq Abedin ⋅ Amit Alfassy ⋅ Eli Schwartz ⋅ Daniel Caraballo ⋅ Yagmur Gizem Cinar ⋅ Florian Scheidegger ⋅ Steven I. Ross ⋅ Daniel Karl I. Weidele ⋅ Hang Hua ⋅ Ekaterina Arutyunova ⋅ Roei Herzig ⋅ Zihan Wang ⋅ Xinyue Yu ⋅ Yunfei Zhao ⋅ Sicong Jiang ⋅ Minghao Liu ⋅ Qunshu Lin ⋅ Aude Oliva ⋅ Rogerio Feris
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 158
Cross-Modal Guided Visual Synthesis for Data-Efficient Multimodal Depression Recognition
Shanliang Yang ⋅ Xiaoxiao Wang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 159
AffordGrasp: Cross-Modal Diffusion for Affordance-Aware Grasp Synthesis
Xiaofei Wu ⋅ Yi Zhang ⋅ Yumeng Liu ⋅ Yuexin Ma ⋅ Yujiao Shi ⋅ Xuming He
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 160
PAM: A Pose–Appearance–Motion Engine for Sim-to-Real HOI Video Generation
Mingju Gao ⋅ Kaisen Yang ⋅ Huan-ang Gao ⋅ Bohan Li ⋅ Ao Ding ⋅ Wenyi Li ⋅ Yangcheng Yu ⋅ Jinkun Liu ⋅ Shaocong Xu ⋅ Yike Niu ⋅ Haohan Chi ⋅ Hao Chen ⋅ Hao Tang ⋅ Yu Zhang ⋅ Li Yi ⋅ Hao Zhao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 161
AffordGen: Generating Diverse Demonstrations for Generalizable Object Manipulation with Affordance Correspondence
Jiawei Zhang ⋅ Kaizhe Hu ⋅ Yingqian Huang ⋅ Yuanchen Ju ⋅ Zhengrong Xue ⋅ Huazhe Xu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 162
HandWorld: Hand-Centric Unified Video Action Generation
Zhihao Sun ⋅ Zhiying Du ⋅ Xitong Yang ⋅ Zuxuan Wu
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 163
HVG-3D: Bridging Real and Simulation Domains for 3D-Conditional Hand-Object Interaction Video Synthesis
Mingjin Chen ⋅ Junhao Chen ⋅ Zhaoxin Fan ⋅ Yujian Lee ⋅ Zichen Dang ⋅ Lili Wang ⋅ Yawen Cui ⋅ Lap-Pui Chau ⋅ Yi Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 164
ArtHOI: Taming Foundation Models for Monocular 4D Reconstruction of Hand-Articulated-Object Interactions
Zikai Wang ⋅ Zhilu Zhang ⋅ Yiqing Wang ⋅ Hui Li ⋅ Wangmeng Zuo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 165
LAM: Language Articulated Object Modelers
Yipeng Gao ⋅ Yunhao Ge ⋅ Peilin Cai ⋅ Daniel Seita ⋅ Laurent Itti
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 166
Haptic Neural Fields: Bringing Tactile Interactions to 3D Rendered Scenes
Antonio Luigi Stefani ⋅ Niccolò Bisagno ⋅ Nicola Conci ⋅ Eckehard Steinbach ⋅ Francesco De Natale
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 167
Open-world Hand-Object Interaction Video Generation Based on Structure and Contact-aware Representation
Haodong Yan ⋅ Hang Yu ⋅ Zhide Zhong ⋅ Weilin Yuan ⋅ Xin Gong ⋅ Zehang Luo ⋅ Chengxi Heyu ⋅ Junfeng Li ⋅ Wenxuan Song ⋅ Shunbo Zhou ⋅ Haoang Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 168
EgoEdit: Dataset, Real-Time Streaming Model, and Benchmark for Egocentric Video Editing
Runjia Li ⋅ Moayed Haji Ali ⋅ Ashkan Mirzaei ⋅ Chaoyang Wang ⋅ Arpit Sahni ⋅ Ivan Skorokhodov ⋅ Aliaksandr Siarohin ⋅ Tomas Jakab ⋅ Junlin Han ⋅ Sergey Tulyakov ⋅ Philip H.S. Torr ⋅ Willi Menapace
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 169
From Inpainting to Layer Decomposition: Repurposing Generative Inpainting Models for Image Layer Decomposition
Jingxi Chen ⋅ Yixiao Zhang ⋅ Xiaoye qian ⋅ Zongxia Li ⋅ Cornelia Fermuller ⋅ Caren Chen ⋅ Yiannis Aloimonos
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 170
Temporal Equilibrium MeanFlow: Bridging the Scale Gap for One-Step Generation
Yuanpeng Tu ⋅ Yunpeng Chen ⋅ Xinyu Zhang ⋅ Chao Liao ⋅ Hengshuang Zhao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 171
PROMO: Promptable Outfitting for Efficient High-Fidelity Virtual Try-On
Haohua Chen ⋅ Tianze Zhou ⋅ Wei Zhu ⋅ Runqi Wang ⋅ Yandong Guan ⋅ Dejia Song ⋅ Yibo Chen ⋅ Xu Tang ⋅ Yao Hu ⋅ Lu Sheng ⋅ Zhiyong Wu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 172
Harmony: Harmonizing Audio and Video Generation through Cross-Task Synergy
Teng Hu ⋅ Zhentao Yu ⋅ Guozhen Zhang ⋅ Zihan Su ⋅ zhengguang zhou ⋅ Youliang Zhang ⋅ Yuan Zhou ⋅ qinglin lu ⋅ Ran Yi
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 173
UniSER: A Foundation Model for Unified Soft Effects Removal
Jingdong Zhang ⋅ Lingzhi Zhang ⋅ Qing Liu ⋅ Mang Tik Chiu ⋅ Connelly Barnes ⋅ Yizhou Wang ⋅ Haoran You ⋅ Xiaoyang Liu ⋅ Yuqian Zhou ⋅ Zhe Lin ⋅ Eli Shechtman ⋅ Sohrab Amirghodsi ⋅ Xin Li ⋅ Wenping Wang ⋅ Xiaohang Zhan
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 174
EffectMaker: Unifying Reasoning and Generation for Customized Visual Effect Creation
Shiyuan Yang ⋅ Ruihuang Li ⋅ Jiale Tao ⋅ Shuai Shao ⋅ qinglin lu ⋅ Jing Liao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 175
Inference-time Physics Alignment of Video Generative Models with Latent World Models
Jianhao Yuan ⋅ Zhang Xiaofeng ⋅ Felix Friedrich ⋅ Nicolas Beltran-Velez ⋅ Melissa Hall ⋅ Reyhane Askari ⋅ Xiaofeng Zhang ⋅ Nicolas Ballas ⋅ Michal Drozdzal ⋅ Adriana Romero-Soriano
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 176
SMRABooth: Subject and Motion Representation Alignment for Customized Video Generation
Xuancheng Xu ⋅ Li Yaning ⋅ Sisi You ⋅ Bing-Kun Bao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 177
Plenoptic Video Generation
Xiao Fu ⋅ Shitao Tang ⋅ Min Shi ⋅ Xian Liu ⋅ Jinwei Gu ⋅ Ming-Yu Liu ⋅ Dahua Lin ⋅ Chen-Hsuan Lin
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 178
PyramidalWan: On Making Pretrained Video Model Pyramidal for Efficient Inference
Denis Korzhenkov ⋅ Adil Karjauv ⋅ Animesh Karnewar ⋅ Mohsen Ghafoorian ⋅ Amir Habibian
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 179
AdapTok: Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space
Yan Li ⋅ Changyao TIAN ⋅ Renqiu Xia ⋅ Ning Liao ⋅ Weiwei Guo ⋅ Hongsheng Li ⋅ Jifeng Dai ⋅ Hao Li ⋅ Xue Yang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 180
OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory
Zhaochong An ⋅ Menglin Jia ⋅ Haonan Qiu ⋅ Zijian Zhou ⋅ Xiaoke Huang ⋅ Zhiheng Liu ⋅ Weiming Ren ⋅ Kumara Kahatapitiya ⋅ Ding Liu ⋅ Sen He ⋅ Chenyang Zhang ⋅ Tao Xiang ⋅ Fanny Yang ⋅ Serge Belongie ⋅ Tian Xie
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 181
Flowception: Temporally Expansive Flow Matching for Video Generation
Tariq Berrada Ifriqi ⋅ John Nguyen ⋅ Karteek Alahari ⋅ Jakob Verbeek ⋅ Ricky T. Q. Chen
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 182
Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition
Shengming Yin ⋅ Zekai Zhang ⋅ Zecheng Tang ⋅ Kaiyuan Gao ⋅ Xiao Xu ⋅ Kun Yan ⋅ Jiahao Li ⋅ Yilei chen ⋅ Yuxiang Chen ⋅ Heung-Yeung Shum ⋅ Lionel M. Ni ⋅ Junyang Lin ⋅ Chenfei Wu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 183
Linear Image Generation by Synthesizing Exposure Brackets
Yuekun Dai ⋅ Zhoutong Zhang ⋅ Shangchen Zhou ⋅ Nanxuan Zhao
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 184
Low-Resolution Editing is All You Need for High-Resolution Editing
Junsung Lee ⋅ Hyunsoo Lee ⋅ Yong Jae Lee ⋅ Bohyung Han
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 185
UniGenDet: A Unified Generative-Discriminative Framework for Co-Evolutionary Image Generation and Generated Image Detection
Yanran Zhang ⋅ Wenzhao Zheng ⋅ Yifei Li ⋅ Bingyao Yu ⋅ Yu Zheng ⋅ Lei Chen ⋅ Jiwen Lu ⋅ Jie Zhou
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 186
iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation
ZHOUJIE FU ⋅ Xianfang Zeng ⋅ jinghong lan ⋅ Xinyao Liao ⋅ Chen Cheng ⋅ Junyi Chen ⋅ Jiacheng Wei ⋅ Wei Cheng ⋅ Shiyu Liu ⋅ Yunuo Chen ⋅ Gang Yu ⋅ Guosheng Lin
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 187
VENI: Variational Encoder for Natural Illumination
Paul Walker ⋅ James A. D. Gardner ⋅ Andreea Ardelean ⋅ William A. P. Smith ⋅ Bernhard Egger
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 188
SketchAssist: A Practical Assistant for Semantic Edits and Precise Local Redrawing
Han Zou ⋅ Yan Zhang ⋅ Ruiqi Yu ⋅ Cong Xie ⋅ Jie Huang ⋅ Zhan Zhenpeng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 189
MultiShotMaster: A Controllable Multi-Shot Video Generation Framework
Qinghe Wang ⋅ Xiaoyu Shi ⋅ Baolu Li ⋅ Weikang Bian ⋅ Quande Liu ⋅ Huchuan Lu ⋅ Xintao Wang ⋅ Pengfei Wan ⋅ Kun Gai ⋅ Xu Jia
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 190
MoCha: End-to-End Video Character Replacement without Structural Guidance
Zhengbo Xu ⋅ Jie Ma ⋅ Ziheng Wang ⋅ Zhan Peng ⋅ Jun Liang ⋅ Jing Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 191
Negative Binomial Variational Autoencoders for Overdispersed Latent Modeling
Yixuan Zhang ⋅ Jinhao Sheng ⋅ Wenxin Zhang ⋅ Quyu Kong ⋅ Feng Zhou
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 192
Training-free Detection of Generated Videos via Spatial-Temporal Likelihoods
Omer Ben Hayun ⋅ Roy Betser ⋅ Meir Yossef Levi ⋅ Levi Kassel ⋅ Guy Gilboa
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 193
VOSR: A Vision-Only Generative Model for Image Super-Resolution
Rongyuan Wu ⋅ Lingchen Sun ⋅ Zhengqiang ZHANG ⋅ Xiangtao Kong ⋅ Jixin Zhao ⋅ Shihao Wang ⋅ Lei Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 194
Dual Graph Regularized Deep Unfolding Network for Guided Depth Map Super-resolution
Zhiwei Zhong ⋅ Peilin CHEN ⋅ Qiangqiang Shen ⋅ Bo Li ⋅ Shiqi Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 195
DUO-VSR: Dual-Stream Distillation for One-Step Video Super-Resolution
Zhengyao Lv ⋅ Menghan Xia ⋅ Xintao Wang ⋅ Kwan-Yee K. Wong
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 196
VSRELL: A Simple Baseline for Video Super-Resolution and Enhancement in Low-Light Environment
Yanming hui ⋅ Fanhua Shang ⋅ Hongying Liu ⋅ Ben Wang ⋅ Zhenwei Zhang ⋅ Liang Wan ⋅ Wei Feng ⋅ Tong Xue ⋅ Bingqin Lv
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 197
Gradient Knows Best: Mixed-Precision Quantization via Gradient-Guided Bit Allocation for Super-Resolution
Jun Young Kim ⋅ Joo Jeon ⋅ Sangyeon Ahn ⋅ Yoonseo Park ⋅ Yong Oh ⋅ Bogyeong Kim ⋅ Sung In Cho
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 198
Toward Real-world Infrared Image Super-Resolution: A Unified Autoregressive Framework and Benchmark Dataset
Yang Zou ⋅ Jun Ma ⋅ Zhidong Jiao ⋅ Xingyuan Li ⋅ Zhiying Jiang ⋅ Jinyuan Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 199
Next-Scale Autoregressive Models for Text-to-Motion Generation
Zhiwei Zheng ⋅ Shibo Jin ⋅ Lingjie Liu ⋅ Mingmin Zhao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 200
Push-and-Step: From RL-Based Balance Recovery to Physical Simulation of Dense Crowds
Alexis Jensen ⋅ Pei Xu ⋅ Ioannis Karamouzas ⋅ Charles Pontonnier ⋅ Julien Pettré
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 201
Iterative Closed-Loop Motion Synthesis for Scaling the Capabilities of Humanoid Control
Weisheng Xu ⋅ Qiwei Wu ⋅ Jiaxi Zhang ⋅ Jing Tan ⋅ Yangfan Li ⋅ Yuetong Fang ⋅ Jiaqi Xiong ⋅ Kai Wu ⋅ Rong OU ⋅ Renjing Xu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 202
RoMo: A Large-Scale, Richly Organized Dataset and Semantic Taxonomy for Human Motion Generation
Jiahao Zhang ⋅ Joseph Liu ⋅ Young-Yoon Lee ⋅ Seonghyeon Moon ⋅ Victor Zordan ⋅ Guy Tevet ⋅ C. Karen Liu ⋅ Stephen Gould ⋅ Oren Jacob ⋅ Haomiao Jiang ⋅ Mubbasir Kapadia ⋅ Yizhak Ben-Shabat
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 203
FrankenMotion: Part-level Human Motion Generation and Composition
Chuqiao Li ⋅ Xianghui Xie ⋅ Yong Cao ⋅ Andreas Geiger ⋅ Gerard Pons-Moll
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 204
HSI-GPT2: A Dual-Granularity Large Motion Reasoning Model with Diffusion Refinement for Human–Scene Interaction
Yuan Wang ⋅ LI XIANG ⋅ Yali Li ⋅ XUEGE HOU ⋅ Shengjin Wang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 205
SceMoS: Scene-Aware 3D Human Motion Synthesis by Planning with Geometry-Grounded Tokens
Anindita Ghosh ⋅ Vladislav Golyanik ⋅ Taku Komura ⋅ Philipp Slusallek ⋅ Christian Theobalt ⋅ Rishabh Dabral
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 206
Progressive Guessing to Fixed Point: Rethinking Human Motion Prediction with Deep Equilibrium Models
Dong Wei ⋅ Huaijiang Sun ⋅ Fan Liu ⋅ Yuhui Zheng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 207
Archon: A Unified Multimodal Model for Holistic Digital Human Generation
Chong Bao ⋅ Shichen Liu ⋅ Lijun Yu ⋅ David Futschik ⋅ Stylianos Moschoglou ⋅ Shefali Srivastava ⋅ Ziqian Bai ⋅ Feitong Tan ⋅ Guofeng Zhang ⋅ Zhaopeng Cui ⋅ Sean Fanello ⋅ Yinda Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 208
ReMoGen: Real-time Human Interaction-to-Reaction Generation via Modular Learning from Diverse Data
Yaoqin Ye ⋅ Yiteng Xu ⋅ Qin Sun ⋅ Xinge Zhu ⋅ YUJING SUN ⋅ Yuexin Ma
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 209
Towards Motion Turing Test: Evaluating Human-Likeness in Humanoid Robots
Mingzhe Li ⋅ Mengyin Liu ⋅ Zekai Wu ⋅ Xincheng Lin ⋅ Junsheng Zhang ⋅ Ming Yan ⋅ Zengye Xie ⋅ Changwang Zhang ⋅ Chenglu Wen ⋅ Lan Xu ⋅ Siqi Shen ⋅ Cheng Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 210
PatchScene: Patch-based Voxel Diffusion Model for Large-Scale Scene Completion
Qingdong Xu ⋅ Jiajun Zhu ⋅ Shilin Zhu ⋅ Xinjing He ⋅ Chao Lu ⋅ Huanran Wang ⋅ Jiyao Zhang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 211
Prototype-Guided Concept Erasure in Diffusion Models
Yuze Cai ⋅ Jiahao Lu ⋅ Hongxiang Shi ⋅ Yichao Zhou ⋅ Hong Lu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 212
Any2Any 3D Diffusion Models with Knowledge Transfer: A Radiotherapy Planning Study
Yuhan Wang ⋅ Zihan Li ⋅ Han Liu ⋅ Simon Arberet ⋅ Martin F. Kraus ⋅ Yuyin Zhou ⋅ Florin-Cristian Ghesu ⋅ Dorin Comaniciu ⋅ Ali Kamen ⋅ Riqiang Gao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 213
CARD: Correlation Aware Restoration with Diffusion
Niki Nezakati ⋅ Arnab Ghosh ⋅ Amit K. Roy-Chowdhury ⋅ Vishwanath Saragadam
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 214
DMAligner: Enhancing Image Alignment via Diffusion Model Based View Synthesis
Xinglong Luo ⋅ Ao Luo ⋅ Zhengning Wang ⋅ Yueqi Yang ⋅ Chaoyu Feng ⋅ Lei Lei ⋅ Bing Zeng ⋅ Shuaicheng Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 215
DRiffusion: Draft-and-Refine Process Parallelizes Diffusion Models with Ease
Runsheng Bai ⋅ Chengyu Zhang ⋅ Yangdong Deng
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 216
Do Less, Achieve More: Do We Need Every-Step Optimization for RL Fine-tuning of Diffusion Models?
Renye Yan ⋅ Jikang Cheng ⋅ Shikun Sun ⋅ Yi Sun ⋅ You Wu ⋅ Wei Peng ⋅ Zongwei Wang ⋅ Ling Liang ⋅ Junliang Xing ⋅ Yimao Cai
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 217
CSF: Black-box Fingerprinting via Compositional Semantics for Text-to-Image Models
Junhoo Lee ⋅ Mijin Koo ⋅ Nojun Kwak
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 218
InstantViR: Real-Time Video Inverse Problem Solver with Distilled Diffusion Prior
Weimin Bai ⋅ Suzhe Xu ⋅ Yiwei Ren ⋅ Jinhua Hao ⋅ Ming Sun ⋅ Wenzheng Chen ⋅ He Sun
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 219
MMTIT-Bench: A Multilingual and Multi-Scenario Benchmark with Cognition–Perception–Reasoning Guided Text-Image Machine Translation
Gengluo Li ⋅ Chengquan Zhang ⋅ Yupu Liang ⋅ Huawen Shen ⋅ Yaping Zhang ⋅ Pengyuan Lyu ⋅ Weinong Wang ⋅ Xingyu Wan ⋅ Gangyan Zeng ⋅ Han Hu ⋅ Can Ma ⋅ Yu ZHOU
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 220
M3DocDep: Multi-modal, Multi-page, Multi-document Dependency Chunking with Large Vision-Language Models
Joongmin Shin ⋅ Jeongbae Park ⋅ Jaehyung Seo ⋅ Heuiseok Lim
[ Slides
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 221
Towards Policy-Adaptive Image Guardrail: Benchmark and Method
Caiyong Piao ⋅ Zhiyuan Yan ⋅ Haoming Xu ⋅ Yunzhen Zhao ⋅ Kaiqing Lin ⋅ Feiyang Xu ⋅ Shuigeng Zhou
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 222
Flat-Pack Bench: Evaluating Spatio-Temporal Understanding in Large Vision-Language Models through Furniture Assembly
Aditya Chetan ⋅ Eric Cai ⋅ Peeyush Kushwaha ⋅ Bharath Raj Nagoor Kani ⋅ Utkarsh Mall ⋅ Qianqian Wang ⋅ Noah Snavely ⋅ Bharath Hariharan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 223
TextFM: Robust Semi-dense Feature Matching with Language Guidance
Zhihao Zheng ⋅ Jinglun Feng ⋅ Nirav Savaliya ⋅ Zheng-Hang Yeh ⋅ Bo Lang ⋅ Mooi Choo Chuah
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 224
What’s Wrong with Synthetic Data for Scene Text Recognition? A Strong Synthetic Engine with Diverse Simulations and Self-Evolution
Xingsong Ye ⋅ Yongkun Du ⋅ Jiaxin Zhang ⋅ Chen Li ⋅ Jing LYU ⋅ Zhineng Chen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 225
Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing
Cheng Cui ⋅ Ting Sun ⋅ Suyin Liang ⋅ Tingquan Gao ⋅ Zelun Zhang ⋅ Jiaxuan Liu ⋅ Xueqing Wang ⋅ Changda Zhou ⋅ Hongen Liu ⋅ Manhui Lin ⋅ Yue Zhang ⋅ yubo zhang ⋅ Jing Zhang ⋅ Jun Zhang ⋅ Xing Wei ⋅ Yi Liu ⋅ Dianhai Yu ⋅ Yanjun Ma
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 226
SJD-PAC: Accelerating Speculative Jacobi Decoding via Proactive Drafting and Adaptive Continuation
Jialiang Kang ⋅ Han Shu ⋅ Wenshuo Li ⋅ Yingjie Zhai ⋅ Xinghao Chen
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 227
Point Cloud as a Foreign Language for Multi-modal Large Language Model
Sneha Paul ⋅ Zachary Patterson ⋅ Nizar Bouguila
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 228
Grounded 3D-Aware Spatial Vision-Language Modeling
An-Chieh Cheng ⋅ Yang Fu ⋅ Yatai Ji ⋅ Ligeng Zhu ⋅ Guanqi Zhan ⋅ Zhuoyang Zhang ⋅ Zhaojing Yang ⋅ Song Han ⋅ Yao Lu ⋅ Pavlo Molchanov ⋅ Vidya Nariyambut Murali ⋅ Jan Kautz ⋅ Xiaolong Wang ⋅ Danny Yin ⋅ Sifei Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 229
SpatialTree: How Spatial Intelligence Branches Out in MLLMs
Yuxi Xiao ⋅ longfei li ⋅ Shen Yan ⋅ Xinhang Liu ⋅ Sida Peng ⋅ Yunchao Wei ⋅ Xiaowei Zhou ⋅ Bingyi Kang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 230
TerraScope: Pixel-Grounded Visual Reasoning for Earth Observation
Yan Shu ⋅ Bin Ren ⋅ Zhitong Xiong ⋅ Xiao Xiang Zhu ⋅ Begüm Demir ⋅ Nicu Sebe ⋅ Paolo Rota
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 231
Beyond 3D VQAs: Injecting 3D Spatial Priors into Vision-Language Models for Enhanced Geometric Reasoning
Chun-Hsiao Yeh ⋅ Shengyi Qian ⋅ Manchen Wang ⋅ Yi Ma ⋅ Joseph Tighe ⋅ Fanyi Xiao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 232
OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding
Sheng-Yu Huang ⋅ Jaesung Choe ⋅ Yu-Chiang Frank Wang ⋅ Cheng Sun
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 233
BOP-ASK: Object-Interaction Reasoning for Vision-Language Models
Vineet Bhat ⋅ Sungsu Kim ⋅ Valts Blukis ⋅ Greg Heinrich ⋅ Prashanth Krishnamurthy ⋅ Ramesh Karri ⋅ Stan Birchfield ⋅ Farshad Khorrami ⋅ Jonathan Tremblay
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 234
Scalable Object Relation Encoding for Better 3D Spatial Reasoning in Large Language Models
Shengli Zhou ⋅ Minghang Zheng ⋅ Feng Zheng ⋅ Yang Liu
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 235
Eliciting Complex Spatial Reasoning in MLLMs through Wide-Baseline Matching
Hao Zhong ⋅ Muzhi Zhu ⋅ Shenyan Zeng ⋅ Anzhou Li ⋅ Cong Chen ⋅ Hua Geng ⋅ Duochao Shi ⋅ Wentao Ye ⋅ Tao Lin ⋅ Hao Chen ⋅ Chunhua Shen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 236
REALM: An MLLM-Agent Framework for Open World 3D Reasoning Segmentation and Editing on Gaussian Splatting
Changyue Shi ⋅ Minghao Chen ⋅ Yiping Mao ⋅ Chuxiao Yang ⋅ Xinyuan Hu ⋅ Jiajun Ding ⋅ Zhou Yu
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 237
From Indoor to Open World: Revealing the Spatial Reasoning Gap in MLLMs
Mingrui Wu ⋅ Zhaozhi Wang ⋅ Fangjinhua Wang ⋅ Jiaolong Yang ⋅ Marc Pollefeys ⋅ Tong Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 238
MVGGT: Multimodal Visual Geometry Grounded Transformer for Multiview 3D Referring Expression Segmentation
Changli Wu ⋅ Haodong Wang ⋅ Jiayi Ji ⋅ Yutian Yao ⋅ Chunsai Du ⋅ Jihua Kang ⋅ Yanwei Fu ⋅ Liujuan Cao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 239
SpaceMind: Camera-Guided Modality Fusion for Spatial Reasoning in Vision-Language Models
Ruosen Zhao ⋅ Zhikang Zhang ⋅ Jialei Xu ⋅ Jiahao Chang ⋅ Dong Chen ⋅ Lingyun Li ⋅ Weijian Sun ⋅ Zizhuang Wei
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 240
ReMatch: Boosting Representation through Matching for Multimodal Retrieval
Qianying Liu ⋅ Xiao Liang ⋅ Zhiqiang Zhang ⋅ Yibo Chen ⋅ Xu Tang ⋅ Zhongfei Qing ⋅ Fengfan Zhou ⋅ Yao Hu ⋅ Paul Henderson
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 241
RI-Mamba: Rotation-Invariant Mamba for Robust Text-to-Shape Retrieval
Khanh Nguyen ⋅ Dasith de Silva Edirimuni ⋅ Ghulam Mubashar Hassan ⋅ Ajmal Mian
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 242
Revisiting F-measure Optimization in Multi-Label Classification: A Sampling-based Approach
Zixun Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 243
Thinking Beyond Labels: Vocabulary-Free Fine-Grained Recognition using Reasoning-Augmented LMMs
Dmitry Demidov ⋅ Muhammad Zaigham Zaheer ⋅ Zongyan Han ⋅ Omkar Thawakar ⋅ Rao Anwer
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 244
WISER: Wider Search, Deeper Thinking, and Adaptive Fusion for Training-Free Zero-Shot Composed Image Retrieval
Tianyue Wang ⋅ Leigang Qu ⋅ tianyu yang ⋅ xiangzhao hao ⋅ Yifan Xu ⋅ Haiyun Guo ⋅ Jinqiao Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 245
Modeling the Visual Ambiguity of Human Sketches
Yang Zhou ⋅ Ping Ni ⋅ Jin Wang ⋅ Senyun Jia ⋅ Jingdan Yan ⋅ Kaixiang Huang ⋅ Guodong Lu ⋅ Jingru Yang ⋅ Shengfeng He
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 246
SATTC: Structure-Aware Label-Free Test-Time Calibration for Cross-Subject EEG-to-Image Retrieval
Qunjie Huang ⋅ Weina Zhu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 247
ConeSep: Cone-based Robust Noise-Unlearning Compositional Network for Composed Image Retrieval
Zixu Li ⋅ Yupeng Hu ⋅ Zhiwei Chen ⋅ Mingyu Zhang ⋅ Zhiheng Fu ⋅ Liqiang Nie
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 248
V^2-SAM: Marrying SAM2 with Multi-Prompt Experts for Cross-View Object Correspondence
Jiancheng Pan ⋅ Runze Wang ⋅ Tianwen Qian ⋅ Mohammad Mahdi ⋅ Yanwei Fu ⋅ Xiangyang Xue ⋅ Xiaomeng Huang ⋅ Luc Van Gool ⋅ Danda Paudel ⋅ Yuqian Fu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 249
WeaveTime: Streaming from Earlier Frames into Emergent Memory in VideoLLMs
Yulin Zhang ⋅ Cheng Shi ⋅ Sibei Yang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 250
Streaming Video Crime Anticipation with Spatio-Temporal Causal Reasoning
Yusong Wang ⋅ Zheyuan Gu ⋅ Keyu Mao ⋅ Minghao Shao ⋅ Mingkun Xu ⋅ Prayag Tiwari ⋅ Jiawei Shao ⋅ qingsong zhao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 251
Efficient Frame Selection for Long Video Understanding via Reinforcement Learning
Yaxuan Qin ⋅ Hefei Li ⋅ Wenqi Mu ⋅ Yancheng He
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 252
HieraMamba: Video Temporal Grounding via Hierarchical Anchor-Mamba Pooling
Joungbin An ⋅ Kristen Grauman
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 253
InternVideo-Next: Towards World-Understanding Video Models
Chenting Wang ⋅ Yuhan Zhu ⋅ Yicheng Xu ⋅ Jiange Yang ⋅ ziang yan ⋅ Yali Wang ⋅ Yi Wang ⋅ Limin Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 254
Condensed Test-Time Adaptation of VLMs for Action Recognition
Wenxuan Ge ⋅ Qu Hongyu ⋅ Rui Yan ⋅ Guo-Sen Xie ⋅ Yazhou Yao ⋅ Xiangbo Shu ⋅ Jinhui Tang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 255
Test-time Ego-Exo-centric Adaptation for Action Anticipation via Multi-Label Prototype Growing and Dual-Clue Consistency
Zhaofeng Shi ⋅ Heqian Qiu ⋅ Lanxiao Wang ⋅ Qingbo Wu ⋅ Fanman Meng ⋅ Lili Pan ⋅ Hongliang Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 256
A Stitch in Time: Learning Procedural Workflow via Self-Supervised Plackett–Luce Ranking
chengan che ⋅ Chao Wang ⋅ Xinyue Chen ⋅ Sophia Tsoka ⋅ Luis Carlos Garcia Peraza Herrera
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 257
SurgCoT: Advancing Spatiotemporal Reasoning in Surgical Videos through a Chain-of-Thought Benchmark
Gui Wang ⋅ YongSong Zhou ⋅ Kaijun Deng ⋅ Wooi Ping Cheah ⋅ Rong Qu ⋅ Jianfeng Ren ⋅ Linlin Shen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 258
Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing
Baifeng Shi ⋅ Stephanie Fu ⋅ Long Lian ⋅ Hanrong Ye ⋅ David Eigen ⋅ Aaron Reite ⋅ Jan Kautz ⋅ Boyi Li ⋅ David Chan ⋅ Trevor Darrell ⋅ Pavlo Molchanov ⋅ Danny Yin
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 259
Concept-Guided Fine-Tuning: Steering ViTs away from Spurious Correlations to Improve Robustness
Yehonatan Elisha ⋅ Oren Barkan ⋅ Noam Koenigstein
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 260
Explaining Object Detectors via Collective Contribution of Pixels
Toshinori Yamauchi ⋅ Hiroshi Kera ⋅ Kazuhiko Kawamoto
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 261
Where MLLMs Attend and What They Rely On: Explaining Autoregressive Token Generation
Ruoyu Chen ⋅ Xiaoqing Guo ⋅ Kangwei Liu ⋅ Siyuan Liang ⋅ Shiming Liu ⋅ Qunli Zhang ⋅ Laiyuan Wang ⋅ Hua Zhang ⋅ Xiaochun Cao
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 262
H-Sets: Hessian-Guided Discovery of Set-Level Feature Interactions in Image Classifiers
Ayushi Mehrotra ⋅ Dipkamal Bhusal ⋅ Michael Clifford ⋅ Nidhi Rastogi
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 263
Evaluating Generative Models via One-Dimensional Code Distributions
Zexi Jia ⋅ Pengcheng Luo ⋅ Yijia Zhong ⋅ Jinchao Zhang ⋅ Jie Zhou
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 264
TriDF: Evaluating Perception, Detection, and Hallucination for Interpretable DeepFake Detection
Jian-Yu Jiang-Lin ⋅ Kang-Yang Huang ⋅ Ling Zou ⋅ Ling Lo ⋅ Sheng-Ping Yang ⋅ Yu-Wen Tseng ⋅ Kun-Hsiang Lin ⋅ Chia-Ling Chen ⋅ Yu-Ting Ta ⋅ Yan-Tsung Wang ⋅ Po-Ching Chen ⋅ Hongxia Xie ⋅ Hong-Han Shuai ⋅ Wen-Huang Cheng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 265
BuildAnyPoint: 3D Building Structured Abstraction from Diverse Point Clouds
Tongyan Hua ⋅ Haoran Gong ⋅ Yuan Liu ⋅ Di Wang ⋅ Ying-Cong Chen ⋅ Wufan Zhao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 266
LiDAR-to-4DRadar Diffusion Bridge via Cross-Modal Alignment and Translation in Latent Space
Dazhong Shen ⋅ Jingjing Gu ⋅ Qiang Zhou ⋅ Meng Zhao ⋅ Ying Sun
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 267
Edges Compete for Trust: Group Relative Edge Optimization for Building Reconstruction from Point Clouds
Yujun Liu ⋅ Ruisheng Wang ⋅ Xiang Ao ⋅ Haoyuan Shen ⋅ Kuihao Wang ⋅ Kun Zhou ⋅ Qingquan Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 268
Unsupervised Monocular 3D Keypoint Discovery from Multi-View Diffusion Priors
Subin Jeon ⋅ In Cho ⋅ Junyoung Hong ⋅ Woong Oh ⋅ Seon Joo Kim
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 269
QD-PCQA: Quality-Aware Domain Adaptation for Point Cloud Quality Assessment
Guohua Zhang ⋅ Jian Jin ⋅ Meiqin Liu ⋅ Chao Yao ⋅ Weisi Lin
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 270
L3DR: 3D-aware LiDAR Diffusion and Rectification
QUAN LIU ⋅ Xiaoqin Zhang ⋅ Ling Shao ⋅ Shijian Lu
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 271
Ghost-FWL: A Large-Scale Full-Waveform LiDAR Dataset for Ghost Detection and Removal
Kazuma Ikeda ⋅ Ryosei Hara ⋅ Rokuto Nagata ⋅ Ozora Sako ⋅ Zihao Ding ⋅ Takahiro Kado ⋅ Ibuki Fujioka ⋅ Taro Beppu ⋅ Mariko Isogawa ⋅ Kentaro Yoshioka
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 272
Ghosts in the Point Clouds: De-glaring LiDAR in the Transient Domain
Avery gump ⋅ Connor Henley ⋅ Sungjin Cheong ⋅ Akarsh Prabhakara ⋅ Mohit Gupta
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 273
MS^2Gait: A Multi-Scale Spatio-Temporal Fusion Network for LiDAR-based Gait Recognition
Shenyin Xu ⋅ Yishan Wang ⋅ Xinyu Li ⋅ Rui Liu ⋅ Zhongyuan Wang ⋅ Xin Tian
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 274
Foundry: Distilling 3D Foundation Models for the Edge
Guillaume Letellier ⋅ Siddharth Srivastava ⋅ Frederic Jurie ⋅ Gaurav Sharma
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 275
Learning to Identify Out-of-Distribution Objects for 3D LiDAR Anomaly Segmentation
Simone Mosco ⋅ Daniel Fusaro ⋅ Alberto Pretto
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 276
Dual-Level Confidence based Implicit Self-Refinement for Medical Visual Question Answering
Meihong Pan ⋅ Yefeng Zheng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 277
FedMPT: Federated Multi-Label Prompt Tuning of Vision-Language Models
Xucong Wang ⋅ Pengkun Wang ⋅ Zhe Zhao ⋅ Liheng Yu ⋅ Shuang Wang ⋅ Yang Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 278
Rethinking Model Selection in VLM Through the Lens of Gromov-Wasserstein Distance
Muyang Li ⋅ Yucheng Liu ⋅ Jianbo Ma ⋅ Elliot Osborne ⋅ Bo Han ⋅ Tongliang Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 279
NTK-Guided Implicit Neural Teaching
Chen Zhang ⋅ Wei Zuo ⋅ Bingyang Cheng ⋅ Yikun Wang ⋅ Wei-Bin Kou ⋅ Yik-Chung WU ⋅ Ngai Wong
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 280
SynthRGB-T: Language-Vision Guided Image Translation for Diversity Synthesis
Jiangang Ding ⋅ Yiquan Du ⋅ Pengxiang Li ⋅ Lili Pei ⋅ Yuanlin Zhao ⋅ Wei Li
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 281
Text-Printed Image: Bridging the Image-Text Modality Gap for Text-centric Training of Large Vision-Language Models
Shojiro Yamabe ⋅ Futa Waseda ⋅ Daiki Shiono ⋅ Tsubasa Takahashi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 282
Harmonious Parameter Adaptation in Continual Visual Instruction Tuning for Safety-Aligned MLLMs
Ziqi Wang ⋅ Chang Che ⋅ Qi Wang ⋅ Hui Ma ⋅ Zenglin Shi ⋅ Cees G. M. Snoek ⋅ Meng Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 283
StructXLIP: Enhancing Vision-language Models with Multimodal Structural Cues
Zanxi Ruan ⋅ Songqun Gao ⋅ Qiuyu Kong ⋅ Yiming Wang ⋅ Marco Cristani
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 284
Same or Not? Enhancing Visual Perception in Vision-Language Models
Damiano Marsili ⋅ Aditya Mehta ⋅ Ryan Y. ⋅ Georgia Gkioxari
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 285
Vector Prism: Animating Vector Graphics by Stratifying Semantic Structure
Jooyeol Yun ⋅ Jaegul Choo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 286
AssemblyBench: Physics-Aware Assembly of Complex Industrial Objects
Danrui Li ⋅ Jiahao Zhang ⋅ Bernhard Egger ⋅ Moitreya Chatterjee ⋅ Suhas Lohit ⋅ Tim Marks ⋅ Anoop Cherian
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 287
Animator-Centric Skeleton Generation on Objects with Fine-Grained Details
Mingze Sun ⋅ Cheng Zeng ⋅ Pei Jiansong ⋅ Junhao Chen ⋅ Chaoyue Song ⋅ Shaohui Wang ⋅ Tianyuan Chang ⋅ Bin Huang ⋅ Zijiao Zeng ⋅ Ruqi Huang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 288
Synthesizing Visual Concepts as Vision-Language Programs
Antonia Wüst ⋅ Wolfgang Stammer ⋅ Hikaru Shindo ⋅ Lukas Helff ⋅ Devendra Singh Dhami ⋅ Kristian Kersting
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 289
Self-Consistency for LLM-Based Motion Trajectory Generation and Verification
Jiaju Ma ⋅ R. Kenny Jones ⋅ Jiajun Wu ⋅ Maneesh Agrawala
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 290
Semantic Scale Space: A Framework for Controllable Image Abstraction
Kazu Mishiba
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 291
Pointer-CAD: Unifying B-Rep and Command Sequences via Pointer-based Edges & Faces Selection
Dacheng Qi ⋅ Chenyu Wang ⋅ Jingwei Xu ⋅ Tianzhe Chu ⋅ Zibo Zhao ⋅ Wen Liu ⋅ Wenrui Ding ⋅ Yi Ma ⋅ Shenghua Gao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 292
DSFlash: Comprehensive Panoptic Scene Graph Generation in Realtime
Julian Lorenz ⋅ Vladyslav Kovganko ⋅ Elias Kohout ⋅ Mrunmai Phatak ⋅ Daniel Kienzle ⋅ Rainer Lienhart
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 293
SIF: Semantically In-Distribution Fingerprints for Large Vision-Language Models
Yifei Zhao ⋅ Qian Lou ⋅ Mengxin Zheng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 294
Designing to Forget: Deep Semi-parametric Models for Unlearning
Amber Yija Zheng ⋅ YU-SHAN TAI ⋅ Raymond A. Yeh
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 295
Meta-FC: Meta-Learning with Feature Consistency for Robust and Generalizable Watermarking
Yuheng Li ⋅ Weitong Chen ⋅ chengcheng zhu ⋅ Jiale Zhang ⋅ Chunpeng Ge ⋅ Di Wu ⋅ Guodong Long
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 296
PrivSynth: Alternating and Control-Based Optimization for Privacy and Utility in Synthetic Data
Xinyuan Zhao ⋅ Hanlin Gu ⋅ Guibao Song ⋅ Gongxi Zhu ⋅ Yifei Zou ⋅ Lixin Fan ⋅ Yuxing Han
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 297
Neighbor-Aware Localized Concept Erasure in Text-to-Image Diffusion Models
Zhuan Shi ⋅ Alireza Dehghanpour Farashah ⋅ Rik de Vries ⋅ Golnoosh Farnadi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 298
EcoAlign: An Economically Rational Framework for Efficient LVLM Alignment
Ruoxi Cheng ⋅ Hao-Xuan Ma ⋅ Teng Ma ⋅ Hongyi Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 299
Activation Matters: Test-time Activated Negative Labels for OOD Detection with Vision-Language Models
Yabin Zhang ⋅ Maya Varma ⋅ Yunhe Gao ⋅ Jean-Benoit Delbrouck ⋅ Jiaming Liu ⋅ Chong Wang ⋅ Curtis Langlotz
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 300
A Polynomial Chaos Framework for Causal Discovery in Nonlinear Uncertain Systems
Liang Cao
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 301
Domain-Skewed Federated Learning with Feature Decoupling and Calibration
Huan Wang ⋅ Jun Shen ⋅ Jun Yan ⋅ Guansong Pang
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 302
From Selection to Scheduling: Federated Geometry-Aware Correction Makes Exemplar Replay Work Better under Continual Dynamic Heterogeneity
Zhuang Qi ⋅ Yingpeng Tang ⋅ Lei Meng ⋅ Guoqing Chao ⋅ Lei Wu ⋅ Han Yu ⋅ Xiangxu Meng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 303
Fine-Tuning Impairs the Balancedness of Foundation Models in Long-tailed Personalized Federated Learning
Shihao Hou ⋅ Chikai Shang ⋅ Zhiheng Yang ⋅ jiacheng yang ⋅ Xinyi Shang ⋅ Junlong Gao ⋅ Yiqun Zhang ⋅ Yang Lu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 304
Few-for-Many Personalized Federated Learning
Ping Guo ⋅ ZHANG Tiantian ⋅ Xi Lin ⋅ Xiang Li ⋅ Zhi-Ri Tang ⋅ Qingfu Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 305
ProxyFL: A Proxy-Guided Framework for Federated Semi-Supervised Learning
Duowen Chen ⋅ Yan Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 306
Domain Sensitive Federated Learning with Fisher-Informed Pruning
Chenchen Lin ⋅ Wenhao Yuan ⋅ Zhengji Xu ⋅ Xuehe Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 307
SPARROW: Learning Spatial Precision and Temporal Referential Consistency in Pixel-Grounded Video MLLMs
Mohamad Alansari ⋅ Naufal Suryanto ⋅ Divya Velayudhan ⋅ Sajid Javed ⋅ Naoufel Werghi ⋅ Muzammal Naseer
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 308
Bridging Facial Understanding and Animation via Language Models
Luchuan Song ⋅ Pinxin Liu ⋅ Haiyang Liu ⋅ Zhenchao Jin ⋅ Yolo Yunlong Tang ⋅ Zichong Xu ⋅ Susan Liang ⋅ Jing Bi ⋅ Jason J. Corso ⋅ Chenliang Xu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 309
AR²-4FV: Anchored Referring and Re-identification for Long-Term Grounding in Fixed-View Videos
Teng Yan ⋅ Yihan Liu ⋅ Jiongxu Chen ⋅ Teng Wang ⋅ Jiaqi LI ⋅ Bingzhuo Zhong
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 310
CVA: Context-aware Video-text Alignment for Video Temporal Grounding
Sungho Moon ⋅ Seunghun Lee ⋅ Jiwan Seo ⋅ Sunghoon Im
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 311
OmniGround: A Comprehensive Spatio-Temporal Grounding Benchmark for Real-World Complex Scenarios
Hong Gao ⋅ Jingyu Wu ⋅ Xiangkai Xu ⋅ Kangni Xie ⋅ Yunchen Zhang ⋅ Bin Zhong ⋅ Xurui Gao ⋅ Min-Ling Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 312
ST4R-Splat: Spatio-Temporal Referring Segmentation in 4D Gaussian Splatting
Yuming Meng ⋅ Dong Wu ⋅ Hongbin Zha
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 313
WeMMU: Enhanced Bridging of Vision-Language Models and Diffusion Models via Noisy Query Tokens
Jian Yang ⋅ Dacheng Yin ⋅ Xiaoxuan He ⋅ Yong Li ⋅ Fengyun Rao ⋅ Jing LYU ⋅ Wei Zhai ⋅ Yang Cao ⋅ Zheng-Jun Zha
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 314
Rejection Mixing: Fast Semantic Propagation of Mask Tokens for Efficient DLLM Inference
Yushi Ye ⋅ Feng Hong ⋅ Huangjie Zheng ⋅ Xu Chen ⋅ Zhiyong Chen ⋅ Yanfeng Wang ⋅ Jiangchao Yao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 315
Towards Unified Human Perception and Machine Understanding: Token Flow Guided Compression Framework
Li Xu ⋅ YingFu Zhang ⋅ Kepeng Xu ⋅ Gang He ⋅ Yunsong Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 316
A More Word-like Image Tokenization for MLLMs
Hyun Lee ⋅ Hyemin Jeong ⋅ Yejin Kim ⋅ Hyungwook Choi ⋅ Hyunsoo Cho ⋅ Soo Kyung Kim ⋅ Joonseok Lee
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 317
DUET-VLM: Dual stage Unified Efficient Token reduction for VLM Training and Inference
Aditya Kumar Singh ⋅ Hitesh Kandala ⋅ Pratik Prabhanjan Brahma ⋅ Zicheng Liu ⋅ Emad Barsoum
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 318
Unified Spatiotemporal Token Compression for Video-LLMs at Ultra-Low Retention
Junhao Du ⋅ XUE JIALONG ⋅ Anqi Li ⋅ Jincheng Dai ⋅ Guo Lu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 319
One Layer’s Trash is Another Layer’s Treasure: Adaptive Layer-wise Visual Token Selection in LVLMs
Yongru Chen ⋅ Kai Zhang ⋅ Zeliang Zong ⋅ Yuchen Lu ⋅ Wenming Tan ⋅ Ye Ren ⋅ Jilin Hu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 320
OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models
Keda Tao ⋅ Kele Shao ⋅ Bohan Yu ⋅ Weiqiang Wang ⋅ Jian liu ⋅ Huan Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 321
Tunable Soft Equivariance with Guarantees
Md Ashiqur Rahman ⋅ Lim Jun Hao ⋅ Jeremiah Jiang ⋅ Teck-Yian Lim ⋅ Raymond A. Yeh
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 322
Semi-Supervised Conformal Prediction With Unlabeled Nonconformity Score
Xuanning Zhou ⋅ Zihao Shi ⋅ Hao Zeng ⋅ Xiaobo Xia ⋅ Bingyi Jing ⋅ Hongxin Wei
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 323
Cluster-aware Anchor Learning for Multi-View Clustering
Zhe Chen ⋅ Fanhui Meng ⋅ Tianyang Xu ⋅ Xiao-Jun Wu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 324
Revisiting Sparsity Constraint Under High-Rank Property in Partial Multi-Label Learning
Chongjie Si ⋅ Yidan Cui ⋅ Fuchao Yang ⋅ Wei Shen
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 325
Weight Space Representation Learning via Neural Field Adaptation
Zhuoqian Yang ⋅ Mathieu Salzmann ⋅ Sabine Süsstrunk
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 326
Recurrent Video Masked Autoencoders
Daniel Zoran ⋅ Nikhil Parthasarathy ⋅ Yi Yang ⋅ Drew A Hudson ⋅ Joao Carreira ⋅ Andrew Zisserman
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 327
Revisiting Unknowns: Towards Effective and Efficient Open-Set Active Learning
Chen-Chen Zong ⋅ Yu-Qi Chi ⋅ Xie-Yang Wang ⋅ Yan Cui ⋅ Shengjun Huang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 328
Seeing Through the Shift: Causality-Inspired Robust Generalized Category Discovery
Wei Feng ⋅ Yiwen Jiang ⋅ Sijin Zhou ⋅ Zhuang Qi ⋅ Zhongxing Xu ⋅ Zhonghua Wang ⋅ feilong tang ⋅ Zongyuan Ge
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 329
From Exploration to Exploitation: A Two-Stage Entropy RLVR Approach for Noise-Tolerant MLLM Training
Donglai Xu ⋅ Hongzheng Yang ⋅ Yuzhi Zhao ⋅ Pingping Zhang ⋅ Jinpeng Chen ⋅ Wenao Ma ⋅ Zhijian Hou ⋅ Mengyang Wu ⋅ Xiaolei Li ⋅ Senkang Hu ⋅ Ziyi Guan ⋅ Jason Chun Lok Li ⋅ Lai-Man Po
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 330
Spatial Retrieval Augmented Autonomous Driving
Xiaosong Jia ⋅ Chenhe Zhang ⋅ Yule Jiang ⋅ Songbur Wong ⋅ Zhiyuan Zhang ⋅ chen chen ⋅ Shaofeng Zhang ⋅ Xuanhe Zhou ⋅ Xue Yang ⋅ Junchi Yan ⋅ Yu-Gang Jiang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 331
Scaling-Aware Data Selection for End-to-End Autonomous Driving Systems
Tolga Dimlioglu ⋅ Nadine Chang ⋅ Maying Shen ⋅ Rafid Mahmood ⋅ Jose M. Alvarez
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 332
ColaVLA: Leveraging Cognitive Latent Reasoning for Hierarchical Parallel Trajectory Planning in Autonomous Driving
Qihang Peng ⋅ Xuesong Chen ⋅ Chenye Yang ⋅ Shaoshuai Shi ⋅ Hongsheng Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 333
CARD: A Multi-Modal Automotive Dataset for Dense 3D Reconstruction in Challenging Road Topography
Gasser Elazab ⋅ Frank Neuhaus ⋅ Tilman Koß ⋅ Malte Splietker ⋅ Aditya Date ⋅ Michael Unterreiner ⋅ Maximilian Jansen ⋅ Olaf Hellwich
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 334
MindDriver: Introducing Progressive Multimodal Reasoning for Autonomous Driving
Lingjun Zhang ⋅ Yujian Yuan ⋅ Changjie Wu ⋅ Xinyuan Chang ⋅ Xin Cai ⋅ Shuang Zeng ⋅ Linzhe Shi ⋅ Sijin Wang ⋅ Hang Zhang ⋅ Mu Xu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 335
WPT: World-to-Policy Transfer via Online World Model Distillation
Guangfeng Jiang ⋅ Yueru Luo ⋅ Jun Liu ⋅ Yi Huang ⋅ Yiyao Zhu ⋅ zhan qu ⋅ Dave Chen ⋅ Bingbing Liu ⋅ Xu Yan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 336
ClimaOoD: Improving Anomaly Segmentation via Physically Realistic Synthetic Data
Yuxing Liu ⋅ Zheng Li ⋅ Huanhuan Liang ⋅ Ji Zhang ⋅ Zeyu Sun ⋅ Yong Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 337
Recover to Predict: Progressive Retrospective Learning for Variable-Length Trajectory Prediction
Hao Zhou ⋅ Lu Qi ⋅ Xiangtai Li ⋅ Jie Zhang ⋅ Yi Liu ⋅ Xu Yang ⋅ Mingyu Fan ⋅ Fei Luo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 338
URScenes: A Multi-scenario Dataset for Unstructured Road Environments
runsen liu ⋅ Aizemaitijiang Baoerhan ⋅ Zhangyu Wang ⋅ Jie Wang ⋅ Jinghao Cui ⋅ Guizhen Yu ⋅ Songyue Yang ⋅ WanCheng Sun ⋅ Mingjun Tang ⋅ Zhanbo Hua ⋅ Wenwen Luo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 339
MeanFuser: Fast One-Step Multi-Modal Trajectory Generation and Adaptive Reconstruction via MeanFlow for End-to-End Driving
junli wang ⋅ Yinan Zheng ⋅ Xueyi Liu ⋅ Zebin Xing ⋅ Pengfei Li ⋅ Kun Ma ⋅ Hangjun Ye ⋅ Guang Chen ⋅ Guang Li ⋅ Long Chen ⋅ Zhongpu Xia ⋅ Qichao Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 340
SAMosaic3D: Modular Scene Assembly for Real-Time 3D Segment Anything
Peng Wang ⋅ Yongcai Wang ⋅ Wang Chen ⋅ Hualong Cao ⋅ Kang Yang ⋅ Chunxu Li ⋅ Jie Wen ⋅ Deying Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 341
Mitigating Objectness Bias and Region-to-Text Misalignment for Open-Vocabulary Panoptic Segmentation
Nikolay Kormushev ⋅ Josip Šarić ⋅ Matej Kristan
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 342
MV3DIS: Multi-View Mask Matching via 3D Guides for Zero-Shot 3D Instance Segmentation
yibo zhao ⋅ Yigong Zhang ⋅ Jin Xie
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 343
PEARL: Geometry Aligns Semantics for Training-Free Open-Vocabulary Semantic Segmentation
Gensheng Pei ⋅ Xiruo Jiang ⋅ Xinhao Cai ⋅ Tao Chen ⋅ Yazhou Yao ⋅ Byeungwoo Jeon
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 344
RAVEN: Radar Adaptive Vision Encoders for Efficient Chirp-wise Object Detection and Segmentation
Anuvab Sen ⋅ Mir Sayeed ⋅ Saibal Mukhopadhyay
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 345
SAMIX: Reinforcing SAM2 with Semantic Adapter and Reference Selecting Policy for Mix-Supervised Segmentation
Qiang Hu ⋅ Jiajie Wei ⋅ Zhenyu Yi ⋅ Zhifen Yan ⋅ Yingjie Guo ⋅ Hongkuan Shi ⋅ Ge-Peng Ji ⋅ Qiang Li ⋅ Zhiwei Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 346
MARSS: Radar Semantic Segmentation via Modular Attention and State Space Models
fengyu chen ⋅ Tiao Tan ⋅ Teng Li ⋅ Yuantian Quan ⋅ Qingmin Liao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 347
MixerCSeg: An Efficient Mixer Architecture for Crack Segmentation via Decoupled Mamba Attention
Zilong Zhao ⋅ Zhengming Ding ⋅ Pei Niu ⋅ Wenhao Sun ⋅ Feng Guo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 348
Exemplar-Free Class Incremental Learning via Preserving Class-Discriminative Structure
Xin Zhang ⋅ Liang Bai ⋅ Guanchao Wang ⋅ Xian Yang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 349
Critical Patch-Aware Sparse Prompting with Decoupled Training for Continual Learning on the Edge
Wonseon Lim ⋅ Jaesung Lee ⋅ Dae-Won Kim
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 350
PACT: Phase-Like Transition Constraints in Adapter-Based Continual Learning of Vision-Language Models
Xuan Wang ⋅ Guiguang Ding ⋅ Jungong Han
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 351
Representation-Steered Incremental Adapter-Tuning for Class-Incremental Learning with Pre-Trained Models
Jiarui Zhao ⋅ Libo Huang ⋅ Xiangqi Li ⋅ Zhulin An ⋅ Chuanguang Yang ⋅ Yu Wang ⋅ boyu diao ⋅ Yongjun Xu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 352
Re-evaluating Continual VQA: Toward Fair and Robust Evaluation for Multimodal Continual Learning
Zijian Gao ⋅ Zicheng Sun ⋅ Xingxing Zhang ⋅ Kele Xu ⋅ Huaimin Wang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 353
Distilling Balanced Knowledge from a Biased Teacher
Seonghak Kim
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 354
Enhancing Continual Learning of Vision-Language Models via Dynamic Prefix Weighting
Hyeonseo Jang ⋅ Hyuk Kwon ⋅ Kibok Lee
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 355
Beyond Myopic Alignment: Lookahead Optimization for Online Class-Incremental Learning
Song Lai ⋅ Zhe Zhao ⋅ Fei Zhu ⋅ Ji Cheng ⋅ Xi Lin ⋅ Qingfu Zhang ⋅ Gaofeng Meng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 356
EmoDiffTalk: Emotion-aware Diffusion for Editable 3D Gaussian Talking Head
Chang Liu ⋅ Tianjiao Jing ⋅ Chengcheng Ma ⋅ Xuanqi Zhou ⋅ Zhengxuan Lian ⋅ Qin Jin ⋅ Hongliang Yuan ⋅ Shi-Sheng Huang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 357
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation
Taekyung Ki ⋅ Sangwon Jang ⋅ Jaehyeong Jo ⋅ Jaehong Yoon ⋅ Sung Ju
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 358
D^3FER: Dual Channel and Dual Branch Network for Robust Facial Expression Recognition under Dual Challenges
Hui Tang ⋅ Yifan He ⋅ Zhong Jin
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 359
HumanNOVA: Photorealistic, Universal and Rapid 3D Human Avatar Modeling from a Single Image
Hezhen Hu ⋅ Wangbo Zhao ⋅ Lanqing Guo ⋅ Hanwen Jiang ⋅ Jonathan C. Liu ⋅ Zhiwen Fan ⋅ Kai Wang ⋅ Zhangyang Wang ⋅ Georgios Pavlakos
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 360
ExpPortrait: Expressive Portrait Generation via Personalized Representation
Junyi Wang ⋅ Yudong Guo ⋅ Boyang Guo ⋅ Shengming Yang ⋅ Juyong Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 361
PersonaLive! Expressive Portrait Image Animation for Live Streaming
Zhiyuan Li ⋅ Chi-Man Pun ⋅ Chen Fang ⋅ Jue Wang ⋅ Xiaodong Cun
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 362
ProFocus: Proactive Perception and Focused Reasoning in Vision-and-Language Navigation
Wei Xue ⋅ Mingcheng Li ⋅ Xuecheng Wu ⋅ Jingqun Tang ⋅ Dingkang Yang ⋅ Lihua Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 363
OptiMVMap: Offline Vectorized Map Construction via Optimal Multi-vehicle Perspectives
Zedong Dan ⋅ Zijie Wang ⋅ Wei Zhang ⋅ Xiangru Lin ⋅ Weiming Zhang ⋅ Xiao Tan ⋅ Jingdong Wang ⋅ Liang Lin ⋅ Guanbin Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 364
CogDriver: Integrating Cognitive Inertia for Temporally Coherent Planning in Autonomous Driving
Pei Liu ⋅ Qingtian Ning ⋅ Xinyan Lu ⋅ Haipeng LIU ⋅ Weiliang Ma ⋅ Dangen She ⋅ XianPeng Lang ⋅ Jun Ma
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 365
TopoHR: Hierarchical Centerline Representation for Cyclic Topology Reasoning in Driving Scenes with Point-to-Instance Relations
Yifeng Bai ⋅ Zhirong Chen ⋅ Bo Song ⋅ Erkang Cheng ⋅ Haibin Ling
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 366
AURA: Multi-modal Shared Autonomy for Urban Navigation
Yukai Ma ⋅ Honglin He ⋅ Selina Song ⋅ Wayne Wu ⋅ Bolei Zhou
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 367
Zero-Shot Reconstruction of Animatable 3D Avatars with Cloth Dynamics from a Single Image
Joohyun Kwon ⋅ Geonhee Sim ⋅ Gyeongsik Moon
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 368
FlexAvatar: Learning Complete 3D Head Avatars with Partial Supervision
Tobias Kirschstein ⋅ Simon Giebenhain ⋅ Matthias Nießner
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 369
Large-scale Codec Avatars: The Unreasonable Effectiveness of Large-scale Avatar Pretraining
Junxuan Li ⋅ Rawal Khirodkar ⋅ Egor Zakharov ⋅ Jihyun Lee ⋅ Zhaoen Su ⋅ Yuan Dong ⋅ Julieta Martinez ⋅ Kai Li ⋅ Qingyang Tan ⋅ Takaaki Shiratori ⋅ Matthew Hu ⋅ Peihong Guo ⋅ Xuhua Huang ⋅ Zhongshi Jiang ⋅ LINGCHEN YANG ⋅ Ariyan Zarei ⋅ Marco Pesavento ⋅ Yichen Xu ⋅ Chengan He ⋅ He Wen ⋅ Giljoo Nam ⋅ Teng Deng ⋅ Wyatt Borsos ⋅ Anjali Thakrar ⋅ Jean-Charles Bazin ⋅ Rinat Abdrashitov ⋅ Carsten Stoll ⋅ Ginés Hidalgo ⋅ James Booth ⋅ Lucy Wang ⋅ Xiaowen Ma ⋅ Yu Rong ⋅ Sairanjith Thalanki ⋅ Chen Cao ⋅ Christian Häne ⋅ Abhishek Kar ⋅ Sofien Bouaziz ⋅ Jason Saragih ⋅ Yaser Sheikh ⋅ Shunsuke Saito
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 370
UIKA: Fast Universal Head Avatar from Pose-Free Images
Zijian Wu ⋅ Boyao Zhou ⋅ Liangxiao Hu ⋅ Hongyu Liu ⋅ Yuan Sun ⋅ Xuan Wang ⋅ Xun Cao ⋅ Yujun Shen ⋅ Hao Zhu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 371
FlexAvatar: Flexible Large Reconstruction Model for Animatable Gaussian Head Avatars with Detailed Deformation
Cheng Peng ⋅ Zhuo Su ⋅ Liao Wang ⋅ Chen Guo ⋅ Zhaohu Li ⋅ Chengjiang Long ⋅ Zheng Lv ⋅ Jingxiang Sun ⋅ Chenyangguang Zhang ⋅ Yebin Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 372
First Logit Boosting: Visual Grounding Method to Mitigate Object Hallucination in Large Vision-Language Models
Jiwoo Ha ⋅ Jongwoo Baek ⋅ Jinhyun So
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 373
Locate-then-Sparsify: Attribution Guided Sparse Strategy for Visual Hallucination Mitigation
Tiantian Dang ⋅ Chao Bi ⋅ Shufan Shen ⋅ Jinzhe Liu ⋅ Qingming Huang ⋅ Shuhui Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 374
Envision, Attend, Then Respond: Counterfactual Hallucination Mitigation in Large Vision-Language Models
Yuxuan Liang ⋅ Fan Shi ⋅ Rui Zhu ⋅ Xu Li ⋅ Xiaolei Chen ⋅ Zhe Liu ⋅ Bin Li ⋅ Xiangyang Xue
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 375
PAS: Prelim Attention Score for Detecting Object Hallucinations in Large Vision-Language Models
Nhat Hoang ⋅ Minh Vu ⋅ My T. Thai ⋅ Manish Bhattarai
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 376
MoD-DPO: Towards Mitigating Cross-modal Hallucinations in Omni LLMs using Modality Decoupled Preference Optimization
Ashutosh Chaubey ⋅ Jiacheng Pang ⋅ Mohammad Soleymani
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 377
Fine-Grained Multi Image Object Hallucination Benchmark
Joonki Min ⋅ Chaeyun Kim ⋅ Hyungwook Choi ⋅ Yejin Kim ⋅ Kihyun Kim ⋅ Yohan Jo ⋅ Joonseok Lee
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 378
Generative Video Motion Editing with 3D Point Tracks
Yao-Chih Lee ⋅ Zhoutong Zhang ⋅ Gabriel Huang ⋅ Jui-Hsien Wang ⋅ Joon-Young Lee ⋅ Jia-Bin Huang ⋅ Eli Shechtman ⋅ Zhengqi Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 379
BulletTime: Decoupled Control of Time and Camera Pose for Video Generation
Yiming Wang ⋅ Qihang Zhang ⋅ Shengqu Cai ⋅ Tong Wu ⋅ Jan Ackermann ⋅ Zhengfei Kuang ⋅ Yang Zheng ⋅ Frano Rajič ⋅ Siyu Tang ⋅ Gordon Wetzstein
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 380
Learning to Generate Highly Dynamic Videos using Synthetic Motion Data
Wonjoon Jin ⋅ Jiyun Won ⋅ Janghyeok Han ⋅ Qi Dai ⋅ Chong Luo ⋅ Seung-Hwan Baek ⋅ Sunghyun Cho
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 381
Stereo World Model: Camera-Guided Stereo Video Generation
Yangtian Sun ⋅ Zehuan Huang ⋅ Yifan Niu ⋅ Lin Ma ⋅ Yan-Pei Cao ⋅ Yuewen Ma ⋅ Xiaojuan Qi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 382
CG-Floor: Centroid-Guided Diffusion for Large-Scale Floorplan Generation
Hongjin Lian ⋅ Jian Ma ⋅ Hongjie Chen ⋅ Jia Li ⋅ Ruizhen Hu ⋅ Yu-Kun Lai ⋅ Kun Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 383
MAD: Motion Appearance Decoupling for efficient Driving World Models
Ahmad Rahimi ⋅ Valentin Gerard ⋅ Éloi Zablocki ⋅ Matthieu Cord ⋅ Alex Alahi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 384
VDFE: Difference-Aware 3D Scene Editing with Non-Intrusive Video Diffusion Priors for Multi-View Consistency and Efficiency
Chao Zhang ⋅ Fang Liu ⋅ Shuo Li ⋅ Yang Liu ⋅ Jiahao Wang ⋅ Xinyan Huang ⋅ Lingling Li ⋅ Puhua Chen ⋅ Xu Liu ⋅ Wenping Ma ⋅ Siqi Yu
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 385
Endless World: Real-Time 3D-Aware Long Video Generation
Ke Zhang ⋅ Jiacong Xu ⋅ Yiqun Mei ⋅ Vishal M. Patel
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 386
SpatialDiff: 3D-Aware Object Movement via Implicit Spatial Modeling
Zheng Liu ⋅ Zijian He ⋅ Huiguo He ⋅ Weizhi Zhong ⋅ Yejun Tang ⋅ Huan Yang ⋅ Kun Gai ⋅ Guanbin Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 387
Towards Realistic and Consistent Orbital Video Generation via 3D Foundation Priors
Rong Wang ⋅ Ruyi Zha ⋅ Ziang Cheng ⋅ Jiayu Yang ⋅ Pulak Purkait ⋅ Hongdong Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 388
YOLO-ULM: Ultra-Lightweight Models for Real-Time Object Detection
Shasha Han ⋅ Chong Li ⋅ Xinning Wang ⋅ Xuebo Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 389
CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations in the wild
Alex Hoi Hang Chan ⋅ Neha Singhal ⋅ Onur Kocahan ⋅ Andrea Meltzer ⋅ Saverio Lubrano ⋅ Miya Warrington ⋅ Michael Griesser ⋅ Fumihiro Kano ⋅ Hemal Naik
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 390
YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection
Xu Lin ⋅ Jinlong Peng ⋅ Zhenye Gan ⋅ Jiawen Zhu ⋅ Jun Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 391
VLM4RSDet: Collaborative Optimization with Vision-Language Model for Enhancing Remote Sensing Object Detection
Shuohao Shi ⋅ Qiang Fang ⋅ Xin Xu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 392
WiTTA-Bench: Benchmarking Test-Time Adaptation for WiFi Sensing
Bing Li ⋅ Qiang Wang ⋅ JUNDA LU ⋅ Le Zhang ⋅ Yun Liu ⋅ Ce Zhu ⋅ Wei Cui
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 393
MFEN: Multi-Frequency Expert Network for Visible-Infrared Person Re-ID
Xulin Li ⋅ Yan Lu ⋅ Bin Liu ⋅ Qinhong Yang ⋅ Qi Chu ⋅ Tao Gong ⋅ Nenghai Yu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 394
Object-Generalized Re-Identification: A Step Towards Universal Instance Perception
Shuoyi Chen ⋅ Yurui Wu ⋅ Mang Ye
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 395
When Transformers Meet Mamba: A Hybrid Transformer-Mamba Network for Video Object Detection
Qiang Qi ⋅ Xiao Wang ⋅ Zongyuan Du ⋅ Yu Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 396
Prompt-Anchored Vision–Text Distillation for Lifelong Person Re-identification
Wen Wen ⋅ Hao CHEN ⋅ Shiliang Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 397
HyperGait: Unleashing the Power of Parsing for Gait Recognition in the Wild via Hypergraph
Jinkai Zheng ⋅ jiaqing wei ⋅ Xinxiang Jin ⋅ Yaoqi Sun ⋅ Xichun Sheng ⋅ Ming Li ⋅ Liangqiong Qu ⋅ Xinchen Liu ⋅ Wu Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 398
Accelerating Streaming Video Large Language Models via Hierarchical Token Compression
Yiyu Wang ⋅ Xuyang Liu ⋅ Xiyan Gui ⋅ Xinying Lin ⋅ Boxue Yang ⋅ Chenfei Liao ⋅ Tailai Chen ⋅ Linfeng Zhang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 399
Do You See What I Am Pointing At? Gesture-Based Egocentric Video Question Answering
Yura Choi ⋅ Roy Miles ⋅ Rolandos Alexandros Potamias ⋅ Ismail Elezi ⋅ Jiankang Deng ⋅ Stefanos Zafeiriou
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 400
Beyond Caption-Based Queries in Video Moment Retrieval
David Pujol-Perich ⋅ Albert Clapés ⋅ Dima Damen ⋅ Sergio Escalera ⋅ Michael Wray
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 401
Neural-Centric Video Processing Pipeline for Unified Multi-Task Inference
Seyeon Lee ⋅ Juncheol Ye ⋅ Jaehong Kim ⋅ Dongsu Han
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 402
VideoRealBench: A Chain-of-Thought Realism Evaluation Benchmark for Generated Human-Centric Videos
Min Yang ⋅ Xinwen Zhang ⋅ Jialei Tang ⋅ Xin Zhou ⋅ Kehan Li ⋅ Zeyi Huang ⋅ Limin Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 403
VAST: Video Ability‑Stratified Taxonomy for Data‑Efficient Video Reasoning
Zhongan Wang ⋅ Xiaoyu Wen ⋅ Lingxiao Du ⋅ Kun Li ⋅ zhiliang wu ⋅ Xingcheng Xu ⋅ Qiaosheng Zhang ⋅ Chaochao Lu ⋅ Hehe Fan
[ Slides
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 404
An Empirical Study on How Video-LLMs Answer Video Questions
Chenhui Gou ⋅ Ziyu Ma ⋅ Zicheng Duan ⋅ Haoyu He ⋅ Feng Chen ⋅ Liyang Liu ⋅ Bohan Zhuang ⋅ Jianfei Cai ⋅ Hamid Rezatofighi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 405
FPSBench: A Benchmark for Video Understanding at High Frame Rates
Rohan Choudhury ⋅ Jean Dandurand ⋅ Kai Qiu ⋅ Kshitij Madhav Bhat ⋅ Kartik Sharma ⋅ Liza Dahiya ⋅ Yizhou Zhao ⋅ Souraja Kundu ⋅ Chun-Hsien Lin ⋅ Kris Kitani ⋅ László A. Jeni
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 406
UniComp: Rethinking Video Compression Through Informational Uniqueness
Chao Yuan ⋅ Shimin Chen ⋅ Minliang Lin ⋅ Limeng Qiao ⋅ Guanglu Wan ⋅ Lin Ma
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 407
NaTex: Seamless Texture Generation as Latent Color Diffusion
Zeqiang Lai ⋅ Yunfei Zhao ⋅ Zibo Zhao ⋅ Xin Yang ⋅ Xin Huang ⋅ Jingwei Huang ⋅ Xiangyu Yue ⋅ Chunchao Guo
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 408
Your Latent Mask is Wrong: Pixel-Equivalent Latent Compositing for Diffusion Models
Rowan Bradbury ⋅ Dazhi Zhong
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 409
Pluggable Pruning with Contiguous Layer Distillation for Diffusion Transformers
jian ma ⋅ Qirong Peng ⋅ Xujie Zhu ⋅ Peixing Xie ⋅ Chen Chen ⋅ Haonan Lu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 410
Attribute-Preserving Pseudo-Labeling for Diffusion-Based Face Swapping
Jiwon Kang ⋅ Yeji Choi ⋅ JoungBin Lee ⋅ Wooseok Jang ⋅ Jinhyeok Choi ⋅ Taekeun Kang ⋅ Yongjae Park ⋅ Myungin Kim ⋅ Seungryong Kim
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 411
Delta Rectified Flow Sampling for Text-to-Image Editing
Gaspard Beaudouin ⋅ Minghan LI ⋅ Jaeyeon Kim ⋅ Sung-Hoon Yoon ⋅ Mengyu Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 412
Training-free Mixed-Resolution Latent Upsampling for Spatially Accelerated Diffusion Transformers
Wongi Jeong ⋅ Kyungryeol Lee ⋅ Hoigi Seo ⋅ Se Young Chun
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 413
SpotEdit: Selective Region Editing in Diffusion Transformers
ZHIBIN QIN ⋅ Zhenxiong Tan ⋅ Zeqing Wang ⋅ Songhua Liu ⋅ Xinchao Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 414
All-in-One Slider for Attribute Manipulation in Diffusion Models
Weixin Ye ⋅ Hongguang Zhu ⋅ Wei Wang ⋅ Yahui Liu ⋅ Mengyu Wang ⋅ Xuecheng Nie
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 415
DA-VAE: Plug-in Latent Compression for Diffusion via Detail Alignment
Xin Cai ⋅ Zhiyuan You ⋅ Zhoutong Zhang ⋅ Tianfan Xue
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 416
From Sketch to Fresco: Efficient Diffusion Transformer with Progressive Resolution
Shikang Zheng ⋅ Guantao Chen ⋅ Landis He ⋅ Jiacheng Liu ⋅ Yuqi Lin ⋅ Chang Zou ⋅ Linfeng Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 417
CATNet: Collaborative Alignment and Transformation Network for Cooperative Perception
Gong Chen ⋅ Chaokun Zhang ⋅ Tao Tang ⋅ Pengcheng Lv ⋅ Feng Li ⋅ Xin Xie
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 418
Scene Reconstruction as Mapping Priors for 3D Detection
Yang Fu ⋅ Yuliang Zou ⋅ Hao Xiang ⋅ Xin Huang ⋅ Yijing Bai ⋅ Chen Song ⋅ Weijing Shi ⋅ Govind Thattai ⋅ Dragomir Anguelov ⋅ Mingxing Tan ⋅ Yingwei Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 419
CCF: Complementary Collaborative Fusion for Domain Generalized Multi-Modal 3D Object Detection
Yuchen Wu ⋅ Kun Wang ⋅ Yining Pan ⋅ Na Zhao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 420
Unleashing the Power of Chain-of-Prediction for Monocular 3D Object Detection
Zhihao Zhang ⋅ Abhinav Kumar ⋅ Girish Chandar ⋅ Xiaoming Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 421
R4Det: 4D Radar-Camera Fusion for High-Performance 3D Object Detection
Zhongyu Xia ⋅ Yousen Tang ⋅ Yongtao Wang ⋅ Zhifeng Wang ⋅ Weijun Qin
[ Slides
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 422
Revisiting Token Compression for Accelerating ViT-based Sparse Multi-View 3D Object Detectors
Mingqian Ji ⋅ Shanshan Zhang ⋅ Jian Yang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 423
Few-Shot Incremental 3D Object Detection in Dynamic Indoor Environments
Yun Zhu ⋅ Jianjun Qian ⋅ Jian Yang ⋅ Jin Xie ⋅ Na Zhao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 424
Learning from Synthetic Data via Provenance-Based Input Gradient Guidance
Koshiro Nagano ⋅ Ryo Fujii ⋅ Ryo Hachiuma ⋅ Fumiaki Sato ⋅ Taiki Sekii ⋅ HIDEO SAITO
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 425
Seeing Clearly, Reasoning Confidently: Plug-and-Play Remedies for Vision Language Model Blindness
Xin Hu ⋅ Haomiao Ni ⋅ Yunbei Zhang ⋅ Jihun Hamm ⋅ Zechen Li ⋅ Zhengming Ding
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 426
Draft and Refine with Visual Experts
SungHeon Jeong ⋅ Ryozo Masukawa ⋅ Jihong Park ⋅ Sanggeon Yun ⋅ Wenjun Huang ⋅ Hanning Chen ⋅ Mahdi Imani ⋅ Mohsen Imani
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 427
R2G: A Multi-View Circuit Graph Benchmark Suite from RTL to GDSII
ZEWEI ZHOU ⋅ Jiajun Zou ⋅ Jiajia Zhang ⋅ Ao Yang ⋅ Ruichao He ⋅ Haozheng Zhou ⋅ Ao Liu ⋅ Jiawei Liu ⋅ Leilei Jin ⋅ Shan Shen ⋅ Daying Sun
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 428
VQ-VA World: Towards High-Quality Visual Question-Visual Answering
Chenhui Gou ⋅ Zilong Chen ⋅ Zeyu Wang ⋅ Feng Li ⋅ Deyao Zhu ⋅ Zicheng Duan ⋅ Kunchang Li ⋅ Chaorui Deng ⋅ Hongyi Yuan ⋅ Haoqi Fan ⋅ Cihang Xie ⋅ Jianfei Cai ⋅ Hamid Rezatofighi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 429
Cross-Domain Demo-to-Code via Neurosymbolic Counterfactual Reasoning
Jooyoung Kim ⋅ Wonje Choi ⋅ Younguk Song ⋅ Honguk Woo
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 430
Beyond Multiple Choice: Verifiable OpenQA for Robust Vision-Language RFT
Yesheng Liu ⋅ Hao Li ⋅ Haiyu Xu ⋅ Baoqi Pei ⋅ Jiahao Wang ⋅ Mingxuan Zhao ⋅ Jing-Shu Zheng ⋅ Zheqi He ⋅ JG Yao ⋅ Xi Yang ⋅ Bowen Qin ⋅ Jiajun Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 431
See Further, Think Deeper: Advancing VLM's Reasoning Ability with Low-level Visual Cues and Reflection
Zhiheng Wu ⋅ Tong Wang ⋅ Shuning Wang ⋅ Naiming Liu ⋅ Yumeng Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 432
PDCR: Perception-Decomposed Confidence Reward for Vision-Language Reasoning
Hee Suk Yoon ⋅ Eunseop Yoon ⋅ Ji Woo Hong ⋅ SooHwan Eom ⋅ Gwanhyeong Koo ⋅ Mark Hasegawa-Johnson ⋅ Qi Dai ⋅ Chong Luo ⋅ Chang D. Yoo
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 433
μVLM: A Vision Language Model for μNPUs
Zijie Chen ⋅ Guiyun Fan ⋅ Zhaoxing Yang ⋅ Rong Ding ⋅ Haiming Jin
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 434
Gaussian Mapping for Evolving Scenes
Vladimir Yugay ⋅ Thies Kersten ⋅ Luca Carlone ⋅ Theo Gevers ⋅ Martin R. Oswald ⋅ Lukas Schmid
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 435
Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting
Tianjiao Yu ⋅ Vedant Shah ⋅ Muntasir Wahed ⋅ Ying Shen ⋅ Kiet A. Nguyen ⋅ Ismini Lourentzou
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 436
AnchorSplat: Feed-Forward 3D Gaussian Splatting With 3D Geometric Priors
Xiaoxue Zhang ⋅ Xiaoxu Zheng ⋅ Yixuan Yin ⋅ Tiao Zhao ⋅ Kaihua Tang ⋅ Michael Bi Mi ⋅ Zhan Xu ⋅ Dave Chen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 437
SGAD-SLAM: Splatting Gaussians at Adjusted Depth for Better Radiance Fields in RGBD SLAM
Pengchong Hu ⋅ Zhizhong Han
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 438
Faster-GS: Analyzing and Improving Gaussian Splatting Optimization
Florian Hahlbohm ⋅ Linus Franke ⋅ Martin Eisemann ⋅ Marcus Magnor
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 439
Layered 4D-Rotor Gaussian Splatting: A Compressed Representation for Long Dynamic Scenes
Hanjie Xu ⋅ Yuanxing Duan ⋅ Qiyu Dai ⋅ Ge Li ⋅ Baoquan Chen ⋅ He Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 440
GaussianGrow: Geometry-aware Gaussian Growing from 3D Point Clouds with Text Guidance
Weiqi Zhang ⋅ Junsheng Zhou ⋅ Haotian Geng ⋅ Kanle Shi ⋅ Shenkun Xu ⋅ Yi Fang ⋅ Yu-Shen Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 441
PhysGS: Bayesian-Inferred Gaussian Splatting for Physical Property Estimation
Samarth Chopra ⋅ Jing Liang ⋅ Gershom Seneviratne ⋅ Dinesh Manocha
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 442
3D Gaussian Splatting at Arbitrary Resolutions with Compact Proxy Anchors
Mingyun Jeong ⋅ Seongro Yoon ⋅ Francois Bremond ⋅ Donghyeon Cho
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 443
Stochastic Ray Tracing for the Reconstruction of 3D Gaussian Splatting
Peiyu Xu ⋅ Shuang Zhao ⋅ Xin Sun ⋅ Krishna Mullia ⋅ Raymond Fei ⋅ Iliyan Georgiev
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 444
AeroDGS: Physically Consistent Dynamic Gaussian Splatting for Single-Sequence Aerial 4D Reconstruction
Hanyang Liu ⋅ Rongjun Qin
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 445
GaussianPile: A Unified Sparse Gaussian Splatting Framework for Slice-based Volumetric Reconstruction
Di Kong ⋅ Yikai Wang ⋅ Wenjie Guo ⋅ Yifan Bu ⋅ Boya Zhang ⋅ Yuexin Duan ⋅ Xiawei Yue ⋅ Wenbiao Du ⋅ Yiman Zhong ⋅ Yuwen Chen ⋅ Cheng Ma
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 446
More Natural, More Real: Object-aware Gaussian Splatting for 3D Visual Decoding from Human Brain
Haodong Jing ⋅ Dongyao Jiang ⋅ Jixin Wang ⋅ Junhao Jia ⋅ Yanshu Li ⋅ Yongqiang Ma ⋅ Nanning Zheng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 447
Eulerian Gaussian Splatting using Hashed Probability Pyramids
Mia Gaia Polansky ⋅ George Kopanas ⋅ Stephan Garbin ⋅ Todd Zickler ⋅ Dor Verbin
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 448
Confidence-Guided Multi-Scale Aggregation for Sparse-View High-Resolution 3D Gaussian Splatting
Qinzheng Zhou ⋅ Zaychik Liu ⋅ Lijing Lu ⋅ Zhihang Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 449
ULF-Loc: Unbiased Landmark Feature for Robust Visual Localization with 3D Gaussian Splatting
Yingdong Gu ⋅ Shaocheng Yan ⋅ Zhenjun Zhao ⋅ Yuan Kou ⋅ Jianxin Luo ⋅ Pengcheng Shi ⋅ Jiayuan Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 450
Robust3DGSW: Toward Robust Watermarking for Quantization-Aware 3D Gaussian Splatting
Boyu Wang ⋅ Jun Xia ⋅ Mingsong Chen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 451
ParkGaussian: Surround-view 3D Gaussian Splatting for Autonomous Parking
Xiaobao Wei ⋅ Zhangjie Ye ⋅ Yuxiang Gu ⋅ Zunjie Zhu ⋅ Yunfei Guo ⋅ Yingying Shen ⋅ Shan Zhao ⋅ Ming Lu ⋅ Haiyang Sun ⋅ Bing Wang ⋅ Guang Chen ⋅ Rongfeng Lu ⋅ Hangjun Ye
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 452
L^2DGS: Low-Light Dynamic Gaussian Splatting
Ashish Kumar ⋅ A. N. Rajagopalan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 453
Probabilistic Concept Graph Reasoning for Multimodal Misinformation Detection
Ruichao Yang ⋅ Wei Gao ⋅ Xiaobin Zhu ⋅ Jing Ma ⋅ Hongzhan Lin ⋅ Ziyang Luo ⋅ Bo-Wen Zhang ⋅ Xu-Cheng Yin
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 454
POINTS-Long: Adaptive Dual-Mode Visual Reasoning in MLLMs
Haicheng Wang ⋅ Yuan Liu ⋅ Yikun Liu ⋅ Zhemeng Yu ⋅ Zhongyin Zhao ⋅ Yangxiu You ⋅ Zilin Yu ⋅ Le Tian ⋅ Zhou Xiao ⋅ Jie Zhou ⋅ Weidi Xie ⋅ Yanfeng Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 455
SegCompass: Exploring Interpretable Alignment with Sparse Autoencoders for Enhanced Reasoning Segmentation
Zhenyu Lu ⋅ Liupeng Li ⋅ Jinpeng Wang ⋅ Haoqian Kang ⋅ Yan Feng ⋅ Ke Chen ⋅ Yaowei Wang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 456
CRIT: Graph-Based Automatic Data Synthesis to Enhance Cross-Modal Multi-Hop Reasoning
Junyoung Sung ⋅ Seungwoo Lyu ⋅ Minjun Kim ⋅ Sumin An ⋅ Arsha Nagrani ⋅ Paul Hongsuck Seo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 457
DeepScan: A Training-Free Framework for Visually Grounded Reasoning in Large Vision-Language Models
Yangfu Li ⋅ Hongjian Zhan ⋅ Jiawei Chen ⋅ YUNING GONG ⋅ Qi Liu ⋅ Yue Lu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 458
Locate-Then-Examine: Grounded Region Reasoning Improves Detection of AI-Generated Images
Yikun Ji ⋅ Yan Hong ⋅ Bowen Deng ⋅ Jun Lan ⋅ Huijia Zhu ⋅ Weiqiang Wang ⋅ Liqing Zhang ⋅ Jianfu Zhang
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 459
HUMORCHAIN: Theory-Guided Multi-Stage Reasoning for Interpretable Multimodal Humor Generation
Jiajun Zhang ⋅ Shijia Luo ⋅ Ruikang Zhang ⋅ Qi Su
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 460
CodeDance: A Dynamic Tool-integrated MLLM for Executable Visual Reasoning
Qi Song ⋅ Honglin Li ⋅ Yingchen Yu ⋅ Haoyi Zhou ⋅ Lin Yang ⋅ Song Bai ⋅ Qi She ⋅ Zilong Huang ⋅ Yunqing Zhao
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 461
Rethinking MLLM Itself as a Segmenter with a Single Segmentation Token
Anqi Zhang ⋅ Xiaokang Ji ⋅ Guangyu Gao ⋅ Jianbo Jiao ⋅ Chi Harold Liu ⋅ Yunchao Wei
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 462
Video-Only ToM: Enhancing Theory of Mind in Multimodal Large Language Models
SIQI LIU ⋅ Xinyang Li ⋅ Bochao Zou ⋅ Junbao Zhuo ⋅ Huimin Ma ⋅ Jiansheng Chen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 463
Mario: Multimodal Graph Reasoning with Large Language Models
Yuanfu Sun ⋅ Kang Li ⋅ Pengkang Guo ⋅ Jiajin Liu ⋅ Qiaoyu Tan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 464
Boosting Reasoning in Large Multimodal Models via Activation Replay
Yun Xing ⋅ Xiaobin Hu ⋅ Qingdong He ⋅ Jiangning Zhang ⋅ Shuicheng Yan ⋅ Shijian Lu ⋅ Yu-Gang Jiang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 465
Rationale-Enhanced Decoding for Multi-modal Chain-of-Thought
Shin'ya Yamaguchi ⋅ Kosuke Nishida ⋅ Daiki Chijiwa
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 466
Mimic Human Cognition, Master Multi-Image Reasoning: A Meta-Action Framework for Enhanced Visual Understanding
Jianghao Yin ⋅ Qingbin Li ⋅ KUN SUN ⋅ Cheng Ding ⋅ Jie Wang ⋅ Qin Chen ⋅ Jie Zhou ⋅ Nan Wang ⋅ Changqing Li ⋅ Pei Wu ⋅ Jian Xu ⋅ Zheming Yang ⋅ Liang He
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 467
ROSE: Rotate Your Large Language Model to See
Tongtian Yue ⋅ Xuange Gao ⋅ Longteng Guo ⋅ Zijia Zhao ⋅ Zikang Liu ⋅ Jie Jiang ⋅ Hua Huang ⋅ Jing Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 468
OpenMMReasoner: Pushing the Frontiers in Multimodal Reasoning with an Open and General Recipe
Kaichen Zhang ⋅ Keming Wu ⋅ Zuhao Yang ⋅ Bo Li ⋅ Kairui Hu ⋅ Bin Wang ⋅ Xingxuan Li ⋅ Lidong Bing
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 469
SelecTKD: Selective Token-Weighted Knowledge Distillation for LLMs
Haiduo Huang ⋅ Jiangcheng Song ⋅ Yadong Zhang ⋅ Pengju Ren
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 470
Sparsity as a Key: Unlocking New Insights from Latent Structures for Out-of-Distribution Detection
Ahyoung Oh ⋅ Wonseok Shin ⋅ Songkuk Kim
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 471
SparVAR: Exploring Sparsity in Visual AutoRegressive Modeling for Training-Free Acceleration
Zekun Li ⋅ wang ning ⋅ Tongxin Bai ⋅ Changwang Mei ⋅ Ning Wang ⋅ Shuang Qiu ⋅ Jian Cheng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 472
Suppressing Non-Semantic Noise in Masked Image Modeling Representations
Martine Hjelkrem-Tan ⋅ Marius Aasan ⋅ Rwiddhi Chakraborty ⋅ Gabriel Y. Arteaga ⋅ Changkyu Choi ⋅ Adín Ramírez Rivera
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 473
Block-based Learned Image Compression without Blocking Artifacts
Jong Wook Kim ⋅ Suyong Bahk ⋅ TaeHwa Lee ⋅ HyunDong CHO ⋅ Donghyun Kim ⋅ Sung-Chang Lim ⋅ Jin Soo Choi ⋅ Hui Yong Kim
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 474
DeDelayed: Deleting Remote Inference Delay via On-Device Correction
Dan Jacobellis ⋅ Mateen Ulhaq ⋅ Fabien Racapé ⋅ Hyomin Choi ⋅ Neeraja Yadwadkar
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 475
AdaRadar: Rate Adaptive Spectral Compression for Radar-based Perception
Jinho Park ⋅ Se Young Chun ⋅ Mingoo Seok
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 476
Gaussian Splatting-based Low-Rank Tensor Representation for Multi-Dimensional Image Recovery
Yiming Zeng ⋅ Xile Zhao ⋅ Wei-Hao Wu ⋅ Teng-Yu Ji ⋅ Chao Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 477
Precise Object and Effect Removal with Adaptive Target-Aware Attention
Jixin Zhao ⋅ Zhouxia Wang ⋅ Peiqing Yang ⋅ Shangchen Zhou
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 478
Decompose, Mix, Adapt: A Unified Framework for Parameter-Efficient Neural Network Recombination and Compression
Nazia Tasnim ⋅ Shrimai Prabhumoye ⋅ Bryan A. Plummer
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 479
FreqSIC: Frequency-aware Stereo Image Compression with Bi-directional Checkerboard Context Model
Shiyu Qin ⋅ Yongkang Lu ⋅ Yimin Zhou ⋅ Jiawei Li ⋅ Yifan Ren ⋅ Yuerong Xue ⋅ Shu-Tao Xia ⋅ Bin Chen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 480
SinGeo: Unlock Single Model's Potential for Robust Cross-View Geo-Localization
CHEN Yang ⋅ Xieyuanli Chen ⋅ Junxiang Li ⋅ Jie Tang ⋅ Tao Wu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 481
Fusion of Depth and Semantics for Probabilistic Floorplan Localization
Kecheng Ye ⋅ Mao Chen ⋅ Xiangkai Zhang ⋅ Xu Yang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 482
A2GC: Asymmetric Aggregation with Geometric Constraints for Locally Aggregated Descriptors
Zhenyu Li ⋅ Tianyi Shang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 483
Geo2: Geometry-Guided Cross-view Geo-Localization and Image Synthesis
Yancheng Zhang ⋅ Xiaohan Zhang ⋅ Guangyu Sun ⋅ Zonglin Lyu ⋅ Safwan Wshah ⋅ Chen Chen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 484
Coverage Optimization for Camera View Selection
Timothy Chen ⋅ Adam Dai ⋅ Maximilian Adang ⋅ Grace Gao ⋅ Mac Schwager
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 485
Resolving Evidence Sparsity: Agentic Context Engineering for Long-Document Understanding
Keliang Liu ⋅ Zizhi Chen ⋅ Mingcheng Li ⋅ Jingqun Tang ⋅ Dingkang Yang ⋅ Lihua Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 486
Reasoning Palette: Modulating Reasoning via Latent Contextualization for Controllable Exploration for (V)LMs
Rujiao Long ⋅ Yang Li ⋅ Xingyao Zhang ⋅ Weixun Wang ⋅ Tianqianjin Lin ⋅ Xi Zhao ⋅ Yuchi Xu ⋅ Wenbo Su ⋅ Junchi Yan ⋅ Bo Zheng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 487
ORCA: Orchestrated Reasoning with Collaborative Agents for Document Visual Question Answering
Aymen Lassoued ⋅ Mohamed Ali Souibgui ⋅ Yousri Kessentini
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 488
MSJoE: Jointly Evolving MLLM and Sampler for Efficient Long-Form Video Understanding
Wenhui Tan ⋅ Xiaoyi Yu ⋅ Jiaze Li ⋅ Yijing Chen ⋅ Jianzhong Ju ⋅ Zhenbo Luo ⋅ Ruihua Song ⋅ Jian Luan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 489
A Multi-Agent Perception-Action Alliance for Efficient Long Video Reasoning
Yichang Xu ⋅ Gaowen Liu ⋅ Ramana Kompella ⋅ Tiansheng Huang ⋅ Sihao Hu ⋅ Fatih Ilhan ⋅ Selim Tekin ⋅ Zachary Yahn ⋅ Ling Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 490
Saliency-Guided Representation with Consistency Policy Learning for Visual Unsupervised Reinforcement Learning
Jingbo Sun ⋅ Qichao Zhang ⋅ Songjun Tu ⋅ Xing Fang ⋅ Yupeng Zheng ⋅ Haoran Li ⋅ Ke Chen ⋅ Dongbin Zhao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 491
LensWalk: Agentic Video Understanding by Planning How You See in Videos
Keliang Li ⋅ Yansong Li ⋅ Hongze Shen ⋅ Mengdi Liu ⋅ Hong Chang ⋅ Shiguang Shan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 492
DPGF-Net: Dual-Prior Guided Fusion Network for Joint Assessment of Perceptual Quality and Semantic Consistency in AI-Generated Images
Tao Li ⋅ Xingran LIAO ⋅ Mingliang Zhou
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 493
RegionFuse: Region-Adaptive Pixel Distribution Learning for Infrared and Visible Image Fusion
Jianghan Xia ⋅ Hong Song ⋅ Jinfu Li ⋅ Yucong Lin ⋅ Shihan Ma ⋅ Jingfan Fan ⋅ Danni Ai ⋅ Tianyu Fu ⋅ Deqiang Xiao ⋅ Jian Yang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 494
Missing No More: Dictionary-Guided Cross-Modal Image Fusion under Missing Infrared
Yafei Zhang ⋅ Meng Ma ⋅ Huafeng Li ⋅ Yu Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 495
VideoFusion: A Spatio-Temporal Collaborative Network for Multi-modal Video Fusion
Linfeng Tang ⋅ Yeda Wang ⋅ Meiqi Gong ⋅ Zizhuo Li ⋅ Yuxin Deng ⋅ Xunpeng Yi ⋅ Chunyu Li ⋅ Han Xu ⋅ HAO ZHANG ⋅ Jiayi Ma
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 496
TAPE: Task-Adaptive Prototype Evolution in Audio-Language Models for Fully Few-shot Class-incremental Audio Classification
Yunlong Gao ⋅ Wenxin Liang ⋅ Guanglu Wang ⋅ Senqi Guan ⋅ Linlin Zong ⋅ Dongyu Zhang ⋅ Xinyue Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 497
Remedying Target-Domain Astigmatism for Cross-Domain Few-Shot Object Detection
Yongwei Jiang ⋅ Yixiong Zou ⋅ Yuhua Li ⋅ Ruixuan Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 498
DDSF: Robust Few-Shot Learning via Disentangled Subspaces with Determinantal Point Process
xulun ye ⋅ Yifan Mei ⋅ Kun Zhou ⋅ Zelei Wu ⋅ Jieyu Zhao
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 499
Hyperbolic Defect Feature Synthesis for Few-Shot Defect Classification
Huimin Li ⋅ Boxuan Hu ⋅ Yulin Zhang ⋅ Xiuzhuang Zhou ⋅ Junlin Hu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 500
Training-Only Heterogeneous Image-Patch-Text Graph Supervision for Advancing Few-Shot Learning Adapters
Mohammed Rahman Sherif Khan Mohammad ⋅ Ardhendu Behera ⋅ Sandip Pradhan ⋅ Swagat Kumar ⋅ Amr Ahmed
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 501
Learning to Learn Weight Generation via Local Consistency Diffusion
Yunchuan Guan ⋅ Yu Liu ⋅ Ke Zhou ⋅ Zhiqi Shen ⋅ Jenq-Neng Hwang ⋅ Lei Li
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 502
Balanced Dataset Distillation via Modeling Multiple Visual Pattern Distribution
Guanghui Shi ⋅ Xuefeng Liang ⋅ Qixiang Wen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 503
Grid Distillation: Compositional Image Distillation via Structured Generative Grids
Biplab Ch Das ⋅ Shouvik Das ⋅ Viswanath Gopalakrishnan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 504
Dataset Distillation by Influence Matching
Haoru Tan ⋅ Wang Wang ⋅ WU Sitong ⋅ Xiuzhe Wu ⋅ Yangtian Sun ⋅ Chirui Chang ⋅ Shaofeng Zhang ⋅ Xiaojuan Qi
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 505
StableMaterials: Enhancing Diversity in Material Generation via Semi-Supervised Learning
Giuseppe Vecchio
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 506
Seeing Through Blur: Tackling Defocus in Spike-Based Imaging
Xiantao Ma ⋅ Siwei Dong ⋅ Lin Zhu ⋅ Lizhi Wang ⋅ Hua Huang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 507
Distilling Quasi-Conformal Mapping: A Generalizable and Efficient Solution for Wide-Angle Correction
Chengyang Liu ⋅ Zixuan Lin ⋅ Miaolin Han ⋅ Michael K. Ng ⋅ huibin Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 508
Lighting in Motion: Spatiotemporal HDR Lighting Estimation
Christophe Bolduc ⋅ Julien Philip ⋅ Li Ma ⋅ Mingming He ⋅ Paul Debevec ⋅ Jean-François Lalonde
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 509
LightRR: A Lightweight Network for Single Image Reflection Removal
Wenbin Yin ⋅ Junkang Zhang ⋅ Sunzhe Yang ⋅ Faming Fang ⋅ Guixu Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 510
HFR and HDR Video from Multi-Attenuated Spikes Using a Rapidly Rotating SpokeND Filter
Yakun Chang ⋅ Zhaojun Huang ⋅ Siqi Yang ⋅ Yeliduosi Xiaokaiti ⋅ Shikui Wei ⋅ Yao Zhao ⋅ Tiejun Huang ⋅ Boxin Shi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 511
Coded-E2LF: Coded Aperture Light Field Imaging from Events
Tomoya Tsuchida ⋅ Keita Takahashi ⋅ Chihiro Tsutake ⋅ Toshiaki Fujii ⋅ Hajime Nagahara
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 512
TokenLight: Precise Lighting Control in Images using Attribute Tokens
Sumit Chaturvedi ⋅ Yannick Hold-Geoffroy ⋅ Mengwei Ren ⋅ Jingyuan Liu ⋅ He Zhang ⋅ Yiqun Mei ⋅ Julie Dorsey ⋅ ZHIXIN SHU
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 513
Kaleidoscopic Scintillation Event Imaging
Alex Bocchieri ⋅ John Mamish ⋅ David Appleyard ⋅ Andreas Velten
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 514
gQIR: Generative Quanta Image Reconstruction
Aryan Garg ⋅ Sizhuo Ma ⋅ Mohit Gupta
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 515
Solving Minimal Problems Without Matrix Inversion Using FFT-Based Interpolation
Haidong Wu ⋅ Snehal Bhayani ⋅ Janne Heikkilä
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 516
Predicting Spatial Transcriptomics from Histology Images via High-Order Multi-Cell Interaction Modeling
Youhan Sun ⋅ Jiahua Rao ⋅ Kangrui Du ⋅ Jiancong Xie ⋅ Yuedong Yang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 517
From Spots to Pixels: Dense Spatial Gene Expression Prediction from Histology Images
Ruikun Zhang ⋅ Yan Yang ⋅ Liyuan Pan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 518
Cell-Type Prototype-Informed Neural Network for Gene Expression Estimation from Pathology Images
Kazuya Nishimura ⋅ Ryoma Bise ⋅ Shinnosuke Matsuo ⋅ Haruka Hirose ⋅ Yasuhiro Kojima
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 519
LightSplat: Fast and Memory-Efficient Open-Vocabulary 3D Scene Understanding in Five Seconds
Jaehun Bang ⋅ Jinhyeok Kim ⋅ Minji Kim ⋅ Seungheon Jeong ⋅ Kyungdon Joo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 520
Guardians of the Hair: Rescuing Soft Boundaries in Depth, Stereo, and Novel Views
Xiang Zhang ⋅ Yang Zhang ⋅ Lukas Mehl ⋅ Markus Gross ⋅ Christopher Schroers
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 521
Zero-Shot Depth Completion with Vision-Language Model
Zhiqiang Yan ⋅ Yuan Wu ⋅ Gim Hee Lee
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 522
FE2E: From Editor to Dense Geometry Estimator
jiyuan WANG ⋅ Chunyu Lin ⋅ Lei Sun ⋅ Rongying Liu ⋅ Lang Nie ⋅ Mingxing Li ⋅ Kang Liao ⋅ Xiangxiang Chu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 523
Ego-1K – A Large-Scale Multiview Video Dataset for Egocentric Vision
Jae Yong Lee ⋅ Daniel Scharstein ⋅ Akash Bapat ⋅ Hao Hu ⋅ Andrew Fu ⋅ Haoru Zhao ⋅ Paul Sammut ⋅ Xiang Li ⋅ Stephen Jeapes ⋅ Anik Gupta ⋅ Lior David ⋅ Saketh Madhuvarasu ⋅ Jay Girish Joshi ⋅ Jason Wither
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 524
Edit-As-Act: Goal-Regressive Planning for Open-Vocabulary 3D Indoor Scene Editing
SeongRae Noh ⋅ SeungWon Seo ⋅ Gyeong-Moon Park ⋅ HyeongYeop Kang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 525
VGGT-360: Geometry-Consistent Zero-Shot Panoramic Depth Estimation
Jiayi Yuan ⋅ Haobo Jiang ⋅ De Wen Soh ⋅ Na Zhao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 526
NI-Tex: Non-isometric Image-based Garment Texture Generation
Hui Shan ⋅ Ming Li ⋅ Haitao Yang ⋅ Kai Zheng ⋅ Sizhe Zheng ⋅ Yanwei Fu ⋅ Xiangru Huang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 527
Velox: Learning Representations of 4D Geometry and Appearance
Anagh Malik ⋅ Dorian Chan ⋅ Xiaoming Zhao ⋅ David B. Lindell ⋅ Oncel Tuzel ⋅ Rick Chang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 528
UniPixie: Unified and Probabilistic 3D Physics Learning via Flow Matching
Qilin Huang ⋅ Quynh Anh Huynh ⋅ Long Le ⋅ Chen Wang ⋅ Chuhao Chen ⋅ Ryan Lucas ⋅ Eric Eaton ⋅ Lingjie Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 529
UniTEX: Universal High Fidelity Generative Texturing for 3D Shapes
Yixun Liang ⋅ Kunming Luo ⋅ Xiao Chen ⋅ Rui Chen ⋅ Jiawei Zhou ⋅ Weiyu Li ⋅ Jiarui Liu ⋅ Fei-Peng Tian ⋅ Ping Tan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 530
Points-to-3D: Structure-Aware 3D Generation with Point Cloud Priors
Jiatong Xia ⋅ Zicheng Duan ⋅ Anton van den Hengel ⋅ Lingqiao Liu
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 531
PartDiffuser: Part-wise 3D Mesh Generation via Discrete Diffusion
Yichen Yang ⋅ Hong Li ⋅ Haodong Zhu ⋅ linin ⋅ guojun lei ⋅ Sheng Xu ⋅ Baochang Zhang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 532
LoST: Level of Semantics Tokenization for 3D Shapes
Niladri Shekhar Dutt ⋅ Zifan Shi ⋅ Paul Guerrero ⋅ Chun-Hao Huang ⋅ Duygu Ceylan ⋅ Niloy J. Mitra ⋅ Xuelin Chen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 533
Lafite: A Generative Latent Field for 3D Native Texturing
Chia-Hao Chen ⋅ Yuanchen Guo ⋅ Zi-Xin Zou ⋅ Ze Yuan ⋅ Guan Luo ⋅ Xiaojuan Qi ⋅ Ding Liang ⋅ Yan-Pei Cao ⋅ Song-Hai Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 534
Image-Guided Geometric Stylization of 3D Meshes
Changwoon Choi ⋅ Hyunsoo Lee ⋅ Clément Jambon ⋅ Yael Vinker ⋅ Young Min Kim
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 535
LATTICE: Democratize High-Fidelity 3D Generation at Scale
Zeqiang Lai ⋅ Yunfei Zhao ⋅ Zibo Zhao ⋅ Haolin Liu ⋅ Qingxiang Lin ⋅ Jingwei Huang ⋅ Chunchao Guo ⋅ Xiangyu Yue
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 536
Dehallu3D: Hallucination-Mitigated 3D Generation from a Single Image via Cyclic View Consistency Refinement
Xiwen Wang ⋅ Shichao Zhang ⋅ Ruowei Wang ⋅ mao li ⋅ Chenyu Zhou ⋅ Ji-Zhe Zhou ⋅ Qijun Zhao ⋅ Hailun Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 537
MeshMosaic: Scaling Artist Mesh Generation via Local-to-Global Assembly
Rui Xu ⋅ Tianyang Xue ⋅ Qiujie Dong ⋅ Le Wan ⋅ Zhe Zhu ⋅ Peng Li ⋅ Zhiyang Dou ⋅ Cheng Lin ⋅ Shiqing Xin ⋅ Yuan Liu ⋅ Wenping Wang ⋅ Taku Komura
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 538
TacSIm: A Dataset and Benchmark for Football Tactical Style Imitation
Peng Wen ⋅ Yuting Wang ⋅ Qiurui Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 539
DynamicsBoost: Dynamic Plausible Video Generation via Annotation-Free Continuation Preference Optimization
Jiaxing Li ⋅ Jiepeng Wang ⋅ Junyao Gao ⋅ Yang Liu ⋅ Eric Li ⋅ Bo An ⋅ Hao-Xiang Guo
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 540
Reinforcement-Guided Synthetic Data Generation for Privacy-Sensitive Identity Recognition
Xuemei Jia ⋅ Jiawei Du ⋅ Hui Wei ⋅ Jun Chen ⋅ Joey Tianyi Zhou ⋅ Zheng Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 541
Fine-Grained GRPO for Precise Preference Alignment in Flow Models
Yujie Zhou ⋅ Pengyang Ling ⋅ Jiazi Bu ⋅ Yibin Wang ⋅ Yuhang Zang ⋅ Jiaqi Wang ⋅ Li Niu ⋅ Guangtao Zhai
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 542
Lighting-grounded Video Generation with Renderer-based Agent Reasoning
Ziqi Cai ⋅ Taoyu Yang ⋅ Zheng Chang ⋅ Si Li ⋅ Han Jiang ⋅ Shuchen Weng ⋅ Boxin Shi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 543
RewardFlow: Generate Images by Optimizing What You Reward
Onkar Susladkar ⋅ Dong-Hwan Jang ⋅ Tushar Prakash ⋅ Adheesh Juvekar ⋅ Vedant Shah ⋅ Ayush Barik ⋅ Nabeel Bashir ⋅ Muntasir Wahed ⋅ Ritish Shrirao ⋅ Ismini Lourentzou
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 544
Goal Force: Teaching Video Models To Accomplish Physics-Conditioned Goals
Nate Gillman ⋅ Yinghua Zhou ⋅ Zitian Tang ⋅ Evan Luo ⋅ Arjan Chakravarthy ⋅ Daksh Aggarwal ⋅ Michael Freeman ⋅ Chen Sun
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 545
Self-Corrected Image Generation with Explainable Latent Rewards
Yinyi Luo ⋅ Hrishikesh Gokhale ⋅ Marios Savvides ⋅ Jindong Wang ⋅ Shengfeng He
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 546
Polyphony: Diffusion-based Dual-Hand Action Segmentation with Alternating Vision Transformer and Semantic Conditioning
Hao Zheng ⋅ Hu Wang ⋅ Tiantian Zheng ⋅ Prajjwal Bhattarai ⋅ Tuka Alhanai
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 547
Reading Your Actions: Learning Generalizable Action Representations via Pre-training AEMG
Zhenghao Huang ⋅ Kaikai Wang ⋅ HUILIN YAO ⋅ Lin Shu
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 548
MA-Bench: Towards Fine-grained Micro-Action Understanding
Kun Li ⋅ Jihao Gu ⋅ Fei Wang ⋅ zhiliang wu ⋅ Hehe Fan ⋅ Dan Guo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 549
OpenMarcie: Dataset for Multimodal Action Recognition in Industrial Environments
Hymalai Bello ⋅ Lala Ray ⋅ Joanna Sorysz ⋅ Sungho Suh ⋅ Paul Lukowicz
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 550
Action Motifs: Self-Supervised Hierarchical Representation of Human Body Movements
Genki Kinoshita ⋅ Shu Nakamura ⋅ Ryo Kawahara ⋅ Shohei Nobuhara ⋅ Yasutomo Kawanishi ⋅ Ko Nishino
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 551
DarkShake-DVS: Event-based Human Action Recognition under Low-light and Shaking Camera Conditions
Jiaqi Chen ⋅ Qinfu Xu ⋅ Liyuan Pan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 552
Protect to Adapt: Subspace-Constrained Adaptation with Ranked Negative Prompt Feedback for Few-Shot Action Recognition
Hantao Qi ⋅ Yan Yan ⋅ Junlong Gao ⋅ Hanzi Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 553
SkeletonContext: Skeleton-side Context Prompt Learning for Zero-Shot Skeleton-based Action Recognition
Ning Wang ⋅ Tieyue Wu ⋅ Naeha Sharif ⋅ Farid Boussaid ⋅ Guangming Zhu ⋅ Lin Mei ⋅ Mohammed Bennamoun ⋅ Liang Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 554
InTrain: Intrinsic Trainability for Zero-Cost Neural Architecture Search
Qinqin Zhou ⋅ Fuhai Chen ⋅ Jipeng Wu ⋅ Zhiwei Chen ⋅ Zhikai Hu ⋅ Weiwei Cai
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 555
S^2FT: Parameter-Efficient Fine-Tuning in Sparse Spectrum Domain
Baoquan Zhang ⋅ Zhehao Yu ⋅ Lisai Zhang ⋅ Kenghong Lin ⋅ Tianran Chen ⋅ Yuxi Sun ⋅ Yunming Ye ⋅ Yao He
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 556
Rethinking SNN Online Training and Deployment: Gradient-Coherent Learning via Hybrid-Driven LIF Model
Zecheng Hao ⋅ Yifan Huang ⋅ Zijie Xu ⋅ Wenxuan Liu ⋅ Yuanhong Tang ⋅ Zhaofei Yu ⋅ Tiejun Huang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 557
Gated KalmaNet: A Fading Memory Layer through Test-time Ridge Regression
Liangzu Peng ⋅ Aditya Chattopadhyay ⋅ Luca Zancato ⋅ Elvis Nunez ⋅ Wei Xia ⋅ Stefano Soatto
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 558
Towards Efficient Medical Reasoning with Minimal Fine-Tuning Data
Xinlin Zhuang ⋅ feilong tang ⋅ Haolin Yang ⋅ Xiwei Liu ⋅ Ming Hu ⋅ Huifa Li ⋅ Haochen Xue ⋅ Junjun He ⋅ Zongyuan Ge ⋅ Yichen Li ⋅ Ying Qian ⋅ Imran Razzak
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 559
AdaBet: Gradient-free Layer Selection for Efficient Training of Deep Neural Networks
Irene Tenison ⋅ Soumyajit Chatterjee ⋅ Fahim Kawsar ⋅ Mohammad Malekzadeh
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 560
TAS-LoRA: Transformer Architecture Search with Mixture-of-LoRA Experts
Jeimin Jeon ⋅ Hyunju Lee ⋅ Bumsub Ham
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 561
QuCNet: Quantum Deep Learning Driven Multi-Circuit Network for Remote Sensing Image Classification
Komal Komal ⋅ Mukul Gupta ⋅ Saumya Singh ⋅ SANTOSH VIPPARTHI ⋅ Chakradhar Reddy Chandupatla ⋅ Subrahmanyam Murala
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 562
Learning to Solve PDEs on Neural Shape Representations
Lilian Welschinger ⋅ Yilin Liu ⋅ Zican Wang ⋅ Niloy J. Mitra
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 563
Frequency Switching Mechanism for Parameter-Efficient Multi-Task Learning
Shih-Wen Liu ⋅ Yen-Chang Chen ⋅ Wei-Ta Chu ⋅ Fu-En Yang ⋅ Yu-Chiang Frank Wang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 564
Reconstructing Spiking Neural Networks Using a Single Neuron with Autapses
Wuque Cai ⋅ Hongze Sun ⋅ Quan Tang ⋅ Shifeng Mao ⋅ Zhenxing Wang ⋅ Jiayi He ⋅ Duo Chen ⋅ Dezhong Yao ⋅ Daqing Guo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 565
Widget2Code: From Visual Widgets to UI Code via Multimodal LLMs
Houston H. Zhang ⋅ TAO ZHANG ⋅ Baoze Lin ⋅ Yuanqi Xue ⋅ Yincheng Zhu ⋅ Huan Liu ⋅ Li Gu ⋅ Linfeng Ye ⋅ Ziqiang Wang ⋅ Xinxin Zuo ⋅ Yang Wang ⋅ YUANHAO YU ⋅ Zhixiang Chi
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 566
GUI-CEval: A Hierarchical and Comprehensive Chinese Benchmark for Mobile GUI Agents
Yang Li ⋅ Yuchen Liu ⋅ Haoyu Lu ⋅ Zhiqiang Xia ⋅ Hongzhen Wang ⋅ Kaiyang Han ⋅ Changpeng Yang ⋅ Jinyang Wu ⋅ Jiaming Xu ⋅ Runyu Shi ⋅ Ying Huang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 567
FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection
Mingyu Ouyang ⋅ Kevin Qinghong Lin ⋅ Mike Zheng Shou ⋅ Hwee Tou Ng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 568
Streamlined Open-Vocabulary Human-Object Interaction Detection
Chang Sun ⋅ Dongliang Liao ⋅ Changxing Ding
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 569
Decompose and Transfer: CoT-Prompting Enhanced Alignment for Open-Vocabulary Temporal Action Detection
SA ZHU ⋅ Wanqian Zhang ⋅ Lin Wang ⋅ Xiaohua Chen ⋅ Chenxu Cui ⋅ Jinchao Zhang ⋅ Bo Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 570
Mitigating Simplicity Bias in OOD Detection through Object Co-occurrence Analysis
Boyang Dai ⋅ Chaoqi Chen ⋅ Yizhou Yu
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 571
Boosting Quantitive and Spatial Awareness for Zero-Shot Object Counting
Da Zhang ⋅ Bingyu Li ⋅ Feiyu Wang ⋅ Zhiyuan Zhao ⋅ Junyu Gao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 572
Parameter-Efficient Semantic Augmentation for Enhancing Open-Vocabulary Object Detection
Weihao Cao ⋅ Runqi Wang ⋅ Xiaoyue Duan ⋅ Jinchao Zhang ⋅ Ang Yang ⋅ Liping Jing
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 573
WeDetect: Fast Open-Vocabulary Object Detection as Retrieval
Shenghao Fu ⋅ Yukun Su ⋅ Fengyun Rao ⋅ Jing LYU ⋅ Xiaohua Xie ⋅ Wei-Shi Zheng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 574
Open-Vocabulary Domain Generalization in Urban-Scene Segmentation
Dong Zhao ⋅ Qi Zang ⋅ Nan Pu ⋅ Wenjing Li ⋅ Nicu Sebe ⋅ Zhun Zhong
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 575
OpenDPR: Open-Vocabulary Change Detection via Vision-Centric Diffusion-Guided Prototype Retrieval for Remote Sensing Imagery
Qi Guo ⋅ Jue Wang ⋅ Yinhe Liu ⋅ Yanfei Zhong
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 576
Annotation-Efficient Coreset Selection for Context-dependent Segmentation
jin zhang ⋅ Zhe Cao ⋅ Biwen Yang ⋅ Ruiheng Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 577
ALLNet: Multi-task Dense Prediction for Degraded Images
Weiran Wang ⋅ Jialing Wu ⋅ Yaqi Chang ⋅ Gang He ⋅ Li Xu ⋅ Chang Wu ⋅ Yunsong Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 578
Geometry-Aware Cross-Modal Graph Alignment for Referring Segmentation in 3D Gaussian Splatting
Yuwen Tao ⋅ Kanglei Zhou ⋅ Chang Li ⋅ Liyuan Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 579
Volumetric Functional Maps
Filippo Maggioli ⋅ Simone Melzi ⋅ Marco Livesu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 580
GenMask: Adapting DiT for Segmentation via Direct Mask Generation
Yang yuhuan ⋅ Xianwei Zhuang ⋅ Yuxuan Cai ⋅ Chaofan Ma ⋅ Shuai Bai ⋅ Jiangchao Yao ⋅ Ya Zhang ⋅ Junyang Lin ⋅ Yanfeng Wang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 581
Frequency-Aware Affinity for Weakly Supervised Semantic Segmentation
Ziqian Yang ⋅ Xianglin Qiu ⋅ Xinqiao Zhao ⋅ Xiaolei Wang ⋅ Quan Zhang ⋅ Jimin Xiao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 582
Learning and Aligning Click-Aware Shape Prior for Interactive Amodal Instance Segmentation
Junjie Chen ⋅ Junwei Lin ⋅ Ren Hong ⋅ Shengjie Liu ⋅ Yuming Fang ⋅ Feng Qian ⋅ Yifan Zuo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 583
Beyond Reassembly: Fractured Object Recovery with Missing Parts
Qun-Ce Xu ⋅ Jiahui Li ⋅ Yan-Pei Cao ⋅ Weihao Cheng ⋅ Tai-Jiang Mu ⋅ Ying Shan ⋅ Chuan Li ⋅ Da Chen ⋅ Yong-Liang Yang ⋅ Shi-Min Hu
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 584
Best Segmentation Buddies for Image-Shape Correspondence
Itai Lang ⋅ Dongwei Lyu ⋅ Dale Decatur ⋅ Rana Hanocka
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 585
RMAE-ProGRess: Advancing Semantic Segmentation in Unstructured Environments
Manish Bhurtel ⋅ Danda B. Rawat
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 586
Local Precise Refinement: A Dual-Gated Mixture-of-Experts for Enhancing Foundation Model Generalization against Spectral Shifts
Xi Chen ⋅ Maojun Zhang ⋅ Yu Liu ⋅ Shen Yan
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 587
Orthogonal Spatial-Aware Multi-View Anchor Graph Clustering for Incomplete Remote Sensing Data
Yongshan Zhang ⋅ Xiaohuan Lin ⋅ Lefei Zhang ⋅ Zhihua Cai
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 588
SIGMA: A Physics-Based Benchmark for Gas Chimney Understanding in Seismic Images
Bao Truong ⋅ Quang Nguyen ⋅ Baoru Huang ⋅ Jinpei Han ⋅ Van Nguyen ⋅ Ngan Le ⋅ Minh-Tan Pham ⋅ Doan Huy Hien ⋅ Anh Nguyen
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 589
SkySense-VITA: Towards Universal In-context Segmentation of Multi-modal Remote Sensing Imagery
Kang Wu ⋅ Lei Yu ⋅ Junwei Luo ⋅ Bo Dang ⋅ Junjian Zhang ⋅ Xiangyuan Cai ⋅ Hongwei Hu ⋅ Jingdong Chen ⋅ Yansheng Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 590
ProM3E: Probabilistic Masked MultiModal Embedding Model for Ecology
Srikumar Sastry ⋅ Subash Khanal ⋅ Aayush Dhakal ⋅ Jiayu Lin ⋅ Daniel Cher ⋅ Phoenix Jarosz ⋅ Nathan Jacobs
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 591
GeoCoT: Towards Reliable Remote Sensing Reasoning with Manifold Perspective
Daixun Li ⋅ Zirui Li ⋅ Sibo He ⋅ Jiayun Tian ⋅ Mingxiang Cao ⋅ Weiying Xie ⋅ Yunke Wang ⋅ Xin Zhang ⋅ Yusi Zhang ⋅ Yunsong Li ⋅ Chang Xu ⋅ Leyuan Fang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 592
STCast: Adaptive Boundary Alignment for Global and Regional Weather Forecasting
Hao Chen ⋅ Tao Han ⋅ Jie ZHANG ⋅ Song Guo ⋅ Lei Bai
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 593
NeighborMAE: Exploiting Spatial Dependencies between Neighboring Earth Observation Images in Masked Autoencoders Pretraining
Liang Zeng ⋅ Valerio Marsocci ⋅ Wufan Zhao ⋅ Andrea Nascetti ⋅ Maarten Vergauwen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 594
GeoDiT: A Diffusion-based Vision-Language Model for Geospatial Understanding
Jiaqi Liu ⋅ Ronghao Fu ⋅ Haoran Liu ⋅ Lang Sun ⋅ Qipeng Wang ⋅ Bo Yang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 595
Balanced Hierarchical Contrastive Learning with Decoupled Queries for Fine-grained Object Detection in Remote Sensing Images
Jingzhou Chen ⋅ Dexin Chen ⋅ Fengchao Xiong ⋅ Yuntao Qian ⋅ Liang Xiao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 596
Generative Adversarial Perturbations with Cross-paradigm Transferability on Localized Crowd Counting
Alabi Mehzabin Anisha ⋅ Guangjing Wang ⋅ Sriram Chellappan
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 597
Improving Adversarial Transferability with Local Perturbation Augmentation
Jian-Xun Mi ⋅ Xuanhui Zhong ⋅ Weisheng Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 598
Echoes of Ownership: Adversarial-Guided Dual Injection for Copyright Protection in MLLMs
Chengwei Xia ⋅ Fan Ma ⋅ Ruijie Quan ⋅ Yunqiu Xu ⋅ Kun Zhan ⋅ Yi Yang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 599
Stealing Split Learning Bottom Models by Recovering Embedding Geometry
Qinbo Zhang ⋅ Yanhang Shi ⋅ Ziyi Zhang ⋅ Hao Wang ⋅ Sai Qian Zhang ⋅ Jian Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 600
PoInit-of-View: Poisoning Initialization of Views Transfers Across Multiple 3D Reconstruction Systems
Weijie Wang ⋅ Songlong Xing ⋅ Zhengyu Zhao ⋅ Nicu Sebe ⋅ Bruno Lepri
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 601
No Way To Steal My Face: Proactive Defense Against Identity-Preserving Personalized Generation
Lizhi Xiong ⋅ Jun Li ⋅ Ziqiang Li ⋅ Weiwei Jiang ⋅ Zhangjie Fu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 602
Towards Reliable Evaluation of Adversarial Robustness for Spiking Neural Networks
Jihang Wang ⋅ Dongcheng Zhao ⋅ Ruolin Chen ⋅ Qian Zhang ⋅ Yi Zeng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 603
Where, What, Why: Toward Explainable 3D-GS Watermarking
Mingshu Cai ⋅ Jiajun Li ⋅ Osamu Yoshie ⋅ Yuya Ieiri ⋅ Yixuan Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 604
Robust Spiking Neural Networks by Temporal Mutual Information
Mengting Xu ⋅ Shi Gu ⋅ Peng Lin ⋅ De Ma ⋅ Huajin Tang ⋅ Qian Zheng ⋅ Gang Pan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 605
TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos
Seungjae Lee ⋅ Yoonkyo Jung ⋅ Inkook Chun ⋅ Yao-Chih Lee ⋅ Zikui Cai ⋅ Hongjia Huang ⋅ Aayush Talreja ⋅ Tan Dao ⋅ Yongyuan Liang ⋅ Jia-Bin Huang ⋅ Furong Huang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 606
HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models
Minghui Lin ⋅ Pengxiang Ding ⋅ Shu Wang ⋅ Zifeng Zhuang ⋅ Yang Liu ⋅ Xinyang Tong ⋅ Wenxuan Song ⋅ Shangke Lyu ⋅ Siteng Huang ⋅ Donglin Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 607
AtomicVLA: Unlocking the Potential of Atomic Skill Learning in Robots
Likui Zhang ⋅ Tao Tang ⋅ Zhihao Zhan ⋅ xiuwei chen ⋅ Zisheng Chen ⋅ Jianhua Han ⋅ Jiangtong Zhu ⋅ Pei Xu ⋅ Hang Xu ⋅ Hefeng Wu ⋅ Liang Lin ⋅ Xiaodan Liang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 608
Obstruction Reasoning for Robotic Grasping
Runyu Jiao ⋅ Matteo Bortolon ⋅ Francesco Giuliari ⋅ Alice Fasoli ⋅ Sergio Povoli ⋅ Guofeng Mei ⋅ Yiming Wang ⋅ Fabio Poiesi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 609
PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation
Wenlong Huang ⋅ Yu-Wei Chao ⋅ Arsalan Mousavian ⋅ Ming-Yu Liu ⋅ Dieter Fox ⋅ Kaichun Mo ⋅ Li Fei-Fei
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 610
CycleManip: Enabling Cycle-based Manipulation via Effective History Perception and Understanding
Yi-Lin Wei ⋅ Haoran Liao ⋅ Yuhao Lin ⋅ Pengyue Wang ⋅ Zhizhao Liang ⋅ Guiliang Liu ⋅ Wei-Shi Zheng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 611
SIMPACT: Simulation-Enabled Action Planning using Vision-Language Models
Haowen Liu ⋅ Shaoxiong Yao ⋅ Haonan Chen ⋅ Jiawei Gao ⋅ Jiayuan Mao ⋅ Jia-Bin Huang ⋅ Yilun Du
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 612
Adaptive Action Chunking at Inference-time for Vision-Language-Action Models
Yuanchang Liang ⋅ Xiaobo Wang ⋅ Kai Wang ⋅ Shuo Wang ⋅ Xiaojiang Peng ⋅ Haoyu Chen ⋅ David Kim Huat Chua ⋅ Prahlad Vadakkepat
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 613
Localizing, Structuring, and Rendering: Bridging 3D and 2D Vision-Language-Action Models for Robotic Manipulation
Yunlong Zhao ⋅ Xiaoheng Deng ⋅ Yichao Cao ⋅ Yi Chen ⋅ Xiangjian He ⋅ Shan You ⋅ Shuo Yang ⋅ Lei Fan ⋅ Fei Wang ⋅ Xiu Su
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 614
NIL: No-data Imitation Learning
Mert Albaba ⋅ Chenhao Li ⋅ Markos Diomataris ⋅ Omid Taheri ⋅ Andreas Krause ⋅ Michael J. Black
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 615
Humanoid Generative Pre-Training for Zero-Shot Motion Tracking
Zekun Qi ⋅ Xuchuan Chen ⋅ Jilong Wang ⋅ Chenghuai Lin ⋅ Yunrui Lian ⋅ Wenyao Zhang ⋅ XinQiang Yu ⋅ He Wang ⋅ Li Yi
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 616
EnergyAction: Unimanual to Bimanual Composition with Energy-Based Models
Mingchen Song ⋅ Xiang Deng ⋅ Jie Wei ⋅ Dongmei Jiang ⋅ Liqiang Nie ⋅ Weili Guan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 617
CUBic: Coordinated Unified Bimanual Perception and Control Framework
Xingyu Wang ⋅ Pengxiang Ding ⋅ Jingkai Xu ⋅ Donglin Wang ⋅ Zhaoxin Fan
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 618
RehearseVLA: Simulated Post-Training for VLAs with Physically-Consistent World Model
Junjin Xiao ⋅ Yandan Yang ⋅ Xinyuan Chang ⋅ Ronghan Chen ⋅ Feng Xiong ⋅ Mu Xu ⋅ Wei-Shi Zheng ⋅ Qing Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 619
GraspGen-X: Cross-Embodiment 6-DOF Diffusion-based Grasping
Beining Han ⋅ Yu-Wei Chao ⋅ Erwin Coumans ⋅ Clemens Eppner ⋅ Jia Deng ⋅ Stan Birchfield ⋅ Adithya Murali
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 620
UETrack: A Unified and Efficient Framework for Single Object Tracking
Ben Kang ⋅ Jie Zhao ⋅ Xin Chen ⋅ Wanting Geng ⋅ Bin Zhang ⋅ Lu Zhang ⋅ Dong Wang ⋅ Huchuan Lu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 621
ProgTrack: A Multi-Object Tracking Algorithm with Progressive Matching Strategy
Chenhui Zhang ⋅ Guoqing Dong ⋅ Weijie Peng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 622
Efficient Video Object Segmentation and Tracking with Recurrent Dynamic Submodel
Weidong Tang ⋅ Zhiyuan Liang ⋅ Xinyan Wan ⋅ Chen Zhu ⋅ Zhaopan Xu ⋅ Pengfei Zhou ⋅ Yan Song ⋅ Yang You ⋅ Wangbo Zhao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 623
Learning to Track Instance from Single Nature Language Description
Yaozong Zheng ⋅ Bineng Zhong ⋅ Qihua Liang ⋅ Shuimu Zeng ⋅ Haiying Xia ⋅ Shuxiang Song
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 624
MV-TAP: Tracking Any Point in Multi-View Videos
Jahyeok Koo ⋅ Inès Hyeonsu Kim ⋅ Mungyeom Kim ⋅ Junghyun Park ⋅ Seohyeon Park ⋅ Jaeyeong Kim ⋅ Jung Yi ⋅ Seokju Cho ⋅ Seungryong Kim
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 625
Adaptive Depth Lightweight RGB-T Tracking with Holistic Token Routing
Tian Ding ⋅ Hongtao Yang ⋅ Liangtao Shi ⋅ Jun Li ⋅ Xiantao Hu ⋅ Jian Yang ⋅ Ying Tai
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 626
Content-Adaptive Hierarchical Hyperprior for Neural Video Coding
Junqi Liao ⋅ Yaojun Wu ⋅ Chaoyi Lin ⋅ Zhipin Deng ⋅ Li Li ⋅ Dong Liu ⋅ Xiaoyan Sun
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 627
UTPTrack: Towards Simple and Unified Token Pruning for Visual Tracking
Hao Wu ⋅ Xudong Wang ⋅ Jialiang Zhang ⋅ Junlong Tong ⋅ Xinghao Chen ⋅ Junyan Lin ⋅ Yunpu Ma ⋅ Xiaoyu Shen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 628
Similarity-as-Evidence: Calibrating Overconfident VLMs for Interpretable and Label-Efficient Medical Active Learning
Zhuofan Xie ⋅ Zishan Lin ⋅ Jinliang Lin ⋅ Jie Qi ⋅ Shaohua Hong ⋅ Shuo Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 629
From Infusion to Assimilation Distillation for Medical Image Segmentation
Jiankang Hong ⋅ Ye Luo ⋅ Yinan Liu ⋅ Junsong Yuan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 630
IBISAgent: Reinforcing Pixel-Level Visual Reasoning in MLLMs for Universal Biomedical Object Referring and Segmentation
Yankai Jiang ⋅ Qiaoru Li ⋅ BinLu Xu ⋅ Haoran Sun ⋅ Chao Ding ⋅ Junting Dong ⋅ Yuxiang Cai ⋅ Xuhong Zhang ⋅ Jianwei Yin
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 631
Unlocking Positive Transfer in Incrementally Learning Surgical Instruments: A Self-reflection Hierarchical Prompt Framework
Yu ZHU ⋅ Kang LI ⋅ Zheng Li ⋅ Pheng-Ann Heng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 632
Keep It Frozen: Domain-Routed Conditional Residual Modulation for Multi-Domain Vision Transformers
Ufaq Khan ⋅ Umair Nawaz ⋅ Massimo Caputo ⋅ Muhammad Bilal ⋅ Junaid Qadir ⋅ Muhammad Haris Khan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 633
Virtual Full-stack Scanning of Brain MRI via Imputing Any Quantised Code
Yicheng Wu ⋅ Tao Song ⋅ Zhonghua Wu ⋅ Jin Ye ⋅ Zongyuan Ge ⋅ Wenjia Bai ⋅ Zhaolin Chen ⋅ Jianfei Cai
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 634
MedLoc-R1: Performance-Aware Curriculum Reward Scheduling for GRPO-Based Medical Visual Grounding
Yang Guangjing ⋅ Ziyuan Qin ⋅ Chaoran Zhang ⋅ Chenlin Du ⋅ Jinglin Wang ⋅ Wanran Sun ⋅ Zhenyu Zhang ⋅ Bing Ji ⋅ Qicheng Lao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 635
Turning Pre-Trained Vision Transformers into End-to-End Histopathology Whole Slide Image Models for Survival Prediction
Jiawen Li ⋅ Jiali Hu ⋅ Xitong Ling ⋅ Renao Yan ⋅ Yuxuan Chen ⋅ Tian Guan ⋅ Yonghong He
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 636
A Supervised Multi-task Framework for Joint cryo-ET Restoration Enabled by Generative Physical Simulation
Xinsheng Wang ⋅ Zhidong Yang ⋅ Xiaohua Wan ⋅ Renmin Han ⋅ Shuai Tang ⋅ Hao Dong ⋅ Fa Zhang ⋅ Bin Hu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 637
KAMP: Knowledge-Anchored Multimodal Pretraining Framework for Medical Image Representation
Feiyu Huang ⋅ Jia Li ⋅ Zhao CHEN ⋅ Yang WU ⋅ Caleb Chen Cao ⋅ Lei Chen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 638
CARE: A Molecular-Guided Foundation Model with Adaptive Region Modeling for Whole Slide Image Analysis
Di Zhang ⋅ Zhangpeng Gong ⋅ Xiaobo Pang ⋅ Jiashuai Liu ⋅ Junbo Lu ⋅ Hao Cui ⋅ Jiusong Ge ⋅ Zhi Zeng ⋅ Kai Yi ⋅ Yinghua Li ⋅ Si Liu ⋅ Tingsong Yu ⋅ Haoran Wang ⋅ Mireia Crispin-Ortuzar ⋅ Weimiao Yu ⋅ Chen Li ⋅ Zeyu Gao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 639
Contrastive Cross-Bag Augmentation for Multiple Instance Learning-based Whole Slide Image Classification
Bo Zhang ⋅ Xu Xinan ⋅ Shuo Yan ⋅ Yu Bai ⋅ Zheng Zhang ⋅ Wufan Wang ⋅ Hui Gao ⋅ Wendong Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 640
OmniFM: Toward Modality-Robust and Task-Agnostic Federated Learning for Heterogeneous Medical Imaging
meilin liu ⋅ Jiaying Wang ⋅ Jing Shan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 641
Learning complete and explainable visual representations from itemized text supervision
Yiwei Lyu ⋅ Chenhui Zhao ⋅ Soumyanil Banerjee ⋅ Shixuan Liu ⋅ Akshay Rao ⋅ Akhil Kondepudi ⋅ Honglak Lee ⋅ Todd C. Hollon
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 642
EgoPoseFormer v2: Accurate Egocentric Human Motion Estimation for AR/VR
Zhenyu Li ⋅ Sai Kumar Dwivedi ⋅ Filip Maric ⋅ Carlos Chacón ⋅ Nadine Bertsch ⋅ Filippo Arcadu ⋅ Tomas Hodan ⋅ Michael Ramamonjisoa ⋅ Peter Wonka ⋅ Amy Zhao ⋅ Robin Kips ⋅ Cem Keskin ⋅ Anastasia Tkach ⋅ Chenhongyi Yang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 643
MetricHMSR: Metric Human Mesh and Scene Recovery from Monocular Images
Chentao Song ⋅ He Zhang ⋅ Yuan Haolei ⋅ Haozhe Lin ⋅ Jianhua Tao ⋅ Hongwen Zhang ⋅ Tao Yu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 644
Differentially Private 2D Human Pose Estimation
Kaushik Bhargav Sivangi ⋅ Paul Henderson ⋅ Fani Deligianni
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 645
TROPHIES: Temporal Reconstruction of Places, Humans, and Cameras from Multi-view Videos
Jinpeng Liu ⋅ Yukang Xu ⋅ Yutong Li ⋅ Xingyu Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 646
PoseD-Flow: Versatile and Guided Flow Matching Model of Human Pose
Jebastin Nadar ⋅ Simone Foti ⋅ Tolga Birdal
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 647
SIMSPINE: A Biomechanics-Aware Simulation Framework for 3D Spine Motion Annotation and Benchmarking
Muhammad Saif Ullah Khan ⋅ Didier Stricker
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 648
HUMAPS-4D: A Multimodal Dataset for HUman Motion Analysis with Physiological and Semantic informations
Matthieu Dabrowski ⋅ Ouala Ben Jemaa ⋅ Benjamin Allaert
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 649
PHASE-Net: Physics-Grounded Harmonic Attention System for Efficient Remote Photoplethysmography Measurement
bo zhao ⋅ Dan Guo ⋅ Junzhe Cao ⋅ Yong Xu ⋅ Bochao Zou ⋅ Tao Tan ⋅ Yue Sun ⋅ Zitong YU
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 650
LAMP: Localization Aware Multi-camera People Tracking in Metric 3D World
Nan Yang ⋅ Julian Straub ⋅ Fan Zhang ⋅ Richard Newcombe ⋅ Jakob Engel ⋅ Lingni Ma
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 651
Expanding mmWave Datasets for Human Pose Estimation with Unlabeled Data and LiDAR Datasets
Zhuoxuan Peng ⋅ Boan Zhu ⋅ Xingjian Zhang ⋅ Wenying Li ⋅ Gary Chan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 652
Towards Balanced Multi-Modal Learning in 3D Human Pose Estimation
Mengshi Qi ⋅ Jiaxuan Peng ⋅ Xianlin Zhang ⋅ Huadong Ma
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 653
OMGTex: One-stage Multi-style Facial Texture Reconstruction without Geometry Guidance
Xiao Zitong ⋅ Yuda Qiu ⋅ Zisheng Ye ⋅ Xiaoguang Han
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 654
Human Interaction-Aware 3D Reconstruction from a Single Image
Gwanghyun Kim ⋅ Junghun James Kim ⋅ Suh Yoon Jeon ⋅ Jason Park ⋅ Se Young Chun
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 655
Towards Generalizable AI-Generated Image Detection via Image-Adaptive Prompt Learning
Yiheng Li ⋅ Zichang Tan ⋅ Guoqing Xu ⋅ Zhen Lei ⋅ Xu Zhou ⋅ Yang Yang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 656
SAGA: Source Attribution of Generative AI Videos
Rohit Kundu ⋅ Vishal Mohanty ⋅ Hao Xiong ⋅ Shan Jia ⋅ Athula Balachandran ⋅ Amit K. Roy-Chowdhury
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 657
VMD-FACT: A New Video Dataset and MLLM-based method for Detecting Realistic AI-Generated Video Misinformation
Yongkang Zhang ⋅ Dongyu She ⋅ Baiyu Ji ⋅ Qichuan Geng ⋅ Zhong Zhou ⋅ Yan Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 658
ReAlign: Generalizable Image Forgery Detection via Reasoning-Aligned Representation
Qing Huang ⋅ Zhipei Xu ⋅ Xuanyu Zhang ⋅ Xiangyu Yu ⋅ Jian Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 659
A Sanity Check for Multi-In-Domain Face Forgery Detection in the Real World
Jikang Cheng ⋅ Renye Yan ⋅ Zhiyuan Yan ⋅ Yaozhong Gan ⋅ Xueyi Zhang ⋅ Wei Peng ⋅ Zhongyuan Wang ⋅ Ling Liang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 660
PPM-CLIP: Probabilistic Prompt Modeling for Generalizable AI-Generated Image Detection
WANG XINYUAN ⋅ Yingxin Lai ⋅ Zhiming Luo ⋅ Zhihui Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 661
Learning from Noisy Supervision: A Denoising–Debiasing Framework for Weakly Supervised Video Anomaly Detection
Yaxin Zhao ⋅ Yang Wang ⋅ Wenya Guo ⋅ Sihan Xu ⋅ Xiangrui Cai ⋅ Xi Lin ⋅ Ying Zhang ⋅ Xiaojie Yuan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 662
Anomaly as Non-Conformity via Training-Free Graph Laplacian Energy Minimization
Jungwook Seo ⋅ Minjeong Kim ⋅ Younkwan Lee ⋅ Seungho Shin ⋅ Sungyong Baik
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 663
VisualAD: Language-Free Zero-Shot Anomaly Detection via Vision Transformer
Yanning Hou ⋅ Peiyuan Li ⋅ Zirui Liu ⋅ Yitong Wang ⋅ Yanran Ruan ⋅ Jianfeng Qiu ⋅ Ke Xu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 664
CHAL: Causal-guided Hierarchical Anomaly-aware Learning for Moving Infrared Small Target Detection
Weiwei Duan ⋅ Luping Ji ⋅ Shipeng Lei ⋅ Sicheng Zhu ⋅ Jianghong Huang ⋅ Mao Ye
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 665
RAID: Retrieval-Augmented Anomaly Detection
Mingxiu Cai ⋅ Zhe Zhang ⋅ Gaochang Wu ⋅ Tianyou Chai ⋅ Xiatian Zhu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 666
ADSeeker: A Knowledge-Grounded Reasoning Framework for Industry Anomaly Detection and Reasoning
Kai Zhang ⋅ Zekai Zhang ⋅ Xihe Sun ⋅ Anpeng Wang ⋅ Jingmeng Nie ⋅ Qinghui Chen ⋅ Han Hao ⋅ Jianyuan Guo ⋅ jinglin zhang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 667
InvAD: Inversion-based Reconstruction-Free Anomaly Detection with Diffusion Models
Shunsuke Sakai ⋅ Xiangteng He ⋅ Chunzhi Gu ⋅ Leonid Sigal ⋅ Tatsuhito Hasegawa
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 668
QueryOcc: Query-based Self-Supervision for 3D Semantic Occupancy
Adam Lilja ⋅ Ji Lan ⋅ Junsheng Fu ⋅ Lars Hammarstrand
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 669
GSV2X: Geometry-Aware Uncertainty Modeling and Orthogonal Fusion for Robust Roadside Perception
jianqiang xu ⋅ Gensheng Pei ⋅ 刘华峰 Liu ⋅ Yazhou Yao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 670
Grounded Latents for Entity-Centric 4D Scene Generation
Jinhyung Park ⋅ Navyata Sanghvi ⋅ Erica Weng ⋅ Shawn Hunt ⋅ Shinya Tanaka ⋅ Hironobu Fujiyioshi ⋅ Kris Kitani
[ Poster