Skip to yearly menu bar Skip to main content


(670 events)   Timezone:  
Show all
The 2026 schedule is still incomplete
Toggle Poster Visibility
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 1
Chorus: Multi-Teacher Pretraining for Holistic 3D Gaussian Scene Encoding
Yue Li ⋅ Qi Ma ⋅ Runyi Yang ⋅ Mengjiao Ma ⋅ Bin Ren ⋅ Nikola Popovic ⋅ Nicu Sebe ⋅ Theo Gevers ⋅ Luc Van Gool ⋅ Danda Paudel ⋅ Martin R. Oswald
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 2
Featurising Pixels from Dynamic 3D Scenes with Linear In-Context Learners
Nikita Araslanov ⋅ Martin Sundermeyer ⋅ Hidenobu Matsuki ⋅ David Joseph Tan ⋅ Federico Tombari
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 3
From Pairs to Sequences: Track-Aware Policy Gradients for Keypoint Detection
yepeng liu ⋅ Hao Li ⋅ Liwen Yang ⋅ Fangzhen Li ⋅ Xudi Ge ⋅ Yuliang Gu ⋅ kuang Gao ⋅ Bing Wang ⋅ Guang Chen ⋅ Hangjun Ye ⋅ Yongchao Xu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 4
Linear Fundamental Matrix Estimation from 7 or 5 Points
Taci Ata Kucukpinar ⋅ Juan Mogollon ⋅ Joshua Fraser ⋅ Timothy Duff ⋅ Kannappan Palaniappan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 5
OccuFly: A 3D Vision Benchmark for Semantic Scene Completion from the Aerial Perspective
Markus Gross ⋅ Sai B. Matha ⋅ Aya Fahmy ⋅ Rui Song ⋅ Daniel Cremers ⋅ Henri Meeß
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 6
VGGT-Ω
Jianyuan Wang ⋅ Minghao Chen ⋅ Shangzhan Zhang ⋅ Nikita Karaev ⋅ Johannes Schönberger ⋅ Patrick Labatut ⋅ Piotr Bojanowski ⋅ David Novotny ⋅ Andrea Vedaldi ⋅ Christian Rupprecht
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 7
CodeV: Code with Images for Faithful Visual Reasoning via Tool-Aware Policy Optimization
Xinhai Hou ⋅ Shaoyuan Xu ⋅ Manan Biyani ⋅ Moyan Li ⋅ Jia Liu ⋅ Todd C. Hollon ⋅ Bryan Wang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 8
NitroGen: An Open Foundation Model for Generalist Gaming Agents
Loïc Magne ⋅ Anas Awadalla ⋅ Guanzhi Wang ⋅ Yinzhen Xu ⋅ Joshua Belofsky ⋅ Fengyuan Hu ⋅ Joohwan Kim ⋅ Ludwig Schmidt ⋅ Georgia Gkioxari ⋅ Jan Kautz ⋅ Yisong Yue ⋅ Yejin Choi ⋅ Yuke Zhu ⋅ Jim Fan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 9
PAI-Bench: A Comprehensive Benchmark For Physical AI
Fengzhe Zhou ⋅ Jiannan Huang ⋅ Jialuo Li ⋅ Deva Ramanan ⋅ Humphrey Shi
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 10
RefAV: Towards Planning-Centric Scenario Mining
Cainan Davidson ⋅ Deva Ramanan ⋅ Neehar Peri
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 11
SoccerMaster: A Vision Foundation Model for Soccer Understanding
Haolin Yang ⋅ Jiayuan Rao ⋅ Haoning Wu ⋅ Weidi Xie
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 12
VS-Bench: Evaluating VLMs for Strategic Abilities in Multi-Agent Environments
Zelai Xu ⋅ Zhexuan Xu ⋅ Xiangmin Yi ⋅ Huining Yuan ⋅ Mo Guang ⋅ Kaiwen Long ⋅ Xinlei Chen ⋅ Yi Wu ⋅ Chao Yu ⋅ Yu Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 13
Breaking the Scalability Limit of Multi-Projector Calibration with Embedded Cameras
Takumi Kawano ⋅ Kohei Miura ⋅ Daisuke Iwai
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 14
GaussianFluent: Gaussian Simulation for Dynamic Scenes with Mixed Materials
Bei Huang ⋅ Yixin Chen ⋅ Ruijie Lu ⋅ Gang Zeng ⋅ Hongbin Zha ⋅ Yuru Pei ⋅ Siyuan Huang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 15
InfiniBench: Infinite Benchmarking for Visual Spatial Reasoning with Customizable Scene Complexity
Haoming Wang ⋅ Qiyao Xue ⋅ Wei Gao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 16
MAGICIAN: Efficient Long-Term Planning with Imagined Gaussians for Active Mapping
Shiyao Li ⋅ Antoine Guédon ⋅ Shizhe Chen ⋅ Vincent Lepetit
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 17
Memory-Augmented Scene Understanding and Exploration for Open-World Aerial Object-Goal Navigation
Jiacong Zhou ⋅ Jiaxu Miao ⋅ Yourun Lin ⋅ Xianyun Wang ⋅ Jun Xiao ⋅ Jun Yu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 18
Monocular Open Vocabulary Occupancy Prediction for Indoor Scenes
Changqing Zhou ⋅ Yueru Luo ⋅ Han Zhang ⋅ Zeyu Jiang ⋅ Changhao Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 19
INSID3: Training-Free In-Context Segmentation with DINOv3
Claudia Cuttano ⋅ Gabriele Trivigno ⋅ Christoph Reich ⋅ Daniel Cremers ⋅ Carlo Masone ⋅ Stefan Roth
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 20
MARCO: Navigating the Unseen Space of Semantic Correspondence
Claudia Cuttano ⋅ Gabriele Trivigno ⋅ Carlo Masone ⋅ Stefan Roth
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 21
PR-MaGIC: Prompt Refinement Via Mask Decoder Gradient Flow For In-Context Segmentation
Minjae Lee ⋅ Sungwoo Hur ⋅ Soojin Hwang ⋅ Won Hwa Kim
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 22
R^2-Seg: Training-Free OOD Medical Tumor Segmentation via Anatomical Reasoning and Statistical Rejection
Shuaike Shen ⋅ Ke Liu ⋅ Jiaqing Xie ⋅ Shangde Gao ⋅ Chunhua Shen ⋅ Ge Liu ⋅ Mireia Crispin-Ortuzar ⋅ Shangqi Gao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 23
The SA-FARI Dataset: Segment Anything in Footage of Animals for Recognition and Identification
Dante Wasmuht ⋅ Otto Brookes ⋅ Maximilian Schall ⋅ Pablo Palencia ⋅ Christopher Beirne ⋅ Tilo Burghardt ⋅ Majid Mirmehdi ⋅ Hjalmar Kühl ⋅ Mimi Arandjelovic ⋅ Sam Pottie ⋅ Peter Bermant ⋅ Brandon Asheim ⋅ Yi Jin Toh ⋅ Adam Elzinga ⋅ Jason Allan Holmberg ⋅ Andrew Whitworth ⋅ Eleanor Flatt ⋅ Laura Gustafson ⋅ Chaitanya Ryali ⋅ Yuan-Ting Hu ⋅ Baishan Guo ⋅ Andrew Westbury ⋅ Kate Saenko ⋅ Dídac Surís
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 24
VGGT-Segmentor: Geometry-Enhanced Cross-View Segmentation
Yulu Gao ⋅ Bohao Zhang ⋅ Zongheng Tang ⋅ Jitong Liao ⋅ wenjun wu ⋅ Si Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 25
DAGE: Dual-Stream Architecture for Efficient and Fine-Grained Geometry Estimation
Tuan Duc Ngo ⋅ Gabriel Huang ⋅ Seoung Wug Oh ⋅ Kevin Blackburn-Matzen ⋅ Evangelos Kalogerakis ⋅ Chuang Gan ⋅ Joon-Young Lee
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 26
Wave-Former: Through-Occlusion 3D Reconstruction via Wireless Shape Completion
Laura Dodds ⋅ Maisy Lam ⋅ Waleed Akbar ⋅ Yibo Cheng ⋅ Fadel Adib
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 27
Lite Any Stereo: Efficient Zero-Shot Stereo Matching
Junpeng Jing ⋅ Weixun Luo ⋅ Ye Mao ⋅ Krystian Mikolajczyk
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 28
MuM: Multi-View Masked Image Modeling for 3D Vision
David Nordström ⋅ Johan Edstedt ⋅ Fredrik Kahl ⋅ Georg Bökman
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 29
ZipMap: Linear-Time Stateful 3D Reconstruction via Test-Time Training
Haian Jin ⋅ Rundi Wu ⋅ Tianyuan Zhang ⋅ Ruiqi Gao ⋅ Jonathan T. Barron ⋅ Noah Snavely ⋅ Aleksander Holynski
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 30
Scal3R: Scalable Test-Time Training for Large-Scale 3D Reconstruction
Tao Xie ⋅ Peishan Yang ⋅ Yudong Jin ⋅ Yingfeng Cai ⋅ Wei Yin ⋅ Weiqiang Ren ⋅ Qian Zhang ⋅ Wei Hua ⋅ Sida Peng ⋅ Xiaoyang Guo ⋅ Xiaowei Zhou
[ Slides
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 31
LaRP: Efficient Multi-View Inpainting with Latent Reprojection Priors
Gaoyang Zhang ⋅ Xinguo Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 32
TopoMA: Topology-Guided Multi-Agent Dense RGB 3D Reconstruction via Distributed Inference
Xuanxuan Zhang ⋅ ShuHui Shi ⋅ Tianxiang Zhang ⋅ Zhetao Guo ⋅ Zixuan Huang ⋅ You Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 33
Sparse–View Localization via Online Neural 3D Regression
Ludvig Dillén ⋅ Magnus Oskarsson ⋅ Viktor Larsson
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 34
Dynamic Visual SLAM using a General 3D Prior
Xingguang Zhong ⋅ Liren Jin ⋅ Marija Popovic ⋅ Jens Behley ⋅ Cyrill Stachniss
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 35
Learning Scene Coordinate Reconstruction from Unposed Images via Pose Graph Optimization
Tze Ho Elden Tse ⋅ Jizong Peng ⋅ Angela Yao
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 36
FlashVGGT: Efficient and Scalable Visual Geometry Transformers with Compressed Descriptor Attention
Zipeng Wang ⋅ Dan Xu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 37
No Calibration, No Depth, No Problem: Cross-Sensor View Synthesis with 3D Consistency
Cho-Ying Wu ⋅ Zixun Huang ⋅ Xinyu Huang ⋅ Liu Ren
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 38
UFO: Unifying Feed-Forward and Optimization-based Methods for Large Driving Scene Modeling
Kaiyuan Tan ⋅ Yingying Shen ⋅ Ziyue Zhu ⋅ Mingfei Tu ⋅ HAOHUI ZHU ⋅ Haiyang Sun ⋅ Bing Wang ⋅ Guang Chen ⋅ Hangjun Ye
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 39
Reliev3R: Relieving Feed-forward 3D Reconstruction from Multi-View Geometric Annotations
Youyu Chen ⋅ Junjun Jiang ⋅ Yueru Luo ⋅ Kui Jiang ⋅ Xianming Liu ⋅ Xu Yan ⋅ Dave Chen
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 40
TALO: Pushing 3D Vision Foundation Models Towards Globally Consistent Online Reconstruction
Fengyi Zhang ⋅ Tianjun Zhang ⋅ Kasra Khosoussi ⋅ Zheng Zhang ⋅ Zi Huang ⋅ Yadan Luo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 41
Global Structure-from-Motion Meets Feedforward Reconstruction
Linfei Pan ⋅ Johannes Schönberger ⋅ Marc Pollefeys
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 42
POCA: Pareto-Optimal Curriculum Alignment for Visual Text Generation
Yaohou Fan ⋅ Qingzhong Wang ⋅ Yongsong Huang ⋅ Junyi Liu ⋅ Tomo Miyazaki ⋅ Shinichiro Omachi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 43
DuoGen: Towards Autonomous Interleaved Multimodal Generation
Min Shi ⋅ Xiaohui Zeng ⋅ Jiannan Huang ⋅ Yin Cui ⋅ Francesco Ferroni ⋅ Jialuo Li ⋅ Max Li ⋅ Yogesh Balaji ⋅ Haoxiang Wang ⋅ Tsung-Yi Lin ⋅ Xiao Fu ⋅ Yue Zhao ⋅ Chieh-Yun Chen ⋅ Ming-Yu Liu ⋅ Humphrey Shi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 44
Vibe Spaces for Creatively Connecting and Expressing Visual Concepts
Huzheng Yang ⋅ Katherine Xu ⋅ Andrew Lu ⋅ Michael D. Grossberg ⋅ Yutong Bai ⋅ Jianbo Shi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 45
StoryTailor:A Zero-Shot Pipeline for Action-Rich Multi-Subject Visual Narratives
Jinghao Hu ⋅ Yuhe Zhang ⋅ GuoHua Geng ⋅ Kang Li ⋅ Han Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 46
CREward: A Type-Specific Creativity Reward Model
Jiyeon Han ⋅ Ali Mahdavi Amiri ⋅ Hao Zhang ⋅ Haedong Jeong
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 47
LumiX: Structured and Coherent Text-to-Intrinsic Generation
Xu Han ⋅ Biao Zhang ⋅ Xiangjun Tang ⋅ Xianzhi Li ⋅ Peter Wonka
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 48
Synthetic Curriculum Reinforces Compositional Text-to-Image Generation
Shijian Wang ⋅ Runhao Fu ⋅ Siyi Zhao ⋅ Qingqin Zhan ⋅ Xingjian Wang ⋅ Jiarui Jin ⋅ Yuan Lu ⋅ Hanqian Wu ⋅ Cunjian Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 49
OmniGen2: Towards Instruction-Aligned Multimodal Generation
Chenyuan Wu ⋅ Jiahao Wang ⋅ PengFei Zheng ⋅ Ruiran Yan ⋅ Shitao Xiao ⋅ Xin Luo ⋅ Yueze Wang ⋅ Wanli Li ⋅ Xiyan Jiang ⋅ Yexin Liu ⋅ Junjie Zhou ⋅ Ziyi Xia ⋅ Ze Liu ⋅ Chaofan Li ⋅ Haoge Deng ⋅ Kun Luo ⋅ Bo Zhang ⋅ Jiajun Zhang ⋅ Dong Liu ⋅ Defu Lian ⋅ Xinlong Wang ⋅ Zhongyuan Wang ⋅ Tiejun Huang ⋅ Zheng Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 50
Selectively Extracting and Injecting Visual Attributes into Text-to-Image Models
Seunghwan Choi ⋅ Jooyeol Yun ⋅ Youngdo Lee ⋅ Jaegul Choo
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 51
LoFA: Learning to Predict Personalized Prior for Fast Adaptation of Visual Generative Models
Yiming Hao ⋅ Mutian Xu ⋅ Chongjie Ye ⋅ Jie Qin ⋅ Shunlin Lu ⋅ Yipeng Qin ⋅ Xiaoguang Han
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 52
UniVerse: Empower Unified Generation with Reasoning and Knowledge
Kaiyue Sun ⋅ Weiyang Jin ⋅ Chengqi Duan ⋅ Rongyao Fang ⋅ Xian Liu ⋅ Yuwei Niu ⋅ Chunwei Wang ⋅ Aoxue Li ⋅ Xihui Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 53
UniVerse: A Unified Modulation Framework for Segmentation-Free, Disentangled Multi-Concept Personalization
Quynh Phung ⋅ Sandesh Ghimire ⋅ Minsi Hu ⋅ Charles Tsai ⋅ Jia-Bin Huang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 54
Residual Decoder Adapter: ID-Preserving Tokenizer Adaption for Autoregressive Text Rendering
Dongxing Mao ⋅ Jinpeng Wang ⋅ Jiahao Tang ⋅ Kevin Qinghong Lin ⋅ Linjie Li ⋅ Zhengyuan Yang ⋅ Lijuan Wang ⋅ Min Li ⋅ Jingru Tan
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 55
TGT: Text-Grounded Trajectories for Locally Controlled Video Generation
Guofeng Zhang ⋅ Angtian Wang ⋅ Jacob Fang Fang ⋅ Liming Jiang ⋅ Haotian Yang ⋅ Bo Liu ⋅ Yiding Yang ⋅ Guang Chen ⋅ Longyin Wen ⋅ Alan L. Yuille ⋅ Chongyang Ma
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 56
RAISE: Requirement-Adaptive Evolutionary Refinement for Training-Free Text-to-Image Alignment
Liyao Jiang ⋅ Ruichen Chen ⋅ Chao Gao ⋅ Di Niu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 57
FlowFixer: Towards Detail-Preserving Subject-Driven Generation
Jinyoung Jun ⋅ Wondong Jang ⋅ Wenbin Ouyang ⋅ Raghudeep Gadde ⋅ Jungbeom Lee
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 58
TextPecker: Rewarding Structural Anomaly Quantification for Enhancing Visual Text Rendering
Hanshen Zhu ⋅ Yuliang Liu ⋅ Xuecheng Wu ⋅ An-Lan Wang ⋅ Chao Feng ⋅ Dingkang Yang ⋅ ChaoFeng ChaoFeng ⋅ Can Huang ⋅ Jingqun Tang ⋅ Xiang Bai
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 59
UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios
Tian Ye ⋅ Song Fei ⋅ Lei Zhu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 60
FEAT: Fashion Editing and Try-On from Any Design
Soye Kwon ⋅ Keonyoung Lee ⋅ Dahuin Jung ⋅ Jaekoo Lee
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 61
Rethinking Prompt Design for Inference-time Scaling in Text-to-Visual Generation
Subin Kim ⋅ Sangwoo Mo ⋅ Mamshad Nayeem Rizve ⋅ Yiran Xu ⋅ Difan Liu ⋅ Jinwoo Shin ⋅ Tobias Hinz
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 62
PointAlign: Feature-Level Alignment Regularization for 3D Vision-Language Models
Yuanhao Su ⋅ Shaofeng Zhang ⋅ Xiaosong Jia ⋅ Qi Fan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 63
PowerCLIP: Powerset Alignment for Contrastive Pre-Training
Masaki Kawamura ⋅ Nakamasa Inoue ⋅ Rintaro Yanagi ⋅ Hirokatsu Kataoka ⋅ Rio Yokota
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 64
MoBind: Motion Binding for Fine-Grained IMU–Video Pose Alignment
Duy Nguyen ⋅ Tat-Jun Chin ⋅ Minh Nguyen Nguyen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 65
The Geometry of Robustness: Optimizing Loss Landscape Curvature and Feature Manifold Alignment for Robust Finetuning of Vision-Language Models
Shivang Chopra ⋅ Shaunak Halbe ⋅ Chengyue Huang ⋅ Brisa Maneechotesuwan ⋅ Zsolt Kira
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 66
Tackling Model Bias via Game-theoretic Multi-agent Collaboration Framework for Hateful Meme Classification
Yiwei Wei ⋅ Zhengliang Guo ⋅ Shaozu Yuan ⋅ Chengyin Hu ⋅ Zhiyang Jia ⋅ Jiujiang Guo ⋅ Meng Chen ⋅ Peiying Wang ⋅ Longbiao Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 67
CCCaption: Dual-Reward Reinforcement Learning for Complete and Correct Image Captioning
Zhijiang Tang ⋅ Linhua Wang ⋅ JIAXIN QI ⋅ Weihao Jiang ⋅ Peng Hou ⋅ Anxiang Zeng ⋅ Jianqiang Huang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 68
MM-ReCoder: Advancing Chart-to-Code Generation with Reinforcement Learning and Self-Correction
Zitian Tang ⋅ Xu Zhang ⋅ Jianbo Yuan ⋅ Yang Zou ⋅ Varad Gunjal ⋅ Songyao Jiang ⋅ Davide Modolo
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 69
Learning to Generate via Understanding: Understanding-Driven Intrinsic Rewarding for Unified Multimodal Models
Jiadong Pan ⋅ Liang Li ⋅ Yuxin Peng ⋅ Yu-Ming Tang ⋅ Shuohuan Wang ⋅ Yu Sun ⋅ Hua Wu ⋅ Qingming Huang ⋅ Haifeng Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 70
Hierarchical Process Reward Models are Symbolic Vision Learners
Shan Zhang ⋅ Aotian Chen ⋅ Kai Zou ⋅ Jindong Gu ⋅ Yuan Xue ⋅ Anton van den Hengel
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 71
ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning
Shengyuan Ding ⋅ Xinyu Fang ⋅ Ziyu Liu ⋅ Yuhang Zang ⋅ Yuhang Cao ⋅ Xiangyu Zhao ⋅ Haodong Duan ⋅ Xiaoyi Dong ⋅ Jianze Liang ⋅ Bin Wang ⋅ Conghui He ⋅ Dahua Lin ⋅ Jiaqi Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 72
SG-LoRA: Semantic-guided LoRA Parameters Generation
Miaoge Li ⋅ Yang Chen ⋅ Zhijie Rao ⋅ Can Jiang ⋅ Kang Wei ⋅ Jingcai Guo
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 73
AcTTA: Rethinking Test-Time Adaptation via Dynamic Activation
Hyeongyu Kim ⋅ GeonHui Han ⋅ Dosik Hwang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 74
Reframing Long-Tailed Learning via Loss Landscape Geometry
shenghan chen ⋅ Yiming Liu ⋅ Yanzhen Wang ⋅ Yujia Wang ⋅ Xiankai Lu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 75
Cleaning the Pool: Progressive Filtering of Unlabeled Pools in Deep Active Learning
Denis Huseljic ⋅ Marek Herde ⋅ Lukas Rauch ⋅ Paul Hahn ⋅ Bernhard Sick
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 76
DC-Merge: Improving Model Merging with Directional Consistency
Han-Chen Zhang ⋅ Zi-Hao Zhou ⋅ Mao-Lin Luo ⋅ Shimin Di ⋅ Min-Ling Zhang ⋅ Tong Wei
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 77
TALON: Test-time Adaptive Learning for On-the-Fly Category Discovery
Yanan Wu ⋅ Yuhan Yan ⋅ Tailai Chen ⋅ Zhixiang Chi ⋅ ZiZhang Wu ⋅ Yi Jin ⋅ Yang Wang ⋅ Zhenbo Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 78
Event-Illumination Collaborative Low-light Image Enhancement with a High-resolution Real-world Dataset
Senyan Xu ⋅ Zhijing Sun ⋅ Kean Liu ⋅ Xin Lu ⋅ Ruixuan Jiang ⋅ Xueyang Fu ⋅ Zheng-Jun Zha
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 79
NEC-Diff: Noise-Robust Event-RAW Complementary Diffusion for Seeing Motion in Extreme Darkness
Haoyue Liu ⋅ Jinghan Xu ⋅ Luxin Feng ⋅ Hanyu Zhou ⋅ Haozhi Zhao ⋅ Yi Chang ⋅ Luxin Yan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 80
Towards Persistence: Learning Topological Constraints for Event-based Small Object Detection
Shiman He ⋅ Nuo Chen ⋅ Xinyi Ying ⋅ Yihang Luo ⋅ Yangsi Shi ⋅ Zaiping Lin ⋅ Miao Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 81
Geometric-Photometric Event-based 3D Gaussian Ray Tracing
Kai Kohyama ⋅ Yoshimitsu Aoki ⋅ Guillermo Gallego ⋅ Shintaro Shiba
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 82
EventDrive: Event Cameras for Vision-Language Driving Intelligence
Dongyue Lu ⋅ Rong Li ⋅ Ao Liang ⋅ Lingdong Kong ⋅ Wei Yin ⋅ Lai Xing Ng ⋅ Benoit R. Cottereau ⋅ Camille Simon Chane ⋅ Wei Tsang Ooi
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 83
EventGait: Towards Robust Gait Recognition with Event Streams
Senyan Xu ⋅ Shuai Chen ⋅ Chuanfu Shen ⋅ Kean Liu ⋅ Zhijing Sun ⋅ Chengzhi Cao ⋅ Xueyang Fu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 84
MergeVLA: Cross-Skill Model Merging Toward a Generalist Vision-Language-Action Agent
Yuxia Fu ⋅ Zhizhen Zhang ⋅ Yuqi Zhang ⋅ Zijian Wang ⋅ Zi Huang ⋅ Yadan Luo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 85
Resolving the Stability-Plasticity Dilemma in Reinforcement Learning via Complementary Continual Critics
Bo Sun ⋅ Peixi Peng ⋅ Guang Tan ⋅ Haoran Xu ⋅ Yaokun Li ⋅ Yiqian Chang ⋅ Shuaixian Wang ⋅ Luntong Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 86
SAGE: Scalable Agentic 3D Scene Generation for Embodied AI
Hongchi Xia ⋅ Xuan Li ⋅ Max Li ⋅ Qianli Ma ⋅ Jiashu Xu ⋅ Ming-Yu Liu ⋅ Yin Cui ⋅ Tsung-Yi Lin ⋅ Wei-Chiu Ma ⋅ Shenlong Wang ⋅ Shuran Song ⋅ Fangyin Wei
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 87
Semantic Audio-Visual Navigation in Continuous Environments
Yichen Zeng ⋅ Hebaixu Wang ⋅ Meng Liu ⋅ Yu ZHOU ⋅ Chen Gao ⋅ Kehan Chen ⋅ Gongping Huang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 88
Unifying Perception and Action: A Hybrid-Modality Pipeline with Implicit Visual Chain-of-Thought for Robotic Action Generation
Xiangkai Ma ⋅ Lekai Xing ⋅ Han Zhang ⋅ Wenzhong Li ⋅ Sanglu Lu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 89
FLARE: A Failure-Aware Framework for Autonomous Correction and Recovery in Visual-Language Robotic Manipulation
Ganlong Zhao ⋅ Zijia Tang ⋅ Xingping Chen ⋅ Zhanghui Kuang ⋅ Ye Tian ⋅ Guanbin Li
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 90
Learning to Adapt: Self-Improving Web Agent via Cognitive-Aware Exploration
Weile Chen ⋅ Bingchen Miao ⋅ Qifan Yu ⋅ Wendong Bu ⋅ Guoming Wang ⋅ Wenqiao Zhang ⋅ Shengyu Zhang ⋅ Juncheng Li ⋅ Siliang Tang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 91
General Process Reward Modeling for Robotic Reinforcement Learning
Huajie Tan ⋅ Sixiang Chen ⋅ Yijie Xu ⋅ Zixiao Wang ⋅ Cheng Chi ⋅ Yuheng Ji ⋅ Yaoxu Lyu ⋅ Zhongxia Zhao ⋅ Xiansheng Chen ⋅ Peterson Co ⋅ Shaoxuan Xie ⋅ Guocai Yao ⋅ Pengwei Wang ⋅ Zhongyuan Wang ⋅ Shanghang Zhang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 92
DynBridge: Bridging Imagination and Control through Interaction Dynamics for Robot Manipulation
Alex Wang ⋅ Zhiwei Dong ⋅ Qicheng Bai ⋅ Chenshi Zhang ⋅ Yujie Yi ⋅ Guang Dai ⋅ Yong Liu ⋅ Mengmeng Wang
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 93
Action-Sketcher: From Reasoning to Action via Visual Sketches for Robotic Manipulation
Huajie Tan ⋅ Peterson Co ⋅ Yijie Xu ⋅ Shanyu Rong ⋅ Yuheng Ji ⋅ Cheng Chi ⋅ Xiansheng Chen ⋅ Zhongxia Zhao ⋅ Pengwei Wang ⋅ Zhongyuan Wang ⋅ Shanghang Zhang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 94
Thinking in 360°: Humanoid Visual Search in the Wild
Heyang Yu ⋅ Yinan Han ⋅ Xiangyu Zhang ⋅ Baiqiao Yin ⋅ Bowen Chang ⋅ Xiangyu Han ⋅ Xinhao Liu ⋅ Jing Zhang ⋅ Marco Pavone ⋅ Chen Feng ⋅ Saining Xie ⋅ Yiming Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 95
Learning from Semantic Dictionaries: Discriminative Codebook Contrastive Learning for Unified Visual Representation and Generation
Imanol G. Estepa ⋅ Jesús M Rodríguez-de-Vera ⋅ Bhalaji Nagarajan ⋅ Petia Radeva
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 96
MagicQuill V2: Precise and Interactive Image Editing with Layered Visual Cues
Zichen Liu ⋅ Yue Yu ⋅ Hao Ouyang ⋅ Qiuyu Wang ⋅ Shuailei Ma ⋅ Ka Leong Cheng ⋅ Wen Wang ⋅ Qingyan Bai ⋅ Yuxuan Zhang ⋅ Yanhong Zeng ⋅ Yixuan LI ⋅ Xing Zhu ⋅ Yujun Shen ⋅ Qifeng Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 97
Cycle-Consistent Tuning for Layered Image Decomposition
Zheng Gu ⋅ Min Lu ⋅ Zhida Sun ⋅ Dani Lischinski ⋅ Daniel Cohen-Or ⋅ Hui Huang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 98
RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark
Yang Shi ⋅ Yuhao Dong ⋅ Yue Ding ⋅ Yuran Wang ⋅ Xuanyu Zhu ⋅ Sheng Zhou ⋅ Wenting Liu ⋅ Haochen Tian ⋅ rundong wang ⋅ Huanqian Wang ⋅ Zuyan Liu ⋅ Bohan Zeng ⋅ Ruizhe Chen ⋅ Qixun Wang ⋅ Zhuoran Zhang ⋅ Xinlong Chen ⋅ Chengzhuo Tong ⋅ bozhou li ⋅ Qiang Liu ⋅ Haotian Wang ⋅ Wenjing Yang ⋅ Yuanxing Zhang ⋅ Pengfei Wan ⋅ Yi-Fan Zhang ⋅ Ziwei Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 99
Beyond Objects: Contextual Synthetic Data Generation for Fine-Grained Classification
William Yang ⋅ Xindi Wu ⋅ Zhiwei Deng ⋅ Esin Tureci ⋅ Olga Russakovsky
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 100
NEAF: Natural Image Editing with Attention Fusion for Generalizable Test-time Optimization in Text-Guided Image Editing
Jisoo Kim ⋅ Heeseok Oh
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 101
OntoAug: Rethinking Generative Data Augmentation via Ontology Guidance
Shuo Wang ⋅ Zhichuan Wang ⋅ Jun Luo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 102
Spherical Voronoi: Directional Appearance as a Differentiable Partition of the Sphere
Francesco Di Sario ⋅ Daniel Rebain ⋅ Dor Verbin ⋅ Marco Grangetto ⋅ Andrea Tagliasacchi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 103
4DSurf: High-Fidelity Dynamic Scene Surface Reconstruction
Renjie Wu ⋅ Hongdong Li ⋅ Jose M. Alvarez ⋅ Miaomiao Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 104
Learning 3D Representations for Spatial Intelligence from Unposed Multi-View Images
bo zhou ⋅ Qiuxia Lai ⋅ Zeren Sun ⋅ Xiangbo Shu ⋅ Yazhou Yao ⋅ Wenguan Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 105
Depth Peeling for High-Fidelity Gaussian-Enhanced Surfel Rendering
Keyang Ye ⋅ Hongzhi Wu ⋅ Kun Zhou
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 106
Intrinsic Image Fusion for Multi-View 3D Material Reconstruction
Peter Kocsis ⋅ Lukas Höllein ⋅ Matthias Nießner
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 107
PackUV: Packed Gaussian UV Maps for 4D Volumetric Video
Aashish Rai ⋅ Angela Xing ⋅ Anushka Agarwal ⋅ Xiaoyan Cong ⋅ Zekun Li ⋅ Tao Lu ⋅ Aayush Prakash ⋅ Srinath Sridhar
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 108
Opti-NeuS: Neural Reconstruction for Dual-Layered Transparent and Opaque Objects
Yi Yang ⋅ Gaoyang Zhang ⋅ Jun Tan ⋅ Xinguo Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 109
PhysGaia: A Physics-aware Benchmark with Multi-Body Interactions for Dynamic Novel View Synthesis
Mijeong Kim ⋅ Gunhee Kim ⋅ Jungyoon Choi ⋅ WonJae Roh ⋅ Bohyung Han
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 110
MatSpray: Fusing 2D Material World Knowledge on 3D Geometry
Philipp Langsteiner ⋅ Jan-Niklas Dihlmann ⋅ Hendrik Lensch
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 111
OMoBlur: An Object Motion Blur Dataset and Benchmark for Real-World Local Motion Deblurring
Dingchuan Yu ⋅ Jiatong Li ⋅ Jingwen Zhou ⋅ Zhengyue Zhuge ⋅ Yueting Chen ⋅ Qi Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 112
Hybrid Agents for Image Restoration
Bingchen Li ⋅ Xin Li ⋅ Yiting Lu ⋅ Zhibo Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 113
Zero-Shot Image Denoising via Hybrid Prior-Guided Pseudo Sample Generation
Xiaole Zhao ⋅ Qingsong Pang ⋅ Xiaobo Zhang ⋅ Xun Xu ⋅ Xun Gong ⋅ Yan Yang ⋅ Tianrui Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 114
Self-supervised Dynamic Heterogeneous Degradation Modeling for Unified Zero-Shot Image Restoration
Xiaowan Hu ⋅ Jing Yang ⋅ Henan Liu ⋅ HuaQiu Li ⋅ Mai Xu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 115
Next-Scale Prediction: A Self-Supervised Approach for Real-World Image Denoising
Yiwen Shan ⋅ Haiyu Zhao ⋅ Peng Hu ⋅ Xi Peng ⋅ Yuanbiao Gou
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 116
PhaSR: Generalized Image Shadow Removal with Physically Aligned Priors
Chia-Ming Lee ⋅ Yu-Fan Lin ⋅ Yu-Jou Hsiao ⋅ Jin-Hui Jiang ⋅ Yu-Lun Liu ⋅ Chih-Chung Hsu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 117
UARE: A Unified Vision-Language Model for Image Quality Assessment, Restoration, and Enhancement
Weiqi Li ⋅ Xuanyu Zhang ⋅ Bin Chen ⋅ Jingfen Xie ⋅ Yan Wang ⋅ Kexin Zhang ⋅ Junlin Li ⋅ Li zhang ⋅ Jian Zhang ⋅ Shijie Zhao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 118
FastGaMer: Efficient GainMap Learning for Practical Inverse Tone Mapping
YUANSHEN GUAN ⋅ Ruikang Xu ⋅ Chang Chen ⋅ Yinuo Liao ⋅ Dehua Song ⋅ Fenglong Song ⋅ Zhiwei Xiong
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 119
MDS-VQA: Model-Informed Data Selection for Video Quality Assessment
Jian Zou ⋅ Xiaoyu Xu ⋅ Zhihua Wang ⋅ Yilin Wang ⋅ Balu Adsumilli ⋅ Kede Ma
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 120
Seeing through Light and Darkness: Sensor-Physics Grounded Deblurring HDR NeRF from Single-Exposure Images and Events
Yunshan Qi ⋅ Lin Zhu ⋅ Nan Bao ⋅ Yifan Zhao ⋅ Jia Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 121
Disentanglement-wise Image Dehazing through Cross-Domain Manifold Consensus
Tianyi Lyu ⋅ Mingye Ju ⋅ Kai-Kuang Ma
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 122
Unsupervised Multi-Scale Segmentation of 3D Subcellular World with Stable Diffusion Foundation Model
Mostofa Uddin Uddin ⋅ HM Shadman Tabib ⋅ Thanh-Huy Nguyen ⋅ Kashish Gandhi ⋅ Min Xu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 123
EchoPOSE: 6D Pose Estimation of Sparse Echocardiograms for Left-Ventricular 3D Shape Reconstruction
Lucas Iijima ⋅ Yihao Luo ⋅ Dario Sesia ⋅ Amit Kaura ⋅ Jamil Mayet ⋅ Choon Hwai Yap
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 124
Spatial-SAM: Spatially Consistent 3D Electron Microscopy Segmentation with SDF Memory and Semi-Supervised Learning
Yikai Huang ⋅ Renmin Han ⋅ Yuxuan Wang ⋅ Youcheng Cai ⋅ Ligang Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 125
LLaDA-MedV: Exploring Large Language Diffusion Models for Biomedical Image Understanding
XUANZHAO DONG ⋅ Wenhui Zhu ⋅ Xiwen Chen ⋅ Zhipeng Wang ⋅ Peijie Qiu ⋅ Shao Tang ⋅ Xin Li ⋅ Yalin Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 126
TAlignDiff: Automatic Tooth Alignment assisted by Diffusion-based Transformation Learning
Yunbi Liu ⋅ Enqi Tang ⋅ Shiyu Li ⋅ hui shuai ⋅ Lei Ma ⋅ Juncheng Li ⋅ Kuai Yu ⋅ Shu Lou ⋅ Yongchu Pan ⋅ Qingshan Liu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 127
Harmonized Feature Conditioning and Frequency-Prompt Personalization for Multi-Rater Medical Segmentation
Sanaz Karimijafarbigloo ⋅ Armin Khosravi ⋅ Alireza Kheyrkhah ⋅ Reza Azad ⋅ Mauricio Reyes ⋅ Dorit Merhof
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 128
Masked-Diffusion Autoencoders for 3D Medical Vision Representation Learning
Jiachen Tu ⋅ Guanghui Qin ⋅ Theodore Zhengde Zhao ⋅ Jeya Maria Jose Valanarasu ⋅ Sheng Zhang ⋅ Tristan Naumann ⋅ Fan Lam ⋅ Sheng Wang ⋅ Hoifung Poon
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 129
PGR-Net: Prior-Guided ROI Reasoning Network for Brain Tumor MRI Segmentation
Jiacheng Lu ⋅ Hui Ding ⋅ Shiyu Zhang ⋅ Guoping Huo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 130
Test-Time Attention Purification for Backdoored Large Vision Language Models
Zhifang Zhang ⋅ Yang Bojun ⋅ Shuo He ⋅ Weitong Chen ⋅ Wei Emma Zhang ⋅ Olaf Maennel ⋅ Lei Feng ⋅ Miao Xu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 131
AGFT: Alignment-Guided Fine-Tuning for Zero-Shot Adversarial Robustness of Vision-Language Models
Yubo Cui ⋅ Xianchao Guan ⋅ Zijun Xiong ⋅ Zheng Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 132
Towards Robust Multimodal Large Language Models Against Jailbreak Attacks
ZIYI YIN ⋅ Yuanpu Cao ⋅ Han Liu ⋅ Ting Wang ⋅ Jinghui Chen ⋅ Fenglong Ma
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 133
R^2TUA: Reconstruction-residual Based Targeted and Untargeted Attack Against Text-Image Person Re-Identification
Yubo Wang ⋅ Yan Lu ⋅ Bin Liu ⋅ Xulin Li ⋅ Jixiang Niu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 134
When Robots Obey the Patch: Universal Transferable Patch Attacks on Vision-Language-Action Models
Hui Lu ⋅ Yi Yu ⋅ Yiming Yang ⋅ Chenyu Yi ⋅ Qixin Zhang ⋅ Bingquan Shen ⋅ Alex C. Kot ⋅ Xudong Jiang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 135
FlowHijack: A Dynamics-Aware Backdoor Attack on Flow-Matching Vision-Language-Action Models
Xinyuan An ⋅ Tao Luo ⋅ gengyun peng ⋅ Yaobing Wang ⋅ Kui Ren ⋅ Dongxia Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 136
Principled Steering via Null-space Projection for Jailbreak Defense in Vision-Language Models
Xingyu Zhu ⋅ Beier Zhu ⋅ Shuo Wang ⋅ Junfeng Fang ⋅ Kesen Zhao ⋅ Hanwang Zhang ⋅ Xiangnan He
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 137
Enhancing Part-Level Point Grounding for Any Open-Source MLLMs
Jin-Cheng Jhang ⋅ Fu-En Wang ⋅ Xin Yang ⋅ Nan Qiao ⋅ Lu Xia ⋅ Min Sun ⋅ Cheng-Hao Kuo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 138
MeteorPred: A Meteorological Multimodal Large Model and Dataset for Severe Weather Event Prediction
Shuo Tang ⋅ Jian Xu ⋅ Jiadong Zhang ⋅ yi chen ⋅ Qizhao Jin ⋅ Lingdong Shen ⋅ Chenglin Liu ⋅ Shiming Xiang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 139
YieldSAT: A Multimodal Benchmark Dataset for High-Resolution Crop Yield Prediction
Miro Miranda ⋅ Deepak Pathak ⋅ Patrick Helber ⋅ Benjamin Bischke ⋅ Hiba Najjar ⋅ Francisco Mena ⋅ Cristhian Sanchez ⋅ Akshay Pai ⋅ Diego Arenas ⋅ Matias Valdenegro ⋅ Marcela Charfuelan ⋅ Marlon Nuske ⋅ Andreas Dengel
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 140
How Far Can We Go With Synthetic Data for Audio-Visual Sound Source Localization?
Arda Senocak ⋅ Sooyoung Park ⋅ Tae-Hyun Oh ⋅ Joon Chung
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 141
Modeling Cross-vision Synergy for Unified Large Vision Model
Shengqiong Wu ⋅ Lanhu Wu ⋅ Mingyang Bao ⋅ Wenhao Xu ⋅ Hanwang Zhang ⋅ Shuicheng Yan ⋅ Hao Fei ⋅ Tat-seng Chua
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 142
Beyond Missing Modalities: Hypergraph Conditioned Diffusion for Uncertainty-Aware Multimodal Emotion Recognition
Xihang Qiu ⋅ Yuhao Fang ⋅ Qing Zhou ⋅ Bin Zhai ⋅ Jialong Hong ⋅ Wanpeng Zhang ⋅ Yao Lu ⋅ Ye Zhang ⋅ Chun Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 143
Rosetta Stone For Unified MLLMs: A Unified Tokenizer to Decipher Understanding and Generation
Wenyu Sun ⋅ Hufei Li ⋅ Ruijin Jin ⋅ Xiangheng Kong ⋅ Yuning Jiang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 144
MOON2.0: Dynamic Modality-balanced Multimodal Representation Learning for E-commerce Product Understanding
Zhanheng Nie ⋅ Chenghan Fu ⋅ Daoze Zhang ⋅ Junxian Wu ⋅ Wanxian Guan ⋅ Pengjie Wang ⋅ Jian Xu ⋅ Bo Zheng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 145
Nano-EmoX: Unifying Multimodal Emotional Intelligence from Perception to Empathy
Jiahao Huang ⋅ Fengyan Lin ⋅ Xuechao Yang ⋅ Chen Feng ⋅ Kexin Zhu ⋅ Xu Yang ⋅ Zhide chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 146
AMusE: Audio-Visual Benchmark and Alignment Framework for Agentic Multi-Speaker Understanding
Sanjoy Chowdhury ⋅ Karren Dai Yang ⋅ Xudong Liu ⋅ Fartash Faghri ⋅ Pavan Kumar Anasosalu Vasu ⋅ Oncel Tuzel ⋅ Dinesh Manocha ⋅ Chun-Liang Li ⋅ Raviteja Vemulapalli
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 147
Prototype-as-Prompt: Multimodal Sentiment Prototypes Endowing Large Language Models the Capability to Perform Multimodal Sentiment Analysis
Xianbing Zhao ⋅ Lan Luo ⋅ Hengyang Lu ⋅ Buzhou Tang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 148
CF-IPT: Cross-Modal Fusion Interactive Prompt Tuning of Vision-Language Pre-Trained Model for Multisource Remote Sensing Data Classification
Jinheng Ji ⋅ Jiahui Qu ⋅ Wenqian Dong ⋅ Yunsong Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 149
EMAD: Evidence-Centric Grounded Multimodal Diagnosis for Alzheimer’s Disease
Qiuhui Chen ⋅ Xuancheng Yao ⋅ Zhenglei Zhou ⋅ Xinyue Hu ⋅ Yi Hong
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 150
Multimodal Learning on Low-Quality Data with Conformal Predictive Self-Calibration
Xun Jiang ⋅ Yufan Gu ⋅ Disen Hu ⋅ Yuqing Hou ⋅ Yazhou Yao ⋅ Fumin Shen ⋅ Heng Tao Shen ⋅ Xing Xu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 151
Cross-View Distillation and Adaptive Masking for Incomplete Multi-View Multi-Label Classification
Yadong Liu ⋅ Qiaoqi Li ⋅ Yueying Wang ⋅ Lunke Fei ⋅ Jie Wen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 152
Bootstrap Your Own AV-Proxies: Adaptive Contrastive and Prototype Learning for Audio-Visual Segmentation
Junbo Zhang ⋅ Hang Su ⋅ Zhaofan Li ⋅ Hang Dong ⋅ Chao Sun
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 153
Multimodal Distribution Matching for Vision-Language Dataset Distillation
Jongoh Jeong ⋅ Hoyong Kwon ⋅ Minseok Kim ⋅ Kuk-Jin Yoon
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 154
M4-RAG: A Massive-Scale Multilingual Multi-Cultural Multimodal RAG
David Anugraha ⋅ Patrick Irawan ⋅ Anshul Singh ⋅ En-Shiun Annie Lee ⋅ Genta Indra Winata
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 155
Text-Driven 3D Hand Motion Generation from Sign Language Data
Léore Bensabath ⋅ Mathis Petrovich ⋅ Gul Varol
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 156
Real2Edit2Real: Generating Robotic Demonstrations via a 3D Control Interface
Yujie Zhao ⋅ Hongwei Fan ⋅ Di Chen ⋅ Shengcong Chen ⋅ Liliang Chen ⋅ Xiaoqi Li ⋅ Guangrui Ren ⋅ Hao Dong
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 157
GenHOI: Towards Object-Consistent Hand–Object Interaction with Temporally Balanced and Spatially Selective Object Injection
Xuan Huang ⋅ Mochu Xiang ⋅ Zhelun Shen ⋅ Jinbo Wu ⋅ Chenming Wu ⋅ Chen Zhao ⋅ Kaisiyuan Wang ⋅ Hang Zhou ⋅ Shanshan Liu ⋅ Haocheng Feng ⋅ Wei He ⋅ Jingdong Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 158
Clay-to-Stone: Phase-wise 3D Gaussian Splatting for Monocular Articulated Hand-Object Manipulation Modeling
Xingyu Liu ⋅ Pengfei Ren ⋅ Qi Qi ⋅ Haifeng Sun ⋅ Zirui Zhuang ⋅ Jianxin Liao ⋅ Jingyu Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 159
Training-free Motion Factorization for Compositional Video Generation
Zixuan Wang ⋅ Ziqin Zhou ⋅ Feng Chen ⋅ DUO PENG ⋅ Yixin Hu ⋅ Changsheng Li ⋅ Yinjie Lei
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 160
Audio-sync Video Instance Editing with Granularity-Aware Mask Refiner
Haojie Zheng ⋅ Shuchen Weng ⋅ Jingqi Liu ⋅ Siqi Yang ⋅ Boxin Shi ⋅ Xinlong Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 161
CaTok: Taming Mean Flows for One-Dimensional Causal Image Tokenization
Yitong Chen ⋅ Zuxuan Wu ⋅ Xipeng Qiu ⋅ Yu-Gang Jiang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 162
FFP-300K: Scaling First-Frame Propagation for Generalizable Video Editing
Xijie Huang ⋅ Chengming Xu ⋅ Donghao Luo ⋅ Xiaobin Hu ⋅ Peng Tang ⋅ Xu Peng ⋅ Jiangning Zhang ⋅ Chengjie Wang ⋅ Yanwei Fu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 163
V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties
Ye Fang ⋅ Tong Wu ⋅ Valentin Deschaintre ⋅ Duygu Ceylan ⋅ Iliyan Georgiev ⋅ Chun-Hao Huang ⋅ Yiwei Hu ⋅ Xuelin Chen ⋅ Tuanfeng Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 164
PoseAnything: General Pose-guided Video Generation with Part-aware Temporal Coherence
Ruiyan Wang ⋅ Teng Hu ⋅ Kaihui Huang ⋅ Zihan Su ⋅ Ran Yi ⋅ Lizhuang Ma
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 165
FastHybrid: Accelerating Hybrid Autoregressive Image Generation with Lookahead and Guided Decoding
j zg ⋅ Fang Zhang ⋅ YongXiang Hua ⋅ Bocheng Li ⋅ Wentao Zhang ⋅ Linli Xu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 166
DPAR: Dynamic Patchification for Efficient Autoregressive Visual Generation
Divyansh Srivastava ⋅ Akshay Mehra ⋅ Pranav Maneriker ⋅ Debopam Sanyal ⋅ Vishnu Raj ⋅ Vijay Kamarshi ⋅ Fan Du ⋅ Joshua Kimball
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 167
AlcheMinT: Fine-grained Temporal Control for Multi-Reference Consistent Video Generation
Sharath Girish ⋅ Viacheslav Ivanov ⋅ Tsai-Shien Chen ⋅ Hao Chen ⋅ Aliaksandr Siarohin ⋅ Sergey Tulyakov
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 168
LeapAlign: Post-training Flow Matching Models at Any Generation Step by Building Two-Step Trajectories
Zhanhao Liang ⋅ Tao Yang ⋅ Jie Wu ⋅ Chengjian Feng ⋅ Liang Zheng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 169
EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation
Tianwei Xiong ⋅ Jun Hao Liew ⋅ Zilong Huang ⋅ Zhijie Lin ⋅ Jiashi Feng ⋅ Xihui Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 170
Flow Matching for Multimodal Distributions
Gaoxiang Luo ⋅ Frank Cole ⋅ Sihang Zhang ⋅ Yuxiang Wan ⋅ Yulong Lu ⋅ Ju Sun
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 171
From Scale to Speed: Adaptive Test-Time Scaling for Image Editing
Xiangyan Qu ⋅ Zhenlong Yuan ⋅ Jing Tang ⋅ Rui Chen ⋅ Datao Tang ⋅ Meng Yu ⋅ Lei Sun ⋅ Yancheng Bai ⋅ Xiangxiang Chu ⋅ Gaopeng Gou ⋅ Gang Xiong ⋅ Yujun Cai
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 172
ReasonEdit: Towards Reasoning-Enhanced Image Editing Models
Fukun Yin ⋅ Shiyu Liu ⋅ Yucheng Han ⋅ Zhibo Wang ⋅ Peng Xing ⋅ Rui Wang ⋅ Wei Cheng ⋅ Yingming Wang ⋅ Aojie Li ⋅ Zixin Yin ⋅ Pengtao Chen ⋅ Xianfang Zeng ⋅ Gang Yu ⋅ Daxin Jiang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 173
Cross-Subject EEG-to-Video Reconstruction and Beyond
Runduo Han ⋅ Hongchen Tan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 174
Rethinking Position Embedding as a Context Controller for Multi-Reference and Multi-Shot Video Generation
Binyuan Huang ⋅ Yuning Lu ⋅ Weinan Jia ⋅ hualiang wang ⋅ Mu Liu ⋅ Daiqing Yang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 175
Stand-In: A Lightweight and Plug-and-Play Identity Control for Video Generation
Bowen Xue ⋅ Zheng-Peng Duan ⋅ Qixin Yan ⋅ Wenjing Wang ⋅ Hao Liu ⋅ Chunle Guo ⋅ Chongyi Li ⋅ Chen Li ⋅ Jing LYU
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 176
BiFM: Bidirectional Flow Matching for Few-Step Image Editing and Generation
Yasong Dai ⋅ Zeeshan Hayder ⋅ David Ahmedt-Aristizabal ⋅ Hongdong Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 177
DTG-Restore: Training-Free Diffusion Refinement for Generative Video Super-Resolution
Hidir Yesiltepe ⋅ Koutilya PNVR ⋅ Gaurav Suresh Pathak ⋅ Navaneeth Bodla ⋅ Bharat Singh ⋅ Pinar Yanardag ⋅ Jinrong Xie
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 178
VABench: A Comprehensive Benchmark for Audio-Video Generation
Daili Hua ⋅ Xizhi Wang ⋅ Bohan Zeng ⋅ Xinyi Huang ⋅ Hao Liang ⋅ Junbo Niu ⋅ Xinlong Chen ⋅ Quanqing Xu ⋅ Wentao Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 179
Relightful Video Portrait Harmonization
Jun Myeong Choi ⋅ Jae Shin Yoon ⋅ Luchao Qi ⋅ Roni Sengupta ⋅ Joon-Young Lee
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 180
DiT360: High-Fidelity Panoramic Image Generation via Hybrid Training
Haoran Feng ⋅ Dizhe Zhang ⋅ Xiangtai Li ⋅ Bo Du ⋅ Lu Qi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 181
DVAR: Dynamic Visual Autoregressive Modeling for Image Super-Resolution
Yu Zheng ⋅ Kai Zhang ⋅ Wei Zhu ⋅ Qingguo Liu ⋅ Xiantao Hu ⋅ Jun Li ⋅ Jian Yang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 182
Gated Condition Injection without Multimodal Attention: Towards Controllable Linear-Attention Transformers
Yuhe Liu ⋅ Zhenxiong Tan ⋅ Yujia Hu ⋅ Songhua Liu ⋅ Xinchao Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 183
LinVideo: A Post-Training Framework towards O(n) Attention in Efficient Video Generation
yushi Huang ⋅ Xingtong Ge ⋅ RUIHAO GONG ⋅ Chengtao Lv ⋅ Jun Zhang
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 184
UCAN: Unified Convolutional Attention Network for Expansive Receptive Fields in Lightweight Super-Resolution
Thien Tan Cao ⋅ Phan Thi Thu Trang ⋅ Nghiem Duc ⋅ Ho Ngoc Anh ⋅ Nguyen Duc Dung ⋅ Duc Dung Nguyen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 185
EMR-Diff: Edge-aware Multimodal Residual Diffusion Model for Hyperspectral Image Super-resolution
Tao Zhang ⋅ Shengtao Yao ⋅ Rong Zeng ⋅ Zunjie Zhu ⋅ Bolun Zheng ⋅ Yaoqi Sun ⋅ Ying Fu ⋅ Chenggang Yan
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 186
RAW-Domain Degradation Models for Realistic Smartphone Super-Resolution
Ali Mosleh ⋅ Faraz Ali ⋅ Fengjia Zhang ⋅ Stavros Tsogkas ⋅ Junyong Lee ⋅ Michael S. Brown ⋅ Alex Levinshtein
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 187
One-Step Diffusion Transformer for Controllable Real-World Image Super-Resolution
Yushun Fang ⋅ Yuxiang Chen ⋅ Shibo Yin ⋅ Qiang Hu ⋅ Jiangchao Yao ⋅ Ya Zhang ⋅ Xiaoyun Zhang ⋅ Yanfeng Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 188
FRAMER: Frequency-Aligned Self-Distillation with Adaptive Modulation Leveraging Diffusion Priors for Real-World Image Super-Resolution
Seungho Choi ⋅ Jeahun Sung ⋅ Jihyong Oh
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 189
HDW-SR: High-Frequency Guided Diffusion Model based on Wavelet Decomposition for Image Super-Resolution
Chao Yang ⋅ Boqian Zhang ⋅ Jinghao Xu ⋅ Guang Jiang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 190
Unifying Precise Keyframes and Semantic Control via Multi-level Diffusion
Linjun Wu ⋅ Jiejia Yu ⋅ Leyang Jin ⋅ He Wang ⋅ Bowen Zheng ⋅ Xu Yang ⋅ Hao Jiang ⋅ Fei Xia ⋅ Fei Ling ⋅ Jun Deng ⋅ Xiaogang Jin
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 191
CIGPose: Causal Intervention Graph Neural Network for Whole-Body Pose Estimation
Bohao Li ⋅ Zhicheng Cao ⋅ Huixian Li ⋅ Yangming Guo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 192
Pressure2Motion: Hierarchical Human Motion Reconstruction from Ground Pressure with Text Guidance
Zhengxuan Li ⋅ Qinhui Yang ⋅ Yiyu Zhuang ⋅ Chuan Guo ⋅ Xinxin Zuo ⋅ Xiaoxiao Long ⋅ Yao Yao ⋅ Xun Cao ⋅ Qiu Shen ⋅ Hao Zhu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 193
From 3D Pose to Prose: Biomechanics-Grounded Vision–Language Coaching
Yuyang Ji ⋅ Yixuan Shen ⋅ Shengjie Zhu ⋅ Yu Kong ⋅ Feng Liu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 194
InterPrior: Scaling Generative Control for Physics-Based Human-Object Interactions
Sirui Xu ⋅ Samuel Schulter ⋅ Morteza Ziyadi ⋅ Xialin He ⋅ Xiaohan Fei ⋅ Yu-Xiong Wang ⋅ Liang-Yan Gui
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 195
MoCoDiff: A Controllable Autoregressive Diffusion Model for Expressive Motion Generation
Wenfeng Song ⋅ Xuehan Wang ⋅ Shuai Li ⋅ Yi Chen ⋅ Yuting Guo ⋅ Zhenyu Wu ⋅ Xingliang Jin ⋅ Chenglizhao Chen ⋅ Fei Hou ⋅ Hongyu Wu ⋅ Aimin Hao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 196
W2W: Language-Model-Based Trajectory Prediction with Reinforcement Learning
Zirui Xu ⋅ Biao Yang ⋅ rongrong Ni ⋅ Zhongkai Zhou ⋅ Shaobo Shen
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 197
ParTY: Part-Guidance for Expressive Text-to-Motion Synthesis
KunHo Heo ⋅ SuYeon Kim ⋅ Yonghyun Gwon ⋅ Youngbin Kim ⋅ MyeongAh Cho
[ Slides
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 198
Interact2Ar: Full-Body Human-Human Interaction Generation via Autoregressive Diffusion Models
Pablo Ruiz-Ponce ⋅ Sergio Escalera ⋅ Jose Garcia-Rodriguez ⋅ Jiankang Deng ⋅ Rolandos Alexandros Potamias
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 199
Unified Number-Free Text-to-Motion Generation Via Flow Matching
Guanhe Huang ⋅ Oya Celiktutan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 200
Generative Diffusion Priors for 3D Mapping of the Dark Universe
Brandon Zhao ⋅ Diana Scognamiglio ⋅ Olivier Doré ⋅ Katherine L. Bouman
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 201
FlowPalm: Optical Flow Driven Non-Rigid Deformation for Geometrically Diverse Palmprint Generation
yuchen zou ⋅ Huikai Shao ⋅ Lihuang Fang ⋅ Zhipeng Xiong ⋅ Dexing Zhong
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 202
DiffuView: Multi-View Diffusion Pretraining for 3D Aware Robotic Manipulation
Kaizhao Zhang ⋅ Tian Niu ⋅ Tianyu Liu ⋅ Chenen Guo ⋅ Zijun Xu ⋅ Qingda Hu ⋅ Wenchao Ding
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 203
Circuit Mechanisms for Spatial Relation Generation in Diffusion Transformers
Binxu Wang ⋅ Jingxuan Fan ⋅ Xu Pan
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 204
Dual Ascent Diffusion for Inverse Problems
Minseo Kim ⋅ Axel Levy ⋅ Gordon Wetzstein
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 205
Forecast the Principal, Stabilize the Residual: Subspace-Aware Feature Caching for Diffusion Transformers
Guantao Chen ⋅ Shikang Zheng ⋅ Yuqi Lin ⋅ Linfeng Zhang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 206
Spatial-Spectral Residuals Informed Diffusion Neural Operator for Pan-sharpening
jiahan huang ⋅ Ran Ran ⋅ Junming Hou ⋅ Zihao Chen ⋅ Xiaofeng Cong ⋅ Junling Li ⋅ Liang-Jian Deng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 207
PhyOceanCast: Global Ocean Forecasting with Physics-Informed Diffusion
Qixiu Li ⋅ Xiang Zhu ⋅ Xiaoyong Li ⋅ Xiaolong Xu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 208
Pixel Motion Diffusion is What We Need for Robot Control
E-Ro Nguyen ⋅ Yichi Zhang ⋅ Kanchana Ranasinghe ⋅ Xiang Li ⋅ Michael Ryoo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 209
ORIC: Benchmarking Object Recognition under Contextual Incongruity in Large Vision-Language Models
Zhaoyang Li ⋅ Zhan Ling ⋅ Yuchen Zhou ⋅ Litian Gong ⋅ Erdem Biyik ⋅ Hao Su
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 210
M3Grounder: Mask-Based Multi-Span and Multi-Granular Grounding for Document QA
Venkata Kesav Venna ⋅ Sai Madhusudan Gunda ⋅ Jyothi Swaroopa Jinka ⋅ Hrithik Sagar Rachakonda ⋅ Anirudh Srinivasan ⋅ Ravi Kiran Sarvadevabhatla
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 211
BabyVLM-V2: Toward Developmentally Grounded Pretraining and Benchmarking of Vision Foundation Models
Shengao Wang ⋅ Wenqi Wang ⋅ Zecheng Wang ⋅ Max Whitton ⋅ Michael Wakeham ⋅ Arjun Chandra ⋅ Joey Huang ⋅ Pengyue Zhu ⋅ Helen Chen ⋅ David Li ⋅ Jeffrey Li ⋅ Shawn Li ⋅ Andrew Zagula ⋅ Amy Zhao ⋅ Andrew Zhu ⋅ Sayaka Nakamura ⋅ Yuki Yamamoto ⋅ Jerry Yokono ⋅ Aaron Mueller ⋅ Bryan A. Plummer ⋅ Kate Saenko ⋅ Venkatesh Saligrama ⋅ Boqing Gong
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 212
Towards Real-World Document Parsing via Realistic Scene Synthesis and Document-Aware Training
Gengluo Li ⋅ Pengyuan Lyu ⋅ Chengquan Zhang ⋅ Huawen Shen ⋅ Liang Wu ⋅ Xingyu Wan ⋅ Gangyan Zeng ⋅ Han Hu ⋅ Can Ma ⋅ Yu ZHOU
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 213
RoadSceneBench: A Lightweight Benchmark for Mid-Level Road Scene Understanding
Xiyan Liu ⋅ Han Wang ⋅ Yuhu Wang ⋅ JUNJIE CAI ⋅ Zhe Cao ⋅ Jianzhong Yang ⋅ Zhen Lu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 214
UNICBench: UNIfied Counting Benchmark for MLLM
Chenggang Rong ⋅ Tao Han ⋅ Zhiyuan Zhao ⋅ Yaowu Fan ⋅ Jia Wan ⋅ Song Guo ⋅ Yuan Yuan ⋅ Junyu Gao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 215
CaptionQA: Is Your Caption as Useful as the Image Itself?
Shijia Yang ⋅ Yunong Liu ⋅ Bohan Zhai ⋅ Ximeng Sun ⋅ Zicheng Liu ⋅ Emad Barsoum ⋅ Manling Li ⋅ Chenfeng Xu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 216
EgoProx: Evaluating MLLMs on Egocentric 3D Proximity Reasoning Across a Cognitive Hierarchy
Jinzhao Li ⋅ Yinuo Chen ⋅ Dongxu Piao ⋅ Panwang Pan ⋅ Yifan Yu ⋅ Dong Wang ⋅ Honglei Yan ⋅ Liang Yue ⋅ Shaofei Wang ⋅ Yixin Chen ⋅ Siyuan Huang ⋅ Miao Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 217
VULCAN: Tool-Augmented Multi Agents for Iterative 3D Object Arrangement
Zhengfei Kuang ⋅ Rui Lin ⋅ Long Zhao ⋅ Gordon Wetzstein ⋅ Saining Xie ⋅ Sanghyun Woo
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 218
EmbodiedSplat: Online Feed-Forward Semantic 3DGS for Open-Vocabulary 3D Scene Understanding
Seungjun Lee ⋅ Zihan Wang ⋅ Yunsong Wang ⋅ Gim Hee Lee
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 219
Efficient Encoder-Free Fourier-based 3D Large Multimodal Model
Guofeng Mei ⋅ Wei Lin ⋅ Luigi Riz ⋅ Yujiao Wu ⋅ Yiming Wang ⋅ Fabio Poiesi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 220
Socratic-Geo: Synthetic Data Generation and Cross-Modal Geometric Reasoning via Multi-Agent Interaction
Zhengbo Jiao ⋅ Zifan Zhang ⋅ Shaobo Wang ⋅ Wei Wang ⋅ Bing Zhao ⋅ hu wei ⋅ Linfeng Zhang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 221
HAMMER: Harnessing MLLMs via Cross-Modal Integration for Intention-Driven 3D Affordance Grounding
Lei Yao ⋅ Yong Chen ⋅ YUEJIAO SU ⋅ Yi Wang ⋅ Moyun Liu ⋅ Lap-Pui Chau
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 222
Proxy3D: Efficient 3D Representations for Vision-Language Models via Semantic Clustering and Alignment
Jerry Jiang ⋅ Haowen Sun ⋅ Denis Gudovskiy ⋅ Yohei Nakata ⋅ Tomoyuki Okuno ⋅ Kurt Keutzer ⋅ Wenzhao Zheng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 223
ReLaGS: Relational Language Gaussian Splatting
Yaxu Xie ⋅ Abdalla Arafa ⋅ Alireza Javanmardi ⋅ Christen Millerdurai ⋅ Jia Cheng Hu ⋅ Shaoxiang Wang ⋅ Alain Pagani ⋅ Didier Stricker
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 224
3D-IDE: 3D Implicit Depth Emergent
Chushan Zhang ⋅ Ruihan Lu ⋅ Jinguang Tong ⋅ Yikai Wang ⋅ Hongdong Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 225
FunFact: Building Probabilistic Functional 3D Scene Graphs via Factor-Graph Reasoning
Zhengyu Fu ⋅ René Zurbrügg ⋅ Kaixian Qu ⋅ Marc Pollefeys ⋅ Marco Hutter ⋅ Hermann Blum ⋅ Zuria Bauer
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 226
Parse, Search, and Confirmation: Training-Free Aerial Vision-and-Dialog Navigation with Chain-of-Thought Reasoning and Structured Spatial Memory
Yu Qi ⋅ Hongyu Li ⋅ Shaofei Huang ⋅ Tianrui Hui ⋅ Yaxiong Wang ⋅ Lechao Cheng ⋅ Zhun Zhong ⋅ Si Liu ⋅ Meng Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 227
4DP-QA: Scalable QA for 4D Perception in Vision Language Models
Seokju Cho ⋅ Abhishek Badki ⋅ Hang Su ⋅ Jindong Jiang ⋅ Ziyao Zeng ⋅ Seungryong Kim ⋅ Sifei Liu ⋅ Orazio Gallo
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 228
LASAR: Towards Spatio-temporal Reasoning with Latent Cognitive Map
Jinzhou Tang ⋅ Sidi Liu ⋅ Waikit Xiu ⋅ weixing chen ⋅ Keze Wang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 229
Text-Phase Synergy Network with Dual Priors for Unsupervised Cross-Domain Image Retrieval
Jing Yang ⋅ Hui Xue ⋅ Shipeng Zhu ⋅ Pengfei Fang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 230
EagleNet: Energy-Aware Fine-Grained Relationship Learning Network for Text-Video Retrieval
Yuhan Chen ⋅ Pengwen Dai ⋅ Chuan Wang ⋅ Dayan Wu ⋅ Xiaochun Cao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 231
PIX-TAB: Efficient PIXel-Precise TABle Structure Recognition Approach with Speculative Decoding and Region-Based Image Segmentation
Viktor Zaytsev ⋅ Olena Vynokurova ⋅ Pavlo Tytarchuk ⋅ Dmytro Kozii ⋅ Vitalii Pohribnyi ⋅ Olga Radyvonenko ⋅ Artem Shcherbina
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 232
CARLoS: Retrieval via Concise Assessment Representation of LoRAs at Scale
Shahar Sarfaty ⋅ Adi Haviv ⋅ Uri Y. Hacohen ⋅ Niva Elkin-Koren ⋅ Roi Livni ⋅ Amit H. Bermano
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 233
Camouflage-aware Image-Text Retrieval via Expert Collaboration
Yao Jiang ⋅ Zhongkuan Mao ⋅ xuan wu ⋅ Keren Fu ⋅ Qijun Zhao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 234
TriSim: Tri-Dimensional Similarity Modeling with Extreme Value Theory for False-Negative Mitigation in Remote Sensing Image-Text Retrieval
Chengyu Zheng ⋅ Hanzhang Lu ⋅ Jie Nie ⋅ Shan Du
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 235
TIGER: A Unified Framework for Time, Images and Geo-location Retrieval
David G. ⋅ Sirnam Swetha ⋅ Mubarak Shah
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 236
Mistake Attribution: Fine-Grained Mistake Understanding in Egocentric Videos
Yayuan Li ⋅ Aadit Jain ⋅ Filippos Bellos ⋅ Jason J. Corso
[ Slides
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 237
VidTAG: Temporally Aligned Video to GPS Geolocalization with Denoising Sequence Prediction at a Global Scale
Parth Parag Kulkarni ⋅ Rohit Gupta ⋅ Prakash Chandra Chhipa ⋅ Mubarak Shah
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 238
Stitch-a-Demo: Creating Video Demonstrations from Multistep Descriptions
Chi Hsuan Wu ⋅ Kumar Ashutosh ⋅ Kristen Grauman
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 239
Prototypical Action Reasoning Facilitated by Vision-Language Alignment for Egocentric Action Anticipation
jiang shao ⋅ Xinbo Zhao ⋅ Wenyin Tuo ⋅ XiaoChun Zou
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 240
AdaSpot: Spend Resolution Where It Matters for Precise Event Spotting
Artur Xarles i Esparraguera ⋅ Sergio Escalera ⋅ Thomas B. Moeslund ⋅ Albert Clapés
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 241
Unique Lives, Shared World: Learning from Single-Life Videos
Tengda Han ⋅ Sayna Ebrahimi ⋅ Dilara Gokay ⋅ Li Yang Ku ⋅ Maks Ovsjanikov ⋅ Iva Babukova ⋅ Daniel Zoran ⋅ Viorica Patraucean ⋅ Joao Carreira ⋅ Andrew Zisserman ⋅ Dima Damen
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 242
Symphony: A Cognitively-Inspired Multi-Agent System for Long-Video Understanding
海洋 闫 ⋅ Hongyun Zhou ⋅ Peng Xu ⋅ Xiaoxue Feng ⋅ Mengyi Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 243
VideoARM: Agentic Reasoning over Hierarchical Memory for Long-Form Video Understanding
Yufei Yin ⋅ Qianke Meng ⋅ Minghao Chen ⋅ Jiajun Ding ⋅ Zhenwei Shao ⋅ Zhou Yu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 244
Wavelet-based Frame Selection by Detecting Semantic Boundary for Long Video Understanding
Wang Chen ⋅ Yuhui zeng ⋅ Yongdong Luo ⋅ Tianyu Xie ⋅ Luojun Lin ⋅ Jiayi Ji ⋅ Yan Zhang ⋅ Xiawu Zheng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 245
SVAgent: Storyline-guided Long Video Understanding via Cross-Modal Multi-Agent Collaboration
zhongyu yang ⋅ Zuhao Yang ⋅ SHUO ZHAN ⋅ Tan Yue ⋅ Wei Pang ⋅ Yingfang Yuan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 246
Frame2Freq: Spectral Adapters for Fine-Grained Video Understanding
Thinesh Thiyakesan Ponbagavathi ⋅ Constantin Seibold ⋅ Alina Roitberg
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 247
Structural Graph Probing of Vision–Language Models
Haoyu He ⋅ Yue Zhuo ⋅ Yu Zheng ⋅ Qi R. Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 248
Saliency-R1: Enforcing Interpretable and Faithful Vision-language Reasoning via Saliency-map Alignment Reward
Shizhan Gong ⋅ Minda Hu ⋅ Qiyuan Zhang ⋅ Chen Ma ⋅ Qi Dou
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 249
Hidden Monotonicity: Explaining Deep Neural Networks via their DC Decomposition
Jakob Paul Zimmermann ⋅ Georg Loho
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 250
MaskDiME: Adaptive Masked Diffusion for Precise and Efficient Visual Counterfactual Explanations
Changlu Guo ⋅ Anders Nymark Christensen ⋅ Anders Bjorholm Dahl ⋅ Morten Hannemose
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 251
TRANSPORTER: Transferring Visual Semantics from VLM Manifolds
Alexandros Stergiou
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 252
Relational Visual Similarity
Thao Nguyen ⋅ Sicheng Mo ⋅ Krishna Kumar Singh ⋅ Yilin Wang ⋅ Jing Shi ⋅ Nick Kolkin ⋅ Eli Shechtman ⋅ Yong Jae Lee ⋅ Yuheng Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 253
PointCNN++: Performant Convolution on Native Points
Lihan Li ⋅ Haofeng Zhong ⋅ Rui Bu ⋅ Mingchao Sun ⋅ Wenzheng Chen ⋅ Baoquan Chen ⋅ Yangyan Li
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 254
Fast Markov Random Field Optimisation for Topologically Noisy 3D Shape Matching
Paul Roetzer ⋅ Johan Thunberg ⋅ Zorah Lähner ⋅ Florian Bernard
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 255
LitePT: Lighter Yet Stronger Point Transformer
Yuanwen Yue ⋅ Damien Robert ⋅ Jianyuan Wang ⋅ Sunghwan Hong ⋅ Jan D. Wegner ⋅ Christian Rupprecht ⋅ Konrad Schindler
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 256
SuP: Sub-cloud Driven Point Cloud Registration
Sheldon Fung ⋅ Wei Pan ⋅ Ling Cao ⋅ Fei Hou ⋅ Ling Chen ⋅ Shasha Mao ⋅ Hongdong Li ⋅ Xuequan Lu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 257
PQDT: Pseudo-Query Dual Transformer for Robust Point Cloud Restoration
Haoqing Wu ⋅ Alexa Nawotki ⋅ Jochen Garcke
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 258
Test-Time Training for LiDAR Semantic Segmentation under Corruption via Geometric Inlier Discrimination
Hyeonseong Kim ⋅ Hyun-Kurl Jang ⋅ Kuk-Jin Yoon
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 259
MHopReg: Efficient Hierarchical Multi-Hop Graph Search for Point Cloud Registration
Yue Wu ⋅ Feng Xiao ⋅ Yongzhe Yuan ⋅ Hao Li ⋅ Kaiyuan Feng ⋅ Maoguo Gong ⋅ Qiguang Miao ⋅ Wenping Ma
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 260
GEM: Generating LiDAR World Model via Deformable Mamba
Yang Wu ⋅ Zhaojiang Liu ⋅ Qiang Meng ⋅ Youquan Liu ⋅ renliang Weng ⋅ Jianjun Qian ⋅ Jian Yang ⋅ Jin Xie
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 261
Hybrid Robust Collaborative Perception with LiDAR-4D Radar Fusion under Adverse Weather Conditions
Yuquan Yang ⋅ hui zhang ⋅ Wenyu Lu ⋅ Ziyin Zhang ⋅ Chuanming Zhang ⋅ Xiaohua Xu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 262
Task-Driven Implicit Representations for Automated Design of LiDAR Systems
Nikhil Behari ⋅ Aaron Young ⋅ Tzofi Klinghoffer ⋅ Akshat Dave ⋅ Ramesh Raskar
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 263
Hierarchical Point-Patch Fusion with Adaptive Patch Codebook for 3D Shape Anomaly Detection
Xueyang Kang ⋅ Zizhao Li ⋅ Tian Lan ⋅ Dong Gong ⋅ Kourosh Khoshelham ⋅ Liangliang Nan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 264
When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models
Zhengyang Sun ⋅ Yu Chen ⋅ Xin Zhou ⋅ Xiaofan Li ⋅ Xiwu Chen ⋅ Dingkang Liang ⋅ Xiang Bai
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 265
Beyond Layer-Wise Merging: Chain-of-Merging for Vision-Language Models
Xinyu Zhang ⋅ Yuxuan Dong ⋅ Lingling Zhang ⋅ Chengyou Jia ⋅ Zhuohang Dang ⋅ YiXing Yao ⋅ Yaqiang Wu ⋅ Basura Fernando ⋅ Jun Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 266
GazeShift: Unsupervised Gaze Estimation and Dataset for VR
Gil Shapira ⋅ Ishay Goldin ⋅ Evgeny Artyomov ⋅ Donghoon Kim ⋅ Yosi Keller ⋅ Niv Zehngut
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 267
Improving Calibration in Test-Time Prompt Tuning for Vision-Language Models via Data-Free Flatness-Aware Prompt Pretraining
Hyeonseo Jang ⋅ Jaebyeong Jeon ⋅ Joong-won Hwang ⋅ Kibok Lee
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 268
Reevaluating the Intra-Modal Misalignment Hypothesis in CLIP
Jonas Herzog ⋅ Yue Wang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 269
Dr. Seg: Revisiting GRPO Training for Visual Large Language Models through Perception-Oriented Design
Haoxiang Sun ⋅ Tao Wang ⋅ Chenwei Tang ⋅ Li Yuan ⋅ Jiancheng Lv
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 270
Soft Modality-Guided Expert Specialization in MoE-VLMs
Zi-Hao Bo ⋅ Yaqian Li ⋅ Anzhou Hou ⋅ rinyoichi takezoe ⋅ Ertao Zhao ⋅ Tianxiang Pan ⋅ Jiale Yan ⋅ Mo Guang ⋅ Kaiwen Long
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 271
CoVFT: Context-aware Visual Fine-tuning for Multimodal Large Language Models
Nan Zhou ⋅ Huiqun Wang ⋅ Yaoyan Zheng ⋅ Di Huang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 272
Retrieving Counterfactuals Improves Visual In-Context Learning
Guangzhi Xiong ⋅ Sanchit Sinha ⋅ Zhenghao He ⋅ Aidong Zhang
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 273
AutoRegressive Generation with B-rep Holistic Token Sequence Representation
Jiahao Li ⋅ Yunpeng Bai ⋅ Yongkang Dai ⋅ Hao Guo ⋅ Hongping Gan ⋅ Yilei Shi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 274
VecGlypher: Unified Vector Glyph Generation with Language Models
Xiaoke Huang ⋅ Bhavul Gauri ⋅ Kam-Woh Ng ⋅ Tony Ng ⋅ Mengmeng Xu ⋅ Zhiheng Liu ⋅ Weiming Ren ⋅ Zhaochong An ⋅ Zijian Zhou ⋅ Haonan Qiu ⋅ Yuyin Zhou ⋅ Sen He ⋅ Ziheng Wang ⋅ Tao Xiang ⋅ Xiao Han
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 275
NERFIFY: A Multi-Agent Framework for Turning NeRF Papers into Code
Seemandhar Jain ⋅ Keshav Gupta ⋅ Kunal Gupta ⋅ Manmohan Chandraker
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 276
Diagram2Structure: Unlocking LLMs' Diagram Comprehension through DiagramDiff, an Offline Diagram Structuring Framework
Haoxiang Hu ⋅ Yaokun Li ⋅ Zeyuan Huang ⋅ Cangjun Gao ⋅ Qiang He ⋅ Qingkun Li ⋅ Xiaoming Deng ⋅ Cuixia Ma ⋅ Yu-Kun Lai ⋅ Yong-Jin Liu ⋅ Hongan Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 277
ShowTable: Unlocking Creative Table Visualization with Collaborative Reflection and Refinement
Zhihang Liu ⋅ Xiaoyi Bao ⋅ Pandeng Li ⋅ Junjie Zhou ⋅ Zhaohe Liao ⋅ Yefei He ⋅ Kaixun Jiang ⋅ Chenwei Xie ⋅ Yun Zheng ⋅ Hongtao Xie
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 278
GardenDesigner: Encoding Aesthetic Principles into Jiangnan Garden Construction via a Chain of Agents
Mengtian Li ⋅ Fan Yang ⋅ Ruixue Xiong ⋅ Yiyan Fan ⋅ Zhifeng Xie ⋅ Zeyu Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 279
ShadowDraw: From Any Object to Shadow-Drawing Compositional Art
Rundong Luo ⋅ Noah Snavely ⋅ Wei-Chiu Ma
[ Slides
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 280
End-to-End Hyper-Relational Information Extraction for Engineering Diagrams via Dynamically Tokenized Relation Transformer
Tianyou Bai ⋅ Yan-Ming Zhang ⋅ Zixiang Zhang ⋅ Jibin Zhou ⋅ Fei Yin ⋅ Chenglin Liu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 281
When Anonymity Breaks: Identifying Models Behind Text-to-Image Leaderboards
Ali Naseh ⋅ Anshuman Suri ⋅ Yuefeng Peng ⋅ Harsh Chaudhari ⋅ Alina Oprea ⋅ Amir Houmansadr
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 282
Bias at the End of the Score
Salma Abdel Magid ⋅ Grace Guo ⋅ Esin Tureci ⋅ Amaya Dharmasiri ⋅ Vikram V. Ramaswamy ⋅ Hanspeter Pfister ⋅ Olga Russakovsky
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 283
PECCVAI: Overcoming the Brittleness of AI Image Watermarking Under Visual Paraphrasing Attacks
Shreyas Dixit ⋅ Ashhar Aziz ⋅ Shashwat Bajpai ⋅ Vasu Sharma ⋅ Aman Chadha ⋅ Vinija Jain ⋅ Amitava Das
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 284
Dynamic Token Reweighting for Robust Vision-Language Models
Tanqiu Jiang ⋅ Jiacheng Liang ⋅ Rongyi Zhu ⋅ Jiawei Zhou ⋅ Fenglong Ma ⋅ Ting Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 285
COPYLENS: Towards Copyrighted Characters Infringement Detection via Copyright-Aware Prompt Learning
Yaoyu Jin ⋅ Xiaochun Yang ⋅ Hong Liu ⋅ Leixia Wang ⋅ Jian Li ⋅ Rui Ding ⋅ Bin Wang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 286
Closed-Form Concept Erasure via Double Projections
CHI ZHANG ⋅ Jingpu Cheng ⋅ Zhixian Wang ⋅ Ping Liu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 287
Adaptive Bayesian Early-Exit Networks for Efficient Non-Transferable Learning
Siyu Luan ⋅ Yan Li ⋅ Zhong Chen ⋅ Zhenyi Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 288
Stake the Points: Structure-Faithful Instance Unlearning
Kiseong Hong ⋅ JungKyoo Shin ⋅ Eunwoo Kim
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 289
Federated Active Learning Under Extreme Non-IID and Global Class Imbalance
Chen-Chen Zong ⋅ Shengjun Huang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 290
FedRG: Unleashing the Representation Geometry for Federated Learning with Noisy Clients
Tian Wen ⋅ Zhiqin Yang ⋅ Yonggang Zhang ⋅ Xuefeng Jiang ⋅ Hao Peng ⋅ Yuwei Wang ⋅ Bo Han
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 291
FedCART: Tackling Long-Tailed Distributions in Federated Adversarial Training via Classifier Refinement
Yuchen Qin ⋅ Yizhi Zhou ⋅ Junxiao Wang ⋅ Xin Xie ⋅ Heng QI
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 292
Generalized and Personalized Federated Learning with Black-Box Foundation Models via Orthogonal Transformations
Eun Gyung Kong ⋅ Jewon Yeom ⋅ Yonghoon Jeon ⋅ Taesup Kim
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 293
Fully Decentralized Certified Unlearning
Hithem Lamri ⋅ Michail Maniatakos
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 294
Fed-ADE: Adaptive Learning Rate for Federated Post-adaptation under Distribution Shift
Heewon Park ⋅ Mugon Joe ⋅ Miru Kim ⋅ Kyungjin Im ⋅ Minhae Kwon
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 295
Towards Streaming Referring Video Segmentation via Large Language Model
Wenkang Zhang ⋅ Kaicheng Yang ⋅ Xiang An ⋅ Qiang Li ⋅ Ziyong Feng ⋅ Wankou Yang ⋅ Jiankang Deng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 296
Multi-speaker Attention Alignment for Multimodal Social Interaction
LIANGYANG OUYANG ⋅ Yifei Huang ⋅ Mingfang Zhang ⋅ Caixin Kang ⋅ Ryosuke Furuta ⋅ Yoichi Sato
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 297
OmniVTG: A Large-Scale Dataset and Training Paradigm for Open-World Video Temporal Grounding
Minghang Zheng ⋅ Zihao Yin ⋅ Yi Yang ⋅ Yuxin Peng ⋅ Yang Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 298
SARL-STG: A Spatially Aware Reinforcement Learning Framework for Refining MLLMs in Spatio-Temporal Video Grounding
Hong Gao ⋅ Xiangkai Xu ⋅ Bin Zhong ⋅ Junjie Yin ⋅ Fangyu Kang ⋅ Yutong Xu ⋅ Xiugang Dong ⋅ Xurui Gao ⋅ Min-Ling Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 299
VideoITG: Multimodal Video Understanding with Instructed Temporal Grounding
Shihao Wang ⋅ Guo Chen ⋅ De-An Huang ⋅ Zhiqi Li ⋅ Minghan LI ⋅ Guilin Liu ⋅ Jan Kautz ⋅ Jose M. Alvarez ⋅ Lei Zhang ⋅ Zhiding Yu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 300
DeRVOS: Decoupling Consistent Trajectory Generation and Multimodal Understanding for Referring Video Object Segmentation
WENXUAN CHENG ⋅ Ming Dai ⋅ Huimin Lu ⋅ Wankou Yang
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 301
UniCompress: Token Compression for Unified Vision–Language Understanding and Generation
Ziyao Wang ⋅ Chen Chen ⋅ Jingtao Li ⋅ Weiming Zhuang ⋅ Jiabo Huang ⋅ Ang Li ⋅ Lingjuan Lv
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 302
StreamingTOM: Streaming Token Compression for Efficient Video Understanding
Xueyi Chen ⋅ Keda Tao ⋅ Kele Shao ⋅ Huan Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 303
SCoRe: Salience-Coverage Reduction for Vision Token Pruning in Vision-Language Models
Tong Xu ⋅ Hailong Shi ⋅ Xingyu Gao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 304
VLM-PTQ: Efficient Post-Training Quantization for Large Vision-Language Models
Juncan Deng ⋅ Kejie Huang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 305
Aligning What Vision-Language Models See and Perceive with Adaptive Information Flow
Chengxin Liu ⋅ Wonseok Choi ⋅ Chenshuang Zhang ⋅ Tae-Hyun Oh
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 306
Quant Experts: Token-aware Adaptive Error Reconstruction with Mixture of Experts for Large Vision-Language Models Quantization
Chenwei Jia ⋅ Baoting Li ⋅ Xuchong Zhang ⋅ Mingzhuo Wei ⋅ Bochen Lin ⋅ Hongbin Sun
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 307
Rethinking Token Reduction for Large Vision-Language Models
Yi Wang ⋅ Haofei Zhang ⋅ Qihan Huang ⋅ Anda Cao ⋅ Gongfan Fang ⋅ Wei Wang ⋅ Xuan Jin ⋅ Jie Song ⋅ Mingli Song ⋅ Xinchao Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 308
Prototype-based Causal Intervention for Multi-Label Image Classification
Yanmin Li ⋅ Zhilong Mao ⋅ Mao Wang ⋅ Lihua Liu ⋅ Jibing Wu ⋅ Weidong Bao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 309
FAST: Topology-Aware Frequency-Domain Distribution Matching for Coreset Selection
Jin Cui ⋅ Boran Zhao ⋅ Jiajun Xu ⋅ Jiaqi guo ⋅ Shuo Guan ⋅ Pengju Ren
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 310
Face-Guided Sentiment Boundary Enhancement for Weakly-Supervised Temporal Sentiment Localization
Cailing Han ⋅ Zhangbin Li ⋅ Jinxing Zhou ⋅ Wei Qian ⋅ Jingjing Hu ⋅ Yanghao Zhou ⋅ Zhangling Duan ⋅ Dan Guo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 311
Evidential Deep Partial Label Learning to Quantify Disambiguation Uncertainty
Jinfu Fan ⋅ Jiangnan Li ⋅ Xiaohui Zhong ⋅ Kangrui Ren ⋅ Zhencun Jiang ⋅ 福建话 赣方言 ⋅ Tianhao Gu ⋅ Linqing Huang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 312
Unlocking Strong Supervision: A Data-Centric Study of General-Purpose Audio Pre-Training Methods
Xuanru Zhou ⋅ Yiwen Shao ⋅ Wei-Cheng Tseng ⋅ Dong Yu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 313
Revisiting Learning with Noisy Labels: Active Forgetting and Noise Suppression
Mengmeng Sheng ⋅ Zeren Sun ⋅ Tao Chen ⋅ Jinshan Pan ⋅ Yazhou Yao ⋅ Fumin Shen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 314
PAF: Perturbation-Aware Filtering for Open-Set Semi-Supervised Learning
Yinan Han ⋅ Qing-Yuan Jiang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 315
Global-Graph Guided and Local-Graph Weighted Contrastive Learning for Unified Clustering on Incomplete and Noise Multi-View Data
Hongqing He ⋅ Jie Xu ⋅ Wenyuan Yang ⋅ Yonghua Zhu ⋅ Guoqiu Wen ⋅ Xiaofeng Zhu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 316
Enhancing Out-of-Distribution Detection with Extended Logit Normalization
Yifan Ding ⋅ Xixi Liu ⋅ Jonas Unger ⋅ Gabriel Eilertsen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 317
Unleashing VLA Potentials in Autonomous Driving via Explicit Learning from Failures
Yuechen Luo ⋅ Fang Li ⋅ Qimao Chen ⋅ Shaoqing Xu ⋅ Jiaxin Liu ⋅ Ziying Song ⋅ Zhi-xin Yang ⋅ Fuxi Wen
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 318
Unposed-to-3D: Learning Simulation-Ready Vehicles from Real-World Images
Hongyuan Liu ⋅ Bochao Zou ⋅ Qiankun Liu ⋅ Haochen Yu ⋅ Qi Mei ⋅ Jianfei Jiang ⋅ Chen Liu ⋅ Cheng Bi ⋅ Zhao Wang ⋅ Xueyang Zhang ⋅ Yifei Zhan ⋅ Jiansheng Chen ⋅ Huimin Ma
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 319
SafeDrive: Fine-Grained Safety Reasoning for End-to-End Driving in a Sparse World
Jungho Kim ⋅ Jiyong Oh ⋅ Seunghoon Yu ⋅ Hongjae Shin ⋅ Donghyuk Kwak ⋅ Jun Won Choi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 320
RAG-TP: A General Framework for Vehicle Trajectory Prediction via Retrieval-Augmented Generation
Ziyi Wang ⋅ Yang Zhang ⋅ Guijian Tang ⋅ Chao Zhang ⋅ Shibo Zhang ⋅ Xueqiong Li ⋅ Shaowu Yang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 321
Perceiving the Near, Reasoning the Distant: Coherent Long-Horizon Trajectory Prediction for Autonomous Driving
Hua Hu ⋅ Zikang Zhou ⋅ Qian Zhou ⋅ Zihao WEN ⋅ Junjie Hu ⋅ Xinhong Chen ⋅ Zhengmin JIANG ⋅ Yung-Hui Li ⋅ Jianping Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 322
Dual-Agent Reinforcement Learning for Adaptive and Cost-Aware Visual–Inertial Odometry
Feiyang Pan ⋅ Shenghe Zheng ⋅ Chunyan Yin ⋅ Guangbin Dou
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 323
HorizonForge: Driving Scene Editing with Any Trajectories and Any Vehicles
Yifan Wang ⋅ Francesco Pittaluga ⋅ Zaid Tasneem ⋅ Chenyu You ⋅ Manmohan Chandraker ⋅ Ziyu Jiang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 324
AMap: Distilling Future Priors for Ahead-Aware Online HD Map Construction
Ruikai Li ⋅ Xinrun Li ⋅ Mengwei Xie ⋅ Hao Shan ⋅ Shoumeng Qiu ⋅ Xinyuan Chang ⋅ Yizhe Fan ⋅ Feng Xiong ⋅ Han Jiang ⋅ Yilong Ren ⋅ Haiyang Yu ⋅ Mu Xu ⋅ Yang Long ⋅ Varun Ojha ⋅ Zhiyong Cui
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 325
WAM-Flow: Parallel Coarse-to-Fine Motion Planning via Discrete Flow Matching for Autonomous Driving
Yifang Xu ⋅ Jiahao Cui ⋅ Zhihao Zhu ⋅ Hanlin Shang ⋅ Shan Luan ⋅ Mingwang Xu ⋅ Feipeng Cai ⋅ Neng Zhang ⋅ Yaoyi Li ⋅ Jia Cai ⋅ Siyu Zhu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 326
PlannerRFT: Reinforcing Diffusion Planners through Closed-Loop and Sample-Efficient Fine-Tuning
Hongchen Li ⋅ Tianyu Li ⋅ Jiazhi Yang ⋅ Mingyang Shang ⋅ Gaoqiang Wu ⋅ Caojun Wang ⋅ Haochen Tian ⋅ Zengrong Lin ⋅ Zhihui Hao ⋅ XianPeng Lang ⋅ Jia Hu ⋅ Hongyang Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 327
MARIS: Marine Open-Vocabulary Instance Segmentation
Bingyu Li ⋅ Feiyu Wang ⋅ Da Zhang ⋅ Zhiyuan Zhao ⋅ Junyu Gao ⋅ Xuelong Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 328
XSeg: A Large-scale X-ray Contraband Segmentation Benchmark For Real-World Security Screening
Hongxia Gao ⋅ Yixin Chen ⋅ Jiali Wen ⋅ Litao Li ⋅ Qianyun Liu ⋅ Kaijie Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 329
Training-Free Open-Vocabulary Camouflaged Object Segmentation via Fine-Grained Object Binding and Adaptive Hybrid Prompt
Peng Ren ⋅ Cheng Jiang ⋅ Chuande Yang ⋅ Fuming Sun ⋅ Tian Bai
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 330
M⁴-SAM: Multi-Modal Mixture-of-Experts with Memory-Augmented SAM for RGB-D Video Salient Object Detection
Jiyuan Liu ⋅ jia lin ⋅ Xiaofei Zhou ⋅ Runmin Cong ⋅ Deyang Liu ⋅ Zhi Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 331
ReAttnCLIP: Training-Free Open-Vocabulary Remote Sensing Image Segmentation via Re-defined Attention in CLIP
Xin Niu ⋅ Manqi Zhao ⋅ Dongsheng Jiang ⋅ Yingying Wu ⋅ Bing Su
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 332
Mixture of Prototypes for Test-time Adaptive Segmentation
Guangrui Li ⋅ Zhengyu Zhu ⋅ Yongxin Ge
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 333
Reconstruction-Guided Slot Curriculum: Addressing Object Over-Fragmentation in Video Object-Centric Learning
WonJun Moon ⋅ Hyun Seok Seong ⋅ Jae-Pil Heo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 334
ELVIS: Enhance Low-Light for Video Instance Segmentation in the Dark
Joanne Lin ⋅ Ruirui Lin ⋅ Yini Li ⋅ David Bull ⋅ Nantheera Anantrasirichai
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 335
Decouple Your Discovery and Memory in Continual Generalized Category Discovery
Jiawei Yu ⋅ Zijian Gao ⋅ Xingxing Zhang ⋅ Xuan Liu ⋅ Huaimin Wang ⋅ Kele Xu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 336
Beyond the Static World: Continual Category Discovery under Visual Drift
Wei Feng ⋅ Yiwen Jiang ⋅ Sijin Zhou ⋅ Zongyuan Ge
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 337
Memory-Efficient Transfer Learning with Fading Side Networks via Masked Dual Path Distillation
Yutong Zhang ⋅ Jiaxin Chen ⋅ Honglin Chen ⋅ Kaiqi Zheng ⋅ Shengcai Liao ⋅ Hanwen Zhong ⋅ Weixin Li ⋅ Yunhong Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 338
SAME: Sparse and Anchored Model Editing for Heterogeneous Incremental Learning under Limited Data
Zixuan Duan ⋅ Zeyu Zhang ⋅ Fengyuan Lu ⋅ Shaofeng Zhang ⋅ Wenbin Li ⋅ Qi Fan ⋅ Yang Gao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 339
CHEEM: Continual Learning by Reuse, New, Adapt and Skip - A Hierarchical Exploration-Exploitation Approach
Chinmay Savadikar ⋅ Michelle Dai ⋅ Tianfu Wu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 340
Exemplar-Free Continual Learning for State Space Models
ISAAC NING LEE ⋅ Leila Mahmoodi ⋅ Trung Le ⋅ Mehrtash Harandi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 341
A Faster Path to Continual Learning
Wei Li ⋅ Hangjie Yuan ⋅ Zixiang Zhao ⋅ Borui Kang ⋅ Ziwei Liu ⋅ Tao Feng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 342
Continual Learning for fMRI-Based Brain Disorder Diagnosis via Functional Connectivity Matrices Generative Replay
qianyu Chen ⋅ Shujian Yu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 343
BeautyGRPO: Aesthetic Alignment for Face Retouching via Dynamic Path Guidance and Fine-Grained Preference Modeling
Jiachen Yang ⋅ Xianhui Lin ⋅ Yi Dong ⋅ Zebiao Zheng ⋅ Xing Liu ⋅ Hong Gu ⋅ Yanmei Fang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 344
SyncDreamer: Controllable and Expressive Avatar Generation Beyond the Talking Head
Fatemeh Nazarieh ⋅ Zhenhua Feng ⋅ Diptesh Kanojia ⋅ Josef Kittler ⋅ Muhammad Awais
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 345
PerformRecast: Expression and Head Pose Disentanglement for Portrait Video Editing
Jiadong Liang ⋅ Bojun Xiong ⋅ Jie Tian ⋅ Hua Li ⋅ Xiao Long ⋅ Yong Zheng ⋅ Huan Fu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 346
UniLS: End-to-End Audio-Driven Avatars for Unified Listening and Speaking
Xuangeng Chu ⋅ Ruicong Liu ⋅ Yifei Huang ⋅ Yun Liu ⋅ YICHEN PENG ⋅ Bo Zheng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 347
PC-Talk: Precise Facial Animation Control for Audio-Driven Talking Face Generation
baiqin wang ⋅ Xiangyu Zhu ⋅ Fan Shen ⋅ HAO XU ⋅ Zhen Lei
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 348
FlashPortrait: 6x Faster Infinite Portrait Animation with Adaptive Latent Prediction
Shuyuan Tu ⋅ Yueming Pan ⋅ Yinming Huang ⋅ Xintong Han ⋅ Zhen Xing ⋅ Qi Dai ⋅ Kai Qiu ⋅ Chong Luo ⋅ Zuxuan Wu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 349
DriveVLN: Towards Mapless Vision-and-Language Navigation in Autonomous Driving
Dongqian Guo ⋅ Haoran Wei ⋅ Wencheng Han ⋅ Runzhou Tao ⋅ Zhongying Qiu ⋅ Jianfei Yang ⋅ Jianbing Shen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 350
Towards Open Environments and Instructions: General Vision-Language Navigation via Fast-Slow Interactive Reasoning
Li Yang ⋅ Aming Wu ⋅ Zihao Zhang ⋅ Yahong Han
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 351
Unifying Language-Action Understanding and Generation for Autonomous Driving
Xinyang Wang ⋅ Qian Liu ⋅ WENJIE DING ⋅ Zhao Yang ⋅ Wei Li ⋅ Chang Liu ⋅ Bailin Li ⋅ Kun Zhan ⋅ XianPeng Lang ⋅ Wei Chen
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 352
Drive My Way: Preference Alignment of Vision-Language-Action Model for Personalized Driving
Zehao Wang ⋅ Huaide Jiang ⋅ Shuaiwu Dong ⋅ Yuping Wang ⋅ Hang Qiu ⋅ Jiachen Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 353
Prune2Drive: A Plug-and-Play Framework for Accelerating Vision-Language Models in Autonomous Driving
Minhao Xiong ⋅ Zichen Wen ⋅ Zhuangcheng Gu ⋅ Xuyang Liu ⋅ Rui Zhang ⋅ Hengrui Kang ⋅ Jiabing Yang ⋅ JUNYUAN ZHANG ⋅ Weijia Li ⋅ Conghui He ⋅ Linfeng Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 354
CGHair: Compact Gaussian Hair Reconstruction with Card Clustering
Haimin Luo ⋅ Srinjay Sarkar ⋅ Albert Mosella-Montoro ⋅ Francisco Vicente Carrasco ⋅ Fernando De la Torre
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 355
HyperGaussians: High-Dimensional Gaussian Splatting for High-Fidelity Animatable Face Avatars
Gent Serifi ⋅ Marcel C.
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 356
Skullptor: High Fidelity 3D Head Reconstruction in Seconds with Multi-View Normal Prediction
Noé Artru ⋅ Rukhshanda Hussain ⋅ Emeline Got ⋅ Alexandre Messier ⋅ David B. Lindell ⋅ Abdallah Dib
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 357
RelightAnyone: A Generalized Relightable 3D Gaussian Head Model
Yingyan Xu ⋅ Pramod Rao ⋅ Sebastian Weiss ⋅ Gaspard Zoss ⋅ Markus Gross ⋅ Christian Theobalt ⋅ Marc Habermann ⋅ Derek Bradley
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 358
Feed-forward Gaussian Registration for Head Avatar Creation and Editing
Malte Prinzler ⋅ Paulo Gotardo ⋅ Siyu Tang ⋅ Timo Bolkart
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 359
Residual Decoding: Mitigating Hallucinations in Large Vision-Language Models via History-Aware Residual Guidance
Xinrong Chen ⋅ Xu Chu ⋅ Yingmin Qiu ⋅ Hengyuan Zhang ⋅ Jing Xiong ⋅ Shiyu Tang ⋅ Shuai Liu ⋅ Shaokang Yang ⋅ Cheng Yang ⋅ Hayden Kwok-Hay So ⋅ Ngai Wong
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 360
Prefill-Time Intervention for Mitigating Hallucination in Large Vision-Language Models
Chengsheng Zhang ⋅ Chenghao Sun ⋅ Xinyan Jiang ⋅ Wei Li ⋅ Xinmei Tian
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 361
SVHalluc: Benchmarking Speech–Vision Hallucination in Audio-Visual Large Language Models
Chenshuang Zhang ⋅ Kyeong Seon Kim ⋅ Chengxin Liu ⋅ Tae-Hyun Oh
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 362
Same Attention, Different Truths: Put Logit-Lens over Visual Attention to Detect and Mitigate LVLM Object Hallucination
Zichuan Wang ⋅ Songlin Yang ⋅ Bo Peng ⋅ Zhenchen Tang ⋅ Yang Li ⋅ BeibeiDong BeibeiDong ⋅ Beibei Dong
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 363
Understanding the Role of Hallucination in Reinforcement Post-Training of Multimodal Reasoning Models
Gengwei Zhang ⋅ Jie Peng ⋅ Zhen Tan ⋅ Mufan Qiu ⋅ Hossein Nourkhiz Mahjoub ⋅ Vaishnav Tadiparthi ⋅ Kwonjoon Lee ⋅ Yanyong Zhang ⋅ Tianlong Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 364
Lyapunov Probes for Hallucination Detection in Large Foundation Models
Bozhi Luan ⋅ Gen Li ⋅ Yalan Qin ⋅ Jifeng Guo ⋅ Yun Zhou ⋅ Faguo Wu ⋅ Hongwei Zheng ⋅ wenjun wu ⋅ Zhaoxin Fan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 365
Captain Safari: A World Engine with Pose-Aligned 3D Memory
Yu-Cheng Chou ⋅ Xingrui Wang ⋅ Yitong Li ⋅ Jiahao Wang ⋅ Hanting Liu ⋅ Cihang Xie ⋅ Alan L. Yuille ⋅ Junfei Xiao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 366
Gen3R: 3D Scene Generation Meets Feed-Forward Reconstruction
Jiaxin Huang ⋅ Yuanbo Yang ⋅ Bangbang Yang ⋅ Lin Ma ⋅ Yuewen Ma ⋅ Yiyi Liao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 367
PerpetualWonder: Long-horizon Action-conditioned 4D Scene Generation
Jiahao Zhan ⋅ Zizhang Li ⋅ Hong-Xing Yu ⋅ Jiajun Wu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 368
CineScene: Implicit 3D as Effective Scene Representation for Cinematic Video Generation
Kaiyi Huang ⋅ Yukun Huang ⋅ Yu Li ⋅ Jianhong Bai ⋅ Xintao Wang ⋅ Zinan Lin ⋅ Xuefei Ning ⋅ Jiwen Yu ⋅ Yu Wang ⋅ Xihui Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 369
DreamStereo: Towards Real-Time Stereo Inpainting for HD Videos
Huang yuan ⋅ Sijie Zhao ⋅ Jing Cheng ⋅ Hao Xu ⋅ Shaohui Jiao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 370
SeeThrough3D: Occlusion Aware 3D Control in Text-to-Image Generation
Vaibhav Agrawal ⋅ Rishubh Parihar ⋅ Pradhaan S Bhat ⋅ Ravi Kiran Sarvadevabhatla ⋅ R. Venkatesh Babu
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 371
RecEdit-Drive: 3D Reconstruction-Guided Spatiotemporal Video Editing for Autonomous Driving Scenes
Yipeng Wu ⋅ Xin WANG ⋅ Chenghan Yang ⋅ Chong Wang ⋅ Dongdong Wu ⋅ Wanchao Su ⋅ Hengshuang Zhao ⋅ Wei Feng ⋅ Kairui Yang ⋅ Di Lin
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 372
RAYNOVA: Scale-Temporal Autoregressive World Modeling in Ray Space
Yichen Xie ⋅ Chensheng Peng ⋅ Mazen Abdelfattah ⋅ Yihan Hu ⋅ Jiezhi Yang ⋅ Eric Higgins ⋅ Ryan Brigden ⋅ Masayoshi Tomizuka ⋅ Wei Zhan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 373
RigMo: Unifying Rig and Motion Learning for Generative Animation
Hao Zhang ⋅ Jiahao Luo ⋅ Bohui Wan ⋅ Yizhou Zhao ⋅ Zongrui Li ⋅ Michael Vasilkovsky ⋅ Chaoyang Wang ⋅ Jian Wang ⋅ Narendra Ahuja ⋅ Bing Zhou
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 374
LaVR: Scene Latent Conditioned Generative Video Trajectory Re-Rendering using Large 4D Reconstruction Models
Mingyang Xie ⋅ Numair Khan ⋅ Tianfu Wang ⋅ Naina Dhingra ⋅ Seonghyeon Nam ⋅ Haitao Yang ⋅ Zhuo Hui ⋅ Christopher Metzler ⋅ Andrea Vedaldi ⋅ Hamed Pirsiavash ⋅ Lei Luo
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 375
WHU-MARS: A Multispectral Aerial-Ground Benchmark Towards Any-Scenario Person Re-Identification
Yuxuan Zhao ⋅ Zhongao Zhou ⋅ Bin Yang ⋅ He Li ⋅ Jian Liang ⋅ Jun Chen ⋅ Bo Du ⋅ Mang Ye
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 376
Detect Anything via Next Point Prediction
Qing Jiang ⋅ Junan Huo ⋅ Xingyu Chen ⋅ Yuda Xiong ⋅ Zhaoyang Zeng ⋅ Yihao Chen ⋅ Tianhe Ren ⋅ Junzhi Yu ⋅ Lei Zhang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 377
Text-guided Feature Disentanglement for Cross-modal Gait Recognition
Zhiyang Lu ⋅ Ming Cheng
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 378
Distribution-Aligned Multimodal Fusion for Robust Object Detection
XIAOHUI HAO ⋅ Yanglin Pu ⋅ Yongjun Wang ⋅ Rui She
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 379
PaQ-DETR: Learning Pattern and Quality-Aware Dynamic Queries for Object Detection
Zhengjian Kang ⋅ Jun Zhuang ⋅ Kangtong Mo ⋅ Qi Chen ⋅ Rui Liu ⋅ Ye Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 380
Portable Active Learning for Object Detection
Rashi Sharma ⋅ Justin Timothy C. Bersamin ⋅ Karthikk Subramanian
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 381
Efficiency Follows Global-Local Decoupling
Zhenyu Yang ⋅ Gensheng Pei ⋅ Tao Chen ⋅ Yichao Zhou ⋅ Tianfei Zhou ⋅ Yazhou Yao ⋅ Fumin Shen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 382
VRCLIP: Multimodal Canonical Correlation Alignment for CLIP-Driven Vision-Radio Person Re-Identification
Rui Zhang ⋅ Yaqi Wang ⋅ Yadong Li ⋅ Ruixu Geng ⋅ Jianyang Wang ⋅ Qijun Ying ⋅ Dongheng Zhang ⋅ Yang Hu ⋅ Yan Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 383
EReCu: Pseudo-label Evolution Fusion and Refinement with Multi-Cue Learning for Unsupervised Camouflage Detection
Jiang Shuo ⋅ Gaojia Zhang ⋅ Min Tan ⋅ Yufei Yin ⋅ Gang Pan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 384
Expert-Teacher-Student Collaborative Learning for Domain Adaptive Object Detection
Yiming Cui ⋅ Liang Li ⋅ Haibing Yin ⋅ Yuhan Gao ⋅ Xichun Sheng ⋅ Chenggang Yan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 385
CI-VID: A Coherent Interleaved Text-Video Dataset
Yiming Ju ⋅ Jijin Hu ⋅ Zhengxiong Luo ⋅ Haoge Deng ⋅ Hanyu Zhao ⋅ Li Du ⋅ Wenbo Xiao ⋅ Chengwei Wu ⋅ Donglin Hao ⋅ Xinlong Wang ⋅ Tengfei Pan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 386
Generalizable Video Quality Assessment via Weak-to-Strong Learning
Linhan Cao ⋅ Wei Sun ⋅ Xiangyang Zhu ⋅ Kaiwei Zhang ⋅ Jun Jia ⋅ Yicong Peng ⋅ Dandan Zhu ⋅ Guangtao Zhai ⋅ Xiongkuo Min
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 387
EgoSound: Benchmarking Sound Understanding in Egocentric Videos
Bingwen Zhu ⋅ Yuqian Fu ⋅ Qiaole Dong ⋅ Guolei Sun ⋅ Tianwen Qian ⋅ Yuzheng Wu ⋅ Danda Paudel ⋅ Yanwei Fu ⋅ Xiangyang Xue
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 388
WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning
Woongyeong Yeo ⋅ Kangsan Kim ⋅ Jaehong Yoon ⋅ Sung Ju
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 389
GIFT: Global Irreplaceability Frame Targeting for Efficient Video Understanding
Ma Junpeng ⋅ Sashuai zhou ⋅ Guanghao Li ⋅ Xin Gao ⋅ Yue Cao ⋅ Hengyu Zeng ⋅ Yuxiang Yan ⋅ Zhibin Wang ⋅ Jun Song ⋅ Bo Zheng ⋅ Shanghang Zhang ⋅ Jian Pu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 390
Select Less, Reason More: Prioritizing Evidence Purity for Video Reasoning
Xuchen Li ⋅ Xuzhao Li ⋅ Shiyu Hu ⋅ Kaiqi Huang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 391
Ego2Web: A Web Agent Benchmark Grounded in Egocentric Videos
Shoubin Yu ⋅ Lei Shu ⋅ Antoine Yang ⋅ Yao Fu ⋅ Srinivas Sunkara ⋅ Maria Wang ⋅ Jindong Chen ⋅ Mohit Bansal ⋅ Boqing Gong
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 392
Compositional Transformation Reasoning for Composed Video Retrieval
Sihong Huang ⋅ Jiaxin Wu ⋅ Dongmei Jiang ⋅ Yi Cai ⋅ Yaowei Wang ⋅ Xiaoyong Wei
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 393
UniVBench: Towards Unified Evaluation for Video Foundation Models
Jianhui Wei ⋅ Xiaotian Zhang ⋅ Yichen Li ⋅ Yuan Wang ⋅ Yan Zhang ⋅ Ziyi Chen ⋅ Zhihang Tang ⋅ Wei Xu ⋅ Zuozhu Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 394
NAMI: Efficient Image Generation via Bridged Progressive Rectified Flow Transformers
Yuhang Ma ⋅ Bo Cheng ⋅ Shanyuan Liu ⋅ Hongyi Zhou ⋅ Liebucha Wu ⋅ Dawei Leng ⋅ Yuhui Yin
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 395
InverFill: One-Step Inversion for Enhanced Few-Step Diffusion Inpainting
Hong Duc Vu ⋅ Kien Nguyen ⋅ Trong-Tung Nguyen ⋅ Ngan Nguyen ⋅ Phong Nguyen ⋅ Khoi Nguyen ⋅ Cuong Pham ⋅ Anh Tran
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 396
TimeRipples: Accelerating vDiTs by Understanding the Spatio-Temporal Correlations in Latent Space
Wenxuan Miao ⋅ Yulin Sun ⋅ Aiyue Chen ⋅ Jing Lin ⋅ Yiwu Yao ⋅ Yiming Gan ⋅ Jieru Zhao ⋅ Jingwen Leng ⋅ Minyi Guo ⋅ Yu Feng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 397
ProcessMaker: A Generalized Process Visualization Framework with Adaptive Sequence Steps on Diffusion Transformers
Mengling Xu ⋅ Sisi You ⋅ Li Yaning ⋅ Bing-Kun Bao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 398
MeanFlow Transformers with Representation Autoencoders
Zheyuan Hu ⋅ Chieh-Hsin Lai ⋅ Ge Wu ⋅ Yuki Mitsufuji ⋅ Stefano Ermon
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 399
DiT-IC: Aligned Diffusion Transformer for Efficient Image Compression
Junqi Shi ⋅ Ming Lu ⋅ Xingchen Li ⋅ Anle Ke ⋅ Ruiqi Zhang ⋅ Zhan Ma
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 400
FARMER: Flow AutoRegressive Transformer over Pixels
GuangTing Zheng ⋅ Qinyu Zhao ⋅ Tao Yang ⋅ Fei Xiao ⋅ Zhijie Lin ⋅ Jie Wu ⋅ Jiajun Deng ⋅ Yanyong Zhang ⋅ Rui Zhu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 401
Probabilistic Precipitation Nowcasting with Rectified Flow Transformers
Johannes Schusterbauer ⋅ Jannik Wiese ⋅ Nick Stracke ⋅ Timy Phan ⋅ Björn Ommer
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 402
FlowDC: Flow-Based Decoupling-Decay for Complex Image Editing
Yilei Jiang ⋅ Zhen Wang ⋅ Yanghao Wang ⋅ Jun Yu ⋅ Yueting Zhuang ⋅ Jun Xiao ⋅ Long Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 403
High-Fidelity Diffusion Face Swapping with ID-Constrained Facial Conditioning
Dailan He ⋅ Xiahong Wang ⋅ Shulun Wang ⋅ Hao Shao ⋅ Bingqi Ma ⋅ Guanglu Song ⋅ Yu Liu ⋅ Hongsheng Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 404
3D-Object Perception Transformer (3PT)
Agastya Kalra ⋅ Tim Salzmann ⋅ Guy Stoppi ⋅ Dmitrii Marin ⋅ Rishav Agarwal ⋅ Vage Taamazyan ⋅ Martin Bokeloh ⋅ Stefan Hinterstoisser ⋅ Anton Boykov ⋅ Alberto Dall'Olio ⋅ Pravin Dangol ⋅ Kartik Venkataraman ⋅ Huaijin Chen
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 405
SemLT3D: Semantic-Guided Expert Distillation for Camera-only Long-Tailed 3D Object Detection
Hao Vo ⋅ Khoa Vo ⋅ Tran Phan Phan ⋅ Ngo Xuan Cuong ⋅ Gianfranco Doretto ⋅ Hien Nguyen ⋅ Anh Nguyen ⋅ Ngan Le
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 406
Spe-BEVHead: Rethinking the Detection Head Design for Bird’s-Eye-View Object Detection
Junshu Zhang ⋅ Sicheng Zhao ⋅ Xin Zhao ⋅ Fan Yang ⋅ Ruike Chen ⋅ Jungong Han ⋅ Guiguang Ding
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 407
Unsupervised Multi-agent and Single-agent Perception from Cooperative Views
Haochen Yang ⋅ Baolu Li ⋅ Lei Li ⋅ Delin Ren ⋅ Jiacheng Guo ⋅ Minghai Qin ⋅ Tianyun Zhang ⋅ Hongkai Yu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 408
Zoo3D: Zero-Shot 3D Object Detection at Scene Level
Andrey Lemeshko ⋅ Bulat Gabdullin ⋅ Nikita Drozdov ⋅ Anton Konushin ⋅ Danila Rukhovich ⋅ Maksim Kolodiazhnyi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 409
Beyond Appearance: Camouflaged Object Detection via Geometric Structure
Jinyu Han ⋅ changguang wu ⋅ Fuming Sun ⋅ Jinhui Tang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 410
SABER: Spatially Consistent 3D Universal Adversarial Objects for BEV Detectors
Aixuan Li ⋅ Mochu Xiang ⋅ Bosen Hou ⋅ Zhexiong Wan ⋅ Jing Zhang ⋅ Yuchao Dai
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 411
AceTone: Bridging Words and Colors for Conditional Image Grading
Tianren Ma ⋅ Mingxiang Liao ⋅ Xijin Zhang ⋅ Qixiang Ye
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 412
Do VLMs Perceive or Recall? Probing Visual Perception vs. Memory with Classic Visual Illusions
Xiaoxiao Sun ⋅ Mingyang Li ⋅ Kun yuan ⋅ Min Woo ⋅ Mark Endo ⋅ Shengguang Wu ⋅ Changlin Li ⋅ Yuhui Zhang ⋅ Zeyu Wang ⋅ Serena Yeung
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 413
Pixels Don't Lie (But Your Detector Might): Bootstrapping MLLM-as-a-Judge for Trustworthy Deepfake Detection and Reasoning Supervision
Kartik Kuckreja ⋅ Parul Gupta ⋅ Muhammad Haris Khan ⋅ Abhinav Dhall
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 414
UI-Lens: Assessing General MLLMs’ Potential to Automate UI Display Quality Assurance
Wei Xiang ⋅ Yexinrui WU ⋅ Xinli Chen ⋅ Xinran Li ⋅ Shi Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 415
Seeing is Improving: Visual Feedback for Iterative Text Layout Refinement
Junrong Guo ⋅ Shancheng Fang ⋅ Yadong Qu ⋅ Hongtao Xie
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 416
Is your VLM Sky-Ready? A Comprehensive Spatial Intelligence Benchmark for UAV Navigation
Lingfeng Zhang ⋅ Yuchen Zhang ⋅ Hongsheng Li ⋅ Haoxiang Fu ⋅ Yingbo Tang ⋅ Hangjun Ye ⋅ Long Chen ⋅ Xiaojun Liang ⋅ Xiaoshuai Hao ⋅ Wenbo Ding
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 417
Linking Perception, Confidence and Accuracy in MLLMs
Yuetian Du ⋅ Yucheng Wang ⋅ Rongyu Zhang ⋅ Zhijie Xu ⋅ BOYU YANG ⋅ Ming Kong ⋅ Jie Liu ⋅ Qiang Zhu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 418
AVA-Bench: Atomic Visual Ability Benchmark for Vision Foundation Models
Zheda Mai ⋅ Arpita Chowdhury ⋅ Zihe Wang ⋅ Sooyoung Jeon ⋅ Lemeng Wang ⋅ Jiacheng Hou ⋅ Jihyung Kil ⋅ Wei-Lun Chao
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 419
Learning to Focus and Precise Cropping: A Reinforcement Learning Framework with Information Gaps and Grounding Loss for MLLMs
Xuanpu Zhao ⋅ Zhentao Tan ⋅ Dianmo Sheng ⋅ Tianxiang Chen ⋅ Yao Liu ⋅ Yue Wu ⋅ Tao Gong ⋅ Qi Chu ⋅ Nenghai Yu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 420
From Pixel to Precision: Enhancing Handwritten Mathematical Expression Recognition with Image-Level Reward
Ze Liu ⋅ Kai Zhang ⋅ Xianquan Wang ⋅ Shuochen Liu ⋅ Jiaxian Yan ⋅ Yupeng Han ⋅ Qi Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 421
Rethinking Pose Refinement in 3D Gaussian Splatting under Pose Prior and Geometric Uncertainty
ManGyu Kong ⋅ Jaewon Lee ⋅ Seongwon Lee ⋅ Euntai Kim
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 422
Revisiting Pose Sensitivity in Splat-based Computed Tomography under Sparse-view Reconstruction
Kiseok Choi ⋅ Hyeongjun Cho ⋅ Inchul Kim ⋅ Min H. Kim
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 423
Seele: A Unified Acceleration Framework for Real-Time Gaussian Splatting on Mobile Devices
He Zhu ⋅ Xiaotong Huang ⋅ Zihan Liu ⋅ Weikai Lin ⋅ Xiaohong Liu ⋅ Zhezhi He ⋅ Jingwen Leng ⋅ Minyi Guo ⋅ Yu Feng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 424
GHPT: Real-Time Relightable Gaussian Splatting using Hybrid Path Tracing
Jinyang Bo ⋅ Fan Dou ⋅ Wenrui Quan ⋅ Shangxun Liu ⋅ Yang Xu ⋅ Yuhe Zhang ⋅ Kang Li ⋅ GuoHua Geng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 425
PolarGuide-GSDR: 3D Gaussian Splatting Driven by Polarization Priors and Deferred Reflection for Real-World Reflective Scenes
Derui Shan ⋅ Qian Qiao ⋅ Hao Lu ⋅ Tao Du ⋅ Peng Lu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 426
EcoSplat: Efficiency-controllable Feed-forward 3D Gaussian Splatting from Multi-view Images
Minh-Quan Viet Bui ⋅ Jongmin Park ⋅ Juan Luis Gonzalez Bello ⋅ Jaeho Moon ⋅ Jihyong Oh ⋅ Munchurl Kim
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 427
SGS-Intrinsic: Semantic-Invariant Gaussian Splatting for Sparse-View Indoor Inverse Rendering
jiahao niu ⋅ rongjia zheng ⋅ Wenju Xu ⋅ Wei-Shi Zheng ⋅ Qing Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 428
GIFSplat: Generative Prior-Guided Iterative Feed-Forward 3D Gaussian Splatting from Sparse Views
Tianyu Chen ⋅ Wei Xiang ⋅ Kang Han ⋅ Yu Lu ⋅ Di Wu ⋅ Gaowen Liu ⋅ Ramana Kompella
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 429
3D Gaussian Splatting with Self-Constrained Priors for High Fidelity Surface Reconstruction
Takeshi Noda ⋅ Yu-Shen Liu ⋅ Zhizhong Han
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 430
FilterGS: Traversal-Free Parallel Filtering and Adaptive Shrinking for Large-Scale LoD 3D Gaussian Splatting
Yixian Wang ⋅ HaoLin Yu ⋅ Jiadong Tang ⋅ Yu Gao ⋅ Xihan Wang ⋅ Yufeng Yue ⋅ Yi Yang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 431
TWINGS: Thin Plate Splines Warp-aligned Initialization for Sparse-View Gaussian Splatting
Hyeseong Kim ⋅ Geonhui Son ⋅ Deukhee Lee ⋅ Dosik Hwang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 432
VarSplat: Uncertainty-aware 3D Gaussian Splatting for Robust RGB-D SLAM
Anh Thuan Tran ⋅ Jana Kosecka
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 433
SpeeDe3DGS: Speedy Deformable 3D Gaussian Splatting with Temporal Pruning and Motion Grouping
Allen Tu ⋅ Haiyang Ying ⋅ Alex Hanson ⋅ Yonghan Lee ⋅ Tom Goldstein ⋅ Matthias Zwicker
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 434
FastGS: Training 3D Gaussian Splatting in 100 Seconds
Shiwei Ren ⋅ Tianci Wen ⋅ Yongchun Fang ⋅ Biao Lu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 435
BrepGaussian: CAD reconstruction from Multi-View Images with Gaussian Splatting
Jiaxing Yu ⋅ Dongyang Ren ⋅ Hangyu Xu ⋅ Zhouyuxiao Yang ⋅ Yuanqi Li ⋅ Jie Guo ⋅ Zhengkang Zhou ⋅ Yanwen Guo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 436
ODGS-SLAM: Omnidirectional Gaussian Splatting SLAM
Stefan Spiss ⋅ Joey Hieronimy ⋅ Marcel Ritter ⋅ Matthias Harders
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 437
BA-GS: Bayesian Adaptive Gaussian Splatting for SFM-Free 3D Reconstruction
Zhongjie Ma ⋅ Di Lin ⋅ Xin WANG ⋅ Haotian Dong ⋅ Chong Wang ⋅ Dongdong Wu ⋅ Changqing Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 438
FSFSplatter: Geometrically Accurate Reconstruction with Free Sparse-view Images within 2 minutes
Yibin Zhao ⋅ Yihan Pan ⋅ Jun Nan ⋅ Liwei Chen ⋅ Jianjun YI
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 439
ViRC: Enhancing Visual Interleaved Mathematical CoT with Reason Chunking
Lihong Wang ⋅ Liangqi Li ⋅ Weiwei Feng ⋅ Jiamin Wu ⋅ Changtao Miao ⋅ Tieru Wu ⋅ Rui Ma ⋅ Bo Zhang ⋅ Zhe Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 440
When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought
Yiyang Zhou ⋅ Haoqin Tu ⋅ Zijun Wang ⋅ Zeyu Wang ⋅ Niklas Muennighoff ⋅ Fan Nie ⋅ Chaorui Deng ⋅ Shen Yan ⋅ Haoqi Fan ⋅ Yejin Choi ⋅ James Zou ⋅ Cihang Xie ⋅ Huaxiu Yao ⋅ Qinghao Ye
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 441
PixDLM: A Dual-Path Multimodal Language Model for UAV Reasoning Segmentation
shuyan ke ⋅ Yifan Mei ⋅ Changli Wu ⋅ yonghan zheng ⋅ Jiayi Ji ⋅ Liujuan Cao ⋅ Rongrong Ji
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 442
Can a Second-View Image Be a Language? Geometric and Semantic Cross-Modal Reasoning for X-ray Prohibited Item Detection
Chuang Peng ⋅ Renshuai Tao ⋅ Zhongwei Ren ⋅ Xianglong Liu ⋅ Yunchao Wei
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 443
VCU-Bridge: Hierarchical Visual Connotation Understanding via Semantic Bridging
Ming Zhong ⋅ Yuanlei Wang ⋅ Liuzhou Zhang ⋅ Ruichuan An ⋅ Ray Zhang ⋅ Hao Liang ⋅ Ming Lu ⋅ Ying Shen ⋅ Wentao Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 444
Learning to See through Illumination Extremes with Event Streaming in Multimodal Large Language Models
Baoheng Zhang ⋅ Jiahui Liu ⋅ Zhao Gui ⋅ Zhang Weizhou ⋅ YIXUAN MA ⋅ Jun Jiang ⋅ Yingxian Chen ⋅ Wilton W.T Fok ⋅ Xiaojuan Qi ⋅ Hayden Kwok-Hay So
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 445
VOLD: Reasoning Transfer from LLMs to Vision-Language Models via On-Policy Distillation
Walid Bousselham ⋅ Hilde Kuehne ⋅ Cordelia Schmid
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 446
Cut to the Chase: Training-free Multimodal Summarization via Chain-of-Events
Xiaoxing You ⋅ Qiang Huang ⋅ Lingyu Li ⋅ Xiaojun Chang ⋅ Jun Yu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 447
UVU: Improving Multimodal Understanding via Vision-Language Unified Autoregressive Paradigm
Zhehan Kan ⋅ Xinghua Jiang ⋅ Yanlin Liu ⋅ Xiaochen Yang ⋅ ZHIXIANG WEI ⋅ Shifeng Liu ⋅ Yubo Zhu ⋅ Qingmin Liao ⋅ Wenming Yang ⋅ Xin Li ⋅ Yinsong Liu ⋅ Deqiang Jiang ⋅ Xing Sun
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 448
PointThinker: Point-Incentivized Parallel Thinking for Multimodal Large Language Model
Zhengdong Hu ⋅ Chao Wang ⋅ Fengyun Rao ⋅ Jing LYU ⋅ Hehe Fan ⋅ Yi Yang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 449
OctoMed: Data Recipes for State-of-the-Art Multimodal Medical Reasoning
Timothy Ossowski ⋅ Sheng Zhang ⋅ Qianchu Liu ⋅ Guanghui Qin ⋅ Reuben Tan ⋅ Tristan Naumann ⋅ Junjie Hu ⋅ Hoifung Poon
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 450
HoneyBee: Data Recipes for Vision-Language Reasoners
Hritik Bansal ⋅ Devendra Singh Sachan ⋅ Kai-Wei Chang ⋅ Aditya Grover ⋅ Gargi Ghosh ⋅ Wen-tau Yih ⋅ Ramakanth Pasunuru
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 451
VisPlay: Self-Evolving Vision-Language Models
Yicheng He ⋅ Chengsong Huang ⋅ Zongxia Li ⋅ Jiaxin Huang ⋅ Yonghui Yang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 452
Chart-FR1: Visual Focus-Driven Fine-Grained Reasoning on Dense Charts
Hongkun Pan ⋅ Yuwei Wu ⋅ Wanyi Hong ⋅ ShengHui Hu ⋅ Qitong Yan ⋅ Yi Yang ⋅ Rufei Han ⋅ Changju Zhou ⋅ Minfeng Zhu ⋅ Dongming Han ⋅ Wei Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 453
Thinking-while-Generating: Interleaving Textual Reasoning throughout Visual Generation
Ziyu Guo ⋅ Ray Zhang ⋅ Hongyu Li ⋅ Manyuan Zhang ⋅ Xinyan Chen ⋅ Sifan Wang ⋅ Yan Feng ⋅ Peng Pei ⋅ Pheng-Ann Heng
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 454
ApET: Approximation-Error Guided Token Compression for Efficient VLMs
Qiankun Ma ⋅ Ziyao Zhang ⋅ Haofei Wang ⋅ Zhen Song ⋅ Jie Chen ⋅ Hairong Zheng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 455
Granulon: Awakening Pixel-Level Visual Encoders with Adaptive Multi-Granularity Semantics for MLLM
Junyuan Mao ⋅ Qiankun Li ⋅ Linghao Meng ⋅ Zhicheng He ⋅ Xinliang Zhou ⋅ Kun Wang ⋅ Yang Liu ⋅ Yueming Jin
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 456
Vision Transformers Need More Than Registers
Cheng Shi ⋅ Yizhou Yu ⋅ Sibei Yang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 457
Head-wise Adaptive Rotary Positional Encoding for Fine-Grained Image Generation
Li jiaye ⋅ Baoyou Chen ⋅ Hui Li ⋅ Zilong Dong ⋅ Jingdong Wang ⋅ Siyu Zhu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 458
PRISM: Video Dataset Condensation with Progressive Refinement and Insertion for Sparse Motion
Jaehyun Choi ⋅ Jiwan Hur ⋅ Gyojin Han ⋅ Jaemyung Yu ⋅ Junmo Kim
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 459
AdaSVD: Singular Value Decomposition with Adaptive Mechanisms for Large Multimodal Models
Zhiteng Li ⋅ Mingyuan Xia ⋅ Jingyuan Zhang ⋅ Zheng Hui ⋅ Haotong Qin ⋅ Linghe Kong ⋅ Yulun Zhang ⋅ Xiaokang Yang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 460
ReFTA: Breaking the Weight Reconstruction Bottleneck in Tensorized Parameter-Efficient Fine-Tuning
Jingjing Zheng ⋅ Anda Tang ⋅ Qiangqiang Mao ⋅ Zhouchen Lin ⋅ Yankai Cao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 461
HTTM: Head-wise Temporal Token Merging for Faster VGGT
Weitian Wang ⋅ Lukas Meiner ⋅ Rai Shubham ⋅ Cecilia De La Parra ⋅ Akash Kumar
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 462
Reparameterized Tensor Ring Functional Decomposition for Multi-Dimensional Data Recovery
Yangyang Xu ⋅ Junbo Ke ⋅ You-Wei Wen ⋅ Chao Wang
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 463
Self-Attention Driven Tensor Representation for High-Order Data Recovery
Zhi-Wei SHI ⋅ Yu-Bang Zheng ⋅ Heng-Chao Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 464
PlanaReLoc: Camera Relocalization in 3D Planar Primitives via Region-Based Structure Matching
Hanqiao Ye ⋅ Yuzhou Liu ⋅ Yangdong Liu ⋅ Shuhan Shen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 465
MOGeo: Beyond One-to-One Cross-View Object Geo-localization
Bo Lv ⋅ Qingwang Zhang ⋅ Le Wu ⋅ Yuanyuan Li ⋅ YINGYING ZHU
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 466
Homaloidal parametrization for detecting critical two-view configurations
Rakshith Madhavan ⋅ Matteo Forlivesi ⋅ Marina Bertolini ⋅ Cristina Turrini ⋅ Federica Arrigoni ⋅ Luca Magri
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 467
AsymLoc: Towards Asymmetric Feature Matching for Efficient Visual Localization
Mohammad Omama ⋅ Gabriele Berton ⋅ Eric Foxlin ⋅ Yelin Kim
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 468
MMLandmarks: a Cross-View Instance-Level Benchmark for Geo-Spatial Understanding
Oskar Kristoffersen ⋅ Alba Reinders Sánchez ⋅ Morten Hannemose ⋅ Anders Bjorholm Dahl ⋅ Dim P. Papadopoulos
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 469
Asking like Socrates: Socrates helps VLMs understand remote sensing images
Run Shao ⋅ Ziyu Li ⋅ Zhaoyang Zhang ⋅ Linrui Xu ⋅ Xinran He ⋅ Hongyuan Yuan ⋅ Bolei He ⋅ Yongxing Dai ⋅ Yiming Yan ⋅ Yijun Chen ⋅ Wang Guo ⋅ Haifeng Li
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 470
GTR-Turbo: Merged Checkpoint is Secretly a Free Teacher for Agentic VLM Training
Tong Wei ⋅ Yijun Yang ⋅ Changhao Zhang ⋅ Junliang Xing ⋅ Yuanchun Shi ⋅ Zongqing Lu ⋅ Deheng Ye
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 471
Let VLMs Grade Their Own Thoughts: A Self-Quantification Approach to Reasoning-Aware Reward Modeling
Xing Xi ⋅ Yu Qiu ⋅ Ronghua Luo ⋅ Peixian Chen ⋅ peilin tong
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 472
SciEducator: Scientific Video Understanding and Educating via Deming-Cycle Multi-Agent System
Zhiyu Xu ⋅ Weilong Yan ⋅ YUFEI SHI ⋅ Xin Meng ⋅ Tao He ⋅ Huiping Zhuang ⋅ Ming Li ⋅ Hehe Fan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 473
SenseSearch: Empowering Vision-Language Models with High-Resolution Agentic Search-Reasoning via Reinforcement Learning
Yong Xien Chng ⋅ Tao Hu ⋅ Wenwen Tong ⋅ Xueheng Li ⋅ Jiandong Chen ⋅ Haojia Yu ⋅ Jiefan Lu ⋅ Hewei Guo ⋅ Hanming Deng ⋅ Chengjun Xie ⋅ Gao Huang ⋅ Lewei Lu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 474
Scaling Agentic Reinforcement Learning for Tool-Integrated Reasoning in VLMs
Meng Lu ⋅ Ran Xu ⋅ Yi Fang ⋅ Wenxuan Zhang ⋅ Yue Yu ⋅ Gaurav Srivastava ⋅ Yuchen Zhuang ⋅ Mohamed Elhoseiny ⋅ Charles Fleming ⋅ Carl Yang ⋅ Zhengzhong Tu ⋅ Yang Xie ⋅ Guanghua Xiao ⋅ Di Jin ⋅ Wenqi Shi ⋅ Xuan Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 475
VideoSSR: Video Self-Supervised Reinforcement Learning
Zefeng He ⋅ Xiaoye Qu ⋅ Yafu Li ⋅ Siyuan Huang ⋅ Daizong Liu ⋅ Yu Cheng
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 476
Neurodynamics-Driven Coupled Neural P Systems for Multi-Focus Image Fusion
Bo Li ⋅ Yunkuo Lei ⋅ Tingting Bao ⋅ Hang Yan ⋅ Yaxian Wang ⋅ Weiping Fu ⋅ Lingling Zhang ⋅ Jun Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 477
MagicFuse: Single Image Fusion for Visual and Semantic Reinforcement
HAO ZHANG ⋅ Yanping Zha ⋅ Zizhuo Li ⋅ Meiqi Gong ⋅ Jiayi Ma
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 478
Bridging Pixels and Words: Mask-Aware Local Semantic Fusion for Multimodal Media Verification
Zizhao Chen ⋅ Ping Wei ⋅ Ziyang Ren ⋅ Huan Li ⋅ Xiangru Yin
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 479
Human-Centric Multi-Exposure Fusion: Benchmark and Bi-level Cognition Distillation Framework
Jingjie Shang ⋅ Tengyu Ma ⋅ Heng Zhang ⋅ Jinyuan Liu ⋅ Risheng Liu ⋅ Yuan Wang ⋅ Xiaochen Bo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 480
ConceptPose: Training-Free Zero-Shot Object Pose Estimation using Concept Vectors
Liming Kuang ⋅ Yordanka Velikova ⋅ Mahdi Saleh ⋅ Jan-Nico Zaech ⋅ Danda Paudel ⋅ Benjamin Busam
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 481
A Closer Look at Cross-Domain Few-Shot Object Detection: Fine-Tuning Matters and Parallel Decoder Helps
Xuanlong Yu ⋅ Youyang Sha ⋅ Longfei Liu ⋅ Xi Shen ⋅ Di Yang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 482
NAF: Zero-Shot Feature Upsampling via Neighborhood Attention Filtering
Loick Chambon ⋅ Paul Couairon ⋅ Éloi Zablocki ⋅ Alexandre Boulch ⋅ Nicolas THOME ⋅ Matthieu Cord
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 483
Universal-to-Specific: Dynamic Knowledge-Guided Multiple Instance Learning for Few-Shot Whole Slide Image Classification
Junjian Li ⋅ Hulin Kuang ⋅ Jin Liu ⋅ Hailin Yue ⋅ Mengshen He ⋅ Jianxin Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 484
SOTA: Self-adaptive Optimal Transport for Zero-Shot Classification with Multiple Foundation Models
Zhanxuan Hu ⋅ Qiyu Xu ⋅ Yu Duan ⋅ Yonghang Tai ⋅ Huafeng Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 485
Uni-DAD: Unified Distillation and Adaptation of Diffusion Models for Few-step Few-shot Image Generation
Yara Bahram ⋅ Mélodie Desbos ⋅ Mohammadhadi Shateri ⋅ Eric Granger
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 486
Streamlined Knowledge Distillation
Hyeon-Jin Jung ⋅ Han-Jin Lee ⋅ Seok-Hwan Choi
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 487
Generalizable Knowledge Distillation from Vision Foundation Models for Semantic Segmentation
Chonghua Lv ⋅ Dong Zhao ⋅ Shuang Wang ⋅ Dou Quan ⋅ Ning Huyan ⋅ Nicu Sebe ⋅ Zhun Zhong
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 488
IMS3: Breaking Distributional Aggregation in Diffusion-Based Dataset Distillation
Chenru Wang ⋅ Yunyi Chen ⋅ Zijun Yang ⋅ Joey Tianyi Zhou ⋅ Chi Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 489
Continuous Exposure-Time Modeling for Realistic Atmospheric Turbulence Synthesis
junwei zeng ⋅ Dong Liang ⋅ Shengjun Huang ⋅ Kun Zhan ⋅ Songcan Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 490
240FPS Stereo Vision from Monocular Mixed Spikes
Yeliduosi Xiaokaiti ⋅ Yakun Chang ⋅ Yang Bai ⋅ Zhaojun Huang ⋅ Peiqi Duan ⋅ Boxin Shi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 491
D^2-FOSA: Dual-Diffusion Guided EEG-to-Image Reconstruction with Frequency-Oriented Semantic Alignment
Yu Chenglong ⋅ Shuai Shen ⋅ Xiangsheng Li ⋅ Yang Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 492
Self-Diffusion Driven Blind Imaging
Yanlong Yang ⋅ Guanxiong Luo
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 493
Differentiable Stroke Planning with Dual Parameterization for Efficient and High-Fidelity Painting Creation
Jinfan Liu ⋅ Wuze Zhang ⋅ Zhangli Hu ⋅ Zhehan Zhao ⋅ Ye Chen ⋅ Bingbing Ni
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 494
Solvability of the Viewing Graph Under the Affine Camera Model
Gabriele Pedroni ⋅ Rakshith Madhavan ⋅ Federica Arrigoni
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 495
DiffBMP: Differentiable Rendering with Bitmap Primitives
Seongmin Hong ⋅ Junghun James Kim ⋅ Daehyeop Kim ⋅ Insoo Chung ⋅ Se Young Chun
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 496
Splat-Based Metal Artifact Reduction in Cone-Beam CT via Compact Attenuation Modeling
Kiseok Choi ⋅ Jaemin Cho ⋅ Inchul Kim ⋅ Min H. Kim
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 497
Lumosaic: Hyperspectral Video via Active Illumination and Coded-Exposure Pixels
Dhruv Verma ⋅ Andrew Qiu ⋅ Roberto Rangel ⋅ Ayandev Barman ⋅ Hao Yang ⋅ Chenjia Hu ⋅ Fengqi Zhang ⋅ Roman Genov ⋅ David B. Lindell ⋅ Kiriakos N. Kutulakos ⋅ Alex Mariakakis
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 498
Towards Universal Computational Aberration Correction in Photographic Cameras: A Comprehensive Benchmark Analysis
Xiaolong Qian ⋅ Qi Jiang ⋅ Yao Gao ⋅ Lei Sun ⋅ Zhonghua Yi ⋅ Kailun Yang ⋅ Luc Van Gool ⋅ Kaiwei Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 499
Multi-View Hierarchical Alignment Learning for Spatial Transcriptomics
Zhengzhong Zhu ⋅ Liangjin Liu ⋅ Pei Zhou ⋅ Shiquan min ⋅ Jiangping Zhu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 500
FEAST: Fully Connected Expressive Attention for Spatial Transcriptomics
Taejin Jeong ⋅ Joohyeok Kim ⋅ Jinyeong Kim ⋅ Chanyoung Kim ⋅ Seong Jae Hwang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 501
TRIDENT: A Trimodal Cascade Generative Framework for Drug and RNA-Conditioned Cellular Morphology Synthesis
Rui Peng ⋅ Ziru Liu ⋅ Lingyuan Ye ⋅ Yuxing Lu ⋅ Boxin Shi ⋅ Jinzhuo Wang
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 502
OrienPose: Orientation-Guided Novel View Synthesis for Single-Image Unseen Object Pose Estimation
Yating Liu ⋅ Zhaoshuai Qi ⋅ Yang Zou ⋅ Yongnan Yang ⋅ Shizhou Zhang ⋅ Yanning Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 503
Illustrator’s Depth: Monocular Layer Index Prediction for Image Decomposition
Nissim Maruani ⋅ Peiying Zhang ⋅ Siddhartha Chaudhuri ⋅ Matthew Fisher ⋅ Nanxuan Zhao ⋅ Vladimir G. Kim ⋅ Pierre Alliez ⋅ Mathieu Desbrun ⋅ Wang Yifan
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 504
Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation
Xin Lin ⋅ Meixi Song ⋅ Dizhe Zhang ⋅ Wenxuan Lu ⋅ Haodong Li ⋅ Bo Du ⋅ Ming-Hsuan Yang ⋅ Truong Nguyen ⋅ Lu Qi
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 505
Seeing Depth Through Frequency and Motion: A Progressive Training Paradigm for Monocular Depth Estimation
Ke Li ⋅ Bolin Song ⋅ Hongbo Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 506
GeoGuide: Hierarchical Geometric Guidance for Open-Vocabulary 3D Semantic Segmentation
Xujing Tao ⋅ Chuxin Wang ⋅ Yubo Ai ⋅ Zhixin Cheng ⋅ Zhuoyuan Li ⋅ Liangsheng Liu ⋅ Yujia Chen ⋅ Xinjun Li ⋅ Qiao Li ⋅ Wenfei Yang ⋅ Tianzhu Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 507
B^3-Seg: Camera-Free, Training-Free 3DGS Segmentation via Analytic EIG and Beta-Bernoulli Bayesian Updates
Hiromichi Kamata ⋅ Samuel Arthur Munro ⋅ Fuminori Homma
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 508
PE3R: Perception-Efficient 3D Reconstruction
Jie Hu ⋅ Shizun Wang ⋅ Xinchao Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 509
GS-ASM: 2DGS-Supervised Active Stereo Matching
Zhengling Wu ⋅ Rongfeng Lu ⋅ Quan Chen ⋅ Longjian Zeng ⋅ Ming Lu ⋅ Yaoqi Sun ⋅ Yahong Chen ⋅ Baofeng Ji ⋅ Chenggang Yan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 510
Real2Sim2Real: RetinalDepth-64K for Depth Estimation in Posterior Segment Ophthalmic Surgery
Bingwen Dong ⋅ Gan Liu ⋅ Xiaoxi Lu ⋅ Guangcheng Chen ⋅ Jialu ZHANG ⋅ Yan Hu ⋅ Xiaoqing Zhang ⋅ Jiang Liu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 511
Iris: Bringing Real-World Priors into Diffusion Model for Monocular Depth Estimation
Xinhao Cai ⋅ Gensheng Pei ⋅ Zeren Sun ⋅ Yazhou Yao ⋅ Fumin Shen ⋅ Wenguan Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 512
InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields
Hao Yu ⋅ Haotong Lin ⋅ Jiawei Wang ⋅ Jiaxin Li ⋅ Yida Wang ⋅ Xueyang Zhang ⋅ Yue Wang ⋅ Xiaowei Zhou ⋅ Ruizhen Hu ⋅ Sida Peng
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 513
AirSim360: A Panoramic Simulation Platform within Drone View
Xian Ge ⋅ Yuling Pan ⋅ Yuhang Zhang ⋅ Xiang Li ⋅ Weijun Zhang ⋅ Dizhe Zhang ⋅ Zhaoliang Wan ⋅ Xin Lin ⋅ Xiangkai Zhang ⋅ Juntao Liang ⋅ Xiangtai Li ⋅ jerett Jiang ⋅ Bo Du ⋅ Ming-Hsuan Yang ⋅ Lu Qi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 514
Radar-Guided Polynomial Fitting for Metric Depth Estimation
Patrick Rim ⋅ Hyoungseob Park ⋅ Vadim Ezhov ⋅ Jeffrey Moon ⋅ Alex Wong
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 515
UniDAC: Universal Metric Depth Estimation for Any Camera
Girish Chandar ⋅ Yuliang Guo ⋅ Liu Ren ⋅ Xiaoming Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 516
SCE-Depth: A Spherical Compound Eye Framework for Wide FOV Depth Estimation
Yi Zhu ⋅ Hao Xiong ⋅ Lin Xiao ⋅ Ranfeng Shi ⋅ Qinying Gu ⋅ Leilei Gu
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 517
I-Scene: 3D Instance Models are Implicit Generalizable Spatial Learners
Lu Ling ⋅ Yunhao Ge ⋅ Yichen Sheng ⋅ Aniket Bera
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 518
REVIVE 3D: Refinement via Encoded Voluminous Inflated prior for Volume Enhancement
Hankyeol Lee ⋅ WOOYEOL BAEK ⋅ Seongdo Kim ⋅ Jongyoo Kim
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 519
Muses: Designing, Composing, Generating Nonexistent Fantasy 3D Creatures without Training
Hexiao Lu ⋅ Xiaokun Sun ⋅ Zeyu Cai ⋅ Hao Guo ⋅ Ying Tai ⋅ Jian Yang ⋅ Zhenyu Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 520
EI-Part: Explode for Completion and Implode for Refinement
wanhu sun ⋅ Zhongjin Luo ⋅ Heliang Zheng ⋅ Jiahao Chang ⋅ Chongjie Ye ⋅ Huiang He ⋅ Shengchu Zhao ⋅ Rongfei Jia ⋅ Xiaoguang Han
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 521
MorphAny3D: Unleashing the Power of Structured Latent in 3D Morphing
Xiaokun Sun ⋅ Zeyu Cai ⋅ Hao Tang ⋅ Ying Tai ⋅ Jian Yang ⋅ Zhenyu Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 522
Fast3Dcache: Training-free 3D Geometry Synthesis Acceleration
Mengyu Yang ⋅ Yanming Yang ⋅ Chenyi Xu ⋅ Chenxi Song ⋅ Yufan Zuo ⋅ Tong Zhao ⋅ Ruibo Li ⋅ Chi Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 523
ViLearn: Accelerating Training Convergence of Image-to-3D Generation via Visibility Learning
Rui Chen ⋅ Jianfeng Zhang ⋅ Jing Lin ⋅ Xuanyu Yi ⋅ Yixun Liang ⋅ Guan Luo ⋅ Xiu Li ⋅ Zeming Li ⋅ Ping Tan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 524
FlashMesh: Faster and Better Autoregressive Mesh Synthesis via Structured Speculation
Tingrui Shen ⋅ Yiheng Zhang ⋅ Chen Tang ⋅ Chuan Ping ⋅ Zixing Zhao ⋅ Le Wan ⋅ Yuwang Wang ⋅ Ronggang Wang ⋅ Shengfeng He
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 525
X-Part: High Fidelity And Structure Coherent Shape Decomposition And Completion
XINHAO YAN ⋅ Jiachen Xu ⋅ Yang Li ⋅ Changfeng Ma ⋅ Yunhan Yang ⋅ Chunshi Wang ⋅ Zibo Zhao ⋅ Zeqiang Lai ⋅ Yunfei Zhao ⋅ Zhuo Chen ⋅ Chunchao Guo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 526
Realiz3D: 3D Generation Made Photorealistic via Domain-Aware Learning
Ido Sobol ⋅ Kihyuk Sohn ⋅ Yoav Blum ⋅ Egor Zakharov ⋅ Max Bluvstein ⋅ Andrea Vedaldi ⋅ Or Litany
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 527
TopoMesh: High-Fidelity Mesh Autoencoding via Topological Unification
Guan Luo ⋅ Xiu Li ⋅ Rui Chen ⋅ Xuanyu Yi ⋅ Jing Lin ⋅ Chia-Hao Chen ⋅ Jiahang Liu ⋅ Song-Hai Zhang ⋅ Jianfeng Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 528
Nestwork: Conditional 3D Furnished House Layout Generation through Latent Heterogeneous Graph Diffusion
Shuhan Miao ⋅ Biru Cao ⋅ Junling Zhuang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 529
TEXTRIX: Latent Attribute Grid for Native Texture Generation and Beyond
Yifei Zeng ⋅ Yajie Bao ⋅ Jiachen Qian ⋅ Shuang Wu ⋅ Youtian Lin ⋅ Hao Zhu ⋅ Buyu Li ⋅ Feihu Zhang ⋅ Xun Cao ⋅ Yao Yao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 530
Beyond Geometry: Artistic Disparity Synthesis for Immersive 2D-to-3D
Ping Chen ⋅ Zezhou Chen ⋅ Xingpeng Zhang ⋅ Yanlin Qian ⋅ Huan Hu ⋅ Xiang Liu ⋅ Zipeng Wang ⋅ Xin Wang ⋅ Zhaoxiang Liu ⋅ Kai Wang ⋅ Shiguo Lian
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 531
WorldGen: From Text to Traversable and Interactive 3D Worlds
Dilin Wang ⋅ Hyunyoung Jung ⋅ Tom Monnier ⋅ Kihyuk Sohn ⋅ Chuhang Zou ⋅ Xiaoyu Xiang ⋅ Yu-Ying Yeh ⋅ Di Liu ⋅ Zixuan Huang ⋅ Thu Nguyen-Phuoc ⋅ Yuchen Fan ⋅ Sergiu Oprea ⋅ Ziyan Wang ⋅ Roman Shapovalov ⋅ Nikolaos Sarafianos ⋅ Thibault Groueix ⋅ Antoine Toisoul ⋅ Prithviraj Dhar ⋅ Xiao Chu ⋅ Minghao Chen ⋅ Geon Yeong Park ⋅ Rakesh Ranjan ⋅ Andrea Vedaldi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 532
ExMesh: EXplicit Mesh Reconstruction with Topology Adaptation
Chuanjin Fan ⋅ Lifan Wu ⋅ Wenjie Chang ⋅ Hanzhi Chang ⋅ Wenfei Yang ⋅ Tianzhu Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 533
SceneMaker: Open-set 3D Scene Generation with Decoupled De-occlusion and Pose Estimation Model
Yukai Shi ⋅ Weiyu Li ⋅ Zihao Wang ⋅ Hongyang Li ⋅ Xingyu Chen ⋅ Ping Tan ⋅ Lei Zhang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 534
ShapeR: Robust Conditional 3D Shape Generation from Casual Captures
Mohd Yawar Nihal Siddiqui ⋅ Duncan Frost ⋅ Samir Aroudj ⋅ Armen Avetisyan ⋅ Henry Howard-Jenkins ⋅ Daniel DeTone ⋅ Pierre Moulon ⋅ Qirui Wu ⋅ Zhengqin Li ⋅ Julian Straub ⋅ Richard Newcombe ⋅ Jakob Engel
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 535
SwiftTailor: Efficient 3D Garment Generation with Geometry Image Representation
Phuc Pham ⋅ Uy Dieu Tran ⋅ Binh-Son Hua ⋅ Phong Nguyen
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 536
3DrawAgent: Teaching LLM to Draw in 3D with Early Contrastive Experience
Hongcan Xiao ⋅ Xinyue Xiao ⋅ Yilin Wang ⋅ Yue Zhang ⋅ Yonggang Qi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 537
Sculpt4D: Generating 4D Shapes via Sparse-Attention Diffusion Transformers
Minghao Yin ⋅ Wenbo Hu ⋅ Jiale Xu ⋅ Ying Shan ⋅ Kai Han
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 538
HiFi-BRep: High-Fidelity Latent Representation for Robust B-Rep Generation
Junhao Hou ⋅ Chenqi Luo ⋅ PuFan Wang ⋅ Jiaying Lu ⋅ Yusheng Liu ⋅ Feiwei Qin ⋅ Meie Fang ⋅ Kun Zhou
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 539
PhysGen: Physically Grounded 3D Shape Generation for Industrial Design
Yingxuan You ⋅ Chen Zhao ⋅ Hantao Zhang ⋅ Ming Xu ⋅ Pascal Fua
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 540
Perceptual 3D Simulation With Physical World Modeling
Wanhee Lee ⋅ Klemen Kotar ⋅ Rahul Venkatesh ⋅ Jared Watrous ⋅ Daniel L.K. Yamins
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 541
EchoFoley: Event-Centric Hierarchical Control for Video Grounded Creative Sound Generation
Bingxuan Li ⋅ Yiming Cui ⋅ Yicheng He ⋅ Yiwei Wang ⋅ Shu Zhang ⋅ Longyin Wen ⋅ Yulei Niu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 542
Active Intelligence in Video Avatars via Closed-loop World Modeling
Xuanhua He ⋅ Tianyu Yang ⋅ Ke Cao ⋅ Rui-Qi Wu ⋅ Cheng Meng ⋅ Yong Zhang ⋅ Zhuoliang Kang ⋅ Xiaoming Wei ⋅ Qifeng Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 543
Enhancing Spatial Understanding in Image Generation via Reward Modeling
Zhenyu Tang ⋅ Chaoran Feng ⋅ Yufan Deng ⋅ Jie Wu ⋅ Xiaojie Li ⋅ Rui Wang ⋅ Yunpeng Chen ⋅ Daquan Zhou
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 544
Seeing What Matters: Visual Preference Policy Optimization for Visual Generation
Ziqi Ni ⋅ Yuanzhi Liang ⋅ Rui Li ⋅ Yi Zhou ⋅ Haibin Huang ⋅ Chi Zhang ⋅ Xuelong Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 545
TAG-MoE: Task-Aware Gating for Unified Generative Mixture-of-Experts
Yu Xu ⋅ Hongbin Yan ⋅ Juan Cao ⋅ YIJI CHENG ⋅ Tiankai Hang ⋅ Runze He ⋅ Zijin Yin ⋅ Shiyi Zhang ⋅ Yuxin Zhang ⋅ Jintao Li ⋅ Chunyu Wang ⋅ qinglin lu ⋅ Tong-yee Lee ⋅ Fan Tang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 546
Identity-Preserving Image-to-Video Generation via Reward-Guided Optimization
Liao Shen ⋅ Wentao Jiang ⋅ Yiran Zhu ⋅ Jiahe Li ⋅ Tiezheng Ge ⋅ Zhiguo Cao ⋅ Bo Zheng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 547
JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization
yunlong lin ⋅ Linqing Wang ⋅ Kunjie Lin ⋅ Zixu Lin ⋅ Kaixiong Gong ⋅ Wenbo Li ⋅ Bin Lin ⋅ Zhenxi Li ⋅ Shiyi Zhang ⋅ Yuyang Peng ⋅ Wenxun Dai ⋅ Xinghao Ding ⋅ Chunyu Wang ⋅ qinglin lu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 548
Learning Latent Proxies for Controllable Single-Image Relighting
Haoze Zheng ⋅ Zihao Wang ⋅ Xianfeng Wu ⋅ Yajing Bai ⋅ Yexin Liu ⋅ Yun LI ⋅ Xiaogang Xu ⋅ Harry Yang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 549
MoVie: Broaden Your Views with Human Motion for Action Detection
Di Yang ⋅ Mahmoud Ali ⋅ Xuanlong Yu ⋅ Xi Shen ⋅ Quan Kong ⋅ Gianpiero Francesca ⋅ Francois Bremond
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 550
MooCap: A Multi-View Benchmark for Cow-Object-Human Interaction and Behavior Dynamics
Ian Noronha ⋅ Heather Neave ⋅ Upinder Kaur
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 551
LAOF: Robust Latent Action Learning with Optical Flow Constraints
Xizhou Bu ⋅ Jiexi Lyu ⋅ Fulei Sun ⋅ Ruichen Yang ⋅ Zhiqiang Ma ⋅ Wei Li
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 552
DarkAct: A RGB-Thermal Dataset and Fusion Framework for Multimodal Low-Light Action Recognition
Yuanjun Tan ⋅ Aoran Xiao ⋅ Liqian Deng ⋅ Zhigang Tu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 553
Random Wins All: Rethinking Grouping Strategies for Vision Tokens
Qihang Fan ⋅ Yuang Ai ⋅ Huaibo Huang ⋅ Ran He
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 554
Steering Where to Diffuse: Generative Modeling of Phenotypic Response Simulation with Steered Diffusion Bridge
Rongchao Zhang ⋅ Chengxin Li ⋅ Yiwei Lou ⋅ Yuling Shi ⋅ Hanpin Wang ⋅ Yu Huang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 555
Deep Feature Deformation Weights
Richard Liu ⋅ Itai Lang ⋅ Rana Hanocka
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 556
Resolving Endpoint Underfitting in Diffusion Bridges via Noise Alignment
Yurong Gao ⋅ Zicheng Zhang ⋅ Congying Han ⋅ Tiande Guo ⋅ Xinmin QIu
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 557
RNN as Linear Transformer: A Closer Investigation into Representational Potentials of Visual Mamba Models
Timing Yang ⋅ Feng Wang ⋅ Guoyizhe Wei
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 558
Coupling Liquid Time‑Constant Encoders with Modern Hopfield Memory
Bishal Ranjan Swain ⋅ Kyung Joo Cheoi ⋅ Jaepil Ko
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 559
Stronger Normalization-Free Transformers
Mingzhi Chen ⋅ Taiming Lu ⋅ Jiachen Zhu ⋅ Mingjie Sun ⋅ Zhuang Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 560
HCL-FF: Hierarchical and Contrastive Learning for Forward-Forward Algorithm
Jie-En Yao ⋅ Hong-En Chen ⋅ C.-C. Jay Kuo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 561
Can You Learn to See Without Images? Procedural Warm-Up for Vision Transformers
Zachary Shinnick ⋅ Liangze Jiang ⋅ Hemanth Saratchandran ⋅ Damien Teney ⋅ Anton van den Hengel
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 562
Convolutional Neural Networks Driven by Content Similarity
Ligeng Zou ⋅ Guihu Zhao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 563
MorphSeek: Fine-grained Latent Representation-Level Policy Optimization for Deformable Image Registration
Runxun Zhang ⋅ Yizhou Liu ⋅ Dongrui Li ⋅ Bo XU ⋅ Jingwei Wei
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 564
HATS: Hardness-Aware Trajectory Synthesis for GUI Agents
Rui Shao ⋅ RUIZE GAO ⋅ Bin Xie ⋅ Yixing Li ⋅ Kaiwen Zhou ⋅ Shuai Wang ⋅ Weili Guan ⋅ Gongwei Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 565
MVP: Multiple View Prediction Improves GUI Grounding
Yunzhu Zhang ⋅ Zeyu Pan ⋅ Zhengwen Zeng ⋅ Shuheng Shen ⋅ Changhua Meng ⋅ Linchao Zhu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 566
Towards GUI Agents: Vision-Language Diffusion Models for GUI Grounding
Shrinidhi Kumbhar ⋅ Haofu Liao ⋅ srikar appalaraju ⋅ Kunwar Yashraj Singh
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 567
ProactiveMobile: A Comprehensive Benchmark for Boosting Proactive Intelligence On Mobile Devices
Dezhi Kong ⋅ Zhengzhao Feng ⋅ Qiliang Liang ⋅ Hao Wang ⋅ haofei Sun ⋅ Changpeng Yang ⋅ Yang Li ⋅ Peng Zhou ⋅ Shuai Nie ⋅ Hongzhen Wang ⋅ Linfeng Zhou ⋅ Hao Jia ⋅ Jiaming Xu ⋅ Runyu Shi ⋅ Ying Huang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 568
OS-Oracle: A Comprehensive Framework for Cross-Platform GUI Critic Models
Zhenyu Wu ⋅ JingJing Xie ⋅ Zehao Li ⋅ Bowen Yang ⋅ Qiushi Sun ⋅ Zhaoyang Liu ⋅ Zhoumianze Liu ⋅ Yu Qiao ⋅ Xiangyu Yue ⋅ Zun Wang ⋅ Zichen Ding
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 569
Training High-Level Schedulers with Execution-Feedback Reinforcement Learning for Long-Horizon GUI Automation
Zehao Deng ⋅ Tianjie Ju ⋅ Zheng Wu ⋅ Zhuosheng Zhang ⋅ Gongshen Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 570
See, Think, Act: Teaching Multimodal Agents to Effectively Interact with GUI by Identifying Toggles
Zongru Wu ⋅ Rui Mao ⋅ Zhiyuan Tian ⋅ Pengzhou Cheng ⋅ Tianjie Ju ⋅ Zheng Wu ⋅ Lingzhong Dong ⋅ Haiyue Sheng ⋅ Zhuosheng Zhang ⋅ Gongshen Liu
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 571
Beyond Weak Supervision: MLLMs-Guided Graded Knowledge Distillation for Unsupervised Camouflaged Object Detection
Huafeng Chen ⋅ Chenguang Zhu ⋅ Yueming Lyu ⋅ Caifeng Shan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 572
Detecting Unknown Objects via Energy-based Separation for Open World Object Detection
JunWoo Heo ⋅ Keonhee Park ⋅ Gyeong-Moon Park
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 573
Beyond Prompt Degradation: Prototype-guided Dual-pool Prompting for Incremental Object Detection
Yaoteng Zhang ⋅ Qing Zhou ⋅ Junyu Gao ⋅ Qi Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 574
SPAR: Single-Pass Any-Resolution ViT for Open-vocabulary Segmentation
Naomi Kombol ⋅ Ivan Martinović ⋅ Siniša Šegvić ⋅ Giorgos Tolias
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 575
TTL: Test-time Textual Learning for OOD Detection with Pretrained Vision-Language Models
Jinlun Ye ⋅ Jiang Liao ⋅ Runhe Lai ⋅ Xinhua Lu ⋅ Jiaxin Zhuang ⋅ Zhiyong Gan ⋅ Ruixuan Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 576
Parameterized Prompt for Incremental Object Detection
Zijia An ⋅ boyu diao ⋅ RuiQi Liu ⋅ Libo Huang ⋅ Chuanguang Yang ⋅ Fei Wang ⋅ Zhulin An ⋅ Yongjun Xu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 577
SRA-Det: Learning Omni-Grained Open-Vocabulary Detection Beyond Category Names
Li Yang ⋅ Boyu Cai ⋅ Wei Liu ⋅ Yan Wang ⋅ Chunfeng Yuan ⋅ Bing Li ⋅ Weiming Hu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 578
Retrieve and Segment: Are a Few Examples Enough to Bridge the Supervision Gap in Open-Vocabulary Segmentation?
Tilemachos Aravanis ⋅ Vladan Stojnić ⋅ Vasileios Psomas ⋅ Nikos Komodakis ⋅ Giorgos Tolias
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 579
PCA-Seg: Revisiting Cost Aggregation for Open-Vocabulary Semantic and Part Segmentation
Jianjian Yin ⋅ Tao Chen ⋅ Yi Chen ⋅ Gensheng Pei ⋅ Xiangbo Shu ⋅ Yazhou Yao ⋅ Fumin Shen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 580
Partial Weakly-Supervised Oriented Object Detection
Mingxin Liu ⋅ Peiyuan Zhang ⋅ Yuan Liu ⋅ Wei Zhang ⋅ Yue Zhou ⋅ Ning Liao ⋅ Ziyang Gong ⋅ Junwei Luo ⋅ Zhirui Wang ⋅ Yi Yu ⋅ Xue Yang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 581
Seeing Both Sides: Towards Bidirectional Semantic Alignment for Open-Vocabulary Camouflaged Object Segmentation
Guohui Zhang ⋅ Fuming Sun ⋅ Yu Zhao ⋅ Yuqiu Kong ⋅ Jing Sun ⋅ Ganggang Huang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 582
Towards Robust Multi-Modal Semantic Segmentation with Teacher-Student Framework and Hybrid Prototype Distillation
jiaqi tan ⋅ Xu Zheng ⋅ Yang Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 583
REL-SF4PASS: Panoramic Semantic Segmentation with REL Depth Representation and Spherical Fusion
Xuewei Li ⋅ Xinghan Bao ⋅ Zhimin Chen ⋅ Xi Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 584
Looking Beyond the Window: Global-Local Aligned CLIP for Training-free Open-Vocabulary Semantic Segmentation
ByeongCheol Lee ⋅ Hyun Seok Seong ⋅ Sangeek Hyun ⋅ Gilhan Park ⋅ WonJun Moon ⋅ Jae-Pil Heo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 585
From Softmax to Dirichlet: Evidential Learning for Semi-supervised Semantic Segmentation
Huayu Mai ⋅ Rui Sun ⋅ Yujia Chen ⋅ Wangkai Li ⋅ Bingzhou Wang ⋅ Aibing Li ⋅ Zhangyu He ⋅ Yuan Wang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 586
Particulate: Feed-Forward 3D Object Articulation
Ruining Li ⋅ YUXIN YAO ⋅ Chuanxia Zheng ⋅ Christian Rupprecht ⋅ Joan Lasenby ⋅ Shangzhe Wu ⋅ Andrea Vedaldi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 587
HOPS: Hierarchical Open-vocabulary Part Segmentation with Attention-Aware Filtering and Affinity-Guided Enhancement
Xinlong Li ⋅ Di Lin ⋅ Shaoyiyi Gao ⋅ Yaxuan Liu ⋅ Jixian He ⋅ Jiaxin Li ⋅ Ruonan Liu ⋅ Qing Guo ⋅ Kairui Yang ⋅ Wei Feng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 588
Shape-of-You: Fused Gromov-Wasserstein Optimal Transport for Semantic Correspondence in-the-Wild
Jiin Im ⋅ Sisung Liu ⋅ Je Hyeong Hong
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 589
MEMO: Human-like Crisp Edge Detection Using Masked Edge Prediction
Jiaxin Cheng ⋅ Yue Wu ⋅ Yicong Zhou
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 590
MUFASA: A Multi-Layer Framework for Slot Attention
Sebastian Bock ⋅ Leonie Schüßler ⋅ Krishnakant Singh ⋅ Simone Schaub-Meyer ⋅ Stefan Roth
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 591
ChangeBridge: Spatiotemporal Image Generation with Multimodal Controls for Remote Senisng
Zhenghui Zhao ⋅ Chen Wu ⋅ Xiangyong Cao ⋅ Di Wang ⋅ Hongruixuan Chen ⋅ Datao Tang ⋅ Liangpei Zhang ⋅ Zhuo Zheng
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 592
MOMO: Mars Orbital MOdel Foundation Model for Mars Orbital Applications
Mirali Purohit ⋅ Bimal Gajera ⋅ Irish Mehta ⋅ Bhanu Tokas ⋅ Jacob Adler ⋅ Steven Lu ⋅ Scott Dickenshied ⋅ Serina Diniega ⋅ Brian Bue ⋅ Umaa Rebbapragada ⋅ Hannah Kerner
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 593
Seeing Through the Noise: Improving Infrared Small Target Detection and Segmentation from Noise Suppression Perspective
Maoxun Yuan ⋅ Duanni Meng ⋅ Ziteng Xi ⋅ Tianyi Zhao ⋅ Shiji Zhao ⋅ Yimian Dai ⋅ Xingxing Wei
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 594
GeoBridge: A Semantic-Anchored Multi-View Foundation Model Bridging Images and Text for Geo-Localization
Zixuan Song ⋅ Jing Zhang ⋅ Di Wang ⋅ Zidie Zhou ⋅ Wenbin Liu ⋅ Haonan Guo ⋅ En Wang ⋅ Bo Du
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 595
GeoSANE: Learning Geospatial Representations from Models, Not Data
Joëlle Hanna ⋅ Damian Falk ⋅ Stella X. Yu ⋅ Damian Borth
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 596
Brewing Stronger Features: Dual-Teacher Distillation for Multispectral Earth Observation
Filip Wolf ⋅ Blaz Rolih ⋅ Luka Cehovin Zajc
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 597
Spectral Super-Resolution via Adversarial Unfolding and Data-Driven Spectrum Regularization: From Multispectral Satellite Data to NASA Hyperspectral Image
Si-Sheng Yang ⋅ Chia-Hsiang Lin
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 598
RAMEN: Resolution-Adjustable Multimodal Encoder for Earth Observation
Nicolas Houdré ⋅ Diego Marcos ⋅ Hugo Riffaud de Turckheim ⋅ Dino Ienco ⋅ Laurent Wendling ⋅ Camille Kurtz ⋅ Sylvain Lobry
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 599
ORSATR-X: A Foundation Model based on Differential-and-Excitation Networks for Optical Remote Sensing Object Recognition
Canyu Mo ⋅ Yongxiang Liu ⋅ Jiehua Zhang ⋅ Zilong Yu ⋅ Zhen Liu ⋅ Tianpeng Liu ⋅ Li Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 600
SEBA: Sample-Efficient Black-Box Attacks on Visual Reinforcement Learning
Tairan HUANG ⋅ Yulin Jin ⋅ Junxu Liu ⋅ Qingqing Ye ⋅ Haibo Hu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 601
IAG: Input-aware Backdoor Attack on VLM-based Visual Grounding
Junxian Li ⋅ Beining Xu ⋅ Simin Chen ⋅ Jiatong LI ⋅ Jingdi Lei ⋅ Haodong Zhao ⋅ Di Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 602
DASH: A Meta-Attack Framework for Synthesizing Effective and Stealthy Adversarial Examples
Abdullah Al Nomaan Nafi ⋅ Habibur Rahaman ⋅ Zafaryab Haider ⋅ Tanzim Mahfuz ⋅ Fnu Suya ⋅ Swarup Bhunia ⋅ Prabuddha Chakraborty
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 603
AdapAction: Adaptive Target Action Backdoor Attack against GUI Agents
Baicheng Chen ⋅ Mingda Zhang ⋅ Min Zhang ⋅ Haizhou Li ⋅ Baoyuan Wu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 604
Phantom: Physical Object Interactions as Dynamic Triggers for NMS-Exploited Backdoors
Tianlin Huo ⋅ Dongchuan Ran ⋅ Ranjie Duan ⋅ Yao Zhu ⋅ Peilun Du ⋅ ningbo yao ⋅ Huanqian Yan ⋅ Xu Han ⋅ Qiang Yun ⋅ Yuzheng Tan ⋅ Yang Bao ⋅ Yuan He
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 605
Verifying Neural Network Robustness with Dual Perturbations
Hai Duong ⋅ Son Vu ⋅ Thanh Le ⋅ ThanhVu Nguyen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 606
Defending Unauthorized Model Merging via Dual-Stage Weight Protection
Wei-Jia Chen ⋅ Min-Yan Tsai ⋅ Cheng-Yi Lee ⋅ Chia-Mu Yu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 607
AntiStyler: Defending Object Detection Models Against Adversarial Patch Attacks Using Style Removal
Idan Yankelev ⋅ Edita Grolman ⋅ Yarin Yerushalmi Levi ⋅ Amit Giloni ⋅ Omer Hofman ⋅ Toshiya Shimizu ⋅ Yuval Elovici ⋅ Asaf Shabtai
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 608
On the Role of Temporal Granularity in the Robustness of Spiking Neural Networks
Mengting Xu ⋅ Shi Gu ⋅ Peng Lin ⋅ De Ma ⋅ Huajin Tang ⋅ Qian Zheng ⋅ Gang Pan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 609
Boosting Vision-Language-Action Finetuning with Feasible Action Neighborhood Prior
Haochen Niu ⋅ Kanyu Zhang ⋅ Shuyu Yin ⋅ Qinghai Guo ⋅ Peilin Liu ⋅ Fei Wen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 610
Exploring Conditions for Diffusion Models in Robotic Control
Heeseong Shin ⋅ Byeongho Heo ⋅ Dongyoon Han ⋅ Seungryong Kim ⋅ Taekyung Kim
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 611
A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens
Tommie Kerssies ⋅ Gabriele Berton ⋅ Ju He ⋅ Qihang Yu ⋅ Wufei Ma ⋅ Daan de Geus ⋅ Gijs Dubbelman ⋅ Liang-Chieh Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 612
Efficient Hybrid SE(3)-Equivariant Visuomotor Flow Policy via Spherical Harmonics for Robot Manipulation
Qinglun Zhang ⋅ Shen Cheng ⋅ Tian Dan ⋅ Haoqiang Fan ⋅ Guanghui Liu ⋅ Shuaicheng Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 613
TSTM: Temporal Segmentation for Task-relevant Mask in Visual Reinforcement Learning Generalization
Weicheng Du ⋅ Wenjia Meng ⋅ Zhengzhe Zhang ⋅ Yilong Yin ⋅ Xiankai Lu
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 614
Scaling Spatial and Temporal Context for Robotic Imitation Learning Policies With Scene Graphs
Jianing Qian ⋅ Qinhe Peng ⋅ Emmanuel Panov ⋅ Leonor Fermoselle ⋅ Dinesh Jayaraman ⋅ Bernadette Bucher ⋅ Tarik Kelestemur
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 615
AdaDexTrack: Dynamic Modulation for Adaptive and Generalizable Dexterous Manipulation Tracking
Jianibieke Adalibieke ⋅ Qianwei Han ⋅ Xueyi Liu ⋅ Yuzhe Qin ⋅ Li Yi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 616
GraspLDP: Towards Generalizable Grasping Policy via Latent Diffusion
Enda Xiang ⋅ Haoxiang Ma ⋅ Xinzhu Ma ⋅ Zicheng Liu ⋅ Di Huang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 617
MoEActok: A MoE-based Action Tokenizer for Vision-Language-Action Models
Chunpu Xu ⋅ Zhixuan Liang ⋅ Tianshuo Yang ⋅ Chi-Min Chan ⋅ Yang Xiao ⋅ Jessie Wang ⋅ Xiaokang Yang ⋅ Yao Mu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 618
A Cross-view Fusion Framework for Robust 6-DoF Grasp Pose Estimation
Kangjian Zhu ⋅ Haobo Jiang ⋅ Jianjun Qian ⋅ Jin Xie
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 619
SAVA-X: Ego-to-Exo Imitation Error Detection via Scene-Adaptive View Alignment and Bidirectional Cross View Fusion
Xiang Li ⋅ Heqian Qiu ⋅ Lanxiao Wang ⋅ Benliu Qiu ⋅ Fanman Meng ⋅ Linfeng Xu ⋅ Hongliang Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 620
PromptDepth: Efficient and Promptable Geometric 3D Vision Model for Embodied Intelligence
Xianyun Wang ⋅ Jiaxu Miao ⋅ Tian Xu ⋅ Siyuan Wang ⋅ Yuehao Li ⋅ Haoyang Hu ⋅ Jun Xiao ⋅ Yonghong Tian ⋅ Jun Yu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 621
Gallant: Voxel Grid-based Humanoid Locomotion and Local-navigation across 3-D Constrained Terrains
Qingwei Ben ⋅ Botian Xu ⋅ Kailin Li ⋅ Feiyu Jia ⋅ Wentao Zhang ⋅ Jingping Wang ⋅ Jingbo Wang ⋅ Dahua Lin ⋅ Jiangmiao Pang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 622
PALM: Progress-Aware Policy Learning via Affordance Reasoning for Long-Horizon Robotic Manipulation
Yuanzhe Liu ⋅ Jingyuan Zhu ⋅ Yuchen Mo ⋅ Gen Li ⋅ Xu Cao ⋅ Jin Jin ⋅ Yifan Shen ⋅ Zhengyuan Li ⋅ Tianjiao Yu ⋅ Wenzhen Yuan ⋅ Fangqiang Ding ⋅ Ismini Lourentzou
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 623
IGen: Scalable Data Generation for Robot Learning from Open-World Images
Chenghao Gu ⋅ Haolan Kang ⋅ Junchao Lin ⋅ Jinghe Wang ⋅ Duo Wu ⋅ Shuzhao Xie ⋅ Fanding Huang ⋅ Junchen Ge ⋅ Ziyang Gong ⋅ Letian Li ⋅ Hongying Zheng ⋅ Changwei Lv ⋅ Zhi Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 624
Hypergraph-State Collaborative Reasoning for Multi-Object Tracking
Zikai Song ⋅ Junqing Yu ⋅ Yi-Ping Phoebe Chen ⋅ Wei Yang ⋅ Xinchao Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 625
TGTrack: Temporal Generative Learning for Unified Single Object Tracking
Wanting Geng ⋅ Xin Chen ⋅ Chuanyu Sun ⋅ Jie Zhao ⋅ Ben Kang ⋅ Dong Wang ⋅ Huchuan Lu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 626
GeoMotion: Rethinking Motion Segmentation via Latent 4D Geometry
Xiankang He ⋅ Peile Lin ⋅ Ying Cui ⋅ Dongyan Guo ⋅ Chunhua Shen ⋅ Xiaoqin Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 627
Generalizable Structure-Aware Keypoint Correspondence for Category-Unified 3D Single Object Tracking
Jie Xiao ⋅ Yinchao Ma ⋅ Yuyang Tang ⋅ Dengqing Yang ⋅ Jianpeng Yang ⋅ Xu Zhou ⋅ Qiao Li ⋅ Wenfei Yang ⋅ Tianzhu Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 628
Generative Point Tracking and Forecasting
Xuanchen Lu ⋅ Ang Cao ⋅ Chao Feng ⋅ Andrew Owens
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 629
RAGTrack: Language-aware RGBT Tracking with Retrieval-Augmented Generation
Hao Li ⋅ Yuhao Wang ⋅ Wenning Hao ⋅ Pingping Zhang ⋅ Dong Wang ⋅ Huchuan Lu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 630
Dual-level Adaptation for Multi-Object Tracking: Building Test-Time Calibration from Experience and Intuition
Wen Guo ⋅ Pengfei Zhao ⋅ Zongmeng Wang ⋅ Yufan Hu ⋅ Junyu Gao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 631
GMT: Effective Global Framework for Multi-Target Multi-Camera Tracking
Yihao Zhen ⋅ Mingyue Xu ⋅ Qiang Wang ⋅ Baojie Fan ⋅ Jiahua Dong ⋅ Tinghui Zhao ⋅ Huijie Fan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 632
Bridging Brain and Semantics: A Hierarchical Framework for Semantically Enhanced fMRI-to-Video Reconstruction
Yujie Wei ⋅ Chenglong Ma ⋅ Jianxiong Gao ⋅ Chenhui Wang ⋅ Shiwei Zhang ⋅ Biao Gong ⋅ Shuai Tan ⋅ Hangjie Yuan ⋅ Hongming Shan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 633
GraPHFormer: A Multimodal Graph Persistent Homology Transformer for the Analysis of Neuroscience Morphologies
Uzair Shah ⋅ Marco Agus ⋅ Mahmoud Gamal ⋅ Mahmood Alzubaidi ⋅ Corrado Cali ⋅ PIERRE MAGISTRETTI ⋅ Abdesselam Bouzerdoum ⋅ Mowafa Househ
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 634
DARC: Dual Adjustment Reasoning with Counterfactuals for Trustworthy Chest X-ray Classification
Zhifang Liao ⋅ Junhao Li ⋅ HaoKang Ding ⋅ Yucheng Song
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 635
Every Error has Its Magnitude: Asymmetric Mistake Severity Training for Multiclass Multiple Instance Learning
Sungrae Hong ⋅ Jiwon Jeong ⋅ Jisu Shin ⋅ Donghee Han ⋅ Sol Lee ⋅ Kyungeun Kim ⋅ Mun Yong Yi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 636
Phrase-grounded APO for Improving Chest X-ray Report Generation
Raziuddin Mahmood ⋅ Tanveer Syeda-Mahmood
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 637
Focus-to-Perceive Representation Learning: A Cognition-Inspired Hierarchical Framework for Endoscopic Video Analysis
Yuan Zhang ⋅ Sihao Dou ⋅ Kai Hu ⋅ Shuhua Deng ⋅ Chunhong Cao ⋅ Fen Xiao ⋅ Xieping Gao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 638
OraPO: Oracle-educated Reinforcement Learning for Data-efficient and Factual Radiology Report Generation
Zhuoxiao Chen ⋅ Hongyang Yu ⋅ Ying Xu ⋅ Yadan Luo ⋅ Long Duong ⋅ Yuan-Fang Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 639
FluoCLIP: Stain-Aware Focus Quality Assessment in Fluorescence Microscopy
Hyejin Park ⋅ Jiwon Yoon ⋅ Sumin Park ⋅ Suree Kim ⋅ Sinae Jang ⋅ Eunsoo Lee ⋅ Dongmin Kang ⋅ Dongbo Min
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 640
CryoKRAQEN: Kernel-Regularized Annealing for Quantized Embedding Networks in Cryo-EM Heterogeneous Reconstruction
Wenyuan Gao ⋅ Yutan Wu ⋅ Xuming He
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 641
Building Robust Vision Encoders for Cross-Dataset Evaluation in Immunofluorescent Microscopy
Umar Marikkar ⋅ Syed Sameed Husain ⋅ Muhammad Awais ⋅ Sara Atito
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 642
H2-Surv: Hierarchical Hyperbolic Multimodal Representation Learning for Survival Prediction
Jiaqi Yang ⋅ Wenting Chen ⋅ Xiangjian He ⋅ Yuanbai Li ⋅ Sen Yang ⋅ Linlin Shen ⋅ Xiaohan Xing
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 643
Dual-Level Hypergraph Generation for Addressing Feature Scarcity in Whole-Slide Image Classification
Shuilian Yao ⋅ Qi Jia ⋅ Qi Jia ⋅ Pengshuo Zhang ⋅ Lili Sun ⋅ Weimin Wang ⋅ Yanmei Zhu ⋅ Bo Zhang ⋅ Xin Fan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 644
Temporal Inversion for Learning Interval Change in Chest X-Rays
Hanbin Ko ⋅ Kyungmin Jeon ⋅ Doowoong Choi ⋅ Chang Min Park
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 645
JUMP-Hand: Learning Joint-wise Uncertainty to Gate Mixture of View Experts for Multi-View 3D Hand Reconstruction
Haohong Kuang ⋅ Yang Xiao ⋅ Changlong Jiang ⋅ Jinghong Zheng ⋅ Hang Xu ⋅ Ran Wang ⋅ Zhiguo Cao ⋅ Joey Tianyi Zhou
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 646
PAD-Hand: Physics-Aware Diffusion for Hand Motion Recovery
Elkhan Ismayilzada ⋅ Yufei Zhang ⋅ Zijun Cui
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 647
Anatomical Domain Shifts: Test-time Heterogeneous Adaptation for 3D Human Pose Prediction
Qiongjie Cui ⋅ Pan Zhou ⋅ Jingjing Chen ⋅ Na Zhao
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 648
Unlocking Motion from Large Vision Models with a Semantic and Kinematic Duality for Gait Recognition
Zhanbo Huang ⋅ Dingqiang Ye ⋅ Xiaoming Liu ⋅ Yu Kong
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 649
Learning 3D Shape Fidelity Metric from Real-world Distortions
Xuelu Feng ⋅ Tianyu Luan ⋅ Zixin Zhu ⋅ Akshobhya Sharma ⋅ Phani Nuney ⋅ Junsong Yuan ⋅ Chunming Qiao
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 650
BarbieGait: An Identity-Consistent Synthetic Human Dataset with Versatile Cloth-Changing for Gait Recognition
Qingyuan Cai ⋅ Saihui Hou ⋅ Xuecai Hu ⋅ Yongzhen Huang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 651
FisherPoser: Human Motion Estimation from Sparse Observations with Hierarchical Region-Wise Fisher-Matrix Uncertainty Modeling
Songpengcheng Xia ⋅ Qingyu Zhang ⋅ Zhuo Su ⋅ Jiarui Yang ⋅ Zengyuan Lai ⋅ Qi Wu ⋅ Ling Pei
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 652
EmbodMocap: In-the-Wild 4D Human-Scene Reconstruction for Embodied Agents
Wenjia Wang ⋅ Liang Pan ⋅ Huaijin Pi ⋅ Yuke Lou ⋅ Xuqian Ren ⋅ Yifan Wu ⋅ Zhouyingcheng Liao ⋅ Lei Yang ⋅ Rishabh Dabral ⋅ Christian Theobalt ⋅ Taku Komura
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 653
Ground Reaction Inertial Poser: Physics-based Human Motion Capture from Sparse IMUs and Insole Pressure Sensors
Ryosuke Hori ⋅ Jyun-Ting Song ⋅ Zhengyi Luo ⋅ Jinkun Cao ⋅ Soyong Shin ⋅ HIDEO SAITO ⋅ Kris Kitani
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 654
FUN REC Reconstructing Functional 3D Scenes from Egocentric Interaction Videos
Alexandros Delitzas ⋅ Chenyangguang Zhang ⋅ Alexey Gavryushin ⋅ Tommaso Di Mario ⋅ Boyang Sun ⋅ Rishabh Dabral ⋅ Leonidas Guibas ⋅ Christian Theobalt ⋅ Marc Pollefeys ⋅ Francis Engelmann ⋅ Daniel Barath
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 655
VIMCAN: Visual-Inertial 3D Human Pose Estimation with Hybrid Mamba-Cross-Attention Network
Zepeng Yang ⋅ Junxuan Bai ⋅ Hao Li ⋅ Ju Dai ⋅ Junjun Pan ⋅ Yongfeng Yin ⋅ Bin Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 656
Bringing Your Portrait to 3D Presence
Jiawei Zhang ⋅ Lei Chu ⋅ Jiahao Li ⋅ Zhenyu Zang ⋅ Chong Li ⋅ Xiao Li ⋅ Xun Cao ⋅ Hao Zhu ⋅ Yan Lu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 657
FLOW: Feature-Level Optimal Warping for Generalized Remote Physiological Measurement
bo zhao ⋅ Junzhe Cao ⋅ Dan Guo ⋅ Dongmin Huang ⋅ Wenjin Wang ⋅ Tao Tan ⋅ Yue Sun ⋅ Zitong YU
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 658
One-to-More: High-Fidelity Training-Free Anomaly Generation with Attention Control
Haoxiang Rao ⋅ Zhao Wang ⋅ Chenyang Si ⋅ Yan LYU ⋅ Yuanyi Duan ⋅ Fang Zhao ⋅ Caifeng Shan
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 659
UniMMAD: Unified Multi-Modal and Multi-Class Anomaly Detection via MoE-Driven Feature Decompression
Yuan Zhao ⋅ Youwei Pang ⋅ Lihe Zhang ⋅ Hanqi Liu ⋅ Jiaming Zuo ⋅ Huchuan Lu ⋅ Xiaoqi Zhao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 660
BUSSARD: Normalizing Flows for Bijective Universal Scene-Specific Anomalous Relationship Detection
Melissa Schween ⋅ Mathis Kruse ⋅ Bodo Rosenhahn
[ Slides
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 661
Multi-Prototype Compactness and Boundary-Aware Synthesis for Unsupervised Anomaly Detection
Liao Kailun ⋅ Jianfeng Yang ⋅ Tao Tao ⋅ Wenfei Wu ⋅ Jiaming Jiang ⋅ Jinsheng Xiao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 662
PDD: Manifold-Prior Diverse Distillation for Medical Anomaly Detection
Xijun Lu ⋅ Hongying Liu ⋅ Fanhua Shang ⋅ Yanming hui ⋅ Liang Wan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 663
Weakly Supervised Video Anomaly Detection with Anomaly-Connected Components and Intention Reasoning
Yu Wang ⋅ Hongli Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 664
SubspaceAD: Training-Free Few-Shot Anomaly Detection via Subspace Modeling
Camile Lendering ⋅ Erkut Akdag ⋅ Egor Bondarev
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 665
Learning Spatial-Temporal Consistency for 3D Semantic Scene Completion
Yujie Xue ⋅ Meng Wang ⋅ Ruihui Li ⋅ F anWu ⋅ Zhizhong Liu ⋅ Zhuo Tang ⋅ Kenli Li
[ Slides
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 666
Generalizing Visual Geometry Priors to Sparse Gaussian Occupancy Prediction
Changqing Zhou ⋅ Yueru Luo ⋅ Changhao Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 667
Deformable Gaussian Occupancy: Decoupling Rigid and Nonrigid Motion with Factorized Distillation
Yang Gao ⋅ Wuyang Li ⋅ Po-Chien Luan ⋅ Alex Alahi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 668
OccAny: Generalized Unconstrained Urban 3D Occupancy
Anh Quan Cao ⋅ Vu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 669
Dr.Occ: Depth- and Region-Guided 3D Occupancy from Surround-View Cameras for Autonomous Driving
Xubo Zhu ⋅ Haoyang Zhang ⋅ Fei He ⋅ Rui Wu ⋅ Yanhu Shan ⋅ Wen Yang ⋅ Huai Yu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 670
ShelfOcc: Native 3D Supervision beyond LiDAR for Vision-Based Occupancy Estimation
Simon Boeder ⋅ Fabian Gigengack ⋅ Simon Roesler ⋅ Holger Caesar ⋅ Benjamin Risse
[ Poster