Skip to yearly menu bar Skip to main content


(4463 events)   Timezone:  
The 2026 schedule is still incomplete
Toggle Poster Visibility
Oral
None
Adversarial Style Optimization: Enhancing VLM Jailbreaks by GRPO-based Stylistic Triggers Optimization
Registration
Tue Jun 02 01:00 PM -- 07:00 PM (PDT) @ Lobby A None
Registration / Badge Pickup
Break
Wed Jun 03 06:00 AM -- 08:00 AM (PDT) @ ExHall C None
Breakfast
Registration
Wed Jun 03 06:00 AM -- 04:00 PM (PDT) @ Lobby A None
Registration / Badge Pickup
Tutorial
Wed Jun 03 07:00 AM -- 11:00 AM (PDT) @ 301/302 None
The Principles of Diffusion Models: Real-Time Continuous & Discrete Diffusion
Chieh-Hsin Lai · Subham Sahoo · Dongjun Kim · Yang Song · Yuki Mitsufuji · Stefano Ermon
Tutorial
Wed Jun 03 07:00 AM -- 11:00 AM (PDT) @ Mile High 2B None
Tom Builds, Tom Breaks: Hands-On Attacks and Defenses for Vision-Language Systems
Pavan Reddy
Tutorial
Wed Jun 03 07:00 AM -- 11:00 AM (PDT) @ Mile High 3C None
Towards Safe Multi-Modal Learning: Evolving Threats and Safety Solutions
Xi Li · Manling Li · Muchao Ye
Tutorial
Wed Jun 03 07:00 AM -- 11:00 AM (PDT) @ 702 None
Edge AI in Action: Mastering On-Device Inference
Fabricio Batista Narcizo · Elizabete Munzlinger · Sai Narsi Reddy Donthi Reddy · Shan Ahmed Shaffi
Workshop
Wed Jun 03 07:00 AM -- 11:00 AM (PDT) @ Four Seasons 4 None
Workshop on "Bitter Lessons"
Anand Bhattad ⋅ Aditya Prakash
Workshop
Wed Jun 03 07:00 AM -- 11:30 AM (PDT) @ 111 None
Generative AI for XR and Identity-based Applications
Brendan David-John ⋅ Chris Thomas
Workshop
Wed Jun 03 07:00 AM -- 11:00 AM (PDT) @ 506 None
GRAIL-V: Grounded Retrieval & Agentic Intelligence for Vision-Language
Amit Agarwal ⋅ Vivek. Gupta
Workshop
Wed Jun 03 07:00 AM -- 11:00 AM (PDT) @ 505 None
The 3rd Workshop on Human Motion Generation - New Perspective on Simulation, Animation, and VR applications
Chuan Guo ⋅ Yuxuan Mu
Workshop
Wed Jun 03 07:00 AM -- 11:00 AM (PDT) @ 106 None
LatinX in Computer Vision Research Workshop
Francisco Lopez-Tiro ⋅ Dustin Carrión-Ojeda ⋅ Willams De Lima ⋅ Hernan Dario Benitez ⋅ Ana Maria Quintero ⋅ William de Lima
Workshop
Wed Jun 03 07:00 AM -- 11:00 AM (PDT) @ Mile High 1CD None
Multimodal Foundation Models for Biomedicine: Challenges and Opportunities
Yuhui Zhang ⋅ Xiaohan Wang
Workshop
Wed Jun 03 07:00 AM -- 11:00 AM (PDT) @ 601 None
The 2nd Workshop on Multimodal Spatial Intelligence
Juil Koo ⋅ Phillip Y. Lee
Workshop
Wed Jun 03 07:00 AM -- 11:45 AM (PDT) @ Mile High 4EF None
On Sensor Vision Workshop
Andrew J. Davison ⋅ Shinjeong Kim
Workshop
Wed Jun 03 07:00 AM -- 11:00 AM (PDT) @ 712 None
22nd Workshop on Perception Beyond the Visible Spectrum
Riad I. Hammoud ⋅ Yi Ding
Workshop
Wed Jun 03 07:00 AM -- 11:00 AM (PDT) @ 109 None
The 2nd International Workshop & Challenge on Subtle Visual Computing @CVPR 2026
Zitong Yu ⋅ Xun Lin
Workshop
Wed Jun 03 07:00 AM -- 11:00 AM (PDT) @ 705/707 None
1st Workshop on Video World Models: Interaction, Memory, and Efficiency
Jiwen Yu ⋅ Xihui Liu
Workshop
Wed Jun 03 07:00 AM -- 11:00 AM (PDT) @ 708 None
Women in Computer Vision
Karen Sanchez ⋅ Carla Muntean
Workshop
Wed Jun 03 07:00 AM -- 11:00 AM (PDT) @ Mile High 2A None
Workshop on World Models Meet Active Sensing and Closed-Loop Planning
Jieneng Chen ⋅ Alan Yuille
Workshop
Wed Jun 03 07:00 AM -- 11:30 AM (PDT) @ Mile High 1AB None
The 5th Explainable AI for Computer Vision (XAI4CV) Workshop
Miguel-Ángel Fernández-Torres
Workshop
Wed Jun 03 07:00 AM -- 11:30 AM (PDT) @ 108 None
PHAROS AI Factory for Medical Imaging & Healthcare
Stefanos Kollias ⋅ Xujiong Ye
Workshop
Wed Jun 03 07:00 AM -- 04:00 PM (PDT) @ Mile High 1EF None
Workshop on Agentic AI for Visual Media
Jinjin Gu ⋅ Lei Sun
Workshop
Wed Jun 03 07:00 AM -- 04:00 PM (PDT) @ 503 None
Bridging Vision, Language, and Action: What’s Missing in Actionable Visual Perception for Robotics
Jiawei Ma ⋅ Chengzhi Mao
Workshop
Wed Jun 03 07:00 AM -- 04:00 PM (PDT) @ Mile High 3A None
Autonomous Understanding Through Open-world Perception and Integrated Language models for On-road Tasks
Ali AlShami ⋅ Ryan Rabinowitz
Workshop
Wed Jun 03 07:00 AM -- 05:00 PM (PDT) @ 207 None
Foundation Models for V2X-Based Cooperative Autonomous Driving
Walter Zimmer ⋅ Rui Song
Workshop
Wed Jun 03 07:00 AM -- 04:00 PM (PDT) @ Four Seasons 1 None
From Lab Demos to Daily Tasks: Embodied Intelligence in the Wild
Huijie Wang ⋅ Hongyang Li
Workshop
Wed Jun 03 07:00 AM -- 04:00 PM (PDT) @ 504 None
13th Workshop on Fine-grained Visual Categorization
Nico Lang ⋅ Lukas Picek
Workshop
Wed Jun 03 07:00 AM -- 04:00 PM (PDT) @ 205 None
4th Workshop on Vision Based Industrial Inspection
Shancong Mou ⋅ Hao Yan
Workshop
Wed Jun 03 07:00 AM -- 04:00 PM (PDT) @ Four Seasons 2 None
The 1st Workshop on Deployment of Foundation Models for Embodied AI
Burhan Yaman ⋅ Xin Ye
Workshop
Wed Jun 03 07:15 AM -- 12:00 PM (PDT) @ 102/104 None
Workshop on Vision-based Assistants in the Real-World
Apratim Bhattacharyya ⋅ Fadime Sener
Workshop
Wed Jun 03 07:20 AM -- 11:30 AM (PDT) @ 113 None
Multimodal Alignment for a Pluralistic Society
Perampalli Shravan Nayak ⋅ Aishwarya Agrawal
Workshop
Wed Jun 03 07:25 AM -- 11:35 AM (PDT) @ 501 None
AI for Creative Visual Content Generation, Editing and Understanding
Ozgur Kara ⋅ Junho Kim
Workshop
Wed Jun 03 07:25 AM -- 12:00 PM (PDT) @ 203 None
IPA: Interactive Physical AI Workshop
Seonwook Park ⋅ Amrita Mazumdar
Workshop
Wed Jun 03 07:30 AM -- 11:30 AM (PDT) @ 610/612 None
AI for Content Creation
James Tompkin ⋅ Krishna Kumar Singh
Workshop
Wed Jun 03 07:30 AM -- 11:30 AM (PDT) @ Mile High 4AB None
The 3rd AI for Visual Arts Workshop and Challenges
Deblina Bhattacharjee ⋅ Bahar Aydemir
Workshop
Wed Jun 03 07:30 AM -- 11:30 AM (PDT) @ 710 None
The 5th DataCV Workshop and Challenge
Liang Zheng ⋅ Yue Yao
Workshop
Wed Jun 03 07:30 AM -- 10:59 AM (PDT) @ 711 None
The 5th Workshop on Federated Learning for Computer Vision
Chen Chen ⋅ Guangyu Sun
Workshop
Wed Jun 03 07:30 AM -- 11:30 AM (PDT) @ 112 None
Generative AI for Sign Language
Hezhen Hu ⋅ Yuecong Min
Workshop
Wed Jun 03 07:30 AM -- 04:00 PM (PDT) @ Mile High 2C None
Sense of Space: Multi-Sensory Modeling for Embodied Intelligence
Rao Fu ⋅ Li Guan
Workshop
Wed Jun 03 07:30 AM -- 05:00 PM (PDT) @ 703 None
Visual General Intelligence
Hirokatsu Kataoka ⋅ Yoshihiro Fukuhara
Workshop
Wed Jun 03 07:30 AM -- 11:30 AM (PDT) @ 507 None
AI4RWC: The 2nd International Workshop on Vision Intelligence for Real-world Challenges
Daqian Shi ⋅ Xiaolei Diao
Workshop
Wed Jun 03 07:45 AM -- 04:00 PM (PDT) @ Mile High 4CD None
Computational Cameras and Displays
Vishwanath Saragadam ⋅ Fei Xia
Workshop
Wed Jun 03 07:45 AM -- 04:50 PM (PDT) @ 704/706 None
Third Joint Egocentric Vision (EgoVis) Workshop
Siddhant Bansal ⋅ Tushar Nagarajan
Workshop
Wed Jun 03 07:50 AM -- 11:30 AM (PDT) @ 110 None
AERO-HPR: Human Perception and Recognition in Aerial Surveillance
Kien Nguyen Thanh ⋅ Arun Ross
Workshop
Wed Jun 03 07:50 AM -- 11:30 AM (PDT) @ 107 None
2nd Workshop on Photorealistic 3D Head Avatars
Tobias Kirschstein ⋅ Simon Giebenhain
Workshop
Wed Jun 03 07:50 AM -- 02:30 PM (PDT) @ 502 None
Efficient Deep Learning for Computer Vision
Shuai Zhang ⋅ Yung-Hsiang Lu
Tutorial
Wed Jun 03 08:00 AM -- 11:15 AM (PDT) @ 201 None
Accelerated Diffusion Models: From Theory to Interactive World Models
Julius Berner · Weili Nie · Arash Vahdat
Workshop
Wed Jun 03 08:00 AM -- 12:00 PM (PDT) @ 105 None
The 3rd Workshop on AI for Content Generation, Quality Enhancement and Streaming
Marcos V. Conde ⋅ Radu Timofte
Workshop
Wed Jun 03 08:00 AM -- 11:30 AM (PDT) @ 709 None
The 22nd Embedded Vision Workshop
Matteo Poggi ⋅ Tse-Wei Chen
Workshop
Wed Jun 03 08:00 AM -- 11:30 AM (PDT) @ 607 None
The 3rd Workshop on Foundation Models for Medical Vision
Jun Ma ⋅ Yuyin Zhou
Workshop
Wed Jun 03 08:00 AM -- 03:00 PM (PDT) @ 605 None
12th Workshop on Medical Computer Vision
Zongwei Zhou ⋅ Yucheng Tang
Workshop
Wed Jun 03 08:00 AM -- 05:00 PM (PDT) @ Mile High 3B None
Urban Scene Modeling: Structured, Semantic, and Synthetic 3D Habitats
Jack Langerman ⋅ Ruisheng Wang
Workshop
Wed Jun 03 08:15 AM -- 05:00 PM (PDT) @ 603 None
Workshop on Autonomous Driving
Vincent Casser ⋅ Jose M. Alvarez
Break
Wed Jun 03 09:00 AM -- 10:00 AM (PDT) @ ExHall A None
Coffee Break
Tutorial
Wed Jun 03 12:00 PM -- 04:00 PM (PDT) @ Mile High 3C None
Principled Interpretability in Vision Models: From Mechanistic Understanding to Interpretable Models by Design
Tsui-Wei (Lily) Weng · Tuomas Oikarinen
Tutorial
Wed Jun 03 12:00 PM -- 04:00 PM (PDT) @ 702 None
Monte Carlo physical simulation
Rohan Sawhney · Bailey Miller · Ioannis Gkioulekas · Keenan Crane
Tutorial
Wed Jun 03 12:00 PM -- 04:00 PM (PDT) @ 201 None
Building GenAI based Simulation Environment for End-to-End Autonomous Driving
Henry Liu · Howie Sun · Jun Gao · Shuo Feng · Xintao Yan · Jiawei Wang
Tutorial
Wed Jun 03 12:00 PM -- 04:00 PM (PDT) @ 301/302 None
From Perception to Simulation: The Emergence of World Models in Multi-modal Reasoning
Yujun Cai · Jianfei Cai · Yiwei Wang · Ming-Hsuan Yang
Workshop
Wed Jun 03 12:00 PM -- 05:00 PM (PDT) @ 507 None
GigaBrain Challenge 2026: Workshop on World Models Empowering Vision Language Action Model
Zheng Zhu ⋅ Xiaofeng Wang ⋅ Hongyang Li ⋅ Shanghang Zhang ⋅ Yao Mu ⋅ Haoqiang Fan ⋅ Zhizhong Su
Workshop
Wed Jun 03 12:00 PM -- 04:45 PM (PDT) @ 106 None
The Second CVPR Workshop on Foundation and Large Vision Models in Remote Sensing (MORSE)
Saurabh Prasad ⋅ Jocelyn Chanussot
Workshop
Wed Jun 03 12:00 PM -- 05:15 PM (PDT) @ Mile High 1CD None
The 2nd 3D-LLM/VLA Workshop: Bridging Language, Vision and Action in 3D Environments
Yining Hong ⋅ Wenbo hu
Workshop
Wed Jun 03 12:00 PM -- 05:00 PM (PDT) @ 505 None
10th Affective & Behavior Analysis in-the-wild
Dimitrios Kollias ⋅ Panagiotis Tzirakis
Workshop
Wed Jun 03 12:00 PM -- 05:00 PM (PDT) @ 103 None
Authenticity & Provenance in the age of Generative AI
Shruti Agarwal ⋅ Sarah Barrington
Workshop
Wed Jun 03 12:00 PM -- 04:00 PM (PDT) @ 711 None
Auto-Annotation with Expert-Crafted Guidelines
Shu Kong ⋅ Sara Beery
Workshop
Wed Jun 03 12:00 PM -- 04:00 PM (PDT) @ 610/612 None
Cognitive Foundations for Multimodal Models
Aditya Chinchure ⋅ Sahithya Ravi
Workshop
Wed Jun 03 12:00 PM -- 05:00 PM (PDT) @ 109 None
Computer Vision for the Built World
Iro Armeni ⋅ Fuxin Li
Workshop
Wed Jun 03 12:00 PM -- 04:00 PM (PDT) @ 102/104 None
Computer Vision with Small Data: Beyond Scale -- Toward Data-Efficient Dynamically-Aware Video Intelligence
Sarah Ostadabbas ⋅ Shayda Moezzi
Workshop
Wed Jun 03 12:00 PM -- 05:00 PM (PDT) @ 112 None
Computer Vision for Biomechanics Workshop
Ethan Goan ⋅ Akila Pemasiri
Workshop
Wed Jun 03 12:00 PM -- 04:00 PM (PDT) @ 110 None
Sixth Workshop on Neural Architecture Search
Stephen McGough ⋅ Amir Atapour-Abarghouei
Workshop
Wed Jun 03 12:00 PM -- 05:05 PM (PDT) @ 111 None
DataMFM: Emerging Directions in Data for Multimodal Foundation Models
Pengyuan Li ⋅ Zihan Wang
Workshop
Wed Jun 03 12:00 PM -- 05:00 PM (PDT) @ 501 None
End-to-End 3D Learning
Zhiwen Fan ⋅ Dimitris Metaxas
Workshop
Wed Jun 03 12:00 PM -- 05:00 PM (PDT) @ 203 None
3rd Workshop on Efficient and On-Device Generation (EDGE), CVPR 2026
Felix Juefei-Xu ⋅ Tingbo Hou
Workshop
Wed Jun 03 12:00 PM -- 04:00 PM (PDT) @ Mile High 4EF None
1st Workshop on Multi-Agent Robotic Systems: Scaling with Compositional Intelligence
Yiran Qin ⋅ Zhenfei Yin
Workshop
Wed Jun 03 12:00 PM -- 05:00 PM (PDT) @ Four Seasons 4 None
The 5th Workshop on “What is Next in Multimodal Foundation Models?”
Edson Araujo ⋅ Roei Herzig
Workshop
Wed Jun 03 12:00 PM -- 05:00 PM (PDT) @ 601 None
Workshop on Multimodal Human Motion Analysis
Olivia Nocentini ⋅ Rishabh Dabral
Workshop
Wed Jun 03 12:00 PM -- 04:00 PM (PDT) @ 105 None
The 1st Workshop on Monitoring the World through an Imperfect Lens
Miriam Cha ⋅ Greg Angelides
Workshop
Wed Jun 03 12:00 PM -- 04:30 PM (PDT) @ 210/212 None
2nd Workshop on Multimodal Sign Language Recognition
Raffaele Mineo ⋅ Hamzah Luqman
Workshop
Wed Jun 03 12:00 PM -- 05:00 PM (PDT) @ 709 None
The 3rd MetaFood Workshop (MTF)
Yuhao Chen ⋅ Petia Radeva
Workshop
Wed Jun 03 12:00 PM -- 05:00 PM (PDT) @ Mile High 1AB None
Machine Unlearning for Vision
Alessio Sampieri ⋅ Bardh Prenkaj
Workshop
Wed Jun 03 12:00 PM -- 04:00 PM (PDT) @ 705/707 None
OpenSUN3D: 6th Workshop on Open-World 3D Scene Understanding with Foundation Models
Francis Engelmann ⋅ Anna-Maria Halacheva
Workshop
Wed Jun 03 12:00 PM -- 05:00 PM (PDT) @ 107 None
Synthetic & Adversarial ForEnsics
Josué Martínez-Martínez ⋅ Pooya Khorrami
Workshop
Wed Jun 03 12:00 PM -- 04:30 PM (PDT) @ 710 None
3rd Workshop on ScanNet++ Novel View Synthesis and 3D Semantic Understanding Challenge
Angela Dai ⋅ Matthias Nießner
Workshop
Wed Jun 03 12:00 PM -- 05:00 PM (PDT) @ Mile High 2A None
The 7th International Workshop and CVML Challenge on Agriculture-Vision: Challenges & Opportunities for Computer Vision in Agriculture
Chris Padwick ⋅ Ripudaman Arora
Workshop
Wed Jun 03 12:00 PM -- 04:00 PM (PDT) @ 108 None
The 1st Workshop on Vision for Intelligent Task Assistants
Ehsan Elhamifar ⋅ Jason J. Corso
Workshop
Wed Jun 03 12:15 PM -- 05:00 PM (PDT) @ 113 None
Second Workshop on Foundation and Generative Models in Biometrics
Hatef Otroshi Shahreza ⋅ Vitomir Struc
Workshop
Wed Jun 03 12:20 PM -- 04:30 PM (PDT) @ Mile High 4AB None
Rediscovering Intelligence: Can AI Still Learn from Humans?
Xi Wang ⋅ Yen-Ling Kuo
Workshop
Wed Jun 03 12:25 PM -- 04:30 PM (PDT) @ 506 None
The 2nd Workshop on Test-time Scaling for Computer Vision
Yinpeng Dong ⋅ Yichi Zhang
Tutorial
Wed Jun 03 12:30 PM -- 03:45 PM (PDT) @ Mile High 2B None
3D Human Mesh Modeling and Recovery from RGB and LiDAR
Romain Bregier · Istvan Sarandi · Salma Galaaoui · Fabien Baradel · Nermin Samet · David Picard
Workshop
Wed Jun 03 12:30 PM -- 04:45 PM (PDT) @ 708 None
Spatial Intelligence for Cultural Heritage
Marina Paolanti ⋅ Roberto Pierdicca
Workshop
Wed Jun 03 12:45 PM -- 04:40 PM (PDT) @ 607 None
The 5th Workshop on Transformers for Vision and Multimodal AI
Gedas Bertasius ⋅ Zhiding Yu
Workshop
Wed Jun 03 01:00 PM -- 04:00 PM (PDT) @ 712 None
The 1st Workshop on AI-assisted Long Video Creation
Yudong Jiang ⋅ Lisai Zhang
Break
Wed Jun 03 02:00 PM -- 03:00 PM (PDT) @ ExHall A None
Coffee Break
Break
Thu Jun 04 06:00 AM -- 08:00 AM (PDT) @ ExHall C None
Breakfast
Registration
Thu Jun 04 06:00 AM -- 04:00 PM (PDT) @ Lobby A None
Registration / Badge Pickup
Workshop
Thu Jun 04 06:30 AM -- 11:30 AM (PDT) @ 105 None
3D Geometry Generation for Scientific Computing (2nd Edition)
Wuyang Chen ⋅ Marissa Ramirez de Chanlatte ⋅ Peter Yichen Chen ⋅ Chuhang Zou ⋅ Zhiwen Fan ⋅ Daniel Martin ⋅ Michael Mahoney
Workshop
Thu Jun 04 06:30 AM -- 11:30 AM (PDT) @ 704/706 None
2nd Workshop on Knowledge-Intensive Multimodal Reasoning
Arman Cohan ⋅ Yilun Zhao
Tutorial
Thu Jun 04 07:00 AM -- 11:00 AM (PDT) @ 702 None
Extending Computer Vision to Hidden Objects: A Tutorial on Millimeter-Wave Imaging and Reconstruction of Occluded Scenes
Mingmin Zhao · Laura Dodds
Tutorial
Thu Jun 04 07:00 AM -- 11:00 AM (PDT) @ Mile High 2B None
The Full Stack of Physical AI: Simulation, Foundation Models, and Edge Deployment for Next-Generation Robotics Applications
Raymond Lo · Johnny Nunez · Chitoku Yato · Spencer Huang · Mitesh Patel
Tutorial
Thu Jun 04 07:00 AM -- 11:00 AM (PDT) @ 201 None
Recent Advances in AI for Medical Imaging: Progress, Challenges, and Future Directions
Jiaqi Wang · Peirong Liu · Can Zhao
Tutorial
Thu Jun 04 07:00 AM -- 11:00 AM (PDT) @ 203 None
Computer Vision at Scale: Multi-Camera Tracking, Calibration, and Event Detection for Checkout-Free Retail
Hareesh Kolluru · Motilal Agarwal · Tanmay Bangalore
Workshop
Thu Jun 04 07:00 AM -- 11:30 AM (PDT) @ 703 None
Third Workshop for Learning 3D with Multi-View Supervision
Abdullah J Hamdi ⋅ Silvio Giancola
Workshop
Thu Jun 04 07:00 AM -- 11:00 AM (PDT) @ 610/612 None
6th Workshop on 3D Scene Understanding for Vision, Graphics, and Robotics
Yixin Chen ⋅ Shaofei Wang
Workshop
Thu Jun 04 07:00 AM -- 11:00 AM (PDT) @ 502 None
Workshop on Any-to-any Multimodal Learning
Shengqiong Wu ⋅ Wei Dai
Workshop
Thu Jun 04 07:00 AM -- 11:30 AM (PDT) @ 102/104 None
The 3rd Workshop on New Trends in AI-Generated Media and Security
Shu Hu ⋅ Xin Wang
Workshop
Thu Jun 04 07:00 AM -- 11:30 AM (PDT) @ 106 None
2nd Workshop on Computer Vision for Children
Yifan Shen ⋅ Xu Cao
Workshop
Thu Jun 04 07:00 AM -- 11:00 AM (PDT) @ Four Seasons 2 None
The 5th Workshop on Computer Vision in the Wild: Towards Unified Multimodal Agents For Reasoning in the Wild
Reuben Tan ⋅ Zhengyuan Yang
Workshop
Thu Jun 04 07:00 AM -- 11:00 AM (PDT) @ Mile High 2C None
The Second Workshop on the Evaluation of the Generative Foundation Models
Wisdom Ikezogwo ⋅ Maria Zontak
Workshop
Thu Jun 04 07:00 AM -- 11:30 AM (PDT) @ 607 None
Geometry-Free Novel View Synthesis and Controllable Video Models
Andrea Tagliasacchi
Workshop
Thu Jun 04 07:00 AM -- 11:00 AM (PDT) @ 710 None
Humans of Generative AI
Jaron Mink ⋅ David Forsyth
Workshop
Thu Jun 04 07:00 AM -- 11:30 AM (PDT) @ 504 None
The 1st Workshop on Low‑Level Vision Frontiers with Generative AI, Preference Optimization, and Agentic Systems
Xin Li ⋅ Yeying Jin
Workshop
Thu Jun 04 07:00 AM -- 11:10 AM (PDT) @ 711 None
6th Omnidirectional Computer Vision Workshop
Pierre Moulon ⋅ Guillaume Caron
Workshop
Thu Jun 04 07:00 AM -- 11:00 AM (PDT) @ 712 None
Open-World Vision
Shu Kong ⋅ Neehar Peri
Workshop
Thu Jun 04 07:00 AM -- 11:20 AM (PDT) @ 113 None
From Perception to Persuasion: Challenges and Advances in Misinformation Detection in Society
PRIYANKA SINGH ⋅ Xue Li
Workshop
Thu Jun 04 07:00 AM -- 11:00 AM (PDT) @ Mile High 3A None
SPAR-3D: Security, Privacy, and Adversarial Robustness in 3D Generative Vision Models
Nicole Meng ⋅ Yingjie Lao
Workshop
Thu Jun 04 07:00 AM -- 11:30 AM (PDT) @ 705/707 None
Trustworthy, Robust, Uncertainty-Aware, and Explainable Visual Intelligence and Beyond
Tsui-Wei Weng ⋅ Nghia Hoang
Workshop
Thu Jun 04 07:00 AM -- 11:00 AM (PDT) @ Mile High 4EF None
The 8th UG2+ Workshop and Challenge: Bridging the Gap between Computational Photography and Visual Perception
Alex Wong ⋅ Dong Lao
Workshop
Thu Jun 04 07:00 AM -- 11:30 AM (PDT) @ Mile High 4AB None
Unified Robotic Vision with Cross-Modal Sensing and Alignment
Zongwei Wu ⋅ Christos Sakaridis
Workshop
Thu Jun 04 07:00 AM -- 11:00 AM (PDT) @ 506 None
9th International Workshop on Visual Odometry and Computer Vision Applications Based on Location Clues
Guoyu Lu ⋅ Friedrich Fraundorfer
Workshop
Thu Jun 04 07:00 AM -- 04:00 PM (PDT) @ 112 None
11th Workshop on Computer Vision and Multimodal Microscopy Image Analysis
Steve Finkbeiner ⋅ Mei Chen
Workshop
Thu Jun 04 07:00 AM -- 04:00 PM (PDT) @ 107 None
The Seventh Annual Embodied Artificial Intelligence Workshop
Anthony Francis ⋅ David Hall
Workshop
Thu Jun 04 07:00 AM -- 04:00 PM (PDT) @ Mile High 2A None
2nd Workshop on Agents in Interaction, from Humans to Robots
Yufei Ye ⋅ Homanga Bharadhwaj
Workshop
Thu Jun 04 07:00 AM -- 04:00 PM (PDT) @ 505 None
Mobile AI workshop and associated challenges, 6th edition
Andrey Ignatov ⋅ Radu Timofte
Workshop
Thu Jun 04 07:00 AM -- 04:00 PM (PDT) @ Four Seasons 1 None
Multi-Agent Embodied Intelligent Systems Meet Agentic-AI era: Opportunities, Challenges and Futures
Xiangbo Gao ⋅ Yuheng Wu
Workshop
Thu Jun 04 07:00 AM -- 05:00 PM (PDT) @ 207 None
11th New Trends in Image Restoration and Enhancement Workshop and Challenges
Radu Timofte ⋅ Zongwei Wu
Workshop
Thu Jun 04 07:00 AM -- 04:00 PM (PDT) @ Mile High 3B None
Video Generative Models: Benchmarks and Evaluation
Shuo Xing ⋅ Mingyang Wu
Workshop
Thu Jun 04 07:00 AM -- 04:00 PM (PDT) @ Four Seasons 4 None
2nd Workshop on Video Large Language Models
Rohit Gupta ⋅ Sirnam Swetha
Workshop
Thu Jun 04 07:00 AM -- 04:00 PM (PDT) @ 501 None
Workshop on Visual Concepts
Joy Hsu ⋅ R. Kenny Jones
Workshop
Thu Jun 04 07:00 AM -- 04:00 PM (PDT) @ Mile High 1CD None
Sight and Sound
Andrew Owens ⋅ Jiajun Wu
Workshop
Thu Jun 04 07:10 AM -- 11:30 AM (PDT) @ Mile High 1AB None
4th Workshop on Maritime Computer Vision
Benjamin Kiefer ⋅ Jon Muhovic
Tutorial
Thu Jun 04 07:30 AM -- 04:00 PM (PDT) @ Mile High 3C None
Analytic understanding of diffusion models
Artem Lukoianov · Chenyang Yuan · Christopher Scarvelis · Mason Kamb
Workshop
Thu Jun 04 07:30 AM -- 11:30 AM (PDT) @ 108 None
6th Workshop on CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling
Tuan-Anh Vu ⋅ Isla Duporge
Workshop
Thu Jun 04 07:30 AM -- 11:00 AM (PDT) @ 603 None
Exploring the Next Generation of Data
Nadine Chang ⋅ Maying Shen
Workshop
Thu Jun 04 07:30 AM -- 11:30 AM (PDT) @ Mile High 4CD None
Personalization in Generative AI Workshop
Pinar Yanardag ⋅ Nupur Kumari
Workshop
Thu Jun 04 07:30 AM -- 11:30 AM (PDT) @ 110 None
PhysHuman: Physically Grounded Human Perception and Modeling
Feng Liu ⋅ Youngjoong Kwon ⋅ Cheng Zhang
Workshop
Thu Jun 04 07:30 AM -- 11:30 AM (PDT) @ 103 None
Safe Artificial Intelligence for All Domains
Oliver Wasenmüller ⋅ Markus Enzweiler
Workshop
Thu Jun 04 07:45 AM -- 11:20 AM (PDT) @ 709 None
VizWiz Grand Challenge: Interpreting Images and Videos Taken by Blind People
Danna Gurari ⋅ Neelima Prasad
Workshop
Thu Jun 04 07:45 AM -- 04:00 PM (PDT) @ 205 None
4th Workshop on Generative Models for Computer Vision
Adam Kortylewski ⋅ Fangneng Zhan
Workshop
Thu Jun 04 07:45 AM -- 05:30 PM (PDT) @ 111 None
9th Multimodal Learning and Applications Workshop
Paolo Rota ⋅ Michael Ying Yang
Workshop
Thu Jun 04 07:55 AM -- 11:30 AM (PDT) @ 601 None
Multimodal Algorithmic Reasoning Workshop
Anoop Cherian ⋅ Suhas Lohit
Tutorial
Thu Jun 04 08:00 AM -- 04:30 PM (PDT) @ 301/302 None
All You Need To Know About Self-Driving
Raquel Urtasun · Abbas Sadat · Sivabalan Manivasagam · Jingkang Wang · Ioan Andrei Barsan
Workshop
Thu Jun 04 08:00 AM -- 11:15 AM (PDT) @ 210/212 None
The Eighth Workshop on Precognition: Seeing through the Future
Khoa Luu ⋅ Nemanja Djuric
Workshop
Thu Jun 04 08:00 AM -- 04:00 PM (PDT) @ 708 None
The 6th Workshop of Adversarial Machine Learning on Computer Vision: Safety of Vision-Language Agents
Aishan Liu ⋅ Jiakai Wang
Workshop
Thu Jun 04 08:00 AM -- 04:30 PM (PDT) @ 503 None
12th IEEE International Workshop on Computer Vision in Sports
Rikke Gade ⋅ Silvio Giancola
Workshop
Thu Jun 04 08:00 AM -- 04:00 PM (PDT) @ 507 None
EarthVision: Large Scale Computer Vision for Remote Sensing Imagery
Ronny Haensch ⋅ Devis Tuia
Workshop
Thu Jun 04 08:00 AM -- 04:00 PM (PDT) @ 605 None
Embodied Reasoning in Action: Workshop and Challenge on Embodied Reasoning for Robotic Manipulation
Jiafei Duan ⋅ Jason Ren
Workshop
Thu Jun 04 08:00 AM -- 04:30 PM (PDT) @ 109 None
2nd Workshop on Human-Interactive Generation and Editing
Jinbo Xing ⋅ Xi Chen
Workshop
Thu Jun 04 08:00 AM -- 04:00 PM (PDT) @ Mile High 1EF None
How Do Vision Models Work?
Tamar Rott Shaham ⋅ Amil Dravid
Break
Thu Jun 04 09:00 AM -- 10:00 AM (PDT) @ ExHall A None
Coffee Break
Tutorial
Thu Jun 04 12:00 PM -- 04:00 PM (PDT) @ 201 None
The Road to Convergence: Evolution of Unified Multimodal Models
Jindong Wang · Hao Chen · Jiakui Hu · Zhaolong Su · Sharon Li
Tutorial
Thu Jun 04 12:00 PM -- 04:00 PM (PDT) @ Mile High 2B None
Foundations and Frontiers of Watermarking: Algorithms, Multimodal Extensions, Benchmarks, and Authenticity Frameworks
Vishal Asnani · Shruti Agarwal · Benedetta Tondi · Pierre Fernandez · Furong Huang
Tutorial
Thu Jun 04 12:00 PM -- 04:00 PM (PDT) @ 702 None
From Perception to Action: Building Efficient and Deployable Robot Intelligence Pipelines with Open-Source Edge AI Toolkits
Samet Akcay · Zhuo Wu · Michael Paulitsch · Ashutosh Kumar · Tao Xiong · Adrian Boguszewski · Sameer Sheorey · Benjamin Ummenhofer
Workshop
Thu Jun 04 12:00 PM -- 05:00 PM (PDT) @ 603 None
1st Workshop on Generative 3D Reconstruction
Daniel Barath ⋅ Fabian Manhardt
Workshop
Thu Jun 04 12:00 PM -- 05:00 PM (PDT) @ 110 None
Medical Reasoning with Vision Language Foundation Models
Anas Zafar ⋅ Muhammad Waqas
Workshop
Thu Jun 04 12:00 PM -- 05:00 PM (PDT) @ Mile High 2C None
4D Digital Twins: Real-to-Sim-to-Real for Physical AI
Amrita Mazumdar ⋅ Tianye Li
Workshop
Thu Jun 04 12:00 PM -- 05:00 PM (PDT) @ 506 None
2nd Workshop on 4D Vision: Modeling the Dynamic World
Jiahui Lei ⋅ Shangzhe Wu
Workshop
Thu Jun 04 12:00 PM -- 04:00 PM (PDT) @ 710 None
Artificial Intelligence for Space
Daniele Gammelli ⋅ Gabriele Meoni
Workshop
Thu Jun 04 12:00 PM -- 04:00 PM (PDT) @ 105 None
2nd Workshop on GenAI for Storytelling
Andrew Shin ⋅ Yusuke Mori
Workshop
Thu Jun 04 12:00 PM -- 04:00 PM (PDT) @ Four Seasons 2 None
Big Model Adaptation In Computer Vision
Yuki Asano ⋅ Anna Kukleva
Workshop
Thu Jun 04 12:00 PM -- 04:00 PM (PDT) @ 106 None
CVPR 2026 Biometrics Workshop
Bir Bhanu ⋅ Ajay Kumar
Workshop
Thu Jun 04 12:00 PM -- 05:00 PM (PDT) @ Mile High 1AB None
Bridging AI and Medical Reality: Computer Vision for Real-world Clinical Translation
Yicheng Wu ⋅ Yutong Xie ⋅ Kai Wang
Workshop
Thu Jun 04 12:00 PM -- 05:00 PM (PDT) @ 113 None
Computer Vision × Education: Building a Cross‑Community Agenda for Multimodal Vision in Classrooms
Ekta Sood ⋅ Joyces H Fonteles
Workshop
Thu Jun 04 12:00 PM -- 04:45 PM (PDT) @ 709 None
CV4Science: Using Computer Vision for the Sciences
Utkarsh Mall ⋅ Ye Zhu
Workshop
Thu Jun 04 12:00 PM -- 05:00 PM (PDT) @ 103 None
Domain Generalization: Evolution, Breakthroughs, and Future Horizons (2nd Edition)
Muhammad Haris Khan ⋅ Rishabh Lalla
Workshop
Thu Jun 04 12:00 PM -- 05:00 PM (PDT) @ 703 None
The 2nd CVPR Workshop on Foundation Models Meet Embodied Agents
Manling Li ⋅ Qineng Wang
Workshop
Thu Jun 04 12:00 PM -- 04:00 PM (PDT) @ 711 None
The 7th International Workshop on Eye and Gaze in Computer Vision
Yihua Cheng ⋅ Seonwook Park ⋅ Hyung Jin Chang
Workshop
Thu Jun 04 12:00 PM -- 05:00 PM (PDT) @ 504 None
Eighth Workshop on Image Matching: Local Features and Beyond
Dmytro Mishkin ⋅ Eduard Trulls
Workshop
Thu Jun 04 12:00 PM -- 05:00 PM (PDT) @ Mile High 4CD None
1st Workshop on Journey to the Awards: Generative AI for Movie-Grade Video Production (J2A), CVPR 2026
Felix Juefei-Xu ⋅ Stephane Grabl
Workshop
Thu Jun 04 12:00 PM -- 04:00 PM (PDT) @ Mile High 3A None
The 2nd Workshop on Multi-Modal Reasoning for Agentic Intelligence
Yijiang Li ⋅ Zhenfei Yin
Workshop
Thu Jun 04 12:00 PM -- 05:00 PM (PDT) @ 203 None
4D World Models: Bridging Generation and Reconstruction
Aayush Prakash ⋅ Aashish Rai
Workshop
Thu Jun 04 12:00 PM -- 04:00 PM (PDT) @ 102/104 None
Third Workshop on Simulation for Autonomous Driving
Yiyi Liao ⋅ Maximilian Igl
Workshop
Thu Jun 04 12:00 PM -- 04:00 PM (PDT) @ 610/612 None
ScaleBot: The First Workshop on Scalable Robot Learning Systems
Sijin Chen ⋅ Yuxiang Lu
Workshop
Thu Jun 04 12:00 PM -- 04:30 PM (PDT) @ 607 None
The 3rd Workshop on Synthetic Data for Computer Vision
Jieyu Zhang ⋅ Zixian Ma
Workshop
Thu Jun 04 12:15 PM -- 05:00 PM (PDT) @ 705/707 None
Second Workshop on Skilled Activity Understanding, Assessment & Feedback Generation
Paritosh Parmar ⋅ Brendan Morris
Workshop
Thu Jun 04 12:30 PM -- 04:30 PM (PDT) @ 712 None
The Third Workshop on Anomaly Detection with Foundation Models
Kuan-Chuan Peng ⋅ Ying Zhao
Workshop
Thu Jun 04 12:30 PM -- 04:30 PM (PDT) @ Mile High 4AB None
Appearance Understanding and Generation
Elena Garces ⋅ Giuseppe Vecchio
Workshop
Thu Jun 04 12:30 PM -- 03:30 PM (PDT) @ 502 None
Pixel-level Video Understanding in the Wild Challenge
Henghui Ding ⋅ Nikhila Ravi
Workshop
Thu Jun 04 12:30 PM -- 05:00 PM (PDT) @ 601 None
Visual Anomaly and Novelty Detection - 4th Edition
Philipp Seeböck ⋅ Latha Pemula
Workshop
Thu Jun 04 01:00 PM -- 04:40 PM (PDT) @ 108 None
See the World in a Different Light: Physical Appearance Modeling and Relighting in the Age of Generative AI
Xilong Zhou ⋅ Marc Habermann
Workshop
Thu Jun 04 01:00 PM -- 04:30 PM (PDT) @ 704/706 None
6th International Workshop on Long-form Video Understanding, Generation and Action
Mike Zheng Shou ⋅ Gedas Bertasius
Break
Thu Jun 04 02:00 PM -- 03:00 PM (PDT) @ ExHall A None
Coffee Break
Break
Fri Jun 05 06:00 AM -- 08:00 AM (PDT) @ ExHall C None
Breakfast
Registration
Fri Jun 05 06:00 AM -- 04:00 PM (PDT) @ Lobby A None
Registration / Badge Pickup
Remarks
Fri Jun 05 07:30 AM -- 08:00 AM (PDT) @ Bluebird Ballroom None
Welcome & Awards
Poster Setup
Fri Jun 05 07:45 AM -- 08:15 AM (PDT) @ ExHall A None
Poster Setup
Break
Fri Jun 05 08:00 AM -- 08:15 AM (PDT) None
Courtesy Break
Oral
Fri Jun 05 08:15 AM -- 08:27 AM (PDT) @ Four Seasons Ballroom None
Black-box Membership Inference Attacks on the Pre-training Data of Image-generation Models
Tao Qi ⋅ Huili Wang ⋅ Yuanhong Huang ⋅ Wendan Wang ⋅ Lianchao Zhao ⋅ Jinrui Wang ⋅ Zichen Qin ⋅ Shangguang Wang ⋅ Yongfeng Huang
Oral
Fri Jun 05 08:15 AM -- 08:30 AM (PDT) @ Bluebird Ballroom None
A Style is Worth One Code: Unlocking Code-to-Style Image Generation with Discrete Style Space
Huijie Liu ⋅ Shuhao Cui ⋅ Haoxiang Cao ⋅ Shuai Ma ⋅ Kai Wu ⋅ Guoliang Kang
Oral
Fri Jun 05 08:15 AM -- 08:27 AM (PDT) @ Mile High Ballroom 1A - 2A None
Advancing Image Classification with Discrete Diffusion Classification Modeling
Omer Belhasin ⋅ Shelly Golan ⋅ Ran El-Yaniv ⋅ Michael Elad
[ Slides
Oral
Fri Jun 05 08:15 AM -- 08:27 AM (PDT) @ Mile High Ballroom 3A - 4A None
Customized Fusion: A Closed-Loop Dynamic Network for Adaptive Multi-Task-Aware Infrared-Visible Image Fusion
Zengyi Yang ⋅ Yu Liu ⋅ Juan Cheng ⋅ Zhiqin Zhu ⋅ Yafei Zhang ⋅ Huafeng Li
Oral Session
Fri Jun 05 08:15 AM -- 09:30 AM (PDT) @ Mile High Ballroom 3A - 4A None
Oral Session 1D: Computational Imaging
Oral Session
Fri Jun 05 08:15 AM -- 09:30 AM (PDT) @ Mile High Ballroom 1A - 2A None
Oral Session 1C: Efficient Reasoning
Oral Session
Fri Jun 05 08:15 AM -- 09:30 AM (PDT) @ Four Seasons Ballroom None
Oral Session 1B: Visual Security
Oral Session
Fri Jun 05 08:15 AM -- 09:30 AM (PDT) @ Bluebird Ballroom None
Oral Session 1A: Multimodal Vision
Oral
Fri Jun 05 08:27 AM -- 08:40 AM (PDT) @ Four Seasons Ballroom None
Data Leakage Detection and De-duplication in Large Scale Geospatial Image Datasets
Yeshwanth Kumar Adimoolam ⋅ Charalambos Poullis ⋅ Melinos Averkiou
Oral
Fri Jun 05 08:27 AM -- 08:40 AM (PDT) @ Mile High Ballroom 3A - 4A None
Dual Band Thermal Videography: Separating Time-Varying Reflection and Emission Near Ambient Conditions
Sriram Narayanan ⋅ Mani Ramanagopal ⋅ Srinivasa G. Narasimhan
Oral
Fri Jun 05 08:27 AM -- 08:40 AM (PDT) @ Mile High Ballroom 1A - 2A None
Does YOLO Really Need to See Every Training Image in Every Epoch?
Xingxing Xie ⋅ Jiahua Dong ⋅ Junwei Han ⋅ Gong Cheng
Oral
Fri Jun 05 08:30 AM -- 08:45 AM (PDT) @ Bluebird Ballroom None
ANTS: Adaptive Negative Textual Space Shaping for OOD Detection via Test-Time MLLM Understanding and Reasoning
Wenjie Zhu ⋅ Yabin Zhang ⋅ Xin Jin ⋅ Wenjun Zeng ⋅ Lei Zhang
Oral
Fri Jun 05 08:40 AM -- 08:52 AM (PDT) @ Mile High Ballroom 3A - 4A None
MetaSpectra+: A Compact Broadband Metasurface Camera for Snapshot Hyperspectral+ Imaging
Yuxuan Liu ⋅ Wei Xu ⋅ Qi Guo
Oral
Fri Jun 05 08:40 AM -- 08:52 AM (PDT) @ Four Seasons Ballroom None
RAVEN: Erasing Invisible Watermarks via Novel View Synthesis
Fahad Shamshad ⋅ Nils Lukas ⋅ Karthik Nandakumar
Oral
Fri Jun 05 08:40 AM -- 08:52 AM (PDT) @ Mile High Ballroom 1A - 2A None
Fine-grained Image Aesthetic Assessment: Learning Discriminative Scores from Relative Ranks
Zhichao Yang ⋅ Jianjie Wang ⋅ Zhixianhe Zhang ⋅ Pangu Xie ⋅ Xiangfei Sheng ⋅ Pengfei Chen ⋅ Leida Li
Oral
Fri Jun 05 08:45 AM -- 09:00 AM (PDT) @ Bluebird Ballroom None
ARGUS: Defending Against Multimodal Indirect Prompt Injection via Steering Instruction-Following Behavior
Weikai Lu ⋅ Ziqian Zeng ⋅ Kehua Zhang ⋅ Haoran Li ⋅ Huiping Zhuang ⋅ Ruidong Wang ⋅ Cen Chen ⋅ Hao Peng
Oral
Fri Jun 05 08:52 AM -- 09:05 AM (PDT) @ Mile High Ballroom 1A - 2A None
NuWa: Deriving Lightweight Class-Specific Vision Transformers for Edge Devices
Ziteng Wei ⋅ Qiang He ⋅ Bing Li ⋅ Feifei Chen ⋅ Hai Jin ⋅ Yun Yang
Oral
Fri Jun 05 08:52 AM -- 09:05 AM (PDT) @ Four Seasons Ballroom None
LDP-Slicing: Local Differential Privacy for Images via Randomized Bit-Plane Slicing
Yuanming Cao ⋅ Chengqi Li ⋅ Wenbo He
Oral
Fri Jun 05 08:52 AM -- 09:05 AM (PDT) @ Mile High Ballroom 3A - 4A None
Spectrum from Defocus: Fast Spectral Imaging with Chromatic Focal Stack
M. Kerem Aydin ⋅ Yi-Chun Hung ⋅ Jaclyn Pytlarz ⋅ Qi Guo ⋅ Emma Alexander
Oral
Fri Jun 05 09:00 AM -- 09:15 AM (PDT) @ Bluebird Ballroom None
TEAR: Temporal-aware Automated Red-teaming for Text-to-Video Models
Jiaming He ⋅ Guanyu Hou ⋅ Hongwei Li ⋅ Zhicong Huang ⋅ Kangjie Chen ⋅ Yi Yu ⋅ Wenbo Jiang ⋅ Guowen Xu ⋅ Tianwei Zhang
Oral
Fri Jun 05 09:05 AM -- 09:17 AM (PDT) @ Mile High Ballroom 3A - 4A None
Towards Photorealistic and Efficient Bokeh Rendering via Diffusion Framework
Linxiao Shi ⋅ Siming Zheng ⋅ Zerong Wang ⋅ Hao Zhang ⋅ Jinwei Chen ⋅ Bo Li ⋅ Shifeng Chen ⋅ Peng-Tao Jiang
Oral
Fri Jun 05 09:05 AM -- 09:17 AM (PDT) @ Mile High Ballroom 1A - 2A None
Plant Taxonomy Meets Plant Counting: A Fine-Grained, Taxonomic Dataset for Counting Hundreds of Plant Species
Jinyu Xu ⋅ Tianqi Hu ⋅ Xiaonan Hu ⋅ Letian Zhou ⋅ Songliang Cao ⋅ Meng Zhang ⋅ Hao Lu
Oral
Fri Jun 05 09:05 AM -- 09:17 AM (PDT) @ Four Seasons Ballroom None
NOWA: Null-space Optical Watermark for Invisible Capture Fingerprinting and Tamper Localization
Edwin Vargas ⋅ Jhon Lopez ⋅ Henry Arguello ⋅ Ashok Veeraraghavan
Poster Setup
Fri Jun 05 09:15 AM -- 09:45 AM (PDT) @ ExHall A None
Poster Setup
Oral
Fri Jun 05 09:15 AM -- 09:30 AM (PDT) @ Bluebird Ballroom None
ViT^3: Unlocking Test-Time Training in Vision
Dongchen Han ⋅ Yining Li ⋅ Tianyu Li ⋅ Zixuan Cao ⋅ Ziming Wang ⋅ Jun Song ⋅ Cheng Yu ⋅ Bo Zheng ⋅ Gao Huang
Oral
Fri Jun 05 09:17 AM -- 09:30 AM (PDT) @ Mile High Ballroom 1A - 2A None
Rethinking Dataset Distillation: Hard Truths about Soft Labels
Priyam Dey ⋅ Aditya Sahdev ⋅ Sunny Bhati ⋅ Konda Reddy Mopuri ⋅ R. Venkatesh Babu
Oral
Fri Jun 05 09:17 AM -- 09:30 AM (PDT) @ Mile High Ballroom 3A - 4A None
UnReflectAnything: RGB-Only Highlight Removal by Rendering Synthetic Specular Supervision
Alberto Rota ⋅ Mert Kiray ⋅ Mert Asim Karaoglu ⋅ Patrick Ruhkamp ⋅ Elena De Momi ⋅ Nassir Navab ⋅ Benjamin Busam
Oral
Fri Jun 05 09:17 AM -- 09:30 AM (PDT) @ Four Seasons Ballroom None
Revisiting Geometric Obfuscation with Dual Convergent Lines for Privacy-Preserving Image Queries in Visual Localization
Jeonggon Kim ⋅ Heejoon Moon ⋅ Je Hyeong Hong
Break
Fri Jun 05 09:45 AM -- 10:30 AM (PDT) @ ExHall F None
Coffee
Demonstration
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall F None
Demos
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 1
A Style is Worth One Code: Unlocking Code-to-Style Image Generation with Discrete Style Space
Huijie Liu ⋅ Shuhao Cui ⋅ Haoxiang Cao ⋅ Shuai Ma ⋅ Kai Wu ⋅ Guoliang Kang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 2
Adversarial Style Optimization: Enhancing VLM Jailbreaks by GRPO-based Stylistic Triggers Optimization
Bingjun Luo ⋅ Jialin Guo ⋅ Yue Yao ⋅ Xinpeng Ding
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 3
ANTS: Adaptive Negative Textual Space Shaping for OOD Detection via Test-Time MLLM Understanding and Reasoning
Wenjie Zhu ⋅ Yabin Zhang ⋅ Xin Jin ⋅ Wenjun Zeng ⋅ Lei Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 4
ARGUS: Defending Against Multimodal Indirect Prompt Injection via Steering Instruction-Following Behavior
Weikai Lu ⋅ Ziqian Zeng ⋅ Kehua Zhang ⋅ Haoran Li ⋅ Huiping Zhuang ⋅ Ruidong Wang ⋅ Cen Chen ⋅ Hao Peng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 5
TEAR: Temporal-aware Automated Red-teaming for Text-to-Video Models
Jiaming He ⋅ Guanyu Hou ⋅ Hongwei Li ⋅ Zhicong Huang ⋅ Kangjie Chen ⋅ Yi Yu ⋅ Wenbo Jiang ⋅ Guowen Xu ⋅ Tianwei Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 6
ViT^3: Unlocking Test-Time Training in Vision
Dongchen Han ⋅ Yining Li ⋅ Tianyu Li ⋅ Zixuan Cao ⋅ Ziming Wang ⋅ Jun Song ⋅ Cheng Yu ⋅ Bo Zheng ⋅ Gao Huang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 7
Black-box Membership Inference Attacks on the Pre-training Data of Image-generation Models
Tao Qi ⋅ Huili Wang ⋅ Yuanhong Huang ⋅ Wendan Wang ⋅ Lianchao Zhao ⋅ Jinrui Wang ⋅ Zichen Qin ⋅ Shangguang Wang ⋅ Yongfeng Huang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 8
Data Leakage Detection and De-duplication in Large Scale Geospatial Image Datasets
Yeshwanth Kumar Adimoolam ⋅ Charalambos Poullis ⋅ Melinos Averkiou
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 9
RAVEN: Erasing Invisible Watermarks via Novel View Synthesis
Fahad Shamshad ⋅ Nils Lukas ⋅ Karthik Nandakumar
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 10
LDP-Slicing: Local Differential Privacy for Images via Randomized Bit-Plane Slicing
Yuanming Cao ⋅ Chengqi Li ⋅ Wenbo He
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 11
NOWA: Null-space Optical Watermark for Invisible Capture Fingerprinting and Tamper Localization
Edwin Vargas ⋅ Jhon Lopez ⋅ Henry Arguello ⋅ Ashok Veeraraghavan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 12
Revisiting Geometric Obfuscation with Dual Convergent Lines for Privacy-Preserving Image Queries in Visual Localization
Jeonggon Kim ⋅ Heejoon Moon ⋅ Je Hyeong Hong
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 13
Advancing Image Classification with Discrete Diffusion Classification Modeling
Omer Belhasin ⋅ Shelly Golan ⋅ Ran El-Yaniv ⋅ Michael Elad
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 14
Does YOLO Really Need to See Every Training Image in Every Epoch?
Xingxing Xie ⋅ Jiahua Dong ⋅ Junwei Han ⋅ Gong Cheng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 15
Fine-grained Image Aesthetic Assessment: Learning Discriminative Scores from Relative Ranks
Zhichao Yang ⋅ Jianjie Wang ⋅ Zhixianhe Zhang ⋅ Pangu Xie ⋅ Xiangfei Sheng ⋅ Pengfei Chen ⋅ Leida Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 16
NuWa: Deriving Lightweight Class-Specific Vision Transformers for Edge Devices
Ziteng Wei ⋅ Qiang He ⋅ Bing Li ⋅ Feifei Chen ⋅ Hai Jin ⋅ Yun Yang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 17
Plant Taxonomy Meets Plant Counting: A Fine-Grained, Taxonomic Dataset for Counting Hundreds of Plant Species
Jinyu Xu ⋅ Tianqi Hu ⋅ Xiaonan Hu ⋅ Letian Zhou ⋅ Songliang Cao ⋅ Meng Zhang ⋅ Hao Lu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 18
Rethinking Dataset Distillation: Hard Truths about Soft Labels
Priyam Dey ⋅ Aditya Sahdev ⋅ Sunny Bhati ⋅ Konda Reddy Mopuri ⋅ R. Venkatesh Babu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 19
Customized Fusion: A Closed-Loop Dynamic Network for Adaptive Multi-Task-Aware Infrared-Visible Image Fusion
Zengyi Yang ⋅ Yu Liu ⋅ Juan Cheng ⋅ Zhiqin Zhu ⋅ Yafei Zhang ⋅ Huafeng Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 20
Dual Band Thermal Videography: Separating Time-Varying Reflection and Emission Near Ambient Conditions
Sriram Narayanan ⋅ Mani Ramanagopal ⋅ Srinivasa G. Narasimhan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 21
MetaSpectra+: A Compact Broadband Metasurface Camera for Snapshot Hyperspectral+ Imaging
Yuxuan Liu ⋅ Wei Xu ⋅ Qi Guo
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 22
Spectrum from Defocus: Fast Spectral Imaging with Chromatic Focal Stack
M. Kerem Aydin ⋅ Yi-Chun Hung ⋅ Jaclyn Pytlarz ⋅ Qi Guo ⋅ Emma Alexander
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 23
Towards Photorealistic and Efficient Bokeh Rendering via Diffusion Framework
Linxiao Shi ⋅ Siming Zheng ⋅ Zerong Wang ⋅ Hao Zhang ⋅ Jinwei Chen ⋅ Bo Li ⋅ Shifeng Chen ⋅ Peng-Tao Jiang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 24
UnReflectAnything: RGB-Only Highlight Removal by Rendering Synthetic Specular Supervision
Alberto Rota ⋅ Mert Kiray ⋅ Mert Asim Karaoglu ⋅ Patrick Ruhkamp ⋅ Elena De Momi ⋅ Nassir Navab ⋅ Benjamin Busam
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 25
AVGGT: Rethinking Global Attention for Accelerating VGGT
Xianbing Sun ⋅ Zhikai Zhu ⋅ Zhengyu Lou ⋅ Bo Yang ⋅ Jinyang Tang ⋅ Liqing Zhang ⋅ He Wang ⋅ Jianfu Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 26
ManifoldNeuS: Manifold-aware View Optimizability for Pose-Free Neural Surface Reconstruction
Xinxin Liu ⋅ Xue Wang ⋅ Guoqing Zhou ⋅ Qing Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 27
LongStream: Long-Sequence Streaming Autoregressive Visual Geometry
Chong Cheng ⋅ Xianda Chen ⋅ Tao Xie ⋅ Wei Yin ⋅ Weiqiang Ren ⋅ Qian Zhang ⋅ Xiaoyang Guo ⋅ Hao Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 28
RPGFusion: 4D Radar Prior-Guided Multi-Modal Fusion for 3D Detection
Xin Qiu ⋅ Wenjie Liu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 29
MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second
Chenguo Lin ⋅ Yuchen Lin ⋅ Panwang Pan ⋅ Yifan Yu ⋅ Tao Hu ⋅ Honglei Yan ⋅ Katerina Fragkiadaki ⋅ Yadong Mu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 30
JRM: Joint Reconstruction Model for Multiple Objects without Alignment
Qirui Wu ⋅ Mohd Yawar Nihal Siddiqui ⋅ Duncan Frost ⋅ Samir Aroudj ⋅ Armen Avetisyan ⋅ Richard Newcombe ⋅ Angel Xuan Chang ⋅ Jakob Engel ⋅ Henry Howard-Jenkins
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 31
Inferring Compositional 4D Scenes without Ever Seeing One
Ahmet Berke Gökmen ⋅ Ajad Chhatkuli ⋅ Luc Van Gool ⋅ Danda Paudel
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 32
FreeScale: Scaling 3D Scenes via Certainty-Aware Free-View Generation
Chenhan Jiang ⋅ Yu Chen ⋅ Qingwen Zhang ⋅ Jifei Song ⋅ Songcen Xu ⋅ Dit-Yan Yeung ⋅ Jiankang Deng
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 33
Complet4R: Geometric Complete 4D Reconstruction
Weibang Wang ⋅ Kenan Li ⋅ Zhuoguang Chen ⋅ Yijun Yuan ⋅ Hang Zhao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 34
Unblur-SLAM: Dense Neural SLAM for Blurry Inputs
Qi Zhang ⋅ Denis Rozumny ⋅ Francesco Girlanda ⋅ Sezer Karaoglu ⋅ Marc Pollefeys ⋅ Theo Gevers ⋅ Martin R. Oswald
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 35
Learning Compact 3D Representations from Feed-Forward Novel View Synthesis
Honggyu An ⋅ Jaewoo Jung ⋅ Mungyeom Kim ⋅ Chaehyun Kim ⋅ Minkyeong Jeon ⋅ Jisang Han ⋅ Kazumi Fukuda ⋅ Takuya Narihira ⋅ HYUNAH KO ⋅ Junsu Kim ⋅ Sunghwan Hong ⋅ Yuki Mitsufuji ⋅ Seungryong Kim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 36
Fast Spatial Tracking with Visual Geometry Transformer
Chengjie Huang ⋅ GUILE WU ⋅ Dongfeng Bai ⋅ Bingbing Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 37
How Much 3D Do Video Foundation Models Encode?
Zixuan Huang ⋅ Xiang Li ⋅ Zhaoyang Lv ⋅ James M.
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 38
MetroGS: Efficient and Stable Reconstruction of Geometrically Accurate High-Fidelity Large-Scale Scenes
Kehua Chen ⋅ Tianlu Mao ⋅ Xinzhu Ma ⋅ Hao Jiang ⋅ Zehao Li ⋅ Zihan Liu ⋅ Shuqin Gao ⋅ Honglong Zhao ⋅ Feng Dai ⋅ Yucheng Zhang ⋅ Zhaoqi Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 39
RnG: A Unified Transformer for Complete 3D Modeling from Partial Observations
Mochu Xiang ⋅ Zhelun Shen ⋅ Xuesong li ⋅ Jiahui Ren ⋅ Jing Zhang ⋅ Chen Zhao ⋅ Shanshan Liu ⋅ Haocheng Feng ⋅ Jingdong Wang ⋅ Yuchao Dai
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 40
Long-Tail Internet Photo Reconstruction
Yuan Li ⋅ Yuanbo Xiangli ⋅ Hadar Averbuch-Elor ⋅ Noah Snavely ⋅ Ruojin Cai
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 41
Emergent Outlier View Rejection in Visual Geometry Grounded Transformers
Jisang Han ⋅ Sunghwan Hong ⋅ Jaewoo Jung ⋅ Wooseok Jang ⋅ Honggyu An ⋅ Qianqian Wang ⋅ Seungryong Kim ⋅ Chen Feng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 42
Flow3r: Factored Flow Prediction for Scalable Visual Geometry Learning
Zhongxiao Cong ⋅ Qitao Zhao ⋅ Minsik Jeon ⋅ Shubham Tulsiani
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 43
MultiBanana: A Challenging Benchmark for Multi-Reference Text-to-Image Generation
Yuta Oshima ⋅ Daiki Miyake ⋅ Kohsei Matsutani ⋅ Yusuke Iwasawa ⋅ Masahiro Suzuki ⋅ Yutaka Matsuo ⋅ Hiroki Furuta
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 44
HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives
Yihao Meng ⋅ Hao Ouyang ⋅ Yue Yu ⋅ Qiuyu Wang ⋅ Wen Wang ⋅ Ka Leong Cheng ⋅ Hanlin Wang ⋅ Shuailei Ma ⋅ Yixuan LI ⋅ Chen Cheng ⋅ Yanhong Zeng ⋅ Xing Zhu ⋅ Yujun Shen ⋅ Huamin Qu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 45
Design Your Ad: Personalized Advertising Image and Text Generation with Unified Autoregressive Models
Yexing Xu ⋅ Wei Feng ⋅ Shen Zhang ⋅ Haohan Wang ⋅ Yuxin Qin ⋅ Yaoyu Li ⋅ Ao Ma ⋅ Yuhao Luo ⋅ Lu Wang ⋅ Xudong Ren ⋅ Haoran Wang ⋅ Run Ling ⋅ Zheng Zhang ⋅ Jingjing Lv ⋅ Junjie Shen ⋅ Ching Law ⋅ Longguang Wang ⋅ Yulan Guo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 46
SketchDeco: Training-Free Latent Composition for Precise Sketch Colourisation
Chaitat Utintu ⋅ Yi-Zhe Song
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 47
ConsistCompose: Unified Multimodal Layout Control for Image Composition
Xuanke Shi ⋅ Boxuan Li ⋅ Xiaoyang Han ⋅ Zhongang Cai ⋅ Lei Yang ⋅ Quan Wang ⋅ Dahua Lin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 48
A Training-Free Style-Personalization via SVD-Based Feature Decomposition
Kyoungmin Lee ⋅ Jihun Park ⋅ Jongmin Gim ⋅ Wonhyeok Choi ⋅ Kyumin Hwang ⋅ Jaeyeul Kim ⋅ Sunghoon Im
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 49
Beyond Patches: Global-aware Autoregressive Model for Multimodal Few-Shot Font Generation
Haonan Cai ⋅ Yuxuan Luo ⋅ Zhouhui Lian
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 50
ImageRAGTurbo: Towards One-step Text-to-Image Generation with Retrieval-Augmented Diffusion Models
Peijie Qiu ⋅ Hariharan Ramshankar ⋅ Arnau Ramisa ⋅ Amit C C ⋅ Rene Vidal ⋅ Vamsi Salaka ⋅ Rahul Bhagat
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 51
OmniSonic: Towards Universal and Holistic Audio Generation from Video and Text
Weiguo Pian ⋅ Saksham Singh Kushwaha ⋅ Zhimin Chen ⋅ Shijian Deng ⋅ Kai Wang ⋅ Yunhui Guo ⋅ Yapeng Tian
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 52
Ar2Can: An Architect and an Artist Leveraging a Canvas for Multi-Human Generation
Shubhankar Borse ⋅ Phuc Pham ⋅ Farzad Farhadzadeh ⋅ Seokeon Choi ⋅ Phong Nguyen ⋅ Anh Tran ⋅ Sungrack Yun ⋅ Munawar Hayat ⋅ Fatih Porikli
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 53
Curriculum Group Policy Optimization: Adaptive Sampling for Unleashing the Potential of Text-to-Image Generation
Baoteng Li ⋅ Xianghao Zang ⋅ Xinran Wang ⋅ Xiangyu Na ⋅ Zhixiang He ⋅ Hao Sun ⋅ Chi Zhang ⋅ Zhongjiang He ⋅ Tianwei Cao ⋅ Kongming Liang ⋅ Zhanyu Ma
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 54
SplitFlux: Learning to Decouple Content and Style from a Single Image
Yitong Yang ⋅ Yinglin Wang ⋅ Changshuo Wang ⋅ Yongjun Zhang ⋅ Ziyang Chen ⋅ Shuting He
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 55
FontCrafter: High-Fidelity Element-Driven Artistic Font Creation with Visual In-Context Generation
Wuyang Luo ⋅ Chengkaitan Chengkaitan to Chengkai Tan ⋅ Chang Ge ⋅ Binye Hong ⋅ Su Yang ⋅ Yongjiu Ma
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 56
EmoStyle: Emotion-Driven Image Stylization
Jingyuan Yang ⋅ Zihuan Bai ⋅ Hui Huang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 57
Text-Image Conditioned 3D Generation
Jiazhong Cen ⋅ Jiemin Fang ⋅ Sikuang Li ⋅ Guanjun Wu ⋅ Chen Yang ⋅ Taoran Yi ⋅ Zanwei Zhou ⋅ zhikuan bao ⋅ Lingxi Xie ⋅ Wei Shen ⋅ Qi Tian
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 58
IntroSVG: Learning from Rendering Feedback for Text-to-SVG Generation via an Introspective Generator–Critic Framework
Feiyu Wang ⋅ Jiayuan Yang ⋅ Zhiyuan Zhao ⋅ Da Zhang ⋅ Bingyu Li ⋅ Peng Liu ⋅ Junyu Gao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 59
AnyDoc: Enhancing Document Generation via Large-Scale HTML/CSS Data Synthesis and Height-Aware Reinforcement Optimization
Jiawei Lin ⋅ Wanrong Zhu ⋅ Vlad I Morariu ⋅ Christopher Tensmeyer
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 60
Reasoning Diffusion for Unpaired Test Time Out-of-distribution Text-Image to Video Generation
Zirui Pan ⋅ Xin Wang ⋅ Yipeng Zhang ⋅ Hong Chen ⋅ Kecheng Zheng ⋅ Wenwu Zhu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 61
SpatialReward: Verifiable Spatial Reward Modeling for Fine-Grained Spatial Consistency in Text-to-Image Generation
Sashuai zhou ⋅ Qiang Zhou ⋅ Ma Junpeng ⋅ Yue Cao ⋅ Ruofan Hu ⋅ Ziang Zhang ⋅ Xiaoda Yang ⋅ Zhibin Wang ⋅ Jun Song ⋅ Cheng Yu ⋅ Bo Zheng ⋅ Zhou Zhao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 62
STAGE: Storyboard-Anchored Generation for Cinematic Multi-shot Narrative
Peixuan Zhang ⋅ Zijian Jia ⋅ Kaiqi Liu ⋅ Shuchen Weng ⋅ Si Li ⋅ Boxin Shi
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 63
MTA: Multimodal Task Alignment for BEV Perception and Captioning
Yunsheng Ma ⋅ Burhan Yaman ⋅ Xin Ye ⋅ Jingru Luo ⋅ Feng Tao ⋅ Abhirup Mallik ⋅ Ziran Wang ⋅ Liu Ren
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 64
β-CLIP: Text-Conditioned Contrastive Learning for Multi-Granular Vision-Language Alignment
Fatimah Zohra ⋅ Chen Zhao ⋅ Hani Itani ⋅ Bernard Ghanem
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 65
SafeRoPE: Risk-specific Head-wise Embedding Rotation for Safe Generation in Rectified Flow Transformers
Xiang Yang ⋅ Feifei Li ⋅ Mi Zhang ⋅ Geng Hong ⋅ Xiaoyu You ⋅ Min Yang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 66
FALCON: False-Negative Aware Learning of Contrastive Negatives in Vision-Language Alignment
Myunsoo Kim ⋅ Seong-Woong Shim ⋅ Byung-Jun Lee
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 67
Spatial-Aware VLA Pretraining through Visual-Physical Alignment from Human Videos
Yicheng Feng ⋅ Wanpeng Zhang ⋅ Ye Wang ⋅ Hao Luo ⋅ Haoqi Yuan ⋅ Sipeng Zheng ⋅ Zongqing Lu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 68
Training One Model to Master Cross-Level Agentic Actions via Reinforcement Learning
Kaichen He ⋅ Zihao Wang ⋅ Muyao Li ⋅ Anji Liu ⋅ Yitao Liang
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 69
Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
Yurun Chen ⋅ Xueyu Hu ⋅ Yuhan Liu ⋅ Ziqi Wang ⋅ Zeyi Liao ⋅ Lin Chen ⋅ Feng Wei ⋅ Yuxi qian ⋅ Bo Zheng ⋅ Keting Yin ⋅ Shengyu Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 70
EMO-R3: Reflective Reinforcement Learning for Emotional Reasoning in Multimodal Large Language Models
Yiyang Fang ⋅ Wenke Huang ⋅ Pei Fu ⋅ Yihao Yang ⋅ Kehua Su ⋅ Zhenbo Luo ⋅ Jian Luan ⋅ Mang Ye
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 71
EvoGraph-R1: Self-Evolving Multimodal Knowledge Hypergraphs for Agentic Retrieval
Jiashi Lin ⋅ Changhong Jiang ⋅ Xiangru Lin ⋅ Ruifei Zhang ⋅ Xinyi Zhu ⋅ Jiyao Liu ⋅ Cheng Tang ⋅ Ye Du ⋅ Shujian Gao ⋅ Junzhi Ning ⋅ Lihao Liu ⋅ Ziyan Huang ⋅ Tianbin Li ⋅ Jin Ye ⋅ Junjun He
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 72
Cross-modal Identity Mapping: Minimizing Information Loss in Modality Conversion via Reinforcement Learning
Haonan Jia ⋅ Shichao Dong ⋅ Xin Dong ⋅ Zenghui Sun ⋅ Jin Wang ⋅ Jinsong Lan ⋅ Xiaoyong Zhu ⋅ Bo Zheng ⋅ Kaifu Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 73
Downscaling Intelligence: Exploring Perception and Reasoning Bottlenecks in Small Multimodal Models
Mark Endo ⋅ Serena Yeung
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 74
Stabilizing Feature Geometry in Noisy Pretrained Models for Robust Downstream Tasks
Quanyu Zhang ⋅ Zhongyi Han ⋅ Hao Sun ⋅ Yongshun Gong ⋅ Xiaoyan Wang ⋅ Yilong Yin ⋅ Shuo Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 75
Black-Box Domain Adaptation for Object Detection with Retention-Driven Knowledge Compression
Yuwu Lu ⋅ Chunzhi Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 76
Decoupled and Reusable Adaptation for Efficient Cross-Modal Transfer
Yajing Liu ⋅ Yumeng Zhang ⋅ Yue Si ⋅ Baojie Fan ⋅ Jiandong Tian
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 77
Preference-Aligned LoRA Merging: Preserving Subspace Coverage and Addressing Directional Anisotropy
Wooseong Jeong ⋅ Wonyoung Lee ⋅ Kuk-Jin Yoon
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 78
Curvature-Aware Zeroth-Order Optimization for Memory-Efficient Test-Time Adaptation
Junming Zhang ⋅ Shuyu Yin ⋅ Peilin Liu ⋅ Rendong Ying ⋅ Fei Wen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 79
Label-Free Cross-Task LoRA Merging with Null-Space Compression
Wonyoung Lee ⋅ Wooseong Jeong ⋅ Kuk-Jin Yoon
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 80
Basis-Oriented Low-rank Transfer for Few-Shot and Test-Time Adaptation
Junghwan Park ⋅ Woojin Cho ⋅ Junhyuk Heo ⋅ Darongsae Kwon ⋅ Kookjin Lee
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 81
GeCo: Geometry-Consistent Regularization for Domain Generalized Semantic Segmentation
Qi Zang ⋅ Dong Zhao ⋅ Nan Pu ⋅ Wenjing Li ⋅ Zhun Zhong ⋅ Meng Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 82
Event-based Motion Deblurring with Unpaired Data
Hoonhee Cho ⋅ Yuhwan Jeong ⋅ Kuk-Jin Yoon
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 83
Stable Spike: Dual Consistency Optimization via Bitwise AND Operations for Spiking Neural Networks
Yongqi Ding ⋅ Kunshan Yang ⋅ Linze Li ⋅ Yiyang Zhang ⋅ Mengmeng Jing ⋅ Lin Zuo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 84
Event-based Visual Deformation Measurement
Yuliang Wu ⋅ Wei Zhai ⋅ Yuxin Cui ⋅ Tiesong Zhao ⋅ Yang Cao ⋅ Zheng-Jun Zha
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 85
Bidirectional Cross-Modal Prompting for Event-Frame Asymmetric Stereo
Ninghui Xu ⋅ Fabio Tosi ⋅ Lihui Wang ⋅ Jiawei Han ⋅ Luca Bartolomei ⋅ Zhiting Yao ⋅ Matteo Poggi ⋅ Stefano Mattoccia
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 86
SpikeTrack: High-performance and Energy-efficient Event-Based Object Tracking with Spiking Neural Network
Yang Wang ⋅ Jiqing Zhang ⋅ Chuanyu Sun ⋅ Qianhui Liu ⋅ Huilin Ge ⋅ Ziqi Wei ⋅ Xin Yang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 87
Event Structural Valley: A Unified Theoretical and Practical Framework for Event Camera Autofocus
Xijie Xiang ⋅ Lin Zhu ⋅ Wei Zhang ⋅ Yonghong Tian
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 88
Adaptive Spatial-Temporal Window: Unlocking the Potential of Event Cameras in Heterogeneous Velocity Scenarios
Zhipeng Sui ⋅ Haiqing Hao ⋅ Weihua He ⋅ Seng-Hong Lee ⋅ Wenhui Wang
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 89
Do You Have Freestyle? Expressive Humanoid Locomotion via Audio Control
Zhe Li ⋅ Cheng Chi ⋅ Yangyang Wei ⋅ Boan Zhu ⋅ Tao Huang ⋅ Zhenguo Sun ⋅ Yibo Peng ⋅ Pengwei Wang ⋅ Zhongyuan Wang ⋅ Fangzhou Liu ⋅ Chang Xu ⋅ Shanghang Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 90
CLaD: Planning with Grounded Foresight via Cross-Modal Latent Dynamics
Andrew Jeong ⋅ Jaemin Kim ⋅ Sebin Lee ⋅ Sung-Eui Yoon
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 91
InternData-A1: Pioneering High-Fidelity Synthetic Data for Pre-training Generalist Policy
Yang Tian ⋅ Yuyin Yang ⋅ Yiman Xie ⋅ Zetao Cai ⋅ Xu Shi ⋅ Ning Gao ⋅ Hangxu Liu ⋅ Xuekun Jiang ⋅ Zherui Qiu ⋅ Feng Yuan ⋅ Yaping Li ⋅ Ping Wang ⋅ Junhao Cai ⋅ Jia Zeng ⋅ Hao Dong ⋅ Jiangmiao Pang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 92
DemoFunGrasp: Universal Dexterous Functional Grasping via Demonstration-Editing Reinforcement Learning
Chuan Mao ⋅ Haoqi Yuan ⋅ Ziye Huang ⋅ Chaoyi Xu ⋅ Kai Ma ⋅ Zongqing Lu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 93
GeniNav: Generative Model Driven Image-Goal Navigation via Imagination-Guided Consistency Flow Matching
Yuqi Chen ⋅ Junjie Gao ⋅ Yongzhou Pan ⋅ Siyuan Song ⋅ ZIXUAN ZHANG ⋅ Jiaping Xiao ⋅ Mir Feroskhan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 94
Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation
Pingrui Zhang ⋅ Yifei Su ⋅ Pengyuan Wu ⋅ Dong An ⋅ Li Zhang ⋅ Zhigang Wang ⋅ Dong Wang ⋅ Bin Zhao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 95
DRAMA: Next-Gen Dynamic Orchestration for Resilient Multi-Agent Ecosystems in Flux
Xinkui Zhao ⋅ Yifan Zhang ⋅ Sai Liu ⋅ Naibo Wang ⋅ Guanjie Cheng ⋅ Yueshen Xu ⋅ Chang Liu ⋅ Shuiguang Deng ⋅ Jianwei Yin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 96
Arcadia: Toward a Full-Lifecycle Framework for Embodied Lifelong Learning
Minghe Gao ⋅ Juncheng Li ⋅ Yuze Lin ⋅ Xuqi Liu ⋅ Jiaming Ji ⋅ Xiaoran Pan ⋅ Zihan Xu ⋅ Xian Li ⋅ Mingjie Li ⋅ Wei Ji ⋅ Rong Wei ⋅ Rui Tang ⋅ Qizhou Wang ⋅ Kai Shen ⋅ Jun Xiao ⋅ Qi Wu ⋅ Siliang Tang ⋅ Yueting Zhuang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 97
Wanderland: Geometrically Grounded Simulation for Open-World Embodied AI
Xinhao Liu ⋅ Jiaqi Li ⋅ Youming Deng ⋅ Ruxin Chen ⋅ Yingjia Zhang ⋅ Yifei Ma ⋅ Li Guo ⋅ Yiming Li ⋅ Jing Zhang ⋅ Chen Feng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 98
ORV: 4D Occupancy-centric Robot Video Generation
Xiuyu Yang ⋅ Bohan Li ⋅ Shaocong Xu ⋅ Nan Wang ⋅ Chongjie Ye ⋅ Zhaoxi Chen ⋅ Minghan Qin ⋅ Yikang Ding ⋅ Zheng Zhu ⋅ Xin Jin ⋅ Hang Zhao ⋅ Hao Zhao
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 99
DextER: Language-driven Dexterous Grasp Generation with Embodied Reasoning
Junha Lee ⋅ Eunha Park ⋅ Minsu Cho
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 100
Language-Free Generative Editing from One Visual Example
Omar Elezabi ⋅ Eduard Zamfir ⋅ Zongwei Wu ⋅ Radu Timofte
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 101
Omni IIE Bench: Benchmarking the Practical Capabilities of Image Editing Models
Yujia Yang ⋅ Yuanxiang Wang ⋅ Zhenyu Guan ⋅ Tiankun Yang ⋅ Chenxi Bao ⋅ Haopeng Jin ⋅ Jinwen Luo ⋅ Xinyu Zuo ⋅ Lisheng Duan ⋅ Haijin Liang ⋅ Jin Ma ⋅ Xinming Wang ⋅ Ruiwen Tao ⋅ Hongzhu Yi
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 102
LuxRemix: Lighting Decomposition and Remixing for Indoor Scenes
Ruofan Liang ⋅ Norman Müller ⋅ Ethan Weber ⋅ Duncan Zauss ⋅ Nandita Vijaykumar ⋅ Peter Kontschieder ⋅ Christian Richardt
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 103
CompBench: Benchmarking Complex Instruction-guided Image Editing
Bohan Jia ⋅ Wenxuan Huang ⋅ Yuntian Tang ⋅ Junbo Qiao ⋅ Jincheng Liao ⋅ Shaosheng Cao ⋅ Fei Zhao ⋅ Zhaopeng Feng ⋅ Zhouhong Gu ⋅ Zhenfei Yin ⋅ Lei Bai ⋅ Wanli Ouyang ⋅ Lin Chen ⋅ Fei Zhao ⋅ Zihan Wang ⋅ Yuan Xie ⋅ Shaohui Lin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 104
Garments2Look: A Multi-Reference Dataset for High-Fidelity Outfit-Level Virtual Try-On with Clothing and Accessories
Junyao Hu ⋅ Zhongwei Cheng ⋅ Waikeung Wong ⋅ Xingxing Zou
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 105
Learning Personalized Photographic Style from Pairwise User Preferences
Jinwoo Kim ⋅ Jihye Yoo ⋅ Seon Joo Kim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 106
CogniEdit: Dense Gradient Flow Optimization for Fine-Grained Image Editing
Yan Li ⋅ Lin Liu ⋅ Xiaopeng Zhang ⋅ Wei Xue ⋅ Wenhan Luo ⋅ Yike Guo ⋅ Qi Tian
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 107
Efficient Weighted Sampling via Score-based Generative Models
Heasung Kim ⋅ Taekyun Lee ⋅ Hyeji Kim ⋅ Gustavo De Veciana
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 108
MOSAIC-GS: Monocular Scene Reconstruction via Advanced Initialization for Complex Dynamic Environments
Svitlana Morkva ⋅ Vaishakh Patil ⋅ Alessio Tonioni ⋅ Michael Oechsle ⋅ Maximum Wilder-Smith ⋅ Marco Hutter
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 109
REArtGS++: Generalizable Articulation Reconstruction with Temporal Geometry Constraint via Planar Gaussian Splatting
Di Wu ⋅ Liu Liu ⋅ Anran Huang ⋅ 玉研 刘 ⋅ Qiaojun Yu ⋅ Shaofan Liu ⋅ Liangtu Song ⋅ Cewu Lu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 110
Dynamic-eDiTor: Training-Free Text-Driven 4D Scene Editing with Multimodal Diffusion Transformer
Dong In Lee ⋅ Hyungjun Doh ⋅ Seunggeun Chi ⋅ Runlin Duan ⋅ Sangpil Kim ⋅ Karthik Ramani
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 111
FaithFusion: Harmonizing Reconstruction and Generation via Pixel-wise Information Gain
YuAn Wang ⋅ Xiaofan Li ⋅ Chi Huang ⋅ Wenhao Zhang ⋅ Hao Li ⋅ Bosheng Wang ⋅ Xun Sun ⋅ Jun Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 112
IR-HGP: Physically-Aware Gaussian Inverse Rendering for High-Illumination Scenes via Generative Priors
Qingan Zhang ⋅ Wensheng Li ⋅ Chengying Gao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 113
Seeing through boxes: Non-Line-of-Sight 3D Reconstruction from Radar Signals
Jiachen Lu ⋅ Hailan Shanbhag ⋅ Haitham Al Hassanieh
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 114
Speeding Up the Learning of 3D Gaussians with Much Shorter Gaussian Lists
Jiaqi Liu ⋅ Zhizhong Han
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 115
DynamicTree: Interactive Real Tree Animation via Sparse Voxel Spectrum
Yaokun Li ⋅ Lihe Ding ⋅ Xiao Chen ⋅ Guang Tan ⋅ Tianfan Xue
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 116
WildRayZer: Self-supervised Large View Synthesis in Dynamic Environments
Xuweiyi Chen ⋅ Wentao Zhou ⋅ Zezhou Cheng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 117
DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images
Xiaoxue Chen ⋅ Ziyi Xiong ⋅ Yuantao Chen ⋅ Gen Li ⋅ Nan Wang ⋅ Hongcheng Luo ⋅ Long Chen ⋅ Haiyang Sun ⋅ Bing Wang ⋅ Guang Chen ⋅ Hongyang Li ⋅ Ya-Qin Zhang ⋅ Hangjun Ye ⋅ Hao Zhao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 118
Retrieve-to-Restore: Efficient All-in-One Image Restoration with a Retrieval-Based Degradation Bank
Chenxu Wang ⋅ Kai Zhang ⋅ Jian Yang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 119
MRI Contrast Enhancement Kinetics World Model
Jindi Kong ⋅ Yuting He ⋅ Cong Xia ⋅ Rongjun Ge ⋅ Shuo Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 120
ReflexSplit: Single Image Reflection Separation via Layer Fusion-Separation
Chia-Ming Lee ⋅ Yu-Fan Lin ⋅ Jin-Hui Jiang ⋅ Yu-Jou Hsiao ⋅ Chih-Chung Hsu ⋅ Yu-Lun Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 121
Rethinking Knowledge Transfer in Image Quality Assessment: A Perceptual Preference Structure Alignment Perspective
Aobo Li ⋅ Jinjian Wu ⋅ Yongxu Liu ⋅ Jupo Ma ⋅ Weisheng Dong
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 122
ZeroIDIR: Zero-Reference Illumination Degradation Image Restoration with Perturbed Consistency Diffusion Models
Hai Jiang ⋅ Zhen Liu ⋅ Yinjie Lei ⋅ Songchen Han ⋅ Bing Zeng ⋅ Shuaicheng Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 123
White-Balance First, Adjust Later: Cross-Camera Color Constancy via Vision-Language Evaluation
Shuwei Li ⋅ Lei Tan ⋅ Robby T. Tan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 124
Unpaired Image Deraining Using Reward-Guided Self-Reinforcement Strategy
Yinghao Chen ⋅ Yeying Jin ⋅ Xiang Chen ⋅ Yanyan Wei ⋅ Ziyang Yan ⋅ Yaowen Fu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 125
LF-BVN: Blind-View Network for Self-Supervised Light Field Denoising
Longzhao Guo ⋅ shuo zhang ⋅ Chen Gao ⋅ Qian Tian ⋅ Youfang Lin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 126
rPPG-VQA: A Video Quality Assessment Framework for Unsupervised rPPG Training
Tianyang Dai ⋅ Ming Chang ⋅ Yan Chen ⋅ Yang Hu
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 127
Efficient Real-Time Raw-to-Raw Denoising for Extreme Low-Light Ultra HD Video on Mobile Devices
Charantej Reddy Pochimireddy ⋅ Subhasmita Sahoo ⋅ Apoorva Verma ⋅ Palavalli Shyam ⋅ Swapnil Malviya ⋅ Sarvesh Sarvesh ⋅ Raj Narayana Gadde
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 128
Towards Generalized Representations for Low-Light Understanding: When Signal Constancy Meets Semantic Enrichment
Yifan Li ⋅ Haofeng Huang ⋅ Wenhan Yang ⋅ Jiaying Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 129
Synergistic Bleeding Region and Point Detection in Laparoscopic Surgical Videos
Jialun Pei ⋅ Zhangjun Zhou ⋅ Diandian Guo ⋅ Zhixi Li ⋅ Jing Qin ⋅ Bo Du ⋅ Pheng-Ann Heng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 130
MedCLIPSeg: Probabilistic Vision-Language Adaptation for Data-Efficient and Generalizable Medical Image Segmentation
Taha Koleilat ⋅ Hojat Asgariandehkordi ⋅ Omid Nejatimanzari ⋅ Berardino Barile ⋅ Yiming Xiao ⋅ Hassan Rivaz
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 131
AD-GBC: Anisotropic Granular-Ball Skip-Connection Refiner for UNet-Based Medical Image Segmentation
Xiya Shen ⋅ Qinglin Zhao ⋅ Li Feng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 132
OSA: Echocardiography Video Segmentation via Orthogonalized State Update and Anatomical Prior-aware Feature Enhancement
Rui Wang ⋅ Huisi Wu ⋅ Jing Qin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 133
VesMamba: 3D Pulmonary Vessel Segmentation from CT images via Mamba with Structural Perception and Scale-aware Filtering
Zhipeng Liu ⋅ Guilian Chen ⋅ Zheng Jiang ⋅ Huisi Wu ⋅ Jing Qin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 134
SemiGDA: Generative Dual-distribution Alignment for Semi-Supervised Medical Image Segmentation
kaiwen Huang ⋅ Yi Zhou ⋅ Yizhe Zhang ⋅ Jingxiong Li ⋅ Tao Zhou
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 135
Diffusion-Based Native Adversarial Synthesis for Enhanced Medical Segmentation Generalization
Hongyu Zhang ⋅ Haipeng Chen ⋅ Zhimin Xu ⋅ Chengxin Yang ⋅ Yingda Lyu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 136
CG-Reasoner: Centroid-Guided Positional Reasoning Segmentation for Medical Imaging with a Robust Visual-Text Consistency Metric
Lakshmikar R. ⋅ Ming Ma
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 137
Instruction-Guided Lesion Segmentation for Chest X-rays with Automatically Generated Large-Scale Dataset
Geon Choi ⋅ Hangyul Yoon ⋅ Hyunju Shin ⋅ Hyunki Park ⋅ Sang Hoon Seo ⋅ Eunho Yang ⋅ Edward Choi
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 138
Towards Highly Transferable Vision-Language Attack via Semantic-Augmented Dynamic Contrastive Interaction
Yuanbo Li ⋅ Tianyang Xu ⋅ Cong Hu ⋅ Tao Zhou ⋅ Xiao-Jun Wu ⋅ Josef Kittler
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 139
Towards Human-Imperceptible Backdoor Attacks on Text-to-Image Diffusion Models
Changkun Wu ⋅ Chenghao Chen ⋅ Wu kun ⋅ Chong Fu ⋅ Biru Zhu ⋅ Zhenyu Wen ⋅ Zhen Hong
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 140
TTP: Test-Time Padding for Adversarial Detection and Robust Adaptation on Vision-Language Models
Zhiwei Li ⋅ Yitian Pang ⋅ Weining Wang ⋅ Zhenan Sun ⋅ Qi Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 141
DualMirage: Hunting Stealthy Multimodal LLM Agents via CAPTCHAs with Contour and Adversarial Illusions
Bei Chen ⋅ Gaolei Li ⋅ Jun Wu ⋅ Jianhua Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 142
Models as Lego Builders: Assembling Malice from Benign Blocks via Semantic Blueprints
Chenxi Li ⋅ Xianggan Liu ⋅ Dake Shen ⋅ Yaosong Du ⋅ Zhibo Yao ⋅ Hao Jiang ⋅ Linyi Jiang ⋅ Chengwei Cao ⋅ Jingzhe Zhang ⋅ RanYi Peng ⋅ Peiling Bai ⋅ Xiande Huang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 143
Source Models Leak What They Shouldn’t: Unlearning Zero-Shot Transfer in Domain Adaptation Through Adversarial Optimization
Arnav Devalapally ⋅ Poornima Jain ⋅ Kartik Srinivas ⋅ Vineeth Balasubramanian
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 144
A Unified Perspective on Adversarial Membership Manipulation in Vision Models
RUIZE GAO ⋅ Kaiwen Zhou ⋅ Yongqiang Chen ⋅ Feng Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 145
Shedding Light on VLN Robustness: A Black-box Framework for Indoor Lighting-based Adversarial Attack
Chenyang LI ⋅ Wenbing Tang ⋅ Yihao Huang ⋅ Simon Sinong Zhan ⋅ Ming Hu ⋅ Xiaojun Jia ⋅ Yang Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 146
OddGridBench: Exposing the Lack of Fine-Grained Visual Discrepancy Sensitivity in Multimodal Large Language Models
tengjin Weng ⋅ Wenhao Jiang ⋅ Jingyi Wang ⋅ Ming Li ⋅ Lin Ma ⋅ Zhong Ming
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 147
Beyond What's Shared: Recovering Lost Unique Information from Intermediate Layers to Boost Multimodal Geo-Foundation Models
JangHyeon Lee ⋅ Philipe Ambrozio Dias ⋅ Yao-Yi Chiang ⋅ Dalton Lunga
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 148
WikiCLIP: An Efficient Contrastive Baseline for Open-domain Visual Entity Recognition
Shan Ning ⋅ Longtian Qiu ⋅ Jiaxuan Sun ⋅ Xuming He
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 149
CLCR: Cross-Level Semantic Collaborative Representation for Multimodal Learning
Chunlei Meng ⋅ Guanhong Huang ⋅ Rong Fu ⋅ Runmin Jian ⋅ Zhongxue Gan ⋅ Chun Ouyang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 150
Learning Anchor in Dual Orthogonal Space for Fast Multi-view Clustering
Yalan Qin ⋅ Hanzhou Wu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 151
Bootstrapping Multi-view Learning for Test-time Noisy Correspondence
Changhao He ⋅ Di Xue ⋅ Shuxian Li ⋅ Yanji Hao ⋅ Xi Peng ⋅ Peng Hu
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 152
Differences That Matter: Auditing Models for Capability Gap Discovery and Rectification
Qihao Liu ⋅ Chengzhi Mao ⋅ Yaojie Liu ⋅ Alan L. Yuille ⋅ Wen-Sheng Chu
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 153
FAVE: A Structured Benchmark for Fine-Grained Audio-Visual Temporal Evaluation in Multimodal LLMs
Weiheng Lu ⋅ An Yu ⋅ Jian Li ⋅ Zhenfei Zhang ⋅ Felix X.-F. Ye ⋅ Ming-Ching Chang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 154
Omni2Sound: Towards Unified Video-Text-to-Audio Generation
yusheng dai ⋅ Zehua Chen ⋅ Yuxuan Jiang ⋅ Qiuhong Ke ⋅ Jianfei Cai ⋅ Jun Zhu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 155
EmoThinker: Advancing Visual-Acoustic Emotion Analysis via Structural Token Selection and Chain-of-Thought Reasoning
Qinfu Xu ⋅ Liyuan Pan ⋅ Yiwei Wei ⋅ Shaozu Yuan ⋅ Jiaqi Chen ⋅ Tianyu Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 156
Enhancing Descriptive Captions with Visual Attributes for Multimodal Perception
Yanpeng Sun ⋅ JING HAO ⋅ Ke Zhu ⋅ Jiang-Jiang Liu ⋅ Xiaofan Li ⋅ Na Zhao ⋅ Zechao Li ⋅ Jingdong Wang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 157
DiG: Differential Grounding for Enhancing Fine-Grained Perception in Multimodal Large Language Models
Zhou Tao ⋅ Shida Wang ⋅ YongXiang Hua ⋅ Haoyu Cao ⋅ Linli Xu
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 158
Vision-Speech Models: Teaching Speech Models to Converse about Images
Amélie Royer ⋅ Moritz Böhle ⋅ Laurent Mazaré ⋅ Neil Zeghidour ⋅ Alexandre Défossez ⋅ Patrick Pérez
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 159
EMMA: Extracting Multiple physical parameters from Multimodal Data
Farhat Shaikh ⋅ Ayan Banerjee ⋅ Sandeep Gupta
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 160
MMGait: Towards Multi-Modal Gait Recognition
Chenye Wang ⋅ Qingyuan Cai ⋅ Saihui Hou ⋅ Aoqi Li ⋅ Yongzhen Huang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 161
OSMO: Open-vocabulary Self-eMOtion Tracking
Mohamed Abdelfattah ⋅ Bugra Tekin ⋅ Fadime Sener ⋅ Necati Cihan Camgoz ⋅ Eric Sauser ⋅ Shugao Ma ⋅ Alex Alahi ⋅ Edoardo Remelli
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 162
MuCo: Multi-turn Contrastive Learning for Multimodal Embedding Model
Geonmo Gu ⋅ Byeongho Heo ⋅ Jaemyung Yu ⋅ Jaehui Hwang ⋅ Taekyung Kim ⋅ Sangmin Lee ⋅ HeeJae Jun ⋅ Yoohoon Kang ⋅ Sangdoo Yun ⋅ Dongyoon Han
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 163
Cross-Modal Emotion Transfer for Emotion Editing in Talking Face Video
Chanhyuk Choi ⋅ Taesoo Kim ⋅ Donggyu Lee ⋅ Siyeol Jung ⋅ Taehwan Kim
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 164
Unleashing the Intrinsic Visual Representation Capability of Multimodal Large Language Models
Hengzhuang Li ⋅ Xinsong Zhang ⋅ QIMING PENG ⋅ Bin Luo ⋅ Han Hu ⋅ Dengyang Jiang ⋅ Han-Jia Ye ⋅ Teng Zhang ⋅ Hai Jin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 165
Active Perceptual Inference: A Corticothalamic-Inspired Dynamic Nested Recurrent Network for Multimodal Sentiment Analysis with Incomplete Data
Yujuan Zhang ⋅ Qing Li ⋅ Ziyu Li ⋅ Xiuxing Li ⋅ Zhuo Wang ⋅ Mengrui Xu ⋅ Xia Wu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 166
Scalable Trajectory Generation for Whole-Body Mobile Manipulation
Yida Niu ⋅ Xinhai Chang ⋅ Xin Liu ⋅ Ziyuan Jiao ⋅ Yixin Zhu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 167
Breaking the 3D Dataset Bottleneck: Fast Scalable Generation of Aligned 3D Assets from Scratch for Category 6D Pose Estimation and Robotic Grasping
Duret Guillaume ⋅ Danylo Mazurak ⋅ Florence Zara ⋅ Jan Peters ⋅ Liming Chen
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 168
Real-Time Multimodal Fingertip Contact Detection via Depth and Motion Fusion for Vision-Based Human–Computer Interaction
Mukhiddin Toshpulatov ⋅ Wookey Lee ⋅ Suan Lee ⋅ Geehyuk Lee
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 169
Glove2Hand: Synthesizing Natural Hand-Object Interaction from Multi-Modal Sensing Gloves
Xinyu Zhang ⋅ Ziyi Kou ⋅ Chuan Qin ⋅ Mia Huang ⋅ Ergys Ristani ⋅ Ankit Kumar ⋅ Lele Chen ⋅ Kun He ⋅ Abdeslam Boularias ⋅ Li Guan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 170
UniDex: A Robot Foundation Suite for Universal Dexterous Hand Control from Egocentric Human Videos
Gu Zhang ⋅ Qicheng Xu ⋅ Haozhe Zhang ⋅ Jianhan Ma ⋅ Long He ⋅ Yiming Bao ⋅ Zeyu Ping ⋅ Zhecheng Yuan ⋅ Chenhao Lu ⋅ Chengbo Yuan ⋅ Tianhai Liang ⋅ Xiaoyu Tian ⋅ Maanping Shao ⋅ Feihong Zhang ⋅ Mingyu Ding ⋅ Yang Gao ⋅ Hao Zhao ⋅ Hang Zhao ⋅ Huazhe Xu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 171
ConsID-Gen: View-Consistent and Identity-Preserving Image-to-Video Generation
Mingyang Wu ⋅ Ashirbad Mishra ⋅ Soumik Dey ⋅ Shuo Xing ⋅ Naveen Ravipati ⋅ Hansi Wu ⋅ Binbin Li ⋅ Zhengzhong Tu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 172
DiverseGRPO: Mitigating Mode Collapse in Image Generation via Diversity-Aware GRPO
Henglin Liu ⋅ Huijuan Huang ⋅ Jing Wang ⋅ Chang Liu ⋅ Xiu Li ⋅ Xiangyang Ji
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 173
VAR RL Done Right: Tackling Asynchronous Policy Conflicts in Visual Autoregressive Generation
Shikun Sun ⋅ Liao Qu ⋅ Huichao Zhang ⋅ Yiheng Liu ⋅ Yangyang Song ⋅ Xian Li ⋅ Yi Jiang ⋅ Xu Wang ⋅ Jia Jia ⋅ Daniel Kang Du ⋅ Xinglong Wu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 174
Video Generation with Stable Transparency via Shiftable RGB-A Distribution Learner
Haotian Dong ⋅ Wenjing Wang ⋅ Chen Li ⋅ Jing LYU ⋅ Di Lin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 175
MOFA-VTON: More Fashion Possibilities with Fine-Grained Adaptations in Virtual Try-On
Xiaoyu Han ⋅ Chenyang Wang ⋅ Jing Wang ⋅ Shunyuan Zheng ⋅ Quanling Meng ⋅ Shengping Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 176
Scaling Multi-Identity Consistency for Image Customization via Multi-to-Multi Matching Paradigm
Yufeng Cheng ⋅ wenxu wu ⋅ Shaojin Wu ⋅ Mengqi Huang ⋅ Fei Ding ⋅ Qian HE
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 177
NOVA: Sparse Control, Dense Synthesis for Pair-Free Video Editing
Tianlin Pan ⋅ Jiayi Dai ⋅ Chenpu Yuan ⋅ Zhengyao Lv ⋅ Binxin Yang ⋅ Hubery Yin ⋅ Chen Li ⋅ Jing LYU ⋅ Caifeng Shan ⋅ Chenyang Si
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 178
Functional Mean Flow in Hilbert Space
Zhiqi Li ⋅ Yuchen Sun ⋅ Greg Turk ⋅ Bo Zhu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 179
Benchmarking Single-Factor Physical Video-to-Audio Generation
Tingle Li ⋅ Siddharth Gururani ⋅ Kevin Shih ⋅ Gantavya Bhatt ⋅ Sang-gil Lee ⋅ Zhifeng Kong ⋅ Arushi Goel ⋅ Gopala Anumanchipalli ⋅ Ming-Yu Liu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 180
UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions
Guozhen Zhang ⋅ Zixiang Zhou ⋅ Teng Hu ⋅ Ziqiao Peng ⋅ Youliang Zhang ⋅ Yi Chen ⋅ Yuan Zhou ⋅ qinglin lu ⋅ Limin Wang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 181
Refaçade: Editing Object with Given Reference Texture
Youze Huang ⋅ Penghui Ruan ⋅ Bojia Zi ⋅ Xianbiao Qi ⋅ Jianan Wang ⋅ Rong Xiao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 182
Free-Lunch Long Video Generation via Layer-Adaptive O.O.D Correction
Jiahao Tian ⋅ Chenxi Song ⋅ Wei Cheng ⋅ Chi Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 183
Not All Birds Look The Same: Identity-Preserving Generation For Birds
Aaron Sun ⋅ Oindrila Saha ⋅ Subhransu Maji
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 184
HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preserving Human-Product Images
Yi Chen Liu ⋅ Donghao Zhou ⋅ Jie Wang ⋅ Xin Gao ⋅ Guisheng Liu ⋅ Jiatong Li ⋅ Quanwei Zhang ⋅ Qiang Lyu ⋅ Lanqing Guo ⋅ Shilei Wen ⋅ Weiqiang Wang ⋅ Pheng-Ann Heng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 185
EffectErase: Joint Video Object Removal and Insertion for High-Quality Effect Erasing
YANG FU ⋅ Yike Zheng ⋅ Ziyun Dai ⋅ Henghui Ding
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 186
Clothe and Pose
Nakul Sharma ⋅ Aayush Bansal ⋅ Minh Vo
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 187
FlowPortal: Residual-Corrected Flow for Training-Free Video Relighting and Background Replacement
Wenshuo Gao ⋅ Junyi Fan ⋅ Jiangyue Zeng ⋅ Shuai Yang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 188
The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment
Ziheng Ouyang ⋅ Yiren Song ⋅ Yaoli Liu ⋅ Shihao Zhu ⋅ Qibin Hou ⋅ Mingming Cheng ⋅ Mike Zheng Shou
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 189
Rethinking UMM Visual Generation: Masked Modeling for Efficient Image-Only Pre-training
Peng Sun ⋅ Jun XIE ⋅ Tao Lin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 190
VibeToken: Scaling 1D Image Tokenizers and Autoregressive Models for Dynamic Resolution Generations
Maitreya Patel ⋅ Jingtao Li ⋅ Weiming Zhuang ⋅ Yezhou Yang ⋅ Lingjuan Lv
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 191
Bidirectional Normalizing Flow: From Data to Noise and Back
Yiyang Lu ⋅ Qiao Sun ⋅ Xianbang Wang ⋅ Zhicheng Jiang ⋅ Hanhong Zhao ⋅ Kaiming He
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 192
ShotDirector: Directorially Controllable Multi-Shot Video Generation with Cinematographic Transitions
Xiaoxue Wu ⋅ Xinyuan Chen ⋅ Yaohui Wang ⋅ Yu Qiao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 193
Are Image-to-Video Models Good Zero-Shot Image Editors?
Zechuan Zhang ⋅ Zhenyuan Chen ⋅ Zongxin Yang ⋅ Yi Yang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 194
FastLightGen: Fast and Light Video Generation with Fewer Steps and Parameters
Shitong Shao ⋅ Yufei Gu ⋅ Zeke Xie
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 195
Unified Latent Space for Understanding and Generation via Semantic Auto-encoder
Xiaojie Li ⋅ Yang Zhao ⋅ Ming Li ⋅ Yancheng Zhang ⋅ Zonglin Lyu ⋅ Yunpeng Chen ⋅ Rui Wang ⋅ Daquan Zhou
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 196
AHS: Adaptive Head Synthesis via Synthetic Data Augmentations
Taewoong Kang ⋅ Hyojin Jang ⋅ Sohyun Jeong ⋅ Seunggi Moon ⋅ Gihwi Kim ⋅ Hoon Jin Jung ⋅ Jaegul Choo
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 197
CASR: A Robust Cyclic Framework for Arbitrary Large-Scale Super-Resolution with Distribution Alignment and Self-Similarity Awareness
Wenhao Guo ⋅ Zhaoran Zhao ⋅ Peng Lu ⋅ Sheng Li ⋅ Qian Qiao ⋅ RuiDe Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 198
Thermal Diffusion Matters: Infrared Spatial-Temporal Video Super-Resolution through Heat Conduction Priors
Mingxuan Zhou ⋅ Shuang Li ⋅ Yutang Zhang ⋅ Jing Geng ⋅ Yirui Shen ⋅ Jingxuan Kang ⋅ Fuzhen Zhuang ⋅ Shuigen Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 199
TextOVSR: Text-Guided Real-World Opera Video Super-Resolution
Hua Chang ⋅ Xin Xu ⋅ Wei Liu ⋅ Jiayi Wu ⋅ Kui Jiang ⋅ Fei Ma ⋅ Qi Tian
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 200
VoDaSuRe: A Large-Scale Dataset Revealing Domain Shift in Volumetric Super-Resolution
August Leander Høeg ⋅ Sophia Bardenfleth ⋅ Hans Martin Kjer ⋅ Tim Dyrby ⋅ Vedrana Dahl ⋅ Anders Bjorholm Dahl
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 201
GDPO-SR: Group Direct Preference Optimization for One-Step Generative Image Super-Resolution
Qiaosi Yi ⋅ Shuai Li ⋅ Rongyuan Wu ⋅ Lingchen Sun ⋅ Zhengqiang ZHANG ⋅ Lei Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 202
Adaptive Anisotropic Gaussian Splatting for Multi-contrast MRI Arbitrary-Scale Super-Resolution with Anatomy Guidance
Qiuhai Yan ⋅ Kang Chen ⋅ Zhengjie Lu ⋅ Tingting Wang ⋅ Faming Fang ⋅ Guixu Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 203
SignPR: A Progressive Vector-Quantized Diffusion Framework for Sign Language Production
Xiao Liu ⋅ Shiwei Gan ⋅ Yafeng Yin ⋅ Bowen Guo ⋅ Zhiwei Jiang ⋅ Shunmei Meng ⋅ Lei Xie ⋅ Sanglu Lu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 204
LLaMo: Scaling Pretrained Language Models for Unified Motion Understanding and Generation with Continuous Autoregressive Tokens
Zekun Li ⋅ Sizhe An ⋅ Chengcheng Tang ⋅ Chuan Guo ⋅ Ivan Shugurov ⋅ Linguang Zhang ⋅ Amy Zhao ⋅ Srinath Sridhar ⋅ Lingling Tao ⋅ Abhay Mittal
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 205
FlashCap: Millisecond-Accurate Human Motion Capture via Flashing LEDs and Event-Based Vision
Zekai Wu ⋅ Shuqi Fan ⋅ Mengyin Liu ⋅ Yuhua Luo ⋅ Xincheng Lin ⋅ Ming Yan ⋅ Junhao Wu ⋅ Xiuhong Lin ⋅ Yuexin Ma ⋅ Chenglu Wen ⋅ Lan Xu ⋅ Siqi Shen ⋅ Cheng Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 206
Geometric Neural Distance Fields for Learning Human Motion Priors
Zhengdi Yu ⋅ Simone Foti ⋅ Linguang Zhang ⋅ Amy Zhao ⋅ Cem Keskin ⋅ Stefanos Zafeiriou ⋅ Tolga Birdal
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 207
3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation
Zhixue Fang ⋅ Xu He ⋅ Songlin Tang ⋅ Haoxian Zhang ⋅ Qingfeng Li ⋅ Xiaoqiang Liu ⋅ Pengfei Wan ⋅ Kun Gai
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 208
Decoupled Generative Modeling for Human-Object Interaction Synthesis
Hwanhee Jung ⋅ Seunggwan Lee ⋅ Jeongyoon Yoon ⋅ SeungHyeon Kim ⋅ Giljoo Nam ⋅ Qixing Huang ⋅ Sangpil Kim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 209
LiveGesture: Streamable Co-Speech Gesture Generation Model
Muhammad Usama Saleem ⋅ Mayur Jagdishbhai Patel ⋅ Ekkasit Pinyoanuntapong ⋅ Zhongxing Qin ⋅ Li Yang ⋅ Hongfei Xue ⋅ Ahmed Helmy ⋅ Chen Chen ⋅ Pu Wang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 210
HandX: Scaling Bimanual Motion and Interaction Generation
Zimu Zhang ⋅ Yucheng Zhang ⋅ Xiyan Xu ⋅ Ziyin Wang ⋅ Sirui Xu ⋅ Kai Zhou ⋅ Bing Zhou ⋅ Chuan Guo ⋅ Jian Wang ⋅ Yu-Xiong Wang ⋅ Liang-Yan Gui
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 211
MaskAdapt: Learning Flexible Motion Adaptation via Mask-Invariant Prior for Physics-Based Characters
Soomin Park ⋅ Eunseong Lee ⋅ Kwang Bin Lee ⋅ Sung-Hee Lee
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 212
FloodDiffusion: Tailored Diffusion Forcing for Streaming Motion Generation
YIYI CAI ⋅ Yuhan Wu ⋅ Kunhang Li ⋅ YOU ZHOU ⋅ Bo Zheng ⋅ Haiyang Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 213
ProjFlow: Projection Sampling with Flow Matching for Zero‑Shot Exact Spatial Motion Control
Akihisa Watanabe ⋅ Qing Yu ⋅ Edgar Simo-Serra ⋅ Kent Fujiwara
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 214
Correspondence-Attention Alignment for Multi-View Diffusion Models
Minkyung Kwon ⋅ Jinhyeok Choi ⋅ Jiho Park ⋅ Seonghu Jeon ⋅ Jinhyuk Jang ⋅ Junyoung Seo ⋅ Minseop Kwak ⋅ Jin-Hwa Kim ⋅ Seungryong Kim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 215
GenErase: Generalizable and Semantically-Aware Concept Erasure in Diffusion Models
Korada Sri Vardhana ⋅ Soma Biswas
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 216
MatMart: Material Reconstruction of 3D Objects via Diffusion
Xiuchao Wu ⋅ Pengfei Zhu ⋅ Jiangjing Lyu ⋅ Xinguo Liu ⋅ Jie Guo ⋅ Yanwen Guo ⋅ Weiwei Xu ⋅ Chengfei Lv
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 217
Region-Adaptive Sampling for Diffusion Transformers
Ziming Liu ⋅ Yifan Yang ⋅ Chengruidong Zhang ⋅ Yiqi Zhang ⋅ Lili Qiu ⋅ Yang You ⋅ Yuqing Yang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 218
Diffusion Guided Chain-of-Vision for Large Autoregressive Vision Models
Xinyang Wang ⋅ Kecheng Zheng ⋅ Minfeng Zhu ⋅ Wei Wu ⋅ Fan Lu ⋅ Wei Zhai ⋅ Wei Chen
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 219
Guiding Diffusion-based Reconstruction with Contrastive Signals for Balanced Visual Representation
Boyu Han ⋅ Qianqian Xu ⋅ Shilong Bao ⋅ Zhiyong Yang ⋅ Ruochen Cui ⋅ Xilin Zhao ⋅ Qingming Huang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 220
ConceptPrism: Concept Disentanglement in Personalized Diffusion Models via Residual Token Optimization
Minseo Kim ⋅ Minchan Kwon ⋅ Dongyeun Lee ⋅ Yunho Jeon ⋅ Junmo Kim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 221
Heterogeneous Decentralized Diffusion Models
Zhiying Jiang ⋅ Raihan Seraj ⋅ Marcos Villagra ⋅ Bidhan Roy
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 222
Refining Few-Step Text-to-Multiview Diffusion via Reinforcement Learning
Ziyi Zhang ⋅ Li Shen ⋅ Deheng Ye ⋅ Yong Luo ⋅ Huangxuan Zhao ⋅ Meng Liu ⋅ Wei Yu ⋅ Lefei Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 223
GroundingME: Exposing the Visual Grounding Gap in MLLMs through Multi-Dimensional Evaluation
Rang Li ⋅ Lei Li ⋅ Shuhuai Ren ⋅ Hao Tian ⋅ Shuhao Gu ⋅ Shicheng Li ⋅ Zihao Yue ⋅ Yudong Wang ⋅ Wenhan Ma ⋅ Zhe Yang ⋅ Jingyuan Ma ⋅ Zhifang Sui ⋅ Fuli Luo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 224
ENC-Bench: A Benchmark for Evaluating Multimodal Large Language Models in Electronic Navigational Chart Understanding
Ao Cheng ⋅ Xingming Li ⋅ Xuanyu Ji ⋅ Xixiang He ⋅ Qiyao Sun ⋅ Chunping Qiu ⋅ Runke Huang ⋅ Qingyong Hu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 225
Nonparametric Deep Fine-grained Clustering with Low-Rank Guided Vision-Language Model
xulun ye ⋅ Benyu Wu ⋅ Jie Hong ⋅ Kun Zhou
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 226
RealBirdID: Benchmarking Bird Species Identification in the Era of MLLMs
Logan Lawrence ⋅ Oindrila Saha ⋅ Rangel Daroya ⋅ Mustafa Chasmai ⋅ Wuao Liu ⋅ Max Hamilton ⋅ Aaron Sun ⋅ Seoyun Jeong ⋅ Fabien Delattre ⋅ Subhransu Maji ⋅ Grant Horn
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 227
Fast SceneScript: Fast and Accurate Language‑Based 3D Scene Understanding via Multi‑Token Prediction
Ruihong Yin ⋅ Xuepeng Shi ⋅ Oleksandr Bailo ⋅ Marco Manfredi ⋅ Theo Gevers
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 228
PP-OCRv5: A Specialized 5M-Parameter Model Rivaling Billion-Parameter Vision-Language Models on OCR Tasks
Cheng Cui ⋅ yubo zhang ⋅ Ting Sun ⋅ Xueqing Wang ⋅ Hongen Liu ⋅ Manhui Lin ⋅ Yue Zhang ⋅ Tingquan Gao ⋅ Changda Zhou ⋅ Jiaxuan Liu ⋅ Zelun Zhang ⋅ Jing Zhang ⋅ Jun Zhang ⋅ Yi Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 229
World in a Frame: Understanding Culture Mixing as a New Challenge for Vision-Language Models
Eunsu Kim ⋅ Junyeong Park ⋅ Na Min An ⋅ Junseong Kim ⋅ Hitesh Laxmichand Patel ⋅ Jiho Jin ⋅ Julia Kruk ⋅ Amit Agarwal ⋅ Srikant Panda ⋅ Fenal Ashokbhai Ilasariya ⋅ Hyunjung Shim ⋅ Alice Oh
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 230
Gastric-X: A Multimodal Multi-Phase Benchmark Dataset for Advancing Vision-Language Models in Gastric Cancer Analysis
Yuanzhe Li ⋅ Hao Chen ⋅ Rui Yin ⋅ Juyan Ba ⋅ Yu Zhang ⋅ Sheng Lu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 231
HiSpatial: Taming Hierarchical 3D Spatial Understanding in Vision-Language Models
Huizhi Liang ⋅ Yichao Shen ⋅ Yu Deng ⋅ Sicheng Xu ⋅ ZhiYuan Feng ⋅ Tong Zhang ⋅ Yaobo Liang ⋅ Jiaolong Yang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 232
HandVQA: Diagnosing and Improving Fine-Grained Spatial Reasoning about Hands in Vision-Language Models
Khalequzzaman Chowdhury Sayem ⋅ Mubarrat Chowdhury ⋅ Yihalem Yimolal Tiruneh ⋅ Muneeb Ahmed Khan ⋅ Muhammad Salman Ali ⋅ Binod Bhattarai ⋅ Seungryul Baek
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 233
Probing and Bridging Geometry–Interaction Cues for Affordance Reasoning in Vision Foundation Models
Qing Zhang ⋅ Xuesong li ⋅ Jing Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 234
ARC Is a Vision Problem!
Keya Hu ⋅ Ali Cy ⋅ Linlu Qiu ⋅ Xiaoman Delores Ding ⋅ Runqian Wang ⋅ Yeyin Eva Zhu ⋅ Jacob Andreas ⋅ Kaiming He
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 235
Geoint-R1: Formalizing Multimodal Geometric Reasoning with Dynamic Auxiliary Constructions
Jingxuan Wei ⋅ Caijun Jia ⋅ Qi Chen ⋅ Honghao He ⋅ Linzhuang Sun ⋅ Conghui He ⋅ Lijun Wu ⋅ Bihui Yu ⋅ Cheng Tan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 236
S^2-MLLM: Boosting Spatial Reasoning Capability of MLLMs for 3D Visual Grounding with Structural Guidance
Beining Xu ⋅ Siting Zhu ⋅ Zhao Jin ⋅ Junxian Li ⋅ Hesheng Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 237
Learning Multi-View Spatial Reasoning from Cross-View Relations
Suchae Jeong ⋅ Jaehwi Song ⋅ Haeone Lee ⋅ Hanna Kim ⋅ Jian Kim ⋅ Dongjun Lee ⋅ Dong Kyu Shin ⋅ Changyeon Kim ⋅ Dongyoon Hahm ⋅ Woogyeol Jin ⋅ Juheon Choi ⋅ Kimin Lee
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 238
Exploring Spatial Intelligence from a Generative Perspective
Muzhi Zhu ⋅ Shunyao Jiang ⋅ Huanyi Zheng ⋅ Zekai Luo ⋅ Hao Zhong ⋅ Anzhou Li ⋅ Kaijun Wang ⋅ Jintao Rong ⋅ Yang Liu ⋅ Hao Chen ⋅ Tao Lin ⋅ Chunhua Shen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 239
Physical Object Understanding with a Physically Controllable World Model
Rahul Venkatesh ⋅ Klemen Kotar ⋅ Lilian Naing Chen ⋅ Wanhee Lee ⋅ Gia Ancone ⋅ Seungwoo Kim ⋅ Luca Thomas Wheeler ⋅ Jared Watrous ⋅ Honglin Chen ⋅ Daniel Bear ⋅ Stefan Stojanov ⋅ Daniel L.K. Yamins
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 240
QueryMe: Query-Driven Open-Vocabulary 3D Object Affordances Grounding from Multimodal Evidence
Weiyu Zhao ⋅ Ru Li ⋅ Jiaqi Liu ⋅ Sizhe Zhao ⋅ Qinglin Liu ⋅ Shengping Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 241
Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views
Zhangquan Chen ⋅ Manyuan Zhang ⋅ Xinlei Yu ⋅ Xufang Luo ⋅ Mingze Sun ⋅ Zihao Pan ⋅ Xiang An ⋅ Yan Feng ⋅ Peng Pei ⋅ Xunliang Cai ⋅ Ruqi Huang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 242
EG-3DVG: Expression and Geometry Aware Grounding Decoder for 3D Visual Grounding
GwangWook Park ⋅ Hyo-Jun Lee ⋅ Jong-Hyeon Baek ⋅ Hanul Kim ⋅ Yeong Jun Koh
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 243
AffordMatcher: Affordance Learning in 3D Scenes from Visual Signifiers
Nghia Vu ⋅ Tuong Do ⋅ Khang Nguyen ⋅ Baoru Huang ⋅ Nhat Le ⋅ Binh Xuan Nguyen ⋅ Erman Tjiputra ⋅ Quang D. Tran ⋅ Ravi Prakash ⋅ Te-Chuan Chiu ⋅ Anh Nguyen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 244
SpatiaLQA: A Benchmark for Evaluating Spatial Logical Reasoning in Vision-Language Models
Yuechen Xie ⋅ Xiaoyan Zhang ⋅ Yicheng Shan ⋅ Zhu Hao ⋅ Rui Tang ⋅ Rong Wei ⋅ Mingli Song ⋅ Yuanyu Wan ⋅ Jie Song
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 245
Air-Know: Arbiter-Calibrated Knowledge-Internalizing Robust Network for Composed Image Retrieval
Zhiheng Fu ⋅ Yupeng Hu ⋅ Qianyun Yang ⋅ Shiqi Zhang ⋅ Zhiwei Chen ⋅ Zixu Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 246
Intra-class Distribution-guided Generative Hashing with Neighbor Refinement for Cross-modal Retrieval
Hao Sun ⋅ Yadong Huo ⋅ Qibing Qin ⋅ Wenfeng Zhang ⋅ Lei Huang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 247
Language-driven Fine-grained Retrieval
Shijie Wang ⋅ Xin Yu ⋅ Yadan Luo ⋅ Zijian Wang ⋅ Pengfei Zhang ⋅ Zi Huang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 248
MRD: Multi-resolution Retrieval-Detection Fusion for High-Resolution Image Understanding
Fan Yang ⋅ Xingping Dong ⋅ Xin Yu ⋅ Wenhan Luo ⋅ Wei Liu ⋅ Kaihao Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 249
RetFormer: Multimodal Retrieval for Enhancing Image Recognition
Tianrui Yu ⋅ Xiubo Liang ⋅ Hongzhi Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 250
DREAM: Document Recognition with Explicit Adaptive Memory
TIANQI ZHAO ⋅ Di Wu ⋅ Liangrui Peng ⋅ Yifan Huang ⋅ Kemeng Zhao ⋅ Shuo Li ⋅ Zhiyu Li ⋅ Yizhu Wang ⋅ Borui Jiang ⋅ Yuyang Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 251
RMIR: A Benchmark Dataset for Reasoning-Intensive Multimodal Image Retrieval
Yijiang Li ⋅ Kunal Kotian ⋅ Ali Marjaninejad ⋅ Meir Friedenberg ⋅ Kaushik Pavani ⋅ Sunny Dasgupta
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 252
POGA: Paraphrased and Oppositional Graph Alignment for Fine-Grained Cross-Modal Retrieval
Junfeng Zhang ⋅ Zhe Xue ⋅ Yuankai Qi ⋅ Junping Du ⋅ Xiangyang Kong ⋅ Yishuo Yan ⋅ Amin Beheshti ⋅ Jian Yang ⋅ Anton van den Hengel ⋅ Ming-Hsuan Yang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 253
Chain-of-Frames: Advancing Video Understanding in Multimodal LLMs via Frame-Aware Reasoning
SARA GHAZANFARI ⋅ Francesco Croce ⋅ Nicolas Flammarion ⋅ Prashanth Krishnamurthy ⋅ Farshad Khorrami ⋅ Siddharth Garg
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 254
TempR1: Improving Temporal Understanding of MLLMs via Temporal-Aware Multi-Task Reinforcement Learning
Tao Wu ⋅ Li Yang ⋅ Gen Zhan ⋅ Yabin ZHANG ⋅ Yiting Liao ⋅ Junlin Li ⋅ Deliang Fu ⋅ Li zhang ⋅ Limin Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 255
RiskProp: Collision-Anchored Self-Supervised Risk Propagation For Early Accident Anticipation
Yiyang Zou ⋅ Tianhao Zhao ⋅ Peilun Xiao ⋅ Hongyu Jin ⋅ Longyu Qi ⋅ Yuxuan Li ⋅ Liyin Liang ⋅ Yifeng Qian ⋅ Chunbo Lai ⋅ Yutian Lin ⋅ Zhihui Li ⋅ Yu Wu
[ Slides
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 256
MotionEnhancer: Leveraging Video Diffusion for Motion-Enhanced Vision-Language Models
Yifan Xu ⋅ Chao Zhang ⋅ Ruifei Ma ⋅ Fei Gao ⋅ Zhifei Yang ⋅ Jiaxing Qi ⋅ Zhipeng Chen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 257
MedGRPO: Multi-Task Reinforcement Learning for Heterogeneous Medical Video Understanding
Yuhao Su ⋅ Anwesa Choudhuri ⋅ Zhongpai Gao ⋅ Benjamin Planche ⋅ Van Nguyen Nguyen ⋅ Meng Zheng ⋅ Yuhan Shen ⋅ Arun Innanje ⋅ Terrence Chen ⋅ Ehsan Elhamifar ⋅ Ziyan Wu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 258
Asynchronous Temporal Modeling with Two-Agent Framework for Streaming Dense Video Captioning
Yolo Yunlong Tang ⋅ Chao Huang ⋅ Susan Liang ⋅ Jing Bi ⋅ Yicheng Wang ⋅ Daiki Shimada ⋅ Chenliang Xu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 259
TRCoRSurg: Temporal-Relational Co-Reasoning for Surgical Video Triplet Recognition
Fang Li ⋅ Shihao Zou ⋅ Weixin Si ⋅ Yang Gao ⋅ Shuai Li ⋅ Aimin Hao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 260
OASIS: On-Demand Hierarchical Event Memory for Streaming Video Reasoning
Zhijia Liang ⋅ Jiaming Li ⋅ Weikai Chen ⋅ Yanhao Zhang ⋅ Haonan Lu ⋅ Guanbin Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 261
One-Shot Flow, Any-Time Frame: A Bidirectional Warping Framework for Event-Based Video Frame Interpolation
Linghui Fu ⋅ Yuhan Liu ⋅ Hao Chen ⋅ Zhen Yang ⋅ Yongjian Deng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 262
TF-CADE: Foreground-Concentrated Text-Video Alignment for Zero-Shot Temporal Action Detection
Yearang Lee ⋅ Ho-Joong Kim ⋅ Seong-Whan Lee
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 263
PRISM: Prototype-based Reasoning with Inter-modal Semantic Mining for Interpretable Image Recognition
Anni Yu ⋅ Yu-Bin Yang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 264
Concept Regions Matter: Benchmarking CLIP with a New Cluster-Importance Approach
Aishwarya Agarwal ⋅ Srikrishna Karanam ⋅ Vineet Gandhi
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 265
PhaseWin Search Framework Enable Efficient Object-Level Interpretation
Zihan Gu ⋅ Ruoyu Chen ⋅ Junchi Zhang ⋅ Yue Hu ⋅ Hua Zhang ⋅ Xiaochun Cao
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 266
Beyond Top Activations: Efficient and Reliable Crowdsourced Evaluation of Automated Interpretability
Tuomas Oikarinen ⋅ Ge Yan ⋅ Akshay Kulkarni ⋅ Tsui-Wei Weng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 267
From Weights to Concepts: Data-Free Interpretability of CLIP via Singular Vector Decomposition
Francesco Gentile ⋅ Nicola DallAsen ⋅ Francesco Tonini ⋅ Massimiliano Mancini ⋅ Lorenzo Vaquero ⋅ Elisa Ricci
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 268
Hierarchical Concept Embedding & Pursuit for Interpretable Image Classification
Nghia Nguyen ⋅ Tianjiao Ding ⋅ Rene Vidal
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 269
Interpretable and Steerable Concept Bottleneck Sparse Autoencoders
Akshay Kulkarni ⋅ Tsui-Wei Weng ⋅ Vivek Narayanaswamy ⋅ Shusen Liu ⋅ Wesam A. Sakla ⋅ Kowshik Thopalli
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 270
C-LaV: Conditional Latent Velocity Field Denoising for Weather-Robust LiDAR Place Recognition
Xuewei Cao ⋅ Jiayue Yang ⋅ Zhiwen Zeng ⋅ Yanyong Zhang ⋅ Yan Xia
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 271
Towards Foundation Models for 3D Scene Understanding: Instance-Aware Self-Supervised Learning for Point Clouds
Bin Yang ⋅ Mohamed Abdelsamad ⋅ Miao Zhang ⋅ Alexandru Paul Condurache
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 272
Generalized-CVO: Fast and Correspondence-Free Local Point Cloud Registration with Second Order Riemannian Optimization
Ray (Rui) Zhang ⋅ Carl Greiff ⋅ Thomas Lew ⋅ John Subosits
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 273
LiDeRe: A Lightweight Readout for Fast and Data-Efficient Dense Prediction
Timo Lüddecke ⋅ Jan F. Meier ⋅ Jan van Delden ⋅ Alexander S. Ecker
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 274
AnyPcc: Compressing Any Point Cloud with a Single Universal Model
Kangli Wang ⋅ Qianxi Yi ⋅ Yuqi Ye ⋅ Shihao Li ⋅ Wei Gao
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 275
CoLC: Communication-Efficient Collaborative Perception with LiDAR Completion
Yushan Han ⋅ Hui Zhang ⋅ Qiming Xia ⋅ Yi Jin ⋅ Yidong Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 276
Spectral-Geometric Neural Fields for Pose-Free LiDAR View Synthesis
Yinuo Jiang ⋅ Jun Cheng ⋅ Yiran Wang ⋅ Cheng Cheng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 277
C-GenReg: Training-Free 3D Point Cloud Registration by Multi-View-Consistent Geometry-to-Image Generation with Probabilistic Modalities Fusion
Yuval Haitman ⋅ Amit Efraim ⋅ Joseph M. Francos
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 278
PatchAlign3D: Local Feature Alignment for Dense 3D Shape Understanding
Souhail Hadgi ⋅ Bingchen Gong ⋅ Ramana Sundararaman ⋅ Emery Pierson ⋅ Lei Li ⋅ Peter Wonka ⋅ Maks Ovsjanikov
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 279
FoV-Net: Rotation-Invariant CAD B-rep Learning via Field-of-View Ray Casting
Matteo Ballegeer ⋅ Dries F. Benoit
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 280
Neural Distribution Prior for LiDAR Out-of-Distribution Detection
Zizhao Li ⋅ Zhengkang Xiang ⋅ Jiayang Ao ⋅ Feng Liu ⋅ Joseph West ⋅ Kourosh Khoshelham
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 281
DENALI: A Dataset Enabling Non-Line-of-Sight Spatial Reasoning with Low-Cost LiDARs
Nikhil Behari ⋅ Diego Rivero ⋅ Luke Apostolides ⋅ Suman Ghosh ⋅ Paul Pu Liang ⋅ Ramesh Raskar
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 282
Concept-Aware Batch Sampling Improves Language-Image Pretraining
Adhiraj Ghosh ⋅ Vishaal Udandarao ⋅ Thao Nguyen ⋅ Matteo Farina ⋅ Mehdi Cherti ⋅ Jenia Jitsev ⋅ Sewoong Oh ⋅ Elisa Ricci ⋅ Ludwig Schmidt ⋅ Matthias Bethge
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 283
HiFICL: High-Fidelity In-Context Learning for Multimodal Tasks
Xiaoyu Li ⋅ Yuhang Liu ⋅ xuanshuo kang ⋅ zheng luo ⋅ Fangqi Lou ⋅ 吴晓华 吴晓华 ⋅ Zihan Xiong
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 284
InstAP: Instance-Aware Vision-Language Pre-Train for Spatial-Temporal Understanding
Ashutosh Kumar ⋅ Rajat Saini ⋅ Jingjing Pan ⋅ Mustafa Erdogan ⋅ Mingfang Zhang ⋅ Betty Le ⋅ Norimasa Kobori ⋅ Quan Kong
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 285
Vocabulary Scaling Law: Tuning Open-vocabulary Predictors for Their Openness
Ziliang Chen ⋅ Yulu Li ⋅ Liangda Fang ⋅ jusheng zhang ⋅ Yongsen Zheng ⋅ Quanlong Guan ⋅ Xipeng Chen
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 286
Render-to-Adapt: Unsupervised Personal Adaptation for Gaze Estimation
Yangshi Ge ⋅ Zheng Liu ⋅ Feng Lu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 287
ViTPrompt: Training-Free Prompt Refinement with Visual Tokens for Open-Vocabulary Detection
Yitong Qin ⋅ Lihua Zhou ⋅ Jiwei Wei ⋅ Ran Ran ⋅ Shiyuan He ⋅ Zeyu Ma ⋅ Shuaifeng Li ⋅ Nianxin Li ⋅ Heng Tao Shen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 288
Cluster-Aware Neural Collapse Prompt Tuning for Long-Tailed Generalization of Vision-Language Models
Boyang Guo ⋅ Liang Li ⋅ Lin Peng ⋅ Yuhan Gao ⋅ Xichun Sheng ⋅ Chenggang Yan
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 289
LLMind: Bio-inspired Training-free Adaptive Visual Representations for Vision-Language Models
Soumyaratna Debnath ⋅ Bui Manh Duc ⋅ Zinan Liu ⋅ Lin Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 290
Dynamic Logits Adjustment and Exploration for Test-Time Adaptation in Vision Language Models
Haoyan Wu ⋅ Yahao Liu ⋅ Yinjie Lei ⋅ Lixin Duan ⋅ Wen Li
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 291
CAPT: Confusion-Aware Prompt Tuning for Reducing Vision-Language Misalignment
Maoyuan Shao ⋅ Yutong Gao ⋅ Xinyang Huang ⋅ Lijuan Sun ⋅ Guoshun Nan ⋅ Chuang Zhu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 292
GenMatter: Perceiving Physical Objects with Generative Matter Models
Eric Li ⋅ Arijit Dasgupta ⋅ Yoni Friedman ⋅ Mathieu Huot ⋅ Vikash Mansinghka ⋅ Thomas O'Connell ⋅ William Freeman ⋅ Joshua B. Tenenbaum
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 293
Bidirectional Query-Driven Generation of Parametric CAD Sketch
Yang Liu ⋅ Daxuan Ren ⋅ Yijie Ding ⋅ Jianmin Zheng ⋅ Fang Deng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 294
The Missing GAP: From Solving Square Jigsaw Puzzles to Handling Real World Archaeological Fragments
Ofir Itzhak Shahar ⋅ Gur Elkin ⋅ Ohad Ben-Shahar
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 295
Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation
Yiwen Tang ⋅ Ziyu Guo ⋅ Kaixin Zhu ⋅ Ray Zhang ⋅ Qizhi Chen ⋅ Dongzhi Jiang ⋅ Junli Liu ⋅ Bohan Zeng ⋅ Haoming Song ⋅ Delin Qu ⋅ Tianyi Bai ⋅ Dan Xu ⋅ Wentao Zhang ⋅ Bin Zhao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 296
OmniDocLayout: Towards Diverse Document Layout Generation via Coarse-to-Fine LLM Learning
Hengrui Kang ⋅ Zhuangcheng Gu ⋅ Zhiyuan Zhao ⋅ Zichen Wen ⋅ Bin Wang ⋅ Weijia Li ⋅ Conghui He
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 297
Yo'City: Personalized and Boundless 3D Realistic City Scene Generation via Self-Critic Expansion
Keyang Lu ⋅ Sifan Zhou ⋅ Hongbin Xu ⋅ Gang Xu ⋅ Zhifei Yang ⋅ Yikai Wang ⋅ Zhen Xiao ⋅ Jieyi Long ⋅ Ming Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 298
Repurposing 3D Generative Model for Autoregressive Layout Generation
Haoran Feng ⋅ Yifan Niu ⋅ Zehuan Huang ⋅ Yangtian Sun ⋅ Chunchao Guo ⋅ Yuxin Peng ⋅ Lu Sheng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 299
CAD-Refiner: A Unified Framework for CAD Generation and Iterative Editing
Meng Yuan ⋅ Dawei Lin ⋅ Hongxia Xie ⋅ Tieru Wu ⋅ Rui Ma
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 300
A Debiased Reconstruction-based Framework for Training-Free Detection of AI-Generated Images
Sungik Choi ⋅ Hankook Lee ⋅ Jaehoon Lee ⋅ Robin Kim ⋅ Stanley Jungkyu Choi ⋅ Moontae Lee
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 301
Global Information Thresholding for Sufficient and Necessary Circuits
Jegyeong Cho
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 302
PrivateEyes: Gaze-Preserving Anonymization for Data Sharing
Surabhi Gupta ⋅ Dinesh Prabhu Muthumariappan ⋅ Biplab Ch Das ⋅ Anoop Kolar Rajagopal ⋅ Kiran Nanjunda Iyer ⋅ Donghwan Seo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 303
From Measurement to Mitigation: Quantifying and Reducing Identity Leakage in Image Representation Encoders with Linear Subspace Removal
Daniel George ⋅ Charles Yeh ⋅ Daniel Lee ⋅ Yifei Zhang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 304
Bias In, Bias Out? Finding Unbiased Subnetworks in Vanilla Models
Ivan Luiz De Moura Matos ⋅ Djalil Sad Saoud ⋅ Ekaterina Iakovleva ⋅ Vito Paolo ⋅ Enzo Tartaglione
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 305
pH-Strips for Selective Forgetting: A Blunt but Fast Diagnostic Baseline for Machine Unlearning
Chengyao Qian ⋅ Jing Wu ⋅ Trung Le ⋅ Dinh Phung ⋅ Mehrtash Harandi
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 306
Decoupling Defense Strategies for Robust Image Watermarking
Jiahui Chen ⋅ Zehang Deng ⋅ Zeyu Zhang ⋅ Chaoyang Li ⋅ Lianchen Jia ⋅ Lifeng Sun
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 307
Unsafe2Safe: Controllable Image Anonymization for Downstream Utility
Minh Dinh ⋅ SouYoung Jin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 308
Rel-Zero: Harnessing Patch-Pair Invariance for Robust Zero-Watermarking Against AI Editing
Pengzhen Chen ⋅ Yanwei Liu ⋅ Xiaoyan Gu ⋅ Xiaojun Chen ⋅ Wu Liu ⋅ Weiping Wang
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 309
Computation and Communication Efficient Federated Unlearning via On-server Gradient Conflict Mitigation and Expression
Minh-Duong Nguyen ⋅ Senura Hansaja Wanasekara ⋅ Le-Tuan Nguyen ⋅ Ken-Tye Yong ⋅ Quoc-Viet Pham ⋅ Nguyen H. Tran ⋅ Dung D. Le
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 310
DP-FedAdamW: An Efficient Optimizer for Differentially Private Federated Large Models
Jin Liu ⋅ Ning Xi ⋅ Yinbin Miao ⋅ Junkang Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 311
Submodel Extraction for Efficient and Personalized Federated Learning via Optimal Transport
Zheng Jiang ⋅ Nan He ⋅ Yiming Chen ⋅ Lifeng Sun
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 312
FedSDR: Federated Graph Learning with Structural Noise Detection and Reconstruction
Jiaqi Liu ⋅ Zihan Tan ⋅ Guancheng Wan ⋅ Wenke Huang ⋅ He Li ⋅ Mang Ye
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 313
FedDAP: Domain-Aware Prototype Learning for Federated Learning under Domain Shift
Huy Q. Le ⋅ Loc X. Nguyen ⋅ Yu Qiao ⋅ Seong Tae Kim ⋅ Eui-Nam Huh ⋅ Choong Seon Hong
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 314
FedAFD: Multimodal Federated Learning via Adversarial Fusion and Distillation
Min Tan ⋅ Junchao Ma ⋅ Yinfu FENG ⋅ Jiajun Ding ⋅ Wenwen Pan ⋅ Tingting Han ⋅ Qian Zheng ⋅ Zhenzhong Kuang ⋅ Zhou Yu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 315
VIRST: Video-Instructed Reasoning Assistant for SpatioTemporal Segmentation
Jihwan Hong ⋅ Jaeyoung Do
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 316
AXG-Reasoner: Error Detection and Explanation in Long Task Videos with Vision–Language Models
Shih-Po Lee ⋅ Ehsan Elhamifar
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 317
Stay in your Lane: Role Specific Queries with Overlap Suppression Loss for Dense Video Captioning
Seung Hyup Baek ⋅ Jimin Lee ⋅ Hyeongkeun Lee ⋅ Jae Won Cho
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 318
T2SGrid: Temporal-to-Spatial Gridification for Video Temporal Grounding
Chaohong Guo ⋅ Yihan He ⋅ Yongwei Nie ⋅ Fei Ma ⋅ Xuemiao Xu ⋅ Chengjiang Long
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 319
HanDyVQA: A Video QA Benchmark for Fine-Grained Hand-Object Interaction Dynamics
Masatoshi Tateno ⋅ Gido Kato ⋅ Hirokatsu Kataoka ⋅ Yoichi Sato ⋅ Takuma Yagi
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 320
SAIL: Similarity-Aware Guidance and Inter-Caption Augmentation-based Learning for Weakly-Supervised Dense Video Captioning
Ye-Chan Kim ⋅ SeungJu Cha ⋅ Si-Woo Kim ⋅ minju Jeon ⋅ HyunGee Kim ⋅ Dong-Jin Kim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 321
Token Warping Helps MLLMs Look from Nearby Viewpoints
Phillip Y. Lee ⋅ Chanho Park ⋅ Mingue Park ⋅ Seungwoo Yoo ⋅ Juil Koo ⋅ Minhyuk Sung
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 322
Variation-aware Vision Token Dropping for Faster Large Vision-Language Models
Chen junjie ⋅ Xuyang Liu ⋅ Zichen Wen ⋅ Yiyu Wang ⋅ Siteng Huang ⋅ Junjie Chen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 323
Fine-Grained Post-Training Quantization for Large Vision Language Models with Quantization-Aware Integrated Gradients
Ziwei Xiang ⋅ Fanhu Zeng ⋅ Hongjian Fang ⋅ Rui-Qi Wang ⋅ Renxing Chen ⋅ Yanan Zhu ⋅ yi chen ⋅ Peipei Yang ⋅ Xu-Yao Zhang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 324
Blink: Dynamic Visual Token Resolution for Enhanced Multimodal Understanding
Yuchen Feng ⋅ Zhenyu Zhang ⋅ Naibin Gu ⋅ Yilong Chen ⋅ Peng Fu ⋅ Zheng Lin ⋅ Shuohuan Wang ⋅ Yu Sun ⋅ Hua Wu ⋅ Weiping Wang ⋅ Haifeng Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 325
IF-Prune: Information-Flow Guided Token Pruning for Efficient Vision-Language Models
Guohao Sun ⋅ Yufei Wang ⋅ Sizhuo Ma ⋅ Yuege Xie ⋅ Yuting Cheng ⋅ ZHIQIANG TAO ⋅ Jian Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 326
EvoComp: Learning Visual Token Compression for Multimodal Large Language Models via Semantic-Guided Evolutionary Labeling
Jiafei Song ⋅ Fengwei Zhou ⋅ Jin Qu ⋅ Wenjin Jason Li ⋅ Tong Wu ⋅ Gengjian Xue ⋅ Zhikang Zhao ⋅ Daomin Wei ⋅ Yichao Lu ⋅ Bailin Na
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 327
DocPrune: Efficient Document Question Answering via Background, Question, and Comprehension-aware Token Pruning
Joonmyung Choi ⋅ Sanghyeok Lee ⋅ Jongha Kim ⋅ Sehyung Kim ⋅ Dohwan Ko ⋅ Jihyung Kil ⋅ Hyunwoo J. Kim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 328
QuietPrune: Query-Guided Early Token Pruning for Vision-Language Models
Tianxiao Gao ⋅ Shanwei Zhao ⋅ Shuo Fang ⋅ Shiai Zhu ⋅ Chenguang Ma
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 329
The Devil Is in Gradient Entanglement: Energy-Aware Gradient Coordinator for Robust Generalized Category Discovery
Haiyang Zheng ⋅ Nan Pu ⋅ Yaqi Cai ⋅ Teng Long ⋅ Wenjing Li ⋅ Nicu Sebe ⋅ Zhun Zhong
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 330
LLM-Guided Probabilistic Fusion for Label-Efficient Document Layout Analysis
Ibne Farabi Shihab ⋅ Sanjeda Akter ⋅ Anuj Sharma
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 331
Coordinate Denoising for Non‑Equilibrium Molecular Representation Learning
Qianwei Tang ⋅ Baile Xu ⋅ Jian Zhao ⋅ Furao Shen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 332
Plug-and-Play Incomplete Multi-View Clustering via Janus-Faced Affinity Learning with Topology Harmonization
Shengju Yu ⋅ Suyuan Liu ⋅ Wenhao SHAO ⋅ Siwei Wang ⋅ KE LIANG ⋅ Xihong Yang ⋅ Tiejun Li ⋅ Xinwang Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 333
Meta-Learning In-Context Enables Training-Free Cross Subject Brain Decoding
Mu Nan ⋅ Muquan Yu ⋅ Weijian Mai ⋅ Jacob S. Prince ⋅ Hossein Adeli ⋅ Rui Zhang ⋅ Jiahang Cao ⋅ Benjamin Becker ⋅ John S. Pyles ⋅ Margaret M. Henderson ⋅ Chunfeng Song ⋅ Nikolaus Kriegeskorte ⋅ Michael J. Tarr ⋅ Xiaoqing Hu ⋅ Andrew F. Luo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 334
Measure The Feature Universe: Topology-based Pseudo Labeling and Gravity Consistency for Source-Free Domain Adaptation
Jae Yun Lee ⋅ Hyeok Nam ⋅ Sung In Cho
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 335
Conditional Factuality Controlled LLMs with Generalization Certificates via Conformal Sampling
Kai Ye ⋅ Qingtao Pan ⋅ Shuo Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 336
Harnessing the Power of Foundation Models for Accurate Material Classification
QINGRAN LIN ⋅ Fengwei Yang ⋅ Chaolun Zhu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 337
Content-Aware Frequency Encoding for Implicit Neural Representations with Fourier-Chebyshev Features
Junbo Ke ⋅ Yangyang Xu ⋅ Chao Wang ⋅ You-Wei Wen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 338
ActiveAD: Planning-Oriented Active Learning for End-to-End Autonomous Driving
Han Lu ⋅ Xiaosong Jia ⋅ Yichen Xie ⋅ Siyu Sun ⋅ Wenlong Liao ⋅ Xiaokang Yang ⋅ Junchi Yan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 339
TeFlow: Enabling Multi-frame Supervision for Self-Supervised Feed-forward Scene Flow Estimation
Qingwen Zhang ⋅ Chenhan Jiang ⋅ Xiaomeng Zhu ⋅ Yunqi Miao ⋅ Yushan Zhang ⋅ Olov Andersson ⋅ Patric Jensfelt
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 340
Think Before You Drive: World Model-Inspired Multimodal Grounding
Haicheng Liao ⋅ Huanming Shen ⋅ Bonan Wang ⋅ yong kang li ⋅ Yihong Tang ⋅ Chengyue Wang ⋅ Dingyi Zhuang ⋅ Kehua Chen ⋅ HAI YANG ⋅ Chengzhong Xu ⋅ Zhenning Li
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 341
DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning
Zhe Liu ⋅ Runhui Huang ⋅ Rui Yang ⋅ Siming Yan ⋅ Zining Wang ⋅ Lu Hou ⋅ Di Lin ⋅ Xiang Bai ⋅ Hengshuang Zhao
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 342
DrivePTS: A Progressive Learning Framework with Textual and Structural Enhancement for Driving Scene Generation
Zhechao Wang ⋅ Yiming Zeng ⋅ Lufan Ma ⋅ Zeqing Fu ⋅ Chen Bai ⋅ Dongshuo Yin ⋅ Ziyao Lin ⋅ Cheng Lu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 343
WOD-E2E: Waymo Open Dataset for End-to-End Driving in Challenging Long-tail Scenarios
Runsheng Xu ⋅ Hubert Lin ⋅ Wonseok Jeon ⋅ Hao Feng ⋅ Yuliang Zou ⋅ Liting Sun ⋅ John Gorman ⋅ Kate Tolstaya ⋅ Sarah Tang ⋅ Brandyn White ⋅ Ben Sapp ⋅ Mingxing Tan ⋅ Jyh-Jing Hwang ⋅ Dragomir Anguelov
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 344
GuideFlow: Constraint-Guided Flow Matching for Planning in End-to-End Autonomous Driving
Lin Liu ⋅ Caiyan Jia ⋅ Guanyi Yu ⋅ Ziying Song ⋅ Junqiao Li ⋅ Feiyang Jia ⋅ Peiliang Wu ⋅ Xiaoshuai Hao ⋅ Yadan Luo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 345
ResAD: Normalized Residual Trajectory Modeling for End-to-End Autonomous Driving
Zhiyu Zheng ⋅ Shaoyu Chen ⋅ haoran yin ⋅ xinbang zhang ⋅ Jialv Zou ⋅ Xinggang Wang ⋅ Qian Zhang ⋅ Lefei Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 346
KnowVal: A Knowledge-Augmented and Value-Guided Autonomous Driving System
Zhongyu Xia ⋅ Wenhao Chen ⋅ Yongtao Wang ⋅ Ming-Hsuan Yang
[ Slides
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 347
FoSS: Modeling Long-Range Dependencies and Multimodal Uncertainty in Trajectory Prediction via Fourier–State Space Integration
Yizhou Huang ⋅ Genze Jiang ⋅ Yihua Cheng ⋅ Kezhi Wang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 348
NexusFlow: Unifying Disparate Tasks under Partial Supervision via Invertible Flow Networks
Fangzhou Lin ⋅ Yuping Wang ⋅ Yuliang Guo ⋅ Zixun Huang ⋅ Xinyu Huang ⋅ Haichong Zhang ⋅ Kazunori Yamada ⋅ Zhengzhong Tu ⋅ Liu Ren ⋅ Ziming Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 349
Visual Prototype Conditioned Focal Region Generation for UAV-Based Object Detection
Wenhao Li ⋅ Zimeng Wu ⋅ Yu Wu ⋅ Zehua Fu ⋅ Jiaxin Chen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 350
Consistent Instance Field for Dynamic Scene Understanding
Junyi Wu ⋅ Van Nguyen Nguyen ⋅ Benjamin Planche ⋅ Jiachen Tao ⋅ Changchang Sun ⋅ Zhongpai Gao ⋅ Zhenghao Zhao ⋅ Anwesa Choudhuri ⋅ Gengyu Zhang ⋅ Meng Zheng ⋅ Feiran Wang ⋅ Terrence Chen ⋅ Yan Yan ⋅ Ziyan Wu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 351
CLP: A Real-World Dataset of Contaminated Lens Protectors for Robust Semantic Segmentation
Sungyong Park ⋅ Sooyoung Choi ⋅ Hyunseo Koh ⋅ Youngjae Choi ⋅ Heewon Kim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 352
ReSAM: Refine, Requery, and Reinforce: Self-Prompting Point-Supervised Segmentation for Remote Sensing Images
Muhammad Naseer Subhani
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 353
Heuristic Self-Paced Learning for Domain Adaptive Semantic Segmentation under Adverse Conditions
Shiqin Wang ⋅ Haoyang Chen ⋅ Huaizhou Huang ⋅ Yinkan He ⋅ Dongfang Sun ⋅ Xiaoqing Chen ⋅ Xingyu Liu ⋅ Zheng Wang ⋅ Kaiyan Zhao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 354
SAM2Text: Towards Prompt-Free and Multi-Resolution Video Scene Text Segmentation
Jing-Yao Zhang ⋅ Heng Zhang ⋅ Mingsen Zhang ⋅ Binbin Yang ⋅ Fei Yin
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 355
Reinforcing Video Reasoning Segmentation to Think Before It Segments
Sitong Gong ⋅ Yunzhi Zhuge ⋅ Lu Zhang ⋅ Jiazuo Yu ⋅ Pingping Zhang ⋅ Xu Jia ⋅ Huchuan Lu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 356
VideoMaMa: Mask-Guided Video Matting via Generative Prior
Sangbeom Lim ⋅ Seoung Wug Oh ⋅ Gabriel Huang ⋅ Heeji Yoon ⋅ Seungryong Kim ⋅ Joon-Young Lee
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 357
Quantized Residuals to Continuous Prompts for Few-Shot Class Incremental Learning in Vision-Language Models
Abhishek Kumar Sinha ⋅ Nitant Dube ⋅ Soma Biswas
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 358
The Golden Subspace: Where Efficiency Meets Generalization in Continual Test-Time Adaptation
Guannan Lai ⋅ Da-Wei Zhou ⋅ Zhenguo Li ⋅ Han-Jia Ye
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 359
SAIDO: Generalizable Detection of AI-Generated Images via Scene-Aware and Importance-Guided Dynamic Optimization in Continual Learning
Yongkang Hu ⋅ Yu Cheng ⋅ YuShuo Zhang ⋅ Yuan Xie ⋅ Zhaoxia Yin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 360
Is Parameter Isolation Better for Prompt-Based Continual Learning?
Jiangyang Li ⋅ Chenhao Ding ⋅ SongLin Dong ⋅ Qiang Wang ⋅ Jianchao Zhao ⋅ Yuhang He ⋅ Yihong Gong
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 361
Octopus: History-Free Gradient Orthogonalization for Continual Learning in Multimodal Large Language Models
Yuehao Liu ⋅ Shanyan Guan ⋅ Weijia Zhang ⋅ Xuanming Shang ⋅ Yanhao Ge ⋅ Wei Li ⋅ Chao Ma
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 362
Affordance-First Decomposition for Continual Learning in Video–Language Understanding
Mengzhu xu ⋅ Hanzhi Liu ⋅ Ningkang Peng ⋅ qianyu Chen ⋅ Canran Xiao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 363
Quantum-Gated Task-interaction Knowledge Distillation for Pre-trained Model-based Class-Incremental Learning
Linjie Li ⋅ HUIYU XIAO ⋅ Jiarui Cao ⋅ Zhenyu Wu ⋅ Yang Ji
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 364
Elastic Weight Consolidation Done Right for Continual Learning
Xuan Liu ⋅ Xiaobin Chang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 365
On Token's Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models
Chongyang Zhao ⋅ Mingsong Li ⋅ Haodong Lu ⋅ Dong Gong
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 366
Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation
Jiangning Zhang ⋅ junwei zhu ⋅ Zhenye Gan ⋅ Donghao Luo ⋅ Chuming Lin ⋅ FeiFan Xu ⋅ Xu Peng ⋅ Jianlong Hu ⋅ Yuansen Liu ⋅ Yijia Hong ⋅ Weijian Cao ⋅ Han Feng ⋅ Xu Chen ⋅ Chencan Fu ⋅ Keke He ⋅ Xiaobin Hu ⋅ Chengjie Wang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 367
Talking Together: Synthesizing Co-Located 3D Conversations from Audio
Mengyi Shan ⋅ Shouchieh Chang ⋅ Ziqian Bai ⋅ Shichen Liu ⋅ Yinda Zhang ⋅ Luchuan Song ⋅ Rohit Pandey ⋅ Sean Fanello ⋅ Zeng Huang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 368
InfinityHuman: Towards Long-Term Audio-Driven Human Animation
Xiaodi Li ⋅ Pan Xie ⋅ Yi Ren ⋅ Qijun Gan ⋅ Chen Zhang ⋅ Fangyuan Kong ⋅ Xiang Yin ⋅ Zehuan Yuan ⋅ BINGYUE PENG
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 369
Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision
Hyunsoo Cha ⋅ Wonjung Woo ⋅ Byungjun Kim ⋅ Hanbyul Joo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 370
AudioAvatar: Personalized Audio-driven Whole-body Talking Avatars
Seungeun Lee ⋅ SeungJun Moon ⋅ Hah Min Lew ⋅ Ji-Su Kang ⋅ Gyeong-Moon Park
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 371
One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer
Shijun Shi ⋅ Jing Xu ⋅ Zhihang Li ⋅ Chunli Peng ⋅ Xiaoda Yang ⋅ Lijing Lu ⋅ Kai Hu ⋅ Jiangning Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 372
Counterfactual VLA: Self-Reflective Vision-Language-Action Model with Adaptive Reasoning
Zhenghao Peng ⋅ Wenhao Ding ⋅ Yurong You ⋅ Yuxiao Chen ⋅ Wenjie Luo ⋅ Thomas Tian ⋅ Yulong Cao ⋅ Apoorva Sharma ⋅ Danfei Xu ⋅ Boris Ivanovic ⋅ Boyi Li ⋅ Yan Wang ⋅ Marco Pavone
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 373
SGDrive: Scene-to-Goal Hierarchical World Cognition for Autonomous Driving
jingyu li ⋅ Junjie Wu ⋅ Dongnan Hu ⋅ Xiangkai Huang ⋅ Bin Sun ⋅ Zhihui Hao ⋅ XianPeng Lang ⋅ Xiatian Zhu ⋅ Li Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 374
CapNav: Benchmarking Vision Language Models on Capability-conditioned Indoor Navigation
Xia Su ⋅ Ruiqi Chen ⋅ Benlin Liu ⋅ Jingwei Ma ⋅ Zonglin Di ⋅ Ranjay Krishna ⋅ Jon Froehlich
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 375
AutoTraces: Autoregressive Trajectory Forecasting via Multimodal Large Language Models
Teng Wang ⋅ Yanting Lu ⋅ Ruize Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 376
AwareVLN: Reasoning with Self-awareness for Vision-Language Navigation
Wenxuan Guo ⋅ Xiuwei Xu ⋅ Yichen Liu ⋅ Xiangyu Li ⋅ Hang Yin ⋅ Huangxing Chen ⋅ Wenzhao Zheng ⋅ Jianjiang Feng ⋅ Jie Zhou ⋅ Jiwen Lu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 377
Progress-Think: Semantic Progress Reasoning for Vision-Language Navigation
Shuo Wang ⋅ Yucheng Wang ⋅ Guoxin Lian ⋅ Yongcai Wang ⋅ Maiyue Chen ⋅ Kaihui Wang ⋅ Bo Zhang ⋅ Zhizhong Su ⋅ Yutian Zhou ⋅ Wanting Li ⋅ Deying Li ⋅ Zhaoxin Fan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 378
Tavatar: Topology-Aware Gaussian Attribute Derivation for Animatable Human Avatars
Hailin Luo ⋅ Yifan Yang ⋅ Jiazhi Shu ⋅ Zixiong Huang ⋅ Qi Chen ⋅ Qing Du ⋅ Mingkui Tan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 379
PercHead: Perceptual Head Model for Single-Image 3D Head Reconstruction & Editing
Antonio Oroz ⋅ Matthias Nießner ⋅ Tobias Kirschstein
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 380
PhysHead: Simulation-Ready Gaussian Head Avatars
Berna Kabadayi ⋅ Vanessa Sklyarova ⋅ Wojciech Zielonka ⋅ Justus Thies ⋅ Gerard Pons-Moll
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 381
ReWeaver: Towards Simulation-Ready and Topology-Accurate Garment Reconstruction
Ming Li ⋅ Hui Shan ⋅ Kai Zheng ⋅ Chentao Shen ⋅ Siyu Liu ⋅ Yanwei Fu ⋅ Zhen Chen ⋅ Xiangru Huang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 382
FHAvatar: Fast and High-Fidelity Reconstruction of Face-and-Hair Composable 3D Head Avatar from Few Casual Captures
Yujie Sun ⋅ Zhuoqiang CAI ⋅ Chaoyue Niu ⋅ Jianchuan Chen ⋅ Zhiwen Chen ⋅ Chengfei Lv ⋅ Fan Wu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 383
Feed-Forward One-Shot Animatable Textured Mesh Avatar Reconstruction
Yisheng He
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 384
Reallocating Attention Across Layers to Reduce Multimodal Hallucination
Haolang Lu ⋅ Bolun Chu ⋅ WeiYe Fu ⋅ Guoshun Nan ⋅ Junning Liu ⋅ Minghui Pan ⋅ Qiankun Li ⋅ Yi Yu ⋅ Hua Wang ⋅ Kun Wang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 385
VES-RFT: Rewarding Visual Evidence Sensitivity to Mitigate Hallucinations in Large Vision–Language Models
XUEGE HOU ⋅ Wenshuo Li ⋅ Yali Li ⋅ Han Shu ⋅ Yuan Wang ⋅ Xinghao Chen ⋅ Shengjin Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 386
Fighting Hallucinations with Counterfactuals: Diffusion-Guided Perturbations for LVLM Hallucination Suppression
Hamidreza Dastmalchi ⋅ Aijun An ⋅ Ali Cheraghian ⋅ Hamed Barzamini
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 387
Unstitching the Chimera: Frame-Level Risk and Train-Free Mitigation for Video Hallucination
Songyuan Yang ⋅ Guijian Tang ⋅ Kun Hu ⋅ Haotian Wang ⋅ Shixuan Liu ⋅ Wenjing Yang ⋅ Long Lan ⋅ Huibin Tan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 388
CausalLens: Sensitivity-Guided Multi-Head Causal Intervention for Hallucination Mitigation in Large Vision-Language Models
Junyang Ji ⋅ Qifan Liu ⋅ Wenming Yang ⋅ Zhihai He
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 389
Breaking the Illusion: When Positive Meets Negative in Multimodal Decoding
Yubo Jiang ⋅ Yitong An ⋅ Xin Yang ⋅ Abudukelimu Wuerkaixi ⋅ Xuxin Cheng ⋅ Fengying Xie ⋅ Zhiguo Jiang ⋅ Cao Liu ⋅ Ke Zeng ⋅ Haopeng Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 390
FlexTraj: Image-to-Video Generation with Flexible Point Trajectory Control
Zhiyuan Zhang ⋅ Can Wang ⋅ Dongdong Chen ⋅ Jing Liao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 391
Diff4Splat: Repurposing Video Diffusion Models for Dynamic Scene Generation
Panwang Pan ⋅ Chenguo Lin ⋅ Chenxin Li ⋅ Jingjing Zhao ⋅ Yuchen Lin ⋅ Haopeng Li ⋅ yunlong lin ⋅ Kairun Wen ⋅ Yixuan Yuan ⋅ Yadong Mu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 392
Spatia: Video Generation with Updatable Spatial Memory
Jinjing Zhao ⋅ Fangyun Wei ⋅ Zhening Liu ⋅ Hongyang Zhang ⋅ Chang Xu ⋅ Yan Lu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 393
Geometry-as-context: Modulating Explicit 3D in Scene-consistent Video Generation to Geometry Context
JiaKui Hu ⋅ Jialun Liu ⋅ Liying Yang ⋅ Xinliang Zhang ⋅ Kaiwen Li ⋅ Shuang Zeng ⋅ Yuanwei Li ⋅ Haibin Huang ⋅ Chi Zhang ⋅ Yanye Lu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 394
EgoControl: Controllable Egocentric Video Generation via 3D Full-Body Poses
Enrico Pallotta ⋅ Sina Mokhtarzadeh Azar ⋅ Lars Doorenbos ⋅ Serdar Ozsoy ⋅ Umar Iqbal ⋅ Jürgen Gall
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 395
CustomTex: High-fidelity Indoor Scene Texturing via Multi-Reference Customization
Weilin Chen ⋅ Jiahao Rao ⋅ Wenhao Wang ⋅ Xinyang Li ⋅ Xuan Cheng ⋅ Liujuan Cao
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 396
FoleyDesigner: Immersive Stereo Foley Generation with Precise Spatio-Temporal Alignment for Film Clips
Mengtian Li ⋅ Kunyan Dai ⋅ Yi Ding ⋅ Ruobing Ni ⋅ Ying Zhang ⋅ Wenwu Wang ⋅ Zhifeng Xie
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 397
Physical Simulator In-the-Loop Video Generation
Lin Geng Foo ⋅ Mark He Huang ⋅ Alexandros Lattas ⋅ Stylianos Moschoglou ⋅ Thabo Beeler ⋅ Christian Theobalt
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 398
Refracting Reality: Generating Images with Realistic Transparent Objects
Yue Yin ⋅ Enze Tao ⋅ Dylan Campbell
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 399
Generating Humanless Environment Walkthroughs from Egocentric Walking Tour Videos
Yujin Ham ⋅ Junho Kim ⋅ Vivek Boominathan ⋅ Guha Balakrishnan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 400
EgoFlow: Gradient-Guided Flow Matching for Egocentric 6DoF Object Motion Generation
Abhishek Saroha ⋅ Huajian Zeng ⋅ Xingxing Zuo ⋅ Daniel Cremers ⋅ Xi Wang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 401
Spatial-Frequency Collaborative Learning for Occluded Visible-Infrared Person Re-Identification
JIan Yu ⋅ Yujian Feng ⋅ Shuai You ⋅ Zhongkai Zhou ⋅ Fei Wu ⋅ Zhengjun Jing ⋅ Yimu Ji
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 402
Mind the Gap: Transferring Labels to Align Object Detection Datasets
Mikhail Kennerley ⋅ Angelica I Aviles-Rivero ⋅ Carola-Bibiane Schönlieb ⋅ Robby T. Tan
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 403
SSM-Aware Token-Efficient VMamba via Adaptive Patch Pruning and Merging for Person Re-Identification
Huiyuan Huang ⋅ SANG MIN YOON
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 404
Tri-Modal Fusion Transformers for UAV-based Object Detection
Craig Iaboni ⋅ Pramod Abichandani
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 405
View-Aware Semantic Alignment for Aerial-Ground Person Re-Identification
Quan Zhang ⋅ Zeqiang Cai ⋅ Peiming Zhao ⋅ Jingze Wu ⋅ Cailun Wu ⋅ Hongbo Chen ⋅ Jianhuang Lai
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 406
RHCNet: Residual-Guided Hierarchical Calibration Network for Robust Underwater Object Detection
Yueying Wang ⋅ Yiteng Guo ⋅ Weidong Zhang ⋅ Jie Wen ⋅ Liquan Shen ⋅ Huaicheng Yan ⋅ Xin Xu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 407
X-AVDT: Audio-Visual Cross-Attention for Robust Deepfake Detection
Youngseo Kim ⋅ Kwan Yun ⋅ Seokhyeon Hong ⋅ Sihun Cha ⋅ Colette Suhjung Koo ⋅ Junyong Noh
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 408
Beyond Duality: A Hybrid Framework of Leveraging Shared and Private Features for RGB-Event Object Detection
Keyao Wang ⋅ Shuai Liu ⋅ Hengda Shi ⋅ Lukui Shi ⋅ Haiyong Chen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 409
FVBench: Benchmarking Deepfake Video Detection Capability of Large Multimodal Models
Wang Jiarui ⋅ Huiyu Duan ⋅ Juntong Wang ⋅ Xiongkuo Min
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 410
AKCMamba-YOLO: Selective State Space Models For Real-Time Object Detection
Long Chen ⋅ Hui Wang ⋅ Man Xu ⋅ Zexuan Li ⋅ Zizhu Fan
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 411
When AVSR Meets Video Conferencing: Dataset, Degradation, and the Hidden Mechanism Behind Performance Collapse
Yihuan Huang ⋅ Jun Xue ⋅ Liu Jiajun ⋅ Daixian Li ⋅ Tong Zhang ⋅ Zhuolin Yi ⋅ Yanzhen Ren ⋅ Kai Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 412
Your One-Stop Solution for AI-Generated Video Detection
Long Ma ⋅ Zihao Xue ⋅ Yan Wang ⋅ Zhiyuan Yan ⋅ Jin Xu ⋅ Xiaorui Jiang ⋅ Haiyang Yu ⋅ Yong Liao ⋅ Zhen Bi
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 413
UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation
Jiehui Huang ⋅ Yuechen Zhang ⋅ Xu He ⋅ Yuan Gao ⋅ Zhi Cen ⋅ Bin Xia ⋅ Yan Zhou ⋅ Xin Tao ⋅ Pengfei Wan ⋅ Jiaya Jia
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 414
Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning
Yifei Li ⋅ Wenzhao Zheng ⋅ Yanran Zhang ⋅ Runze Sun ⋅ Yu Zheng ⋅ Lei Chen ⋅ Jie Zhou ⋅ Jiwen Lu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 415
HumanVBench: Probing Human-Centric Video Understanding in MLLMs with Automatically Synthesized Benchmarks
Ting Zhou ⋅ Daoyuan Chen ⋅ Qirui Jiao ⋅ Bolin Ding ⋅ Yaliang Li ⋅ Ying Shen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 416
HERBench: A Benchmark for Multi-Evidence Integration in Video Question Answering
Dan Ben Ami ⋅ Gabriele Serussi ⋅ Kobi Cohen ⋅ Chaim Baskin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 417
Seeing the Scene Matters: Revealing Forgetting in Video Understanding Models with a Scene-Aware Long-Video Benchmark
Seng Nam Chen ⋅ Hao Chen ⋅ Chenglam Ho ⋅ Xinyu Mao ⋅ Jinping Wang ⋅ Yu Zhang ⋅ Chao Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 418
Thinking with Frames: Generative Video Distortion Evaluation via Frame Reward Model
Yuan Wang ⋅ Borui Liao ⋅ Huijuan Huang ⋅ Jinda Lu ⋅ Ouxiang Li ⋅ Kuien Liu ⋅ Meng Wang ⋅ Xiang Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 419
MovieRecapsQA: A Multimodal Open-Ended Video Question-Answering Benchmark
Shaden Shaar ⋅ Bradon Thymes ⋅ Sirawut Chaixanien ⋅ Claire Cardie ⋅ Bharath Hariharan
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 420
Training-free, Perceptually Consistent Low-Resolution Previews with High-Resolution Image for Efficient Workflows of Diffusion Models
Wongi Jeong ⋅ Hoigi Seo ⋅ Se Young Chun
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 421
One Model, Many Budgets: Elastic Latent Interfaces for Diffusion Transformers
Moayed Haji Ali ⋅ Willi Menapace ⋅ Ivan Skorokhodov ⋅ Dogyun Park ⋅ Anil Kag ⋅ Michael Vasilkovsky ⋅ Sergey Tulyakov ⋅ Vicente Ordonez ⋅ Aliaksandr Siarohin
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 422
Reflection Separation from a Single Image via Joint Latent Diffusion
Zheng-Hui Huang ⋅ Zhixiang Wang ⋅ Yu-Lun Liu ⋅ Yung-Yu Chuang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 423
MMFace-DiT: A Dual-Stream Diffusion Transformer for High-Fidelity Multimodal Face Generation
Bharath Krishnamurthy ⋅ Ajita Rattani
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 424
DisCa: Accelerating Video Diffusion Transformers with Distillation-Compatible Learnable Feature Caching
Chang Zou ⋅ Changlin Li ⋅ Songtao Liu ⋅ Zhao Zhong ⋅ Kailin Huang ⋅ Linfeng Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 425
MatLat: Material Latent Space for PBR Texture Generation
Kyeongmin Yeo ⋅ Yunhong Min ⋅ Jaihoon Kim ⋅ Minhyuk Sung
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 426
VMonarch: Efficient Video Diffusion Transformers with Structured Attention
Cheng Liang ⋅ Haoxian Chen ⋅ Liang Hou ⋅ Qi Fan ⋅ Gangshan Wu ⋅ Xin Tao ⋅ Limin Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 427
DiffDecompose: Layer-Wise Decomposition of Alpha-Composited Images via Diffusion Transformers
Zitong Wang ⋅ Hang Zhao ⋅ Qianyu Zhou ⋅ Xuequan Lu ⋅ Xiangtai Li ⋅ Hao Yang ⋅ Bo Yang ⋅ Yiren Song
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 428
Calibri: Enhancing Diffusion Transformers via Parameter-Efficient Calibration
Danil Tokhchukov ⋅ Aysel Mirzoeva ⋅ Andrey Kuznetsov ⋅ Konstantin Sobolev
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 429
Transition Matching Distillation for Fast Video Generation
Weili Nie ⋅ Julius Berner ⋅ Nanye Ma ⋅ Chao Liu ⋅ Saining Xie ⋅ Arash Vahdat
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 430
Diffusion-Based Makeup Transfer with Facial Region-Aware Makeup Features
Zheng Gao ⋅ Debin Meng ⋅ Yunqi Miao ⋅ Zhensong Zhang ⋅ Songcen Xu ⋅ Ioannis Patras ⋅ Jifei Song
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 431
UniPR: Unified Object-level Real-to-Sim Perception and Reconstruction from a Single Stereo Pair
Chuanrui Zhang ⋅ Yingshuang Zou ⋅ ZhengXian Wu ⋅ Yonggen Ling ⋅ Yuxiao Yang ⋅ Ziwei Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 432
Query2Uncertainty: Robust Uncertainty Quantification and Calibration for 3D Object Detection under Distribution Shift
Till Beemelmanns ⋅ Alexey Nekrasov ⋅ Stefan Vilceanu ⋅ Jonas Steinhaus ⋅ Timo Woopen ⋅ Bastian Leibe ⋅ Lutz Eckstein
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 433
DICArt: Advancing Category-level Articulated Object Pose Estimation in Discrete State-Spaces
Li Zhang ⋅ Mingyu Mei ⋅ Ailing Wang ⋅ Xianhui Meng ⋅ Yan Zhong ⋅ Xinyuan Song ⋅ Liu Liu ⋅ Rujing Wang ⋅ Zaixing He ⋅ Cewu Lu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 434
PoseGaussian: 6D Pose Estimation for Unseen Objects via Sparse-View Object-Level 3D Gaussian Splatting
Wubin Shi ⋅ Shaoyan Gai ⋅ Feipeng Da
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 435
VGGT-Det: Mining VGGT Internal Priors for Sensor-Geometry-Free Multi-View Indoor 3D Object Detection
Yang Cao ⋅ Feize Wu ⋅ Dave Chen ⋅ Yingji Zhong ⋅ Lanqing Hong ⋅ Dan Xu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 436
MonoSAOD: Monocular 3D Object Detection with Sparsely Annotated Label
Junyoung Jung ⋅ Seokwon Kim ⋅ Jung Uk Kim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 437
V2U4Real: A Real-world Large-scale Dataset for Vehicle-to-UAV Cooperative Perception
Weijia Li ⋅ Haoen Xiang ⋅ Tianxu Wang ⋅ Shuaibing Wu ⋅ Qiming Xia ⋅ Cheng Wang ⋅ Chenglu Wen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 438
SketchVL: Policy Optimization via Fine-Grained Credit Assignment for Chart Understanding and More
Muye Huang ⋅ Lingling Zhang ⋅ Yifei Li ⋅ Yaqiang Wu ⋅ Jun Liu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 439
A Causal Marriage between VLM and IRM from Understanding to Reasoning
Ziliang Chen ⋅ Tianang Xiao ⋅ jusheng zhang ⋅ Yongsen Zheng ⋅ Yang Liu ⋅ Zhao-Rong Lai ⋅ Liang Lin
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 440
Why Does RL Generalize Better Than SFT? A Data-Centric Perspective on VLM Post-Training
Aojun Lu ⋅ Tao Feng ⋅ Hangjie Yuan ⋅ Wei Li ⋅ Yanan Sun
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 441
SoC: Semantic Orthogonal Calibration for Test-Time Prompt Tuning
Leo Fillioux ⋅ Omprakash Chakraborty ⋅ Ismail Ben Ayed ⋅ Paul-Henry Cournède ⋅ Stergios Christodoulidis ⋅ Maria Vakalopoulou ⋅ Jose Dolz
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 442
Learning to Select Visual Tools from Experience
Zeyi Huang ⋅ Yuyang Ji ⋅ Anirudh Sundara Rajan ⋅ Zefan Cai ⋅ Wen Xiao ⋅ Haohan Wang ⋅ Junjie Hu ⋅ Yong Jae Lee
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 443
Agile Deliberation: Concept Deliberation for Subjective Visual Classification
Leijie Wang ⋅ Otilia Stretcu ⋅ Wei Qiao ⋅ Thomas Denby ⋅ Krishnamurthy Viswanathan ⋅ Enming Luo ⋅ Chun-Ta Lu ⋅ Tushar Dogra ⋅ Ranjay Krishna ⋅ Ariel Fuxman
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 444
Tea-Adapter: Teacher Adapter for Efficient Conditional Generation
Yinhan Zhang ⋅ Yue Ma ⋅ Fangqiu Yi ⋅ Chenyang Qi ⋅ Chi Zhang ⋅ Kunyu Feng ⋅ Zeyu Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 445
From Failure to Feedback: Group Revision Unlocks Hard Cases in Object-Level Grounding
Yuyuan Liu ⋅ Yiping Ji ⋅ Anjie Le ⋅ Jiayuan Zhu ⋅ Jiazhen Pan ⋅ Can Peng ⋅ Jiajun Deng ⋅ Fengbei Liu ⋅ Junde Wu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 446
Perception Characteristics Distance: Measuring Stability and Robustness of Perception System in Dynamic Conditions under a Certain Decision Rule
Boyu Jiang ⋅ Liang Shi ⋅ Zhengzhi Lin ⋅ Lanxin Xiang ⋅ Loren Stowe ⋅ Feng Guo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 447
FinPercep-RM: A Fine-grained Reward Model and Co-evolutionary Curriculum for RL-based Real-world Super-Resolution
Yidi Liu ⋅ Zihao Fan ⋅ Jie Huang ⋅ Jie Xiao ⋅ Dong Li ⋅ Wenlong Zhang ⋅ Lei Bai ⋅ Xueyang Fu ⋅ Zheng-Jun Zha
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 448
Twin-T & TwintVQA: A Reliable Structure–Detail Separating VLM and a Comprehensive Benchmark for Chart and Table Tasks
Jiahua Bao ⋅ Siyao Cheng ⋅ Jiaxing Du ⋅ Qingtao Xia ⋅ Changjiang He ⋅ Zeming Lang ⋅ Jie Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 449
SDGS: Spatial Difference Guided Gaussian Splatting for Simultaneous Localization and 3D Reconstruction
Yijian Tian ⋅ Mingtao Ou ⋅ Pan Zijian ⋅ Xinglong Ji
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 450
RT-Splatting: Joint Reflection-Transmission Modeling with Gaussian Splatting
Ji Shi ⋅ Xianghua Ying ⋅ Bowei Xing ⋅ Ruohao Guo ⋅ Wenzhen Yue
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 451
Pose-Free Omnidirectional Gaussian Splatting for 360-Degree Videos with Consistent Depth Priors
Chuanqing Zhuang ⋅ Xin Lu ⋅ Zehui Deng ⋅ Zhengda Lu ⋅ Yiqun Wang ⋅ Junqi Diao ⋅ Jun Xiao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 452
Distilling Unsigned Distance Function for Surface Reconstruction from 3D Gaussian Splatting
Qian Li ⋅ Rao Fu ⋅ Jiangtao Li ⋅ Fan Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 453
Exact-GS: Mathematically Rigorous and Accurate 3D Gaussian Splatting for 3D X-ray Reconstruction
Guangpu Yang ⋅ Steffen Kieß ⋅ Hanxiang Luo ⋅ Xingyu Liu ⋅ Sven Simon
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 454
DualSplat: Robust 3D Gaussian Splatting via Pseudo-Mask Bootstrapping from Reconstruction Failures
Xu Wang ⋅ Zhiru Wang ⋅ Shiyun Xie ⋅ Chengwei Pan ⋅ Yisong Chen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 455
E2EGS: Event-to-Edge Gaussian Splatting for Pose-Free 3D Reconstruction
Yunsoo Kim ⋅ Changki Sung ⋅ Dasol Hong ⋅ Hyun Myung
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 456
Neural Gabor Splatting: Enhanced Gaussian Splatting with Neural Gabor for High-frequency Surface Reconstruction
Haato Watanabe ⋅ Nobuyuki Umetani
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 457
DirectFisheye-GS: Enabling Native Fisheye Input in Gaussian Splatting with Cross-View Joint Optimization
Zhengxian Yang ⋅ Fei Xie ⋅ Xutao Xue ⋅ Rui Zhang ⋅ Taicheng Huang ⋅ Yang Liu ⋅ Mengqi Ji ⋅ Tao Yu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 458
VAD-GS: Visibility-Aware Densification for 3D Gaussian Splatting in Dynamic Urban Scenes
Yikang Zhang ⋅ Rui Fan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 459
GauMVC: Generative Decoupled Gaussian Representation for Human-centric Multi-view Video Compression
Ruoke Yan ⋅ Mingjia Yang ⋅ Xinfeng Zhang ⋅ Haocheng Tang ⋅ Qian Yin ⋅ Zhipin Deng ⋅ Kai Zhang ⋅ Li zhang ⋅ Siwei Ma
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 460
A Geometric Algebra-Informed 3DGS Framework for Wireless Channel Prediction
Jingzhou Shen ⋅ Tianya Zhao ⋅ Xuyu Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 461
RaGS: Unleashing 3D Gaussian Splatting from 4D Radar and Monocular Cue for 3D Object Detection
Xiaokai Bai ⋅ Chenxu Zhou ⋅ Lianqing Zheng ⋅ Jianan Liu ⋅ Siyuan Cao ⋅ Xiaohan Zhang ⋅ Yiming Li ⋅ Zhengzhuang Zhang ⋅ Hui-Liang Shen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 462
Cross-Instance Gaussian Splatting Registration via Geometry-Aware Feature-Guided Alignment
Roy Amoyal ⋅ Oren Freifeld ⋅ Chaim Baskin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 463
ActivePolicy: Active Gaussian Reconstruction and Optimization Strategy Based on Global-Local Information Gain
Yingzhao Li ⋅ Yanjie Liu ⋅ lijun zhao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 464
Uncertainty-driven 3D Gaussian Splatting Active Mapping via Anisotropic Visibility Field
Shangjie Xue ⋅ Jesse Dill ⋅ Dhruv Ahuja ⋅ Frank Dellaert ⋅ Panagiotis Tsiotras ⋅ Danfei Xu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 465
SV-GS: Sparse View 4D Reconstruction with Skeleton-Driven Gaussian Splatting
Jun-Jee Chao ⋅ Volkan Isler
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 466
NimbusGS: Unified 3D Scene Reconstruction under Hybrid Weather
Yanying Li ⋅ Jinyang Li ⋅ Shengfeng He ⋅ Yangyang Xu ⋅ Junyu Dong ⋅ Yong Du
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 467
SparseSplat: Towards Applicable Feed-Forward 3D Gaussian Splatting with Pixel-Unaligned Prediction
Zicheng Zhang ⋅ Xiangting Meng ⋅ Ke Wu ⋅ Wenchao Ding
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 468
REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding
Jiaze Li ⋅ Hao Yin ⋅ Wenhui Tan ⋅ Jingyang Chen ⋅ Boshen Xu ⋅ Yuxun Qu ⋅ Yijing Chen ⋅ Jianzhong Ju ⋅ Zhenbo Luo ⋅ Jian Luan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 469
Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning
Chi-Pin Huang ⋅ Yunze Man ⋅ Zhiding Yu ⋅ Min-Hung Chen ⋅ Jan Kautz ⋅ Yu-Chiang Frank Wang ⋅ Fu-En Yang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 470
Unlocking Token Rewards via Training-Free Reward Attribution
WU Sitong ⋅ Haoru Tan ⋅ Bin Xia ⋅ Xichen Zhang ⋅ Jingyao Li ⋅ Shaofeng Zhang ⋅ Xiaojuan Qi ⋅ Bei Yu ⋅ Jiaya Jia
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 471
MedMO: Grounding and Understanding Multimodal Large Language Model for Medical Images
Ankan Deria ⋅ Komal Kumar ⋅ Adinath Madhavrao Dukre ⋅ Eran Segal ⋅ Salman Khan ⋅ Imran Razzak
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 472
When to Think and When to Look: Uncertainty-Guided Lookback
Jing Bi ⋅ Filippos Bellos ⋅ JunJia Guo ⋅ Yayuan Li ⋅ Chao Huang ⋅ Yolo Yunlong Tang ⋅ Luchuan Song ⋅ Susan Liang ⋅ Zhongfei Zhang ⋅ Jason J. Corso ⋅ Chenliang Xu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 473
StaR-KVQA: Structured Reasoning Traces for Implicit-Knowledge Visual Question Answering
Zhihao Wen ⋅ Wenkang Wei ⋅ Yuan Fang ⋅ Xingtong Yu ⋅ hui zhang ⋅ Weicheng Zhu ⋅ Xin Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 474
Understanding Counting Mechanisms in Large Language and Vision-Language Models
Hosein Hasani ⋅ Amirmohammad Izadi ⋅ Fatemeh Askari ⋅ Mobin Bagherian ⋅ Sadegh Mohammadian ⋅ Mohammad Izadi ⋅ Mahdieh Baghshah
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 475
CLiViS: Unleashing Cognitive Map through Linguistic-Visual Synergy for Embodied Visual Reasoning
Kailing Li ⋅ Qi'ao Xu ⋅ Tianwen Qian ⋅ Yuqian Fu ⋅ Yang Jiao ⋅ Xiaoling Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 476
Proof-of-Perception: Certified Tool-Using Multimodal Reasoning with Compositional Conformal Guarantees
Arya Fayyazi ⋅ Haleh Akrami
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 477
Thinking Diffusion: Penalize and Guide Visual-Grounded Reasoning in Diffusion Multimodal Language Models
Keuntae Kim ⋅ Mingyu Kang ⋅ Yong Suk Choi
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 478
Don’t Show Pixels, Show Cues: Unlocking Visual Tool Reasoning in Language Models via Perception Programs
Muhammad Kamran Janjua ⋅ Hugo Silva ⋅ Di Niu ⋅ Bahador Rashidi
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 479
Hugging Visual Prompt and Segmentation Tokens: Consistency Learning for Fine-Grained Visual Understanding in MLLMs
jing yang ⋅ Sen Yang ⋅ Boqiang Duan ⋅ Ming Dai ⋅ Wei Zhang ⋅ Xiao Tan ⋅ Kunbin Chen ⋅ Wei He ⋅ Jingdong Wang ⋅ Hanli Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 480
VisionLeaf: Entropy-Guided Leaf-First Reasoning for Efficient and Accurate Think-with-Image
Haokun GUI ⋅ Senqiao Yang ⋅ Mingkang Zhu ⋅ Meng Chu ⋅ WU Sitong ⋅ Changsheng Lu ⋅ Zihao Wang ⋅ Zhuotao Tian ⋅ Jiaya Jia
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 481
GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models
Jingxuan Wei ⋅ Caijun Jia ⋅ Xi Bai ⋅ Xinglong Xu ⋅ Siyuan Li ⋅ Linzhuang Sun ⋅ Bihui Yu ⋅ Conghui He ⋅ Lijun Wu ⋅ Cheng Tan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 482
Beyond Depth: Evaluating the Width-centric Reasoning Capability of MLLMs
Mingrui Chen ⋅ Hexiong Yang ⋅ Haogeng Liu ⋅ Huaibo Huang ⋅ Ran He
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 483
GenSplat: Bridging the Generalization Gap in 3DGS Language Comprehension
Fang Liu ⋅ Yuhao Liu ⋅ Ke Xu ⋅ Gerhard Hancke ⋅ Rynson W.H. Lau
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 484
CC-VQA: Conflict- and Correlation-Aware Method for Mitigating Knowledge Conflict in Knowledge-Based Visual Question Answering
Yuyang Hong ⋅ Jiaqi Gu ⋅ Yujing Lou ⋅ Lubin Fan ⋅ Qi Yang ⋅ Ying Wang ⋅ Kun Ding ⋅ Yue Wu ⋅ Shiming Xiang ⋅ Jieping Ye
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 485
LoPrune: Efficient Data Pruning for LoRA-Based Fine-Tuning of Vision Transformer
Qiang He ⋅ Yaozong Yang ⋅ KAIBIN WANG ⋅ Ziteng Wei ⋅ Feifei Chen ⋅ Caslon Chua ⋅ Yun Yang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 486
Multi-Scale Local Speculative Decoding for Image Generation
Elia Peruzzo ⋅ Guillaume Sautiere ⋅ Amir Habibian
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 487
Globscope: Toward a Global View of the Loss Landscape
Mashiat Mustaq ⋅ Xavier M.
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 488
RADAR: VQ-VAE Decoder of VAR is a Good Student for Restoring Against Degradation by Acceleration
Ziyang Wang ⋅ Yue Zhang ⋅ Mingdao Wang ⋅ Yasen Zhang ⋅ Teer Song ⋅ Yu Tian ⋅ Xueming LI
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 489
Beyond Single Solution: Multi-Hypothesis Deep Unfolding Network for Image Compressive Sensing
Wenxue Cui ⋅ Hualin Li ⋅ Yuhang Qin ⋅ Yifu Xu ⋅ Xiaopeng Fan ⋅ Debin Zhao
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 490
FlashDecoder: Real-Time Latent-to-Pixel Streaming Decoder with Transformers
Minguk Kang ⋅ Suha Kwak
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 491
MambaSIC: Mamba-based Stereo Image Compression with Bi-directional Multi-reference Entropy Model
Shiyu Qin ⋅ XINJIE ZHANG ⋅ Zhening Liu ⋅ Jinpeng Wang ⋅ Bin Chen ⋅ Jiawei Li ⋅ Yifan Ren ⋅ Shu-Tao Xia ⋅ Jun Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 492
Neural Dynamic GI: Random-Access Neural Compression for Temporal Lightmaps in Dynamic Lighting Environments
Jianhui Wu ⋅ Jian Zhou ⋅ Zhi Zhou ⋅ Zhangjin Huang ⋅ Chao Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 493
Discovering Adaptive Task Dependencies for Efficient Multi-Task Representation Compression
Zhimeng Huang ⋅ Rongao Yuan ⋅ Junlong Gao ⋅ Qi Mao ⋅ Siwei Ma ⋅ Wen Gao ⋅ Chuanmin Jia
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 494
OmniZip: Learning a Unified and Lightweight Lossless Compressor for Multi-Modal Data
Yan Zhao ⋅ Zhengxue Cheng ⋅ Junxuan Zhang ⋅ Dajiang Zhou ⋅ Qunshan Gu ⋅ Qi Wang ⋅ Li Song
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 495
Perceptual Neural Video Compression with Color Separation and Rank Chain
xiongzhuang liang ⋅ Chuanbo Tang ⋅ Zhuoyuan Li ⋅ Li Li ⋅ Dong Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 496
Beyond Matching to Tiles: Bridging Unaligned Aerial and Satellite Views for Vision-Only UAV Navigation
Liu Kejia ⋅ Haoyang Zhou ⋅ Ruoyu Xu ⋅ Peicheng Wang ⋅ Mingli Song ⋅ Haofei Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 497
GeoFlow: Real-Time Fine-Grained Cross-View Geolocalization via Iterative Flow Prediction
Ayesh Abu Lehyeh ⋅ Xiaohan Zhang ⋅ Ahmad Arrabi ⋅ Waqas Sultani ⋅ Chen Chen ⋅ Safwan Wshah
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 498
PiLoT: Neural Pixel-to-3D Registration for UAV-based Ego and Target Geo-localization
Xiaoya Cheng ⋅ Long Wang ⋅ Yan Liu ⋅ Xinyi Liu ⋅ Hanlin Tan ⋅ Yu Liu ⋅ Maojun Zhang ⋅ Shen Yan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 499
PAUL: Uncertainty-Guided Partition and Augmentation for Robust Cross-View Geo-Localization under Noisy Correspondence
Zheng Li ⋅ Xueyi Zhang ⋅ Yanming Guo ⋅ Yuxiang Xie ⋅ Ding Zhaoyun ⋅ Siqi Cai ⋅ Haizhou Li ⋅ Mingrui Lao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 500
UniGeoRS: A Unified Benchmark for Tri-view Geo-Localization
Xiao Liang ⋅ Huaizhi Tang ⋅ Feiyang Zhang ⋅ Shiji Yuan ⋅ Chun Hu ⋅ Dezhi Zheng ⋅ Kang Ma
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 501
VGA: Empowering Aerial-Ground Localization by Visual Geometry Alignment
Tao Jun Lin ⋅ Yujiao Shi ⋅ Hongdong Li
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 502
Watch and Learn: Learning to Use Computers from Online Videos
Chan Hee Song ⋅ Yiwen Song ⋅ Palash Goyal ⋅ Yu Su ⋅ Oriana Riva ⋅ Hamid Palangi ⋅ Tomas Pfister
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 503
OneThinker: All-in-one Reasoning Model for Image and Video
Kaituo Feng ⋅ Manyuan Zhang ⋅ Hongyu Li ⋅ Kaixuan Fan ⋅ shuang chen ⋅ Yilei Jiang ⋅ Dian Zheng ⋅ Peiwen Sun ⋅ Yiyuan Zhang ⋅ Haoze Sun ⋅ Yan Feng ⋅ Peng Pei ⋅ Xunliang Cai ⋅ Xiangyu Yue
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 504
Incentivizing Versatile Video Reasoning in MLLMs via Data-Efficient Reinforcement Learning
Xiaodong Wang ⋅ Zhirong Wu ⋅ Langling Huang ⋅ Yuxi Zheng ⋅ Peixi Peng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 505
Act2See: Emergent Active Visual Perception for Video Reasoning
Martin Q. Ma ⋅ Yuxiao Qu ⋅ Aditya Agrawal ⋅ Willis Guo ⋅ Paul Pu Liang ⋅ Ruslan Salakhutdinov ⋅ Louis-Philippe Morency
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 506
VideoSeek: Long-Horizon Video Agent with Tool-Guided Seeking
Jingyang Lin ⋅ Jialian Wu ⋅ Jiang Liu ⋅ Ximeng Sun ⋅ Ze Wang ⋅ Xiaodong Yu ⋅ Jiebo Luo ⋅ Zicheng Liu ⋅ Emad Barsoum
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 507
ViLoMem: Agentic Learner with Grow-and-Refine Multimodal Semantic Memory
Weihao Bo ⋅ Shan Zhang ⋅ Yanpeng Sun ⋅ Jingjing Wu ⋅ Qunyi Xie ⋅ Xiao Tan ⋅ Kunbin Chen ⋅ Wei He ⋅ Xiaofan Li ⋅ Na Zhao ⋅ Jingdong Wang ⋅ Zechao Li
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 508
ReMoT: Reinforcement Learning with Motion Contrast Triplets
Cong Wan ⋅ Zeyu Guo ⋅ Jiangyang Li ⋅ SongLin Dong ⋅ Yifan Bai ⋅ Lin Peng ⋅ Zhiheng Ma ⋅ Yihong Gong
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 509
Incentivizing Generative Zero-Shot Learning via Outcome-Reward Reinforcement Learning with Visual Cues
Wenjin Hou ⋅ Xiaoxiao Sun ⋅ Hehe Fan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 510
Semantic-Guided Global-Local Collaborative Prompt Learning for Few-Shot Class Incremental Learning
yongxin yan ⋅ Weisen Chen ⋅ Xingye Chen ⋅ Yuanjie Shao ⋅ Zhengrong Zuo ⋅ Wenming Tan ⋅ Wenqi Ren ⋅ Changxin Gao ⋅ Nong Sang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 511
Beyond Heuristic Prompting: A Concept-Guided Bayesian Framework for Zero-Shot Image Recognition
Hui Liu ⋅ Kecheng Chen ⋅ Jialiang Wang ⋅ Xianming Liu ⋅ Wenya Wang ⋅ Haoliang Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 512
One Patch to Caption Them All: A Unified Zero-Shot Captioning Framework
Lorenzo Bianchi ⋅ Giacomo Pacini ⋅ Fabio Carrara ⋅ Nicola Messina ⋅ Giuseppe Amato ⋅ Fabrizio Falchi
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 513
Data-Centric Meta-Learning for Robust Few-Shot Generalization
Jongmin Lim ⋅ Soobin CHA ⋅ Jaehun Park ⋅ Inho Oh ⋅ Minho Park ⋅ Kwangsu Kim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 514
Bridging the Modality Gap in Compositional Zero-Shot Learning via Sparse Alignment and Unimodal Memory Bank
Yang Zhang ⋅ Zhixiang Chi ⋅ Xudong Yan ⋅ Yang Wang ⋅ Songhe Feng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 515
LIFT and PLACE: A Simple, Stable, and Effective Knowledge Distillation Framework for Lightweight Diffusion Models
Hyunsoo Han ⋅ Sangyeop Yeo ⋅ Jaejun Yoo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 516
WaDi: Weight Direction-aware Distillation for One-step Image Synthesis
Lei Wang ⋅ Yang Cheng ⋅ Senmao Li ⋅ Ge Wu ⋅ Yaxing Wang ⋅ Jian Yang
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 517
Uncertainty-Aware Knowledge Distillation for Multimodal Large Language Models
Jingchen Sun ⋅ Shaobo Han ⋅ Deep Patel ⋅ Wataru Kohno ⋅ Can Jin ⋅ Changyou Chen
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 518
Beyond Soft Label: Dataset Distillation via Orthogonal Gradient Matching
Deyu Bo ⋅ Xinchao Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 519
BHCast: Unlocking Black Hole Plasma Dynamics from a Single Blurry Image with Long-Term Forecasting
Renbo Tu ⋅ Ali SaraerToosi ⋅ Nicholas S. Conroy ⋅ Gennady Pekhimenko ⋅ Aviad Levis
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 520
RawMetaDiff: Unlocking Extreme Darkness from Dual-Exposure RAW with Meta-Guided Diffusion
Panjun Liu ⋅ Jiyuan Xia ⋅ YUANSHEN GUAN ⋅ Yong Li ⋅ Zhiqiang Lang ⋅ Ruikang Xu ⋅ Chang Chen ⋅ Dehua Song ⋅ Fenglong Song ⋅ Zhiwei Xiong
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 521
Prospective Dynamic 3D MRI Reconstruction via Latent-Space Motion Tracking from Single Measurement
Lixuan Chen ⋅ Zhongnan Liu ⋅ Jesse Hamilton ⋅ James M. Balter ⋅ Jeong Joon Park ⋅ Liyue Shen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 522
Lens Component Deletion based on Differentiable Ray Tracing
Wenguan Zhang ⋅ Qirun Zhang ⋅ Tuo Sun ⋅ Jiajian He ⋅ Jiahui Xu ⋅ Huajun Feng ⋅ Qi Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 523
X-band Radar Non-Line-of-Sight Imaging
Dongyu Du ⋅ Mingkun Zhao ⋅ Yutong Yang ⋅ Dominik Scheuble ⋅ Xiaolong Huang ⋅ Zijian Shao ⋅ Mario Bijelic ⋅ Kaushik Sengupta ⋅ Felix Heide
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 524
3M-TI: High-Quality Mobile Thermal Imaging via Calibration-free Multi-Camera Cross-Modal Diffusion
Minchong Chen ⋅ Xiaoyun Yuan ⋅ Junzhe Wan ⋅ Jianing Zhang ⋅ Jun Zhang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 525
UAVLight: A Benchmark for Illumination-Robust 3D Reconstruction in Unmanned Aerial Vehicle (UAV) Scenes
Kang DU ⋅ Xue Liao ⋅ Junpeng Xia ⋅ Chaozheng Guo ⋅ Yi Gu ⋅ Yirui Guan ⋅ Duotun Wang ⋅ Sheng Huang ⋅ Zeyu Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 526
Polarization State Tracing for Reflection Removal and Color-Consistent Reconstruction
Dongyue Wang ⋅ Yang Lu ⋅ Jiandong Tian
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 527
GFRRN: Explore the Gaps in Single Image Reflection Removal
Yu Chen ⋅ Zewei He ⋅ Xingyu Liu ⋅ Zixuan Chen ⋅ Zhe-Ming Lu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 528
Efficient All-Pairs Correlation Volume Sampling for Optical Flow Estimation
Karlis Martins Briedis ⋅ Markus Gross ⋅ Christopher Schroers
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 529
Cross-Slice Knowledge Transfer via Masked Multi-Modal Heterogeneous Graph Contrastive Learning for Spatial Gene Expression Inference
Zhiceng Shi ⋅ Changmiao Wang ⋅ Jun Wan ⋅ Wenwen Min
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 530
Adapting a Pre-trained Single-Cell Foundation Model to Spatial Gene Expression Generation from Histology Images
Donghai Fang ⋅ Yongheng Li ⋅ Zhen WANG ⋅ Yuansong Zeng ⋅ Wenwen Min
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 531
HyperST: Hierarchical Hyperbolic Learning for Spatial Transcriptomics Prediction
Chen Zhang ⋅ Yilu An ⋅ Ying Chen ⋅ Hao Li ⋅ Xitong Ling ⋅ Lihao Liu ⋅ Junjun He ⋅ Yuxiang Lin ⋅ Zihui Wang ⋅ Rongshan Yu
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 532
SO(3)-Equivariant ViT-Adapter for Data-Efficient Zero-Shot Sim-to-Real Indoor Panoramic Depth Estimation
Ziyan He ⋅ Qiudan Zhang ⋅ Lin Ma ⋅ Xu Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 533
Sparsity-Aware Voxel Attention and Foreground Modulation for 3D Semantic Scene Completion
Yu Xue ⋅ Longjun Gao ⋅ Yuanqi Su ⋅ HaoAng Lu ⋅ Xiaoning Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 534
XPaintNet: An eXtreme Lightweight Framework for Stereoscopic Conversion without Inpainting Network
Kihwan Yoon ⋅ Juyeon Shin ⋅ Jeongheum Kang ⋅ Sijung Kim ⋅ Minyong Jeon
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 535
MD2E: Modeling Depth-to-Edge Cues for Monocular Metric Depth Estimation
Chao Ning ⋅ Minghe Shen ⋅ Naoto Yokoya
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 536
LiteSense: Lifting Lightweight ToF with RGB for High-Resolution Metric Depth Estimation
Yusheng Li ⋅ Lizhi LOU ⋅ Yan Tang ⋅ Zekai Miao ⋅ shaoming zhang ⋅ Jianmei Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 537
3D-Aware Multi-Task Learning with Cross-View Correlations for Dense Scene Understanding
Xiaoye Wang ⋅ Chen Tang ⋅ Xiangyu Yue ⋅ Wei-Hong Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 538
The Midas Touch for Metric Depth
Yu Ma ⋅ Zizhan Guo ⋅ Zuyi Xiong ⋅ Haoran Zhang ⋅ Yi Feng ⋅ Hongbo Zhao ⋅ Hanli Wang ⋅ Rui Fan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 539
Lifting Unlabeled Internet-level Data for 3D Scene Understanding
Yixin Chen ⋅ Yaowei Zhang ⋅ Huangyue Yu ⋅ Junchao He ⋅ Yan Wang ⋅ Jiangyong Huang ⋅ Hongyu Shen ⋅ Junfeng Ni ⋅ Shaofei Wang ⋅ Baoxiong Jia ⋅ Song-Chun Zhu ⋅ Siyuan Huang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 540
ObjectMorpher: 3D-Aware Image Editing via Deformable 3DGS
Yuhuan Xie ⋅ Aoxuan Pan ⋅ Yihua Huang ⋅ Chirui Chang ⋅ Peng Dai ⋅ Xin Yu ⋅ Xiaojuan Qi
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 541
PhysX-Anything: Simulation-Ready Physical 3D Assets from Single Image
Ziang Cao ⋅ Fangzhou Hong ⋅ Zhaoxi Chen ⋅ Liang Pan ⋅ Ziwei Liu
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 542
MeshFlow: Efficient Artistic Mesh Generation via MeshVAE and Flow-based Diffusion Transformer
Weiyu Li ⋅ Antoine Toisoul ⋅ Tom Monnier ⋅ Roman Shapovalov ⋅ Rakesh Ranjan ⋅ Ping Tan ⋅ Andrea Vedaldi
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 543
WonderZoom: Multi-Scale 3D World Generation
Jin Cao ⋅ Hong-Xing Yu ⋅ Jiajun Wu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 544
SceneTok: A Compressed, Diffusable Token Space for 3D Scenes
Mohammad Asim ⋅ Christopher Wewer ⋅ Jan Lenssen
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 545
PixARMesh: Autoregressive Mesh-Native Single-View Scene Reconstruction
Xiang Zhang ⋅ Sohyun Yoo ⋅ Hongrui Wu ⋅ Chuan Li ⋅ Jianwen Xie ⋅ Zhuowen Tu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 546
Extend3D: Town-Scale 3D Generation
Seungwoo Yoon ⋅ Jinmo Kim ⋅ Jaesik Park
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 547
Pano3DComposer: Feed-Forward Compositional 3D Scene Generation from Single Panoramic Image
Zidian Qiu ⋅ Ancong Wu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 548
MeshWeaver: Sparse-Voxel-Guided Surface Weaving for Autoregressive Mesh Generation
Jiale Xu ⋅ Wang Zhao ⋅ Ying Shan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 549
CaliTex: Geometry-Calibrated Attention for View-Coherent 3D Texture Generation
Chenyu Liu ⋅ Hongze CHEN ⋅ Jingzhi Bao ⋅ Lingting Zhu ⋅ Runze Zhang ⋅ Weikai Chen ⋅ Zeyu HU ⋅ Yingda Yin ⋅ Keyang Luo ⋅ Xin Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 550
CraftMesh: High-Fidelity Generative Mesh Manipulation via Poisson Seamless Fusion
James Jincheng Hu ⋅ Yuxiao Wu ⋅ Youcheng Cai ⋅ Ligang Liu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 551
LoG3D: Ultra-High-Resolution 3D Shape Modeling via Local-to-Global Partitioning
Xinran Yang ⋅ Shuichang Lai ⋅ Jiangjing Lyu ⋅ Hongjie Li ⋅ Bowen Pan ⋅ Yuanqi Li ⋅ Jie Guo ⋅ Zhengkang Zhou ⋅ Yanwen Guo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 552
MaskFocus: Focusing Policy Optimization on Critical Steps for Masked Image Generation
Guohui Zhang ⋅ Hu Yu ⋅ Xiaoxiao Ma ⋅ Yaning Pan ⋅ Hang Xu ⋅ Jie Huang ⋅ Feng Zhao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 553
Efficient Training for Human Video Generation with Entropy-Guided Prioritized Progressive Learning
Changlin Li ⋅ Jiawei Zhang ⋅ Shuhao Liu ⋅ Sihao Lin ⋅ Zeyi Shi ⋅ Zhihui Li ⋅ Xiaojun Chang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 554
PosterOmni: Generalized Artistic Poster Creation via Task Distillation and Unified Reward Feedback
Sixiang Chen ⋅ Jianyu LAI ⋅ Jialin Gao ⋅ Hengyu Shi ⋅ Zhongying Liu ⋅ Tian Ye ⋅ Junfeng Luo ⋅ Xiaoming Wei ⋅ Lei Zhu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 555
GRPO-Guard: Mitigating Implicit Over-Optimization in Flow Matching via Regulated Clipping
Jing Wang ⋅ Jiajun Liang ⋅ Jie Liu ⋅ Henglin Liu ⋅ Gongye Liu ⋅ Jun Zheng ⋅ Wanyuan Pang ⋅ Ao Ma ⋅ Zhenyu Xie ⋅ Xintao Wang ⋅ Meng Wang ⋅ Pengfei Wan ⋅ Xiaodan Liang
[ Slides
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 556
The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation
Weijia Mao ⋅ Hao Chen ⋅ Zhenheng Yang ⋅ Mike Zheng Shou
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 557
Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning
Guanjie Chen ⋅ Shirui Huang ⋅ Yifu Sun ⋅ Kai Liu ⋅ Jianchen Zhu ⋅ Xiaoye Qu ⋅ Yu Cheng ⋅ Peng Chen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 558
VISTA: A Test-Time Self-Improving Video Generation Agent
Do Xuan Long ⋅ Xingchen Wan ⋅ Hootan Nakhost ⋅ Chen-Yu Lee ⋅ Tomas Pfister ⋅ Sercan O Arik
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 559
Neighbor GRPO: Contrastive ODE Policy Optimization Aligns Flow Models
Dailan He ⋅ Guanlin Feng ⋅ Xingtong Ge ⋅ Yazhe Niu ⋅ Yi Zhang ⋅ Bingqi Ma ⋅ Guanglu Song ⋅ Yu Liu ⋅ Hongsheng Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 560
SMV-EAR: Bring Spatiotemporal Multi-View Representation Learning into Efficient Event-Based Action Recognition
Rui Fan ⋅ Weidong Hao ⋅ Juntao Guan ⋅ Lai Rui ⋅ Tong Wu ⋅ Fanhong Zeng ⋅ Lin Gu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 561
Hierarchical Action Learning for Weakly-Supervised Action Segmentation
Junxian Huang ⋅ Ruichu Cai ⋅ Juntao Fang ⋅ Hao Zhu ⋅ Boyan Xu ⋅ Weilin Chen ⋅ Zijian Li ⋅ Shenghua Gao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 562
Gamba: Mamba-based graph convolutional network with dynamic graph topology learning for action recognition
Rouyi Zhou ⋅ 漾之 吴 ⋅ Jiajun Wen ⋅ Can Gao ⋅ Feng Liu ⋅ Zhihui Lai ⋅ Linlin Shen
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 563
Beyond Binary Contrast: Modeling Continuous Skeleton Action Spaces with Transitional Anchors
Yingjie Feng ⋅ Yi Wang ⋅ Jiaze Wang ⋅ Anfeng Liu ⋅ Zhuotao Tian
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 564
PRISM: Learning a Shared Primitive Space for Transferable Skeleton Action Representation
Di Yang ⋅ Yaohui Wang ⋅ Shuai Shao ⋅ Francois Bremond ⋅ Jiangtao Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 565
TWEO: Transformers Without Extreme Outliers Enables FP8 Training And Quantization For Dummies
Guang Liang ⋅ Jie Shao ⋅ Ningyuan Tang ⋅ Xinyao Liu ⋅ Jianxin Wu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 566
Unified Spherical Frontend: Learning Rotation-Equivariant Representations of Spherical Images from Any Camera
Mukai Yu ⋅ Mosam Dabhi ⋅ Liuyue Xie ⋅ Sebastian Scherer ⋅ László A. Jeni
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 567
The Surprising Effectiveness of Noise Pretraining for Implicit Neural Representations
Kushal Vyas ⋅ Alper Kayabasi ⋅ Daniel Kim ⋅ Vishwanath Saragadam ⋅ Ashok Veeraraghavan ⋅ Guha Balakrishnan
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 568
DABO: Difficulty-Aware Bayesian Optimization with Diffusion-Learned Priors
Mengyang Li ⋅ Pinlong Zhao
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 569
Towards Knowledge-augmented Bayesian Deep Learning For Computer Vision
Wang Ma ⋅ Hanjing Wang ⋅ Yufei Zhang ⋅ Darsha Udayanga ⋅ Qiang Ji
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 570
NESTOR: A Nested MOE-based Neural Operator for Large-Scale PDE Pre-Training
Dengdi Sun ⋅ Xiaoya Zhou ⋅ Xiao Wang ⋅ Hao Si ⋅ Wanli Lyu ⋅ Jin Tang ⋅ Bin Luo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 571
Evidential Transformation Network: Turning Pretrained Models into Evidential Models for Post-hoc Uncertainty Estimation
Yongchan Chun ⋅ Chanhee Park ⋅ Jeongho Yoon ⋅ Jaehyung Seo ⋅ Heuiseok Lim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 572
Beyond Euclidean Gossip: KL-Barycentric Consensus on Heterogeneous and Imbalanced Images
Lu Xu ⋅ Guosheng Yin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 573
Prime Once, then Reprogram Locally: An Efficient Alternative to Black-Box Service Model Adaptation
Yunbei Zhang ⋅ Chengyi Cai ⋅ Feng Liu ⋅ Jihun Hamm
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 574
Batch Loss Score for Dynamic Data Pruning
Qing Zhou ⋅ Bingxuan Zhao ⋅ Tao Yang ⋅ Hongyuan Zhang ⋅ Junyu Gao ⋅ Qi Wang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 575
Teacher-Guided Routing for Sparse Vision Mixture-of-Experts
Masahiro Kada ⋅ Ryota Yoshihashi ⋅ Satoshi Ikehata ⋅ Rei Kawakami ⋅ Ikuro Sato
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 576
WebChain: A Large-Scale Human-Annotated Dataset of Real-World Web Interaction Traces
Sicheng Fan ⋅ Rui Wan ⋅ Yifei Leng ⋅ Gaoning Liang ⋅ LI LING ⋅ Yanyi Shang ⋅ Dehan Kong
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 577
MangoBench: A Benchmark for Multi-Agent Goal-Conditioned Offline Reinforcement Learning
Yi Wang ⋅ Ningze Zhong ⋅ Zhiheng Fu ⋅ Longguang Wang ⋅ Ye Zhang ⋅ Yulan Guo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 578
iSHIFT: Lightweight Slow-Fast GUI Agent with Adaptive Perception
Sarthak Mehrotra ⋅ Sairam Rebbapragada ⋅ Mani Bonthu ⋅ Vineeth Balasubramanian
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 579
MMBench-GUI: A Unified Hierarchical Evaluation Framework for Multi-Platform GUI Agents
Xuehui Wang ⋅ Zhenyu Wu ⋅ JingJing Xie ⋅ Zichen Ding ⋅ Bowen Yang ⋅ Zehao Li ⋅ Zhaoyang Liu ⋅ Qingyun Li ⋅ Xuan Dong ⋅ Zhe Chen ⋅ Weiyun Wang ⋅ Xiangyu Zhao ⋅ Jixuan Chen ⋅ Haodong Duan ⋅ Tianbao Xie ⋅ Chenyu Yang ⋅ Shiqian Su ⋅ Yue Yu ⋅ Yanting Zhang ⋅ Xiangyu Yue ⋅ Weijie Su ⋅ Xizhou Zhu ⋅ Wei Shen ⋅ Jifeng Dai ⋅ Wenhai Wang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 580
Boosting Vision-Language Models Towards Cross-Domain Incremental Object Detection
Xu Wang ⋅ Zihan Lin ⋅ Yixin Zhang ⋅ Zilei Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 581
UniSpector: Towards Universal Open-set Defect Recognition via Spectral-Contrastive Visual Prompting
Geonuk Kim ⋅ Minhoi Kim ⋅ Kangil Lee ⋅ Minsu Kim ⋅ Hyeonseong Jeon ⋅ JEONGHOON HAN ⋅ Hyoungjoon Lim ⋅ Junho Yim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 582
Unlearning without Forgetting: Securely Removing Targeted Concepts from Large-Scale Vision-Language Open-Vocabulary Detectors
Zhongze Wu ⋅ Xiu Su ⋅ Feng Yang ⋅ Dan Niu ⋅ Shan You ⋅ Yueyi Luo ⋅ Jun Long
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 583
UNI-OOD: Unified Object- and Image-level Out-of-Distribution Detection via Cross-Context Attentive Vision-Language Modeling
Yuchuan Li ⋅ Azadeh Motamedi ⋅ Hyock Ju Kwon ⋅ Chul B Park ⋅ Il-Min Kim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 584
S2C2Seg: Semantic-Spatial Consistency and Category Optimization for Open-Vocabulary Segmentation
Yuhao Qing ⋅ Yueying Wang ⋅ Chaoyang Chen ⋅ Weidong Zhang ⋅ Jie Wen ⋅ Xin Xu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 585
NoOVD: Novel Category Discovery and Embedding for Open-Vocabulary Object Detection
Yupeng Zhang ⋅ Ruize Han ⋅ Zhiwei Chen ⋅ Wei Feng ⋅ Liang Wan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 586
The Missing Point in Vision Transformers for Universal Image Segmentation
Sajjad Shahabodini ⋅ Mobina Mansoori ⋅ Farnoush Bayatmakou ⋅ Jamshid Abouei ⋅ Konstantinos N. Plataniotis ⋅ Arash Mohammadi
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 587
PromptMoE: A Segmentation Refinement Framework Leveraging Mixture of Experts for Improved Prompting
Stephen Price ⋅ Danielle L. Cote ⋅ Elke A. Rundensteiner
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 588
The Power of Prior: Training-Free Open-Vocabulary Semantic Segmentation with LLaVA
Bingfeng Zhang ⋅ Siyue Yu ⋅ Hui Li ⋅ Jiahua Lin ⋅ Wenwu Wang ⋅ Jimin Xiao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 589
Beyond Text: Visual Description Assembly by Probabilistic Model for CLIP-based Weakly Supervised Semantic Segmentation
Xianglin Qiu ⋅ Jian Wang ⋅ Xiaolei Wang ⋅ Zhen Zhang ⋅ Jimin Xiao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 590
High-Precision Dichotomous Image Segmentation via Depth Integrity-Prior and Fine-Grained Patch Strategy
Xianjie Liu ⋅ Keren Fu ⋅ Qijun Zhao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 591
GeoSAM2: Unleashing the Power of SAM2 for 3D Part Segmentation
Ken Deng ⋅ Yunhan Yang ⋅ Jingxiang Sun ⋅ Xihui Liu ⋅ Yebin Liu ⋅ Ding Liang ⋅ Yan-Pei Cao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 592
Material Magic Wand: Material-Aware Grouping of 3D Parts in Untextured Meshes
Umangi Jain ⋅ Vladimir G. Kim ⋅ Matheus Gadelha ⋅ Igor Gilitschenski ⋅ Zhiqin Chen
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 593
Synthetic Object Compositions for Scalable and Accurate Learning in Detection, Segmentation, and Grounding
Weikai Huang ⋅ Jieyu Zhang ⋅ Taoyang jia ⋅ Chenhao Zheng ⋅ Ziqi Gao ⋅ Jae Sung Park ⋅ Ranjay Krishna
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 594
Unlocking 3D Affordance Segmentation with 2D Semantic Knowledge
Yu Huang ⋅ Zelin Peng ⋅ Changsong Wen ⋅ Xiaokang Yang ⋅ Wei Shen
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 595
HySeg: Learning Generative Priors for Structure-Aware Remote Sensing Segmentation
Jie Qiu ⋅ XIN LI ⋅ Fan Yang ⋅ Yan Wang ⋅ Dong Yu ⋅ Changying Wang ⋅ Linwei Dai ⋅ Yongxiang Chen ⋅ Youqin Chen ⋅ Jianzhang Chen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 596
Real-Time Long Horizon Air Quality Forecasting via Group-Relative Policy Optimization
Inha Kang ⋅ Eunki Kim ⋅ Wonjeong Ryu ⋅ Jaeyo Shin ⋅ Seungjun Yu ⋅ Yoon-Hee Kang ⋅ Seongeun Jeong ⋅ Eunhye Kim ⋅ Soontae Kim ⋅ Hyunjung Shim
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 597
MMVIP: A Visible-infrared Paired Dataset for Multi-weather Marine Vision
Yunpeng Yin ⋅ Lihan Wang ⋅ Zhaoshen He ⋅ Xinqiang He ⋅ Xingming Liao ⋅ Zhuowei Wang ⋅ Lianglun Cheng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 598
Beyond Tie Points: Satellite Image Block Adjustment based on Dense Feature Consistency
Yi Liu ⋅ Yi Wan ⋅ Lei Yu ⋅ Panwang Xia ⋅ Qiong Wu ⋅ Yingying Pei ⋅ Xuejun Huang ⋅ Junjian Zhang ⋅ Xiangyuan Cai ⋅ Hongwei Hu ⋅ Yongjun Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 599
Spectrally Distilled Representations Aligned with Instruction-Augmented LLMs for Satellite Imagery
Minh Do ⋅ Wei Xiang ⋅ Kang Han ⋅ Di Wu ⋅ Khoa T. Phan ⋅ Yi-Ping Phoebe Chen ⋅ Gaowen Liu ⋅ Ramana Kompella
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 600
Global Underwater Geolocation from Time-Lapse Polarization Imagery
Sara Aghajanzadeh ⋅ Xiaoyang Bai ⋅ Zhongmin Zhu ⋅ David Forsyth ⋅ Viktor Gruev
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 601
Olbedo: An Albedo and Shading Aerial Dataset for Large-Scale Outdoor Environments
Shuang Song ⋅ Debao Huang ⋅ Deyan Deng ⋅ Haolin Xiong ⋅ Yang Tang ⋅ Yajie Zhao ⋅ Rongjun Qin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 602
PRUE: A Practical Recipe for Field Boundary Segmentation at Scale
Gedeon Muhawenayo ⋅ Caleb Robinson ⋅ Subash Khanal ⋅ Zhanpei Fang ⋅ Isaac Corley ⋅ Alexander Wollam ⋅ Tianyi Gao ⋅ Leonard Strnad ⋅ Ryan Avery ⋅ Lyndon Estes ⋅ Ana Tárano ⋅ Nathan Jacobs ⋅ Hannah Kerner
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 603
SARMAE: Masked Autoencoder for SAR Representation Learning
Danxu Liu ⋅ Di Wang ⋅ Hebaixu Wang ⋅ Haoyang Chen ⋅ Wentao Jiang ⋅ Yilin Cheng ⋅ Haonan Guo ⋅ Wei Cui ⋅ Jing Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 604
LNEM: Lunar Neural Elevation Model
Suwan Lee ⋅ Jo Ryeong Yim ⋅ Kibaek Park ⋅ Dong-Gyu Kim ⋅ Eunhyeuk Kim ⋅ Minsup Jeong ⋅ Chae Kyung Sim ⋅ Seokju Lee
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 605
A Polarized Reflection and Material Dataset of Real World Objects
Jing Yang ⋅ Krithika Dharanikota ⋅ Emily Jia ⋅ Haiwei Chen ⋅ Yajie Zhao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 606
LaSM: Layer-wise Scaling Mechanism for Defending Pop-up Attack on GUI Agents
Zihe Yan ⋅ Zhuosheng Zhang ⋅ Jiaping Gui ⋅ Gongshen Liu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 607
RaPA: Enhancing Transferable Targeted Attacks via Random Parameter Pruning
Tongrui Su ⋅ Qingbin Li ⋅ Shengyu Zhu ⋅ Wei Chen ⋅ Xueqi Cheng
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 608
All Vehicles Can Lie: Efficient Adversarial Defense in Fully Untrusted-Vehicle Collaborative Perception via Pseudo-Random Bayesian Inference
Yi Yu ⋅ Libing Wu ⋅ Zhuangzhuang Zhang ⋅ Jing Qiu ⋅ Lijuan Huo ⋅ Jiaqi Feng
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 609
A Combination of Noise and Bilateral Filters Achieve Supralinear and Scalable Adversarial Robustness in CNNs
Nicolas Stalder ⋅ Benjamin F Grewe ⋅ Matteo Saponati ⋅ Pau Vilimelis Aceituno
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 610
DeepProtect: Proactive Face-Swapping Defense using Identity Blending and Attribute Distortion
Eungi Lee ⋅ Seung-hyeok Back ⋅ Hyung-Il Kim ⋅ Seok Bong Yoo
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 611
Write Where It Matters: Policy-Guided Watermarks for 3D Gaussian Splatting
Nan Li ⋅ Yike Zeng ⋅ Qian Zhang ⋅ Qi Zhang ⋅ Zhiyi Pan ⋅ Wei Feng ⋅ Liang Wan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 612
Attack for Defense: Adversarial Agents for Point Prompt Optimization Empowering Segment Anything Model
Xueyu Liu ⋅ Xiaoyi Zhang ⋅ Meilin Liu ⋅ Guangze Shi ⋅ Jia Shen ⋅ Yujie Wang ⋅ Cai Zhao ⋅ Ziyuan He ⋅ Yongfei Wu ⋅ Mingqiang Wei ⋅ Yongle Chen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 613
RevINN: An End-to-End Invertible Neural Network for Reversible Adversarial Examples Generation
Jielun Huang ⋅ Chi-Man Pun ⋅ Guoheng Huang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 614
CamPI: Physical Adversarial Examples through Camera Power Signal Injection
yanze ren ⋅ Mingyuan Lv ⋅ Qinhong Jiang ⋅ Yan Jiang ⋅ Chen Yan ⋅ Xiaoyu Ji ⋅ Wenyuan Xu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 615
Authorize-on-Demand: Dynamic Authorization with Legality-Aware Intellectual Property Protection for VLMs
Lianyu Wang ⋅ Meng Wang ⋅ Huazhu Fu ⋅ Daoqiang Zhang
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 616
GraspALL: Adaptive Structural Compensation from Illumination Variation for Robotic Garment Grasping in Any Low-Light Conditions
Haifeng Zhong ⋅ Wenshuo Han ⋅ Zhouyu Wang ⋅ Runyang Feng ⋅ Fan Tang ⋅ Tong-yee Lee ⋅ zipei fan ⋅ Ruihai Wu ⋅ Yuran Wang ⋅ Hao Dong ⋅ Hechang Chen ⋅ Hyung Jin Chang ⋅ Yixing Gao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 617
Opening the Sim-to-Real Door for Humanoid Pixel-to-Action Policy Transfer
Haoru Xue ⋅ Tairan He ⋅ Zi Wang ⋅ Qingwei Ben ⋅ Wenli Xiao ⋅ Zhengyi Luo ⋅ Xingye Da ⋅ Fernando Castañeda ⋅ Guanya Shi ⋅ Shankar Sastry ⋅ Jim Fan ⋅ Yuke Zhu
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 618
Learning Cross-View Object Correspondence via Cycle-Consistent Mask Prediction
Shannan Yan ⋅ Leqi Zheng ⋅ Keyu Lv ⋅ Jingchen Ni ⋅ Hongyang Wei ⋅ Jiajun Zhang ⋅ Guangting Wang ⋅ Jing LYU ⋅ Chun Yuan ⋅ Fengyun Rao
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 619
RoboWheel: A Data Engine from Real-World Human Demonstrations for Cross-Embodiment Robotic Learning
Yuhong Zhang ⋅ Zihan Gao ⋅ Shengpeng Li ⋅ Ling-Hao Chen ⋅ Kaisheng Liu ⋅ Runqing Cheng ⋅ Xiao Lin ⋅ Junjia Liu ⋅ Zhuoheng Li ⋅ Jingyi Feng ⋅ Ziyan He ⋅ Jintian Lin ⋅ Zheyan Huang ⋅ Zhifang Liu ⋅ Haoqian Wang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 620
Chain of World: World Model Thinking in Latent Motion
Fuxiang Yang ⋅ Donglin Di ⋅ Lulu Tang ⋅ Xuancheng Zhang ⋅ Lei Fan ⋅ Hao Li ⋅ Wei Chen ⋅ Tonghua Su ⋅ Baorui Ma
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 621
Scalable Feature Matching via State Space Modeling and Sparse Correlation
Choo Sin Wai ⋅ Bo Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 622
Video2Robo: 3DGS-based Synthetic Data from One Video Enables Scalable Robot Learning
Yinan Deng ⋅ Kejia Hu ⋅ Ye Chen ⋅ Jianyu Dou ⋅ Jiahui Wang ⋅ Jingyu Zhao ⋅ Haojia Ao ⋅ Yi Yang ⋅ Yufeng Yue
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 623
ConsisVLA-4D: Advancing Spatiotemporal Consistency in Efficient 3D-Perception and 4D-Reasoning for Robotic Manipulation
Wei Li ⋅ Jizhihui Liu ⋅ Yixing Li ⋅ Junwen Tong ⋅ Rui Shao ⋅ Liqiang Nie
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 624
SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models
Senyu Fei ⋅ Siyin Wang ⋅ Li Ji ⋅ Ao Li ⋅ Shiduo Zhang ⋅ Liming Liu ⋅ Jinlong Hou ⋅ Jingjing Gong ⋅ Xianzhong Zhao ⋅ Xipeng Qiu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 625
GeoDexGrasp: Geometry-aware Generation for Data-efficient and Physics-plausible Dexterous Grasping
Bing Han ⋅ Weiyuan Liu ⋅ changlong Zhang ⋅ Chenxi Wang ⋅ Zhibin Zhao ⋅ Zhi Zhai
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 626
Lifelong Imitation Learning with Multimodal Latent Replay and Incremental Adjustment
Yu Fanqi ⋅ Matteo Tiezzi ⋅ Tommaso Apicella ⋅ Cigdem Beyan ⋅ Vittorio Murino
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 627
From Observation to Action: Latent Action-based Primitive Segmentation for VLA Pre-training in Industrial Settings
Jiajie Zhang ⋅ Sören Schwertfeger ⋅ Alexander Kleiner
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 628
AGiLe: Learning Robust Long-Horizon Manipulation via Affordance-Grounded Bidirectional Latent Planning
Zixuan Chen ⋅ Xiangrong Feng ⋅ Jieqi Shi ⋅ Lin Shao ⋅ Jing Huo ⋅ Yang Gao
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 629
Language-Grounded Decoupled Action Representation for Robotic Manipulation
WuDing Weng ⋅ Tongshu Wu ⋅ Liucheng Chen ⋅ Siyu xie ⋅ Zheng Wang ⋅ Xing Xu ⋅ Jingkuan Song ⋅ Heng Tao Shen
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 630
Learning to Act Robustly with View-Invariant Latent Actions
Youngjoon Jeong ⋅ Junha Chun ⋅ Taesup Kim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 631
ORBIT: Benchmarking SfM in the Wild with 360° Video
Sara Sabour ⋅ Richard Tucker ⋅ Marcus Brubaker ⋅ Saurabh Saxena ⋅ Junhwa Hur ⋅ Andrea Tagliasacchi ⋅ Deqing Sun ⋅ David J. Fleet ⋅ Richard Szeliski ⋅ Noah Snavely
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 632
SpikeTrack: A Spike-driven Framework for Efficient Visual Tracking
Qiuyang Zhang ⋅ Jiujun Cheng ⋅ Qichao Mao ⋅ Cong Liu ⋅ Yu Fang ⋅ Yuhong Li ⋅ Mengying Ge ⋅ Shangce Gao
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 633
Time Without Time: Pseudo-Temporal Representation for Space-Time Super-Resolution
Hee Min Choi ⋅ Hyoa Kang ⋅ Suji Kim ⋅ Dokwan Oh ⋅ Nam Ik Cho
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 634
Envisioning the Future, One Step at a Time
Stefan Andreas Baumann ⋅ Jannik Wiese ⋅ Tommaso Martorella ⋅ M. Kalayeh ⋅ Björn Ommer
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 635
FlowFM: Advancing Dark Optical Flow Estimation with Flow Matching
Fengyuan Zuo ⋅ Haiyan Jin ⋅ Yuanlin Zhang ⋅ Zhaolin Xiao ⋅ Bin Wang ⋅ Yuerong Mu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 636
Drift-Resilient Temporal Priors for Visual Tracking
Yuqing Huang ⋅ Liting Lin ⋅ Weijun Zhuang ⋅ Zhenyu He ⋅ Xin Li
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 637
An Efficient Token Compression Framework for Visual Object Tracking
Weijing Wu ⋅ Qihua Liang ⋅ Bineng Zhong ⋅ Haiying Xia ⋅ Zhiyi Mo ⋅ Shuxiang Song
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 638
No Labels, No Look-Ahead: Unsupervised Online Video Stabilization with Classical Priors
Kan Ren ⋅ Gang Wan ⋅ TAO LIU
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 639
From Detection to Association: Learning Discriminative Object Embeddings for Multi-Object Tracking
Yuqing Shao ⋅ Yuchen Yang ⋅ Rui Yu ⋅ Weilong Li ⋅ Xu Guo ⋅ Huaicheng Yan ⋅ Wei Wang ⋅ Xiao Sun
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 640
Momentum Memory for Knowledge Distillation in Computational Pathology
yongxin guo ⋅ Hao Lu ⋅ Onur C. ⋅ Zhengjie Zhu ⋅ Muhammet F. ⋅ Metin N.
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 641
Modeling the Brain’s Grammar: ROI-Guided fMRI Pretraining for Transferable and Interpretable Vision Decoding
Yulong Liu ⋅ Hua Xu ⋅ Yiyang Cai ⋅ Chunyang Jiang ⋅ Sirui Han ⋅ Yike Guo
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 642
Joint Spectral Image Reconstruction and Semantic Segmentation with Cooperative Unfolding
Zijun He ⋅ Ping Wang ⋅ Xiaodong Wang ⋅ Chang Chen ⋅ Xin Yuan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 643
X-WIN: Building Chest Radiograph World Model via Predictive Sensing
Zefan Yang ⋅ Ge Wang ⋅ James Hendler ⋅ Mannudeep K. Kalra ⋅ Pingkun Yan
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 644
fMRI-LM: Towards a Universal Foundation Model for Language-Aligned fMRI Understanding
Yuxiang Wei ⋅ Yanteng Zhang ⋅ Xi Xiao ⋅ Chengxuan Qian ⋅ Tianyang Wang ⋅ Vince D. Calhoun
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 645
Tell2Adapt: A Unified Framework for Source Free Unsupervised Domain Adaptation via Vision Foundation Model
Yulong Shi ⋅ Shijie Li ⋅ Ziyi Li ⋅ Lin Qi
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 646
TIM: Temporal Decoupling with Iterative Mutual-Refinement Model for Longitudinal Radiology Report Generation
Yiheng Dong ⋅ Yi Lin ⋅ Shilong Huang ⋅ Xiyan Yang ⋅ Xin Yang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 647
Ultrasound-CLIP: Semantic-Aware Contrastive Pre-training for Ultrasound Image-Text Understanding
Jiayun Jin ⋅ Haolong Chai ⋅ Xueying Huang ⋅ Xiaoqing Guo ⋅ Zengwei Zheng ⋅ Zhan Zhou ⋅ Junmei Wang ⋅ Xinyu Wang ⋅ Jie Liu ⋅ Binbin Zhou
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 648
Act Like a Pathologist: Tissue-Aware Whole Slide Image Reasoning
Wentao Huang ⋅ Weimin Lyu ⋅ Peiliang Lou ⋅ Qingqiao Hu ⋅ Xiaoling Hu ⋅ Shahira Abousamra ⋅ Wenchao Han ⋅ Ruifeng Guo ⋅ Jiawei Zhou ⋅ Chao Chen ⋅ Chen Wang
[ Slides [ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 649
BiGMINT: Biologically-guided Hierarchical Multimodal Integration for Modeling Multiple Compound Activities in Drug Discovery
Pushpak Pati ⋅ Bo Li ⋅ Abbas Rayabat Khan ⋅ Tomé Albuquerque ⋅ Steffen Jaensch ⋅ Amina Mollaysa ⋅ Walid Hassan ⋅ Samantha J. Allen ⋅ Joke Reumers ⋅ Helai P. Mohammad ⋅ Scott Oloff ⋅ Tommaso Mansi ⋅ Rui Liao ⋅ Dmytro S. Lituiev ⋅ Zhoubing Xu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 650
Modeling Spatiotemporal Neural Frames for High Resolution Brain Dynamic
Wanying Qu ⋅ Jianxiong Gao ⋅ Wei Wang ⋅ Yanwei Fu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 651
CMR-RD: Long-Tailed Adaptive VLM for Explainable CMR Diagnosis
Yansong Li ⋅ Zhongxi Qiu ⋅ Yun Tian ⋅ Zheng jinyu ⋅ Shuo Li
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 652
Clinically-Grounded Counterfactual Reasoning for Medical Video Diagnosis
Jianzhe Gao ⋅ Churan Wang ⋅ Weiyi Zhang ⋅ Jianghua Li ⋅ Lian Li ⋅ Wenguan Wang ⋅ Yixin Zhu ⋅ Yizhou Wang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 653
FBTA: Enabling Single-GPU End-to-End Gigapixel WSI Classification with Feature Bridging and Translation Alignment
Jiuyang Dong ⋅ Jiahan Li ⋅ Junjun Jiang ⋅ Yongbing Zhang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 654
Ultra Diffusion Poser: Diffusion-Based Human Motion Tracking from Sparse Inertial Sensors and Ranging-based Between-sensor Distances
Dominik Hollidt ⋅ Tommaso Bendinelli ⋅ Christian Holz
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 655
Egocentric Visibility-Aware Human Pose Estimation
Peng Dai ⋅ Yu Zhang ⋅ Feng Yiqiang ⋅ ZhenFan Fan ⋅ Yang Zhang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 656
Shoe Style-Invariant and Ground-Aware Learning for Dense Foot Contact Estimation
Daniel Jung ⋅ Kyoung Mu Lee
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 657
OMG-Bench: A New Challenging Benchmark for Skeleton-based Online Micro Hand Gesture Recognition
Haochen Chang ⋅ Pengfei Ren ⋅ Buyuan Zhang ⋅ Da Li ⋅ Tianhao Han ⋅ HaoYang ZHANG ⋅ Liang Xie ⋅ Hongbo Chen ⋅ Erwei Yin
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 658
Recovering Physically Plausible Human-Object Interactions from Monocular Videos
Dingbang Huang ⋅ Etienne Vouga ⋅ Qixing Huang ⋅ Georgios Pavlakos
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 659
MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos
Kehong Gong ⋅ Zhengyu Wen ⋅ Xiaoyu He ⋅ Mingxi Xu ⋅ Qi WANG ⋅ ning Zhang ⋅ Zhengyu Li ⋅ Dongze Lian ⋅ Wei Zhao ⋅ He Xiaoyu ⋅ Mingyuan Zhang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 660
TeHOR: Text-Guided 3D Human and Object Reconstruction with Textures
Hyeongjin Nam ⋅ Daniel Jung ⋅ Kyoung Mu Lee
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 661
SHOW3D: Capturing Scenes of 3D Hands and Objects in the Wild
Patrick Rim ⋅ Kevin Harris ⋅ Braden Copple ⋅ Shangchen Han ⋅ Xu Xie ⋅ Ivan Shugurov ⋅ Sizhe An ⋅ He Wen ⋅ Alex Wong ⋅ Tomas Hodan ⋅ Kun He
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 662
CrossHOI: Learning Cross-View Representations for Monocular 3D Human-Object Interaction Reconstruction
Pei Geng ⋅ Shanshan Zhang ⋅ Jian Yang
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 663
Gaussian-Mixture Latent Flow for Stochastic 3D Human Motion Prediction
Yue Ma ⋅ Frederick W. B. Li ⋅ Xiaohui Liang
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 664
SGSoft: Learning Fused Semantic-Geometric Features for 3D Shape Correspondence via Template-Guided Soft Signals
Soyeon Yoon ⋅ Chang Wook Seo ⋅ Hyunjung Shim
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 665
Beyond Single-View Sufficiency: CVBench for Cross-View Human Understanding
Tianchen Guo ⋅ Chen Liu ⋅ Xin Yu
[ Poster
Poster
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F 666
Breaking Spurious Correlations: Uncertainty-Driven Causal Transformers for AU Detection
Yuru Wang ⋅ Yue Zhou
[ Poster
Poster Session
Fri Jun 05 09:45 AM -- 11:45 AM (PDT) @ ExHall A-F None
Poster Session 1 & Exhibit Hall
Art Program
Fri Jun 05 09:45 AM -- 05:00 PM (PDT) @ ExHall F None
Art Exhibition
Luba Elliott
Art Program
Fri Jun 05 10:00 AM -- 10:30 AM (PDT) @ ExHall F None
Art Gallery Tour with Curator and Artists
Luba Elliott
Oral
Fri Jun 05 12:00 PM -- 12:12 PM (PDT) @ Mile High Ballroom 3A - 4A None
4D Primitive-Mâché: Glueing Primitives for Persistent 4D Scene Reconstruction
Kirill Mazur ⋅ Marwan Taher ⋅ Andrew J. Davison
Oral
Fri Jun 05 12:00 PM -- 12:12 PM (PDT) @ Four Seasons Ballroom None
3DReflecNet: A Large-Scale Dataset for 3D Reconstruction of Reflective, Transparent, and Low-Texture Objects
Zhicheng Liang ⋅ Haoyi Yu ⋅ Boyan Li ⋅ Dayou Zhang ⋅ Zijian Cao ⋅ Tianyi Gong ⋅ Junhua Liu ⋅ Shuguang Cui ⋅ Fangxin Wang
Oral
Fri Jun 05 12:00 PM -- 12:12 PM (PDT) @ Bluebird Ballroom None
MAMMA: Markerless Accurate Multi-person Motion Acquisition
Hanz Cuevas Velasquez ⋅ Anastasios Yiannakidis ⋅ Soyong Shin ⋅ Giorgio Becherini ⋅ Markus Höschle ⋅ Joachim Tesch ⋅ Taylor Obersat ⋅ Tsvetelina Alexiadis ⋅ Eni Halilaj ⋅ Michael J. Black
Oral
Fri Jun 05 12:00 PM -- 12:12 PM (PDT) @ Mile High Ballroom 1A - 2A None
Energy-GS: Image Energy-guided Pose Alignment Gaussian Splatting with redesigned pose gradient flow
Yu Gao ⋅ Lutong Su ⋅ Ruixiang Huang ⋅ Tianji Jiang ⋅ Jiadong Tang ⋅ Yufeng Yue ⋅ Yi Yang
Oral Session
Fri Jun 05 12:00 PM -- 01:15 PM (PDT) @ Bluebird Ballroom None
Oral Session 2A: 3D Reconstruction
Oral Session
Fri Jun 05 12:00 PM -- 01:15 PM (PDT) @ Mile High Ballroom 3A - 4A None
Oral Session 2D: Spatio-Temporal Reconstruction
Oral Session
Fri Jun 05 12:00 PM -- 01:15 PM (PDT) @ Mile High Ballroom 1A - 2A None
Oral Session 2C: Gaussian Splatting & Reconstruction
Oral Session
Fri Jun 05 12:00 PM -- 01:15 PM (PDT) @ Four Seasons Ballroom None
Oral Session 2B: Materials & Lighting
Oral
Fri Jun 05 12:12 PM -- 12:25 PM (PDT) @ Mile High Ballroom 1A - 2A None
MeshSplatting: Differentiable Rendering with Opaque Meshes
Jan Held ⋅ Sanghyun Son ⋅ Renaud Vandeghen ⋅ Daniel Rebain ⋅ Matheus Gadelha ⋅ Yi Zhou ⋅ Anthony Cioppa ⋅ Ming C. ⋅ Marc Van Droogenbroeck ⋅ Andrea Tagliasacchi
Oral
Fri Jun 05 12:12 PM -- 12:25 PM (PDT) @ Bluebird Ballroom None
Natural Human Motion Recovery by Aligning High-Order Temporal Dynamics from Monocular Videos
Dingkun Wei ⋅ Zehong Shen ⋅ Yan Xia ⋅ Yujun Shen ⋅ Georgios Pavlakos ⋅ Xiaowei Zhou
Oral
Fri Jun 05 12:12 PM -- 12:25 PM (PDT) @ Mile High Ballroom 3A - 4A None
Efficiently Reconstructing Dynamic Scenes One D4RT at a Time
Chuhan Zhang ⋅ Guillaume LE MOING ⋅ Skanda Koppula ⋅ Ignacio Rocco ⋅ Liliane Momeni ⋅ Junyu Xie ⋅ Shuyang Sun ⋅ Rahul Sukthankar ⋅ Joëlle K. Barral ⋅ Raia Hadsell ⋅ Zoubin Ghahramani ⋅ Andrew Zisserman ⋅ Junlin Zhang ⋅ Mehdi S. M. Sajjadi
Oral
Fri Jun 05 12:12 PM -- 12:25 PM (PDT) @ Four Seasons Ballroom None
GLINT: Modeling Scene-Scale Transparency via Gaussian Radiance Transport
Youngju Na ⋅ Jaeseong Yun ⋅ Soohyun Ryu ⋅ Hyunsu Kim ⋅ Sung-Eui Yoon ⋅ Suyong Yeon
Oral
Fri Jun 05 12:25 PM -- 12:37 PM (PDT) @ Four Seasons Ballroom None
Neural Field-Based 3D Surface Reconstruction of Microstructures from Multi-Detector Signals in Scanning Electron Microscopy
Shuo Chen ⋅ Yijin Li ⋅ Xi Zheng ⋅ Guofeng Zhang
Oral
Fri Jun 05 12:25 PM -- 12:37 PM (PDT) @ Mile High Ballroom 3A - 4A None
FUSER: Feed-Forward Multiview 3D Registration Transformer and SE(3)^N Diffusion Refinement
Haobo Jiang ⋅ Jin Xie ⋅ Jian Yang ⋅ Liang Yu ⋅ Jianmin Zheng
Oral
Fri Jun 05 12:25 PM -- 12:37 PM (PDT) @ Bluebird Ballroom None
PoseGAM: Robust Unseen Object Pose Estimation via Geometry-Aware Multi-View Reasoning
Jianqi Chen ⋅ Biao Zhang ⋅ Xiangjun Tang ⋅ Peter Wonka
Oral
Fri Jun 05 12:25 PM -- 12:37 PM (PDT) @ Mile High Ballroom 1A - 2A None
Proxy-GS: Unified Occlusion Priors for Training and Inference in Structured 3D Gaussian Splatting
Yuanyuan Gao ⋅ YUNING GONG ⋅ Yifei Liu ⋅ Jingfeng Li ⋅ Dan Xu ⋅ Yanci Zhang ⋅ Dingwen Zhang ⋅ Xiao Sun ⋅ Zhihang Zhong
Art Program
Fri Jun 05 12:30 PM -- 01:30 PM (PDT) @ Room 201 None
Art Panel
Luba Elliott
Oral
Fri Jun 05 12:37 PM -- 12:50 PM (PDT) @ Bluebird Ballroom None
SAM 3D Body: Robust Full-Body Human Mesh Recovery
Xitong Yang ⋅ Devansh Kukreja ⋅ Don Pinkus ⋅ Taosha Fan ⋅ Jinhyung Park ⋅ Soyong Shin ⋅ Jinkun Cao ⋅ Jia-Wei Liu ⋅ Nicolás Ugrinovic ⋅ Anushka Sagar ⋅ Jitendra Malik ⋅ Matt Feiszli ⋅ Piotr Dollár ⋅ Kris Kitani
Oral
Fri Jun 05 12:37 PM -- 12:50 PM (PDT) @ Four Seasons Ballroom None
PhyGaP: Physically-Grounded Gaussians with Polarization Cues
Jiale Wu ⋅ Xiaoyang Bai ⋅ Zongqi He ⋅ Weiwei Xu ⋅ YIFAN PENG
Oral
Fri Jun 05 12:37 PM -- 12:50 PM (PDT) @ Mile High Ballroom 3A - 4A None
Residual Primitive Fitting of 3D Shapes with SuperFrusta
Aditya Ganeshan ⋅ Matheus Gadelha ⋅ Thibault Groueix ⋅ Zhiqin Chen ⋅ Siddhartha Chaudhuri ⋅ Vladimir G. Kim ⋅ Wang Yifan ⋅ Daniel Ritchie
Oral
Fri Jun 05 12:37 PM -- 12:50 PM (PDT) @ Mile High Ballroom 1A - 2A None
RetimeGS: Continuous-Time Reconstruction of 4D Gaussian Splatting
Xuezhen Wang ⋅ Li Ma ⋅ Yulin Shen ⋅ Zeyu Wang ⋅ Pedro V. Sander
Oral
Fri Jun 05 12:50 PM -- 01:02 PM (PDT) @ Bluebird Ballroom None
SAM 3D: 3Dfy Anything in Images
Xingyu Chen ⋅ Fu-Jen Chu ⋅ Pierre Gleize ⋅ Kevin J Liang ⋅ Alexander Sax ⋅ Hao Tang ⋅ Weiyao Wang ⋅ Michelle Guo ⋅ Thibaut Hardin ⋅ Xiang Li ⋅ Aohan Lin ⋅ Jia-Wei Liu ⋅ Ziqi Ma ⋅ Anushka Sagar ⋅ Bowen Song ⋅ Xiaodong Wang ⋅ Jianing "Jed" Yang ⋅ Bowen Zhang ⋅ Piotr Dollár ⋅ Georgia Gkioxari ⋅ Matt Feiszli ⋅ Jitendra Malik
Oral
Fri Jun 05 12:50 PM -- 01:02 PM (PDT) @ Four Seasons Ballroom None
PPISP: Physically-Plausible Compensation and Control of Photometric Variations in Radiance Field Reconstruction
Isaac Deutsch ⋅ Nicolas Moënne-Loccoz ⋅ Gavriel State ⋅ Žan Gojčič
Oral
Fri Jun 05 12:50 PM -- 01:02 PM (PDT) @ Mile High Ballroom 1A - 2A None
Selfi: Self-improving Reconstruction Engine via 3D Geometric Feature Alignment
Youming Deng ⋅ Songyou Peng ⋅ Junyi Zhang ⋅ Kathryn Heal ⋅ Tiancheng Sun ⋅ John Flynn ⋅ Steve Marschner ⋅ Lucy Chai
Oral
Fri Jun 05 12:50 PM -- 01:02 PM (PDT) @ Mile High Ballroom 3A - 4A None
SmokeSVD: Smoke Reconstruction from A Single View via Progressive Novel View Synthesis and Refinement with Diffusion Models
Chen Li ⋅ Shanshan Dong ⋅ Sheng Qiu ⋅ Jianmin Han ⋅ Yibo Zhao ⋅ Zan Gao ⋅ Taku Komura ⋅ Kemeng Huang
Oral
Fri Jun 05 01:02 PM -- 01:15 PM (PDT) @ Mile High Ballroom 3A - 4A None
SparseWorld-TC: Trajectory-Conditioned Sparse Occupancy World Model
Jiayuan Du ⋅ Yiming Zhao ⋅ Zhenglong Guo ⋅ Yong Pan ⋅ Wenbo Hou ⋅ Zhihui Hao ⋅ Kun Zhan ⋅ Qijun Chen
Oral
Fri Jun 05 01:02 PM -- 01:15 PM (PDT) @ Mile High Ballroom 1A - 2A None
Z-Order Transformer for Feed-Forward Gaussian Splatting
Can Wang ⋅ Lei Liu ⋅ Wei Jiang ⋅ Dong Xu
Oral
Fri Jun 05 01:02 PM -- 01:15 PM (PDT) @ Bluebird Ballroom None
SPARK: Sim-ready Part-level Articulated Reconstruction with VLM Knowledge
Yumeng He ⋅ Ying Jiang ⋅ Jiayin Lu ⋅ Yin Yang ⋅ Chenfanfu Jiang
Oral
Fri Jun 05 01:02 PM -- 01:15 PM (PDT) @ Four Seasons Ballroom None
SeeGroup: Multi-Layer Depth Estimation of Transparent Surfaces via Self-Determined Grouping
Hongyu Wen ⋅ Jia Deng
Break
Fri Jun 05 01:15 PM -- 01:30 PM (PDT) None
Courtesy Break
Keynote
Fri Jun 05 01:45 PM -- 02:45 PM (PDT) @ Bluebird Ballroom None
Programmable Biology: Generative AI for Molecular Design
Simon Kohl
Poster Setup
Fri Jun 05 02:30 PM -- 03:00 PM (PDT) @ ExHall A None
Poster Setup
Demonstration
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall F None
Demos
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 1
MAMMA: Markerless Accurate Multi-person Motion Acquisition
Hanz Cuevas Velasquez ⋅ Anastasios Yiannakidis ⋅ Soyong Shin ⋅ Giorgio Becherini ⋅ Markus Höschle ⋅ Joachim Tesch ⋅ Taylor Obersat ⋅ Tsvetelina Alexiadis ⋅ Eni Halilaj ⋅ Michael J. Black
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 2
Natural Human Motion Recovery by Aligning High-Order Temporal Dynamics from Monocular Videos
Dingkun Wei ⋅ Zehong Shen ⋅ Yan Xia ⋅ Yujun Shen ⋅ Georgios Pavlakos ⋅ Xiaowei Zhou
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 3
PoseGAM: Robust Unseen Object Pose Estimation via Geometry-Aware Multi-View Reasoning
Jianqi Chen ⋅ Biao Zhang ⋅ Xiangjun Tang ⋅ Peter Wonka
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 4
SAM 3D Body: Robust Full-Body Human Mesh Recovery
Xitong Yang ⋅ Devansh Kukreja ⋅ Don Pinkus ⋅ Taosha Fan ⋅ Jinhyung Park ⋅ Soyong Shin ⋅ Jinkun Cao ⋅ Jia-Wei Liu ⋅ Nicolás Ugrinovic ⋅ Anushka Sagar ⋅ Jitendra Malik ⋅ Matt Feiszli ⋅ Piotr Dollár ⋅ Kris Kitani
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 5
SAM 3D: 3Dfy Anything in Images
Xingyu Chen ⋅ Fu-Jen Chu ⋅ Pierre Gleize ⋅ Kevin J Liang ⋅ Alexander Sax ⋅ Hao Tang ⋅ Weiyao Wang ⋅ Michelle Guo ⋅ Thibaut Hardin ⋅ Xiang Li ⋅ Aohan Lin ⋅ Jia-Wei Liu ⋅ Ziqi Ma ⋅ Anushka Sagar ⋅ Bowen Song ⋅ Xiaodong Wang ⋅ Jianing "Jed" Yang ⋅ Bowen Zhang ⋅ Piotr Dollár ⋅ Georgia Gkioxari ⋅ Matt Feiszli ⋅ Jitendra Malik
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 6
SPARK: Sim-ready Part-level Articulated Reconstruction with VLM Knowledge
Yumeng He ⋅ Ying Jiang ⋅ Jiayin Lu ⋅ Yin Yang ⋅ Chenfanfu Jiang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 7
3DReflecNet: A Large-Scale Dataset for 3D Reconstruction of Reflective, Transparent, and Low-Texture Objects
Zhicheng Liang ⋅ Haoyi Yu ⋅ Boyan Li ⋅ Dayou Zhang ⋅ Zijian Cao ⋅ Tianyi Gong ⋅ Junhua Liu ⋅ Shuguang Cui ⋅ Fangxin Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 8
GLINT: Modeling Scene-Scale Transparency via Gaussian Radiance Transport
Youngju Na ⋅ Jaeseong Yun ⋅ Soohyun Ryu ⋅ Hyunsu Kim ⋅ Sung-Eui Yoon ⋅ Suyong Yeon
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 9
Neural Field-Based 3D Surface Reconstruction of Microstructures from Multi-Detector Signals in Scanning Electron Microscopy
Shuo Chen ⋅ Yijin Li ⋅ Xi Zheng ⋅ Guofeng Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 10
PhyGaP: Physically-Grounded Gaussians with Polarization Cues
Jiale Wu ⋅ Xiaoyang Bai ⋅ Zongqi He ⋅ Weiwei Xu ⋅ YIFAN PENG
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 11
PPISP: Physically-Plausible Compensation and Control of Photometric Variations in Radiance Field Reconstruction
Isaac Deutsch ⋅ Nicolas Moënne-Loccoz ⋅ Gavriel State ⋅ Žan Gojčič
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 12
SeeGroup: Multi-Layer Depth Estimation of Transparent Surfaces via Self-Determined Grouping
Hongyu Wen ⋅ Jia Deng
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 13
Energy-GS: Image Energy-guided Pose Alignment Gaussian Splatting with redesigned pose gradient flow
Yu Gao ⋅ Lutong Su ⋅ Ruixiang Huang ⋅ Tianji Jiang ⋅ Jiadong Tang ⋅ Yufeng Yue ⋅ Yi Yang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 14
MeshSplatting: Differentiable Rendering with Opaque Meshes
Jan Held ⋅ Sanghyun Son ⋅ Renaud Vandeghen ⋅ Daniel Rebain ⋅ Matheus Gadelha ⋅ Yi Zhou ⋅ Anthony Cioppa ⋅ Ming C. Lin ⋅ Marc Van Droogenbroeck ⋅ Andrea Tagliasacchi
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 15
Proxy-GS: Unified Occlusion Priors for Training and Inference in Structured 3D Gaussian Splatting
Yuanyuan Gao ⋅ YUNING GONG ⋅ Yifei Liu ⋅ Jingfeng Li ⋅ Dan Xu ⋅ Yanci Zhang ⋅ Dingwen Zhang ⋅ Xiao Sun ⋅ Zhihang Zhong
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 16
RetimeGS: Continuous-Time Reconstruction of 4D Gaussian Splatting
Xuezhen Wang ⋅ Li Ma ⋅ Yulin Shen ⋅ Zeyu Wang ⋅ Pedro V. Sander
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 17
Selfi: Self-improving Reconstruction Engine via 3D Geometric Feature Alignment
Youming Deng ⋅ Songyou Peng ⋅ Junyi Zhang ⋅ Kathryn Heal ⋅ Tiancheng Sun ⋅ John Flynn ⋅ Steve Marschner ⋅ Lucy Chai
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 18
Z-Order Transformer for Feed-Forward Gaussian Splatting
Can Wang ⋅ Lei Liu ⋅ Wei Jiang ⋅ Dong Xu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 19
4D Primitive-Mâché: Glueing Primitives for Persistent 4D Scene Reconstruction
Kirill Mazur ⋅ Marwan Taher ⋅ Andrew J. Davison
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 20
Efficiently Reconstructing Dynamic Scenes One D4RT at a Time
Chuhan Zhang ⋅ Guillaume Le Moing ⋅ Skanda Koppula ⋅ Ignacio Rocco ⋅ Liliane Momeni ⋅ Junyu Xie ⋅ Shuyang Sun ⋅ Rahul Sukthankar ⋅ Joëlle K. Barral ⋅ Raia Hadsell ⋅ Zoubin Ghahramani ⋅ Andrew Zisserman ⋅ Junlin Zhang ⋅ Mehdi S. M. Sajjadi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 21
FUSER: Feed-Forward Multiview 3D Registration Transformer and SE(3)^N Diffusion Refinement
Haobo Jiang ⋅ Jin Xie ⋅ Jian Yang ⋅ Liang Yu ⋅ Jianmin Zheng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 22
Residual Primitive Fitting of 3D Shapes with SuperFrusta
Aditya Ganeshan ⋅ Matheus Gadelha ⋅ Thibault Groueix ⋅ Zhiqin Chen ⋅ Siddhartha Chaudhuri ⋅ Vladimir G. Kim ⋅ Wang Yifan ⋅ Daniel Ritchie
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 23
SmokeSVD: Smoke Reconstruction from A Single View via Progressive Novel View Synthesis and Refinement with Diffusion Models
Chen Li ⋅ Shanshan Dong ⋅ Sheng Qiu ⋅ Jianmin Han ⋅ Yibo Zhao ⋅ Zan Gao ⋅ Taku Komura ⋅ Kemeng Huang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 24
SparseWorld-TC: Trajectory-Conditioned Sparse Occupancy World Model
Jiayuan Du ⋅ Yiming Zhao ⋅ Zhenglong Guo ⋅ Yong Pan ⋅ Wenbo Hou ⋅ Zhihui Hao ⋅ Kun Zhan ⋅ Qijun Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 25
Affostruction: 3D Affordance Grounding with Generative Reconstruction
Chunghyun Park ⋅ Seunghyeon Lee ⋅ Minsu Cho
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 26
MV-RoMa: From Pairwise Matching into Multi-View Track Reconstruction
JongMin Lee ⋅ Seungyeop Kang ⋅ Sungjoo Yoo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 27
Unified Primitive Proxies for Structured Shape Completion
Zhaiyu Chen ⋅ Yuqing Wang ⋅ Xiao Xiang Zhu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 28
ART: Articulated Reconstruction Transformer
Zizhang Li ⋅ Cheng Zhang ⋅ Zhengqin Li ⋅ Henry Howard-Jenkins ⋅ Zhaoyang Lv ⋅ Chen Geng ⋅ Jiajun Wu ⋅ Richard Newcombe ⋅ Jakob Engel ⋅ Zhao Dong
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 29
SCE-SLAM: Scale-Consistent Monocular SLAM via Scene Coordinate Embeddings
Yuchen Wu ⋅ Jiahe Li ⋅ Xiaohan Yu ⋅ Lina Yu ⋅ Jin Zheng ⋅ Xiao Bai
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 30
S2D: Sparse to Dense Lifting for 3D Reconstruction with Minimal Inputs
Yuzhou Ji ⋅ Qijian Tian ⋅ He Zhu ⋅ Xiaoqi Jiang ⋅ Guangzhi Cao ⋅ Lizhuang Ma ⋅ Yuan Xie ⋅ Xin Tan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 31
Pip-Stereo: Progressive Iterations Pruner for Iterative Optimization based Stereo Matching
Jintu Zheng ⋅ Qizhe Liu ⋅ Huangxin Xu ⋅ zhuojie Chen
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 32
Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching
Bowen Wen ⋅ Shaurya Dewan ⋅ Stan Birchfield
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 33
E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training
Qitao Zhao ⋅ Hao Tan ⋅ Qianqian Wang ⋅ Sai Bi ⋅ Kai Zhang ⋅ Kalyan Sunkavalli ⋅ Shubham Tulsiani ⋅ Hanwen Jiang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 34
QVGGT: Post-Training Quantized Visual Geometry Grounded Transformer
Zhizhen Pan ⋅ Hesong Wang ⋅ Huan Wang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 35
SRGCD: Stability-Driven Region Growth Framework for 3D Change Detection
Yue Wu ⋅ Tao Peng ⋅ Yongzhe Yuan ⋅ Kaiyuan Feng ⋅ Hao Li ⋅ Maoguo Gong ⋅ Qiguang Miao ⋅ Wenping Ma
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 36
D-Prism: Differentiable Primitives for Structured Dynamic Modeling
Xingyuan Yu ⋅ Yijin Li ⋅ Chong Zeng ⋅ Yuhang Ming ⋅ Hujun Bao ⋅ Guofeng Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 37
STAC: Plug-and-Play Spatio-Temporal Aware Cache Compression for Streaming 3D Reconstruction
Runze Wang ⋅ Yuxuan Song ⋅ Youcheng Cai ⋅ Ligang Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 38
Stabilizing Streaming Video Geometry via Dynamic Feature Normalization
Xiaoyang Lyu ⋅ Muxin Liu ⋅ Xiaoshan Wu ⋅ Ruicheng Wang ⋅ Yihua Huang ⋅ Yangtian Sun ⋅ Shaoshuai Shi ⋅ Xiaojuan Qi
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 39
LaS-Comp: Zero-shot 3D Completion with Latent–Spatial Consistency
Weilong Yan ⋅ Li Haipeng ⋅ Hao Xu ⋅ Nianjin Ye ⋅ Yihao Ai ⋅ Shuaicheng Liu ⋅ Jingyu Hu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 40
Pano360: Perspective to Panoramic Vision with Geometric Consistency
Zhengdong Zhu ⋅ Weiyi Xue ⋅ Zuyuan Yang ⋅ Wenlve Zhou ⋅ Zhiheng Zhou
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 41
EfficientMonoHair: Fast Strand-Level Reconstruction from Monocular Video via Multi-View Direction Fusion
Da Li ⋅ Dominik Engel ⋅ Deng Luo ⋅ Ivan Viola
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 42
OSPO: Object-Centric Self-Improving Preference Optimization for Text-to-Image Generation
Yoonjin Oh ⋅ Yongjin Kim ⋅ Hyomin Kim ⋅ Donghwan Chi ⋅ Sungwoong Kim
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 43
MoReGen: Multi-Agent Motion-Reasoning Engine for Code-based Text-to-Video Synthesis
Xiangyu Bai ⋅ He Liang ⋅ Bishoy Galoaa ⋅ Utsav Nandi ⋅ Shayda Moezzi ⋅ Yuhang He ⋅ Sarah Ostadabbas
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 44
StyleTextGen: Style-Conditioned Multilingual Scene Text Generation
Zeyu Chen ⋅ Fangmin Zhao ⋅ Yan Shu ⋅ Yichao Liu ⋅ Liu Yu ⋅ Yu ZHOU
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 45
CRAFT-LoRA: Content-Style Personalization via Rank-Constrained Adaptation and Training-Free Fusion
Yu Li ⋅ Yujun Cai ⋅ Chi Zhang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 46
OneHOI: Unifying Human-Object Interaction Generation and Editing
Jiun Tian Hoe ⋅ Weipeng Hu ⋅ Xudong Jiang ⋅ Yap-Peng Tan ⋅ Chee Seng Chan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 47
GlyphPrinter: Region-Grouped Direct Preference Optimization for Glyph-Accurate Visual Text Rendering
Xincheng Shuai ⋅ Ziye Li ⋅ Henghui Ding ⋅ Dacheng Tao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 48
Self-Paced and Self-Corrective Masked Prediction for Movie Trailer Generation
Sidan Zhu ⋅ Hongteng Xu ⋅ Dixin Luo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 49
TV2TV: A Unified Framework for Interleaved Language and Video Generation
Xiaofeng Zhang ⋅ Youssef Emad ⋅ Melissa Hall ⋅ John Nguyen ⋅ Karthik Padthe ⋅ Liam Robbins ⋅ Amir Bar ⋅ Delong Chen ⋅ Michal Drozdzal ⋅ Maha Elbayad ⋅ Yushi Hu ⋅ Shang-Wen Li ⋅ Jakob Verbeek ⋅ XuDong Wang ⋅ Marjan Ghazvininejad ⋅ Luke Zettlemoyer ⋅ Emily Dinan
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 50
Narrative Weaver: Towards Controllable Long-Range Visual Consistency with Multi-Modal Conditioning
Zhengjian Yao ⋅ Yongzhi Li ⋅ Xinyuan Gao ⋅ Quan Chen ⋅ Peng Jiang ⋅ Yanye Lu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 51
Ref4D-VideoBench: Four-Dimensional Reference-Based Evaluation of Text-to-Video Generative Models
Jiajia Wei ⋅ YuJia He ⋅ Yuhan Hou ⋅ Hang Qi ⋅ Sihua Wang ⋅ Jincheng Shi ⋅ Kwok Fung Li ⋅ Zibin Zheng ⋅ Weibin Wu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 52
PureCC: Pure Learning for Text-to-Image Concept Customization
Zhichao Liao ⋅ Xiaole Xian ⋅ Qingyu Li ⋅ Wenyu Qin ⋅ Meng Wang ⋅ Weicheng Xie ⋅ Siyang Song ⋅ Pingfa Feng ⋅ Long ZENG ⋅ Liang Pan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 53
Disentangling to Re-couple: Resolving the Similarity-Controllability Paradox in Subject-Driven Text-to-Image Generation
Shuang Li ⋅ Chao Deng ⋅ Hang Chen ⋅ Liqun Liu ⋅ zhenyu hu ⋅ Te Cao ⋅ Mengge Xue ⋅ Yuan Chen ⋅ Peng Shu ⋅ Huan Yu ⋅ Jie Jiang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 54
Yume1.5: A Text-Controlled Interactive World Generation Model
Xiaofeng Mao ⋅ Zhen Li ⋅ Chuanhao Li ⋅ Xiaojie Xu ⋅ Kaining Ying ⋅ Kaipeng Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 55
PosterReward: Unlocking Accurate Evaluation for High-Quality Graphic Design Generation
Jianyu LAI ⋅ Sixiang Chen ⋅ Jialin Gao ⋅ Hengyu Shi ⋅ Zhongying Liu ⋅ Fuxiang Zhai ⋅ Junfeng Luo ⋅ Xiaoming Wei ⋅ Lujia Wang ⋅ Lei Zhu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 56
Scone: Bridging Composition and Distinction in Subject-Driven Image Generation via Unified Understanding-Generation Modeling
Yuran Wang ⋅ Bohan Zeng ⋅ Chengzhuo Tong ⋅ Wenxuan Liu ⋅ Yang Shi ⋅ Xiaochen Ma ⋅ Hao Liang ⋅ Yuanxing Zhang ⋅ Wentao Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 57
SLVMEval: Synthetic Meta Evaluation Benchmark for Text-to-Long Video Generation
Ryosuke Matsuda ⋅ Keito Kudo ⋅ Haruto Yoshida ⋅ Nobuyuki Shimizu ⋅ Jun Suzuki
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 58
PROMPTMINER: Black-Box Prompt Stealing against Text-to-Image Generative Models via Reinforcement Learning and VLM-Guided Optimization
Mingzhe Li ⋅ Renhao 'Norman' Zhang ⋅ Zhiyang Wen ⋅ Siqi Pan ⋅ Bruno da Silva ⋅ Juan Zhai ⋅ Shiqing Ma
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 59
FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing
Guangzhao Li ⋅ Yanming Yang ⋅ Chenxi Song ⋅ Xiaohong Liu ⋅ Chi Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 60
Self-Evaluation Unlocks Any-Step Text-to-Image Generation
Xin Yu ⋅ Xiaojuan Qi ⋅ Zhengqi Li ⋅ Kai Zhang ⋅ Richard Zhang ⋅ Zhe Lin ⋅ Eli Shechtman ⋅ Tianyu Wang ⋅ Yotam Nitzan
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 61
Say Cheese! Detail-Preserving Portrait Collection Generation via Natural Language Edits
Zelong Sun ⋅ Jiahui Wu ⋅ Ying Ba ⋅ Dong Jing ⋅ Zhiwu Lu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 62
LVLM-Aided Alignment of Task-Specific Vision Models
Alexander Koebler ⋅ Lukas Kuhn ⋅ Ingo Thon ⋅ Florian Buettner
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 63
DeepAlign: Mitigating Modality Conflict through Modality-Specific Alignment
Shuo Li ⋅ Bingchen Miao ⋅ Wendong Bu ⋅ Juncheng Li ⋅ Hanwang Zhang ⋅ Fei Wu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 64
PG-VTON: Single-Pass Training-Free Virtual Try-On via Patch-Guided Reference Alignment
Guohao Zhao ⋅ Yuxin Peng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 65
Linguistic Priors for Visual Decoupling: Towards Symmetric Vision-Brain Alignment
Dongjun Liu ⋅ Weichen Dai ⋅ Jingsheng Qian ⋅ Honggang Liu ⋅ Hangjie Yi ⋅ Wanzeng Kong
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 66
Scaling Spatial Intelligence with Multimodal Foundation Models
Zhongang Cai ⋅ Wang Ruisi ⋅ Chenyang Gu ⋅ Fanyi Pu ⋅ Junxiang Xu ⋅ YUBO WANG ⋅ Wanqi Yin ⋅ Zhitao Yang ⋅ Chen Wei ⋅ Tongxi Zhou ⋅ Qingping SUN ⋅ Hui En Pang ⋅ Jiaqi Li ⋅ Oscar Qian ⋅ Zhiqian Lin ⋅ Xuanke Shi ⋅ Kewang Deng ⋅ Xiaoyang Han ⋅ Zukai Chen ⋅ Xiangyu Fan ⋅ Hanming Deng ⋅ Lewei Lu ⋅ Liang Pan ⋅ Bo Li ⋅ Ziwei Liu ⋅ Quan Wang ⋅ Dahua Lin ⋅ Lei Yang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 67
R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning
Qi Yang ⋅ Bolin Ni ⋅ Shiming Xiang ⋅ Houwen Peng
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 68
SafeGRPO: Self-Rewarded Multimodal Safety Alignment via Rule-Governed Policy Optimization
Xuankun Rong ⋅ Wenke Huang ⋅ Tingfeng Wang ⋅ Daiguo Zhou ⋅ Bo Du ⋅ Mang Ye
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 69
AVATAR: Reinforcement Learning to See, Hear, and Reason Over Video
Yogesh Kulkarni ⋅ Pooyan Fazli
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 70
CogniVerse: Revolutionizing Multi-Modal Retrieval-Augmented Generation with Cognitive Reflection and Geometric Reasoning
Xiang Fang ⋅ Wanlong Fang ⋅ Changshuo Wang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 71
FOZO: Forward-Only Zeroth-Order Prompt Optimization for Test-Time Adaptation
Xingyu Wang ⋅ Tao Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 72
Language Does Matter for Cross-Domain Few-Shot Visual Feature Enhancement
Fei Zhou ⋅ Xiwen Zhang ⋅ Qingqing Qiu ⋅ Lei Zhang ⋅ Wei Wei ⋅ Chen Ding ⋅ Yi Zhang ⋅ Liang Li ⋅ Xiangyu Yue ⋅ Yanning Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 73
Back to Source: Open-Set Continual Test-Time Adaptation via Domain Compensation
Yingkai Yang ⋅ Chaoqi Chen ⋅ Hui Huang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 74
Bridging Domain Expertise and Generalization for Performance Estimation
Shuxuan Li ⋅ Zhilin Zhao ⋅ Quyu Kong ⋅ Wei-Shi Zheng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 75
Adaptive Data Augmentation with Multi-armed Bandit: Sample-Efficient Embedding Calibration for Implicit Pattern Recognition
Minxue Tang ⋅ Yangyang Yu ⋅ Aolin Ding ⋅ MAZIYAR BARAN POUYAN ⋅ Taha Belkhouja ⋅ Yujia Bao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 76
Bridging Domains through Subspace-Aware Model Merging
Levy Chaves ⋅ Chao Zhou ⋅ Rebekka Burkholz ⋅ Eduardo Valle ⋅ Sandra Avila
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 77
DA-Mamba: Learning Domain-Aware State Space Model for Global-Local Alignment in Domain Adaptive Object Detection
Haochen Li ⋅ Rui Zhang ⋅ Hantao Yao ⋅ Xin Zhang ⋅ Yifan Hao ⋅ Shaohui Peng ⋅ Yongwei Zhao ⋅ Ling Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 78
Scaling Dense Event-Stream Pretraining from Visual Foundation Models
Zhiwen Chen ⋅ Junhui Hou ⋅ Zhiyu Zhu ⋅ Jinjian Wu ⋅ Guangming Shi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 79
Event Stream Filtering via Probability Flux Estimation
Jinze Chen ⋅ Wei Zhai ⋅ Yang Cao ⋅ Bin Li ⋅ Zheng-Jun Zha
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 80
AIMDepth: Asymmetric Image-Event Mamba for Monocular Depth Estimation
Luoxi Jing ⋅ Dianxi Shi ⋅ YuShe Cao ⋅ Yuanze Wang ⋅ Junze Zhang ⋅ Yuning Cui ⋅ Mengzhu Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 81
Time-Specialized Event-Image Alignment for Blur-to-Video Decomposition
Zhijing Sun ⋅ Senyan Xu ⋅ Ruixuan Jiang ⋅ Kean Liu ⋅ Runze Tian ⋅ Xueyang Fu ⋅ Zheng-Jun Zha
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 82
eRetinexGS: Retinex Modeling for Low-Light Scene Enhancement via Event Streams and 3D Gaussian Splatting
Haojie Yan ⋅ Zehao Chen ⋅ Yan Liu ⋅ Shi Gu ⋅ Peng Lin ⋅ De Ma ⋅ Huajin Tang ⋅ Qian Zheng ⋅ Gang Pan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 83
Unsupervised 3d Motion Estimation Using Event Camera
Han Han ⋅ Wei Zhai ⋅ Tiesong Zhao ⋅ Bin Li ⋅ Yang Cao ⋅ Zheng-Jun Zha
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 84
Goal-Driven Reward by Video Diffusion Models for Reinforcement Learning
Qi Wang ⋅ Mian Wu ⋅ Yuyang Zhang ⋅ Mingqi Yuan ⋅ Wenyao Zhang ⋅ Haoxiang You ⋅ Yunbo Wang ⋅ Xin Jin ⋅ Xiaokang Yang ⋅ Wenjun Zeng
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 85
ModularAgent: A Task-Aware Modular Framework for Joint Optimization of Multimodal Large Language Models and World Models
Yu-Wei Zhan ⋅ Xin Wang ⋅ Pengzhe Mao ⋅ Tongtong Feng ⋅ Ren Wang ⋅ Wenwu Zhu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 86
AstraNav-Memory: Contexts Compression for Long Memory
Junjun Hu ⋅ Xinda Xue ⋅ Botao Ren ⋅ Minghua Luo ⋅ Jintao Chen ⋅ Haochen Bai ⋅ Liangliang You ⋅ Mu Xu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 87
Test-Time Perturbation Learning with Delayed Feedback for Vision-Language-Action Models
Zehua Zang ⋅ Xi Wang ⋅ Fuchun Sun ⋅ Xiao Xu ⋅ Lixiang Liu ⋅ Jiahuan Zhou ⋅ Jiangmeng Li
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 88
OVSegDT: Segmenting Transformer for Open-Vocabulary Object Goal Navigation
Tatiana Zemskova ⋅ Aleksei Staroverov ⋅ Dmitry Yudin ⋅ Aleksandr Panov
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 89
ShowUI-π: Flow-based Generative Models as GUI Dexterous Hands
Siyuan Hu ⋅ Kevin Qinghong Lin ⋅ Mike Zheng Shou
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 90
ActiveVLA: Injecting Active Perception into Vision-Language-Action Models for Precise 3D Robotic Manipulation
Zhenyang Liu ⋅ Yongchong Gu ⋅ Yikai Wang ⋅ Xiangyang Xue ⋅ Yanwei Fu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 91
ACoT-VLA: Action Chain-of-Thought for Vision-Language-Action Models
Linqing Zhong ⋅ Yi Liu ⋅ Yifei Wei ⋅ Ziyu Xiong ⋅ Si Liu ⋅ Guangrui Ren
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 92
BridgeEQA: Virtual Embodied Agents for Real Bridge Inspections
Subin Varghese ⋅ Joshua Gao ⋅ Asad Ur Rahman ⋅ Vedhus Hoskere
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 93
SyncMos: Scalable Motion Synchronisation for Multi-Agent Scene Interaction
Lingxiao Li ⋅ Dongwon Kim ⋅ Lingyan Ruan ⋅ Taesoo Kwon ⋅ Bin Chen ⋅ Taehyun Rhee
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 94
Planning in 8 Tokens: A Compact Discrete Tokenizer for Latent World Model
Dongwon Kim ⋅ Gawon Seo ⋅ Jinsung Lee ⋅ Minsu Cho ⋅ Suha Kwak
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 95
Omni-Attribute: Open-vocabulary Attribute Encoder for Visual Concept Personalization
Tsai-Shien Chen ⋅ Aliaksandr Siarohin ⋅ Gordon Guocheng Qian ⋅ Kuan-Chieh Jackson Wang ⋅ Egor Nemchinov ⋅ Moayed Haji Ali ⋅ Riza Alp Guler ⋅ Willi Menapace ⋅ Ivan Skorokhodov ⋅ Anil Kag ⋅ Jun-Yan Zhu ⋅ Sergey Tulyakov
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 96
IF-Bench: Benchmarking and Enhancing MLLMs for Infrared Images with Generative Visual Prompting
Tao Zhang ⋅ Yuyang Hong ⋅ Yang Xia ⋅ Kun Ding ⋅ Zeyu Zhang ⋅ Ying Wang ⋅ Shiming Xiang ⋅ Chunhong Pan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 97
InstantRetouch: Efficient and High-Fidelity Instruction-Guided Image Retouching with Bilateral Space
Jiarui Wu ⋅ Yujin Wang ⋅ Ruikang Li ⋅ Fan Zhang ⋅ Mingde Yao ⋅ Tianfan Xue
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 98
MICON-Bench: Benchmarking and Enhancing Multi-Image Context Image Generation in Unified Multimodal Models
Mingrui Wu ⋅ Hang Liu ⋅ Jiayi Ji ⋅ Xiaoshuai Sun ⋅ Rongrong Ji
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 99
The Devil is in Attention Sharing: Improving Complex Non-rigid Image Editing Faithfulness via Attention Synergy
Zhuo Chen ⋅ Fanyue Wei ⋅ Runze Xu ⋅ Jingjing Li ⋅ Lixin Duan ⋅ Angela Yao ⋅ Wen Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 100
ShreddingNet: Coarse-to-Fine Restoration for Multi-Source Shredded Manuscripts
Haoyang Cui ⋅ Hao Jiang ⋅ Yadong Mu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 101
Image Guides Images: Consistent Video Amodal Completion with Rectified In-Context Exemplar Guidance
Xiaoyu Kong ⋅ Ketong Ren ⋅ Dongyu She ⋅ Weiming Dong ⋅ Miao Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 102
Radiance Meshes for Volumetric Reconstruction
Alexander Mai ⋅ Trevor Hedstrom ⋅ George Kopanas ⋅ Janne Kontkanen ⋅ Falko Kuester ⋅ Jonathan T. Barron
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 103
Aesthetic Camera Viewpoint Suggestion with 3D Aesthetic Field
Sheyang Tang ⋅ Armin Shafiee Sarvestani ⋅ Jialu Xu ⋅ Xiaoyu Xu ⋅ Zhou Wang
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 104
CoRoGS: Contextual Gaussian Splatting for Robust Large-Deviation View Synthesis
Xin Ma ⋅ Peng Lu ⋅ Yisong Chen ⋅ Chengwei Pan ⋅ Sheng Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 105
ChronoGS: Disentangling Invariants and Changes in Multi-Period Scenes
Zhongtao Wang ⋅ Jiaqi Dai ⋅ Qingtian Zhu ⋅ Yilong Li ⋅ Mai Su ⋅ Fei Zhu ⋅ Meng GAI ⋅ Shaorong Wang ⋅ Chengwei Pan ⋅ Yisong Chen ⋅ Guoping Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 106
Real-Time Dynamic Scene Rendering with Controlled Compressibility and Contact Awareness
Boya Shi ⋅ Naiyang Guan ⋅ Xiaodong Yi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 107
Splatent: Splatting Diffusion Latents for Novel View Synthesis
Or Hirschorn ⋅ Omer Sela ⋅ Inbar Huberman-Spiegelglas ⋅ Netalee Efrat Sela ⋅ Eli Alshan ⋅ Ianir Ideses ⋅ Frederic Devernay ⋅ Yochai Zvik ⋅ Lior Fritz
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 108
ParticleGS: Learning Neural Gaussian Particle Dynamics from Videos for Prior-free Physical Motion Extrapolation
Jinsheng Quan ⋅ Qiaowei Miao ⋅ Yichao Xu ⋅ Zizhuo Lin ⋅ Ying Li ⋅ Wei Yang ⋅ Zhihui Li ⋅ Yawei Luo
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 109
Dynamic-Static Decomposition for Novel View Synthesis of Dynamic Scenes with Spiking Neurons
Lingyun Dai ⋅ Zehao Chen ⋅ Yan Liu ⋅ Shi Gu ⋅ Peng Lin ⋅ De Ma ⋅ Huajin Tang ⋅ Qian Zheng ⋅ Gang Pan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 110
DiffSoup: Direct Differentiable Rasterization of Triangle Soup for Extreme Radiance Field Simplification
Kenji Tojo ⋅ Bernd Bickel ⋅ Nobuyuki Umetani
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 111
Gyro-based Deep Video Deblurring
Jaesung Rim ⋅ Woohyeok Kim ⋅ Haeyun Lee ⋅ Heemin Yang ⋅ Ke Wang ⋅ Sunghyun Cho
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 112
Residual Diffusion Bridge Model for Image Restoration
Hebaixu Wang ⋅ Jing Zhang ⋅ Haoyang Chen ⋅ Haonan Guo ⋅ Di Wang ⋅ Jiayi Ma ⋅ Bo Du
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 113
MMDIR: Multimodal Instruction-Driven Framework for Mixed-Degradation Document Image Restoration
Heng Li ⋅ Xingyuan Wang ⋅ Yang Fan ⋅ Yunan Zhang ⋅ Xiangping Wu ⋅ Qingcai Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 114
Rectifying Latent Space for Generative Single-Image Reflection Removal
Mingjia Li ⋅ Jin Hu ⋅ Hainuo Wang ⋅ Qiming Hu ⋅ Jiarui Wang ⋅ Xiaojie Guo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 115
Towards Generalized Multimodal Homography Estimation
Jinkun You ⋅ Jiaxin Cheng ⋅ Jie Zhang ⋅ Yicong Zhou
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 116
Edit-aware RAW reconstruction
Abhijith Punnappurath ⋅ Luxi Zhao ⋅ Ke Zhao ⋅ Hue Nguyen ⋅ Radek Grzeszczuk ⋅ Michael S. Brown
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 117
Face2Scene: Using Facial Degradation as an Oracle for Diffusion-Based Scene Restoration
Amirhossein Kazerouni ⋅ Maitreya Suin ⋅ Tristan T Aumentado-Armstrong ⋅ Sina Honari ⋅ Amanpreet Walia ⋅ Iqbal Mohomed ⋅ Kosta Derpanis ⋅ Babak TAATI ⋅ Alex Levinshtein
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 118
HG-Lane: High-Fidelity Generation of Lane Scenes under Adverse Weather and Lighting Conditions without Re-annotation
Daichao Zhao ⋅ Qiupu Chen ⋅ Feng He ⋅ Xin Ning ⋅ Qiankun Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 119
NanoSD: Edge Efficient Foundation Model for Real Time Image Restoration
Subhajit Sanyal ⋅ Srinivas Soumitri Miriyala ⋅ Akshay Janardan Bankar ⋅ Manjunath Arveti ⋅ Sowmya Vajrala ⋅ Shreyas Pandith ⋅ Sravanth Kodavanti ⋅ Abhishek Ameta ⋅ Harshit Harshit ⋅ Amit Unde
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 120
MR. Illuminate: Zero-Shot Low-Light Image Enhancement with Diffusion Prior
Joshua Cho ⋅ Sara Aghajanzadeh ⋅ Zhen Zhu ⋅ David Forsyth
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 121
FoundIR-v2: Optimizing Pre-Training Data Mixtures for Image Restoration Foundation Model
Xiang Chen ⋅ Jinshan Pan ⋅ Jiangxin Dong ⋅ Jian Yang ⋅ Jinhui Tang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 122
SPEGC: Continual Test-Time Adaptation via Semantic-Prompt-Enhanced Graph Clustering for Medical Image Segmentation
Xiaogang Du ⋅ Jiawei Zhang ⋅ Tongfei Liu ⋅ Tao Lei ⋅ Yingbo Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 123
BackSplit: The Importance of Sub-dividing the Background in Biomedical Lesion Segmentation
Rachit Saluja ⋅ Asli Cihangir ⋅ Ruining Deng ⋅ Johannes C. Paetzold ⋅ Fengbei Liu ⋅ Mert Sabuncu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 124
Divide, Conquer, and Aggregate: Asymmetric Experts for Class-Imbalanced Semi-Supervised Medical Image Segmentation
Yajun Liu
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 125
CROWn: A Unified Framework for Anti‑Aliased Downsampling and Phase‑Calibrated Fusion in 3D Medical Segmentation
Xingru Huang ⋅ Shuanghua Ye ⋅ Zhao Huang ⋅ Wenwen Tang ⋅ Huiyu Zhou ⋅ Zhiwen Zheng ⋅ Jin Liu ⋅ Xiaoshuai Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 126
Rethinking Box Supervision: Bias-Free Weakly Supervised Medical Segmentation
Jun Wei ⋅ Hui Huang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 127
Semi-supervised Echocardiography Video Segmentation via Anchor Semantic Awareness and Continuous Pseudo-label Reforging
Yunpeng Fang ⋅ Yimu Sun ⋅ Jingxing Guo ⋅ Huisi Wu ⋅ Jing Qin
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 128
TANGO: Learning Distribution-wise Foundation Prior Consistency and Instance-wise Style Calibration for Medical Image Generalization
Chuang Liu ⋅ Yichao Cao ⋅ Xiu Su ⋅ Haogang Zhu
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 129
MambaLiteUNet: Cross-Gated Adaptive Feature Fusion for Robust Skin Lesion Segmentation
Md Maklachur Rahman ⋅ Soon Ki Jung ⋅ Tracy Hammond
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 130
Breaking Multimodal LLM Safety via Video-Driven Prompting
Dong Wang ⋅ XIANGYU HE ⋅ Xinqi Lyu ⋅ Bin Xiao
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 131
When LoRA Betrays: Backdooring Text-to-Image Models by Masquerading as Benign Adapters
Liangwei Lyu ⋅ Jiaqi Xu ⋅ Jianwei Ding ⋅ Qiyao Deng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 132
RecoverMark: Robust Watermarking for Localization and Recovery of Manipulated Faces
Haonan An ⋅ Xiaohui Ye ⋅ Guang Hua ⋅ Yihang Tao ⋅ Hangcheng Cao ⋅ Xiangyu Yu ⋅ Yuguang Fang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 133
A Provable Energy-Guided Test-Time Defense Boosting Adversarial Robustness of Large Vision-Language Models
Mujtaba Hussain Mirza ⋅ Antonio D’Orazio ⋅ Odelia Melamed ⋅ Iacopo Masi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 134
FORCE: Transferable Visual Jailbreaking Attacks via Feature Over-Reliance CorrEction
Runqi Lin ⋅ Alasdair Paren ⋅ Suqin Yuan ⋅ Muyang Li ⋅ Philip H.S. Torr ⋅ Adel Bibi ⋅ Tongliang Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 135
PureProof: Diffusion-Resistant Black-box Targeted Attack on Large Vision-Language Models
Yiming CAO ⋅ Dong Wang ⋅ Xinqi Lyu ⋅ Bin Xiao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 136
UniDef: Universal Defense Against Unauthorized Image Manipulation
Mingwen Shao ⋅ Lingzhuang Meng ⋅ Xiang Lv ⋅ Mengyao Wu ⋅ Xinyuan Chen ⋅ Qiao Zhang ⋅ Chang Liu ⋅ Yuanjian Qiao ⋅ Chao Dong
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 137
Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following
Tianyi Xiong ⋅ Yi Ge ⋅ Ming Li ⋅ Zuolong Zhang ⋅ Pranav Kulkarni ⋅ Kaishen Wang ⋅ Qi He ⋅ Zeying Zhu ⋅ Chenxi Liu ⋅ Ruibo Chen ⋅ Tong Zheng ⋅ Yanshuo Chen ⋅ Xiyao Wang ⋅ Ray Zhang ⋅ Wenhu Chen ⋅ Heng Huang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 138
MERLIN: Building Low-SNR Robust Multimodal LLMs for Electromagnetic Signals
Junyu Shen ⋅ Zhendong She ⋅ Chenghanyu Zhang ⋅ Yuchuang Sun ⋅ Luqing Luo ⋅ Dingwei Tan ⋅ Zonghao Guo ⋅ Bo Guo ⋅ Zehua Han ⋅ Wupeng Xie ⋅ Yaxin Mu ⋅ Peng Zhang ⋅ Peipei Li ⋅ Fengxiang Wang ⋅ Yangang Sun ⋅ Maosong Sun
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 139
Rethinking Cross-Modal Anchor Alignment for Mitigating Error Accumulation
Bin Liu ⋅ Wei Sun ⋅ Qianqian Wang ⋅ Wei Feng ⋅ Yijie Chen ⋅ Haixi Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 140
SOUPLE: Enhancing Audio-Visual Localization and Segmentation with Learnable Prompt Contexts
Khanh Binh Nguyen ⋅ Chae Jung Park
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 141
Omni-MMSI: Toward Identity-attributed Social Interaction Understanding
Xinpeng Li ⋅ Bolin Lai ⋅ Hardy Chen ⋅ Shijian Deng ⋅ Cihang Xie ⋅ Yuyin Zhou ⋅ James M. ⋅ Yapeng Tian
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 142
Inconsistency-aware Multimodal Schrödinger Bridge for Deepfake Localization
Jiayu Xiong ⋅ Jing Wang ⋅ Qi Zhang ⋅ Wanlong Wang ⋅ Jun Xue
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 143
MASQuant: Modality-Aware Smoothing Quantization for Multimodal Large Language Models
lulu hu ⋅ Xiao Wenhu ⋅ Chen Xin ⋅ Xinhua Xu ⋅ Bowen Xu ⋅ Kun Li ⋅ Yongliang Tao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 144
Seeing Through Touch: Tactile-Driven Visual Localization of Material Regions
Seongyu Kim ⋅ Seungwoo Lee ⋅ Hyeonggon Ryu ⋅ Joon Chung ⋅ Arda Senocak
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 145
Seeing What Matters: A Training-Free Self-Guided Framework for Multimodal Detail Perception and Reasoning
Mingjie Ma ⋅ yichao ma ⋅ Zhong Yang ⋅ Guohui Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 146
Illuminating Visual Identity in Universal Multimodal Embeddings
Jiawei Cao ⋅ Junyi Feng ⋅ Jiashen Hua ⋅ Ziheng Huang ⋅ Bing Deng ⋅ Kaijie Wu ⋅ Chaochen Gu ⋅ Jieping Ye
[ Slides
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 147
Anti-Degradation Lifelong Multi-View Clustering
Xingfeng Li ⋅ Hao Pan ⋅ Honglin Yuan ⋅ Yuan Sun ⋅ Xujian Zhao ⋅ Jiaqi Lin ⋅ Zhenwen Ren
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 148
The Coherence Trap: When MLLM-Crafted Narratives Exploit Manipulated Visual Contexts
Yuchen Zhang ⋅ Yaxiong Wang ⋅ Yujiao Wu ⋅ Lianwei Wu ⋅ Li Zhu ⋅ Zhedong Zheng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 149
Efficient and High-Fidelity Omni Modality Retrieval
Chuong Huynh ⋅ Manh Luong ⋅ Abhinav Shrivastava
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 150
Same Content, Different Answers: Cross-Modal Inconsistency in MLLMs
Angela van Sprang ⋅ Laurens Samson ⋅ Ana Lucic ⋅ Erman Acar ⋅ Sennay Ghebreab ⋅ Yuki M Asano
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 151
Tri-Subspaces Disentanglement for Multimodal Sentiment Analysis
Chunlei Meng ⋅ Jiabin Luo ⋅ Zhenglin Yan ⋅ Zhenyu Yu ⋅ Rong Fu ⋅ Zhongxue Gan ⋅ Chun Ouyang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 152
HAVE-Bench: Hierarchical Audio-Visual Evaluation from Perception to Interaction
Zhong Muyan ⋅ Erfei Cui ⋅ Sen Xing ⋅ Weiyun Wang ⋅ Wen Wu ⋅ Yuchen Hu ⋅ Yanting Zhang ⋅ Xiaowei Hu ⋅ Wenhai Wang ⋅ Chao Zhang ⋅ Jifeng Dai
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 153
Predictive Regularization Against Visual Representation Degradation in Multimodal Large Language Models
Enguang Wang ⋅ Qiang Wang ⋅ Yuanchen Wu ⋅ Ke Yan ⋅ Xinbin Yuan ⋅ Shouhong Ding ⋅ Xialei Liu ⋅ Mingming Cheng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 154
THE MORE, THE MERRIER: CONTRASTIVE FUSION FOR HIGHER-ORDER MULTIMODAL ALIGNMENT
Stefanos Koutoupis ⋅ Michaela Areti Zervou ⋅ Konstantinos Kontras ⋅ Maarten De Vos ⋅ Panagiotis Tsakalides ⋅ Grigorios Tsagkatakis
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 155
CineSRD: Leveraging Visual, Acoustic, and Linguistic Cues for Open-World Visual Media Speaker Diarization
Liangbin Huang ⋅ Xiaohua Liao ⋅ Chaoqun Cui ⋅ Shijing Wang ⋅ Zhaolong Huang ⋅ Yanlong Du ⋅ Wenji Mao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 156
HandDreamer: Zero-Shot Text to 3D Hand Model Generation using Corrective Hand Shape Guidance
Green Rosh ⋅ Prateek Kukreja ⋅ Vishakha SR ⋅ Pawan Prasad B H
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 157
UST-Hand: An Uncertainty-aware Spatiotemporal Point Cloud Interaction Network for 3D Self-supervised Hand Pose Estimation
Tianhao Han ⋅ HaoYang ZHANG ⋅ Liang Xie ⋅ Haochen Chang ⋅ Kun Gao ⋅ Yuan Cheng ⋅ Pengfei Ren ⋅ Erwei Yin
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 158
ForeHOI: Feed-forward 3D Object Reconstruction from Daily Hand-Object Interaction Videos
Yuantao Chen ⋅ Jiahao Chang ⋅ Chongjie Ye ⋅ Chaoran Zhang ⋅ Zhaojie Fang ⋅ Chenghong Li ⋅ Xiaoguang Han
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 159
Hoi! - A Multimodal Dataset for Force-Grounded, Cross-View Articulated Manipulation
Tim Engelbracht ⋅ René Zurbrügg ⋅ Matteo Wohlrapp ⋅ Martin Büchner ⋅ Abhinav Valada ⋅ Marc Pollefeys ⋅ Hermann Blum ⋅ Zuria Bauer
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 160
Enhancing Hands in 3D Whole-Body Pose Estimation with Conditional Hands Modulator
Gyeongsik Moon
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 161
TouchDream: 3D Object Completion through Imagined Touch
Yuanbo Wang ⋅ Xinning Wang ⋅ Zhaoxuan Zhang ⋅ Changlong Wang ⋅ qianchen xia ⋅ Xiaopeng Wei ⋅ Xin Yang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 162
ForceVLA2: Unleashing Hybrid Force-Position Control with Force Awareness for Contact-Rich Manipulation
Yang Li ⋅ Zhaxizhuoma ⋅ Hongru Jiang ⋅ Junjie Xia ⋅ Hongquan Zhang ⋅ Jinda Du ⋅ Yunsong Zhou ⋅ Jia Zeng ⋅ Ce Hao ⋅ Jieji Ren ⋅ Qiaojun Yu ⋅ Cewu Lu ⋅ Yu Qiao ⋅ Jiangmiao Pang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 163
TokenHand: Discrete Token Representation for Efficient Hand Mesh Reconstruction
Xinguo He ⋅ Yixin Shen ⋅ Rahul Chaudhari
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 164
Artiverse: A Diverse and Physically Grounded Dataset for Articulated Objects
Denys Iliash ⋅ Jiayi Liu ⋅ Egor Fokin ⋅ Qirui Wu ⋅ Ali Mahdavi Amiri ⋅ Manolis Savva ⋅ Angel Xuan Chang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 165
MatPedia: A Universal Generative Foundation for High-Fidelity Material Synthesis
Di Luo ⋅ Shuhui Yang ⋅ Mingxin Yang ⋅ Jiawei Lu ⋅ Yixuan Tang ⋅ Xintong Han ⋅ Zhuo Chen ⋅ Beibei Wang ⋅ Chunchao Guo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 166
LogCD: Local-to-global Consistency Distillation for Few-step Image Generation
Qingsong Xie ⋅ Zhenyi Liao ⋅ Chen Chen ⋅ Zhijie Deng ⋅ Haonan Lu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 167
EditCtrl: Disentangled Local and Global Control for Real-Time Generative Video Editing
Yehonathan Litman ⋅ Shikun Liu ⋅ Dario Seyb ⋅ Nicholas Milef ⋅ Yang Zhou ⋅ Carl Marshall ⋅ Shubham Tulsiani ⋅ Caleb Leak
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 168
Anchoring and Rescaling Attention for Semantically Coherent Inbetweening
Tae Eun Choi ⋅ Sumin Shim ⋅ Junhyeok Kim ⋅ Seong Jae Hwang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 169
FlashMotion: Few-Step Controllable Video Generation with Trajectory Guidance
Quanhao Li ⋅ Zhen Xing ⋅ Rui Wang ⋅ Haidong Cao ⋅ Qi Dai ⋅ Daoguo Dong ⋅ Zuxuan Wu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 170
LightMover: Generative Light Movement with Color and Intensity Controls
Gengze Zhou ⋅ Tianyu Wang ⋅ Soo Ye Kim ⋅ ZHIXIN SHU ⋅ Xin Yu ⋅ Yannick Hold-Geoffroy ⋅ Sumit Chaturvedi ⋅ Qi Wu ⋅ Zhe Lin ⋅ Scott Cohen
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 171
Parallel Jacobi Decoding for Fast Autoregressive Image Generation
Boya Liao ⋅ Ying Li ⋅ Siyong Jian ⋅ Huan Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 172
CARE-Edit: Condition-Aware Routing of Experts for Contextual Image Editing
Yucheng Wang ⋅ Zedong Wang ⋅ Yuetong Wu ⋅ Yue Ma ⋅ Dan Xu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 173
CREval: An Automated Interpretable Evaluation for Creative Image Manipulation under Complex Instructions
Chonghuinan Wang ⋅ Zihan Chen ⋅ Yuxiang Wei ⋅ Tianyi Jiang ⋅ Xiaohe Wu ⋅ Fan Li ⋅ Wangmeng Zuo ⋅ Hongxun Yao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 174
EchoVDiff: Cardiac-Cycle Echocardiography Video Generation from Arbitrary Frame
Jiansong Zhang ⋅ Xiaying Yang ⋅ Xiaoling Luo ⋅ Linlin Shen
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 175
Re-Align: Structured Reasoning-guided Alignment for In-Context Image Generation and Editing
Runze He ⋅ YIJI CHENG ⋅ Tiankai Hang ⋅ Zhimin Li ⋅ Yu Xu ⋅ Zijin Yin ⋅ Shiyi Zhang ⋅ Wenxun Dai ⋅ Penghui Du ⋅ Ao Ma ⋅ Chunyu Wang ⋅ qinglin lu ⋅ Jizhong Han ⋅ Jiao Dai
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 176
ChimeraLoRA: Multi-Head LoRA-Guided Synthetic Datasets
Hoyoung Kim ⋅ Minwoo Jang ⋅ Jabin Koo ⋅ Sangdoo Yun ⋅ Jungseul Ok
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 177
Frequency-Aware Flow Matching for High-Quality Image Generation
Sucheng Ren ⋅ Qihang Yu ⋅ Ju He ⋅ Xiaohui Shen ⋅ Liang-Chieh Chen
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 178
STARFlow-V: End-to-End Video Generative Modeling with Autoregressive Normalizing Flows
Jiatao Gu ⋅ Ying Shen ⋅ Tianrong Chen ⋅ Laurent Dinh ⋅ Yuyang Wang ⋅ Miguel Ángel Bautista ⋅ David Berthelot ⋅ Joshua Susskind ⋅ Shuangfei Zhai
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 179
MixFlow Training: Alleviating Exposure Bias with Slowed Interpolation Mixture
Hui Li ⋅ Jiayue Lyu ⋅ Fu-Yun Wang ⋅ Kaihui Cheng ⋅ Siyu Zhu ⋅ Jingdong Wang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 180
Improving Controllable Generation: Faster Training and Better Performance via x0-Supervision
Amadou S. SANGARE ⋅ Adrien Maglo ⋅ Mohamed Chaouch ⋅ Bertrand Luvison
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 181
Visual-Aware CoT: Achieving High-Fidelity Visual Consistency in Unified Models
Zixuan Ye ⋅ Quande Liu ⋅ Cong Wei ⋅ Yuanxing Zhang ⋅ Xintao Wang ⋅ Pengfei Wan ⋅ Kun Gai ⋅ Wenhan Luo
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 182
OrionEdit: Bridging Reference and Source Images for Generalized Cross-Image Editing
Zeyu Jiang ⋅ Lai-Man Po ⋅ XUYUAN XU ⋅ Yexin Wang ⋅ Guoping Gong ⋅ Haoxuan Wu ⋅ Chenbo Yan ⋅ Kun Li ⋅ Yuyang Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 183
PositionIC: Unified Position and Identity Consistency for Image Customization
Junjie Hu ⋅ Tianyang Han ⋅ Kai Ma ⋅ Jialin Gao ⋅ Yang Song ⋅ Xianhua He ⋅ Junfeng Luo ⋅ Xiaoming Wei ⋅ Wenqiang Zhang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 184
P-Flow: Prompting Visual Effects Generation
Rui Zhao ⋅ Mike Zheng Shou
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 185
Clair Obscur: an Illumination-Aware Method for Real-World Image Vectorization
Xingyue Lin ⋅ Shuai Peng ⋅ Xiangyu Xie ⋅ Jianhua Zhu ⋅ Yuxuan Zhou ⋅ Liangcai Gao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 186
SURF: Signature-Retained Fast Video Generation
Kaixin Ding ⋅ Xi Chen ⋅ Sihui Ji ⋅ Yuan Gao ⋅ Liang Hou ⋅ Xin Tao ⋅ Hengshuang Zhao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 187
The devil is in the details: Enhancing Video Virtual Try-On via Keyframe-Driven Details Injection
Qingdong He ⋅ Xueqin Chen ⋅ Yanjie Pan ⋅ Peng Tang ⋅ Pengcheng Xu ⋅ Zhenye Gan ⋅ Chengjie Wang ⋅ Xiaobin Hu ⋅ Jiangning Zhang ⋅ Yabiao Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 188
Lynx: Towards High-Fidelity Personalized Video Generation
Shen Sang ⋅ Tiancheng Zhi ⋅ Tianpei Gu ⋅ Jing Liu ⋅ Linjie Luo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 189
VisionDirector: Vision-Language Guided Closed-Loop Refinement for Generative Image Synthesis
Meng Chu ⋅ Senqiao Yang ⋅ Haoxuan Che ⋅ Suiyun Zhang ⋅ Xichen Zhang ⋅ Shaozuo Yu ⋅ Haokun GUI ⋅ Zhefan Rao ⋅ Dandan Tu ⋅ Rui Liu ⋅ Jiaya Jia
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 190
ClusterMark: Towards Robust Watermarking for Autoregressive Image Generators with Visual Token Clustering
Denis Lukovnikov ⋅ Andreas Müller ⋅ Erwin Quiring ⋅ Asja Fischer
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 191
Stable Mean Flow: Lyapunov-Inspired One-Step Flow Matching
Guangxun Zhang ⋅ Mason Haberle ⋅ Davi Geiger
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 192
OPRO: Orthogonal Panel-Relative Operators for Panel-Aware In-Context Image Generation
Sanghyeon Lee ⋅ Minwoo Lee ⋅ Euijin Shin ⋅ Kangyeol Kim ⋅ Seunghwan Choi ⋅ Jaegul Choo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 193
First Frame Is the Place to Go for Video Content Customization
Jingxi Chen ⋅ Zongxia Li ⋅ Zhichao Liu ⋅ Guangyao Shi ⋅ Xiyang Wu ⋅ Fuxiao Liu ⋅ Cornelia Fermuller ⋅ Brandon Y. Feng ⋅ Yiannis Aloimonos
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 194
Scaling Zero-Shot Reference-to-Video Generation
Zijian Zhou ⋅ Shikun Liu ⋅ Haozhe Liu ⋅ Haonan Qiu ⋅ Zhaochong An ⋅ Weiming Ren ⋅ Zhiheng Liu ⋅ Xiaoke Huang ⋅ Kam-Woh Ng ⋅ Tian Xie ⋅ Xiao Han ⋅ Yuren Cong ⋅ Hang Li ⋅ Chuyan Zhu ⋅ Aditya Patel ⋅ Tao Xiang ⋅ Sen He
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 195
MotionEdit: Benchmarking and Learning Motion-Centric Image Editing
Yixin Wan ⋅ Lei Ke ⋅ Wenhao Yu ⋅ Kai-Wei Chang ⋅ Dong Yu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 196
VDOT: Efficient Unified Video Creation via Optimal Transport Distillation
Yutong Wang ⋅ Haiyu Zhang ⋅ Tianfan Xue ⋅ Yu Qiao ⋅ Yaohui Wang ⋅ Chang Xu ⋅ Xinyuan Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 197
Real-Time Generation of Streamable Talking Portrait Video with Reference-Guided Deep Compression VAEs
Sicheng Xu ⋅ Yu Deng ⋅ Shoukang Hu ⋅ Yichuan Wang ⋅ Yizhong Zhang ⋅ Zhan Chen ⋅ Jiaolong Yang ⋅ Baining Guo
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 198
RunawayEvil: Jailbreaking the Image-to-Video Generative Models
yueming lyu ⋅ Rufan Qian ⋅ Yueming Lyu ⋅ Qinglong Liu ⋅ Linzhuang Zou ⋅ Jie Qin ⋅ Songhua Liu ⋅ Caifeng Shan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 199
MultiAnimate: Pose-Guided Image Animation Made Extensible
Yingcheng Hu ⋅ Haowen Gong ⋅ Chuanguang Yang ⋅ Zhulin An ⋅ Yongjun Xu ⋅ Songhua Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 200
Translating Signals to Languages for sEMG-Based Activity Recognition
Ming Wang ⋅ Haoxuan Qu ⋅ Qiuhong Ke ⋅ Wei Zhou ⋅ Hossein Rahmani ⋅ Jun Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 201
Open the Motion Door: Atomic Motion Decomposition and Recomposition for Open-Vocabulary Motion Generation
Ke Fan ⋅ Jiangning Zhang ⋅ Ran Yi ⋅ Jingyu Gong ⋅ Yabiao Wang ⋅ yating wang ⋅ Xin Tan ⋅ Chengjie Wang ⋅ Lizhuang Ma
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 202
Multi-level Causal LLM-based Text-to-Motion Generation with Human Alignment
Chen Xiaodong ⋅ Qian Bao ⋅ Xudong Liu ⋅ Jianping Fang ⋅ Jintao Fang ⋅ Yongdong Zhang ⋅ Tao Mei ⋅ Wu Liu
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 203
MotionHiFlow: Text-to-Motion via Hierarchical Flow Matching
Heng Li ⋅ Xiaotong Lin ⋅ Ling-An Zeng ⋅ Yulei Kang ⋅ Shuai Li ⋅ Jian-Fang Hu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 204
LaMoGen: Language to Motion Generation Through LLM-Guided Symbolic Inference
Junkun JIANG ⋅ Ho Yin Au ⋅ Jingyu Xiang ⋅ Jie Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 205
Accelerating Diffusion via Hybrid Data-Pipeline Parallelism Based on Conditional Guidance Scheduling
Euisoo Jung ⋅ Byunghyun Kim ⋅ Hyunjin Kim ⋅ Seonghye Cho ⋅ Jae-Gil Lee
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 206
GVIS: Generative Vector Image Steganography
ZiHao Xu ⋅ Dawei xu ⋅ Zihan Li ⋅ Xixi Zheng ⋅ Chuan Zhang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 207
MaxMark: High-Capacity Diffusion-Native Watermarking via Robust and Invertible Latent Embedding
Xuanhang Chang ⋅ Zhonghao Yang ⋅ Cheng Zhuo ⋅ YU LI
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 208
GeoRK2: Geometry-Guided Runge–Kutta Integration for Diffusion Transformer Acceleration
Chaoqun Sun ⋅ Zongjing Fu ⋅ Powei Chang ⋅ Jinpeng Zhang ⋅ JianXiang Xiang ⋅ Yukang Gao ⋅ Chenyu Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 209
Test-time Sparsity for Extreme Fast Action Diffusion
Kangye Ji ⋅ Yuan Meng ⋅ Jianbo Zhou ⋅ Ye Li ⋅ Chen Tang ⋅ Zhi Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 210
Trainable Log-linear Sparse Attention for Efficient Diffusion Transformers
Yifan Zhou ⋅ Zeqi Xiao ⋅ Tianyi Wei ⋅ Shuai Yang ⋅ Xingang Pan
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 211
A Self-Conditioned Representation Guided Diffusion Model for Realistic Text-to-LiDAR Scene Generation
Wentao Qu ⋅ Guofeng Mei ⋅ Yang Wu ⋅ Yongshun Gong ⋅ Xiaoshui Huang ⋅ Liang Xiao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 212
When Local Rules Create Global Order: Self-Organized Representation Learning for Latent Diffusion Models
Junrong Lian ⋅ Weijian Deng ⋅ Pengxu Wei ⋅ Yaqin Chen ⋅ Qixiang Ye ⋅ Liang Lin
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 213
ViStoryBench: Comprehensive Benchmark Suite for Story Visualization
Cailin Zhuang ⋅ Ailin Huang ⋅ Hu Yaoqi ⋅ Jingwei Wu ⋅ Wei Cheng ⋅ Jiaqi Liao ⋅ Hongyuan Wang ⋅ Xinyao Liao ⋅ Weiwei Cai ⋅ Hengyuan Xu ⋅ Xuanyang Zhang ⋅ Xianfang Zeng ⋅ Zhewei Huang ⋅ Gang Yu ⋅ Chi Zhang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 214
R4-CGQA: Retrieval-based Vision Language Models for Computer Graphics Image Quality Assessment
Zhuangzi Li ⋅ Jian Jin ⋅ Shilv Cai ⋅ Weisi Lin
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 215
A³: Towards Advertising Aesthetic Assessment
Kaiyuan Ji ⋅ Yixuan Gao ⋅ Lu Sun ⋅ Yushuo Zheng ⋅ Zijian Chen ⋅ Jianbo Zhang ⋅ Xiangyang Zhu ⋅ Yuan Tian ⋅ Zicheng Zhang ⋅ Guangtao Zhai
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 216
GraphVLM: Benchmarking Vision Language Models for Multimodal Graph Learning
Jiajin Liu ⋅ Dongzhe Fan ⋅ Chuanhao Ji ⋅ Daochen Zha ⋅ Qiaoyu Tan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 217
Phrase-Grounding-Aware Supervised Fine-Tuning for Chart Recognition via Side-Masked Attention
Koichiro Ito
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 218
VL-RouterBench: A Benchmark for Vision–Language Model Routing
Zhehao Huang ⋅ Baijiong Lin ⋅ Jingyuan Zhang ⋅ Jingying Wang ⋅ Yuhang Liu ⋅ Ning Lu ⋅ Tao Li ⋅ Xiaolin Huang
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 219
CLIP Is Shortsighted: Paying Attention Beyond the First Sentence
Marc-Antoine Lavoie ⋅ Anas Mahmoud ⋅ Aldo Zaimi ⋅ Arsene Fansi Tchango ⋅ Steven L. Waslander
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 220
G^2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning
Wenbo hu ⋅ JINGLI LIN ⋅ Yilin Long ⋅ Yunlong Ran ⋅ Lihan Jiang ⋅ Yifan Wang ⋅ Chenming Zhu ⋅ Runsen Xu ⋅ Tai Wang ⋅ Jiangmiao Pang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 221
UZ3DVG: Unaided Zero-Shot 3D Visual Grounding with Generated Language Conditions
Wenbin Tan ⋅ Jiawen Lin ⋅ Yuan Xie ⋅ Yachao Zhang ⋅ Yanyun Qu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 222
LangField4D: Learning Identity-Adaptive and Spatio-Temporal Continuous 4D Language Fields for Dynamic Scenes
Yichao Xu ⋅ Qiaowei Miao ⋅ Jinsheng Quan ⋅ Wei Yang ⋅ Zhihui Li ⋅ Yawei Luo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 223
Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning
Yuhong Liu ⋅ Beichen Zhang ⋅ Yuhang Zang ⋅ Yuhang Cao ⋅ Long Xing ⋅ Xiaoyi Dong ⋅ Haodong Duan ⋅ Dahua Lin ⋅ Jiaqi Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 224
CLIPoint3D: Language-Grounded Few-Shot Unsupervised 3D Point Cloud Domain Adaptation
Mainak Singha ⋅ Sarthak Mehrotra ⋅ Paolo Casari ⋅ Subhasis Chaudhuri ⋅ Elisa Ricci ⋅ Biplab Banerjee
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 225
GeoTikzBridge: Advancing Multimodal Code Generation for Geometric Perception and Reasoning
Jiayin Sun ⋅ Caixia Sun ⋅ Boyu Yang ⋅ hailin li ⋅ Xiao Chen ⋅ Yi Zhang ⋅ Errui Ding ⋅ Liang Li ⋅ Chao Deng ⋅ Junlan Feng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 226
Keep it SymPL: Symbolic Projective Layout for Allocentric Spatial Reasoning in Vision-Language Models
Jaeyun Jang ⋅ Seunghui Shin ⋅ Taeho Park ⋅ Hyoseok Hwang
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 227
Geometry-Guided 3D Visual Token Pruning for Video-Language Models
Han Li ⋅ Zehao Huang ⋅ Jiahui Fu ⋅ Naiyan Wang ⋅ Si Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 228
Context-Nav: Context-Driven Exploration and Viewpoint-Aware 3D Spatial Reasoning for Instance Navigation
Won Shik Jang ⋅ Ue-Hwan Kim
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 229
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models
Shengchao Zhou ⋅ Yuxin Chen ⋅ Yuying Ge ⋅ Wei Huang ⋅ Jiehong Lin ⋅ Ying Shan ⋅ Xiaojuan Qi
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 230
PanoEnv: Exploring 3D Spatial Intelligence in Panoramic Environments with Reinforcement Learning
Zekai Lin ⋅ Xu Zheng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 231
Hilbert-Geo: Solving Solid Geometric Problems by Neural-Symbolic Reasoning
Ruoran Xu ⋅ Haoyu Cheng ⋅ Bin Dong ⋅ Qiufeng Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 232
Direction-aware 3D Large Multimodal Models
QUAN LIU ⋅ Weihao Xuan ⋅ Junjue Wang ⋅ Naoto Yokoya ⋅ Ling Shao ⋅ Shijian Lu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 233
CLAY: Conditional Visual Similarity Modulation in Vision-Language Embedding Space
Sohwi Lim ⋅ Lee Hyoseok ⋅ Jungjoon Park ⋅ Tae-Hyun Oh
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 234
Tackling Alignment Ambiguity in Person Retrieval through Conversational Attribute Mining
Hao Zou ⋅ Runqing Zhang ⋅ Jin Ding ⋅ xue zhou ⋅ Jianxiao Zou ⋅ Mingzhu Cai
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 235
Beyond Global Similarity: Multi-Conditional Retrieval for Fine-Grained Cross-Modal Understanding
Xuan Lu ⋅ Kangle Li ⋅ Haohang Huang ⋅ Rui Meng ⋅ Wenjun Zeng ⋅ Xiaoyu Shen
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 236
Imagine Before Concentration: Diffusion-Guided Registers Enhance Partially Relevant Video Retrieval
Jun Li ⋅ Xuhang Lou ⋅ Jinpeng Wang ⋅ Yuting Wang ⋅ Yaowei Wang ⋅ Shu-Tao Xia ⋅ Bin Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 237
What Is the Optimal Ranking Score Between Precision and Recall? We Can Always Find It and It Is Rarely F1
Sébastien Piérard ⋅ Adrien Deliege ⋅ Marc Van Droogenbroeck
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 238
Robust Remote Sensing Image–Text Retrieval with Noisy Correspondence
qiya song ⋅ Yiqiang Xie ⋅ Yuan Sun ⋅ Renwei Dian ⋅ Xudong Kang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 239
PinPoint: Evaluation of Composed Image Retrieval with Explicit Negatives, Multi-Image Queries, and Paraphrase Testing
Rohan Mahadev ⋅ Joyce Yuan ⋅ Patrick Poirson ⋅ David Xue ⋅ Hao-Yu Wu ⋅ Dmitry Kislyuk
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 240
Single-step Diffusion-based Video Coding with Semantic-Temporal Guidance
Naifu Xue ⋅ Zhaoyang Jia ⋅ Jiahao Li ⋅ Bin Li ⋅ Zihan Zheng ⋅ Yuan Zhang ⋅ Yan Lu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 241
Memory Matters: Boosting Training-Free Zero-Shot Temporal Action Localization with a Learnable Lookup Table
Han Jiang ⋅ Haoyu Tang ⋅ Xiaoxuan Mu ⋅ Chen Li ⋅ Jihua Zhu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 242
TVHighlights: LLM-Guided Human-Free Collaborative Training for Video Highlight Detection in Movies and TV Dramas
Qi Qiu ⋅ Xuan Wu ⋅ Jiawei Peng ⋅ Yuan Miao ⋅ Xu Yang ⋅ Yanlong Du
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 243
Color When It Counts: Grayscale-Guided Online Triggering for Always-On Streaming Video Sensing
Weitong Cai ⋅ Hang Zhang ⋅ Yukai Huang ⋅ Shitong Sun ⋅ Jiankang Deng ⋅ Songcen Xu ⋅ Jifei Song ⋅ Zhensong Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 244
Reinforcing Structured Chain-of-Thought for Video Understanding
Peiyao Wang ⋅ Haotian Xu ⋅ Noranart Vesdapunt ⋅ Rui Hou ⋅ Jingyi Zhang ⋅ Haibin Ling ⋅ Oleksandr Obiednikov ⋅ Ning Zhou ⋅ Kah Fu Fu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 245
FlexiVideo: Variation-Aware Temporal Dynamics Modeling for Efficient Video Understanding
Da Peng ⋅ Xuesong Yang ⋅ Zonghao Guo ⋅ Yichen Zhang ⋅ Chi Chen ⋅ Yidan Zhang ⋅ Yuan Yao ⋅ Fang Wan ⋅ Wei Ke ⋅ Maosong Sun
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 246
MS-Temba: Multi-Scale Temporal Mamba for Understanding Long Untrimmed Videos
Arkaprava Sinha ⋅ Monish Soundar Raj ⋅ Pu Wang ⋅ Ahmed Helmy ⋅ Hieu Le ⋅ Srijan Das
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 247
Learning Effective Sign Features without Text for Gloss-free Sign Language Translation
Shiwei Gan ⋅ Xiao Liu ⋅ Yafeng Yin ⋅ Nan Liu ⋅ Kuizhuang Liu ⋅ Desibieer Tuerdaken ⋅ Zhiwei Jiang ⋅ Lei Xie ⋅ Sanglu Lu ⋅ Hongkai Wen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 248
META: Meta Evolution of Tool Trajectory Adaptation for Long-Video Understanding
Jing Huang ⋅ Luyuan Chen ⋅ Zhijie Xu ⋅ Yadong Li ⋅ Xingzhong Xu ⋅ Siye Chen ⋅ Jie Liu ⋅ Ming Kong ⋅ Qiang Zhu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 249
GT-SVJ: Generative-Transformer-Based Self-Supervised Video Judge For Efficient Video Reward Modeling
Shivanshu Shekhar ⋅ Uttaran Bhattacharya ⋅ Raghavendra Addanki ⋅ Mehrab Tanjim ⋅ Somdeb Sarkhel ⋅ Tong Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 250
Local Motion Matters: A Deconstruct–Recompose Paradigm for Reinforcement Learning Pre-training from Videos
Jinwen Wang ⋅ Youfang Lin ⋅ Xiaobo Hu ⋅ Shuo Wang ⋅ Kai Lv
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 251
Align Once to Explain: Feature Alignment for Scalable B-cosification of Foundational Vision Transformers
Raphael Maser ⋅ Siddhartha Gairola ⋅ Sukrut Rao ⋅ Bernt Schiele
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 252
Rounded or Streamlined Head? Bridging Concept Bottleneck Models and Attribute-Described Object Parts
Yang Liu ⋅ Jiajin Zhang ⋅ Yaojun Hu ⋅ Bingguang Hao ⋅ Xin Cao ⋅ Yingda Xia ⋅ Danyang Tu ⋅ Shi Gu ⋅ Ling Zhang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 253
CIGMA: Causal Information-Gain Mechanistic Attribution of Attention Heads in Vision Transformers
Maisha Maliha ⋅ Dean F. Hougen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 254
Rethinking Concept Bottleneck Models: From Pitfalls to Solutions
Merve Tapli ⋅ Quentin Bouniot ⋅ Wolfgang Stammer ⋅ Zeynep Akata ⋅ Emre Akbas
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 255
Make it SING: Analyzing Semantic Invariants in Classifiers
Harel Yadid ⋅ Meir Yossef Levi ⋅ Roy Betser ⋅ Guy Gilboa
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 256
Back to the Feature: Explaining Video Classifiers with Video Counterfactual Explanations
Chao Wang ⋅ chengan che ⋅ Xinyue Chen ⋅ Sophia Tsoka ⋅ Luis Carlos Garcia Peraza Herrera
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 257
LEADER: Learning Reliable Local-to-Global Correspondences for LiDAR Relocalization
Jianshi Wu ⋅ Minghang Zhu ⋅ dq Liu ⋅ Wen Li ⋅ Sheng Ao ⋅ Siqi Shen ⋅ Chenglu Wen ⋅ Cheng Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 258
UniCorrn: Unified Correspondence Transformer Across 2D and 3D
Prajnan Goswami ⋅ Tianye Ding ⋅ Feng Liu ⋅ Huaizu Jiang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 259
Probabilistic Discrepancy Learning for Roadside LiDAR Scene Completion
Xiaogang Wu ⋅ Jinchao Hu ⋅ Zixian Wang ⋅ Dun Liu ⋅ BoXiang Cheng ⋅ Yiqiang Wu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 260
TACO: Task-Aware Contrastive Learning for Joint LiDAR Localization and 3D Object Detection
Leyuan Xing ⋅ huanjia zhang ⋅ Dongyu Pan ⋅ Hai Wu ⋅ Qiming Xia ⋅ Kezheng Xiong ⋅ Wen Li ⋅ Chenglu Wen ⋅ Cheng Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 261
Adapting Point Cloud Analysis via Multimodal Bayesian Distribution Learning
Xingyu Zhu ⋅ Yi Liang ⋅ Shuo Wang ⋅ Wenbo Zhu ⋅ Yongliang Wu ⋅ Beier Zhu ⋅ Hanwang Zhang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 262
Learning Coordinate-based Convolutional Kernels for Continuous SE(3) Equivariant and Efficient Point Cloud Analysis
Jaein Kim ⋅ Hee Bin Yoo ⋅ Dong-Sig Han ⋅ Byoung-Tak Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 263
R3-PCQA: Ray-Reprojection-Reinforcement for No-Reference 3D Point Cloud Quality Assessment
Junhyuk Seo ⋅ Sanghyuk SEO ⋅ Dawoon Kim ⋅ Heeseok Oh
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 264
Geometric-Aware Hypergraph Reasoning for Novel Class Discovery in Point Cloud Segmentation
Zihao Zhang ⋅ Aming Wu ⋅ Li Yang ⋅ Yahong Han ⋅ Jialie Shen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 265
PointCSP: Cross-Sample Semantic Propagation and Stability Preservation in Self-Supervised Point Cloud Learning
Xinxing Yu ⋅ Ajian Liu ⋅ Sunyuan Qiang ⋅ Hui Ma ⋅ Liying Yang ⋅ Yuzhong Wang ⋅ Zhi Rao ⋅ Yanyan Liang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 266
U4D: Uncertainty-Aware 4D World Modeling from LiDAR Sequences
Xiang Xu ⋅ Ao Liang ⋅ Youquan Liu ⋅ Linfeng Li ⋅ Lingdong Kong ⋅ Ziwei Liu ⋅ Qingshan Liu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 267
TerraSeg: Self-Supervised Ground Segmentation for Any LiDAR
Ted Lentsch ⋅ Santiago Montiel-Marín ⋅ Holger Caesar ⋅ Dariu M. Gavrila
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 268
Where Does Vision Meet Language? Understanding and Refining Visual Fusion in MLLMs via Contrastive Attention
Shezheng Song ⋅ Shasha Li ⋅ Shan Zhao ⋅ Xiaopeng Li ⋅ Qian Wan ⋅ Chengyu Wang ⋅ Tianwei Yan ⋅ Ma Jun ⋅ Jie Yu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 269
UniRefiner: Teaching Pre-trained ViTs to Self-Dispose Dross via Contrastive Register
Congpei Qiu ⋅ Zhaoyu Hu ⋅ Wei Ke ⋅ Zhuotao Tian ⋅ Yanhao Wu ⋅ Tong Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 270
SigLino: Efficient Multi-Teacher Distillation for Agglomerative Vision Foundation Models
Sofian Chaybouti ⋅ Sanath Narayan ⋅ Yasser Dahou ⋅ Phúc H. Lê Khắc ⋅ Ankit Singh ⋅ Ngoc Dung Huynh ⋅ Wamiq Reyaz Para ⋅ Hilde Kuehne ⋅ Hakim Hacid
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 271
Heuristic-inspired Reasoning Priors Facilitate Data-Efficient Referring Object Detection
Xu Zhang ⋅ Zhe Chen ⋅ Jing Zhang ⋅ Dacheng Tao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 272
LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning
Zebin You ⋅ Shen Nie ⋅ Xiaolu Zhang ⋅ JUN ZHOU ⋅ Zhiwu Lu ⋅ Ji-Rong Wen ⋅ Chongxuan Li
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 273
AVION: Aerial Vision–Language Instruction from Offline Teacher to Prompt-Tuned Network
Yu Hu ⋅ Jianyang Gu ⋅ Hao Liu ⋅ Yue Cao ⋅ Jozsef Hamari ⋅ Zheng Liu ⋅ Mohsen Zardadi
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 274
CrossVL: Complexity-Aware Feature Routing and Paired Curriculum for Cross-View Vision-Language Detection
Zhipeng Liu ⋅ Chunbo Luo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 275
Masking Teacher and Reinforcing Student for Distilling Vision-Language Models
Byung-Kwan Lee ⋅ Yu-Chiang Frank Wang ⋅ Ryo Hachiuma
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 276
Role-SynthCLIP: A Role-Play Driven Diverse Synthetic Data Approach
Yuanxiang Huangfu ⋅ Chaochao wang ⋅ weilei wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 277
BiMotion: B-spline Motion for Text-guided Dynamic 3D Character Generation
Miaowei Wang ⋅ Qingxuan Yan ⋅ Zhi Cao ⋅ Yayuan Li ⋅ Oisin Mac Aodha ⋅ Jason J. Corso ⋅ Amir Vaxman
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 278
PSDesigner: Automated Graphic Design with a Human-Like Creative Workflow
Xincheng Shuai ⋅ Song Tang ⋅ Yutong Huang ⋅ Henghui Ding ⋅ Dacheng Tao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 279
CADFS: A Big CAD Program Dataset and Framework for Computer-Aided Design with Large Language Models
Vladislav Pyatov ⋅ Gleb Bobrovskikh ⋅ Saveliy Galochkin ⋅ Nikita Boldyrev ⋅ Oleg Voynov ⋅ Alexander Filippov ⋅ Gonzalo Ferrer ⋅ Peter Wonka ⋅ Evgeny Burnaev
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 280
MapRoute:Precise-Concept Erasing Mappers via Semantic Routing
Sihao Li ⋅ Baixi Baixi ⋅ Shuohong Xia ⋅ Yunyun Yang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 281
PhotoFramer: Multi-modal Image Composition Instruction
Zhiyuan You ⋅ Ke Wang ⋅ He Zhang ⋅ Xin Cai ⋅ Jinjin Gu ⋅ Tianfan Xue ⋅ Chao Dong ⋅ Zhoutong Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 282
Can We Build Scene Graphs, Not Classify Them? FlowSG: Progressive Image-Conditioned Scene Graph Generation with Flow Matching
Xin Hu ⋅ Ke Qin ⋅ Wen Yin ⋅ Yuan-Fang Li ⋅ Ming Li ⋅ Tao He
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 283
DuetSVG: Unified Multimodal SVG Generation with Internal Visual Guidance
Peiying Zhang ⋅ Nanxuan Zhao ⋅ Matthew Fisher ⋅ Yiran Xu ⋅ Jing Liao ⋅ Difan Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 284
Bias Is a Subspace, Not a Coordinate: A Geometric Rethinking of Post‑hoc Debiasing in Vision-Language Models
Dachuan Zhao ⋅ Weiyue Li ⋅ Zhenda Shen ⋅ Yushu Qiu ⋅ Bowen Xu ⋅ Haoyu Chen ⋅ Yongchao Chen
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 285
Frequency-domain Manipulation for Face Obfuscation
Jintae Kim ⋅ Keunsoo Ko ⋅ Chang-Su Kim
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 286
Towards Reasoning-Preserving Unlearning in Multimodal Large Language Models
Hongji Li ⋅ Manjiang Yu ⋅ Junchi Yao ⋅ PRIYANKA SINGH ⋅ Xue Li ⋅ Di Wang ⋅ Lijie Hu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 287
Erasing Thousands of Concepts: Towards Scalable and Practical Concept Erasure for Text-to-Image Diffusion Models
Hoigi Seo ⋅ Byung Hyun Lee ⋅ Jaehyun Cho ⋅ Sungjin Lim ⋅ Se Young Chun
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 288
POUR: A Provably Optimal Method for Unlearning Representation via Neural Collapse
Anjie Le ⋅ Can Peng ⋅ Yuyuan Liu ⋅ Alison Noble
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 289
Do Vision-Language Models Leak What They Learn? Adaptive Token-Weighted Model Inversion Attacks
Ngoc-Bao Nguyen ⋅ Sy-Tuyen Ho ⋅ Koh Jun Hao ⋅ Ngai-Man Cheung
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 290
Protego: User-Centric Pose-Invariant Privacy Protection Against Face Recognition-Induced Digital Footprint Exposure
Ziling Wang ⋅ Shuya Yang ⋅ Jialin Lu ⋅ Ka-Ho Chow
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 291
SPDMark: Selective Parameter Displacement for Robust Video Watermarking
Samar Fares ⋅ Nurbek Tastan ⋅ Karthik Nandakumar
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 292
Enhancing Visual Representation with Textual Semantics: Textual Semantics-Powered Prototypes for Heterogeneous Federated Learning
Xinghao Wu ⋅ Jianwei Niu ⋅ Xuefeng Liu ⋅ Guogang Zhu ⋅ Jiayuan Zhang ⋅ Shaojie Tang ⋅ Wei Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 293
FedHarmony: Harmonizing Heterogeneous Label Correlations in Federated Multi-Label Learning
Zhiqiang Kou ⋅ Junxiang Wu ⋅ Wenke Huang ⋅ Wenwen He ⋅ Ming-Kun Xie ⋅ Changwei Wang ⋅ Yuheng Jia ⋅ Di Jiang ⋅ Yang Liu ⋅ Xin Geng ⋅ Qiang Yang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 294
FedSST: Rethinking Fair Federated Graph Learning under Structural Shift
Dingyi Zhao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 295
GDFA: Geometry-Driven Federated Unlearning with Directional Task Vector Alignment
Xiuting Weng ⋅ Ruizhi Pu ⋅ Yuanhang Yao ⋅ Kun Yue ⋅ Zhiwen Tang ⋅ Lixing Yu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 296
FedARA: Resource-adaptive Low-rank Personalized Federated Learning via Anchor-driven Representation Alignment on Heterogeneous Edge Devices
Ruonan Zhao ⋅ Zheng Wang ⋅ Debin Liu ⋅ shijie lv ⋅ Laurence Yang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 297
InterRVOS: Interaction-Aware Referring Video Object Segmentation
Woojeong Jin ⋅ Seongchan Kim ⋅ Jaeho Lee ⋅ Seungryong Kim
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 298
RE-VLM: Event-Augmented Vision-Language Model for Scene Understanding
Hanqing Liu ⋅ Mingjie Liu ⋅ Luoping Cui ⋅ Endian Lin ⋅ Donghong Jiang ⋅ Chuang Zhu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 299
RegFormer: Transferable Relational Grounding for Efficient Weakly-Supervised Human-Object Interaction Detection
Jihwan Park ⋅ Chanhyeong Yang ⋅ Jinyoung Park ⋅ Taehoon Song ⋅ Hyunwoo J. Kim
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 300
Learning to Refuse: Refusal-Aware Reinforcement Fine-Tuning for Hard-Irrelevant Queries in Video Temporal Grounding
Jin-Seop Lee ⋅ Sungjoon Lee ⋅ SeongJun Jung ⋅ Boyang Li ⋅ Jee-Hyong Lee
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 301
GroundVTS: Visual Token Sampling in Multimodal Large Language Models for Video Temporal Grounding
Rong Fan ⋅ Kaiyan Xiao ⋅ Minghao Zhu ⋅ Liuyi Wang ⋅ KAI DAI ⋅ Zhao Yang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 302
TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs
Jun Zhang ⋅ Teng Wang ⋅ Yuying Ge ⋅ Yixiao Ge ⋅ Xinhao Li ⋅ Limin Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 303
Tokenization Allows Multimodal Large Language Models to Understand, Generate and Edit Architectural Floor Plans
Sizhong Qin ⋅ Ramon Elias Weber ⋅ Xinzheng Lu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 304
MeToM: Metadata-Guided Token Merging for Efficient Video LLMs
Zhuojie Wu ⋅ Shijie Wang ⋅ Xin Yu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 305
Token Reduction via Local and Global Contexts Optimization for Efficient Video Large Language Models
Jinlong Li ⋅ Liyuan Jiang ⋅ Haonan Zhang ⋅ Nicu Sebe
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 306
VLIC: Vision-Language Models As Perceptual Judges for Human-Aligned Image Compression
Kyle Sargent ⋅ Ruiqi Gao ⋅ Philipp Henzler ⋅ Charles Herrmann ⋅ Aleksander Holynski ⋅ Li Fei-Fei ⋅ Jiajun Wu ⋅ Jason Y. Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 307
Mostly Text, Smart Visuals: Asymmetric Text-Visual Pruning for Large Vision-Language Models
Sijie Li ⋅ Biao Qian ⋅ Jungong Han
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 308
Attention-aware Inference Optimizations for Large Vision-Language Models with Memory-efficient Decoding
Fatih Ilhan ⋅ Gaowen Liu ⋅ Ramana Kompella ⋅ Selim Tekin ⋅ Tiansheng Huang ⋅ Zachary Yahn ⋅ Yichang Xu ⋅ Ling Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 309
CoIn: Coverage and Informativeness-Guided Token Reduction for Efficient Large Multimodal Models
Chenxi Du ⋅ Yongheng Deng ⋅ Jiani Liu ⋅ Yujia Zhang ⋅ Xi Chen ⋅ Ju Ren
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 310
TAMER: A Tri-Modal Contrastive Alignment and Multi-Scale Embedding Refinement Framework for Zero-Shot ECG Diagnosis
Xuewei Zhou ⋅ Yajie Meng ⋅ Pan Zeng ⋅ Xianfang Tang ⋅ Feifei Cui ⋅ Qiangguo Jin ⋅ Jialiang Yang ⋅ Junlin Xu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 311
Your Dissimilarities Define You: Complementary Learning Exploiting Class Diversities
Dimitrios Katsikas ⋅ Nikolaos Passalis ⋅ Anastasios Tefas
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 312
CGU-Bayes: Causal Graph Uncertainty-Guided Bayesian Inference for Domain Generalization
Naiyu Yin ⋅ Hanjing Wang ⋅ Yue Yu ⋅ Tian Gao ⋅ Amit Dhurandhar ⋅ Chung-Hao Lee ⋅ Qiang Ji
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 313
Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning
Shashanka Venkataramanan ⋅ Valentinos Pariza ⋅ Mohammadreza Salehi ⋅ Lukas Knobel ⋅ Elias Ramzi ⋅ Spyros Gidaris ⋅ Andrei Bursuc ⋅ Yuki M Asano
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 314
Towards Stable Self-Supervised Object Representations in Unconstrained Egocentric Video
Yuting Tan ⋅ Xilong Cheng ⋅ Yunxiao Qin ⋅ Zhengnan Li ⋅ Jingjing Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 315
LRDUN: A Low-Rank Deep Unfolding Network for Efficient Spectral Compressive Imaging
HE HUANG ⋅ Yujun Guo ⋅ Wei He
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 316
Neural Collapse in Test-Time Adaptation
Xiao Chen ⋅ Zhongjing Du ⋅ Jiazhen Huang ⋅ Jiang Xu ⋅ Li Lu ⋅ Jingyan Jiang ⋅ Zhi Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 317
CLEX: Complementary Label Exchange Learning for Noisy Facial Expression Recognition
Lin Wang ⋅ Fang Liu ⋅ Xiaofen Xing ⋅ Kailing Guo ⋅ Xiangmin Xu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 318
TruckDrive: Long-Range Autonomous Highway Driving Dataset
Filippo Ghilotti ⋅ Edoardo Palladin ⋅ Samuel Brucker ⋅ Adam Sigal ⋅ Mario Bijelic ⋅ Felix Heide
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 319
Neuro-Cognitive Reward Modeling for Human-Centered Autonomous Vehicle Control
Zhuoli Zhuang ⋅ Yu-Cheng Chang ⋅ Yu-Kai Wang ⋅ Thomas Do ⋅ Chin-teng Lin
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 320
E3AD: An Emotion-Aware Vision-Language-Action Model for Human-Centric End-to-End Autonomous Driving
Yihong Tang ⋅ Haicheng Liao ⋅ Tong Nie ⋅ Junlin He ⋅ Ao Qu ⋅ Kehua Chen ⋅ Wei Ma ⋅ Zhenning Li ⋅ Lijun Sun ⋅ Chengzhong Xu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 321
The Blind Spot of Adaptation: Quantifying and Mitigating Forgetting in Fine-tuned Driving Models
Runhao Mao ⋅ Hanshi Wang ⋅ Yixiang Yang ⋅ Qianli Ma ⋅ Jingmeng Zhou ⋅ Zhipeng Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 322
Den-TP: A Density-Balanced Data Curation and Evaluation Framework for Trajectory Prediction
Ruining Yang ⋅ Yi Xu ⋅ Yun Fu ⋅ Lili Su
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 323
Percept-WAM: Perception-Enhanced World-Awareness-Action Model for Robust End-to-End Autonomous Driving
Jianhua Han ⋅ Meng Tian ⋅ Jiangtong Zhu ⋅ Fan He ⋅ Huixin Zhang ⋅ Sitong Guo ⋅ Dechang Zhu ⋅ Hao Tang ⋅ Pei Xu ⋅ Yuze Guo ⋅ Minzhe Niu ⋅ Haojie Zhu ⋅ Qichao Dong ⋅ Xuechao Yan ⋅ Siyuan Dong ⋅ Lu Hou ⋅ Qingqiu Huang ⋅ Xiaosong Jia ⋅ Hang Xu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 324
GaussianDWM: 3D Gaussian Driving World Model for Unified Scene Understanding and Multi-Modal Generation
Tianchen Deng ⋅ Xuefeng Chen ⋅ Yi Chen ⋅ Qu Chen ⋅ Yuyao Xu ⋅ Lijin Yang ⋅ Le Xu ⋅ Yu Zhang ⋅ Bo Zhang ⋅ Wuxiong Huang ⋅ Hesheng Wang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 325
Mind the Hitch: Dynamic Calibration and Articulated Perception for Autonomous Trucks
morui zhu ⋅ Yongqi Zhu ⋅ Song Fu ⋅ Qing Yang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 326
DriveMoE: Mixture-of-Experts for Vision-Language-Action Model in End-to-End Autonomous Driving
Zhenjie Yang ⋅ Yilin Chai ⋅ Xiaosong Jia ⋅ Qifeng Li ⋅ Yuqian Shao ⋅ Xuekai Zhu ⋅ Haisheng Su ⋅ Junchi Yan
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 327
Beyond Rule-Based Agents: Active Markov Games for Realistic Multi-Agent Interaction in Autonomous Driving
Yuan Gui ⋅ Hongchen Luo ⋅ Jiao Wang ⋅ Qu Liqi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 328
Test-Time Multi-Prompt Adaptation for Open-Vocabulary Remote Sensing Image Segmentation
Ting Yang ⋅ Qilong Wang ⋅ Qibin Hou ⋅ Qinghua Hu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 329
ReScene4D: Temporally Consistent Semantic Instance Segmentation of Evolving Indoor 3D Scenes
Emily Steiner ⋅ Jianhao Zheng ⋅ Henry Howard-Jenkins ⋅ Chris Xie ⋅ Iro Armeni
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 330
CrackSSM: Reviving SSMs for Crack Segmentation via Dynamic Scanning
Yubin Gu ⋅ Boyang Hou ⋅ Yuan Meng ⋅ Wenting Luo ⋅ Jiayi Ji ⋅ Xiaoshuai Sun
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 331
BiPA: Bilevel Prompt Adaptation for Underwater Instance Segmentation
Long Ma ⋅ Haoze Zheng ⋅ Yuhang Mao ⋅ Jinyuan Liu ⋅ Chengpei Xu ⋅ Xinwei Xue ⋅ Yi Wang ⋅ Xiangjian He ⋅ Weimin Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 332
RS-SSM: Refining Forgotten Specifics in State Space Model for Video Semantic Segmentation
Kai Zhu ⋅ Zhenyu Cui ⋅ Zehua Zang ⋅ Jiahuan Zhou
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 333
Scene-Centric Unsupervised Video Panoptic Segmentation
Christoph Reich ⋅ Oliver Hahn ⋅ Nikita Araslanov ⋅ Laura Leal-Taixe ⋅ Christian Rupprecht ⋅ Daniel Cremers ⋅ Stefan Roth
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 334
Bootstrapping Video Semantic Segmentation Model via Distillation-assisted Test-Time Adaptation
Jihun Kim ⋅ Hoyong Kwon ⋅ Hyeokjun Kweon ⋅ Kuk-Jin Yoon
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 335
GeoFree-CoSeg: Unsupervised Point Cloud-Image Cross-Modal Co-Segmentation Without Geometric Alignment
Xin Duan ⋅ Xiabi Liu ⋅ Liyuan Pan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 336
Parameter-efficient Continual Learning for Enhancing Plasticity without Forgetting under Limited Model Capacity
Yitian Chen ⋅ Shigeng Zhang ⋅ Xuan Liu ⋅ Mingming Lu ⋅ Kai Chen ⋅ Hongye Zhu ⋅ Xinning Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 337
Dual-Estimator: Decoupling Global and Local Semantic Shift for Drift Compensation in Class-Incremental Learning
Fankang Xu ⋅ Lu Jin ⋅ Yanpeng Sun ⋅ Shiyu Xuan ⋅ Zechao Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 338
Continual Distillation of Teachers from Different Domains
Nicolas Michel ⋅ Maorong Wang ⋅ Jiangpeng He ⋅ Toshihiko Yamasaki
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 339
Multimodal Continual Instruction Tuning with Dynamic Gradient Guidance
Songze Li ⋅ Mingyu Gao ⋅ Tonghua Su ⋅ Xu-Yao Zhang ⋅ Zhongjie Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 340
Learning from Itself: Mining Internal Knowledge from Vision Language Models for Continual Learning
Yizheng Gong ⋅ Siyue Yu ⋅ Waleed Al-Nuaimy ⋅ Jimin Xiao
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 341
AdaPrior: Bayesian-Inspired Adaptive Prior Correction for Long-Tailed Continual Learning
S Divakar Bhat ⋅ Amit Popat More ⋅ Mudit Soni ⋅ Bhuvan Aggarwal
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 342
An Optimal Transport-driven Approach for Cultivating Latent Space in Online Incremental Learning
Quyen Tran ⋅ Hai Nguyen ⋅ Minh Quan Dao ⋅ Hoang Phan ⋅ Linh Ngo Van ⋅ Khoat Than ⋅ Dinh Phung ⋅ Dimitris Metaxas ⋅ Trung Le
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 343
HAD: Heterogeneity-Aware Distillation for Lifelong Heterogeneous Learning
Xuerui Zhang ⋅ Xuehao Wang ⋅ Zhan Zhuang ⋅ Linglan Zhao ⋅ Ziyue Li ⋅ Xinmin Zhang ⋅ Zhihuan Song ⋅ Yu Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 344
U-Mind: A Unified Framework for Real-Time Multimodal Interaction with Audiovisual Generation
xiang deng ⋅ Feng Gao ⋅ Yong Zhang ⋅ Youxin Pang ⋅ Xu Xiaoming ⋅ Zhuoliang Kang ⋅ Xiaoming Wei ⋅ Yebin Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 345
StreamAvatar: Streaming Diffusion Models for Real-Time Interactive Human Avatars
Zhiyao Sun ⋅ Ziqiao Peng ⋅ Yifeng Ma ⋅ Yi Chen ⋅ zhengguang zhou ⋅ Zixiang Zhou ⋅ Guozhen Zhang ⋅ Youliang Zhang ⋅ Yuan Zhou ⋅ qinglin lu ⋅ Yong-Jin Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 346
FlashLips: 100-FPS Mask-Free Latent Lip-Sync using Reconstruction Instead of Diffusion or GANs
Andreas Zinonos ⋅ Michał Stypułkowski ⋅ Antoni Bigata Casademunt ⋅ Stavros Petridis ⋅ Maja Pantic ⋅ Nikita Drobyshev
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 347
WildCap: Facial Albedo Capture in the Wild via Hybrid Inverse Rendering
Yuxuan Han ⋅ Xin Ming ⋅ Tianxiao Li ⋅ Zhuofan Shen ⋅ Qixuan Zhang ⋅ Lan Xu ⋅ Feng Xu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 348
EmoTaG: Emotion-Aware Talking Head Synthesis on Gaussian Splatting with Few-Shot Personalization
Haolan Xu ⋅ Keli Cheng ⋅ Lei Wang ⋅ Ning Bi ⋅ Xiaoming Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 349
DyaDiT: A Multi-Modal Diffusion Transformer for Socially Favorable Dyadic Gesture Generation
YICHEN PENG ⋅ Jyun-Ting Song ⋅ Siyeol Jung ⋅ RUOFAN LIU ⋅ Haiyang Liu ⋅ Xuangeng Chu ⋅ Ruicong Liu ⋅ Erwin Wu ⋅ Hideki Koike ⋅ Kris Kitani
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 350
TRM-VLA: Temporal-Aware Chain-of-Thought Reasoning and Memorization for Vision-Language-Action Models
LI XIANG ⋅ Yali Li ⋅ Yuan Wang ⋅ Shengjin Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 351
VGGDrive: Empowering Vision-Language Models with Cross-View Geometric Grounding for Autonomous Driving
Jie Wang ⋅ Guang Li ⋅ Zhijian Huang ⋅ Chenxu Dang ⋅ Hangjun Ye ⋅ Yahong Han ⋅ Long Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 352
NoRD: A Data-Efficient Vision-Language-Action Model that Drives without Reasoning
Ishaan Rawal ⋅ Shubh Gupta ⋅ Yihan Hu ⋅ Wei Zhan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 353
HTNav: A Hybrid Navigation Framework with Tiered Structure for Urban Aerial Vision-and-Language Navigation
Chengjie Fan ⋅ Cong Pan ⋅ Zijian Liu ⋅ Ningzhong Liu ⋅ Jie Qin
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 354
CycleBEV: Regularizing View Transformation Networks via View Cycle Consistency for Bird’s-Eye-View Semantic Segmentation
Jeongbin Hong ⋅ Dooseop Choi ⋅ Taeg-Hyun An ⋅ KYOUNG AN AN ⋅ Kyoung-Wook Min
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 355
STAvatar: Soft Binding and Temporal Density Control for Monocular 3D Head Avatars Reconstruction
Jiankuo Zhao ⋅ Xiangyu Zhu ⋅ Zidu Wang ⋅ Zhen Lei
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 356
CrowdGaussian: Reconstructing High-Fidelity 3D Gaussians for Human Crowd from a Single Image
Yizheng Song ⋅ Yiyu Zhuang ⋅ Qipeng Xu ⋅ Haixiang Wang ⋅ Jiahe Zhu ⋅ Jing Tian ⋅ Siyu Zhu ⋅ Hao Zhu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 357
OMG-Avatar: One-shot Multi-LOD Gaussian Head Avatar
Jianqiang Ren ⋅ Lin Liu ⋅ Steven Hoi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 358
Globally Optimal Pose from Orthographic Silhouettes
Agniva Sengupta ⋅ Dilara Kus ⋅ Jianning Li ⋅ Stefan Zachow
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 359
AvatarPointillist: AutoRegressive 4D Gaussian Avatarization
Hongyu Liu ⋅ Xuan Wang ⋅ Zijian Wu ⋅ yating wang ⋅ Ziyu Wan ⋅ Yue Ma ⋅ Runtao Liu ⋅ Boyao Zhou ⋅ Yujun Shen ⋅ Qifeng Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 360
COPO: Causal-Oriented Policy Optimization for Hallucinations of MLLMs
Peizheng Guo ⋅ Jingyao Wang ⋅ Wenwen Qiang ⋅ Jiahuan Zhou ⋅ Changwen Zheng ⋅ Gang Hua
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 361
Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding
Zhongxing Xu ⋅ Zhonghua Wang ⋅ Zhe Qian ⋅ Dachuan Shi ⋅ feilong tang ⋅ Ming Hu ⋅ Shiyan Su ⋅ Xiaocheng Zou ⋅ Wei Feng ⋅ Dwarikanath Mahapatra ⋅ Yifan Peng ⋅ Minquan Lin ⋅ Zongyuan Ge
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 362
AdaIAT: Adaptively Increasing Attention to Generated Text to Alleviate Hallucinations in LVLM
Lian Zhong ⋅ Ziqiang He ⋅ Jibin Zheng ⋅ Jin Li ⋅ Z. Wang ⋅ xiangui Kang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 363
HulluEdit: Single-Pass Evidence-Consistent Subspace Editing for Mitigating Hallucinations in Large Vision-Language Models
Yangguang Lin ⋅ Quan Fang ⋅ Yufei Li ⋅ Jiachen Sun ⋅ Junyu Gao ⋅ Jitao Sang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 364
SEASON: Mitigating Temporal Hallucination in Video Large Language Models via Self-Diagnostic Contrastive Decoding
Chang-Hsun Wu ⋅ Kai-Po Chang ⋅ Yu-Yang Sheng ⋅ Hung-Kai Chung ⋅ Kuei-Chun Wang ⋅ Yu-Chiang Frank Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 365
One Token, Two Fates: A Unified Framework via Vision Token Manipulation Against MLLMs Hallucination
Zhan Fa ⋅ Yue Duan ⋅ Jian Zhang ⋅ Lei Qi ⋅ Yinghuan Shi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 366
EgoX: Egocentric Video Generation from a Single Exocentric Video
Taewoong Kang ⋅ Kinam Kim ⋅ Dohyeon Kim ⋅ Minho Park ⋅ Junha Hyung ⋅ Jaegul Choo
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 367
SymphoMotion: Joint Control of Camera Motion and Object Dynamics for Coherent Video Generation
Guiyu Zhang ⋅ Yabo Chen ⋅ Xunzhi Xiang ⋅ Junchao Huang ⋅ Zhongyu Wang ⋅ Li Jiang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 368
Pantheon360: Taming Digital Twin Generation via 3D-Aware 360° Video Diffusion
Ting-Hsuan Chen ⋅ Ying-Huan Chen ⋅ Tao Tu ⋅ Jie-Ying Lee ⋅ Cho-Ying Wu ⋅ Fangzhou Lin ⋅ Hengyuan Zhang ⋅ David Paz ⋅ Xinyu Huang ⋅ Yuliang Guo ⋅ Yu-Lun Liu ⋅ Yue Wang ⋅ Liu Ren
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 369
SeeU: Seeing the Unseen World via 4D Dynamics-aware Generation
Yu Yuan ⋅ Tharindu Wickremasinghe ⋅ Zeeshan Nadir ⋅ Xijun Wang ⋅ Yiheng Chi ⋅ Stanley H. Chan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 370
ReDirector: Creating Any-Length Video Retakes with Rotary Camera Encoding
Byeongjun Park ⋅ Byung-Hoon Kim ⋅ Hyungjin Chung ⋅ Jong Chul
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 371
Scaling4D: Pushing the Frontier of Video Novel View Synthesis through Large-Scale Monocular Videos
Hongrui Cai ⋅ Junjie Luo ⋅ Zhihong Fu ⋅ Shengnan Zhu ⋅ Jiawei Wen ⋅ Wanquan Feng ⋅ Songtao Zhao ⋅ Qian HE
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 372
PHANTOM: Physics-Infused Video Generation via Joint Modeling of Visual and Latent Physical Dynamics
Ying Shen ⋅ Jerry Xiong ⋅ Tianjiao Yu ⋅ Ismini Lourentzou
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 373
WorldReel: 4D Video Generation with Consistent Geometry and Motion Modeling
Shaoheng Fang ⋅ Hanwen Jiang ⋅ Yunpeng Bai ⋅ Niloy J. Mitra ⋅ Qixing Huang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 374
Let Your Image Move with Your Motion! -- Implicit Multi-Object Multi-Motion Transfer
Li Yuze ⋅ Dong Gong ⋅ Xiao Cao ⋅ Junchao Yuan ⋅ Dongsheng Li ⋅ Lei Zhou ⋅ Yun Sing Koh ⋅ Cheng Yan ⋅ Xinyu Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 375
SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time
Zhening Huang ⋅ Hyeonho Jeong ⋅ Xuelin Chen ⋅ Yulia Gryaditskaya ⋅ Tuanfeng Wang ⋅ Joan Lasenby ⋅ Chun-Hao Huang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 376
D2FANet: Enhancing Video Object Detection with Dual-Domain Feature Aggregation Network
Qiang Qi ⋅ Wenqi Shang ⋅ Meifang Wang ⋅ Xiao Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 377
HierUQ: Hierarchical Uncertainty Quantification with Adaptive Granularity Reconciliation for Degraded Image Classification
YANG CHU ⋅ Xiaomeng Yang ⋅ Keli Deng ⋅ Yuntao Qian
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 378
ID-Sim: An Identity-Focused Similarity Metric
Julia Chae ⋅ Nick Kolkin ⋅ Jui-Hsien Wang ⋅ Richard Zhang ⋅ Sara Beery ⋅ Cusuh Ham
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 379
Hier-COS: Making Deep Features Hierarchy-aware via Composition of Orthogonal Subspaces
Depanshu Sani ⋅ Saket Anand
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 380
Towards Cross-Modal Preservation, Consistency and Alignment for Privacy-Preserving Visible-Infrared Person Re-Identification
Yudi Xie ⋅ Zhongao Zhou ⋅ Bin Yang ⋅ Zhenghan Chen ⋅ Mang Ye
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 381
Enhancing Mixture-of-Experts Specialization via Cluster-Aware Upcycling
Sanghyeok Chu ⋅ Pyunghwan Ahn ⋅ Gwangmo Song ⋅ Seung Hwan Kim ⋅ Honglak Lee ⋅ Bohyung Han
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 382
COPE: Consistent Occlusion and Prompt Enhancement Network for Occluded Person Re-identification
Sun Siyi ⋅ Jinliang Lin ⋅ Juanjuan Weng ⋅ Zhihui Liu ⋅ Shaozi Li ⋅ Zhiming Luo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 383
Assignment-Driven Hash Learning in a Hyper-Semantic Space for On-the-Fly Category Discovery
Kaibing Yang ⋅ Yucheng Wang ⋅ Tingzhang Luo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 384
DyFCLT: Dynamic Frequency-Decoupled Cross-Modal Learning Transformer for Multimodal Tiny Object Detection
Chaolang Li ⋅ Pengwen Dai ⋅ Jingyu Li ⋅ Siyuan Yao ⋅ Yuchen Jiang ⋅ Zhuoran Zheng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 385
EW-DETR: Evolving World Object Detection via Incremental Low-Rank DEtection TRansformer
Munish Monga ⋅ Vishal Chudasama ⋅ Pankaj Wasnik ⋅ C.V. Jawahar
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 386
Building a Precise Video Language with Human–AI Oversight
Zhiqiu Lin ⋅ Siyuan Cen ⋅ Chancharik Mitra ⋅ Isaac Li ⋅ Yuhan Huang ⋅ Yu Tong Tiffany Ling ⋅ Hewei Wang ⋅ Irene Pi ⋅ Shihang Zhu ⋅ Yili Han ⋅ Yilun Du ⋅ Deva Ramanan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 387
CoCoVideo: The High-Quality Commercial-Model-Based Contrastive Benchmark for AI-Generated Video Detection
Huidong Feng ⋅ Wentao Chen ⋅ Jie Chen ⋅ Xinqi Cai ⋅ Ruolong Ma ⋅ Yinglin Zheng ⋅ Yuxin Lin ⋅ Ming Zeng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 388
Towards Sparse Video Understanding and Reasoning
Chenwei Xu ⋅ Zhen Ye ⋅ Shang Wu ⋅ Weijian Li ⋅ Zihan Wang ⋅ Zhuofan Xia ⋅ Lie Lu ⋅ Pranav Maneriker ⋅ Fan Du ⋅ Manling Li ⋅ Han Liu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 389
Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding
Jialuo Li ⋅ Bin Li ⋅ Jiahao Li ⋅ Yan Lu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 390
MuKV: Multi-Grained KV Cache Compression for Long Streaming Video Question-Answering
Junbin Xiao ⋅ Jiajun Chen ⋅ Tianxiang Sun ⋅ Xun Yang ⋅ Angela Yao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 391
ParallelVLM: Lossless Video-LLM Acceleration with Visual Alignment Aware Parallel Speculative Decoding
Quan Kong ⋅ Yuhao Shen ⋅ Yicheng Ji ⋅ Huan Li ⋅ Cong Wang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 392
TiViBench: Benchmarking Think-in-Video Reasoning for Video Generation
Harold Haodong Chen ⋅ Disen Lan ⋅ Wen-Jie Shu ⋅ Qingyang Liu ⋅ Zihan Wang ⋅ Sirui CHEN ⋅ Wenkai Cheng ⋅ Kanghao Chen ⋅ Hongfei (Faye) Zhang ⋅ Zixin Zhang ⋅ Rongjin Guo ⋅ Yu Cheng ⋅ Ying-Cong Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 393
What Are You Doing? A Closer Look at Controllable Human Video Generation
Emanuele Bugliarello ⋅ Anurag Arnab ⋅ Roni Paiss ⋅ Christy Koh ⋅ Pieter-Jan Kindermans ⋅ Cordelia Schmid
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 394
Score2Instruct: Scaling Up Video Quality-Centric Instructions via Automated Dimension Scoring
Qizhi Xie ⋅ Kun Yuan ⋅ Yunpeng Qu ⋅ Jiachao Gong ⋅ Mingda Wu ⋅ Ming Sun ⋅ Chao Zhou ⋅ Jihong Zhu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 395
CFG-Ctrl: Control-Based Classifier-Free Diffusion Guidance
Hanyang Wang ⋅ Yiyang Liu ⋅ Jiawei Chi ⋅ Fangfu Liu ⋅ Ran Xue ⋅ Yueqi Duan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 396
Towards Holistic Modeling for Video Frame Interpolation with Auto-regressive Diffusion Transformers
Xinyu Peng ⋅ Han Li ⋅ Yuyang Huang ⋅ Ziyang Zheng ⋅ Yaoming Wang ⋅ Xin Chen ⋅ Wenrui Dai ⋅ Chenglin Li ⋅ Junni Zou ⋅ Hongkai Xiong
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 397
DDiT: Dynamic Patch Scheduling for Efficient Diffusion Transformers
Dahye Kim ⋅ Deepti Ghadiyaram ⋅ Raghudeep Gadde
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 398
Towards High-resolution and Disentangled Reference-based Sketch Colorization
Dingkun Yan ⋅ Xinrui Wang ⋅ Ru Wang ⋅ Zhuoru Li ⋅ Jinze Yu ⋅ Yusuke Iwasawa ⋅ Yutaka Matsuo ⋅ Jiaxian Guo
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 399
MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation
Yiren Song ⋅ Cheng Liu ⋅ Mike Zheng Shou
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 400
Layer-wise Instance Binding for Regional and Occlusion Control in Text-to-Image Diffusion Transformers
Ruidong Chen ⋅ Yancheng Bai ⋅ Xuanpu Zhang ⋅ Jianhao Zeng ⋅ Lanjun Wang ⋅ Dan Song ⋅ Lei Sun ⋅ Xiangxiang Chu ⋅ An-An Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 401
Memory-Efficient Fine-Tuning Diffusion Transformers via Dynamic Patch Sampling and Block Skipping
Sunghyun Park ⋅ Jeongho Kim ⋅ Hyoungwoo Park ⋅ Debasmit Das ⋅ Sungrack Yun ⋅ Munawar Hayat ⋅ Jaegul Choo ⋅ Fatih Porikli ⋅ Seokeon Choi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 402
COT-FM: Cluster-wise Optimal Transport Flow Matching
Chiensheng Chiang ⋅ Kuan-Hsun Tu ⋅ Jia-Wei Liao ⋅ Cheng-Fu Chou ⋅ Tsung-Wei Ke
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 403
Interpretable Motion-Attentive Maps: Spatio-Temporally Localizing Concepts in Video Diffusion Transformers
Youngjun Jun ⋅ seil kang ⋅ Woojung Han ⋅ Seong Jae Hwang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 404
Guiding a Diffusion Transformer with the Internal Dynamics of Itself
Xingyu Zhou ⋅ Qifan Li ⋅ Xiaobin Hu ⋅ Hai Chen ⋅ Shuhang Gu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 405
CoopDiff: A Diffusion-Guided Approach for Cooperation under Corruptions
Gong Chen ⋅ Chaokun Zhang ⋅ Pengcheng Lv
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 406
RARE: Learn to RAnk and REtrieve for Monocular 3D Object Detection
Hyeonjeong Park ⋅ Peixi Xiong ⋅ Xiaoqian Ruan ⋅ Dian Jia ⋅ Pei Yu ⋅ Wei Tang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 407
COG: Confidence-aware Optimal Geometric Correspondence for Unsupervised Single-reference Novel Object Pose Estimation
Yuchen Che ⋅ JINGTU WU ⋅ Hao ZHENG ⋅ Asako Kanezaki
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 408
Learnability-Driven Submodular Optimization for Active Roadside 3D Detection
Ruiyu Mao ⋅ Baoming Zhang ⋅ Nicholas Ruozzi ⋅ Yunhui Guo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 409
Look Before You Fuse: 2D-Guided Cross-Modal Alignment for Robust 3D Detection
Xiang Li ⋅ Zhangchi Hu ⋅ Xu Xiao ⋅ Bin Kong
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 410
Long-SCOPE: Fully Sparse Long-Range Cooperative 3D Perception
Jiahao Wang ⋅ Zikun Xu ⋅ Yuner Zhang ⋅ Zhongwei Jiang ⋅ Chenyang Lu ⋅ Shuocheng Yang ⋅ Yuxuan Wang ⋅ Jiaru Zhong ⋅ Chuang Zhang ⋅ Shaobing Xu ⋅ Jianqiang Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 411
Dynamics-Aware Preference Optimization for Vision-Language Models
jusheng zhang ⋅ Kaitong Cai ⋅ Jing Yang ⋅ Jian Wang ⋅ Keze Wang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 412
Selection-as-Nonlinearity: Bridging Attention and Activation via a Joint Game–Decision Lens for Interpretable, Discriminative Visual Representations
Sudong Cai ⋅ Shuai Yuan ⋅ Bingzhi Chen ⋅ Rui Mao ⋅ Bing Wang
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 413
Learning What Helps: Task-Aligned Context Selection for Vision Tasks
Jingyu Guo ⋅ Emir Konuk ⋅ Fredrik Strand ⋅ Christos Matsoukas ⋅ Kevin Smith
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 414
Consensus Entropy: Harnessing Multi-VLM Agreement for Self-Verifying and Self-Improving OCR
Yulong Zhang ⋅ Tianyi Liang ⋅ Erfei Cui ⋅ Guoqing Wang ⋅ Xu Guo ⋅ Chenhui Li ⋅ Gongshen Liu
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 415
NeuroRule: Bridging Vision and Logic with Differentiable Rule Induction
Muhammad Zarar ⋅ Mingzheng Zhang ⋅ Xiaowang Zhang ⋅ Zhiyong Feng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 416
Beyond Graph Model: Reliable VLM Fine-Tuning via Random Graph Adapter
Bo Jiang ⋅ Xueyang Ze ⋅ Beibei Wang ⋅ Xixi Wang ⋅ Xixi Wan ⋅ Bin Luo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 417
Ego: Embedding-Guided Personalization of Vision-Language Models
Soroush Seifi ⋅ Simon Gardier ⋅ Vaggelis Dorovatas ⋅ Daniel Olmeda Reino ⋅ Rahaf Aljundi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 418
JoPPO: Hierarchical Photography Assessment via Contrastive Joint Conditional Probabilistic Reinforcement Learning
Yifan Yang ⋅ Juntuo Wang ⋅ Yuming Qiao ⋅ Xudong Zhang ⋅ Chunyang Yu ⋅ Yan Li ⋅ Xiao Lin ⋅ Liang Luo ⋅ Dan Meng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 419
AeroAgent: A Vision–Physics–Decision Framework for Aerodynamic Vehicle Design
Ye Liu ⋅ Shouyi Li ⋅ Huiyu Yang ⋅ Jianghang gu ⋅ Wenhao Fan ⋅ Zhongxin Yang ⋅ Ding Wang ⋅ Simeng Chen ⋅ Zirun Jiang ⋅ Yuanwei Bin ⋅ Shiyi Chen ⋅ Yuntian Chen
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 420
MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe
Tianyu Yu ⋅ Zefan Wang ⋅ Chongyi Wang ⋅ Fuwei Huang ⋅ Wenshuo Ma ⋅ Zhihui He ⋅ Tianchi Cai ⋅ Weize Chen ⋅ Yuxiang Huang ⋅ Ranchi Zhao ⋅ Bokai Xu ⋅ Junbo Cui ⋅ Yingjing Xu ⋅ Liqing Ruan ⋅ Luoyuan Zhang ⋅ Hanyu Liu ⋅ Jingkun Tang ⋅ Hongyuan Liu ⋅ Qining Guo ⋅ Wenhao Hu ⋅ Bingxiang He ⋅ Jie Zhou ⋅ Jie Cai ⋅ Ji Qi ⋅ Zonghao Guo ⋅ Chi Chen ⋅ Guoyang Zeng ⋅ Yuxuan Li ⋅ Ganqu Cui ⋅ Ning Ding ⋅ Xu Han ⋅ Yuan Yao ⋅ Zhiyuan Liu ⋅ Maosong Sun
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 421
Prune Wisely, Reconstruct Sharply: Compact 3D Gaussian Splatting via Adaptive Pruning and Difference-of-Gaussian Primitives
Haoran Wang ⋅ Guoxi Huang ⋅ Fan Zhang ⋅ David Bull ⋅ Nantheera Anantrasirichai
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 422
MSCD-GS: Motion-Separated Cooperative Deblurring Dynamic Reconstruction via Gaussian Splatting
yongjian liao ⋅ Xu Zou ⋅ Wenjun Chen ⋅ Huixuan Li ⋅ Xiaoen Xie ⋅ Chunxi Li ⋅ Shixiang Huang ⋅ Gang Zhang ⋅ Jiahuan Zhou ⋅ Sheng Zhong ⋅ Luxin Yan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 423
P2GS: Physical Prior-guided Gaussian Splatting for Photometrically Consistent Urban Reconstruction
Kota Shimomura ⋅ Hidehisa Arai ⋅ Tsubasa Takahashi ⋅ Takayoshi Yamashita ⋅ Hironobu Fujiyioshi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 424
iSplat: Iterative Learning for Fine-Grained Gaussian Splatting
Haifeng Wu ⋅ Wei Long ⋅ Shuhang Gu ⋅ Lixin Duan ⋅ Wen Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 425
Off The Grid: Detection of Primitives for Feed-Forward 3D Gaussian Splatting
Arthur Moreau ⋅ Richard Shaw ⋅ Michal Nazarczuk ⋅ Jisu Shin ⋅ Thomas Tanay ⋅ Zhensong Zhang ⋅ Songcen Xu ⋅ Eduardo Pérez-Pellitero
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 426
MAPo: Motion-Aware Partitioning of Deformable 3D Gaussian Splatting for High-Fidelity Dynamic Scene Reconstruction
Han Jiao ⋅ Jiakai Sun ⋅ Yexing Xu ⋅ Lei Zhao ⋅ Wei Xing ⋅ Huaizhong Lin
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 427
FreeArtGS: Articulated Gaussian Splatting Under Free-moving Scenario
Hang Dai ⋅ Hongwei Fan ⋅ Han Zhang ⋅ Duojin Wu ⋅ Jiyao Zhang ⋅ Hao Dong
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 428
HeroGS: Hierarchical Guidance for Robust 3D Gaussian Splatting under Sparse Views
Jiashu Li ⋅ Xumeng Han ⋅ Zhaoyang Wei ⋅ Zipeng Wang ⋅ Kuiran Wang ⋅ Guorong Li ⋅ Zhenjun Han ⋅ Jianbin Jiao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 429
SharpTimeGS: Sharp and Stable Dynamic Gaussian Splatting via Lifespan Modulation
Zhanfeng Liao ⋅ Jiajun Zhang ⋅ Hanzhang Tu ⋅ Zhixi Wang ⋅ Yunqi Gao ⋅ Hongwen Zhang ⋅ Yebin Liu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 430
Physically Inspired Gaussian Splatting for HDR Novel View Synthesis
Huimin Zeng ⋅ Yue Bai ⋅ hailing wang ⋅ Yun Fu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 431
PhysIR-Splat: Physically Consistent Thermal Infrared Radiative Transfer in 3D Gaussian Splatting
Jingyuan Gao ⋅ Yumeng Hu ⋅ Fei Gao ⋅ Mingjin Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 432
4C4D: 4 Camera 4D Gaussian Splatting
Junsheng Zhou ⋅ Zhifan Yang ⋅ Liang Han ⋅ Wenyuan Zhang ⋅ Kanle Shi ⋅ Shenkun Xu ⋅ Yu-Shen Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 433
SplatSuRe: Selective Super-Resolution for Multi-view Consistent 3D Gaussian Splatting
Pranav Asthana ⋅ Alex Hanson ⋅ Allen Tu ⋅ Tom Goldstein ⋅ Matthias Zwicker ⋅ Amitabh Varshney
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 434
GaussianZoom: Progressive Zoom-in Generative 3D Gaussian Splatting with Geometric and Semantic Guidance
Jiale Shi ⋅ Jiarui Hu ⋅ Zesong Yang ⋅ Kaixuan Luan ⋅ Hujun Bao ⋅ Zhaopeng Cui
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 435
MotionScale: Reconstructing Appearance, Geometry, and Motion of Dynamic Scenes with Scalable 4D Gaussian Splatting
Haoran Zhou ⋅ Gim Hee Lee
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 436
PRIMU: Uncertainty Estimation for Novel Views in Gaussian Splatting from Primitive-Based Representations of Error and Coverage
Thomas Gottwald ⋅ Edgar Heinert ⋅ Peter Stehr ⋅ Chamuditha Jayanga Galappaththige ⋅ Matthias Rottmann
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 437
TGSFormer: Scalable Temporal Gaussian Splatting for Embodied Semantic Scene Completion
Rui Qian ⋅ Haozhi Cao ⋅ Tianchen Deng ⋅ TIANXIN HU ⋅ Weixiang Guo ⋅ Shenghai Yuan ⋅ Lihua Xie
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 438
Disco-GS: Gaussian Splatting in Dynamic Color Lighting
Ashish Kumar ⋅ A. N. Rajagopalan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 439
ReAG: Reasoning-Augmented Generation for Knowledge-based Visual Question Answering
Alberto Compagnoni ⋅ Marco Morini ⋅ Sara Sarto ⋅ Federico Cocchi ⋅ Davide Caffagni ⋅ Marcella Cornia ⋅ Lorenzo Baraldi ⋅ Rita Cucchiara
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 440
GuardTrace-VL: Detecting Unsafe Multimodel Reasoning via Iterative Safety Supervision
Yuxiao Xiang ⋅ Junchi Chen ⋅ Zhenchao Jin ⋅ Changtao Miao ⋅ Haojie Yuan ⋅ Qi Chu ⋅ Tao Gong ⋅ Nenghai Yu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 441
AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition
Zichuan Lin ⋅ Yicheng Liu ⋅ Yang Yang ⋅ Lvfang Tao ⋅ Deheng Ye
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 442
See It, Say It, Sorted: An Iterative Training-Free Framework for Visually-Grounded Multimodal Reasoning in LVLMs
Yongchang Zhang ⋅ Xianzheng Ma ⋅ Tianyi Liu ⋅ Guangquan Zhou ⋅ Yang Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 443
Will Multimodal Models Be Dazzled by Multi-Image Visual Puzzles?
zhi zhu ⋅ YaoQi Fan ⋅ Zhe Chen ⋅ Yue Cao ⋅ Yangzhou Liu ⋅ Tong Lu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 444
GThinker: Towards General Multimodal Reasoning via Cue-Guided Rethinking
Yufei Zhan ⋅ Ziheng Wu ⋅ Yousong Zhu ⋅ Rongkun Xue ⋅ Guanghao Zhou ⋅ Ruipu Luo ⋅ Zhenghao Chen ⋅ Can Zhang ⋅ Yifan Li ⋅ Zhentao he ⋅ Zheming Yang ⋅ Ming Tang ⋅ Minghui Qiu ⋅ Jinqiao Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 445
Visual Grounding for Object Questions
Martin Nicolas Everaert ⋅ Xiruo Liu ⋅ Hiroyuki Takeda ⋅ Raja Bala ⋅ Vivek Yadav ⋅ Vidya Narayanan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 446
CARE What Fails: Contrastive Anchored-REflection for Verifiable Multimodal Reasoning
Yongxin Wang ⋅ Zhicheng Yang ⋅ Meng Cao ⋅ Mingfei Han ⋅ Haokun Lin ⋅ Yingying Zhu ⋅ Xiaojun Chang ⋅ Xiaodan Liang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 447
What Do Visual Tokens Really Encode? Uncovering Sparsity and Redundancy in Multimodal Large Language Models
Yingqi Fan ⋅ Junlong Tong ⋅ Anhao Zhao ⋅ Xiaoyu Shen
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 448
Think-as-You-See: Streaming Chain-of-Thought Reasoning for Large Vision-Language Models
Jialiang Zhang ⋅ Junlong Tong ⋅ Junyan Lin ⋅ Hao Wu ⋅ Yirong Sun ⋅ Yunpu Ma ⋅ Xiaoyu Shen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 449
Stable and Efficient Single-Rollout RL for Multimodal Reasoning
Rui Liu ⋅ Dian Yu ⋅ Lei Ke ⋅ Haolin Liu ⋅ Yujun Zhou ⋅ Zhenwen Liang ⋅ Haitao Mi ⋅ Pratap Tokekar ⋅ Dong Yu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 450
Revisiting the Necessity of Lengthy Chain-of-Thought in Vision-centric Reasoning Generalization
Yifan Du ⋅ Kun Zhou ⋅ Yingqian Min ⋅ Yue Ling ⋅ Wayne Xin Zhao ⋅ Youbin Wu ⋅ Ji-Rong Wen
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 451
Monet: Reasoning in Latent Visual Space Beyond Image and Language
Qixun Wang ⋅ Yang Shi ⋅ Yifei Wang ⋅ Yuanxing Zhang ⋅ Pengfei Wan ⋅ Kun Gai ⋅ Xianghua Ying ⋅ Yisen Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 452
STAR-R1: Multi-View Spatial TrAnsformation Reasoning by Reinforcing Multimodal LLMs
Zongzhao Li ⋅ Zongyang Ma ⋅ Mingze Li ⋅ Songyou Li ⋅ Yu Rong ⋅ Tingyang Xu ⋅ Ziqi Zhang ⋅ Deli Zhao ⋅ Wenbing Huang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 453
From Where Things Are to What They Are For: Benchmarking Spatial–Functional Intelligence in Multimodal LLMs
Le Zhang ⋅ Jihan Yang ⋅ Soundarya Krishnan ⋅ Jimit Majmudar ⋅ Xiou Ge ⋅ Prasoon Puri ⋅ Prathamesh Saraf ⋅ Shruti Bhargava ⋅ Dhivya Piraviperumal ⋅ Yinan Ling ⋅ Cindy Pan ⋅ Hong Yu ⋅ Aishwarya Agrawal ⋅ Bo-Hsiang Tseng
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 454
Deeper Thought, Weaker Aim: Understanding and Mitigating Perceptual Impairment during Reasoning in Multimodal Large Language Models
Ruiying Peng ⋅ Xueyu Wu ⋅ Jing Lei ⋅ Lu Hou ⋅ Yuanzheng Ma ⋅ Xiaohui Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 455
S2D: Selective Spectral Decay for Quantization-Friendly Conditioning of Neural Activations
Arnav Chavan ⋅ Nahush Lele ⋅ Udbhav Bamba ⋅ Sankalp Dayal ⋅ Aditi Raghunathan ⋅ Deepak Gupta
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 456
OneSparse: A Unified Framework for Sparse Activation Layers in Vision Models
Xingkui Zhu ⋅ Dingkang Liang ⋅ Cheng Chen ⋅ Daoxin Zhang ⋅ lv hanxiang ⋅ Zhe Xu ⋅ Yao Hu ⋅ Xiang Bai
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 457
What Matters in Practical Learned Image Compression
Kedar Tatwawadi ⋅ Parisa Rahimzadeh ⋅ Zhanghao Sun ⋅ Zhiqi Chen ⋅ Ziyun Yang ⋅ Sanjay Nair ⋅ Divija Hasteer ⋅ Oren Rippel
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 458
BinaryAttention: One-Bit QK-Attention for Vision and Diffusion Transformers
Chaodong XIAO ⋅ Zhengqiang ZHANG ⋅ Lei Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 459
Ultra-Low Bitrate Perceptual Image Compression with Shallow Encoder
Tianyu Zhang ⋅ Dong Liu ⋅ Chang Wen Chen
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 460
LazyVAR: Accelerating Visual Autoregressive Models via Scale-wise Token Pruning and Parallel Group Decoding
Rongge Mao ⋅ Chengqi Dong ⋅ S Kevin Zhou
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 461
Spk2VidNet: A Hierarchical Recurrent Architecture for High-Fidelity Video Reconstruction from Long Spike-Camera Streams
Yuanlin Wang ⋅ Ruiqin Xiong ⋅ Jiyu Xie ⋅ Zhenkun Zhu ⋅ Zhaofei Yu ⋅ Xiaopeng Fan ⋅ Tiejun Huang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 462
Adaptive Learned Image Compression with Graph Neural Networks
Yunuo Chen ⋅ Bing He ⋅ Zezheng Lyu ⋅ Hongwei Hu ⋅ Qunshan Gu ⋅ Yuan Tian ⋅ Guo Lu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 463
SGI: Structured 2D Gaussians for Efficient and Compact Large Image Representation
Zixuan Pan ⋅ Kaiyuan Tang ⋅ Jun Xia ⋅ Yifan Qin ⋅ Lin Gu ⋅ Chaoli Wang ⋅ Jianxu Chen ⋅ Yiyu Shi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 464
VVS: Accelerating Speculative Decoding for Visual Autoregressive Generation via Partial Verification Skipping
Haotian Dong ⋅ Ye Li ⋅ Rongwei Lu ⋅ Chen Tang ⋅ Shu-Tao Xia ⋅ Zhi Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 465
HypeVPR: Exploring Hyperbolic Space for Perspective to Equirectangular Visual Place Recognition
Suhan Woo ⋅ Seongwon Lee ⋅ jinwoo jang ⋅ Euntai Kim
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 466
LoD-Loc v3: Generalized Aerial Localization in Dense Cities using Instance Silhouette Alignment
Shuaibang Peng ⋅ Juelin Zhu ⋅ Xia Li ⋅ Kun Yang ⋅ Yu Liu ⋅ Maojun Zhang ⋅ Shen Yan
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 467
CoLoR: The Devil is in Scene Coordinate Regression for Large-Scale Visual Localization
Xindong Mao ⋅ Hang Li ⋅ Yuchen Wu ⋅ Jiahe Li ⋅ Xiao Bai ⋅ Jin Zheng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 468
Affine Perspective-Three-Point Problem
Gaku Nakano
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 469
Sky2Ground: A Benchmark for Site Modeling under Varying Altitude
Zengyan Wang ⋅ Sirshapan Mitra ⋅ Rajat Modi ⋅ Hui Xian Grace Lim ⋅ Yogesh Rawat
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 470
SemanticVLA: Towards Semantic Reasoning over Action Memorization via Synergistic Explicit Trace and Latent Action Planning
Fei Ni ⋅ Zhuo Chen ⋅ Yifu Yuan ⋅ Zibin Dong ⋅ Xianze Yao ⋅ Shan Luo ⋅ Jianye Hao ⋅ Jiankang Deng ⋅ Stefanos Zafeiriou
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 471
WebGym: Scaling Training Environments for Long-Horizon Visual Web Agents with Realistic Tasks
Hao Bai ⋅ Alexey Taymanov ⋅ Tong Zhang ⋅ Aviral Kumar ⋅ Spencer Whitehead
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 472
Beyond Perceptual Shortcuts: Causal-Inspired Debiasing Optimization for Generalizable Video Reasoning in Lightweight MLLMs
Jingze Wu ⋅ Quan Zhang ⋅ Hongfei Suo ⋅ Zeqiang Cai ⋅ Hongbo Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 473
APPO: Attention-guided Perception Policy Optimization for Video Reasoning
Henghui Du ⋅ Chang Zhou ⋅ Xi Chen ⋅ Di Hu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 474
RetouchIQ: MLLM Agents for Instruction-Based Image Retouching with Generalist Reward
Qiucheng Wu ⋅ Jing Shi ⋅ Simon Jenni ⋅ Kushal Kafle ⋅ Tianyu Wang ⋅ Shiyu Chang ⋅ Handong Zhao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 475
EVA: Efficient Reinforcement Learning for End-to-End Video Agent
Yaolun Zhang ⋅ Ruohui Wang ⋅ Jiahao Wang ⋅ Yepeng Tang ⋅ Xuanyu Zheng ⋅ Haonan Duan ⋅ Hao Lu ⋅ Hanming Deng ⋅ Lewei Lu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 476
Visual Document Understanding and Reasoning: A Multi-Agent Collaboration Framework with Agent-Wise Adaptive Test-Time Scaling
Xinlei Yu ⋅ Chengming Xu ⋅ Zhangquan Chen ⋅ Yudong Zhang ⋅ Shilin Lu ⋅ Cheng Yang ⋅ Jiangning Zhang ⋅ Shuicheng Yan ⋅ Xiaobin Hu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 477
GazeOnce360: Fisheye-Based 360° Multi-Person Gaze Estimation with Global–Local Feature Fusion
Zhuojiang Cai ⋅ Zhenghui Sun ⋅ Feng Lu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 478
Bridging Human Evaluation to Infrared and Visible Image Fusion
Jinyuan Liu ⋅ Xingyuan Li ⋅ Qingyun Mei ⋅ HaoYuan Xu ⋅ Zhiying Jiang ⋅ Long Ma ⋅ Risheng Liu ⋅ Xin Fan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 479
Beyond Strict Pairing: Arbitrarily Paired Training for High-Performance Infrared and Visible Image Fusion
Yanglin Deng ⋅ Tianyang Xu ⋅ Chunyang Cheng ⋅ Hui Li ⋅ Xiao-Jun Wu ⋅ Josef Kittler
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 480
Semantic-Adaptive Diffusion for Dynamic Spatiotemporal Fusion
Jinsong Zhang ⋅ Ying Qu ⋅ Yuan Liao ⋅ Hairong Qi ⋅ Zhenzhou Shao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 481
Bayesian Decomposition and Semantic Completion for Few-shot Semantic Segmentation
Guangchen Shi ⋅ Yirui Wu ⋅ Wei Zhu ⋅ Tao Wang ⋅ Hao Zhang ⋅ Bo Li ⋅ Tong Lu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 482
From Few-way to Many-way: Rethinking Few-shot Fine-grained Image Classification
Li-Jun Zhao ⋅ Zhen-Duo Chen ⋅ Xin Luo ⋅ Xin-Shun Xu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 483
STiTch: Semantic Transition and Transportation in Collaboration for Training-Free Zero-Shot Composed Image Retrieval
Miaoge Li ⋅ Dongsheng Wang ⋅ Zening Sun ⋅ Jinsen Zhang ⋅ Wenhan Luo ⋅ Jingcai Guo
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 484
Selective, Regularized, and Calibrated: Harnessing Vision Foundation Models for Cross-Domain Few-Shot Semantic Segmentation
junyuan ma ⋅ Xunzhi Xiang ⋅ Wenbin Li ⋅ Qi Fan ⋅ Yang Gao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 485
FlowComposer: Composable Flows for Compositional Zero-Shot Learning
Zhenqi He ⋅ Lin Li ⋅ Long Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 486
ManifoldGD: Training-Free Hierarchical Manifold Guidance for Diffusion-Based Dataset Distillation
Ayush Roy ⋅ Wei-Yang Alex Lee ⋅ Rudrasis Chakraborty ⋅ Vishnu Suresh Lokhande
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 487
DMGD: Train-Free Dataset Distillation with Semantic-Distribution Matching in Diffusion Models
Qichao Wang ⋅ Yunhong Lu ⋅ Hengyuan Cao ⋅ Junyi Zhang ⋅ Min Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 488
UniRain: Unified Image Deraining with RAG-based Dataset Distillation and Multi-objective Reweighted Optimization
Qianfeng Yang ⋅ Qiyuan Guan ⋅ Xiang Chen ⋅ Jiyu Jin ⋅ Guiyue Jin ⋅ Jiangxin Dong
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 489
Leveraging Multispectral Sensors for Color Correction in Mobile Cameras
Luca Cogo ⋅ Marco Buzzelli ⋅ Simone Bianco ⋅ Javier Vazquez-Corral ⋅ Raimondo Schettini
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 490
Differentiable Adaptive 4D Structured Illumination for Joint Capture of Shape and Reflectance
Huakeng Ding ⋅ Yaowen Chen ⋅ Kun Zhou ⋅ Hongzhi Wu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 491
Optical Diffraction-based Convolution for Semiconductor Lithography
Young-Han Son ⋅ Dong-Hee Shin ⋅ Deok-Joong Lee ⋅ Hyun Jung Lee ⋅ Tae-Eui Kam
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 492
GSNR: Graph Smooth Null-Space Representation for Inverse Problems
Romario Gualdrón-Hurtado ⋅ Roman Jacome ⋅ Rafael S. Suárez ⋅ Henry Arguello
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 493
MatE: Material Extraction from Single-Image via Geometric Prior
Zeyu Zhang ⋅ Wei Zhai ⋅ Jian Yang ⋅ Yang Cao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 494
αMatte4K & µMatting: Dataset and Model for Ultra-Micro Precision Alpha Video Matting
Xinyi Chen ⋅ Hang Dong ⋅ Baowei Jiang ⋅ Shenkun Xu ⋅ Youqi Guan ⋅ Kanle Shi ⋅ Kun Gai ⋅ Haichuan Song
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 495
Revisiting Optimal Coding for I-ToF under Practical Sensor Constraints
WENBIN LUO ⋅ Takafumi Iwaguchi ⋅ Ryusuke Sagawa ⋅ Hiroshi Kawasaki
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 496
Dynamic Black-hole Emission Tomography with Physics-informed Neural Fields
Berthy T. Feng ⋅ Andrew A. Chael ⋅ David Bromley ⋅ Aviad Levis ⋅ William Freeman ⋅ Katherine L. Bouman
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 497
Exploring Spatiotemporal Feature Propagation for Video-Level Compressive Spectral Reconstruction: Dataset, Model and Benchmark
Lijing Cai ⋅ Zhan Shi ⋅ Chenglong Huang ⋅ Jinyao Wu ⋅ Qiping Li ⋅ Zikang Huo ⋅ Linsen Chen ⋅ Chongde Zi ⋅ Xun Cao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 498
Generalizable Radio-Frequency Radiance Fields for Spatial Spectrum Synthesis
Kang Yang ⋅ Yuning Chen ⋅ Wan Du
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 499
SAR2Net: Learning Spatially Anchored Representations for Retrieval-Guided Cross-Stain Alignment
Tianle Shen ⋅ Fang Yan ⋅ Xiaofan Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 500
Advancing Cancer Prognosis with Hierarchical Fusion of Genomic, Proteomic and Pathology Imaging Data from a Systems Biology Perspective
Junjie Zhou ⋅ Bao Xue ⋅ Meiling Wang ⋅ WEI SHAO ⋅ Daoqiang Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 501
PromptStereo: Zero-Shot Stereo Matching via Structure and Motion Prompts
Xianqi Wang ⋅ Hao Yang ⋅ Hangtian Wang ⋅ JunDa Cheng ⋅ Gangwei Xu ⋅ Min Lin ⋅ Xin Yang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 502
Any Resolution Any Geometry: From Multi-View To Multi-Patch
Wenqing Cui ⋅ Zhenyu Li ⋅ Mykola Lavreniuk ⋅ Jian Shi ⋅ Ramzi Idoughi ⋅ Xiangjun Tang ⋅ Peter Wonka
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 503
Paparazzo: Active Mapping of Moving 3D Objects
Davide Allegro ⋅ Shiyao Li ⋅ Stefano Ghidoni ⋅ Vincent Lepetit
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 504
DepthFocus: Controllable Depth Estimation for See-Through Scenes
junhong min ⋅ Jimin Kim ⋅ Minwook Kim ⋅ Cheol-Hui Min ⋅ YOUNGPIL JEON ⋅ Minyong Choi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 505
OVI-MAP: Open-Vocabulary Instance-Semantic Mapping
Zilong Deng ⋅ Federico Tombari ⋅ Marc Pollefeys ⋅ Johanna Wald ⋅ Daniel Barath
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 506
PTC-Depth: Pose-Refined Monocular Depth Estimation with Temporal Consistency
Leezy Han ⋅ Seunggyu Kim ⋅ Dongseok Shim ⋅ Hyeonbeom Lee
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 507
SceneScribe-1M: A Large-Scale Video Dataset with Comprehensive Geometric and Semantic Annotations
Yunnan Wang ⋅ Kecheng Zheng ⋅ Jianyuan Wang ⋅ Minghao Chen ⋅ David Novotny ⋅ Christian Rupprecht ⋅ Yinghao Xu ⋅ Xing Zhu ⋅ Wenjun Zeng ⋅ Xin Jin ⋅ Yujun Shen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 508
Omni-3DEdit: Generalized Versatile 3D Editing in One-Pass
Liyi Chen ⋅ Pengfei Wang ⋅ Guowen Zhang ⋅ Zhiyuan Ma ⋅ Lei Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 509
Ani3DHuman: Photorealistic 3D Human Animation with Self-guided Stochastic Sampling
Qi Sun ⋅ Can Wang ⋅ Jiaxiang Shang ⋅ Yingchun Liu ⋅ Jing Liao
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 510
Variational Graph-based Normal Integration
Lixiong Chen ⋅ Bohan Yu ⋅ Victor Adrian Prisacariu ⋅ Imari Sato
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 511
Vinedresser3D: Towards Agentic Text-guided 3D Editing
Yankuan Chi ⋅ Xiang Li ⋅ Zixuan Huang ⋅ James M.
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 512
MV2UV: Generating High-quality UV Texture Maps with Multiview Prompts
Zheng Zhang ⋅ Qinchuan Zhang ⋅ Yuteng Ye ⋅ Zhi Chen ⋅ Penglei Ji ⋅ Mengfei Li ⋅ Wenxiao ZHANG ⋅ Yuan Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 513
Learning Hierarchical Hyperbolic Mixture Model for Part-aware 3D Generation
Qitong Yang ⋅ Mingtao Feng ⋅ Zijie Wu ⋅ Huixin Zhu ⋅ Weisheng Dong ⋅ Yaonan Wang ⋅ Ajmal Mian
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 514
MeshRipple: Structured Autoregressive Generation of Artist-Meshes
JunKai Lin ⋅ Hang Long ⋅ Huipeng Guo ⋅ Jielei Zhang ⋅ JiaYi Yang ⋅ Tianle Guo ⋅ Yang Yang ⋅ Jianwen Li ⋅ Wenxiao ZHANG ⋅ Matthias Nießner ⋅ Wei Yang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 515
FACE: A Face-based Autoregressive Representation for High-Fidelity and Efficient Mesh Generation
Hanxiao Wang ⋅ Yuanchen Guo ⋅ Ying-Tian Liu ⋅ Zi-Xin Zou ⋅ Biao Zhang ⋅ Weize Quan ⋅ Ding Liang ⋅ Yan-Pei Cao ⋅ Dong-Ming Yan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 516
Easy3E: Feed-Forward 3D Asset Editing via Rectified Voxel Flow
Shimin Hu ⋅ Yuanyi Wei ⋅ Fei Zha ⋅ Yudong Guo ⋅ Juyong Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 517
CUPID: Generative 3D Reconstruction via Joint Object and Pose Modeling
Binbin Huang ⋅ Haobin Duan ⋅ Yiqun Zhao ⋅ Zibo Zhao ⋅ Yi Ma ⋅ Shenghua Gao
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 518
3D-Fixer: Coarse-to-Fine In-place Completion for 3D Scenes from a Single Image
Ze-Xin Yin ⋅ Liu Liu ⋅ Xinjie wang ⋅ Wei Sui ⋅ Zhizhong Su ⋅ Jian Yang ⋅ Jin Xie
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 519
DRM: Diffusion-based Reward Model With Step-wise Guidance
Jaxon Zhang ⋅ Binxin Yang ⋅ Hubery Yin ⋅ Chen Li ⋅ Jing LYU
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 520
Taming Preference Mode Collapse via Directional Decoupling Alignment in Diffusion Reinforcement Learning
Chubin Chen ⋅ Sujie Hu ⋅ Jiashu Zhu ⋅ Meiqi Wu ⋅ Jintao Chen ⋅ Yanxun Li ⋅ Nisha Huang ⋅ Chengyu Fang ⋅ Jiahong Wu ⋅ Xiangxiang Chu ⋅ Xiu Li
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 521
VA-π: Variational Policy Alignment for Pixel-Aware Autoregressive Generation
Xinyao Liao ⋅ QIYUAN HE ⋅ Kai Xu ⋅ Xiaoye Qu ⋅ Yicong Li ⋅ Wei Wei ⋅ Angela Yao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 522
SoliReward: Mitigating Susceptibility to Reward Hacking and Annotation Noise in Video Generation Reward Models
Jiesong Lian ⋅ Ruizhe Zhong ⋅ Zixiang Zhou ⋅ Xiaoyue Mi ⋅ Long Hu ⋅ Yuan Zhou ⋅ qinglin lu ⋅ yixue Hao ⋅ Junchi Yan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 523
AnyID: Ultra-Fidelity Universal Identity-Preserving Video Generation from Any Visual References
Jiahao Wang ⋅ Hualian Sheng ⋅ Sijia Cai ⋅ Yuxiao Yang ⋅ Weizhan Zhang ⋅ Caixia Yan ⋅ Bing Deng ⋅ Jieping Ye
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 524
Style-GRPO: Semantic-Aware Preference Optimization for Image Style Transfer Guided by Reward Modeling
Jianbin Zhao ⋅ Chaoran Feng ⋅ Miao Yu ⋅ Yingtao Li ⋅ Zhenyu Tang ⋅ Wangbo Yu ⋅ Yian Zhao ⋅ Xiaomin Li ⋅ Li Yuan ⋅ Yonghong Tian
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 525
LAMP: Language-Assisted Motion Planning for Controllable Video Generation
Muhammed Burak Kizil ⋅ Enes Şanlı ⋅ Niloy J. Mitra ⋅ Erkut Erdem ⋅ Aykut Erdem ⋅ Duygu Ceylan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 526
Diverse Video Generation with Determinantal Point Process-Guided Policy Optimization
Tahira Kazimi ⋅ Connor Dunlop ⋅ Pinar Yanardag
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 527
Spectral Scalpel: Amplifying Adjacent Action Discrepancy via Frequency-Selective Filtering for Skeleton-Based Action Segmentation
Haoyu Ji ⋅ Bowen Chen ⋅ Zhihao Yang ⋅ Wenze Huang ⋅ Yu Gao ⋅ Xueting Liu ⋅ Weihong Ren ⋅ Zhiyong Wang ⋅ Honghai LIU
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 528
DETACH : Decomposed Spatio-Temporal Alignment for Exocentric Video and Ambient Sensors with Staged Learning
Junho Yoon ⋅ Jaemo Jeong ⋅ Hyunju Kim ⋅ Dongman Lee
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 529
Learning a Unified Latent Action Space from Videos with Action-centric Cycle Consistency
Guangyan Chen ⋅ Qi Shao ⋅ Te Cui ⋅ Zichen Zhou ⋅ Weixin Mao ⋅ Luojie Yang ⋅ Meiling Wang ⋅ Yi Yang ⋅ Hua Chen ⋅ Yufeng Yue
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 530
VideoNet: A Large-Scale Dataset for Domain-Specific Action Recognition
Tanush Yadav ⋅ Reza Salehi ⋅ Jae Sung Park ⋅ Vivek Ramanujan ⋅ Hannaneh Hajishirzi ⋅ Yejin Choi ⋅ Ali Farhadi ⋅ Rohun Tripathi ⋅ Ranjay Krishna
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 531
BD-Merging: Bias-Aware Dynamic Model Merging with Evidence-Guided Contrastive Learning
Yuhan Xie ⋅ Chen Lyu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 532
Dynamic Momentum Recalibration in Online Gradient Learning
Zhipeng Yao ⋅ Rui Yu ⋅ Guisong Chang ⋅ Ying Li ⋅ Yu Zhang ⋅ Dazhou Li
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 533
Spherical Leech Quantization for Visual Tokenization and Generation
Yue Zhao ⋅ Hanwen Jiang ⋅ Zhenlin Xu ⋅ Chutong Yang ⋅ Ehsan Adeli ⋅ Philipp Krähenbühl
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 534
MSPT: Efficient Large-Scale Physical Modeling via Parallelized Multi-Scale Attention
Pedro M. P. Curvo ⋅ Jan-Willem van de Meent ⋅ Maksim Zhdanov
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 535
GR-Gauge: Cost-efficient Training Configuration By Gauging the Gradient Redundancy
Guanjie Wang ⋅ Chen Chen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 536
E^2-SCI: Elastic Edge–Cloud Speculative Decoding via Credit Inertia
Senyao Li ⋅ Haozhao Wang ⋅ Zhaobai Jiang ⋅ Zhanbo Jin ⋅ Hao Fan ⋅ Ruixuan Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 537
HyperNAS: Enhancing Architecture Representation for NAS Predictor via Hypernetwork
Jindi Lv ⋅ Yuhao Zhou ⋅ Yuxin Tian ⋅ Qing Ye ⋅ Wentao Feng ⋅ Jiancheng Lv
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 538
NeuroFlow: Toward Unified Visual Encoding and Decoding from Neural Activity
Weijian Mai ⋅ Mu Nan ⋅ Yu Zhu ⋅ Jiahang Cao ⋅ Rui Zhang ⋅ Yuqin Dai ⋅ Chunfeng Song ⋅ Andrew F. Luo ⋅ Jiamin Wu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 539
Spectral Conformal Risk Control: Distribution-Free Tail Guarantees via Bayesian Quadrature
Mohammad Mahdi Kazemi Esfeh ⋅ Qi Yan ⋅ Yongxing Zhang ⋅ Zahra Gholami ⋅ Renjie Liao ⋅ Purang Abolmaesumi
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 540
Edge-RecViT: Efficient Vision Transformer via Semantic-Refined Dynamic Recursion
YiZhou Li ⋅ Jinyi Xu ⋅ Mingyu Yin ⋅ Xianyi Zhao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 541
ERMoE: Eigen-Reparameterized Mixture-of-Experts for Stable Routing and Interpretable Specialization
Anzhe Cheng ⋅ Shukai Duan ⋅ Shixuan Li ⋅ Chenzhong Yin ⋅ Mingxi Cheng ⋅ Heng Ping ⋅ Tamoghna Chattopadhyay ⋅ Sophia Thomopoulos ⋅ Shahin Nazarian ⋅ Paul Thompson ⋅ Paul Bogdan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 542
GUI-SAGE: Enhancing GUI Automation with Self-Explanatory Learning
Fei Tang ⋅ Zhangxuan Gu ⋅ Zhengxi Lu ⋅ Shangzhan Zhang ⋅ Zhengwen Zeng ⋅ Shuheng Shen ⋅ Changhua Meng ⋅ Yuchen Yan ⋅ Wenqi Zhang ⋅ Yongliang Shen ⋅ Weiming Lu ⋅ Yueting Zhuang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 543
GUIDE: A Benchmark for Understanding and Assisting Users in Open-Ended GUI Tasks
Saelyne Yang ⋅ Jaesang Yu ⋅ Yi-Hao Peng ⋅ Kevin Qinghong Lin ⋅ Jae Won Cho ⋅ Yale Song ⋅ Juho Kim
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 544
HiconAgent: History Context-aware Policy Optimization for GUI Agents
Xurui Zhou ⋅ Gongwei Chen ⋅ Yuquan Xie ⋅ Zaijing Li ⋅ Kaiwen Zhou ⋅ Shuai Wang ⋅ Shuo Yang ⋅ Zhuotao Tian ⋅ Rui Shao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 545
PET-DINO: Unifying Visual Cues into Grounding DINO with Prompt-Enriched Training
Weifu Fu ⋅ Jinyang Li ⋅ Bin-Bin Gao ⋅ Jialin Li ⋅ Yuhuan Lin ⋅ Hanqiu Deng ⋅ Wenbing Tao ⋅ Yong Liu ⋅ Chengjie Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 546
SDDF: Specificity-Driven Dynamic Focusing for Open-Vocabulary Camouflaged Object Detection
Jiaming Liang ⋅ Yifeng Zhan ⋅ Chunlin Liu ⋅ Weihua Zheng ⋅ bingye Peng ⋅ Qiwei Liang ⋅ Boyang Cai ⋅ Xiaochun Mai ⋅ Qiang Nie
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 547
Towards Open-Vocabulary Industrial Defect Understanding with a Large-Scale Multimodal Dataset
Tsai-Ching Ni ⋅ ZhenQi Chen ⋅ YuanFu Yang
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 548
Common Inpainted Objects In-N-Out of Context
Tianze Yang ⋅ Tyson Jordan ⋅ Ruitong Sun ⋅ Ninghao Liu ⋅ Jin Sun
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 549
Prompt-Free Universal Region Proposal Network
Qihong Tang ⋅ Changhan Liu ⋅ Shaofeng Zhang ⋅ Wenbin Li ⋅ Qi Fan ⋅ Yang Gao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 550
Rewis3d: Reconstruction Improves Weakly-Supervised Semantic Segmentation
Jonas Ernst ⋅ Wolfgang Boettcher ⋅ Lukas Hoyer ⋅ Jan Lenssen ⋅ Bernt Schiele
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 551
PaNDaS: Learnable Shape Interpolation Modeling with Localized Control
Thomas Besnier ⋅ Emery Pierson ⋅ Sylvain Arguillere ⋅ Maks Ovsjanikov ⋅ Mohamed Daoudi
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 552
Hilbert Curve-Based Attention Enabling Topology-Preserving Image Tensor Representation for Semantic Segmentation Network
Linkang Xu ⋅ Gang Li ⋅ Yue Song ⋅ Xiangxin Ji
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 553
Towards High-Quality Image Segmentation: Improving Topology Accuracy by Penalizing Neighbor Pixels
J. Miguel Valverde ⋅ Dim P. Papadopoulos ⋅ Rasmus Larsen ⋅ Anders Bjorholm Dahl
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 554
SAGE: Style-Adaptive Generalization for Privacy-Constrained Semantic Segmentation Across Domains
Qingmei Li ⋅ Yang Zhang ⋅ peifeng zhang ⋅ Haohuan Fu ⋅ Juepeng Zheng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 555
Better than Average: Spatially-Aware Aggregation of Segmentation Uncertainty Improves Downstream Performance
Vanessa Emanuela Guarino ⋅ Claudia Winklmayr ⋅ Jannik Franzen ⋅ Josef Rumberger ⋅ Manuel Pfeuffer ⋅ Sonja Greven ⋅ Klaus Maier-Hein ⋅ Dagmar Kainmueller ⋅ Christoph Karg ⋅ Carsten T. Lüth
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 556
Universal 3D Shape Matching via Coarse-to-Fine Language Guidance
Qinfeng Xiao ⋅ Guofeng Mei ⋅ Bo Yang ⋅ Zhang Liying ⋅ Liying Zhang ⋅ Kit-lun Yick
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 557
Direct Segmentation without Logits Optimization for Training-Free Open-Vocabulary Semantic Segmentation
Jiahao Li ⋅ Yang Lu ⋅ Yachao Zhang ⋅ Fangyong Wang ⋅ Yuan Xie ⋅ Yanyun Qu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 558
CDICS: Delving Into Fine-Grained Attribute for In-Context Segmentation via Compositional Prompts and Phased Decoupling
Zhiyu Li ⋅ Dianmo Sheng ⋅ Qi Chu ⋅ Shilong Chen ⋅ Tao Gong ⋅ Zhou Wei ⋅ Nenghai Yu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 559
Discriminative Perception via Anchored Description for Reasoning Segmentation
Tao Yang ⋅ Qing Zhou ⋅ Yanliang Li ⋅ Qi Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 560
SegEarth-R2: Towards Comprehensive Language-guided Segmentation for Remote Sensing Images
Zepeng Xin ⋅ Kaiyu Li ⋅ Luodi Chen ⋅ Wanchen Li ⋅ Xiao Yuchen ⋅ Hui Qiao ⋅ Weizhan Zhang ⋅ Deyu Meng ⋅ Xiangyong Cao
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 561
Cross-Scale Pansharpening via ScaleFormer and the PanScale Benchmark
Ke Cao ⋅ Xuanhua He ⋅ Xueheng Li ⋅ Lingting Zhu ⋅ Yingying Wang ⋅ Ao Ma ⋅ Zhanjie Zhang ⋅ Man Zhou ⋅ Chengjun Xie ⋅ Jie Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 562
CrossEarth-Gate: Fisher-Guided Adaptive Tuning Engine for Efficient Adaptation of Cross-Domain Remote Sensing Semantic Segmentation
Shilei Cao ⋅ Ziyang Gong ⋅ Hehai Lin ⋅ Yang Liu ⋅ Jiashun Cheng ⋅ Xiaoxing Hu ⋅ Haoyuan Liang ⋅ Guowen Li ⋅ Chengwei Qin ⋅ Hong Cheng ⋅ Xue Yang ⋅ Juepeng Zheng ⋅ Haohuan Fu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 563
Multigrain-aware Semantic Prototype Scanning and Tri-Token Prompt Learning Embraced High-Order RWKV for Pan-Sharpening
Junfeng Li ⋅ Wenyang Zhou ⋅ Xueheng Li ⋅ Xuanhua He ⋅ Jianhou Gan ⋅ Wenqi Ren
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 564
ACPV-Net: All-Class Polygonal Vectorization for Seamless Vector Map Generation from Aerial Imagery
Weiqin Jiao ⋅ Hao Cheng ⋅ George Vosselman ⋅ Claudio Persello
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 565
Beyond Endpoints: Path-Centric Reasoning for Vectorized Off-Road Network Extraction
wenfei guan ⋅ Jilin Mei ⋅ Tong Shen ⋅ Xumin Wu ⋅ Shuo Wang ⋅ Chen Min ⋅ Yu Hu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 566
Rotation Invariant and Symmetry Aware Pixel Difference Network for Remote Sensing Object Detection
Jialei Zhan ⋅ Li Liu ⋅ Jiehua Zhang ⋅ Yuhang Xie ⋅ Yongxiang Liu ⋅ Jiangming Chen ⋅ Mingming Cheng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 567
F2Net: A Frequency-Fused Network for Ultra-High Resolution Remote Sensing Segmentation
Hengzhi Chen ⋅ Liqian Feng ⋅ Wenhua Wu ⋅ Xiaogang Zhu ⋅ Qiuxia Wu ⋅ Lianlei Shan ⋅ Kun Hu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 568
RoadGIE: Towards A Global-Scale Aerial Benchmark for Generalizable Interactive Road Extraction
Chenxu Peng ⋅ Chenxu Wang ⋅ Yimian Dai ⋅ Yongxiang Liu ⋅ Mingming Cheng ⋅ Xiang Li
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 569
PGA: Prior-free Generative Attack for Practical No-box Scenario
hongyu peng ⋅ Xiang Yuan ⋅ Gong Cheng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 570
Lipschitz Optimization for Formal Verification of Homographies
Jean-Guillaume Durand ⋅ Panagiotis Kouvaros ⋅ Maxime Gariel ⋅ Alessio Lomuscio
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 571
Batman: Benign Knowledge Alignment Through Malicious Null Space in Federated Backdoor Attack
Wenwen He ⋅ Wenke Huang ⋅ Yiyang Fang ⋅ Wenjie Qu ⋅ Jiaheng Zhang ⋅ Mang Ye
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 572
Out of Sight, Out of Track: Adversarial Attacks on Propagation-based Multi-Object Trackers via Query State Manipulation
Halima Bouzidi ⋅ Haoyu Liu ⋅ Yonatan Achamyeleh ⋅ Praneetsai Iddamsetty ⋅ Mohammad Al Faruque
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 573
Eliminate Distance Differences Induced by Backdoor Attacks: Layer-Selective Training and Clipping to Mask Backdoor Models
Xuzeng Li ⋅ Tao Zhang ⋅ Xiangyun Tang ⋅ JIACHENG WANG ⋅ Jian Wang ⋅ Jiawen Kang ⋅ Jiqiang Liu ⋅ Zhen Han ⋅ Dusit Niyato ⋅ Dong In Kim
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 574
Mitigating Error Amplification in Fast Adversarial Training
Mengnan Zhao ⋅ Lihe Zhang ⋅ Bo Wang ⋅ Tianhang Zheng ⋅ Hong Zhong ⋅ Geyong Min
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 575
Physical Adversarial Clothing Evades Visible-Thermal Detectors via Non-Overlapping RGB-T Pattern
Xiaopei Zhu ⋅ Guanning Zeng ⋅ Zhanhao Hu ⋅ Jun Zhu ⋅ Xiaolin Hu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 576
What Your Features Reveal: Data-Efficient Black-Box Feature Inversion Attack for Split DNNs
Zhihan Ren ⋅ Lijun He ⋅ Jiaxi Liang ⋅ Xinzhu Fu ⋅ Haixia Bi ⋅ Fan Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 577
Exposing Functional Fusion: A New Class of Strategic Backdoor in Dynamic Prompt Architectures
Zeyao Liu ⋅ Zhendong Zhao ⋅ Xiaojun Chen ⋅ Xin Zhao ⋅ Yuexin Xuan ⋅ XIAOSHUANG JI
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 578
Learning to See and Act: Task-Aware Virtual View Exploration for Robotic Manipulation
Yongjie Bai ⋅ Zhouxia Wang ⋅ Yang Liu ⋅ Kaijun Luo ⋅ Yifan Wen ⋅ Mingtong Dai ⋅ weixing chen ⋅ Ziliang Chen ⋅ Lingbo Liu ⋅ Guanbin Li ⋅ Liang Lin
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 579
Evo-1: Lightweight Vision-Language-Action Model with Preserved Semantic Alignment
Tao Lin ⋅ Yilei Zhong ⋅ Yuxin Du ⋅ Jingjing Zhang ⋅ Jiting Liu ⋅ Yinxinyu Chen ⋅ Encheng Gu ⋅ Ziyan Liu ⋅ Hongyi Cai ⋅ Yanwen Zou ⋅ Lixing Zou ⋅ Zhaoye Zhou ⋅ Gen Li ⋅ Bo Zhao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 580
FM-Steer: Enhance Generalist Policies with Value-Guided Cascaded Denoising
Haoming Song ⋅ Delin Qu ⋅ Yuanqi Yao ⋅ Qizhi Chen ⋅ Jiarui Li ⋅ Qi Lv ⋅ Yiwen Tang ⋅ Li Kang ⋅ Heng Zhou ⋅ Xianqiang Gao ⋅ Yuhang Tang ⋅ Xiaofan Li ⋅ Modi Shi ⋅ Guangrui Ren ⋅ Maoqing Yao ⋅ Bin Zhao ⋅ Dong Wang ⋅ Xuelong Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 581
Bootstrap Dynamic-Aware 3D Visual Representation for Scalable Robot Learning
Qiwei Liang ⋅ Boyang Cai ⋅ Minghao Lai ⋅ Sitong Zhuang ⋅ Tao Lin ⋅ Yan Qin ⋅ Yixuan Ye ⋅ Jiaming Liang ⋅ Renjing Xu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 582
Visual Sim-to-Real at Scale for Humanoid Loco-Manipulation
Tairan He ⋅ Zi Wang ⋅ Haoru Xue ⋅ Qingwei Ben ⋅ Zhengyi Luo ⋅ Wenli Xiao ⋅ Ye Yuan ⋅ Xingye Da ⋅ Fernando Castañeda ⋅ Shankar Sastry ⋅ Changliu Liu ⋅ Guanya Shi ⋅ Jim Fan ⋅ Yuke Zhu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 583
Contact-Aware Neural Dynamics
Changwei Jing ⋅ Jai Krishna Bandi ⋅ Jianglong Ye ⋅ Yan Duan ⋅ Pieter Abbeel ⋅ Xiaolong Wang ⋅ Sha Yi
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 584
AVA-VLA: Improving Vision-Language-Action models with Active Visual Attention
Lei Xiao ⋅ Jifeng Li ⋅ Juntao Gao ⋅ Feiyang Ye ⋅ Yan Jin ⋅ Jingjing Qian ⋅ Jing Zhang ⋅ Yong Wu ⋅ Xiaoyuan Yu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 585
UAST: Unified Active Search and Tracking for Arbitrary Targets with UAVs
Liang Qin ⋅ Min Wang ⋅ Xingyu Lu ⋅ Aowen Qiu ⋅ Wengang Zhou ⋅ Houqiang Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 586
SwiftVLA: Unlocking Spatiotemporal Dynamics for Lightweight VLA Models at Minimal Overhead
Chaojun Ni ⋅ Chen Cheng ⋅ Xiaofeng Wang ⋅ Zheng Zhu ⋅ Wenzhao Zheng ⋅ Boyuan Wang ⋅ Tianrun Chen ⋅ Guosheng Zhao ⋅ Haoyun Li ⋅ Zhehao Dong ⋅ Qiang Zhang ⋅ Yun Ye ⋅ Yang Wang ⋅ Guan Huang ⋅ Wenjun Mei
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 587
Visual-RRT: Finding Paths toward Visual-Goals via Differentiable Rendering
Sebin Lee ⋅ Jumin Lee ⋅ Taeyeon Kim ⋅ Youngju Na ⋅ Woobin Im ⋅ Sung-Eui Yoon
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 588
Cross-Hand Latent Representation for Vision-Language-Action Models
Guangqi Jiang ⋅ Yutong Liang ⋅ Jianglong Ye ⋅ Jia-Yang Huang ⋅ Changwei Jing ⋅ Yan Duan ⋅ Pieter Abbeel ⋅ Xiaolong Wang ⋅ Xueyan Zou
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 589
Beyond Success: Refining Elegant Robot Manipulation from Mixed-Quality Data via Just-in-Time Intervention
Yanbo Mao ⋅ Jianlong Fu ⋅ Ruoxuan Zhang ⋅ Hongxia Xie ⋅ Meibao Yao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 590
Physically Ground Commonsense Knowledge for Articulated Object Manipulation with Analytic Concepts
Jiude Wei ⋅ Yuxuan Li ⋅ Cewu Lu ⋅ Jianhua Sun
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 591
GeoPredict: Leveraging Predictive Kinematics and 3D Gaussian Geometry for Precise VLA Manipulation
Jingjing Qian ⋅ Boyao Han ⋅ Chen Shi ⋅ Lei Xiao ⋅ Long Yang ⋅ Shaoshuai Shi ⋅ Li Jiang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 592
From Manuals to Actions: A Unified VLA Model for Chain-of-Thought Manual Generation and Robotic Manipulation
Chenyang Gu ⋅ Jiaming Liu ⋅ Hao Chen ⋅ Runzhong Huang ⋅ Qingpo Wuwu ⋅ Xiaoqi Li ⋅ Zhuoyang Liu ⋅ Ying Li ⋅ Ray Zhang ⋅ Peng Jia ⋅ Pheng-Ann Heng ⋅ Shanghang Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 593
Real-World Point Tracking with Verifier-Guided Pseudo-Labeling
Görkay Aydemir ⋅ Fatma Güney ⋅ Weidi Xie
[ Slides [ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 594
Rethinking Occlusion Modeling for UAV Tracking
Jian Zhang ⋅ Xincheng Yu ⋅ Yi Lin
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 595
Adaptive Capacity Autoregressive Visual Tracking
Tong Lin ⋅ Yifan Bai ⋅ Shiyi Liang ⋅ Ruigang Niu ⋅ Xing Wei
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 596
Spatio-Temporal Conditional Denoising Transformer for Modality-Missing RGBT Tracking
Andong Lu ⋅ Ziyi Zha ⋅ Jiandong Jin ⋅ Shihao Li ⋅ Chenglong Li ⋅ Jin Tang ⋅ Bin Luo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 597
Breaking Smooth-Motion Assumptions: A UAV Benchmark for Multi-Object Tracking in Complex and Adverse Conditions
Jingtao Ye ⋅ Kexin Zhang ⋅ Xunchi Ma ⋅ Johann Li ⋅ Guangming Zhu ⋅ Peiyi Shen ⋅ Linhua Jiang ⋅ Xiangdong Zhang ⋅ Liang Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 598
TrackMAE: Video Representation Learning via Track Mask and Predict
Renaud Vandeghen ⋅ Fida Mohammad Thoker ⋅ Marc Van Droogenbroeck ⋅ Bernard Ghanem
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 599
Dual-branch Distilled Transformer for Efficient Asymmetric UAV Tracking
Hongtao Yang ⋅ Bineng Zhong ⋅ Qihua Liang ⋅ Yaozong Zheng ⋅ Xiantao Hu ⋅ Yuanliang Xue ⋅ Shuxiang Song
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 600
Multi-view Crowd Tracking Transformer with View-Ground Interactions Under Large Real-World Scenes
Qi Zhang ⋅ Jixuan Chen ⋅ Zhang Kaiyi ⋅ Xinquan Yu ⋅ Antoni B. Chan ⋅ Hui Huang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 601
Scaling Self-Supervised and Cross-Modal Pretraining for Volumetric CT Transformers
Cris Claessens ⋅ Christiaan Viviers ⋅ Giacomo D'Amicantonio ⋅ Egor Bondarev ⋅ Fons van der Sommen
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 602
MuViT: Multi-Resolution Vision Transformers for Learning Across Scales in Microscopy
Albert Dominguez Mantes ⋅ Gioele La Manno ⋅ Martin Weigert
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 603
SemVideo: Reconstructs What You Watch from Brain Activity via Hierarchical Semantic Guidance
Minghan Yang ⋅ LAN YANG ⋅ Ke Li ⋅ Honggang Zhang ⋅ Kaiyue Pang ⋅ Yi-Zhe Song
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 604
Multimodal Causality-Driven Representation Learning for Generalizable Medical Image Segmentation
XUSHENG LIANG ⋅ Lihua Zhou ⋅ Nianxin Li ⋅ miao xu ⋅ Ziyang Song ⋅ Dong Yi ⋅ Jinlin Wu ⋅ Jiawei Ma ⋅ Hongbin Liu ⋅ Zhen Lei ⋅ Jiebo Luo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 605
Simple Agents Outperform Experts in Biomedical Imaging Workflow Optimization
Xuefei Wang ⋅ Kai A. Horstmann ⋅ Ethan Lin ⋅ Jonathan Chen ⋅ Alexander Farhang ⋅ Sophia Stiles ⋅ Atharva Sehgal ⋅ Jonathan Light ⋅ David Valen ⋅ Yisong Yue ⋅ Jennifer J. Sun
[ Slides
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 606
TopoSlide: Topologically-Informed Histopathology Whole Slide Image Representation Learning
Shahira Abousamra ⋅ Asmita Sood ⋅ Sylvia Plevritis
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 607
Beyond the Static-World: Lifelong Learning for All-in-One Medical Image Restoration
Shihao Shan ⋅ Hongying Liu ⋅ Fanhua Shang ⋅ Liang Wan ⋅ Jingjing Deng
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 608
Hyperbolic Relational Prompts for Intersectional Fairness in Medical VLMs
Jiayu Qian ⋅ Zongxian Yang ⋅ Guanxing Chen ⋅ Pengwei Hu ⋅ KC Tan ⋅ Yan Wang ⋅ Yu-An Huang ⋅ Zhi-An Huang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 609
RNED: Rotary Number Encoding and Decoding for Quantitative Medical VLM Analysis
Fengbei Liu ⋅ Sunwoo Kwak ⋅ Nusrat Binta Nizam ⋅ Ilan Richter ⋅ Ashley Beecy ⋅ Jayant Raikhelkar ⋅ Deborah Estrin ⋅ Mert Sabuncu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 610
MLLM-HWSI: A Multimodal Large Language Model for Hierarchical Whole Slide Image Understanding
Basit Alawode ⋅ Arif Mahmood ⋅ Muaz Radi ⋅ Shahad Albastaki ⋅ Asim Khan ⋅ Muhammad Bilal ⋅ Moshira Ali Abdalla ⋅ Mohammed Bennamoun ⋅ Sajid Javed
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 611
Learning Generalizable 3D Medical Image Representations from Mask-Guided Self-Supervision
Yunhe Gao ⋅ Yabin Zhang ⋅ Chong Wang ⋅ Jiaming Liu ⋅ Maya Varma ⋅ Jean-Benoit Delbrouck ⋅ Akshay Chaudhari ⋅ Curtis Langlotz
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 612
BiOTPrompt: Bidirectional Optimal Transport Guided Prompting for Disease Evolution-aware Radiology Report Generation
Tengfei Liu ⋅ Yijian Fan ⋅ Boyue Wang ⋅ Yongli Hu ⋅ Mingjie Li ⋅ Jinghua Li ⋅ Junbin Gao ⋅ Xiaojun Chang ⋅ Zhihui Li ⋅ Baocai Yin
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 613
Learning to See Through a Baby’s Eyes: Early Visual Diets Enable Robust Visual Intelligence in Humans and Machines
Yusen Cai ⋅ Qing Lin ⋅ BHARGAVA SATYA NUNNA ⋅ Mengmi Zhang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 614
UDAPose: Unsupervised Domain Adaptation for Low-Light Human Pose Estimation
Haopeng Chen ⋅ Yihao Ai ⋅ Kabeen Kim ⋅ Robby T. Tan ⋅ Yixin Chen ⋅ Bo Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 615
Enhancing Accuracy of Uncertainty Estimation in Appearance-based Gaze Tracking with Probabilistic Evaluation and Calibration
Qiaojie Zheng ⋅ Jiucai Zhang ⋅ Amy Zhang ⋅ Xiaoli Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 616
SCAPO: Self-Supervised Category-Level Articulated Pose Estimation from a Single 3D Observation
Can Zhang ⋅ Gim Hee Lee
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 617
Composite-Attribute Person Re-Identification via Pose-Guided Disentanglement
Kartik Patwari ⋅ Noranart Vesdapunt ⋅ Chien-Yi Wang ⋅ Dawei Li ⋅ Cong Phuoc Huynh ⋅ Ning Zhou ⋅ Chen-Nee Chuah ⋅ Kah Fu Fu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 618
Representing 3D Faces with Learnable B-Spline Volumes
Prashanth Chandran ⋅ Daoye Wang ⋅ Timo Bolkart
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 619
RHINO: Reconstructing Human Interactions with Novel Objects from Monocular Videos
Lixin Xue ⋅ Chengwei Zheng ⋅ Georgios Paschalidis ⋅ Chen Guo ⋅ Manuel Kaufmann ⋅ Juan Zarate ⋅ Dimitrios Tzionas
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 620
HumanBA: Human-Aware Bundle Adjustment via Global Human-Camera Decoupling
Tanuj Sur ⋅ Tanuj Sur ⋅ Tze Ho Elden Tse ⋅ Angela Yao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 621
HamiPose: Hamiltonian Optimization for Unsupervised Domain Adaptive Pose Estimation
Jiawen Li ⋅ Fei Jiang ⋅ Dandan Zhu ⋅ Aimin Zhou
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 622
KASALv2: Fully Automatic 3D Rotational Symmetry Classification and Axis Localization
Mengxin Zhang ⋅ Yulin Wang ⋅ Chen LUO ⋅ Yongzhe Li ⋅ Yijun Zhou
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 623
AnyLift: Scaling Motion Reconstruction from Internet Videos via 2D Diffusion
Hongjie Li ⋅ Heng Yu ⋅ Jiaman Li ⋅ Hong-Xing Yu ⋅ Ehsan Adeli ⋅ C. Karen Liu ⋅ Jiajun Wu
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 624
Active Inference for Micro-Gesture Recognition: EFE-Guided Temporal Sampling and Adaptive Learning
Weijia Feng ⋅ Jingyu Yang ⋅ Ruojia Zhang ⋅ Fengtao Sun ⋅ Qian Gao ⋅ Chenyang Wang ⋅ tongtong Su ⋅ Jia Guo ⋅ Xiaobai Li ⋅ Minglai Shao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 625
ArtPro: Self-Supervised Articulated Object Reconstruction with Adaptive Integration of Mobility Proposals
Xuelu Li ⋅ Zhaonan Wang ⋅ Xiaogang Wang ⋅ Lei Wu ⋅ Manyi Li ⋅ Changhe Tu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 626
Similarity-Consistent Likelihood Diffusion enables Hidden Person Detection from Wall Reflections
Zhiwen Zheng ⋅ Hao Zhou ⋅ Huiyu Qi ⋅ Zhao Huang ⋅ Guangyuan Zhang ⋅ Shaowei Jiang ⋅ Wenwen Tang ⋅ Bin Yang ⋅ Jin Liu ⋅ Xiaoshuai Zhang ⋅ Xingru Huang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 627
VLM-Guided Group Preference Alignment for Diffusion-based Human Mesh Recovery
Wenhao Shen ⋅ Hao Wang ⋅ Wanqi Yin ⋅ Fayao Liu ⋅ Xulei Yang ⋅ Chao Liang ⋅ Zhongang Cai ⋅ Guosheng Lin
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 628
Occluded Human Body Capture with Frequency Domain Denoising Prior
Buzhen Huang ⋅ Chongyang Xu ⋅ Wentao Tang ⋅ Yuan Shu ⋅ Jingyi Ju ⋅ Binghui Zuo ⋅ Yangang Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 629
ResiHMR: Residual-Limb Aware Single-Image 3D Human Mesh Recovery for Individuals with Limb Loss
Jiaying Ying ⋅ Heming Du ⋅ Kaihao Zhang ⋅ Sean M. Tweedy ⋅ Xin Yu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 630
OnlineHMR: Video-based Online World-Grounded Human Mesh Recovery
Yiwen Zhao ⋅ Ce Zheng ⋅ Yufu Wang ⋅ Hsueh-Han Daniel Yang ⋅ Liting Wen ⋅ László A. Jeni
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 631
MimiCAT: Mimic with Correspondence-Aware Cascade-Transformer for Category-Free 3D Pose Transfer
Zenghao Chai ⋅ Chen Tang ⋅ Yongkang Wong ⋅ Xulei Yang ⋅ Mohan Kankanhalli
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 632
Exploring Adaptive Masked Reconstruction for Self-Supervised Skeleton-Based Action Recognition
Shengkai Sun ⋅ Zhiyong Cheng ⋅ Zefan Zhang ⋅ Jianfeng Dong ⋅ Zhihui Li ⋅ Meng Wang
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 633
DFD-HR: Generalizable Deepfake Detection via Hierarchical Routing Learning
JIAMU SUN ⋅ Zhiyuan Yan ⋅ Ke-Yue Zhang ⋅ Taiping Yao ⋅ Shouhong Ding
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 634
MGDHand: Multi-Granularity Prior-to-Inertial Distillation Framework for Sequential 3D Hand Pose Estimation from Sparse IMUs
Xinyi Wang ⋅ Pengfei Ren ⋅ HaoYang ZHANG ⋅ Hanling Zhan ⋅ Yingxi Li ⋅ Liang Xie ⋅ Yue Gao ⋅ Erwei Yin
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 635
CARI4D: Category Agnostic 4D Reconstruction of Human-Object Interaction
Xianghui Xie ⋅ Bowen Wen ⋅ Yan Chang ⋅ Hesam Rabeti ⋅ Jiefeng Li ⋅ Ye Yuan ⋅ Gerard Pons-Moll ⋅ Stan Birchfield
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 636
E-3DPSM: A State Machine for Event-based Egocentric 3D Human Pose Estimation
Mayur Deshmukh ⋅ Hiroyasu Akada ⋅ Helge Rhodin ⋅ Christian Theobalt ⋅ Vladislav Golyanik
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 637
Bézier Degradation Modeling for LiDAR-based Human Motion Capture
Xiaoqi An ⋅ Lin Zhao ⋅ Jun Li ⋅ Chen Gong ⋅ Jian Yang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 638
UniSH: Unifying Scene and Human Reconstruction in a Feed-Forward Pass
Mengfei Li ⋅ Peng Li ⋅ Zheng Zhang ⋅ Jiahao Lu ⋅ Chengfeng Zhao ⋅ Wei Xue ⋅ Qifeng Liu ⋅ Sida Peng ⋅ Wenxiao ZHANG ⋅ Wenhan Luo ⋅ Yuan Liu ⋅ Yike Guo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 639
Illumination-Consistent Human-Scene Reconstruction from Monocular Video
Rongbin Zheng ⋅ Wensheng Li ⋅ Lingzhe Zeng ⋅ Dong Wang ⋅ Chengying Gao
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 640
Attribution as Retrieval: Model-Agnostic AI-Generated Image Attribution
Hongsong Wang ⋅ Renxi Cheng ⋅ Chaolei Han ⋅ Jie Gui
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 641
Agent4FaceForgery: Multi-Agent LLM Framework for Realistic Face Forgery Detection
Yingxin Lai ⋅ Zitong YU ⋅ Jun Wang ⋅ Linlin Shen ⋅ Yong Xu ⋅ Xiaochun Cao
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 642
Enabling Supervised Learning of Generative Signatures for Generalized Synthetic Image Detection
Jianwei Fei ⋅ Yunshu Dai ⋅ Xiaoyu Zhou ⋅ Zhihua Xia ⋅ Alessandro Piva
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 643
DiffusionFF: A Diffusion-based Framework for Joint Face Forgery Detection and Fine-Grained Artifact Localization
Siran Peng ⋅ Haoyuan Zhang ⋅ Li Gao ⋅ Tianshuo Zhang ⋅ Xiangyu Zhu ⋅ Bao Li ⋅ Weisong Zhao ⋅ Zhen Lei
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 644
All in One: Unifying Deepfake Detection, Tampering Localization, and Source Tracing with a Robust Landmark-Identity Watermark
Junjiang Wu ⋅ Liejun Wang ⋅ Zhiqing Guo
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 645
Towards an Incremental Unified Multimodal Anomaly Detection: Augmenting Multimodal Denoising From an Information Bottleneck Perspective
Kaifang Long ⋅ Lianbo Ma ⋅ Jiaqi Liu ⋅ liming liu ⋅ Guoyang Xie
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 646
AG-VAS: Anchor-Guided Zero-Shot Visual Anomaly Segmentation with Large Multimodal Models
Zhen Qu ⋅ Xian Tao ⋅ Xiaoyi Bao ⋅ Dingrong Wang ⋅ ShiChen Qu ⋅ Zhengtao Zhang ⋅ Xingang Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 647
Dual-Prototype-Guided Multi-task Learning for Unsupervised Anomaly Detection and Classification
Qianhao Luo ⋅ Jiajia Mi ⋅ Mingtao Yan ⋅ JingSheng Liu ⋅ ShuYang Pang ⋅ Weiling Li
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 648
The Road Less Seen: Segment Exploration for Weakly Supervised Video Anomaly Detection
Anusha Achaya ⋅ Hitesh Sapkota ⋅ Qi Yu ⋅ Xumin Liu
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 649
Omni-AD: A Large-scale and Versatile Benchmark for Industrial Anomaly Detection
Dahu Shi ⋅ Chengshen He ⋅ Shaochen Zhang ⋅ Bo Qian ⋅ Xiaochen Quan ⋅ Wencong Zhang ⋅ Xing Wei
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 650
Back to Point: Exploring Point-Language Models for Zero-Shot 3D Anomaly Detection
Kaiqiang Li ⋅ Gang Li ⋅ Mingle Zhou ⋅ Min Li ⋅ Delong Han ⋅ Jin Wan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 651
Complementary Prototype Mapping for Efficient Multimodal Anomaly Detection
Yuan Zhao ⋅ Zhang xiaoqin to Xiaoqin Zhang ⋅ Huchuan Lu ⋅ Lihe Zhang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 652
LiDAS: Lighting-driven Dynamic Active Sensing for Nighttime Perception
Simon de Moreau ⋅ Andrei Bursuc ⋅ Hafid EL IDRISSI ⋅ Fabien Moutarde
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 653
Gau-Occ: Geometry-Completed Gaussians for Multi-Modal 3D Occupancy Prediction
Chengxin Lv ⋅ Yihui Li ⋅ Hongyu Yang ⋅ Yunhong Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 654
OpenVO: Open-World Visual Odometry with Temporal Dynamics Awareness
Phuc Nguyen ⋅ Anh N Nhu ⋅ Ming C. Lin
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 655
An Instance-Centric Panoptic Occupancy Prediction Benchmark for Autonomous Driving
Yi Feng ⋅ Junwu E ⋅ Zizhan Guo ⋅ Yu Ma ⋅ Hanli Wang ⋅ Rui Fan
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 656
OneOcc: Semantic Occupancy Prediction for Legged Robots with a Single Panoramic Camera
Hao Shi ⋅ Ze Wang ⋅ Shangwei Guo ⋅ Mengfei Duan ⋅ Song Wang ⋅ Teng Chen ⋅ Kailun Yang ⋅ Lin Wang ⋅ Kaiwei Wang
[ Poster
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 657
ProOOD: Prototype-Guided Out-of-Distribution 3D Occupancy Prediction
Yuheng Zhang ⋅ Mengfei Duan ⋅ Kunyu Peng ⋅ Yuhang Wang ⋅ Di Wen ⋅ Danda Paudel ⋅ Luc Van Gool ⋅ Kailun Yang
[ Poster
Poster Session
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F None
Poster Session 2 & Exhibit Hall w/ Coffee Break
Art Program
Fri Jun 05 04:00 PM -- 04:30 PM (PDT) @ ExHall F None
Art Gallery Tour with Curator and Artists
Luba Elliott
Break
Sat Jun 06 06:30 AM -- 08:00 AM (PDT) @ ExHall C None
Breakfast
Registration
Sat Jun 06 06:30 AM -- 04:00 PM (PDT) @ Lobby A None
Registration / Badge Pickup
Oral
Sat Jun 06 08:00 AM -- 08:12 AM (PDT) @ Four Seasons Ballroom None
ComPose: A Unified Completion-Pose Framework for Robust Category-Level Object Pose Estimation
Huan Ren ⋅ Yihan Chen ⋅ Chuxin Wang ⋅ Nailong Liu ⋅ Wenfei Yang ⋅ Tianzhu Zhang
Oral
Sat Jun 06 08:00 AM -- 08:12 AM (PDT) @ Mile High Ballroom 1A - 2A None
3D-LATTE: Latent Space 3D Editing from Textual Instructions
Maria Parelli ⋅ Michael Oechsle ⋅ Michael Niemeyer ⋅ Federico Tombari ⋅ Andreas Geiger
Oral
Sat Jun 06 08:00 AM -- 08:12 AM (PDT) @ Mile High Ballroom 3A - 4A None
Differentiable Vector Quantization for Rate-Distortion Optimization of Generative Image Compression
SHIYIN JIANG ⋅ Wei Long ⋅ Minghao Han ⋅ Zhenghao Chen ⋅ Ce Zhu ⋅ Shuhang Gu
Oral
Sat Jun 06 08:00 AM -- 08:12 AM (PDT) @ Bluebird Ballroom None
Breaking Semantic Boundaries: Distribution-Guided Semantic Exploration for Creative Generation
Fu Feng ⋅ Yucheng Xie ⋅ Ruixiao Shi ⋅ Xu Yang ⋅ Jing Wang ⋅ Xin Geng
Oral Session
Sat Jun 06 08:00 AM -- 09:15 AM (PDT) @ Four Seasons Ballroom None
Oral Session 3B: Spatial Understanding
Oral Session
Sat Jun 06 08:00 AM -- 09:15 AM (PDT) @ Mile High Ballroom 1A - 2A None
Oral Session 3C: Generative Editing
Oral Session
Sat Jun 06 08:00 AM -- 09:15 AM (PDT) @ Bluebird Ballroom None
Oral Session 3A: Generative Diffusion Modeling
Oral Session
Sat Jun 06 08:00 AM -- 09:15 AM (PDT) @ Mile High Ballroom 3A - 4A None
Oral Session 3D: Multimodal Modeling
Oral
Sat Jun 06 08:12 AM -- 08:25 AM (PDT) @ Mile High Ballroom 1A - 2A None
AnchorFlow: Training-Free 3D Editing via Latent Anchor-Aligned Flows
Zhenglin Zhou ⋅ Fan Ma ⋅ Chengzhuo Gui ⋅ Xiaobo Xia ⋅ Hehe Fan ⋅ Yi Yang ⋅ Tat-seng Chua
Oral
Sat Jun 06 08:12 AM -- 08:25 AM (PDT) @ Four Seasons Ballroom None
CoSMo3D: Open-World Promptable 3D Semantic Segmentation through LLM-Guided Canonical Spatial Modeling
Li Jin ⋅ Weikai Chen ⋅ Yujie Wang ⋅ Yingda Yin ⋅ Zeyu HU ⋅ Runze Zhang ⋅ Keyang Luo ⋅ Shengju Qian ⋅ Xin Wang ⋅ Xueying Qin
Oral
Sat Jun 06 08:12 AM -- 08:25 AM (PDT) @ Mile High Ballroom 3A - 4A None
FINER: MLLMs Hallucinate under Fine-grained Negative Queries
Rui Xiao ⋅ Sanghwan Kim ⋅ Yongqin Xian ⋅ Zeynep Akata ⋅ Stephan Alaniz
Oral
Sat Jun 06 08:12 AM -- 08:25 AM (PDT) @ Bluebird Ballroom None
Guiding a Diffusion Model by Swapping Its Tokens
Weijia Zhang ⋅ Yuehao Liu ⋅ Shanyan Guan ⋅ Wu Ran ⋅ Yanhao Ge ⋅ Wei Li ⋅ Chao Ma
Oral
Sat Jun 06 08:25 AM -- 08:37 AM (PDT) @ Bluebird Ballroom None
PixelDiT: Pixel Diffusion Transformers for Image Generation
Yongsheng Yu ⋅ Wei Xiong ⋅ Weili Nie ⋅ Yichen Sheng ⋅ Shiqiu Liu ⋅ Jiebo Luo
Oral
Sat Jun 06 08:25 AM -- 08:37 AM (PDT) @ Mile High Ballroom 3A - 4A None
MDCS-MoAME: Multi-directional Composite Scanning with Mixture of Attention and Mamba Experts for Cancer Survival Prediction
Linjie Qu ⋅ Jin Xiao ⋅ Xiangrong Liu ⋅ Changming Sun ⋅ Hui Cui ⋅ Yuqi Fang ⋅ Ran Su ⋅ Qiangguo Jin ⋅ leyi wei
Oral
Sat Jun 06 08:25 AM -- 08:37 AM (PDT) @ Mile High Ballroom 1A - 2A None
ChordEdit: One-Step Low-Energy Transport for Image Editing
Liangsi Lu ⋅ Xuhang Chen ⋅ Minzhe Guo ⋅ Shichu Li ⋅ Jingchao Wang ⋅ Yang Shi
Oral
Sat Jun 06 08:25 AM -- 08:37 AM (PDT) @ Four Seasons Ballroom None
GeoViS: Geospatially Rewarded Visual Search for Remote Sensing Visual Grounding
Peirong Zhang ⋅ Yidan Zhang ⋅ Luxiao Xu ⋅ Jinliang Lin ⋅ Zonghao Guo ⋅ Fengxiang Wang ⋅ Xue Yang ⋅ Kaiwen Wei ⋅ Lei Wang
Oral
Sat Jun 06 08:37 AM -- 08:50 AM (PDT) @ Mile High Ballroom 1A - 2A None
Faithful Contouring: Near-Lossless 3D Voxel Representation Free from Iso-surface
Yihao Luo ⋅ Xianglong He ⋅ Chuanyu Pan ⋅ Yiwen Chen ⋅ Jiaqi Wu ⋅ Yangguang Li ⋅ Wanli Ouyang ⋅ Yuanming Hu ⋅ Guang Yang ⋅ Choon Hwai Yap
Oral
Sat Jun 06 08:37 AM -- 08:50 AM (PDT) @ Four Seasons Ballroom None
RobotSeg: A Model and Dataset for Segmenting Robots in Image and Video
Haiyang Mei ⋅ Qiming Huang ⋅ Hai Ci ⋅ Mike Zheng Shou
Oral
Sat Jun 06 08:37 AM -- 08:50 AM (PDT) @ Bluebird Ballroom None
SeaCache: Spectral-Evolution-Aware Cache for Accelerating Diffusion Models
Jiwoo Chung ⋅ Sangeek Hyun ⋅ MinKyu Lee ⋅ Byeongju Han ⋅ Geonho Cha ⋅ Dongyoon Wee ⋅ Youngjun Hong ⋅ Jae-Pil Heo
[ Slides
Oral
Sat Jun 06 08:37 AM -- 08:50 AM (PDT) @ Mile High Ballroom 3A - 4A None
PAS: A Training-Free Stabilizer for Temporal Encoding in Video LLMs
Bowen Sun ⋅ Yujun Cai ⋅ Ming-Hsuan Yang ⋅ Hang Wu ⋅ Yiwei Wang
Oral
Sat Jun 06 08:50 AM -- 09:02 AM (PDT) @ Mile High Ballroom 1A - 2A None
Native and Compact Structured Latents for 3D Generation
Jianfeng XIANG ⋅ Xiaoxue Chen ⋅ Sicheng Xu ⋅ Ruicheng Wang ⋅ Zelong Lv ⋅ Yu Deng ⋅ Hongyuan Zhu ⋅ Yue Dong ⋅ Hao Zhao ⋅ Nicholas Jing Yuan ⋅ Jiaolong Yang
Oral
Sat Jun 06 08:50 AM -- 09:02 AM (PDT) @ Mile High Ballroom 3A - 4A None
PAVAS: Physics-Aware Video-to-Audio Synthesis
Oh Hyun-Bin ⋅ Yuhta Takida ⋅ Toshimitsu Uesaka ⋅ Tae-Hyun Oh ⋅ Yuki Mitsufuji
Oral
Sat Jun 06 08:50 AM -- 09:02 AM (PDT) @ Bluebird Ballroom None
SenCache: Accelerating Diffusion Model Inference via Sensitivity-Aware Caching
Yasaman Haghighi ⋅ Alex Alahi
Oral
Sat Jun 06 08:50 AM -- 09:02 AM (PDT) @ Four Seasons Ballroom None
S^2AM3D: Scale-controllable Part Segmentation of 3D Point Clouds
Han Su ⋅ Tianyu Huang ⋅ Zichen Wan ⋅ Xiaohe Wu ⋅ Wangmeng Zuo
Oral
Sat Jun 06 09:02 AM -- 09:15 AM (PDT) @ Bluebird Ballroom None
Streaming Diffusion Model for Fast Infrared and Visible Video Fusion
Jinyuan Liu ⋅ Ludan Sun ⋅ Tengyu Ma ⋅ Chunyan Yang ⋅ Zhiying Jiang ⋅ Long Ma ⋅ Risheng Liu ⋅ Xin Fan
Oral
Sat Jun 06 09:02 AM -- 09:15 AM (PDT) @ Mile High Ballroom 3A - 4A None
ProPhy: Progressive Physical Alignment for Dynamic World Simulation
Zijun Wang ⋅ Panwen Hu ⋅ Jing Wang ⋅ Terry Jingchen Zhang ⋅ Yuhao Cheng ⋅ Long Chen ⋅ Yiqiang Yan ⋅ Zutao Jiang ⋅ Hanhui Li ⋅ Xiaodan Liang
Oral
Sat Jun 06 09:02 AM -- 09:15 AM (PDT) @ Four Seasons Ballroom None
Scalable Multi-View Subspace Clustering with Tensorized Anchor Guidance
Miao Jia ⋅ Xingchen Hu ⋅ Jiyuan Liu ⋅ Siwei Wang ⋅ Min Wang ⋅ Zijian Chen
Oral
Sat Jun 06 09:02 AM -- 09:15 AM (PDT) @ Mile High Ballroom 1A - 2A None
SliderEdit: Continuous Image Editing with Fine-Grained Instruction Control
Arman Zarei ⋅ Samyadeep Basu ⋅ Mobina Pournemat ⋅ Sayan Nag ⋅ Ryan A. Rossi ⋅ Soheil Feizi
Break
Sat Jun 06 09:15 AM -- 09:30 AM (PDT) None
Courtesy Break
Keynote
Sat Jun 06 09:30 AM -- 10:30 AM (PDT) @ Bluebird Ballroom None
Transforming Computing with Quantum-Centric Supercomputing
Jerry Chow
Poster Setup
Sat Jun 06 10:15 AM -- 10:45 AM (PDT) @ ExHall A None
Poster Setup
Demonstration
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F None
Demos
Doctoral Consortium
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ 207 None
Doctoral Consortium (By invitation only)
Paola Cascante-Bonilla ⋅ Abby Stylianou
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 1
Breaking Semantic Boundaries: Distribution-Guided Semantic Exploration for Creative Generation
Fu Feng ⋅ Yucheng Xie ⋅ Ruixiao Shi ⋅ Xu Yang ⋅ Jing Wang ⋅ Xin Geng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 2
Guiding a Diffusion Model by Swapping Its Tokens
Weijia Zhang ⋅ Yuehao Liu ⋅ Shanyan Guan ⋅ Wu Ran ⋅ Yanhao Ge ⋅ Wei Li ⋅ Chao Ma
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 3
PixelDiT: Pixel Diffusion Transformers for Image Generation
Yongsheng Yu ⋅ Wei Xiong ⋅ Weili Nie ⋅ Yichen Sheng ⋅ Shiqiu Liu ⋅ Jiebo Luo
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 4
SeaCache: Spectral-Evolution-Aware Cache for Accelerating Diffusion Models
Jiwoo Chung ⋅ Sangeek Hyun ⋅ MinKyu Lee ⋅ Byeongju Han ⋅ Geonho Cha ⋅ Dongyoon Wee ⋅ Youngjun Hong ⋅ Jae-Pil Heo
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 5
SenCache: Accelerating Diffusion Model Inference via Sensitivity-Aware Caching
Yasaman Haghighi ⋅ Alex Alahi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 6
Streaming Diffusion Model for Fast Infrared and Visible Video Fusion
Jinyuan Liu ⋅ Ludan Sun ⋅ Tengyu Ma ⋅ Chunyan Yang ⋅ Zhiying Jiang ⋅ Long Ma ⋅ Risheng Liu ⋅ Xin Fan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 7
ComPose: A Unified Completion-Pose Framework for Robust Category-Level Object Pose Estimation
Huan Ren ⋅ Yihan Chen ⋅ Chuxin Wang ⋅ Nailong Liu ⋅ Wenfei Yang ⋅ Tianzhu Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 8
CoSMo3D: Open-World Promptable 3D Semantic Segmentation through LLM-Guided Canonical Spatial Modeling
Li Jin ⋅ Weikai Chen ⋅ Yujie Wang ⋅ Yingda Yin ⋅ Zeyu HU ⋅ Runze Zhang ⋅ Keyang Luo ⋅ Shengju Qian ⋅ Xin Wang ⋅ Xueying Qin
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 9
GeoViS: Geospatially Rewarded Visual Search for Remote Sensing Visual Grounding
Peirong Zhang ⋅ Yidan Zhang ⋅ Luxiao Xu ⋅ Jinliang Lin ⋅ Zonghao Guo ⋅ Fengxiang Wang ⋅ Xue Yang ⋅ Kaiwen Wei ⋅ Lei Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 10
RobotSeg: A Model and Dataset for Segmenting Robots in Image and Video
Haiyang Mei ⋅ Qiming Huang ⋅ Hai Ci ⋅ Mike Zheng Shou
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 11
S^2AM3D: Scale-controllable Part Segmentation of 3D Point Clouds
Han Su ⋅ Tianyu Huang ⋅ Zichen Wan ⋅ Xiaohe Wu ⋅ Wangmeng Zuo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 12
Scalable Multi-View Subspace Clustering with Tensorized Anchor Guidance
Miao Jia ⋅ Xingchen Hu ⋅ Jiyuan Liu ⋅ Siwei Wang ⋅ Min Wang ⋅ Zijian Chen
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 13
3D-LATTE: Latent Space 3D Editing from Textual Instructions
Maria Parelli ⋅ Michael Oechsle ⋅ Michael Niemeyer ⋅ Federico Tombari ⋅ Andreas Geiger
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 14
AnchorFlow: Training-Free 3D Editing via Latent Anchor-Aligned Flows
Fan Ma ⋅ Fan Ma ⋅ Chengzhuo Gui ⋅ Xiaobo Xia ⋅ Hehe Fan ⋅ Yi Yang ⋅ Tat-seng Chua
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 15
ChordEdit: One-Step Low-Energy Transport for Image Editing
Liangsi Lu ⋅ Xuhang Chen ⋅ Minzhe Guo ⋅ Shichu Li ⋅ Jingchao Wang ⋅ Yang Shi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 16
Faithful Contouring: Near-Lossless 3D Voxel Representation Free from Iso-surface
Yihao Luo ⋅ Xianglong He ⋅ Chuanyu Pan ⋅ Yiwen Chen ⋅ Jiaqi Wu ⋅ Yangguang Li ⋅ Wanli Ouyang ⋅ Yuanming Hu ⋅ Guang Yang ⋅ Choon Hwai Yap
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 17
Native and Compact Structured Latents for 3D Generation
Jianfeng XIANG ⋅ Xiaoxue Chen ⋅ Sicheng Xu ⋅ Ruicheng Wang ⋅ Zelong Lv ⋅ Yu Deng ⋅ Hongyuan Zhu ⋅ Yue Dong ⋅ Hao Zhao ⋅ Nicholas Jing Yuan ⋅ Jiaolong Yang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 18
SliderEdit: Continuous Image Editing with Fine-Grained Instruction Control
Arman Zarei ⋅ Samyadeep Basu ⋅ Mobina Pournemat ⋅ Sayan Nag ⋅ Ryan A. Rossi ⋅ Soheil Feizi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 19
Differentiable Vector Quantization for Rate-Distortion Optimization of Generative Image Compression
SHIYIN JIANG ⋅ Wei Long ⋅ Minghao Han ⋅ Zhenghao Chen ⋅ Ce Zhu ⋅ Shuhang Gu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 20
FINER: MLLMs Hallucinate under Fine-grained Negative Queries
Rui Xiao ⋅ Sanghwan Kim ⋅ Yongqin Xian ⋅ Zeynep Akata ⋅ Stephan Alaniz
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 21
MDCS-MoAME: Multi-directional Composite Scanning with Mixture of Attention and Mamba Experts for Cancer Survival Prediction
Linjie Qu ⋅ Jin Xiao ⋅ Xiangrong Liu ⋅ Changming Sun ⋅ Hui Cui ⋅ Yuqi Fang ⋅ Ran Su ⋅ Qiangguo Jin ⋅ leyi wei
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 22
PAS: A Training-Free Stabilizer for Temporal Encoding in Video LLMs
Bowen Sun ⋅ Yujun Cai ⋅ Ming-Hsuan Yang ⋅ Hang Wu ⋅ Yiwei Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 23
PAVAS: Physics-Aware Video-to-Audio Synthesis
Oh Hyun-Bin ⋅ Yuhta Takida ⋅ Toshimitsu Uesaka ⋅ Tae-Hyun Oh ⋅ Yuki Mitsufuji
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 24
ProPhy: Progressive Physical Alignment for Dynamic World Simulation
Zijun Wang ⋅ Panwen Hu ⋅ Jing Wang ⋅ Terry Jingchen Zhang ⋅ Yuhao Cheng ⋅ Long Chen ⋅ Yiqiang Yan ⋅ Zutao Jiang ⋅ Hanhui Li ⋅ Xiaodan Liang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 25
V-DPM: 4D Video Reconstruction with Dynamic Point Maps
Edgar Sucar ⋅ Eldar Insafutdinov ⋅ Zihang Lai ⋅ Andrea Vedaldi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 26
Registration-Free Learnable Multi-View Capture of Faces in Dense Semantic Correspondence
Panagiotis P. Filntisis ⋅ George Retsinas ⋅ Radek Daněček ⋅ Vanessa Sklyarova ⋅ Petros Maragos ⋅ Timo Bolkart
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 27
Mesh4D: 4D Mesh Reconstruction and Tracking from Monocular Video
Zeren Jiang ⋅ Chuanxia Zheng ⋅ Iro Laina ⋅ Diane Larlus ⋅ Andrea Vedaldi
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 28
SPE-MVS: Spatial Position Encoding Enhanced Multi-View Stereo with Monocular Depth Priors
Shaoqian Wang ⋅ Jiadai Sun ⋅ Bosen Hou ⋅ Qiang Wang ⋅ Bin Fan ⋅ Bo Li ⋅ Bin Lu ⋅ Yuchao Dai
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 29
Block-Sparse Global Attention for Efficient Multi-View Geometry Transformers
Chung-Shien Brian Wang ⋅ Christian Schmidt ⋅ Jens Piekenbrinck ⋅ Bastian Leibe
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 30
SMVRT: Implicit Human 3D Modeling Using Sparse Multi-View Volumetric Reconstruction with Transformer Fusion
Chuanmao Fan ⋅ Chenxi Zhao ⋅ Ye Duan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 31
LiDAR Prompted Spatio-Temporal Multi-View Stereo for Autonomous Driving
Qihao Sun ⋅ Jiarun Liu ⋅ Ziqian Ni ⋅ Jianyun Xu ⋅ Sheng Yang ⋅ Tao Xie ⋅ lijun zhao ⋅ Ruifeng Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 32
Any4D: Unified Feed-Forward Metric 4D Reconstruction
Jay Karhade ⋅ Nikhil Keetha ⋅ Yuchen Zhang ⋅ Tanisha Gupta ⋅ Akash Sharma ⋅ Sebastian Scherer ⋅ Deva Ramanan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 33
Co-Me: Confidence Guided Token Merging for Visual Geometric Transformers
Yutian Chen ⋅ Yuheng Qiu ⋅ Ruogu Li ⋅ Jay Patrikar ⋅ Sebastian Scherer
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 34
Point4Cast: Streaming Dynamic Scene Reconstruction and Forecasting
Xinhang Liu ⋅ Pedro Miraldo ⋅ Suhas Lohit ⋅ Huaizu Jiang ⋅ Naoko Sawada ⋅ Yu-Wing Tai ⋅ Chi-Keung Tang ⋅ Moitreya Chatterjee
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 35
AMB3R: Accurate Feed-forward Metric-scale 3D Reconstruction with Backend
Hengyi Wang ⋅ Lourdes Agapito
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 36
AlignPose: Generalizable 6D Pose Estimation via Multi-view Feature-metric Alignment
Anna Šárová Mikeštíková ⋅ Médéric Fourmy ⋅ Martin Cífka ⋅ Josef Sivic ⋅ Vladimir Petrik
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 37
Parallelised Differentiable Straightest Geodesics for 3D Meshes
Hippolyte Verninas ⋅ Caner Korkmaz ⋅ Stefanos Zafeiriou ⋅ Tolga Birdal ⋅ Simone Foti
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 38
Geometry-Aligned and Anomaly-Aware Reconstruction for 3D Anomaly Detection
linchun wu ⋅ Qin Zou ⋅ Yuanhao Yue ⋅ Zhongyuan Wang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 39
DVGT: Driving Visual Geometry Transformer
Sicheng Zuo ⋅ Zixun Xie ⋅ Wenzhao Zheng ⋅ Shaoqing Xu ⋅ Fang Li ⋅ Shengyin Jiang ⋅ Long Chen ⋅ Zhi-xin Yang ⋅ Jiwen Lu
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 40
FMPose3D: monocular 3D pose estimation via flow matching
Ti Wang ⋅ Xiaohang Yu ⋅ Mackenzie Weygandt Mathis
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 41
MoRE: 3D Visual Geometry Reconstruction Meets Mixture-of-Experts
Jingnan Gao ⋅ Zhe Wang ⋅ Xianze Fang ⋅ Xingyu Ren ⋅ Zhuo Chen ⋅ Shengqi Liu ⋅ Yuhao Cheng ⋅ Jiangjing Lyu ⋅ Xiaokang Yang ⋅ Yichao Yan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 42
Foundation Encoders Are All You Need for Preference-Aware Personalization
Hyungjin Kim ⋅ Seokho Ahn ⋅ Young-Duk Seo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 43
Where Culture Fades: Revealing the Cultural Gap in Text-to-Image Generation
Chuancheng Shi ⋅ Shangze Li ⋅ Shiming Guo ⋅ Simiao Xie ⋅ Wenhua Wu ⋅ Jingtong Dou ⋅ Chao Wu ⋅ Canran Xiao ⋅ Cong Wang ⋅ Zifeng Cheng ⋅ Fei Shen ⋅ Tat-seng Chua
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 44
ThinkGen: Generalized Thinking for Visual Generation
Siyu Jiao ⋅ Yiheng Lin ⋅ Yujie Zhong ⋅ Qi She ⋅ Wei zhou ⋅ Xiaohan Lan ⋅ Zilong Huang ⋅ Fei Yu ⋅ Yingchen Yu ⋅ Yunqing Zhao ⋅ Yao Zhao ⋅ Yunchao Wei
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 45
CoLoGen: Progressive Learning of Concept–Localization Duality for Unified Image Generation
YuXin Song ⋅ Yu Lu ⋅ Haoyuan Sun ⋅ Huanjin Yao ⋅ Fanglong Liu ⋅ Yifan Sun ⋅ Haocheng Feng ⋅ Hang Zhou ⋅ Jingdong Wang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 46
Talk2Move: Reinforcement Learning for Text-Instructed Object-Level Geometric Transformation in Scenes
Jing Tan ⋅ Zhaoyang Zhang ⋅ Yantao Shen ⋅ Jiarui Cai ⋅ Shuo Yang ⋅ Jiajun Wu ⋅ Wei Xia ⋅ Zhuowen Tu ⋅ Stefano Soatto
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 47
When Safety Collides: Resolving Multi-Category Harmful Conflicts in Text-to-Image Diffusion via Adaptive Safety Guidance
Yongli Xiang ⋅ Ziming Hong ⋅ Zhaoqing Wang ⋅ Xiangyu Zhao ⋅ Bo Han ⋅ Tongliang Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 48
PSR: Scaling Multi-Subject Personalized Image Generation with Pairwise Subject-Consistency Rewards
Shulei Wang ⋅ Longhui Wei ⋅ XIN HE ⋅ Jianbo Ouyang ⋅ Hui Lu ⋅ Zhou Zhao ⋅ Qi Tian
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 49
HBridge: H-Shape Bridging of Heterogeneous Experts for Unified Multimodal Understanding and Generation
Xiang Wang ⋅ Zhifei Zhang ⋅ He Zhang ⋅ Zhe Lin ⋅ Yuqian Zhou ⋅ Qing Liu ⋅ Shiwei Zhang ⋅ Yijun Li ⋅ Shaoteng Liu ⋅ Haitian Zheng ⋅ Jason Kuen ⋅ Yuehuan Wang ⋅ Changxin Gao ⋅ Nong Sang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 50
Multimodal Semantic Bias Mitigation for Diverse Text-To-3D Generation
Yukuan Min ⋅ Muli Yang ⋅ Jinhao Zhang ⋅ Yuxuan Wang ⋅ Yihang Zhu ⋅ Jiexi Yan ⋅ Cheng Deng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 51
Visual Personalization Turing Test
Rameen Abdal ⋅ James Burgess ⋅ Sergey Tulyakov ⋅ Kuan-Chieh Jackson Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 52
Composing Concepts from Images and Videos via Concept-prompt Binding
Xianghao Kong ⋅ Zeyu Zhang ⋅ Yuwei Guo ⋅ Zhuoran ZHAO ⋅ Songchun Zhang ⋅ Anyi Rao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 53
Less is More: Data-Efficient Adaptation for Controllable Text-to-Video Generation
Shihan Cheng ⋅ Nilesh Kulkarni ⋅ David Hyde ⋅ Dmitriy Smirnov
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 54
Semantic Derivative Flow: Graph-Guided Diffusion for Controllable Instance Interactions
Shibin Mei ⋅ Hang Wang ⋅ Bingbing Ni
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 55
Improving Text-to-Image Generation with Intrinsic Self-Confidence Rewards
Seungwook Kim ⋅ Minsu Cho
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 56
Hierarchical Enhancement of Semantic Priors for Disentangled Text-Driven Motion Generation
Wenhan Lv ⋅ Shaopan Wang ⋅ Xiangyu Wu ⋅ Tianchu Hang ⋅ Zhongquan Jian ⋅ Qingqiang Wu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 57
Simpleposter: A Simple Baseline For Product Poster Generation
Benlei Cui ⋅ Fangao Zeng ⋅ Weitao Jiang ⋅ Yuwen Zhai ⋅ Haiwen Hong ⋅ Longtao Huang ⋅ Hui Xue ⋅ Wenxiang Shang ⋅ Pipei Huang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 58
Prompt Yourself: Awakening Textual Semantics in 1D Visual Tokenizers
hualiang wang ⋅ Siming Fu ⋅ Weinan Jia ⋅ Yuning Lu ⋅ Mu Liu ⋅ Jidong Jiang ⋅ Xiaomeng Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 59
SkyReels-Text: Fine-Grained Font-Controllable Text Editing for Poster Design
Yunjie Yu ⋅ Jingchen Wu ⋅ Junchen Zhu ⋅ Chunze Lin ⋅ Guibin Chen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 60
Image Generation from Contextually-Contradictory Prompts
Saar Huberman ⋅ Or Patashnik ⋅ Omer Dahary ⋅ Ron Mokady ⋅ Daniel Cohen-Or
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 61
PromptEnhancer: Taming Your Rewriter for Text-to-Image Generation via Fine-Grained Reward
Linqing Wang ⋅ zhiyong xu ⋅ XiMing Xing ⋅ YIJI CHENG ⋅ Zhiyuan Zhao ⋅ Donghao Li ⋅ Tiankai Hang ⋅ Zhenxi Li ⋅ Jiale Tao ⋅ Qixun Wang ⋅ Ruihuang Li ⋅ Comi Chen ⋅ Xin LI ⋅ Mingrui Wu ⋅ Xinchi Deng ⋅ Shuyang Gu ⋅ Chunyu Wang ⋅ qinglin lu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 62
Aligning Text, Images and 3D Structure Token-by-Token
Aadarsh Sahoo ⋅ Vansh Tibrewal ⋅ Georgia Gkioxari
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 63
RefTon: Reference person shot assist virtual Try-on
Liuzhuozheng Li ⋅ Yue Gong ⋅ Shanyuan Liu ⋅ Zanyi Wang ⋅ Dengyang Jiang ⋅ Liebucha Wu ⋅ Bo Cheng ⋅ Yuhang Ma ⋅ Dawei Leng ⋅ Yuhui Yin
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 64
GaussianVision: Vision-Language Alignment from Compressed Image Representations using 2D Gaussian Splatting
Yasmine Omri ⋅ Connor Ding ⋅ Tsachy Weissman ⋅ Thierry Tambe
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 65
Copy-Transform-Paste: Zero-Shot Object-Object Alignment Guided by Vision-Language and Geometric Constraints
Rotem Gatenyo ⋅ Ohad Fried
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 66
Gravitation-Driven Semantic Alignment for Text Video Retrieval
Yi YANG ⋅ Zheng Wang ⋅ Xing Xu ⋅ Jingkuan Song ⋅ Heng Tao Shen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 67
MoE-GRPO: Optimizing Mixture-of-Experts via Reinforcement Learning in Vision-Language Models
Dohwan Ko ⋅ Jinyoung Park ⋅ Seoung Choi ⋅ Sanghyeok Lee ⋅ Seohyun Lee ⋅ Hyunwoo J. Kim
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 68
M^3KG-RAG: Multi-hop Multimodal Knowledge Graph-enhanced Retrieval-Augmented Generation
Hyeongcheol Park ⋅ Jiyoung Seo ⋅ Jaewon Mun ⋅ Hogun Park ⋅ Wonmin Byeon ⋅ Sung June Kim ⋅ Hyeonsoo Im ⋅ JeungSub Lee ⋅ Sangpil Kim
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 69
Evolutionary Multimodal Reasoning via Hierarchical Semantic Representation for Intent Recognition
Qianrui Zhou ⋅ Hua Xu ⋅ Yunjin Gu ⋅ Yifan Wang ⋅ Songze Li ⋅ Hanlei Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 70
ReFAct: Empowering Multimodal Web Agents with Visual and Context Focusing
Rui Wu ⋅ Shuo Zhang ⋅ Xiaoxuan Tang ⋅ Ruirui Zhang ⋅ Yi Liu ⋅ Tao Jiang ⋅ Wenhao Xu ⋅ Yong Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 71
PersonaVLM: Long-Term Personalized Multimodal LLMs
Chang Nie ⋅ Chaoyou Fu ⋅ Yi-Fan Zhang ⋅ Haihua Yang ⋅ Caifeng Shan
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 72
MR-RAG: Multimodal Relevance-Aware Retrieval-Augmented Generation for Medical Visual Question Answering
Xuze Li ⋅ Haozhao Wang ⋅ Zhenyu Huang ⋅ Zhongxu Wang ⋅ Zhang Jinghua ⋅ Ruixuan Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 73
Decoupling Stability and Plasticity for Multi-Modal Test-Time Adaptation
Yongbo He ⋅ Zirun Guo ⋅ Tao Jin
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 74
CUE: Concept-Aware Multi-Label Expansion to Mitigate Concept Confusion in Long-Tailed Learning
Ruichi Zhang ⋅ Chikai Shang ⋅ jiacheng yang ⋅ Mengke Li ⋅ Yang Zhou ⋅ Junlong Gao ⋅ Yang Lu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 75
Energy Waveify and Redistribution for Test-Time Adaptation: A Control System Perspective
Zhenbin Wang ⋅ Lei Zhang ⋅ Lituan Wang ⋅ Zhenwei Zhang ⋅ Guangwu Qian ⋅ Yan Wang ⋅ Wei Huang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 76
CD-Buffer: Complementary Dual-Buffer Framework for Test-Time Adaptation in Adverse Weather Object Detection
Youngjun Song ⋅ Hyeongyu Kim ⋅ Dosik Hwang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 77
CoFiDA-M: Concept-Aware Feature Modulation for Cross-Domain Adaptation with Image-Only Inference
Nurjahan Sultana ⋅ Moi Hoon Yap ⋅ Xinqi Fan ⋅ Wenqi Lu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 78
Towards Multimodal Domain Generalization with Few Labels
Hongzhao Li ⋅ Hao Dong ⋅ Hualei Wan ⋅ Shupan Li ⋅ Mingliang Xu ⋅ Muhammad Haris Khan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 79
Reclaiming Lost Text Layers for Source-Free Cross-Domain Few-Shot Learning
ZHENYU ZHANG ⋅ Guangyao Chen ⋅ Yixiong Zou ⋅ Yuhua Li ⋅ Ruixuan Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 80
Event6D: Event-based Novel Object 6D Pose Tracking
Jae-Young Kang ⋅ Hoonhee Cho ⋅ Taeyeop Lee ⋅ Minjun Kang ⋅ Bowen Wen ⋅ Youngho Kim ⋅ Kuk-Jin Yoon
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 81
EV-CGNet: Co-visible Focused 3D-guided 2D Event Keypoint Detection Network
Yuan Gao ⋅ Tianle Ding ⋅ Yuqing Zhu ⋅ Tianzhu Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 82
AE2VID: Event-based Video Reconstruction via Aperture Modulation
Chenxu Bai ⋅ Boyu Li ⋅ Peiqi Duan ⋅ xinyu zhou ⋅ Hanyue Lou ⋅ Boxin Shi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 83
From Contrast to Consistency: Rethinking Event-based Continuous-Time Optical Flow Estimation
rui hu ⋅ Song Wu ⋅ Wen Yang ⋅ Jinjian Wu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 84
Spike-driven Discrete Aggregation for Event-based Object Detection
Huaning Li ⋅ Ziming Wang ⋅ Runhao Jiang ⋅ Yan Rui ⋅ Huajin Tang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 85
x^2-Fusion: Cross-Modality and Cross-Dimension Flow Estimation in Event Edge Space
Ruishan Guo ⋅ Ciyu Ruan ⋅ Haoyang Wang ⋅ Zihang GONG ⋅ Jingao Xu ⋅ Xinlei Chen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 86
FloVerse: Floor Plan-Guided Multi-Modal Navigation
weiqi Huang ⋅ Shuangyi Dong ⋅ Jiaxin Li ⋅ Yifei Guo ⋅ Zan Wang ⋅ Wei Liang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 87
TrajRAG: Retrieving Geometric-Semantic Experience for Zero-Shot Object Navigation
Yiyao Wang ⋅ Sixian Zhang ⋅ Keming Zhang ⋅ Xinhang Song ⋅ Songjie Du ⋅ Shuqiang Jiang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 88
History to Future: Evolving Agent with Experience and Thought for Zero-shot Vision-and-Language Navigation
Guangzhao Dai ⋅ Shuo Wang ⋅ Zihan Wang ⋅ Guo-Sen Xie ⋅ Yang Yang ⋅ Jinshan Pan ⋅ Qianru Sun ⋅ Xiangbo Shu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 89
DreamSAC: Learning Hamiltonian World Models via Symmetry Exploration
Jinzhou Tang ⋅ Fan Feng ⋅ Minghao Fu ⋅ Wenjun Lin ⋅ Jing Yang ⋅ Biwei Huang ⋅ Keze Wang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 90
Beyond Scanpaths: Graph-Based Gaze Simulation in Dynamic Scenes
Luke Palmer ⋅ Petar Palasek ⋅ Hazem Abdelkawy
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 91
CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning
Zhenquan Yao ⋅ Zitong Huang ⋅ yihan zeng ⋅ Jianhua Han ⋅ Hang Xu ⋅ Chun-Mei Feng ⋅ Jianwei Ma ⋅ Wangmeng Zuo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 92
Rethinking Visual Rearrangement from A Diffusion Perspective
Tianliang Qi ⋅ Xinhang Song ⋅ Yuyi Liu ⋅ Shuqiang Jiang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 93
APEX: A Decoupled Memory-based Explorer for Asynchronous Aerial Object Goal Navigation
Daoxuan Zhang ⋅ Ping Chen ⋅ Xiaobo Xia ⋅ Xiu Su ⋅ Ruichen Zhen ⋅ Jianqiang Xiao ⋅ Shuo Yang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 94
Bridging the 2D-3D Gap: A Hierarchical Semantic-Geometric Map for Vision Language Navigation
Kailing Li ⋅ Tianwen Qian ⋅ Lijin Yang ⋅ Yuqian Fu ⋅ Jingyu Gong ⋅ Xiaoling Wang ⋅ Liang He
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 95
InterAgent: Physics-based Multi-agent Command Execution via Diffusion on Interaction Graphs
Bin Li ⋅ Ruichi Zhang ⋅ Han Liang ⋅ Jingyan Zhang ⋅ Juze Zhang ⋅ Xin Chen ⋅ Lan Xu ⋅ Jingyi Yu ⋅ Jingya Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 96
When Robots Should Say ''I Don’t Know'': Benchmarking Abstention in Embodied Question Answering
Tao Wu ⋅ Chuhao Zhou ⋅ Guangyu Zhao ⋅ Haozhi Cao ⋅ Yewen Pu ⋅ Jianfei Yang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 97
RoboAgent: Chaining Basic Capabilities for Embodied Task Planning
Peiran Xu ⋅ Jiaqi Zheng ⋅ Yadong Mu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 98
Towards Training-free Scene Text Editing
Yubo Li ⋅ Xugong Qin ⋅ peng zhang ⋅ Hailun Lin ⋅ Gangyan Zeng ⋅ Kexin Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 99
VINS-120K: Ultra High-Resolution Image Editing with A Large-Scale Dataset
Zhizhou Chen ⋅ Shanyan Guan ⋅ Zhanxin Gao ⋅ En Ci ⋅ Yanhao Ge ⋅ Wei Li ⋅ Zhenyu Zhang ⋅ Jian Yang ⋅ Ying Tai
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 100
ArtiMuse: Fine-Grained Image Aesthetics Assessment with Joint Scoring and Expert-Level Understanding
Shuo Cao ⋅ Nan Ma ⋅ Jiayang Li ⋅ Xiaohui Li ⋅ Lihao Shao ⋅ Kaiwen Zhu ⋅ Yu Zhou ⋅ Yuandong Pu ⋅ Jiarui Wu ⋅ Jiaquan Wang ⋅ Bo Qu ⋅ Wenhai Wang ⋅ Yu Qiao ⋅ Dajuin Yao ⋅ Yihao Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 101
Charge: A Comprehensive Novel View Synthesis Benchmark and Dataset to Bind Them All
Michal Nazarczuk ⋅ Thomas Tanay ⋅ Arthur Moreau ⋅ Zhensong Zhang ⋅ Eduardo Pérez-Pellitero
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 102
Region-Wise Correspondence Prediction between Manga Line Art Images
Yingxuan Li ⋅ Jiafeng Mao ⋅ Qianru Qiu ⋅ Yusuke Matsui
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 103
WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation
Wei Chow ⋅ Jiachun Pan ⋅ Yongyuan Liang ⋅ Mingze Zhou ⋅ Xue Song ⋅ Liyu Jia ⋅ Saining Zhang ⋅ Siliang Tang ⋅ Juncheng Li ⋅ Fengda Zhang ⋅ Weijia Wu ⋅ Hanwang Zhang ⋅ Tat-seng Chua
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 104
I2I-Bench: A Comprehensive Benchmark Suite for Image-to-Image Editing Models
Juntong Wang ⋅ Wang Jiarui ⋅ Huiyu Duan ⋅ Jiaxiang Kang ⋅ Guangtao Zhai ⋅ Xiongkuo Min
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 105
TokenGS: Decoupling 3D Gaussian Prediction from Pixels with Learnable Tokens
Jiawei Ren ⋅ Michal Tyszkiewicz ⋅ Jiahui Huang ⋅ Žan Gojčič
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 106
Hermite Radial Basis Function for Surface Reconstruction via Differentiable Rendering
Hugo Blanc ⋅ Jean-Emmanuel Deschaud ⋅ Alexis Paljic
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 107
RF4D:Neural Radar Fields for Novel View Synthesis in Outdoor Dynamic Scenes
Jiarui Zhang ⋅ Zhihao Li ⋅ Chong Wang ⋅ Bihan Wen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 108
Voxify3D: Pixel Art Meets Volumetric Rendering
Yi-Chuan Huang ⋅ Jiewen Chan ⋅ Hao-Jen Chien ⋅ Yu-Lun Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 109
Node-RF: Learning Generalized Continuous Space-Time Scene Dynamics with Neural ODE-based NeRFs
Hiran Sarkar ⋅ Liming Kuang ⋅ Yordanka Velikova ⋅ Benjamin Busam
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 110
FluidGaussian: Propagating Simulation-Based Uncertainty Toward Functionally-Intelligent 3D Reconstruction
Yuqiu Liu ⋅ Jialin Song ⋅ Marissa Ramirez de Chanlatte ⋅ Rochishnu Chowdhury ⋅ Rushil Paresh Desai ⋅ Wuyang Chen ⋅ Daniel Martin ⋅ Michael Mahoney
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 111
GaussFusion: Improving 3D Reconstruction in the Wild with A Geometry-Informed Video Generator
Liyuan Zhu ⋅ Manjunath Narayana ⋅ Michal Stary ⋅ Will Hutchcroft ⋅ Gordon Wetzstein ⋅ Iro Armeni
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 112
LagerNVS: Latent Geometry for Fully Neural Real-time Novel View Synthesis
Stanislaw Szymanowicz ⋅ Minghao Chen ⋅ Jianyuan Wang ⋅ Christian Rupprecht ⋅ Andrea Vedaldi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 113
Turbo-GS: Accelerating 3D Gaussian Fitting for High-Resolution Radiance Fields
Ankit Dhiman ⋅ Tao Lu ⋅ Srinath Ravi ⋅ Emre Arslan ⋅ Angela Xing ⋅ Yuanbo Xiangli ⋅ R. Venkatesh Babu ⋅ Srinath Sridhar
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 114
BiProLoRA: Bilevel Prompt LoRA for Real Scene Recovery
Nan An ⋅ Long Ma ⋅ Tengyu Ma ⋅ Zhu Liu ⋅ Yingchi Liu ⋅ Risheng Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 115
Degradation-Consistent Test-Time Adaptation for All-in-One Image Restoration
Ni Tang ⋅ Shenghao nie ⋅ Xiaotong Luo ⋅ Yuan Xie ⋅ Yanyun Qu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 116
CanonCGT: Reference-Based Color Grading via Canonical Pivot Representation
JINWON KO ⋅ Keunsoo Ko ⋅ Chang-Su Kim
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 117
2-Shots in the Dark: Low-Light Denoising with Minimal Data Acquisition
Liying Lu ⋅ Raphael Achddou ⋅ Sabine Süsstrunk
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 118
Restore, Assess, Repeat: A Unified Framework for Iterative Image Restoration
I-Hsiang (Aaron) Chen ⋅ Isma Hadji ⋅ Enrique Sanchez ⋅ Adrian Bulat ⋅ Sy-Yen Kuo ⋅ Radu Timofte ⋅ Georgios Tzimiropoulos ⋅ Brais Martinez
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 119
It Takes Two: A Duet of Periodicity and Directionality for Burst Flicker Removal
lishen qu ⋅ Shihao Zhou ⋅ Jie Liang ⋅ Hui Zeng ⋅ Lei Zhang ⋅ Jufeng Yang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 120
Scan Clusters, Not Pixels: A Cluster-Centric Paradigm for Efficient Ultra-high-definition Image Restoration
Chen Wu ⋅ Ling Wang ⋅ Zhuoran Zheng ⋅ Yuning Cui ⋅ Zhixiong Yang ⋅ Xiangyu Chen ⋅ Yue Zhang ⋅ Weidong Jiang ⋅ Jingyuan Xia
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 121
Seeing Beyond 8bits: Subjective and Objective Quality Assessment of HDR-UGC Videos
SHRESHTH SAINI ⋅ Bowen Chen ⋅ Yilin Wang ⋅ Neil Birkbeck ⋅ Balu Adsumilli ⋅ Alan C.
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 122
Dynamic Exposure Burst Image Restoration
Woohyeok Kim ⋅ Jaesung Rim ⋅ Daeyeon Kim ⋅ Sunghyun Cho
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 123
FAPE-IR: Frequency-Aware Planning and Execution Framework for All-in-One Image Restoration
Jingren Liu ⋅ Shuning Xu ⋅ Qirui Yang ⋅ Yun wang ⋅ Xiangyu Chen ⋅ Zhong Ji
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 124
ColorFLUX: A Structure-Color Decoupling Framework for Old Photo Colorization
Bingchen Li ⋅ Zhixin Wang ⋅ Fan Li ⋅ Jiaqi Xu ⋅ Jiaming Guo ⋅ Renjing Pei ⋅ Xin Li ⋅ Zhibo Chen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 125
VEMamba: Efficient Isotropic Reconstruction of Volume Electron Microscopy with Axial-Lateral Consistent Mamba
Longmi Gao ⋅ Pan Gao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 126
Anatomica: Localized Control over Geometric and Topological Properties for Anatomical Diffusion Models
Karim Kadry ⋅ Abdalla Abdelwahed ⋅ Ajay Manicka ⋅ Naravich Chutisilp ⋅ Farhad R. Nezami ⋅ Elazer R Edelman
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 127
EMGauss: Continuous Slice-to-3D Reconstruction via Dynamic Gaussian Modeling in Volume Electron Microscopy
Yumeng He ⋅ Zanwei Zhou ⋅ Yekun Zheng ⋅ Chen Liang ⋅ Yunbo Wang ⋅ Xiaokang Yang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 128
Underground Plant Exploration: Non-Destructive 3D Root Assessment with GPR Based on Point Graph Neural Network
Yuwei Zhou ⋅ Guoyu Lu
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 129
Uni-Encoder Meets Multi-Encoders: Representation Before Fusion for Brain Tumor Segmentation with Missing Modalities
Peibo Song ⋅ Xiaotian Xue ⋅ Jinshuo Zhang ⋅ zihao wang ⋅ Jinhua liu ⋅ Shujun Fu ⋅ Fangxun Bao ⋅ Si Yong Yeo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 130
MicroFM: Physics-guided Flow Matching for Isotropic Microscopy Reconstruction
Xingzu Zhan ⋅ Runmin Jiang ⋅ Vatsal Gupta ⋅ Tanush Swaminathan ⋅ Yanwen Wang ⋅ Genpei Zhang ⋅ Haili Wang ⋅ Min Xu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 131
Dynamic Stream Network for Combinatorial Explosion Problem in Deformable Medical Image Registration
Shaochen Bi ⋅ Yuting He ⋅ Weiming Wang ⋅ Hao Chen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 132
PMRNet: Physics-informed Multi-scale Refinement Network for Medical Image Segmentation
Boce Kang
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 133
Towards Robust Vision Transformers: Path Dependency Analysis and a Simple Two-Stage Adversarial Training
Seongmin Kim ⋅ Byung Cheol Song
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 134
PA-Attack: Guiding Gray-Box Attacks on LVLM Vision Encoders with Prototypes and Attention
Hefei Mei ⋅ Zirui Wang ⋅ Chang Xu ⋅ Jianyuan Guo ⋅ Minjing Dong
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 135
When CLIP Sees More, It Fights Back Harder: Multi-View Guided Adaptive Counterattacks for Test-Time Adversarial Robustness
Sunoh Kim ⋅ Daeho Um
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 136
Hidden Dangers of Compositional Generation: Diagnosing Semantic Safety Failures in Text-to-Image Models
Haoming Yang ⋅ Ke Ma ⋅ ligonf zhang ⋅ Xiaojun Jia ⋅ Yingfei Sun ⋅ Qianqian Xu ⋅ Qingming Huang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 137
VisiLock: Authorizing Instruction-based Image editing with Dual Score Distillation
Van Thanh ⋅ Yun Fu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 138
JANUS: A Lightweight Framework for Jailbreaking Text-to-Image Models via Distribution Optimization
Haolun Zheng ⋅ Yu He ⋅ Tailun Chen ⋅ Shuo Shao ⋅ Zhixuan Chu ⋅ Hongbin zhou ⋅ Lan Tao ⋅ Zhan Qin ⋅ Kui Ren
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 139
GenBreak: Red Teaming Text-to-Image Generation Using Large Language Models
Zilong Wang ⋅ Xiang Zheng ⋅ Xiaosen Wang ⋅ Bo Wang ⋅ Xingjun Ma
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 140
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models
Zhiheng Liu ⋅ Weiming Ren ⋅ Haozhe Liu ⋅ Zijian Zhou ⋅ Shoufa Chen ⋅ Haonan Qiu ⋅ Xiaoke Huang ⋅ Zhaochong An ⋅ Fanny Yang ⋅ Aditya Patel ⋅ Viktar Atliha ⋅ Tony Ng ⋅ Xiao Han ⋅ Chuyan Zhu ⋅ Chenyang Zhang ⋅ Ding Liu ⋅ Juan-Manuel Pérez-Rúa ⋅ Sen He ⋅ Jürgen Schmidhuber ⋅ Wenhu Chen ⋅ Ping Luo ⋅ Wei Liu ⋅ Tao Xiang ⋅ Jonas Schult ⋅ Yuren Cong
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 141
Generate, Analyze, and Refine: Training-Free Sound Source Localization via MLLM Meta-Reasoning
Subin Park ⋅ Jung Uk Kim
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 142
MMCP-GEN: A Modality-Extensible Diffusion Language Model for Conditional Protein Sequence Generation
Zeyu An ⋅ Wanyu Lin ⋅ Feng Tan ⋅ Shujun Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 143
Few-shot Acoustic Synthesis with Multimodal Flow Matching
Amandine Brunetto
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 144
CLIP-like Model as a Foundational Density Ratio Estimator
Fumiya Uchiyama ⋅ Rintaro Yanagi ⋅ Shohei Taniguchi ⋅ Shota Takashiro ⋅ Masahiro Suzuki ⋅ Hirokatsu Kataoka ⋅ Yusuke Iwasawa ⋅ Yutaka Matsuo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 145
Learning What Matters: Prioritized Concept Learning via Relative Error-driven Sample Selection
Qian Yang ⋅ Shivam Chandhok ⋅ Oscar Mañas ⋅ Kanishk Jain ⋅ Aishwarya Agrawal ⋅ Leonid Sigal
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 146
EgoAVU: Egocentric Audio-Visual Understanding
Ashish Seth ⋅ Xinhao Mei ⋅ Changsheng Zhao ⋅ Varun Nagaraja ⋅ Ernie Chang ⋅ Gregory P. Meyer ⋅ Gael Le Lan ⋅ Yunyang Xiong ⋅ Vikas Chandra ⋅ Yangyang Shi ⋅ Dinesh Manocha ⋅ zhipeng cai
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 147
Dictionary-Aligned Concept Control for Safeguarding Multimodal LLMs
Jinqi Luo ⋅ Jinyu Yang ⋅ Tal Neiman ⋅ Lei Fan ⋅ Bing Yin ⋅ Son Dinh Tran ⋅ Mubarak Shah ⋅ Rene Vidal
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 148
Multimodal Protein Language Models for Enzyme Kinetic Parameters: From Substrate Recognition to Conformational Adaptation
Fei Wang ⋅ Xinye Zheng ⋅ Kun Li ⋅ Yanyan Wei ⋅ Yuxin Liu ⋅ Ganpeng Hu ⋅ Tong Bao ⋅ Jingwen Yang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 149
Echoes Over Time: Unlocking Length Generalization in Video-to-Audio Generation Models
Christian Simon ⋅ Masato Ishii ⋅ Wei-Yao Wang ⋅ Koichi Saito ⋅ Akio Hayakawa ⋅ Dongseok Shim ⋅ Zhi Zhong ⋅ Shuyang Cui ⋅ Takashi Shibuya ⋅ Shusuke Takahashi ⋅ Yuki Mitsufuji
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 150
Adaptive Confidence Regularization for Multimodal Failure Detection
Moru Liu ⋅ Hao Dong ⋅ Olga Fink ⋅ Mario Trapp
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 151
Factorize, Reconstruct, Enhance: A Unified Framework for Multimodal Sentiment Analysis
Zhilu Yang ⋅ Mingcheng Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 152
PhenoYieldNet: Learning Crop-Aware Phenological Responses for Multi-Crop Yield Prediction
Yu Luo ⋅ Xiaogang Zhu ⋅ Shan Zeng ⋅ Wei Xiang ⋅ Thomas Francis Bishop ⋅ Zhiyong Wang ⋅ Kun Hu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 153
Conflict-Aware Adaptive Cross-Reconstruction for Multimodal Sentiment Analysis
Yan Wang ⋅ Fuyuan Cao ⋅ Xingwang Zhao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 154
EduDiag: A Benchmark for Educational Diagnostic Reasoning with Error Tracing and Correction on Large Multimodal Models
Jiali Chen ⋅ Yuqi Xue ⋅ Xusen Hei ⋅ DingBa Fu ⋅ wei yuancheng ⋅ Jiayuan Xie ⋅ Yi Cai
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 155
UniM: A Unified Any-to-Any Interleaved Multimodal Benchmark
Yanlin Li ⋅ Minghui Guo ⋅ Kaiwen Zhang ⋅ Shize Zhang ⋅ Yiran Zhao ⋅ Haodong Li ⋅ Congyue Zhou ⋅ Weijie Zheng ⋅ Yushen Yan ⋅ Shengqiong Wu ⋅ Wei Ji ⋅ Lei Cui ⋅ Furu Wei ⋅ Hao Fei ⋅ Mong-Li Lee ⋅ Wynne Hsu
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 156
Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement
Chunlei Zhang ⋅ Jiahao Xia ⋅ Yun Xiao ⋅ Bo Jiang ⋅ Liying Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 157
ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding
Jovana Kondic ⋅ Pengyuan Li ⋅ Dhiraj Joshi ⋅ Isaac Sanchez ⋅ Ben wiesel ⋅ Shafiq Abedin ⋅ Amit Alfassy ⋅ Eli Schwartz ⋅ Daniel Caraballo ⋅ Yagmur Gizem Cinar ⋅ Florian Scheidegger ⋅ Steven I. Ross ⋅ Daniel Karl I. Weidele ⋅ Hang Hua ⋅ Ekaterina Arutyunova ⋅ Roei Herzig ⋅ Zihan Wang ⋅ Xinyue Yu ⋅ Yunfei Zhao ⋅ Sicong Jiang ⋅ Minghao Liu ⋅ Qunshu Lin ⋅ Aude Oliva ⋅ Rogerio Feris
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 158
Cross-Modal Guided Visual Synthesis for Data-Efficient Multimodal Depression Recognition
Shanliang Yang ⋅ Xiaoxiao Wang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 159
AffordGrasp: Cross-Modal Diffusion for Affordance-Aware Grasp Synthesis
Xiaofei Wu ⋅ Yi Zhang ⋅ Yumeng Liu ⋅ Yuexin Ma ⋅ Yujiao Shi ⋅ Xuming He
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 160
PAM: A Pose–Appearance–Motion Engine for Sim-to-Real HOI Video Generation
Mingju Gao ⋅ Kaisen Yang ⋅ Huan-ang Gao ⋅ Bohan Li ⋅ Ao Ding ⋅ Wenyi Li ⋅ Yangcheng Yu ⋅ Jinkun Liu ⋅ Shaocong Xu ⋅ Yike Niu ⋅ Haohan Chi ⋅ Hao Chen ⋅ Hao Tang ⋅ Yu Zhang ⋅ Li Yi ⋅ Hao Zhao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 161
AffordGen: Generating Diverse Demonstrations for Generalizable Object Manipulation with Affordance Correspondence
Jiawei Zhang ⋅ Kaizhe Hu ⋅ Yingqian Huang ⋅ Yuanchen Ju ⋅ Zhengrong Xue ⋅ Huazhe Xu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 162
HandWorld: Hand-Centric Unified Video Action Generation
Zhihao Sun ⋅ Zhiying Du ⋅ Xitong Yang ⋅ Zuxuan Wu
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 163
HVG-3D: Bridging Real and Simulation Domains for 3D-Conditional Hand-Object Interaction Video Synthesis
Mingjin Chen ⋅ Junhao Chen ⋅ Zhaoxin Fan ⋅ Yujian Lee ⋅ Zichen Dang ⋅ Lili Wang ⋅ Yawen Cui ⋅ Lap-Pui Chau ⋅ Yi Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 164
ArtHOI: Taming Foundation Models for Monocular 4D Reconstruction of Hand-Articulated-Object Interactions
Zikai Wang ⋅ Zhilu Zhang ⋅ Yiqing Wang ⋅ Hui Li ⋅ Wangmeng Zuo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 165
LAM: Language Articulated Object Modelers
Yipeng Gao ⋅ Yunhao Ge ⋅ Peilin Cai ⋅ Daniel Seita ⋅ Laurent Itti
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 166
Haptic Neural Fields: Bringing Tactile Interactions to 3D Rendered Scenes
Antonio Luigi Stefani ⋅ Niccolò Bisagno ⋅ Nicola Conci ⋅ Eckehard Steinbach ⋅ Francesco De Natale
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 167
Open-world Hand-Object Interaction Video Generation Based on Structure and Contact-aware Representation
Haodong Yan ⋅ Hang Yu ⋅ Zhide Zhong ⋅ Weilin Yuan ⋅ Xin Gong ⋅ Zehang Luo ⋅ Chengxi Heyu ⋅ Junfeng Li ⋅ Wenxuan Song ⋅ Shunbo Zhou ⋅ Haoang Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 168
EgoEdit: Dataset, Real-Time Streaming Model, and Benchmark for Egocentric Video Editing
Runjia Li ⋅ Moayed Haji Ali ⋅ Ashkan Mirzaei ⋅ Chaoyang Wang ⋅ Arpit Sahni ⋅ Ivan Skorokhodov ⋅ Aliaksandr Siarohin ⋅ Tomas Jakab ⋅ Junlin Han ⋅ Sergey Tulyakov ⋅ Philip H.S. Torr ⋅ Willi Menapace
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 169
From Inpainting to Layer Decomposition: Repurposing Generative Inpainting Models for Image Layer Decomposition
Jingxi Chen ⋅ Yixiao Zhang ⋅ Xiaoye qian ⋅ Zongxia Li ⋅ Cornelia Fermuller ⋅ Caren Chen ⋅ Yiannis Aloimonos
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 170
Temporal Equilibrium MeanFlow: Bridging the Scale Gap for One-Step Generation
Yuanpeng Tu ⋅ Yunpeng Chen ⋅ Xinyu Zhang ⋅ Chao Liao ⋅ Hengshuang Zhao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 171
PROMO: Promptable Outfitting for Efficient High-Fidelity Virtual Try-On
Haohua Chen ⋅ Tianze Zhou ⋅ Wei Zhu ⋅ Runqi Wang ⋅ Yandong Guan ⋅ Dejia Song ⋅ Yibo Chen ⋅ Xu Tang ⋅ Yao Hu ⋅ Lu Sheng ⋅ Zhiyong Wu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 172
Harmony: Harmonizing Audio and Video Generation through Cross-Task Synergy
Teng Hu ⋅ Zhentao Yu ⋅ Guozhen Zhang ⋅ Zihan Su ⋅ zhengguang zhou ⋅ Youliang Zhang ⋅ Yuan Zhou ⋅ qinglin lu ⋅ Ran Yi
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 173
UniSER: A Foundation Model for Unified Soft Effects Removal
Jingdong Zhang ⋅ Lingzhi Zhang ⋅ Qing Liu ⋅ Mang Tik Chiu ⋅ Connelly Barnes ⋅ Yizhou Wang ⋅ Haoran You ⋅ Xiaoyang Liu ⋅ Yuqian Zhou ⋅ Zhe Lin ⋅ Eli Shechtman ⋅ Sohrab Amirghodsi ⋅ Xin Li ⋅ Wenping Wang ⋅ Xiaohang Zhan
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 174
EffectMaker: Unifying Reasoning and Generation for Customized Visual Effect Creation
Shiyuan Yang ⋅ Ruihuang Li ⋅ Jiale Tao ⋅ Shuai Shao ⋅ qinglin lu ⋅ Jing Liao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 175
Inference-time Physics Alignment of Video Generative Models with Latent World Models
Jianhao Yuan ⋅ Zhang Xiaofeng ⋅ Felix Friedrich ⋅ Nicolas Beltran-Velez ⋅ Melissa Hall ⋅ Reyhane Askari ⋅ Xiaofeng Zhang ⋅ Nicolas Ballas ⋅ Michal Drozdzal ⋅ Adriana Romero-Soriano
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 176
SMRABooth: Subject and Motion Representation Alignment for Customized Video Generation
Xuancheng Xu ⋅ Li Yaning ⋅ Sisi You ⋅ Bing-Kun Bao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 177
Plenoptic Video Generation
Xiao Fu ⋅ Shitao Tang ⋅ Min Shi ⋅ Xian Liu ⋅ Jinwei Gu ⋅ Ming-Yu Liu ⋅ Dahua Lin ⋅ Chen-Hsuan Lin
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 178
PyramidalWan: On Making Pretrained Video Model Pyramidal for Efficient Inference
Denis Korzhenkov ⋅ Adil Karjauv ⋅ Animesh Karnewar ⋅ Mohsen Ghafoorian ⋅ Amir Habibian
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 179
AdapTok: Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space
Yan Li ⋅ Changyao TIAN ⋅ Renqiu Xia ⋅ Ning Liao ⋅ Weiwei Guo ⋅ Hongsheng Li ⋅ Jifeng Dai ⋅ Hao Li ⋅ Xue Yang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 180
OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory
Zhaochong An ⋅ Menglin Jia ⋅ Haonan Qiu ⋅ Zijian Zhou ⋅ Xiaoke Huang ⋅ Zhiheng Liu ⋅ Weiming Ren ⋅ Kumara Kahatapitiya ⋅ Ding Liu ⋅ Sen He ⋅ Chenyang Zhang ⋅ Tao Xiang ⋅ Fanny Yang ⋅ Serge Belongie ⋅ Tian Xie
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 181
Flowception: Temporally Expansive Flow Matching for Video Generation
Tariq Berrada Ifriqi ⋅ John Nguyen ⋅ Karteek Alahari ⋅ Jakob Verbeek ⋅ Ricky T. Q. Chen
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 182
Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition
Shengming Yin ⋅ Zekai Zhang ⋅ Zecheng Tang ⋅ Kaiyuan Gao ⋅ Xiao Xu ⋅ Kun Yan ⋅ Jiahao Li ⋅ Yilei chen ⋅ Yuxiang Chen ⋅ Heung-Yeung Shum ⋅ Lionel M. Ni ⋅ Junyang Lin ⋅ Chenfei Wu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 183
Linear Image Generation by Synthesizing Exposure Brackets
Yuekun Dai ⋅ Zhoutong Zhang ⋅ Shangchen Zhou ⋅ Nanxuan Zhao
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 184
Low-Resolution Editing is All You Need for High-Resolution Editing
Junsung Lee ⋅ Hyunsoo Lee ⋅ Yong Jae Lee ⋅ Bohyung Han
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 185
UniGenDet: A Unified Generative-Discriminative Framework for Co-Evolutionary Image Generation and Generated Image Detection
Yanran Zhang ⋅ Wenzhao Zheng ⋅ Yifei Li ⋅ Bingyao Yu ⋅ Yu Zheng ⋅ Lei Chen ⋅ Jiwen Lu ⋅ Jie Zhou
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 186
iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation
ZHOUJIE FU ⋅ Xianfang Zeng ⋅ jinghong lan ⋅ Xinyao Liao ⋅ Chen Cheng ⋅ Junyi Chen ⋅ Jiacheng Wei ⋅ Wei Cheng ⋅ Shiyu Liu ⋅ Yunuo Chen ⋅ Gang Yu ⋅ Guosheng Lin
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 187
VENI: Variational Encoder for Natural Illumination
Paul Walker ⋅ James A. D. Gardner ⋅ Andreea Ardelean ⋅ William A. P. Smith ⋅ Bernhard Egger
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 188
SketchAssist: A Practical Assistant for Semantic Edits and Precise Local Redrawing
Han Zou ⋅ Yan Zhang ⋅ Ruiqi Yu ⋅ Cong Xie ⋅ Jie Huang ⋅ Zhan Zhenpeng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 189
MultiShotMaster: A Controllable Multi-Shot Video Generation Framework
Qinghe Wang ⋅ Xiaoyu Shi ⋅ Baolu Li ⋅ Weikang Bian ⋅ Quande Liu ⋅ Huchuan Lu ⋅ Xintao Wang ⋅ Pengfei Wan ⋅ Kun Gai ⋅ Xu Jia
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 190
MoCha: End-to-End Video Character Replacement without Structural Guidance
Zhengbo Xu ⋅ Jie Ma ⋅ Ziheng Wang ⋅ Zhan Peng ⋅ Jun Liang ⋅ Jing Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 191
Negative Binomial Variational Autoencoders for Overdispersed Latent Modeling
Yixuan Zhang ⋅ Jinhao Sheng ⋅ Wenxin Zhang ⋅ Quyu Kong ⋅ Feng Zhou
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 192
Training-free Detection of Generated Videos via Spatial-Temporal Likelihoods
Omer Ben Hayun ⋅ Roy Betser ⋅ Meir Yossef Levi ⋅ Levi Kassel ⋅ Guy Gilboa
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 193
VOSR: A Vision-Only Generative Model for Image Super-Resolution
Rongyuan Wu ⋅ Lingchen Sun ⋅ Zhengqiang ZHANG ⋅ Xiangtao Kong ⋅ Jixin Zhao ⋅ Shihao Wang ⋅ Lei Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 194
Dual Graph Regularized Deep Unfolding Network for Guided Depth Map Super-resolution
Zhiwei Zhong ⋅ Peilin CHEN ⋅ Qiangqiang Shen ⋅ Bo Li ⋅ Shiqi Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 195
DUO-VSR: Dual-Stream Distillation for One-Step Video Super-Resolution
Zhengyao Lv ⋅ Menghan Xia ⋅ Xintao Wang ⋅ Kwan-Yee K. Wong
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 196
VSRELL: A Simple Baseline for Video Super-Resolution and Enhancement in Low-Light Environment
Yanming hui ⋅ Fanhua Shang ⋅ Hongying Liu ⋅ Ben Wang ⋅ Zhenwei Zhang ⋅ Liang Wan ⋅ Wei Feng ⋅ Tong Xue ⋅ Bingqin Lv
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 197
Gradient Knows Best: Mixed-Precision Quantization via Gradient-Guided Bit Allocation for Super-Resolution
Jun Young Kim ⋅ Joo Jeon ⋅ Sangyeon Ahn ⋅ Yoonseo Park ⋅ Yong Oh ⋅ Bogyeong Kim ⋅ Sung In Cho
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 198
Toward Real-world Infrared Image Super-Resolution: A Unified Autoregressive Framework and Benchmark Dataset
Yang Zou ⋅ Jun Ma ⋅ Zhidong Jiao ⋅ Xingyuan Li ⋅ Zhiying Jiang ⋅ Jinyuan Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 199
Next-Scale Autoregressive Models for Text-to-Motion Generation
Zhiwei Zheng ⋅ Shibo Jin ⋅ Lingjie Liu ⋅ Mingmin Zhao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 200
Push-and-Step: From RL-Based Balance Recovery to Physical Simulation of Dense Crowds
Alexis Jensen ⋅ Pei Xu ⋅ Ioannis Karamouzas ⋅ Charles Pontonnier ⋅ Julien Pettré
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 201
Iterative Closed-Loop Motion Synthesis for Scaling the Capabilities of Humanoid Control
Weisheng Xu ⋅ Qiwei Wu ⋅ Jiaxi Zhang ⋅ Jing Tan ⋅ Yangfan Li ⋅ Yuetong Fang ⋅ Jiaqi Xiong ⋅ Kai Wu ⋅ Rong OU ⋅ Renjing Xu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 202
RoMo: A Large-Scale, Richly Organized Dataset and Semantic Taxonomy for Human Motion Generation
Jiahao Zhang ⋅ Joseph Liu ⋅ Young-Yoon Lee ⋅ Seonghyeon Moon ⋅ Victor Zordan ⋅ Guy Tevet ⋅ C. Karen Liu ⋅ Stephen Gould ⋅ Oren Jacob ⋅ Haomiao Jiang ⋅ Mubbasir Kapadia ⋅ Yizhak Ben-Shabat
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 203
FrankenMotion: Part-level Human Motion Generation and Composition
Chuqiao Li ⋅ Xianghui Xie ⋅ Yong Cao ⋅ Andreas Geiger ⋅ Gerard Pons-Moll
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 204
HSI-GPT2: A Dual-Granularity Large Motion Reasoning Model with Diffusion Refinement for Human–Scene Interaction
Yuan Wang ⋅ LI XIANG ⋅ Yali Li ⋅ XUEGE HOU ⋅ Shengjin Wang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 205
SceMoS: Scene-Aware 3D Human Motion Synthesis by Planning with Geometry-Grounded Tokens
Anindita Ghosh ⋅ Vladislav Golyanik ⋅ Taku Komura ⋅ Philipp Slusallek ⋅ Christian Theobalt ⋅ Rishabh Dabral
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 206
Progressive Guessing to Fixed Point: Rethinking Human Motion Prediction with Deep Equilibrium Models
Dong Wei ⋅ Huaijiang Sun ⋅ Fan Liu ⋅ Yuhui Zheng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 207
Archon: A Unified Multimodal Model for Holistic Digital Human Generation
Chong Bao ⋅ Shichen Liu ⋅ Lijun Yu ⋅ David Futschik ⋅ Stylianos Moschoglou ⋅ Shefali Srivastava ⋅ Ziqian Bai ⋅ Feitong Tan ⋅ Guofeng Zhang ⋅ Zhaopeng Cui ⋅ Sean Fanello ⋅ Yinda Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 208
ReMoGen: Real-time Human Interaction-to-Reaction Generation via Modular Learning from Diverse Data
Yaoqin Ye ⋅ Yiteng Xu ⋅ Qin Sun ⋅ Xinge Zhu ⋅ YUJING SUN ⋅ Yuexin Ma
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 209
Towards Motion Turing Test: Evaluating Human-Likeness in Humanoid Robots
Mingzhe Li ⋅ Mengyin Liu ⋅ Zekai Wu ⋅ Xincheng Lin ⋅ Junsheng Zhang ⋅ Ming Yan ⋅ Zengye Xie ⋅ Changwang Zhang ⋅ Chenglu Wen ⋅ Lan Xu ⋅ Siqi Shen ⋅ Cheng Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 210
PatchScene: Patch-based Voxel Diffusion Model for Large-Scale Scene Completion
Qingdong Xu ⋅ Jiajun Zhu ⋅ Shilin Zhu ⋅ Xinjing He ⋅ Chao Lu ⋅ Huanran Wang ⋅ Jiyao Zhang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 211
Prototype-Guided Concept Erasure in Diffusion Models
Yuze Cai ⋅ Jiahao Lu ⋅ Hongxiang Shi ⋅ Yichao Zhou ⋅ Hong Lu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 212
Any2Any 3D Diffusion Models with Knowledge Transfer: A Radiotherapy Planning Study
Yuhan Wang ⋅ Zihan Li ⋅ Han Liu ⋅ Simon Arberet ⋅ Martin F. Kraus ⋅ Yuyin Zhou ⋅ Florin-Cristian Ghesu ⋅ Dorin Comaniciu ⋅ Ali Kamen ⋅ Riqiang Gao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 213
CARD: Correlation Aware Restoration with Diffusion
Niki Nezakati ⋅ Arnab Ghosh ⋅ Amit K. Roy-Chowdhury ⋅ Vishwanath Saragadam
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 214
DMAligner: Enhancing Image Alignment via Diffusion Model Based View Synthesis
Xinglong Luo ⋅ Ao Luo ⋅ Zhengning Wang ⋅ Yueqi Yang ⋅ Chaoyu Feng ⋅ Lei Lei ⋅ Bing Zeng ⋅ Shuaicheng Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 215
DRiffusion: Draft-and-Refine Process Parallelizes Diffusion Models with Ease
Runsheng Bai ⋅ Chengyu Zhang ⋅ Yangdong Deng
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 216
Do Less, Achieve More: Do We Need Every-Step Optimization for RL Fine-tuning of Diffusion Models?
Renye Yan ⋅ Jikang Cheng ⋅ Shikun Sun ⋅ Yi Sun ⋅ You Wu ⋅ Wei Peng ⋅ Zongwei Wang ⋅ Ling Liang ⋅ Junliang Xing ⋅ Yimao Cai
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 217
CSF: Black-box Fingerprinting via Compositional Semantics for Text-to-Image Models
Junhoo Lee ⋅ Mijin Koo ⋅ Nojun Kwak
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 218
InstantViR: Real-Time Video Inverse Problem Solver with Distilled Diffusion Prior
Weimin Bai ⋅ Suzhe Xu ⋅ Yiwei Ren ⋅ Jinhua Hao ⋅ Ming Sun ⋅ Wenzheng Chen ⋅ He Sun
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 219
MMTIT-Bench: A Multilingual and Multi-Scenario Benchmark with Cognition–Perception–Reasoning Guided Text-Image Machine Translation
Gengluo Li ⋅ Chengquan Zhang ⋅ Yupu Liang ⋅ Huawen Shen ⋅ Yaping Zhang ⋅ Pengyuan Lyu ⋅ Weinong Wang ⋅ Xingyu Wan ⋅ Gangyan Zeng ⋅ Han Hu ⋅ Can Ma ⋅ Yu ZHOU
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 220
M3DocDep: Multi-modal, Multi-page, Multi-document Dependency Chunking with Large Vision-Language Models
Joongmin Shin ⋅ Jeongbae Park ⋅ Jaehyung Seo ⋅ Heuiseok Lim
[ Slides
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 221
Towards Policy-Adaptive Image Guardrail: Benchmark and Method
Caiyong Piao ⋅ Zhiyuan Yan ⋅ Haoming Xu ⋅ Yunzhen Zhao ⋅ Kaiqing Lin ⋅ Feiyang Xu ⋅ Shuigeng Zhou
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 222
Flat-Pack Bench: Evaluating Spatio-Temporal Understanding in Large Vision-Language Models through Furniture Assembly
Aditya Chetan ⋅ Eric Cai ⋅ Peeyush Kushwaha ⋅ Bharath Raj Nagoor Kani ⋅ Utkarsh Mall ⋅ Qianqian Wang ⋅ Noah Snavely ⋅ Bharath Hariharan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 223
TextFM: Robust Semi-dense Feature Matching with Language Guidance
Zhihao Zheng ⋅ Jinglun Feng ⋅ Nirav Savaliya ⋅ Zheng-Hang Yeh ⋅ Bo Lang ⋅ Mooi Choo Chuah
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 224
What’s Wrong with Synthetic Data for Scene Text Recognition? A Strong Synthetic Engine with Diverse Simulations and Self-Evolution
Xingsong Ye ⋅ Yongkun Du ⋅ Jiaxin Zhang ⋅ Chen Li ⋅ Jing LYU ⋅ Zhineng Chen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 225
Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing
Cheng Cui ⋅ Ting Sun ⋅ Suyin Liang ⋅ Tingquan Gao ⋅ Zelun Zhang ⋅ Jiaxuan Liu ⋅ Xueqing Wang ⋅ Changda Zhou ⋅ Hongen Liu ⋅ Manhui Lin ⋅ Yue Zhang ⋅ yubo zhang ⋅ Jing Zhang ⋅ Jun Zhang ⋅ Xing Wei ⋅ Yi Liu ⋅ Dianhai Yu ⋅ Yanjun Ma
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 226
SJD-PAC: Accelerating Speculative Jacobi Decoding via Proactive Drafting and Adaptive Continuation
Jialiang Kang ⋅ Han Shu ⋅ Wenshuo Li ⋅ Yingjie Zhai ⋅ Xinghao Chen
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 227
Point Cloud as a Foreign Language for Multi-modal Large Language Model
Sneha Paul ⋅ Zachary Patterson ⋅ Nizar Bouguila
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 228
Grounded 3D-Aware Spatial Vision-Language Modeling
An-Chieh Cheng ⋅ Yang Fu ⋅ Yatai Ji ⋅ Ligeng Zhu ⋅ Guanqi Zhan ⋅ Zhuoyang Zhang ⋅ Zhaojing Yang ⋅ Song Han ⋅ Yao Lu ⋅ Pavlo Molchanov ⋅ Vidya Nariyambut Murali ⋅ Jan Kautz ⋅ Xiaolong Wang ⋅ Danny Yin ⋅ Sifei Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 229
SpatialTree: How Spatial Intelligence Branches Out in MLLMs
Yuxi Xiao ⋅ longfei li ⋅ Shen Yan ⋅ Xinhang Liu ⋅ Sida Peng ⋅ Yunchao Wei ⋅ Xiaowei Zhou ⋅ Bingyi Kang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 230
TerraScope: Pixel-Grounded Visual Reasoning for Earth Observation
Yan Shu ⋅ Bin Ren ⋅ Zhitong Xiong ⋅ Xiao Xiang Zhu ⋅ Begüm Demir ⋅ Nicu Sebe ⋅ Paolo Rota
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 231
Beyond 3D VQAs: Injecting 3D Spatial Priors into Vision-Language Models for Enhanced Geometric Reasoning
Chun-Hsiao Yeh ⋅ Shengyi Qian ⋅ Manchen Wang ⋅ Yi Ma ⋅ Joseph Tighe ⋅ Fanyi Xiao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 232
OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding
Sheng-Yu Huang ⋅ Jaesung Choe ⋅ Yu-Chiang Frank Wang ⋅ Cheng Sun
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 233
BOP-ASK: Object-Interaction Reasoning for Vision-Language Models
Vineet Bhat ⋅ Sungsu Kim ⋅ Valts Blukis ⋅ Greg Heinrich ⋅ Prashanth Krishnamurthy ⋅ Ramesh Karri ⋅ Stan Birchfield ⋅ Farshad Khorrami ⋅ Jonathan Tremblay
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 234
Scalable Object Relation Encoding for Better 3D Spatial Reasoning in Large Language Models
Shengli Zhou ⋅ Minghang Zheng ⋅ Feng Zheng ⋅ Yang Liu
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 235
Eliciting Complex Spatial Reasoning in MLLMs through Wide-Baseline Matching
Hao Zhong ⋅ Muzhi Zhu ⋅ Shenyan Zeng ⋅ Anzhou Li ⋅ Cong Chen ⋅ Hua Geng ⋅ Duochao Shi ⋅ Wentao Ye ⋅ Tao Lin ⋅ Hao Chen ⋅ Chunhua Shen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 236
REALM: An MLLM-Agent Framework for Open World 3D Reasoning Segmentation and Editing on Gaussian Splatting
Changyue Shi ⋅ Minghao Chen ⋅ Yiping Mao ⋅ Chuxiao Yang ⋅ Xinyuan Hu ⋅ Jiajun Ding ⋅ Zhou Yu
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 237
From Indoor to Open World: Revealing the Spatial Reasoning Gap in MLLMs
Mingrui Wu ⋅ Zhaozhi Wang ⋅ Fangjinhua Wang ⋅ Jiaolong Yang ⋅ Marc Pollefeys ⋅ Tong Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 238
MVGGT: Multimodal Visual Geometry Grounded Transformer for Multiview 3D Referring Expression Segmentation
Changli Wu ⋅ Haodong Wang ⋅ Jiayi Ji ⋅ Yutian Yao ⋅ Chunsai Du ⋅ Jihua Kang ⋅ Yanwei Fu ⋅ Liujuan Cao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 239
SpaceMind: Camera-Guided Modality Fusion for Spatial Reasoning in Vision-Language Models
Ruosen Zhao ⋅ Zhikang Zhang ⋅ Jialei Xu ⋅ Jiahao Chang ⋅ Dong Chen ⋅ Lingyun Li ⋅ Weijian Sun ⋅ Zizhuang Wei
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 240
ReMatch: Boosting Representation through Matching for Multimodal Retrieval
Qianying Liu ⋅ Xiao Liang ⋅ Zhiqiang Zhang ⋅ Yibo Chen ⋅ Xu Tang ⋅ Zhongfei Qing ⋅ Fengfan Zhou ⋅ Yao Hu ⋅ Paul Henderson
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 241
RI-Mamba: Rotation-Invariant Mamba for Robust Text-to-Shape Retrieval
Khanh Nguyen ⋅ Dasith de Silva Edirimuni ⋅ Ghulam Mubashar Hassan ⋅ Ajmal Mian
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 242
Revisiting F-measure Optimization in Multi-Label Classification: A Sampling-based Approach
Zixun Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 243
Thinking Beyond Labels: Vocabulary-Free Fine-Grained Recognition using Reasoning-Augmented LMMs
Dmitry Demidov ⋅ Muhammad Zaigham Zaheer ⋅ Zongyan Han ⋅ Omkar Thawakar ⋅ Rao Anwer
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 244
WISER: Wider Search, Deeper Thinking, and Adaptive Fusion for Training-Free Zero-Shot Composed Image Retrieval
Tianyue Wang ⋅ Leigang Qu ⋅ tianyu yang ⋅ xiangzhao hao ⋅ Yifan Xu ⋅ Haiyun Guo ⋅ Jinqiao Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 245
Modeling the Visual Ambiguity of Human Sketches
Yang Zhou ⋅ Ping Ni ⋅ Jin Wang ⋅ Senyun Jia ⋅ Jingdan Yan ⋅ Kaixiang Huang ⋅ Guodong Lu ⋅ Jingru Yang ⋅ Shengfeng He
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 246
SATTC: Structure-Aware Label-Free Test-Time Calibration for Cross-Subject EEG-to-Image Retrieval
Qunjie Huang ⋅ Weina Zhu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 247
ConeSep: Cone-based Robust Noise-Unlearning Compositional Network for Composed Image Retrieval
Zixu Li ⋅ Yupeng Hu ⋅ Zhiwei Chen ⋅ Mingyu Zhang ⋅ Zhiheng Fu ⋅ Liqiang Nie
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 248
V^2-SAM: Marrying SAM2 with Multi-Prompt Experts for Cross-View Object Correspondence
Jiancheng Pan ⋅ Runze Wang ⋅ Tianwen Qian ⋅ Mohammad Mahdi ⋅ Yanwei Fu ⋅ Xiangyang Xue ⋅ Xiaomeng Huang ⋅ Luc Van Gool ⋅ Danda Paudel ⋅ Yuqian Fu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 249
WeaveTime: Streaming from Earlier Frames into Emergent Memory in VideoLLMs
Yulin Zhang ⋅ Cheng Shi ⋅ Sibei Yang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 250
Streaming Video Crime Anticipation with Spatio-Temporal Causal Reasoning
Yusong Wang ⋅ Zheyuan Gu ⋅ Keyu Mao ⋅ Minghao Shao ⋅ Mingkun Xu ⋅ Prayag Tiwari ⋅ Jiawei Shao ⋅ qingsong zhao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 251
Efficient Frame Selection for Long Video Understanding via Reinforcement Learning
Yaxuan Qin ⋅ Hefei Li ⋅ Wenqi Mu ⋅ Yancheng He
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 252
HieraMamba: Video Temporal Grounding via Hierarchical Anchor-Mamba Pooling
Joungbin An ⋅ Kristen Grauman
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 253
InternVideo-Next: Towards World-Understanding Video Models
Chenting Wang ⋅ Yuhan Zhu ⋅ Yicheng Xu ⋅ Jiange Yang ⋅ ziang yan ⋅ Yali Wang ⋅ Yi Wang ⋅ Limin Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 254
Condensed Test-Time Adaptation of VLMs for Action Recognition
Wenxuan Ge ⋅ Qu Hongyu ⋅ Rui Yan ⋅ Guo-Sen Xie ⋅ Yazhou Yao ⋅ Xiangbo Shu ⋅ Jinhui Tang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 255
Test-time Ego-Exo-centric Adaptation for Action Anticipation via Multi-Label Prototype Growing and Dual-Clue Consistency
Zhaofeng Shi ⋅ Heqian Qiu ⋅ Lanxiao Wang ⋅ Qingbo Wu ⋅ Fanman Meng ⋅ Lili Pan ⋅ Hongliang Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 256
A Stitch in Time: Learning Procedural Workflow via Self-Supervised Plackett–Luce Ranking
chengan che ⋅ Chao Wang ⋅ Xinyue Chen ⋅ Sophia Tsoka ⋅ Luis Carlos Garcia Peraza Herrera
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 257
SurgCoT: Advancing Spatiotemporal Reasoning in Surgical Videos through a Chain-of-Thought Benchmark
Gui Wang ⋅ YongSong Zhou ⋅ Kaijun Deng ⋅ Wooi Ping Cheah ⋅ Rong Qu ⋅ Jianfeng Ren ⋅ Linlin Shen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 258
Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing
Baifeng Shi ⋅ Stephanie Fu ⋅ Long Lian ⋅ Hanrong Ye ⋅ David Eigen ⋅ Aaron Reite ⋅ Jan Kautz ⋅ Boyi Li ⋅ David Chan ⋅ Trevor Darrell ⋅ Pavlo Molchanov ⋅ Danny Yin
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 259
Concept-Guided Fine-Tuning: Steering ViTs away from Spurious Correlations to Improve Robustness
Yehonatan Elisha ⋅ Oren Barkan ⋅ Noam Koenigstein
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 260
Explaining Object Detectors via Collective Contribution of Pixels
Toshinori Yamauchi ⋅ Hiroshi Kera ⋅ Kazuhiko Kawamoto
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 261
Where MLLMs Attend and What They Rely On: Explaining Autoregressive Token Generation
Ruoyu Chen ⋅ Xiaoqing Guo ⋅ Kangwei Liu ⋅ Siyuan Liang ⋅ Shiming Liu ⋅ Qunli Zhang ⋅ Laiyuan Wang ⋅ Hua Zhang ⋅ Xiaochun Cao
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 262
H-Sets: Hessian-Guided Discovery of Set-Level Feature Interactions in Image Classifiers
Ayushi Mehrotra ⋅ Dipkamal Bhusal ⋅ Michael Clifford ⋅ Nidhi Rastogi
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 263
Evaluating Generative Models via One-Dimensional Code Distributions
Zexi Jia ⋅ Pengcheng Luo ⋅ Yijia Zhong ⋅ Jinchao Zhang ⋅ Jie Zhou
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 264
TriDF: Evaluating Perception, Detection, and Hallucination for Interpretable DeepFake Detection
Jian-Yu Jiang-Lin ⋅ Kang-Yang Huang ⋅ Ling Zou ⋅ Ling Lo ⋅ Sheng-Ping Yang ⋅ Yu-Wen Tseng ⋅ Kun-Hsiang Lin ⋅ Chia-Ling Chen ⋅ Yu-Ting Ta ⋅ Yan-Tsung Wang ⋅ Po-Ching Chen ⋅ Hongxia Xie ⋅ Hong-Han Shuai ⋅ Wen-Huang Cheng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 265
BuildAnyPoint: 3D Building Structured Abstraction from Diverse Point Clouds
Tongyan Hua ⋅ Haoran Gong ⋅ Yuan Liu ⋅ Di Wang ⋅ Ying-Cong Chen ⋅ Wufan Zhao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 266
LiDAR-to-4DRadar Diffusion Bridge via Cross-Modal Alignment and Translation in Latent Space
Dazhong Shen ⋅ Jingjing Gu ⋅ Qiang Zhou ⋅ Meng Zhao ⋅ Ying Sun
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 267
Edges Compete for Trust: Group Relative Edge Optimization for Building Reconstruction from Point Clouds
Yujun Liu ⋅ Ruisheng Wang ⋅ Xiang Ao ⋅ Haoyuan Shen ⋅ Kuihao Wang ⋅ Kun Zhou ⋅ Qingquan Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 268
Unsupervised Monocular 3D Keypoint Discovery from Multi-View Diffusion Priors
Subin Jeon ⋅ In Cho ⋅ Junyoung Hong ⋅ Woong Oh ⋅ Seon Joo Kim
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 269
QD-PCQA: Quality-Aware Domain Adaptation for Point Cloud Quality Assessment
Guohua Zhang ⋅ Jian Jin ⋅ Meiqin Liu ⋅ Chao Yao ⋅ Weisi Lin
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 270
L3DR: 3D-aware LiDAR Diffusion and Rectification
QUAN LIU ⋅ Xiaoqin Zhang ⋅ Ling Shao ⋅ Shijian Lu
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 271
Ghost-FWL: A Large-Scale Full-Waveform LiDAR Dataset for Ghost Detection and Removal
Kazuma Ikeda ⋅ Ryosei Hara ⋅ Rokuto Nagata ⋅ Ozora Sako ⋅ Zihao Ding ⋅ Takahiro Kado ⋅ Ibuki Fujioka ⋅ Taro Beppu ⋅ Mariko Isogawa ⋅ Kentaro Yoshioka
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 272
Ghosts in the Point Clouds: De-glaring LiDAR in the Transient Domain
Avery gump ⋅ Connor Henley ⋅ Sungjin Cheong ⋅ Akarsh Prabhakara ⋅ Mohit Gupta
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 273
MS^2Gait: A Multi-Scale Spatio-Temporal Fusion Network for LiDAR-based Gait Recognition
Shenyin Xu ⋅ Yishan Wang ⋅ Xinyu Li ⋅ Rui Liu ⋅ Zhongyuan Wang ⋅ Xin Tian
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 274
Foundry: Distilling 3D Foundation Models for the Edge
Guillaume Letellier ⋅ Siddharth Srivastava ⋅ Frederic Jurie ⋅ Gaurav Sharma
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 275
Learning to Identify Out-of-Distribution Objects for 3D LiDAR Anomaly Segmentation
Simone Mosco ⋅ Daniel Fusaro ⋅ Alberto Pretto
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 276
Dual-Level Confidence based Implicit Self-Refinement for Medical Visual Question Answering
Meihong Pan ⋅ Yefeng Zheng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 277
FedMPT: Federated Multi-Label Prompt Tuning of Vision-Language Models
Xucong Wang ⋅ Pengkun Wang ⋅ Zhe Zhao ⋅ Liheng Yu ⋅ Shuang Wang ⋅ Yang Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 278
Rethinking Model Selection in VLM Through the Lens of Gromov-Wasserstein Distance
Muyang Li ⋅ Yucheng Liu ⋅ Jianbo Ma ⋅ Elliot Osborne ⋅ Bo Han ⋅ Tongliang Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 279
NTK-Guided Implicit Neural Teaching
Chen Zhang ⋅ Wei Zuo ⋅ Bingyang Cheng ⋅ Yikun Wang ⋅ Wei-Bin Kou ⋅ Yik-Chung WU ⋅ Ngai Wong
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 280
SynthRGB-T: Language-Vision Guided Image Translation for Diversity Synthesis
Jiangang Ding ⋅ Yiquan Du ⋅ Pengxiang Li ⋅ Lili Pei ⋅ Yuanlin Zhao ⋅ Wei Li
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 281
Text-Printed Image: Bridging the Image-Text Modality Gap for Text-centric Training of Large Vision-Language Models
Shojiro Yamabe ⋅ Futa Waseda ⋅ Daiki Shiono ⋅ Tsubasa Takahashi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 282
Harmonious Parameter Adaptation in Continual Visual Instruction Tuning for Safety-Aligned MLLMs
Ziqi Wang ⋅ Chang Che ⋅ Qi Wang ⋅ Hui Ma ⋅ Zenglin Shi ⋅ Cees G. M. Snoek ⋅ Meng Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 283
StructXLIP: Enhancing Vision-language Models with Multimodal Structural Cues
Zanxi Ruan ⋅ Songqun Gao ⋅ Qiuyu Kong ⋅ Yiming Wang ⋅ Marco Cristani
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 284
Same or Not? Enhancing Visual Perception in Vision-Language Models
Damiano Marsili ⋅ Aditya Mehta ⋅ Ryan Y. ⋅ Georgia Gkioxari
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 285
Vector Prism: Animating Vector Graphics by Stratifying Semantic Structure
Jooyeol Yun ⋅ Jaegul Choo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 286
AssemblyBench: Physics-Aware Assembly of Complex Industrial Objects
Danrui Li ⋅ Jiahao Zhang ⋅ Bernhard Egger ⋅ Moitreya Chatterjee ⋅ Suhas Lohit ⋅ Tim Marks ⋅ Anoop Cherian
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 287
Animator-Centric Skeleton Generation on Objects with Fine-Grained Details
Mingze Sun ⋅ Cheng Zeng ⋅ Pei Jiansong ⋅ Junhao Chen ⋅ Chaoyue Song ⋅ Shaohui Wang ⋅ Tianyuan Chang ⋅ Bin Huang ⋅ Zijiao Zeng ⋅ Ruqi Huang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 288
Synthesizing Visual Concepts as Vision-Language Programs
Antonia Wüst ⋅ Wolfgang Stammer ⋅ Hikaru Shindo ⋅ Lukas Helff ⋅ Devendra Singh Dhami ⋅ Kristian Kersting
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 289
Self-Consistency for LLM-Based Motion Trajectory Generation and Verification
Jiaju Ma ⋅ R. Kenny Jones ⋅ Jiajun Wu ⋅ Maneesh Agrawala
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 290
Semantic Scale Space: A Framework for Controllable Image Abstraction
Kazu Mishiba
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 291
Pointer-CAD: Unifying B-Rep and Command Sequences via Pointer-based Edges & Faces Selection
Dacheng Qi ⋅ Chenyu Wang ⋅ Jingwei Xu ⋅ Tianzhe Chu ⋅ Zibo Zhao ⋅ Wen Liu ⋅ Wenrui Ding ⋅ Yi Ma ⋅ Shenghua Gao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 292
DSFlash: Comprehensive Panoptic Scene Graph Generation in Realtime
Julian Lorenz ⋅ Vladyslav Kovganko ⋅ Elias Kohout ⋅ Mrunmai Phatak ⋅ Daniel Kienzle ⋅ Rainer Lienhart
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 293
SIF: Semantically In-Distribution Fingerprints for Large Vision-Language Models
Yifei Zhao ⋅ Qian Lou ⋅ Mengxin Zheng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 294
Designing to Forget: Deep Semi-parametric Models for Unlearning
Amber Yija Zheng ⋅ YU-SHAN TAI ⋅ Raymond A. Yeh
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 295
Meta-FC: Meta-Learning with Feature Consistency for Robust and Generalizable Watermarking
Yuheng Li ⋅ Weitong Chen ⋅ chengcheng zhu ⋅ Jiale Zhang ⋅ Chunpeng Ge ⋅ Di Wu ⋅ Guodong Long
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 296
PrivSynth: Alternating and Control-Based Optimization for Privacy and Utility in Synthetic Data
Xinyuan Zhao ⋅ Hanlin Gu ⋅ Guibao Song ⋅ Gongxi Zhu ⋅ Yifei Zou ⋅ Lixin Fan ⋅ Yuxing Han
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 297
Neighbor-Aware Localized Concept Erasure in Text-to-Image Diffusion Models
Zhuan Shi ⋅ Alireza Dehghanpour Farashah ⋅ Rik de Vries ⋅ Golnoosh Farnadi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 298
EcoAlign: An Economically Rational Framework for Efficient LVLM Alignment
Ruoxi Cheng ⋅ Hao-Xuan Ma ⋅ Teng Ma ⋅ Hongyi Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 299
Activation Matters: Test-time Activated Negative Labels for OOD Detection with Vision-Language Models
Yabin Zhang ⋅ Maya Varma ⋅ Yunhe Gao ⋅ Jean-Benoit Delbrouck ⋅ Jiaming Liu ⋅ Chong Wang ⋅ Curtis Langlotz
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 300
A Polynomial Chaos Framework for Causal Discovery in Nonlinear Uncertain Systems
Liang Cao
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 301
Domain-Skewed Federated Learning with Feature Decoupling and Calibration
Huan Wang ⋅ Jun Shen ⋅ Jun Yan ⋅ Guansong Pang
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 302
From Selection to Scheduling: Federated Geometry-Aware Correction Makes Exemplar Replay Work Better under Continual Dynamic Heterogeneity
Zhuang Qi ⋅ Yingpeng Tang ⋅ Lei Meng ⋅ Guoqing Chao ⋅ Lei Wu ⋅ Han Yu ⋅ Xiangxu Meng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 303
Fine-Tuning Impairs the Balancedness of Foundation Models in Long-tailed Personalized Federated Learning
Shihao Hou ⋅ Chikai Shang ⋅ Zhiheng Yang ⋅ jiacheng yang ⋅ Xinyi Shang ⋅ Junlong Gao ⋅ Yiqun Zhang ⋅ Yang Lu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 304
Few-for-Many Personalized Federated Learning
Ping Guo ⋅ ZHANG Tiantian ⋅ Xi Lin ⋅ Xiang Li ⋅ Zhi-Ri Tang ⋅ Qingfu Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 305
ProxyFL: A Proxy-Guided Framework for Federated Semi-Supervised Learning
Duowen Chen ⋅ Yan Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 306
Domain Sensitive Federated Learning with Fisher-Informed Pruning
Chenchen Lin ⋅ Wenhao Yuan ⋅ Zhengji Xu ⋅ Xuehe Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 307
SPARROW: Learning Spatial Precision and Temporal Referential Consistency in Pixel-Grounded Video MLLMs
Mohamad Alansari ⋅ Naufal Suryanto ⋅ Divya Velayudhan ⋅ Sajid Javed ⋅ Naoufel Werghi ⋅ Muzammal Naseer
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 308
Bridging Facial Understanding and Animation via Language Models
Luchuan Song ⋅ Pinxin Liu ⋅ Haiyang Liu ⋅ Zhenchao Jin ⋅ Yolo Yunlong Tang ⋅ Zichong Xu ⋅ Susan Liang ⋅ Jing Bi ⋅ Jason J. Corso ⋅ Chenliang Xu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 309
AR²-4FV: Anchored Referring and Re-identification for Long-Term Grounding in Fixed-View Videos
Teng Yan ⋅ Yihan Liu ⋅ Jiongxu Chen ⋅ Teng Wang ⋅ Jiaqi LI ⋅ Bingzhuo Zhong
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 310
CVA: Context-aware Video-text Alignment for Video Temporal Grounding
Sungho Moon ⋅ Seunghun Lee ⋅ Jiwan Seo ⋅ Sunghoon Im
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 311
OmniGround: A Comprehensive Spatio-Temporal Grounding Benchmark for Real-World Complex Scenarios
Hong Gao ⋅ Jingyu Wu ⋅ Xiangkai Xu ⋅ Kangni Xie ⋅ Yunchen Zhang ⋅ Bin Zhong ⋅ Xurui Gao ⋅ Min-Ling Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 312
ST4R-Splat: Spatio-Temporal Referring Segmentation in 4D Gaussian Splatting
Yuming Meng ⋅ Dong Wu ⋅ Hongbin Zha
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 313
WeMMU: Enhanced Bridging of Vision-Language Models and Diffusion Models via Noisy Query Tokens
Jian Yang ⋅ Dacheng Yin ⋅ Xiaoxuan He ⋅ Yong Li ⋅ Fengyun Rao ⋅ Jing LYU ⋅ Wei Zhai ⋅ Yang Cao ⋅ Zheng-Jun Zha
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 314
Rejection Mixing: Fast Semantic Propagation of Mask Tokens for Efficient DLLM Inference
Yushi Ye ⋅ Feng Hong ⋅ Huangjie Zheng ⋅ Xu Chen ⋅ Zhiyong Chen ⋅ Yanfeng Wang ⋅ Jiangchao Yao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 315
Towards Unified Human Perception and Machine Understanding: Token Flow Guided Compression Framework
Li Xu ⋅ YingFu Zhang ⋅ Kepeng Xu ⋅ Gang He ⋅ Yunsong Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 316
A More Word-like Image Tokenization for MLLMs
Hyun Lee ⋅ Hyemin Jeong ⋅ Yejin Kim ⋅ Hyungwook Choi ⋅ Hyunsoo Cho ⋅ Soo Kyung Kim ⋅ Joonseok Lee
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 317
DUET-VLM: Dual stage Unified Efficient Token reduction for VLM Training and Inference
Aditya Kumar Singh ⋅ Hitesh Kandala ⋅ Pratik Prabhanjan Brahma ⋅ Zicheng Liu ⋅ Emad Barsoum
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 318
Unified Spatiotemporal Token Compression for Video-LLMs at Ultra-Low Retention
Junhao Du ⋅ XUE JIALONG ⋅ Anqi Li ⋅ Jincheng Dai ⋅ Guo Lu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 319
One Layer’s Trash is Another Layer’s Treasure: Adaptive Layer-wise Visual Token Selection in LVLMs
Yongru Chen ⋅ Kai Zhang ⋅ Zeliang Zong ⋅ Yuchen Lu ⋅ Wenming Tan ⋅ Ye Ren ⋅ Jilin Hu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 320
OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models
Keda Tao ⋅ Kele Shao ⋅ Bohan Yu ⋅ Weiqiang Wang ⋅ Jian liu ⋅ Huan Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 321
Tunable Soft Equivariance with Guarantees
Md Ashiqur Rahman ⋅ Lim Jun Hao ⋅ Jeremiah Jiang ⋅ Teck-Yian Lim ⋅ Raymond A. Yeh
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 322
Semi-Supervised Conformal Prediction With Unlabeled Nonconformity Score
Xuanning Zhou ⋅ Zihao Shi ⋅ Hao Zeng ⋅ Xiaobo Xia ⋅ Bingyi Jing ⋅ Hongxin Wei
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 323
Cluster-aware Anchor Learning for Multi-View Clustering
Zhe Chen ⋅ Fanhui Meng ⋅ Tianyang Xu ⋅ Xiao-Jun Wu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 324
Revisiting Sparsity Constraint Under High-Rank Property in Partial Multi-Label Learning
Chongjie Si ⋅ Yidan Cui ⋅ Fuchao Yang ⋅ Wei Shen
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 325
Weight Space Representation Learning via Neural Field Adaptation
Zhuoqian Yang ⋅ Mathieu Salzmann ⋅ Sabine Süsstrunk
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 326
Recurrent Video Masked Autoencoders
Daniel Zoran ⋅ Nikhil Parthasarathy ⋅ Yi Yang ⋅ Drew A Hudson ⋅ Joao Carreira ⋅ Andrew Zisserman
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 327
Revisiting Unknowns: Towards Effective and Efficient Open-Set Active Learning
Chen-Chen Zong ⋅ Yu-Qi Chi ⋅ Xie-Yang Wang ⋅ Yan Cui ⋅ Shengjun Huang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 328
Seeing Through the Shift: Causality-Inspired Robust Generalized Category Discovery
Wei Feng ⋅ Yiwen Jiang ⋅ Sijin Zhou ⋅ Zhuang Qi ⋅ Zhongxing Xu ⋅ Zhonghua Wang ⋅ feilong tang ⋅ Zongyuan Ge
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 329
From Exploration to Exploitation: A Two-Stage Entropy RLVR Approach for Noise-Tolerant MLLM Training
Donglai Xu ⋅ Hongzheng Yang ⋅ Yuzhi Zhao ⋅ Pingping Zhang ⋅ Jinpeng Chen ⋅ Wenao Ma ⋅ Zhijian Hou ⋅ Mengyang Wu ⋅ Xiaolei Li ⋅ Senkang Hu ⋅ Ziyi Guan ⋅ Jason Chun Lok Li ⋅ Lai-Man Po
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 330
Spatial Retrieval Augmented Autonomous Driving
Xiaosong Jia ⋅ Chenhe Zhang ⋅ Yule Jiang ⋅ Songbur Wong ⋅ Zhiyuan Zhang ⋅ chen chen ⋅ Shaofeng Zhang ⋅ Xuanhe Zhou ⋅ Xue Yang ⋅ Junchi Yan ⋅ Yu-Gang Jiang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 331
Scaling-Aware Data Selection for End-to-End Autonomous Driving Systems
Tolga Dimlioglu ⋅ Nadine Chang ⋅ Maying Shen ⋅ Rafid Mahmood ⋅ Jose M. Alvarez
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 332
ColaVLA: Leveraging Cognitive Latent Reasoning for Hierarchical Parallel Trajectory Planning in Autonomous Driving
Qihang Peng ⋅ Xuesong Chen ⋅ Chenye Yang ⋅ Shaoshuai Shi ⋅ Hongsheng Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 333
CARD: A Multi-Modal Automotive Dataset for Dense 3D Reconstruction in Challenging Road Topography
Gasser Elazab ⋅ Frank Neuhaus ⋅ Tilman Koß ⋅ Malte Splietker ⋅ Aditya Date ⋅ Michael Unterreiner ⋅ Maximilian Jansen ⋅ Olaf Hellwich
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 334
MindDriver: Introducing Progressive Multimodal Reasoning for Autonomous Driving
Lingjun Zhang ⋅ Yujian Yuan ⋅ Changjie Wu ⋅ Xinyuan Chang ⋅ Xin Cai ⋅ Shuang Zeng ⋅ Linzhe Shi ⋅ Sijin Wang ⋅ Hang Zhang ⋅ Mu Xu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 335
WPT: World-to-Policy Transfer via Online World Model Distillation
Guangfeng Jiang ⋅ Yueru Luo ⋅ Jun Liu ⋅ Yi Huang ⋅ Yiyao Zhu ⋅ zhan qu ⋅ Dave Chen ⋅ Bingbing Liu ⋅ Xu Yan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 336
ClimaOoD: Improving Anomaly Segmentation via Physically Realistic Synthetic Data
Yuxing Liu ⋅ Zheng Li ⋅ Huanhuan Liang ⋅ Ji Zhang ⋅ Zeyu Sun ⋅ Yong Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 337
Recover to Predict: Progressive Retrospective Learning for Variable-Length Trajectory Prediction
Hao Zhou ⋅ Lu Qi ⋅ Xiangtai Li ⋅ Jie Zhang ⋅ Yi Liu ⋅ Xu Yang ⋅ Mingyu Fan ⋅ Fei Luo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 338
URScenes: A Multi-scenario Dataset for Unstructured Road Environments
runsen liu ⋅ Aizemaitijiang Baoerhan ⋅ Zhangyu Wang ⋅ Jie Wang ⋅ Jinghao Cui ⋅ Guizhen Yu ⋅ Songyue Yang ⋅ WanCheng Sun ⋅ Mingjun Tang ⋅ Zhanbo Hua ⋅ Wenwen Luo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 339
MeanFuser: Fast One-Step Multi-Modal Trajectory Generation and Adaptive Reconstruction via MeanFlow for End-to-End Driving
junli wang ⋅ Yinan Zheng ⋅ Xueyi Liu ⋅ Zebin Xing ⋅ Pengfei Li ⋅ Kun Ma ⋅ Hangjun Ye ⋅ Guang Chen ⋅ Guang Li ⋅ Long Chen ⋅ Zhongpu Xia ⋅ Qichao Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 340
SAMosaic3D: Modular Scene Assembly for Real-Time 3D Segment Anything
Peng Wang ⋅ Yongcai Wang ⋅ Wang Chen ⋅ Hualong Cao ⋅ Kang Yang ⋅ Chunxu Li ⋅ Jie Wen ⋅ Deying Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 341
Mitigating Objectness Bias and Region-to-Text Misalignment for Open-Vocabulary Panoptic Segmentation
Nikolay Kormushev ⋅ Josip Šarić ⋅ Matej Kristan
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 342
MV3DIS: Multi-View Mask Matching via 3D Guides for Zero-Shot 3D Instance Segmentation
yibo zhao ⋅ Yigong Zhang ⋅ Jin Xie
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 343
PEARL: Geometry Aligns Semantics for Training-Free Open-Vocabulary Semantic Segmentation
Gensheng Pei ⋅ Xiruo Jiang ⋅ Xinhao Cai ⋅ Tao Chen ⋅ Yazhou Yao ⋅ Byeungwoo Jeon
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 344
RAVEN: Radar Adaptive Vision Encoders for Efficient Chirp-wise Object Detection and Segmentation
Anuvab Sen ⋅ Mir Sayeed ⋅ Saibal Mukhopadhyay
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 345
SAMIX: Reinforcing SAM2 with Semantic Adapter and Reference Selecting Policy for Mix-Supervised Segmentation
Qiang Hu ⋅ Jiajie Wei ⋅ Zhenyu Yi ⋅ Zhifen Yan ⋅ Yingjie Guo ⋅ Hongkuan Shi ⋅ Ge-Peng Ji ⋅ Qiang Li ⋅ Zhiwei Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 346
MARSS: Radar Semantic Segmentation via Modular Attention and State Space Models
fengyu chen ⋅ Tiao Tan ⋅ Teng Li ⋅ Yuantian Quan ⋅ Qingmin Liao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 347
MixerCSeg: An Efficient Mixer Architecture for Crack Segmentation via Decoupled Mamba Attention
Zilong Zhao ⋅ Zhengming Ding ⋅ Pei Niu ⋅ Wenhao Sun ⋅ Feng Guo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 348
Exemplar-Free Class Incremental Learning via Preserving Class-Discriminative Structure
Xin Zhang ⋅ Liang Bai ⋅ Guanchao Wang ⋅ Xian Yang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 349
Critical Patch-Aware Sparse Prompting with Decoupled Training for Continual Learning on the Edge
Wonseon Lim ⋅ Jaesung Lee ⋅ Dae-Won Kim
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 350
PACT: Phase-Like Transition Constraints in Adapter-Based Continual Learning of Vision-Language Models
Xuan Wang ⋅ Guiguang Ding ⋅ Jungong Han
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 351
Representation-Steered Incremental Adapter-Tuning for Class-Incremental Learning with Pre-Trained Models
Jiarui Zhao ⋅ Libo Huang ⋅ Xiangqi Li ⋅ Zhulin An ⋅ Chuanguang Yang ⋅ Yu Wang ⋅ boyu diao ⋅ Yongjun Xu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 352
Re-evaluating Continual VQA: Toward Fair and Robust Evaluation for Multimodal Continual Learning
Zijian Gao ⋅ Zicheng Sun ⋅ Xingxing Zhang ⋅ Kele Xu ⋅ Huaimin Wang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 353
Distilling Balanced Knowledge from a Biased Teacher
Seonghak Kim
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 354
Enhancing Continual Learning of Vision-Language Models via Dynamic Prefix Weighting
Hyeonseo Jang ⋅ Hyuk Kwon ⋅ Kibok Lee
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 355
Beyond Myopic Alignment: Lookahead Optimization for Online Class-Incremental Learning
Song Lai ⋅ Zhe Zhao ⋅ Fei Zhu ⋅ Ji Cheng ⋅ Xi Lin ⋅ Qingfu Zhang ⋅ Gaofeng Meng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 356
EmoDiffTalk: Emotion-aware Diffusion for Editable 3D Gaussian Talking Head
Chang Liu ⋅ Tianjiao Jing ⋅ Chengcheng Ma ⋅ Xuanqi Zhou ⋅ Zhengxuan Lian ⋅ Qin Jin ⋅ Hongliang Yuan ⋅ Shi-Sheng Huang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 357
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation
Taekyung Ki ⋅ Sangwon Jang ⋅ Jaehyeong Jo ⋅ Jaehong Yoon ⋅ Sung Ju
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 358
D^3FER: Dual Channel and Dual Branch Network for Robust Facial Expression Recognition under Dual Challenges
Hui Tang ⋅ Yifan He ⋅ Zhong Jin
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 359
HumanNOVA: Photorealistic, Universal and Rapid 3D Human Avatar Modeling from a Single Image
Hezhen Hu ⋅ Wangbo Zhao ⋅ Lanqing Guo ⋅ Hanwen Jiang ⋅ Jonathan C. Liu ⋅ Zhiwen Fan ⋅ Kai Wang ⋅ Zhangyang Wang ⋅ Georgios Pavlakos
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 360
ExpPortrait: Expressive Portrait Generation via Personalized Representation
Junyi Wang ⋅ Yudong Guo ⋅ Boyang Guo ⋅ Shengming Yang ⋅ Juyong Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 361
PersonaLive! Expressive Portrait Image Animation for Live Streaming
Zhiyuan Li ⋅ Chi-Man Pun ⋅ Chen Fang ⋅ Jue Wang ⋅ Xiaodong Cun
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 362
ProFocus: Proactive Perception and Focused Reasoning in Vision-and-Language Navigation
Wei Xue ⋅ Mingcheng Li ⋅ Xuecheng Wu ⋅ Jingqun Tang ⋅ Dingkang Yang ⋅ Lihua Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 363
OptiMVMap: Offline Vectorized Map Construction via Optimal Multi-vehicle Perspectives
Zedong Dan ⋅ Zijie Wang ⋅ Wei Zhang ⋅ Xiangru Lin ⋅ Weiming Zhang ⋅ Xiao Tan ⋅ Jingdong Wang ⋅ Liang Lin ⋅ Guanbin Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 364
CogDriver: Integrating Cognitive Inertia for Temporally Coherent Planning in Autonomous Driving
Pei Liu ⋅ Qingtian Ning ⋅ Xinyan Lu ⋅ Haipeng LIU ⋅ Weiliang Ma ⋅ Dangen She ⋅ XianPeng Lang ⋅ Jun Ma
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 365
TopoHR: Hierarchical Centerline Representation for Cyclic Topology Reasoning in Driving Scenes with Point-to-Instance Relations
Yifeng Bai ⋅ Zhirong Chen ⋅ Bo Song ⋅ Erkang Cheng ⋅ Haibin Ling
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 366
AURA: Multi-modal Shared Autonomy for Urban Navigation
Yukai Ma ⋅ Honglin He ⋅ Selina Song ⋅ Wayne Wu ⋅ Bolei Zhou
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 367
Zero-Shot Reconstruction of Animatable 3D Avatars with Cloth Dynamics from a Single Image
Joohyun Kwon ⋅ Geonhee Sim ⋅ Gyeongsik Moon
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 368
FlexAvatar: Learning Complete 3D Head Avatars with Partial Supervision
Tobias Kirschstein ⋅ Simon Giebenhain ⋅ Matthias Nießner
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 369
Large-scale Codec Avatars: The Unreasonable Effectiveness of Large-scale Avatar Pretraining
Junxuan Li ⋅ Rawal Khirodkar ⋅ Egor Zakharov ⋅ Jihyun Lee ⋅ Zhaoen Su ⋅ Yuan Dong ⋅ Julieta Martinez ⋅ Kai Li ⋅ Qingyang Tan ⋅ Takaaki Shiratori ⋅ Matthew Hu ⋅ Peihong Guo ⋅ Xuhua Huang ⋅ Zhongshi Jiang ⋅ LINGCHEN YANG ⋅ Ariyan Zarei ⋅ Marco Pesavento ⋅ Yichen Xu ⋅ Chengan He ⋅ He Wen ⋅ Giljoo Nam ⋅ Teng Deng ⋅ Wyatt Borsos ⋅ Anjali Thakrar ⋅ Jean-Charles Bazin ⋅ Rinat Abdrashitov ⋅ Carsten Stoll ⋅ Ginés Hidalgo ⋅ James Booth ⋅ Lucy Wang ⋅ Xiaowen Ma ⋅ Yu Rong ⋅ Sairanjith Thalanki ⋅ Chen Cao ⋅ Christian Häne ⋅ Abhishek Kar ⋅ Sofien Bouaziz ⋅ Jason Saragih ⋅ Yaser Sheikh ⋅ Shunsuke Saito
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 370
UIKA: Fast Universal Head Avatar from Pose-Free Images
Zijian Wu ⋅ Boyao Zhou ⋅ Liangxiao Hu ⋅ Hongyu Liu ⋅ Yuan Sun ⋅ Xuan Wang ⋅ Xun Cao ⋅ Yujun Shen ⋅ Hao Zhu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 371
FlexAvatar: Flexible Large Reconstruction Model for Animatable Gaussian Head Avatars with Detailed Deformation
Cheng Peng ⋅ Zhuo Su ⋅ Liao Wang ⋅ Chen Guo ⋅ Zhaohu Li ⋅ Chengjiang Long ⋅ Zheng Lv ⋅ Jingxiang Sun ⋅ Chenyangguang Zhang ⋅ Yebin Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 372
First Logit Boosting: Visual Grounding Method to Mitigate Object Hallucination in Large Vision-Language Models
Jiwoo Ha ⋅ Jongwoo Baek ⋅ Jinhyun So
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 373
Locate-then-Sparsify: Attribution Guided Sparse Strategy for Visual Hallucination Mitigation
Tiantian Dang ⋅ Chao Bi ⋅ Shufan Shen ⋅ Jinzhe Liu ⋅ Qingming Huang ⋅ Shuhui Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 374
Envision, Attend, Then Respond: Counterfactual Hallucination Mitigation in Large Vision-Language Models
Yuxuan Liang ⋅ Fan Shi ⋅ Rui Zhu ⋅ Xu Li ⋅ Xiaolei Chen ⋅ Zhe Liu ⋅ Bin Li ⋅ Xiangyang Xue
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 375
PAS: Prelim Attention Score for Detecting Object Hallucinations in Large Vision-Language Models
Nhat Hoang ⋅ Minh Vu ⋅ My T. Thai ⋅ Manish Bhattarai
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 376
MoD-DPO: Towards Mitigating Cross-modal Hallucinations in Omni LLMs using Modality Decoupled Preference Optimization
Ashutosh Chaubey ⋅ Jiacheng Pang ⋅ Mohammad Soleymani
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 377
Fine-Grained Multi Image Object Hallucination Benchmark
Joonki Min ⋅ Chaeyun Kim ⋅ Hyungwook Choi ⋅ Yejin Kim ⋅ Kihyun Kim ⋅ Yohan Jo ⋅ Joonseok Lee
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 378
Generative Video Motion Editing with 3D Point Tracks
Yao-Chih Lee ⋅ Zhoutong Zhang ⋅ Gabriel Huang ⋅ Jui-Hsien Wang ⋅ Joon-Young Lee ⋅ Jia-Bin Huang ⋅ Eli Shechtman ⋅ Zhengqi Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 379
BulletTime: Decoupled Control of Time and Camera Pose for Video Generation
Yiming Wang ⋅ Qihang Zhang ⋅ Shengqu Cai ⋅ Tong Wu ⋅ Jan Ackermann ⋅ Zhengfei Kuang ⋅ Yang Zheng ⋅ Frano Rajič ⋅ Siyu Tang ⋅ Gordon Wetzstein
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 380
Learning to Generate Highly Dynamic Videos using Synthetic Motion Data
Wonjoon Jin ⋅ Jiyun Won ⋅ Janghyeok Han ⋅ Qi Dai ⋅ Chong Luo ⋅ Seung-Hwan Baek ⋅ Sunghyun Cho
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 381
Stereo World Model: Camera-Guided Stereo Video Generation
Yangtian Sun ⋅ Zehuan Huang ⋅ Yifan Niu ⋅ Lin Ma ⋅ Yan-Pei Cao ⋅ Yuewen Ma ⋅ Xiaojuan Qi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 382
CG-Floor: Centroid-Guided Diffusion for Large-Scale Floorplan Generation
Hongjin Lian ⋅ Jian Ma ⋅ Hongjie Chen ⋅ Jia Li ⋅ Ruizhen Hu ⋅ Yu-Kun Lai ⋅ Kun Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 383
MAD: Motion Appearance Decoupling for efficient Driving World Models
Ahmad Rahimi ⋅ Valentin Gerard ⋅ Éloi Zablocki ⋅ Matthieu Cord ⋅ Alex Alahi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 384
VDFE: Difference-Aware 3D Scene Editing with Non-Intrusive Video Diffusion Priors for Multi-View Consistency and Efficiency
Chao Zhang ⋅ Fang Liu ⋅ Shuo Li ⋅ Yang Liu ⋅ Jiahao Wang ⋅ Xinyan Huang ⋅ Lingling Li ⋅ Puhua Chen ⋅ Xu Liu ⋅ Wenping Ma ⋅ Siqi Yu
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 385
Endless World: Real-Time 3D-Aware Long Video Generation
Ke Zhang ⋅ Jiacong Xu ⋅ Yiqun Mei ⋅ Vishal M. Patel
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 386
SpatialDiff: 3D-Aware Object Movement via Implicit Spatial Modeling
Zheng Liu ⋅ Zijian He ⋅ Huiguo He ⋅ Weizhi Zhong ⋅ Yejun Tang ⋅ Huan Yang ⋅ Kun Gai ⋅ Guanbin Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 387
Towards Realistic and Consistent Orbital Video Generation via 3D Foundation Priors
Rong Wang ⋅ Ruyi Zha ⋅ Ziang Cheng ⋅ Jiayu Yang ⋅ Pulak Purkait ⋅ Hongdong Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 388
YOLO-ULM: Ultra-Lightweight Models for Real-Time Object Detection
Shasha Han ⋅ Chong Li ⋅ Xinning Wang ⋅ Xuebo Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 389
CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations in the wild
Alex Hoi Hang Chan ⋅ Neha Singhal ⋅ Onur Kocahan ⋅ Andrea Meltzer ⋅ Saverio Lubrano ⋅ Miya Warrington ⋅ Michael Griesser ⋅ Fumihiro Kano ⋅ Hemal Naik
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 390
YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection
Xu Lin ⋅ Jinlong Peng ⋅ Zhenye Gan ⋅ Jiawen Zhu ⋅ Jun Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 391
VLM4RSDet: Collaborative Optimization with Vision-Language Model for Enhancing Remote Sensing Object Detection
Shuohao Shi ⋅ Qiang Fang ⋅ Xin Xu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 392
WiTTA-Bench: Benchmarking Test-Time Adaptation for WiFi Sensing
Bing Li ⋅ Qiang Wang ⋅ JUNDA LU ⋅ Le Zhang ⋅ Yun Liu ⋅ Ce Zhu ⋅ Wei Cui
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 393
MFEN: Multi-Frequency Expert Network for Visible-Infrared Person Re-ID
Xulin Li ⋅ Yan Lu ⋅ Bin Liu ⋅ Qinhong Yang ⋅ Qi Chu ⋅ Tao Gong ⋅ Nenghai Yu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 394
Object-Generalized Re-Identification: A Step Towards Universal Instance Perception
Shuoyi Chen ⋅ Yurui Wu ⋅ Mang Ye
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 395
When Transformers Meet Mamba: A Hybrid Transformer-Mamba Network for Video Object Detection
Qiang Qi ⋅ Xiao Wang ⋅ Zongyuan Du ⋅ Yu Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 396
Prompt-Anchored Vision–Text Distillation for Lifelong Person Re-identification
Wen Wen ⋅ Hao CHEN ⋅ Shiliang Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 397
HyperGait: Unleashing the Power of Parsing for Gait Recognition in the Wild via Hypergraph
Jinkai Zheng ⋅ jiaqing wei ⋅ Xinxiang Jin ⋅ Yaoqi Sun ⋅ Xichun Sheng ⋅ Ming Li ⋅ Liangqiong Qu ⋅ Xinchen Liu ⋅ Wu Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 398
Accelerating Streaming Video Large Language Models via Hierarchical Token Compression
Yiyu Wang ⋅ Xuyang Liu ⋅ Xiyan Gui ⋅ Xinying Lin ⋅ Boxue Yang ⋅ Chenfei Liao ⋅ Tailai Chen ⋅ Linfeng Zhang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 399
Do You See What I Am Pointing At? Gesture-Based Egocentric Video Question Answering
Yura Choi ⋅ Roy Miles ⋅ Rolandos Alexandros Potamias ⋅ Ismail Elezi ⋅ Jiankang Deng ⋅ Stefanos Zafeiriou
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 400
Beyond Caption-Based Queries in Video Moment Retrieval
David Pujol-Perich ⋅ Albert Clapés ⋅ Dima Damen ⋅ Sergio Escalera ⋅ Michael Wray
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 401
Neural-Centric Video Processing Pipeline for Unified Multi-Task Inference
Seyeon Lee ⋅ Juncheol Ye ⋅ Jaehong Kim ⋅ Dongsu Han
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 402
VideoRealBench: A Chain-of-Thought Realism Evaluation Benchmark for Generated Human-Centric Videos
Min Yang ⋅ Xinwen Zhang ⋅ Jialei Tang ⋅ Xin Zhou ⋅ Kehan Li ⋅ Zeyi Huang ⋅ Limin Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 403
VAST: Video Ability‑Stratified Taxonomy for Data‑Efficient Video Reasoning
Zhongan Wang ⋅ Xiaoyu Wen ⋅ Lingxiao Du ⋅ Kun Li ⋅ zhiliang wu ⋅ Xingcheng Xu ⋅ Qiaosheng Zhang ⋅ Chaochao Lu ⋅ Hehe Fan
[ Slides
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 404
An Empirical Study on How Video-LLMs Answer Video Questions
Chenhui Gou ⋅ Ziyu Ma ⋅ Zicheng Duan ⋅ Haoyu He ⋅ Feng Chen ⋅ Liyang Liu ⋅ Bohan Zhuang ⋅ Jianfei Cai ⋅ Hamid Rezatofighi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 405
FPSBench: A Benchmark for Video Understanding at High Frame Rates
Rohan Choudhury ⋅ Jean Dandurand ⋅ Kai Qiu ⋅ Kshitij Madhav Bhat ⋅ Kartik Sharma ⋅ Liza Dahiya ⋅ Yizhou Zhao ⋅ Souraja Kundu ⋅ Chun-Hsien Lin ⋅ Kris Kitani ⋅ László A. Jeni
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 406
UniComp: Rethinking Video Compression Through Informational Uniqueness
Chao Yuan ⋅ Shimin Chen ⋅ Minliang Lin ⋅ Limeng Qiao ⋅ Guanglu Wan ⋅ Lin Ma
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 407
NaTex: Seamless Texture Generation as Latent Color Diffusion
Zeqiang Lai ⋅ Yunfei Zhao ⋅ Zibo Zhao ⋅ Xin Yang ⋅ Xin Huang ⋅ Jingwei Huang ⋅ Xiangyu Yue ⋅ Chunchao Guo
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 408
Your Latent Mask is Wrong: Pixel-Equivalent Latent Compositing for Diffusion Models
Rowan Bradbury ⋅ Dazhi Zhong
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 409
Pluggable Pruning with Contiguous Layer Distillation for Diffusion Transformers
jian ma ⋅ Qirong Peng ⋅ Xujie Zhu ⋅ Peixing Xie ⋅ Chen Chen ⋅ Haonan Lu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 410
Attribute-Preserving Pseudo-Labeling for Diffusion-Based Face Swapping
Jiwon Kang ⋅ Yeji Choi ⋅ JoungBin Lee ⋅ Wooseok Jang ⋅ Jinhyeok Choi ⋅ Taekeun Kang ⋅ Yongjae Park ⋅ Myungin Kim ⋅ Seungryong Kim
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 411
Delta Rectified Flow Sampling for Text-to-Image Editing
Gaspard Beaudouin ⋅ Minghan LI ⋅ Jaeyeon Kim ⋅ Sung-Hoon Yoon ⋅ Mengyu Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 412
Training-free Mixed-Resolution Latent Upsampling for Spatially Accelerated Diffusion Transformers
Wongi Jeong ⋅ Kyungryeol Lee ⋅ Hoigi Seo ⋅ Se Young Chun
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 413
SpotEdit: Selective Region Editing in Diffusion Transformers
ZHIBIN QIN ⋅ Zhenxiong Tan ⋅ Zeqing Wang ⋅ Songhua Liu ⋅ Xinchao Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 414
All-in-One Slider for Attribute Manipulation in Diffusion Models
Weixin Ye ⋅ Hongguang Zhu ⋅ Wei Wang ⋅ Yahui Liu ⋅ Mengyu Wang ⋅ Xuecheng Nie
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 415
DA-VAE: Plug-in Latent Compression for Diffusion via Detail Alignment
Xin Cai ⋅ Zhiyuan You ⋅ Zhoutong Zhang ⋅ Tianfan Xue
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 416
From Sketch to Fresco: Efficient Diffusion Transformer with Progressive Resolution
Shikang Zheng ⋅ Guantao Chen ⋅ Landis He ⋅ Jiacheng Liu ⋅ Yuqi Lin ⋅ Chang Zou ⋅ Linfeng Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 417
CATNet: Collaborative Alignment and Transformation Network for Cooperative Perception
Gong Chen ⋅ Chaokun Zhang ⋅ Tao Tang ⋅ Pengcheng Lv ⋅ Feng Li ⋅ Xin Xie
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 418
Scene Reconstruction as Mapping Priors for 3D Detection
Yang Fu ⋅ Yuliang Zou ⋅ Hao Xiang ⋅ Xin Huang ⋅ Yijing Bai ⋅ Chen Song ⋅ Weijing Shi ⋅ Govind Thattai ⋅ Dragomir Anguelov ⋅ Mingxing Tan ⋅ Yingwei Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 419
CCF: Complementary Collaborative Fusion for Domain Generalized Multi-Modal 3D Object Detection
Yuchen Wu ⋅ Kun Wang ⋅ Yining Pan ⋅ Na Zhao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 420
Unleashing the Power of Chain-of-Prediction for Monocular 3D Object Detection
Zhihao Zhang ⋅ Abhinav Kumar ⋅ Girish Chandar ⋅ Xiaoming Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 421
R4Det: 4D Radar-Camera Fusion for High-Performance 3D Object Detection
Zhongyu Xia ⋅ Yousen Tang ⋅ Yongtao Wang ⋅ Zhifeng Wang ⋅ Weijun Qin
[ Slides
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 422
Revisiting Token Compression for Accelerating ViT-based Sparse Multi-View 3D Object Detectors
Mingqian Ji ⋅ Shanshan Zhang ⋅ Jian Yang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 423
Few-Shot Incremental 3D Object Detection in Dynamic Indoor Environments
Yun Zhu ⋅ Jianjun Qian ⋅ Jian Yang ⋅ Jin Xie ⋅ Na Zhao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 424
Learning from Synthetic Data via Provenance-Based Input Gradient Guidance
Koshiro Nagano ⋅ Ryo Fujii ⋅ Ryo Hachiuma ⋅ Fumiaki Sato ⋅ Taiki Sekii ⋅ HIDEO SAITO
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 425
Seeing Clearly, Reasoning Confidently: Plug-and-Play Remedies for Vision Language Model Blindness
Xin Hu ⋅ Haomiao Ni ⋅ Yunbei Zhang ⋅ Jihun Hamm ⋅ Zechen Li ⋅ Zhengming Ding
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 426
Draft and Refine with Visual Experts
SungHeon Jeong ⋅ Ryozo Masukawa ⋅ Jihong Park ⋅ Sanggeon Yun ⋅ Wenjun Huang ⋅ Hanning Chen ⋅ Mahdi Imani ⋅ Mohsen Imani
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 427
R2G: A Multi-View Circuit Graph Benchmark Suite from RTL to GDSII
ZEWEI ZHOU ⋅ Jiajun Zou ⋅ Jiajia Zhang ⋅ Ao Yang ⋅ Ruichao He ⋅ Haozheng Zhou ⋅ Ao Liu ⋅ Jiawei Liu ⋅ Leilei Jin ⋅ Shan Shen ⋅ Daying Sun
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 428
VQ-VA World: Towards High-Quality Visual Question-Visual Answering
Chenhui Gou ⋅ Zilong Chen ⋅ Zeyu Wang ⋅ Feng Li ⋅ Deyao Zhu ⋅ Zicheng Duan ⋅ Kunchang Li ⋅ Chaorui Deng ⋅ Hongyi Yuan ⋅ Haoqi Fan ⋅ Cihang Xie ⋅ Jianfei Cai ⋅ Hamid Rezatofighi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 429
Cross-Domain Demo-to-Code via Neurosymbolic Counterfactual Reasoning
Jooyoung Kim ⋅ Wonje Choi ⋅ Younguk Song ⋅ Honguk Woo
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 430
Beyond Multiple Choice: Verifiable OpenQA for Robust Vision-Language RFT
Yesheng Liu ⋅ Hao Li ⋅ Haiyu Xu ⋅ Baoqi Pei ⋅ Jiahao Wang ⋅ Mingxuan Zhao ⋅ Jing-Shu Zheng ⋅ Zheqi He ⋅ JG Yao ⋅ Xi Yang ⋅ Bowen Qin ⋅ Jiajun Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 431
See Further, Think Deeper: Advancing VLM's Reasoning Ability with Low-level Visual Cues and Reflection
Zhiheng Wu ⋅ Tong Wang ⋅ Shuning Wang ⋅ Naiming Liu ⋅ Yumeng Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 432
PDCR: Perception-Decomposed Confidence Reward for Vision-Language Reasoning
Hee Suk Yoon ⋅ Eunseop Yoon ⋅ Ji Woo Hong ⋅ SooHwan Eom ⋅ Gwanhyeong Koo ⋅ Mark Hasegawa-Johnson ⋅ Qi Dai ⋅ Chong Luo ⋅ Chang D. Yoo
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 433
μVLM: A Vision Language Model for μNPUs
Zijie Chen ⋅ Guiyun Fan ⋅ Zhaoxing Yang ⋅ Rong Ding ⋅ Haiming Jin
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 434
Gaussian Mapping for Evolving Scenes
Vladimir Yugay ⋅ Thies Kersten ⋅ Luca Carlone ⋅ Theo Gevers ⋅ Martin R. Oswald ⋅ Lukas Schmid
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 435
Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting
Tianjiao Yu ⋅ Vedant Shah ⋅ Muntasir Wahed ⋅ Ying Shen ⋅ Kiet A. Nguyen ⋅ Ismini Lourentzou
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 436
AnchorSplat: Feed-Forward 3D Gaussian Splatting With 3D Geometric Priors
Xiaoxue Zhang ⋅ Xiaoxu Zheng ⋅ Yixuan Yin ⋅ Tiao Zhao ⋅ Kaihua Tang ⋅ Michael Bi Mi ⋅ Zhan Xu ⋅ Dave Chen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 437
SGAD-SLAM: Splatting Gaussians at Adjusted Depth for Better Radiance Fields in RGBD SLAM
Pengchong Hu ⋅ Zhizhong Han
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 438
Faster-GS: Analyzing and Improving Gaussian Splatting Optimization
Florian Hahlbohm ⋅ Linus Franke ⋅ Martin Eisemann ⋅ Marcus Magnor
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 439
Layered 4D-Rotor Gaussian Splatting: A Compressed Representation for Long Dynamic Scenes
Hanjie Xu ⋅ Yuanxing Duan ⋅ Qiyu Dai ⋅ Ge Li ⋅ Baoquan Chen ⋅ He Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 440
GaussianGrow: Geometry-aware Gaussian Growing from 3D Point Clouds with Text Guidance
Weiqi Zhang ⋅ Junsheng Zhou ⋅ Haotian Geng ⋅ Kanle Shi ⋅ Shenkun Xu ⋅ Yi Fang ⋅ Yu-Shen Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 441
PhysGS: Bayesian-Inferred Gaussian Splatting for Physical Property Estimation
Samarth Chopra ⋅ Jing Liang ⋅ Gershom Seneviratne ⋅ Dinesh Manocha
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 442
3D Gaussian Splatting at Arbitrary Resolutions with Compact Proxy Anchors
Mingyun Jeong ⋅ Seongro Yoon ⋅ Francois Bremond ⋅ Donghyeon Cho
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 443
Stochastic Ray Tracing for the Reconstruction of 3D Gaussian Splatting
Peiyu Xu ⋅ Shuang Zhao ⋅ Xin Sun ⋅ Krishna Mullia ⋅ Raymond Fei ⋅ Iliyan Georgiev
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 444
AeroDGS: Physically Consistent Dynamic Gaussian Splatting for Single-Sequence Aerial 4D Reconstruction
Hanyang Liu ⋅ Rongjun Qin
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 445
GaussianPile: A Unified Sparse Gaussian Splatting Framework for Slice-based Volumetric Reconstruction
Di Kong ⋅ Yikai Wang ⋅ Wenjie Guo ⋅ Yifan Bu ⋅ Boya Zhang ⋅ Yuexin Duan ⋅ Xiawei Yue ⋅ Wenbiao Du ⋅ Yiman Zhong ⋅ Yuwen Chen ⋅ Cheng Ma
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 446
More Natural, More Real: Object-aware Gaussian Splatting for 3D Visual Decoding from Human Brain
Haodong Jing ⋅ Dongyao Jiang ⋅ Jixin Wang ⋅ Junhao Jia ⋅ Yanshu Li ⋅ Yongqiang Ma ⋅ Nanning Zheng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 447
Eulerian Gaussian Splatting using Hashed Probability Pyramids
Mia Gaia Polansky ⋅ George Kopanas ⋅ Stephan Garbin ⋅ Todd Zickler ⋅ Dor Verbin
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 448
Confidence-Guided Multi-Scale Aggregation for Sparse-View High-Resolution 3D Gaussian Splatting
Qinzheng Zhou ⋅ Zaychik Liu ⋅ Lijing Lu ⋅ Zhihang Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 449
ULF-Loc: Unbiased Landmark Feature for Robust Visual Localization with 3D Gaussian Splatting
Yingdong Gu ⋅ Shaocheng Yan ⋅ Zhenjun Zhao ⋅ Yuan Kou ⋅ Jianxin Luo ⋅ Pengcheng Shi ⋅ Jiayuan Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 450
Robust3DGSW: Toward Robust Watermarking for Quantization-Aware 3D Gaussian Splatting
Boyu Wang ⋅ Jun Xia ⋅ Mingsong Chen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 451
ParkGaussian: Surround-view 3D Gaussian Splatting for Autonomous Parking
Xiaobao Wei ⋅ Zhangjie Ye ⋅ Yuxiang Gu ⋅ Zunjie Zhu ⋅ Yunfei Guo ⋅ Yingying Shen ⋅ Shan Zhao ⋅ Ming Lu ⋅ Haiyang Sun ⋅ Bing Wang ⋅ Guang Chen ⋅ Rongfeng Lu ⋅ Hangjun Ye
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 452
L^2DGS: Low-Light Dynamic Gaussian Splatting
Ashish Kumar ⋅ A. N. Rajagopalan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 453
Probabilistic Concept Graph Reasoning for Multimodal Misinformation Detection
Ruichao Yang ⋅ Wei Gao ⋅ Xiaobin Zhu ⋅ Jing Ma ⋅ Hongzhan Lin ⋅ Ziyang Luo ⋅ Bo-Wen Zhang ⋅ Xu-Cheng Yin
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 454
POINTS-Long: Adaptive Dual-Mode Visual Reasoning in MLLMs
Haicheng Wang ⋅ Yuan Liu ⋅ Yikun Liu ⋅ Zhemeng Yu ⋅ Zhongyin Zhao ⋅ Yangxiu You ⋅ Zilin Yu ⋅ Le Tian ⋅ Zhou Xiao ⋅ Jie Zhou ⋅ Weidi Xie ⋅ Yanfeng Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 455
SegCompass: Exploring Interpretable Alignment with Sparse Autoencoders for Enhanced Reasoning Segmentation
Zhenyu Lu ⋅ Liupeng Li ⋅ Jinpeng Wang ⋅ Haoqian Kang ⋅ Yan Feng ⋅ Ke Chen ⋅ Yaowei Wang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 456
CRIT: Graph-Based Automatic Data Synthesis to Enhance Cross-Modal Multi-Hop Reasoning
Junyoung Sung ⋅ Seungwoo Lyu ⋅ Minjun Kim ⋅ Sumin An ⋅ Arsha Nagrani ⋅ Paul Hongsuck Seo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 457
DeepScan: A Training-Free Framework for Visually Grounded Reasoning in Large Vision-Language Models
Yangfu Li ⋅ Hongjian Zhan ⋅ Jiawei Chen ⋅ YUNING GONG ⋅ Qi Liu ⋅ Yue Lu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 458
Locate-Then-Examine: Grounded Region Reasoning Improves Detection of AI-Generated Images
Yikun Ji ⋅ Yan Hong ⋅ Bowen Deng ⋅ Jun Lan ⋅ Huijia Zhu ⋅ Weiqiang Wang ⋅ Liqing Zhang ⋅ Jianfu Zhang
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 459
HUMORCHAIN: Theory-Guided Multi-Stage Reasoning for Interpretable Multimodal Humor Generation
Jiajun Zhang ⋅ Shijia Luo ⋅ Ruikang Zhang ⋅ Qi Su
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 460
CodeDance: A Dynamic Tool-integrated MLLM for Executable Visual Reasoning
Qi Song ⋅ Honglin Li ⋅ Yingchen Yu ⋅ Haoyi Zhou ⋅ Lin Yang ⋅ Song Bai ⋅ Qi She ⋅ Zilong Huang ⋅ Yunqing Zhao
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 461
Rethinking MLLM Itself as a Segmenter with a Single Segmentation Token
Anqi Zhang ⋅ Xiaokang Ji ⋅ Guangyu Gao ⋅ Jianbo Jiao ⋅ Chi Harold Liu ⋅ Yunchao Wei
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 462
Video-Only ToM: Enhancing Theory of Mind in Multimodal Large Language Models
SIQI LIU ⋅ Xinyang Li ⋅ Bochao Zou ⋅ Junbao Zhuo ⋅ Huimin Ma ⋅ Jiansheng Chen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 463
Mario: Multimodal Graph Reasoning with Large Language Models
Yuanfu Sun ⋅ Kang Li ⋅ Pengkang Guo ⋅ Jiajin Liu ⋅ Qiaoyu Tan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 464
Boosting Reasoning in Large Multimodal Models via Activation Replay
Yun Xing ⋅ Xiaobin Hu ⋅ Qingdong He ⋅ Jiangning Zhang ⋅ Shuicheng Yan ⋅ Shijian Lu ⋅ Yu-Gang Jiang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 465
Rationale-Enhanced Decoding for Multi-modal Chain-of-Thought
Shin'ya Yamaguchi ⋅ Kosuke Nishida ⋅ Daiki Chijiwa
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 466
Mimic Human Cognition, Master Multi-Image Reasoning: A Meta-Action Framework for Enhanced Visual Understanding
Jianghao Yin ⋅ Qingbin Li ⋅ KUN SUN ⋅ Cheng Ding ⋅ Jie Wang ⋅ Qin Chen ⋅ Jie Zhou ⋅ Nan Wang ⋅ Changqing Li ⋅ Pei Wu ⋅ Jian Xu ⋅ Zheming Yang ⋅ Liang He
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 467
ROSE: Rotate Your Large Language Model to See
Tongtian Yue ⋅ Xuange Gao ⋅ Longteng Guo ⋅ Zijia Zhao ⋅ Zikang Liu ⋅ Jie Jiang ⋅ Hua Huang ⋅ Jing Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 468
OpenMMReasoner: Pushing the Frontiers in Multimodal Reasoning with an Open and General Recipe
Kaichen Zhang ⋅ Keming Wu ⋅ Zuhao Yang ⋅ Bo Li ⋅ Kairui Hu ⋅ Bin Wang ⋅ Xingxuan Li ⋅ Lidong Bing
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 469
SelecTKD: Selective Token-Weighted Knowledge Distillation for LLMs
Haiduo Huang ⋅ Jiangcheng Song ⋅ Yadong Zhang ⋅ Pengju Ren
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 470
Sparsity as a Key: Unlocking New Insights from Latent Structures for Out-of-Distribution Detection
Ahyoung Oh ⋅ Wonseok Shin ⋅ Songkuk Kim
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 471
SparVAR: Exploring Sparsity in Visual AutoRegressive Modeling for Training-Free Acceleration
Zekun Li ⋅ wang ning ⋅ Tongxin Bai ⋅ Changwang Mei ⋅ Ning Wang ⋅ Shuang Qiu ⋅ Jian Cheng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 472
Suppressing Non-Semantic Noise in Masked Image Modeling Representations
Martine Hjelkrem-Tan ⋅ Marius Aasan ⋅ Rwiddhi Chakraborty ⋅ Gabriel Y. Arteaga ⋅ Changkyu Choi ⋅ Adín Ramírez Rivera
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 473
Block-based Learned Image Compression without Blocking Artifacts
Jong Wook Kim ⋅ Suyong Bahk ⋅ TaeHwa Lee ⋅ HyunDong CHO ⋅ Donghyun Kim ⋅ Sung-Chang Lim ⋅ Jin Soo Choi ⋅ Hui Yong Kim
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 474
DeDelayed: Deleting Remote Inference Delay via On-Device Correction
Dan Jacobellis ⋅ Mateen Ulhaq ⋅ Fabien Racapé ⋅ Hyomin Choi ⋅ Neeraja Yadwadkar
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 475
AdaRadar: Rate Adaptive Spectral Compression for Radar-based Perception
Jinho Park ⋅ Se Young Chun ⋅ Mingoo Seok
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 476
Gaussian Splatting-based Low-Rank Tensor Representation for Multi-Dimensional Image Recovery
Yiming Zeng ⋅ Xile Zhao ⋅ Wei-Hao Wu ⋅ Teng-Yu Ji ⋅ Chao Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 477
Precise Object and Effect Removal with Adaptive Target-Aware Attention
Jixin Zhao ⋅ Zhouxia Wang ⋅ Peiqing Yang ⋅ Shangchen Zhou
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 478
Decompose, Mix, Adapt: A Unified Framework for Parameter-Efficient Neural Network Recombination and Compression
Nazia Tasnim ⋅ Shrimai Prabhumoye ⋅ Bryan A. Plummer
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 479
FreqSIC: Frequency-aware Stereo Image Compression with Bi-directional Checkerboard Context Model
Shiyu Qin ⋅ Yongkang Lu ⋅ Yimin Zhou ⋅ Jiawei Li ⋅ Yifan Ren ⋅ Yuerong Xue ⋅ Shu-Tao Xia ⋅ Bin Chen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 480
SinGeo: Unlock Single Model's Potential for Robust Cross-View Geo-Localization
CHEN Yang ⋅ Xieyuanli Chen ⋅ Junxiang Li ⋅ Jie Tang ⋅ Tao Wu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 481
Fusion of Depth and Semantics for Probabilistic Floorplan Localization
Kecheng Ye ⋅ Mao Chen ⋅ Xiangkai Zhang ⋅ Xu Yang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 482
A2GC: Asymmetric Aggregation with Geometric Constraints for Locally Aggregated Descriptors
Zhenyu Li ⋅ Tianyi Shang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 483
Geo2: Geometry-Guided Cross-view Geo-Localization and Image Synthesis
Yancheng Zhang ⋅ Xiaohan Zhang ⋅ Guangyu Sun ⋅ Zonglin Lyu ⋅ Safwan Wshah ⋅ Chen Chen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 484
Coverage Optimization for Camera View Selection
Timothy Chen ⋅ Adam Dai ⋅ Maximilian Adang ⋅ Grace Gao ⋅ Mac Schwager
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 485
Resolving Evidence Sparsity: Agentic Context Engineering for Long-Document Understanding
Keliang Liu ⋅ Zizhi Chen ⋅ Mingcheng Li ⋅ Jingqun Tang ⋅ Dingkang Yang ⋅ Lihua Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 486
Reasoning Palette: Modulating Reasoning via Latent Contextualization for Controllable Exploration for (V)LMs
Rujiao Long ⋅ Yang Li ⋅ Xingyao Zhang ⋅ Weixun Wang ⋅ Tianqianjin Lin ⋅ Xi Zhao ⋅ Yuchi Xu ⋅ Wenbo Su ⋅ Junchi Yan ⋅ Bo Zheng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 487
ORCA: Orchestrated Reasoning with Collaborative Agents for Document Visual Question Answering
Aymen Lassoued ⋅ Mohamed Ali Souibgui ⋅ Yousri Kessentini
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 488
MSJoE: Jointly Evolving MLLM and Sampler for Efficient Long-Form Video Understanding
Wenhui Tan ⋅ Xiaoyi Yu ⋅ Jiaze Li ⋅ Yijing Chen ⋅ Jianzhong Ju ⋅ Zhenbo Luo ⋅ Ruihua Song ⋅ Jian Luan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 489
A Multi-Agent Perception-Action Alliance for Efficient Long Video Reasoning
Yichang Xu ⋅ Gaowen Liu ⋅ Ramana Kompella ⋅ Tiansheng Huang ⋅ Sihao Hu ⋅ Fatih Ilhan ⋅ Selim Tekin ⋅ Zachary Yahn ⋅ Ling Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 490
Saliency-Guided Representation with Consistency Policy Learning for Visual Unsupervised Reinforcement Learning
Jingbo Sun ⋅ Qichao Zhang ⋅ Songjun Tu ⋅ Xing Fang ⋅ Yupeng Zheng ⋅ Haoran Li ⋅ Ke Chen ⋅ Dongbin Zhao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 491
LensWalk: Agentic Video Understanding by Planning How You See in Videos
Keliang Li ⋅ Yansong Li ⋅ Hongze Shen ⋅ Mengdi Liu ⋅ Hong Chang ⋅ Shiguang Shan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 492
DPGF-Net: Dual-Prior Guided Fusion Network for Joint Assessment of Perceptual Quality and Semantic Consistency in AI-Generated Images
Tao Li ⋅ Xingran LIAO ⋅ Mingliang Zhou
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 493
RegionFuse: Region-Adaptive Pixel Distribution Learning for Infrared and Visible Image Fusion
Jianghan Xia ⋅ Hong Song ⋅ Jinfu Li ⋅ Yucong Lin ⋅ Shihan Ma ⋅ Jingfan Fan ⋅ Danni Ai ⋅ Tianyu Fu ⋅ Deqiang Xiao ⋅ Jian Yang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 494
Missing No More: Dictionary-Guided Cross-Modal Image Fusion under Missing Infrared
Yafei Zhang ⋅ Meng Ma ⋅ Huafeng Li ⋅ Yu Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 495
VideoFusion: A Spatio-Temporal Collaborative Network for Multi-modal Video Fusion
Linfeng Tang ⋅ Yeda Wang ⋅ Meiqi Gong ⋅ Zizhuo Li ⋅ Yuxin Deng ⋅ Xunpeng Yi ⋅ Chunyu Li ⋅ Han Xu ⋅ HAO ZHANG ⋅ Jiayi Ma
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 496
TAPE: Task-Adaptive Prototype Evolution in Audio-Language Models for Fully Few-shot Class-incremental Audio Classification
Yunlong Gao ⋅ Wenxin Liang ⋅ Guanglu Wang ⋅ Senqi Guan ⋅ Linlin Zong ⋅ Dongyu Zhang ⋅ Xinyue Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 497
Remedying Target-Domain Astigmatism for Cross-Domain Few-Shot Object Detection
Yongwei Jiang ⋅ Yixiong Zou ⋅ Yuhua Li ⋅ Ruixuan Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 498
DDSF: Robust Few-Shot Learning via Disentangled Subspaces with Determinantal Point Process
xulun ye ⋅ Yifan Mei ⋅ Kun Zhou ⋅ Zelei Wu ⋅ Jieyu Zhao
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 499
Hyperbolic Defect Feature Synthesis for Few-Shot Defect Classification
Huimin Li ⋅ Boxuan Hu ⋅ Yulin Zhang ⋅ Xiuzhuang Zhou ⋅ Junlin Hu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 500
Training-Only Heterogeneous Image-Patch-Text Graph Supervision for Advancing Few-Shot Learning Adapters
Mohammed Rahman Sherif Khan Mohammad ⋅ Ardhendu Behera ⋅ Sandip Pradhan ⋅ Swagat Kumar ⋅ Amr Ahmed
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 501
Learning to Learn Weight Generation via Local Consistency Diffusion
Yunchuan Guan ⋅ Yu Liu ⋅ Ke Zhou ⋅ Zhiqi Shen ⋅ Jenq-Neng Hwang ⋅ Lei Li
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 502
Balanced Dataset Distillation via Modeling Multiple Visual Pattern Distribution
Guanghui Shi ⋅ Xuefeng Liang ⋅ Qixiang Wen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 503
Grid Distillation: Compositional Image Distillation via Structured Generative Grids
Biplab Ch Das ⋅ Shouvik Das ⋅ Viswanath Gopalakrishnan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 504
Dataset Distillation by Influence Matching
Haoru Tan ⋅ Wang Wang ⋅ WU Sitong ⋅ Xiuzhe Wu ⋅ Yangtian Sun ⋅ Chirui Chang ⋅ Shaofeng Zhang ⋅ Xiaojuan Qi
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 505
StableMaterials: Enhancing Diversity in Material Generation via Semi-Supervised Learning
Giuseppe Vecchio
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 506
Seeing Through Blur: Tackling Defocus in Spike-Based Imaging
Xiantao Ma ⋅ Siwei Dong ⋅ Lin Zhu ⋅ Lizhi Wang ⋅ Hua Huang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 507
Distilling Quasi-Conformal Mapping: A Generalizable and Efficient Solution for Wide-Angle Correction
Chengyang Liu ⋅ Zixuan Lin ⋅ Miaolin Han ⋅ Michael K. Ng ⋅ huibin Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 508
Lighting in Motion: Spatiotemporal HDR Lighting Estimation
Christophe Bolduc ⋅ Julien Philip ⋅ Li Ma ⋅ Mingming He ⋅ Paul Debevec ⋅ Jean-François Lalonde
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 509
LightRR: A Lightweight Network for Single Image Reflection Removal
Wenbin Yin ⋅ Junkang Zhang ⋅ Sunzhe Yang ⋅ Faming Fang ⋅ Guixu Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 510
HFR and HDR Video from Multi-Attenuated Spikes Using a Rapidly Rotating SpokeND Filter
Yakun Chang ⋅ Zhaojun Huang ⋅ Siqi Yang ⋅ Yeliduosi Xiaokaiti ⋅ Shikui Wei ⋅ Yao Zhao ⋅ Tiejun Huang ⋅ Boxin Shi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 511
Coded-E2LF: Coded Aperture Light Field Imaging from Events
Tomoya Tsuchida ⋅ Keita Takahashi ⋅ Chihiro Tsutake ⋅ Toshiaki Fujii ⋅ Hajime Nagahara
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 512
TokenLight: Precise Lighting Control in Images using Attribute Tokens
Sumit Chaturvedi ⋅ Yannick Hold-Geoffroy ⋅ Mengwei Ren ⋅ Jingyuan Liu ⋅ He Zhang ⋅ Yiqun Mei ⋅ Julie Dorsey ⋅ ZHIXIN SHU
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 513
Kaleidoscopic Scintillation Event Imaging
Alex Bocchieri ⋅ John Mamish ⋅ David Appleyard ⋅ Andreas Velten
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 514
gQIR: Generative Quanta Image Reconstruction
Aryan Garg ⋅ Sizhuo Ma ⋅ Mohit Gupta
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 515
Solving Minimal Problems Without Matrix Inversion Using FFT-Based Interpolation
Haidong Wu ⋅ Snehal Bhayani ⋅ Janne Heikkilä
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 516
Predicting Spatial Transcriptomics from Histology Images via High-Order Multi-Cell Interaction Modeling
Youhan Sun ⋅ Jiahua Rao ⋅ Kangrui Du ⋅ Jiancong Xie ⋅ Yuedong Yang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 517
From Spots to Pixels: Dense Spatial Gene Expression Prediction from Histology Images
Ruikun Zhang ⋅ Yan Yang ⋅ Liyuan Pan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 518
Cell-Type Prototype-Informed Neural Network for Gene Expression Estimation from Pathology Images
Kazuya Nishimura ⋅ Ryoma Bise ⋅ Shinnosuke Matsuo ⋅ Haruka Hirose ⋅ Yasuhiro Kojima
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 519
LightSplat: Fast and Memory-Efficient Open-Vocabulary 3D Scene Understanding in Five Seconds
Jaehun Bang ⋅ Jinhyeok Kim ⋅ Minji Kim ⋅ Seungheon Jeong ⋅ Kyungdon Joo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 520
Guardians of the Hair: Rescuing Soft Boundaries in Depth, Stereo, and Novel Views
Xiang Zhang ⋅ Yang Zhang ⋅ Lukas Mehl ⋅ Markus Gross ⋅ Christopher Schroers
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 521
Zero-Shot Depth Completion with Vision-Language Model
Zhiqiang Yan ⋅ Yuan Wu ⋅ Gim Hee Lee
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 522
FE2E: From Editor to Dense Geometry Estimator
jiyuan WANG ⋅ Chunyu Lin ⋅ Lei Sun ⋅ Rongying Liu ⋅ Lang Nie ⋅ Mingxing Li ⋅ Kang Liao ⋅ Xiangxiang Chu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 523
Ego-1K – A Large-Scale Multiview Video Dataset for Egocentric Vision
Jae Yong Lee ⋅ Daniel Scharstein ⋅ Akash Bapat ⋅ Hao Hu ⋅ Andrew Fu ⋅ Haoru Zhao ⋅ Paul Sammut ⋅ Xiang Li ⋅ Stephen Jeapes ⋅ Anik Gupta ⋅ Lior David ⋅ Saketh Madhuvarasu ⋅ Jay Girish Joshi ⋅ Jason Wither
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 524
Edit-As-Act: Goal-Regressive Planning for Open-Vocabulary 3D Indoor Scene Editing
SeongRae Noh ⋅ SeungWon Seo ⋅ Gyeong-Moon Park ⋅ HyeongYeop Kang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 525
VGGT-360: Geometry-Consistent Zero-Shot Panoramic Depth Estimation
Jiayi Yuan ⋅ Haobo Jiang ⋅ De Wen Soh ⋅ Na Zhao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 526
NI-Tex: Non-isometric Image-based Garment Texture Generation
Hui Shan ⋅ Ming Li ⋅ Haitao Yang ⋅ Kai Zheng ⋅ Sizhe Zheng ⋅ Yanwei Fu ⋅ Xiangru Huang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 527
Velox: Learning Representations of 4D Geometry and Appearance
Anagh Malik ⋅ Dorian Chan ⋅ Xiaoming Zhao ⋅ David B. Lindell ⋅ Oncel Tuzel ⋅ Rick Chang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 528
UniPixie: Unified and Probabilistic 3D Physics Learning via Flow Matching
Qilin Huang ⋅ Quynh Anh Huynh ⋅ Long Le ⋅ Chen Wang ⋅ Chuhao Chen ⋅ Ryan Lucas ⋅ Eric Eaton ⋅ Lingjie Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 529
UniTEX: Universal High Fidelity Generative Texturing for 3D Shapes
Yixun Liang ⋅ Kunming Luo ⋅ Xiao Chen ⋅ Rui Chen ⋅ Jiawei Zhou ⋅ Weiyu Li ⋅ Jiarui Liu ⋅ Fei-Peng Tian ⋅ Ping Tan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 530
Points-to-3D: Structure-Aware 3D Generation with Point Cloud Priors
Jiatong Xia ⋅ Zicheng Duan ⋅ Anton van den Hengel ⋅ Lingqiao Liu
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 531
PartDiffuser: Part-wise 3D Mesh Generation via Discrete Diffusion
Yichen Yang ⋅ Hong Li ⋅ Haodong Zhu ⋅ linin ⋅ guojun lei ⋅ Sheng Xu ⋅ Baochang Zhang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 532
LoST: Level of Semantics Tokenization for 3D Shapes
Niladri Shekhar Dutt ⋅ Zifan Shi ⋅ Paul Guerrero ⋅ Chun-Hao Huang ⋅ Duygu Ceylan ⋅ Niloy J. Mitra ⋅ Xuelin Chen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 533
Lafite: A Generative Latent Field for 3D Native Texturing
Chia-Hao Chen ⋅ Yuanchen Guo ⋅ Zi-Xin Zou ⋅ Ze Yuan ⋅ Guan Luo ⋅ Xiaojuan Qi ⋅ Ding Liang ⋅ Yan-Pei Cao ⋅ Song-Hai Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 534
Image-Guided Geometric Stylization of 3D Meshes
Changwoon Choi ⋅ Hyunsoo Lee ⋅ Clément Jambon ⋅ Yael Vinker ⋅ Young Min Kim
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 535
LATTICE: Democratize High-Fidelity 3D Generation at Scale
Zeqiang Lai ⋅ Yunfei Zhao ⋅ Zibo Zhao ⋅ Haolin Liu ⋅ Qingxiang Lin ⋅ Jingwei Huang ⋅ Chunchao Guo ⋅ Xiangyu Yue
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 536
Dehallu3D: Hallucination-Mitigated 3D Generation from a Single Image via Cyclic View Consistency Refinement
Xiwen Wang ⋅ Shichao Zhang ⋅ Ruowei Wang ⋅ mao li ⋅ Chenyu Zhou ⋅ Ji-Zhe Zhou ⋅ Qijun Zhao ⋅ Hailun Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 537
MeshMosaic: Scaling Artist Mesh Generation via Local-to-Global Assembly
Rui Xu ⋅ Tianyang Xue ⋅ Qiujie Dong ⋅ Le Wan ⋅ Zhe Zhu ⋅ Peng Li ⋅ Zhiyang Dou ⋅ Cheng Lin ⋅ Shiqing Xin ⋅ Yuan Liu ⋅ Wenping Wang ⋅ Taku Komura
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 538
TacSIm: A Dataset and Benchmark for Football Tactical Style Imitation
Peng Wen ⋅ Yuting Wang ⋅ Qiurui Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 539
DynamicsBoost: Dynamic Plausible Video Generation via Annotation-Free Continuation Preference Optimization
Jiaxing Li ⋅ Jiepeng Wang ⋅ Junyao Gao ⋅ Yang Liu ⋅ Eric Li ⋅ Bo An ⋅ Hao-Xiang Guo
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 540
Reinforcement-Guided Synthetic Data Generation for Privacy-Sensitive Identity Recognition
Xuemei Jia ⋅ Jiawei Du ⋅ Hui Wei ⋅ Jun Chen ⋅ Joey Tianyi Zhou ⋅ Zheng Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 541
Fine-Grained GRPO for Precise Preference Alignment in Flow Models
Yujie Zhou ⋅ Pengyang Ling ⋅ Jiazi Bu ⋅ Yibin Wang ⋅ Yuhang Zang ⋅ Jiaqi Wang ⋅ Li Niu ⋅ Guangtao Zhai
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 542
Lighting-grounded Video Generation with Renderer-based Agent Reasoning
Ziqi Cai ⋅ Taoyu Yang ⋅ Zheng Chang ⋅ Si Li ⋅ Han Jiang ⋅ Shuchen Weng ⋅ Boxin Shi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 543
RewardFlow: Generate Images by Optimizing What You Reward
Onkar Susladkar ⋅ Dong-Hwan Jang ⋅ Tushar Prakash ⋅ Adheesh Juvekar ⋅ Vedant Shah ⋅ Ayush Barik ⋅ Nabeel Bashir ⋅ Muntasir Wahed ⋅ Ritish Shrirao ⋅ Ismini Lourentzou
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 544
Goal Force: Teaching Video Models To Accomplish Physics-Conditioned Goals
Nate Gillman ⋅ Yinghua Zhou ⋅ Zitian Tang ⋅ Evan Luo ⋅ Arjan Chakravarthy ⋅ Daksh Aggarwal ⋅ Michael Freeman ⋅ Chen Sun
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 545
Self-Corrected Image Generation with Explainable Latent Rewards
Yinyi Luo ⋅ Hrishikesh Gokhale ⋅ Marios Savvides ⋅ Jindong Wang ⋅ Shengfeng He
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 546
Polyphony: Diffusion-based Dual-Hand Action Segmentation with Alternating Vision Transformer and Semantic Conditioning
Hao Zheng ⋅ Hu Wang ⋅ Tiantian Zheng ⋅ Prajjwal Bhattarai ⋅ Tuka Alhanai
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 547
Reading Your Actions: Learning Generalizable Action Representations via Pre-training AEMG
Zhenghao Huang ⋅ Kaikai Wang ⋅ HUILIN YAO ⋅ Lin Shu
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 548
MA-Bench: Towards Fine-grained Micro-Action Understanding
Kun Li ⋅ Jihao Gu ⋅ Fei Wang ⋅ zhiliang wu ⋅ Hehe Fan ⋅ Dan Guo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 549
OpenMarcie: Dataset for Multimodal Action Recognition in Industrial Environments
Hymalai Bello ⋅ Lala Ray ⋅ Joanna Sorysz ⋅ Sungho Suh ⋅ Paul Lukowicz
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 550
Action Motifs: Self-Supervised Hierarchical Representation of Human Body Movements
Genki Kinoshita ⋅ Shu Nakamura ⋅ Ryo Kawahara ⋅ Shohei Nobuhara ⋅ Yasutomo Kawanishi ⋅ Ko Nishino
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 551
DarkShake-DVS: Event-based Human Action Recognition under Low-light and Shaking Camera Conditions
Jiaqi Chen ⋅ Qinfu Xu ⋅ Liyuan Pan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 552
Protect to Adapt: Subspace-Constrained Adaptation with Ranked Negative Prompt Feedback for Few-Shot Action Recognition
Hantao Qi ⋅ Yan Yan ⋅ Junlong Gao ⋅ Hanzi Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 553
SkeletonContext: Skeleton-side Context Prompt Learning for Zero-Shot Skeleton-based Action Recognition
Ning Wang ⋅ Tieyue Wu ⋅ Naeha Sharif ⋅ Farid Boussaid ⋅ Guangming Zhu ⋅ Lin Mei ⋅ Mohammed Bennamoun ⋅ Liang Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 554
InTrain: Intrinsic Trainability for Zero-Cost Neural Architecture Search
Qinqin Zhou ⋅ Fuhai Chen ⋅ Jipeng Wu ⋅ Zhiwei Chen ⋅ Zhikai Hu ⋅ Weiwei Cai
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 555
S^2FT: Parameter-Efficient Fine-Tuning in Sparse Spectrum Domain
Baoquan Zhang ⋅ Zhehao Yu ⋅ Lisai Zhang ⋅ Kenghong Lin ⋅ Tianran Chen ⋅ Yuxi Sun ⋅ Yunming Ye ⋅ Yao He
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 556
Rethinking SNN Online Training and Deployment: Gradient-Coherent Learning via Hybrid-Driven LIF Model
Zecheng Hao ⋅ Yifan Huang ⋅ Zijie Xu ⋅ Wenxuan Liu ⋅ Yuanhong Tang ⋅ Zhaofei Yu ⋅ Tiejun Huang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 557
Gated KalmaNet: A Fading Memory Layer through Test-time Ridge Regression
Liangzu Peng ⋅ Aditya Chattopadhyay ⋅ Luca Zancato ⋅ Elvis Nunez ⋅ Wei Xia ⋅ Stefano Soatto
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 558
Towards Efficient Medical Reasoning with Minimal Fine-Tuning Data
Xinlin Zhuang ⋅ feilong tang ⋅ Haolin Yang ⋅ Xiwei Liu ⋅ Ming Hu ⋅ Huifa Li ⋅ Haochen Xue ⋅ Junjun He ⋅ Zongyuan Ge ⋅ Yichen Li ⋅ Ying Qian ⋅ Imran Razzak
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 559
AdaBet: Gradient-free Layer Selection for Efficient Training of Deep Neural Networks
Irene Tenison ⋅ Soumyajit Chatterjee ⋅ Fahim Kawsar ⋅ Mohammad Malekzadeh
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 560
TAS-LoRA: Transformer Architecture Search with Mixture-of-LoRA Experts
Jeimin Jeon ⋅ Hyunju Lee ⋅ Bumsub Ham
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 561
QuCNet: Quantum Deep Learning Driven Multi-Circuit Network for Remote Sensing Image Classification
Komal Komal ⋅ Mukul Gupta ⋅ Saumya Singh ⋅ SANTOSH VIPPARTHI ⋅ Chakradhar Reddy Chandupatla ⋅ Subrahmanyam Murala
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 562
Learning to Solve PDEs on Neural Shape Representations
Lilian Welschinger ⋅ Yilin Liu ⋅ Zican Wang ⋅ Niloy J. Mitra
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 563
Frequency Switching Mechanism for Parameter-Efficient Multi-Task Learning
Shih-Wen Liu ⋅ Yen-Chang Chen ⋅ Wei-Ta Chu ⋅ Fu-En Yang ⋅ Yu-Chiang Frank Wang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 564
Reconstructing Spiking Neural Networks Using a Single Neuron with Autapses
Wuque Cai ⋅ Hongze Sun ⋅ Quan Tang ⋅ Shifeng Mao ⋅ Zhenxing Wang ⋅ Jiayi He ⋅ Duo Chen ⋅ Dezhong Yao ⋅ Daqing Guo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 565
Widget2Code: From Visual Widgets to UI Code via Multimodal LLMs
Houston H. Zhang ⋅ TAO ZHANG ⋅ Baoze Lin ⋅ Yuanqi Xue ⋅ Yincheng Zhu ⋅ Huan Liu ⋅ Li Gu ⋅ Linfeng Ye ⋅ Ziqiang Wang ⋅ Xinxin Zuo ⋅ Yang Wang ⋅ YUANHAO YU ⋅ Zhixiang Chi
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 566
GUI-CEval: A Hierarchical and Comprehensive Chinese Benchmark for Mobile GUI Agents
Yang Li ⋅ Yuchen Liu ⋅ Haoyu Lu ⋅ Zhiqiang Xia ⋅ Hongzhen Wang ⋅ Kaiyang Han ⋅ Changpeng Yang ⋅ Jinyang Wu ⋅ Jiaming Xu ⋅ Runyu Shi ⋅ Ying Huang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 567
FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection
Mingyu Ouyang ⋅ Kevin Qinghong Lin ⋅ Mike Zheng Shou ⋅ Hwee Tou Ng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 568
Streamlined Open-Vocabulary Human-Object Interaction Detection
Chang Sun ⋅ Dongliang Liao ⋅ Changxing Ding
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 569
Decompose and Transfer: CoT-Prompting Enhanced Alignment for Open-Vocabulary Temporal Action Detection
SA ZHU ⋅ Wanqian Zhang ⋅ Lin Wang ⋅ Xiaohua Chen ⋅ Chenxu Cui ⋅ Jinchao Zhang ⋅ Bo Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 570
Mitigating Simplicity Bias in OOD Detection through Object Co-occurrence Analysis
Boyang Dai ⋅ Chaoqi Chen ⋅ Yizhou Yu
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 571
Boosting Quantitive and Spatial Awareness for Zero-Shot Object Counting
Da Zhang ⋅ Bingyu Li ⋅ Feiyu Wang ⋅ Zhiyuan Zhao ⋅ Junyu Gao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 572
Parameter-Efficient Semantic Augmentation for Enhancing Open-Vocabulary Object Detection
Weihao Cao ⋅ Runqi Wang ⋅ Xiaoyue Duan ⋅ Jinchao Zhang ⋅ Ang Yang ⋅ Liping Jing
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 573
WeDetect: Fast Open-Vocabulary Object Detection as Retrieval
Shenghao Fu ⋅ Yukun Su ⋅ Fengyun Rao ⋅ Jing LYU ⋅ Xiaohua Xie ⋅ Wei-Shi Zheng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 574
Open-Vocabulary Domain Generalization in Urban-Scene Segmentation
Dong Zhao ⋅ Qi Zang ⋅ Nan Pu ⋅ Wenjing Li ⋅ Nicu Sebe ⋅ Zhun Zhong
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 575
OpenDPR: Open-Vocabulary Change Detection via Vision-Centric Diffusion-Guided Prototype Retrieval for Remote Sensing Imagery
Qi Guo ⋅ Jue Wang ⋅ Yinhe Liu ⋅ Yanfei Zhong
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 576
Annotation-Efficient Coreset Selection for Context-dependent Segmentation
jin zhang ⋅ Zhe Cao ⋅ Biwen Yang ⋅ Ruiheng Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 577
ALLNet: Multi-task Dense Prediction for Degraded Images
Weiran Wang ⋅ Jialing Wu ⋅ Yaqi Chang ⋅ Gang He ⋅ Li Xu ⋅ Chang Wu ⋅ Yunsong Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 578
Geometry-Aware Cross-Modal Graph Alignment for Referring Segmentation in 3D Gaussian Splatting
Yuwen Tao ⋅ Kanglei Zhou ⋅ Chang Li ⋅ Liyuan Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 579
Volumetric Functional Maps
Filippo Maggioli ⋅ Simone Melzi ⋅ Marco Livesu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 580
GenMask: Adapting DiT for Segmentation via Direct Mask Generation
Yang yuhuan ⋅ Xianwei Zhuang ⋅ Yuxuan Cai ⋅ Chaofan Ma ⋅ Shuai Bai ⋅ Jiangchao Yao ⋅ Ya Zhang ⋅ Junyang Lin ⋅ Yanfeng Wang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 581
Frequency-Aware Affinity for Weakly Supervised Semantic Segmentation
Ziqian Yang ⋅ Xianglin Qiu ⋅ Xinqiao Zhao ⋅ Xiaolei Wang ⋅ Quan Zhang ⋅ Jimin Xiao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 582
Learning and Aligning Click-Aware Shape Prior for Interactive Amodal Instance Segmentation
Junjie Chen ⋅ Junwei Lin ⋅ Ren Hong ⋅ Shengjie Liu ⋅ Yuming Fang ⋅ Feng Qian ⋅ Yifan Zuo
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 583
Beyond Reassembly: Fractured Object Recovery with Missing Parts
Qun-Ce Xu ⋅ Jiahui Li ⋅ Yan-Pei Cao ⋅ Weihao Cheng ⋅ Tai-Jiang Mu ⋅ Ying Shan ⋅ Chuan Li ⋅ Da Chen ⋅ Yong-Liang Yang ⋅ Shi-Min Hu
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 584
Best Segmentation Buddies for Image-Shape Correspondence
Itai Lang ⋅ Dongwei Lyu ⋅ Dale Decatur ⋅ Rana Hanocka
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 585
RMAE-ProGRess: Advancing Semantic Segmentation in Unstructured Environments
Manish Bhurtel ⋅ Danda B. Rawat
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 586
Local Precise Refinement: A Dual-Gated Mixture-of-Experts for Enhancing Foundation Model Generalization against Spectral Shifts
Xi Chen ⋅ Maojun Zhang ⋅ Yu Liu ⋅ Shen Yan
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 587
Orthogonal Spatial-Aware Multi-View Anchor Graph Clustering for Incomplete Remote Sensing Data
Yongshan Zhang ⋅ Xiaohuan Lin ⋅ Lefei Zhang ⋅ Zhihua Cai
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 588
SIGMA: A Physics-Based Benchmark for Gas Chimney Understanding in Seismic Images
Bao Truong ⋅ Quang Nguyen ⋅ Baoru Huang ⋅ Jinpei Han ⋅ Van Nguyen ⋅ Ngan Le ⋅ Minh-Tan Pham ⋅ Doan Huy Hien ⋅ Anh Nguyen
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 589
SkySense-VITA: Towards Universal In-context Segmentation of Multi-modal Remote Sensing Imagery
Kang Wu ⋅ Lei Yu ⋅ Junwei Luo ⋅ Bo Dang ⋅ Junjian Zhang ⋅ Xiangyuan Cai ⋅ Hongwei Hu ⋅ Jingdong Chen ⋅ Yansheng Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 590
ProM3E: Probabilistic Masked MultiModal Embedding Model for Ecology
Srikumar Sastry ⋅ Subash Khanal ⋅ Aayush Dhakal ⋅ Jiayu Lin ⋅ Daniel Cher ⋅ Phoenix Jarosz ⋅ Nathan Jacobs
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 591
GeoCoT: Towards Reliable Remote Sensing Reasoning with Manifold Perspective
Daixun Li ⋅ Zirui Li ⋅ Sibo He ⋅ Jiayun Tian ⋅ Mingxiang Cao ⋅ Weiying Xie ⋅ Yunke Wang ⋅ Xin Zhang ⋅ Yusi Zhang ⋅ Yunsong Li ⋅ Chang Xu ⋅ Leyuan Fang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 592
STCast: Adaptive Boundary Alignment for Global and Regional Weather Forecasting
Hao Chen ⋅ Tao Han ⋅ Jie ZHANG ⋅ Song Guo ⋅ Lei Bai
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 593
NeighborMAE: Exploiting Spatial Dependencies between Neighboring Earth Observation Images in Masked Autoencoders Pretraining
Liang Zeng ⋅ Valerio Marsocci ⋅ Wufan Zhao ⋅ Andrea Nascetti ⋅ Maarten Vergauwen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 594
GeoDiT: A Diffusion-based Vision-Language Model for Geospatial Understanding
Jiaqi Liu ⋅ Ronghao Fu ⋅ Haoran Liu ⋅ Lang Sun ⋅ Qipeng Wang ⋅ Bo Yang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 595
Balanced Hierarchical Contrastive Learning with Decoupled Queries for Fine-grained Object Detection in Remote Sensing Images
Jingzhou Chen ⋅ Dexin Chen ⋅ Fengchao Xiong ⋅ Yuntao Qian ⋅ Liang Xiao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 596
Generative Adversarial Perturbations with Cross-paradigm Transferability on Localized Crowd Counting
Alabi Mehzabin Anisha ⋅ Guangjing Wang ⋅ Sriram Chellappan
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 597
Improving Adversarial Transferability with Local Perturbation Augmentation
Jian-Xun Mi ⋅ Xuanhui Zhong ⋅ Weisheng Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 598
Echoes of Ownership: Adversarial-Guided Dual Injection for Copyright Protection in MLLMs
Chengwei Xia ⋅ Fan Ma ⋅ Ruijie Quan ⋅ Yunqiu Xu ⋅ Kun Zhan ⋅ Yi Yang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 599
Stealing Split Learning Bottom Models by Recovering Embedding Geometry
Qinbo Zhang ⋅ Yanhang Shi ⋅ Ziyi Zhang ⋅ Hao Wang ⋅ Sai Qian Zhang ⋅ Jian Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 600
PoInit-of-View: Poisoning Initialization of Views Transfers Across Multiple 3D Reconstruction Systems
Weijie Wang ⋅ Songlong Xing ⋅ Zhengyu Zhao ⋅ Nicu Sebe ⋅ Bruno Lepri
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 601
No Way To Steal My Face: Proactive Defense Against Identity-Preserving Personalized Generation
Lizhi Xiong ⋅ Jun Li ⋅ Ziqiang Li ⋅ Weiwei Jiang ⋅ Zhangjie Fu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 602
Towards Reliable Evaluation of Adversarial Robustness for Spiking Neural Networks
Jihang Wang ⋅ Dongcheng Zhao ⋅ Ruolin Chen ⋅ Qian Zhang ⋅ Yi Zeng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 603
Where, What, Why: Toward Explainable 3D-GS Watermarking
Mingshu Cai ⋅ Jiajun Li ⋅ Osamu Yoshie ⋅ Yuya Ieiri ⋅ Yixuan Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 604
Robust Spiking Neural Networks by Temporal Mutual Information
Mengting Xu ⋅ Shi Gu ⋅ Peng Lin ⋅ De Ma ⋅ Huajin Tang ⋅ Qian Zheng ⋅ Gang Pan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 605
TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos
Seungjae Lee ⋅ Yoonkyo Jung ⋅ Inkook Chun ⋅ Yao-Chih Lee ⋅ Zikui Cai ⋅ Hongjia Huang ⋅ Aayush Talreja ⋅ Tan Dao ⋅ Yongyuan Liang ⋅ Jia-Bin Huang ⋅ Furong Huang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 606
HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models
Minghui Lin ⋅ Pengxiang Ding ⋅ Shu Wang ⋅ Zifeng Zhuang ⋅ Yang Liu ⋅ Xinyang Tong ⋅ Wenxuan Song ⋅ Shangke Lyu ⋅ Siteng Huang ⋅ Donglin Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 607
AtomicVLA: Unlocking the Potential of Atomic Skill Learning in Robots
Likui Zhang ⋅ Tao Tang ⋅ Zhihao Zhan ⋅ xiuwei chen ⋅ Zisheng Chen ⋅ Jianhua Han ⋅ Jiangtong Zhu ⋅ Pei Xu ⋅ Hang Xu ⋅ Hefeng Wu ⋅ Liang Lin ⋅ Xiaodan Liang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 608
Obstruction Reasoning for Robotic Grasping
Runyu Jiao ⋅ Matteo Bortolon ⋅ Francesco Giuliari ⋅ Alice Fasoli ⋅ Sergio Povoli ⋅ Guofeng Mei ⋅ Yiming Wang ⋅ Fabio Poiesi
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 609
PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation
Wenlong Huang ⋅ Yu-Wei Chao ⋅ Arsalan Mousavian ⋅ Ming-Yu Liu ⋅ Dieter Fox ⋅ Kaichun Mo ⋅ Li Fei-Fei
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 610
CycleManip: Enabling Cycle-based Manipulation via Effective History Perception and Understanding
Yi-Lin Wei ⋅ Haoran Liao ⋅ Yuhao Lin ⋅ Pengyue Wang ⋅ Zhizhao Liang ⋅ Guiliang Liu ⋅ Wei-Shi Zheng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 611
SIMPACT: Simulation-Enabled Action Planning using Vision-Language Models
Haowen Liu ⋅ Shaoxiong Yao ⋅ Haonan Chen ⋅ Jiawei Gao ⋅ Jiayuan Mao ⋅ Jia-Bin Huang ⋅ Yilun Du
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 612
Adaptive Action Chunking at Inference-time for Vision-Language-Action Models
Yuanchang Liang ⋅ Xiaobo Wang ⋅ Kai Wang ⋅ Shuo Wang ⋅ Xiaojiang Peng ⋅ Haoyu Chen ⋅ David Kim Huat Chua ⋅ Prahlad Vadakkepat
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 613
Localizing, Structuring, and Rendering: Bridging 3D and 2D Vision-Language-Action Models for Robotic Manipulation
Yunlong Zhao ⋅ Xiaoheng Deng ⋅ Yichao Cao ⋅ Yi Chen ⋅ Xiangjian He ⋅ Shan You ⋅ Shuo Yang ⋅ Lei Fan ⋅ Fei Wang ⋅ Xiu Su
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 614
NIL: No-data Imitation Learning
Mert Albaba ⋅ Chenhao Li ⋅ Markos Diomataris ⋅ Omid Taheri ⋅ Andreas Krause ⋅ Michael J. Black
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 615
Humanoid Generative Pre-Training for Zero-Shot Motion Tracking
Zekun Qi ⋅ Xuchuan Chen ⋅ Jilong Wang ⋅ Chenghuai Lin ⋅ Yunrui Lian ⋅ Wenyao Zhang ⋅ XinQiang Yu ⋅ He Wang ⋅ Li Yi
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 616
EnergyAction: Unimanual to Bimanual Composition with Energy-Based Models
Mingchen Song ⋅ Xiang Deng ⋅ Jie Wei ⋅ Dongmei Jiang ⋅ Liqiang Nie ⋅ Weili Guan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 617
CUBic: Coordinated Unified Bimanual Perception and Control Framework
Xingyu Wang ⋅ Pengxiang Ding ⋅ Jingkai Xu ⋅ Donglin Wang ⋅ Zhaoxin Fan
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 618
RehearseVLA: Simulated Post-Training for VLAs with Physically-Consistent World Model
Junjin Xiao ⋅ Yandan Yang ⋅ Xinyuan Chang ⋅ Ronghan Chen ⋅ Feng Xiong ⋅ Mu Xu ⋅ Wei-Shi Zheng ⋅ Qing Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 619
GraspGen-X: Cross-Embodiment 6-DOF Diffusion-based Grasping
Beining Han ⋅ Yu-Wei Chao ⋅ Erwin Coumans ⋅ Clemens Eppner ⋅ Jia Deng ⋅ Stan Birchfield ⋅ Adithya Murali
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 620
UETrack: A Unified and Efficient Framework for Single Object Tracking
Ben Kang ⋅ Jie Zhao ⋅ Xin Chen ⋅ Wanting Geng ⋅ Bin Zhang ⋅ Lu Zhang ⋅ Dong Wang ⋅ Huchuan Lu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 621
ProgTrack: A Multi-Object Tracking Algorithm with Progressive Matching Strategy
Chenhui Zhang ⋅ Guoqing Dong ⋅ Weijie Peng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 622
Efficient Video Object Segmentation and Tracking with Recurrent Dynamic Submodel
Weidong Tang ⋅ Zhiyuan Liang ⋅ Xinyan Wan ⋅ Chen Zhu ⋅ Zhaopan Xu ⋅ Pengfei Zhou ⋅ Yan Song ⋅ Yang You ⋅ Wangbo Zhao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 623
Learning to Track Instance from Single Nature Language Description
Yaozong Zheng ⋅ Bineng Zhong ⋅ Qihua Liang ⋅ Shuimu Zeng ⋅ Haiying Xia ⋅ Shuxiang Song
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 624
MV-TAP: Tracking Any Point in Multi-View Videos
Jahyeok Koo ⋅ Inès Hyeonsu Kim ⋅ Mungyeom Kim ⋅ Junghyun Park ⋅ Seohyeon Park ⋅ Jaeyeong Kim ⋅ Jung Yi ⋅ Seokju Cho ⋅ Seungryong Kim
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 625
Adaptive Depth Lightweight RGB-T Tracking with Holistic Token Routing
Tian Ding ⋅ Hongtao Yang ⋅ Liangtao Shi ⋅ Jun Li ⋅ Xiantao Hu ⋅ Jian Yang ⋅ Ying Tai
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 626
Content-Adaptive Hierarchical Hyperprior for Neural Video Coding
Junqi Liao ⋅ Yaojun Wu ⋅ Chaoyi Lin ⋅ Zhipin Deng ⋅ Li Li ⋅ Dong Liu ⋅ Xiaoyan Sun
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 627
UTPTrack: Towards Simple and Unified Token Pruning for Visual Tracking
Hao Wu ⋅ Xudong Wang ⋅ Jialiang Zhang ⋅ Junlong Tong ⋅ Xinghao Chen ⋅ Junyan Lin ⋅ Yunpu Ma ⋅ Xiaoyu Shen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 628
Similarity-as-Evidence: Calibrating Overconfident VLMs for Interpretable and Label-Efficient Medical Active Learning
Zhuofan Xie ⋅ Zishan Lin ⋅ Jinliang Lin ⋅ Jie Qi ⋅ Shaohua Hong ⋅ Shuo Li
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 629
From Infusion to Assimilation Distillation for Medical Image Segmentation
Jiankang Hong ⋅ Ye Luo ⋅ Yinan Liu ⋅ Junsong Yuan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 630
IBISAgent: Reinforcing Pixel-Level Visual Reasoning in MLLMs for Universal Biomedical Object Referring and Segmentation
Yankai Jiang ⋅ Qiaoru Li ⋅ BinLu Xu ⋅ Haoran Sun ⋅ Chao Ding ⋅ Junting Dong ⋅ Yuxiang Cai ⋅ Xuhong Zhang ⋅ Jianwei Yin
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 631
Unlocking Positive Transfer in Incrementally Learning Surgical Instruments: A Self-reflection Hierarchical Prompt Framework
Yu ZHU ⋅ Kang LI ⋅ Zheng Li ⋅ Pheng-Ann Heng
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 632
Keep It Frozen: Domain-Routed Conditional Residual Modulation for Multi-Domain Vision Transformers
Ufaq Khan ⋅ Umair Nawaz ⋅ Massimo Caputo ⋅ Muhammad Bilal ⋅ Junaid Qadir ⋅ Muhammad Haris Khan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 633
Virtual Full-stack Scanning of Brain MRI via Imputing Any Quantised Code
Yicheng Wu ⋅ Tao Song ⋅ Zhonghua Wu ⋅ Jin Ye ⋅ Zongyuan Ge ⋅ Wenjia Bai ⋅ Zhaolin Chen ⋅ Jianfei Cai
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 634
MedLoc-R1: Performance-Aware Curriculum Reward Scheduling for GRPO-Based Medical Visual Grounding
Yang Guangjing ⋅ Ziyuan Qin ⋅ Chaoran Zhang ⋅ Chenlin Du ⋅ Jinglin Wang ⋅ Wanran Sun ⋅ Zhenyu Zhang ⋅ Bing Ji ⋅ Qicheng Lao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 635
Turning Pre-Trained Vision Transformers into End-to-End Histopathology Whole Slide Image Models for Survival Prediction
Jiawen Li ⋅ Jiali Hu ⋅ Xitong Ling ⋅ Renao Yan ⋅ Yuxuan Chen ⋅ Tian Guan ⋅ Yonghong He
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 636
A Supervised Multi-task Framework for Joint cryo-ET Restoration Enabled by Generative Physical Simulation
Xinsheng Wang ⋅ Zhidong Yang ⋅ Xiaohua Wan ⋅ Renmin Han ⋅ Shuai Tang ⋅ Hao Dong ⋅ Fa Zhang ⋅ Bin Hu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 637
KAMP: Knowledge-Anchored Multimodal Pretraining Framework for Medical Image Representation
Feiyu Huang ⋅ Jia Li ⋅ Zhao CHEN ⋅ Yang WU ⋅ Caleb Chen Cao ⋅ Lei Chen
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 638
CARE: A Molecular-Guided Foundation Model with Adaptive Region Modeling for Whole Slide Image Analysis
Di Zhang ⋅ Zhangpeng Gong ⋅ Xiaobo Pang ⋅ Jiashuai Liu ⋅ Junbo Lu ⋅ Hao Cui ⋅ Jiusong Ge ⋅ Zhi Zeng ⋅ Kai Yi ⋅ Yinghua Li ⋅ Si Liu ⋅ Tingsong Yu ⋅ Haoran Wang ⋅ Mireia Crispin-Ortuzar ⋅ Weimiao Yu ⋅ Chen Li ⋅ Zeyu Gao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 639
Contrastive Cross-Bag Augmentation for Multiple Instance Learning-based Whole Slide Image Classification
Bo Zhang ⋅ Xu Xinan ⋅ Shuo Yan ⋅ Yu Bai ⋅ Zheng Zhang ⋅ Wufan Wang ⋅ Hui Gao ⋅ Wendong Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 640
OmniFM: Toward Modality-Robust and Task-Agnostic Federated Learning for Heterogeneous Medical Imaging
meilin liu ⋅ Jiaying Wang ⋅ Jing Shan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 641
Learning complete and explainable visual representations from itemized text supervision
Yiwei Lyu ⋅ Chenhui Zhao ⋅ Soumyanil Banerjee ⋅ Shixuan Liu ⋅ Akshay Rao ⋅ Akhil Kondepudi ⋅ Honglak Lee ⋅ Todd C. Hollon
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 642
EgoPoseFormer v2: Accurate Egocentric Human Motion Estimation for AR/VR
Zhenyu Li ⋅ Sai Kumar Dwivedi ⋅ Filip Maric ⋅ Carlos Chacón ⋅ Nadine Bertsch ⋅ Filippo Arcadu ⋅ Tomas Hodan ⋅ Michael Ramamonjisoa ⋅ Peter Wonka ⋅ Amy Zhao ⋅ Robin Kips ⋅ Cem Keskin ⋅ Anastasia Tkach ⋅ Chenhongyi Yang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 643
MetricHMSR: Metric Human Mesh and Scene Recovery from Monocular Images
Chentao Song ⋅ He Zhang ⋅ Yuan Haolei ⋅ Haozhe Lin ⋅ Jianhua Tao ⋅ Hongwen Zhang ⋅ Tao Yu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 644
Differentially Private 2D Human Pose Estimation
Kaushik Bhargav Sivangi ⋅ Paul Henderson ⋅ Fani Deligianni
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 645
TROPHIES: Temporal Reconstruction of Places, Humans, and Cameras from Multi-view Videos
Jinpeng Liu ⋅ Yukang Xu ⋅ Yutong Li ⋅ Xingyu Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 646
PoseD-Flow: Versatile and Guided Flow Matching Model of Human Pose
Jebastin Nadar ⋅ Simone Foti ⋅ Tolga Birdal
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 647
SIMSPINE: A Biomechanics-Aware Simulation Framework for 3D Spine Motion Annotation and Benchmarking
Muhammad Saif Ullah Khan ⋅ Didier Stricker
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 648
HUMAPS-4D: A Multimodal Dataset for HUman Motion Analysis with Physiological and Semantic informations
Matthieu Dabrowski ⋅ Ouala Ben Jemaa ⋅ Benjamin Allaert
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 649
PHASE-Net: Physics-Grounded Harmonic Attention System for Efficient Remote Photoplethysmography Measurement
bo zhao ⋅ Dan Guo ⋅ Junzhe Cao ⋅ Yong Xu ⋅ Bochao Zou ⋅ Tao Tan ⋅ Yue Sun ⋅ Zitong YU
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 650
LAMP: Localization Aware Multi-camera People Tracking in Metric 3D World
Nan Yang ⋅ Julian Straub ⋅ Fan Zhang ⋅ Richard Newcombe ⋅ Jakob Engel ⋅ Lingni Ma
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 651
Expanding mmWave Datasets for Human Pose Estimation with Unlabeled Data and LiDAR Datasets
Zhuoxuan Peng ⋅ Boan Zhu ⋅ Xingjian Zhang ⋅ Wenying Li ⋅ Gary Chan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 652
Towards Balanced Multi-Modal Learning in 3D Human Pose Estimation
Mengshi Qi ⋅ Jiaxuan Peng ⋅ Xianlin Zhang ⋅ Huadong Ma
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 653
OMGTex: One-stage Multi-style Facial Texture Reconstruction without Geometry Guidance
Xiao Zitong ⋅ Yuda Qiu ⋅ Zisheng Ye ⋅ Xiaoguang Han
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 654
Human Interaction-Aware 3D Reconstruction from a Single Image
Gwanghyun Kim ⋅ Junghun James Kim ⋅ Suh Yoon Jeon ⋅ Jason Park ⋅ Se Young Chun
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 655
Towards Generalizable AI-Generated Image Detection via Image-Adaptive Prompt Learning
Yiheng Li ⋅ Zichang Tan ⋅ Guoqing Xu ⋅ Zhen Lei ⋅ Xu Zhou ⋅ Yang Yang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 656
SAGA: Source Attribution of Generative AI Videos
Rohit Kundu ⋅ Vishal Mohanty ⋅ Hao Xiong ⋅ Shan Jia ⋅ Athula Balachandran ⋅ Amit K. Roy-Chowdhury
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 657
VMD-FACT: A New Video Dataset and MLLM-based method for Detecting Realistic AI-Generated Video Misinformation
Yongkang Zhang ⋅ Dongyu She ⋅ Baiyu Ji ⋅ Qichuan Geng ⋅ Zhong Zhou ⋅ Yan Wang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 658
ReAlign: Generalizable Image Forgery Detection via Reasoning-Aligned Representation
Qing Huang ⋅ Zhipei Xu ⋅ Xuanyu Zhang ⋅ Xiangyu Yu ⋅ Jian Zhang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 659
A Sanity Check for Multi-In-Domain Face Forgery Detection in the Real World
Jikang Cheng ⋅ Renye Yan ⋅ Zhiyuan Yan ⋅ Yaozhong Gan ⋅ Xueyi Zhang ⋅ Wei Peng ⋅ Zhongyuan Wang ⋅ Ling Liang
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 660
PPM-CLIP: Probabilistic Prompt Modeling for Generalizable AI-Generated Image Detection
WANG XINYUAN ⋅ Yingxin Lai ⋅ Zhiming Luo ⋅ Zhihui Liu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 661
Learning from Noisy Supervision: A Denoising–Debiasing Framework for Weakly Supervised Video Anomaly Detection
Yaxin Zhao ⋅ Yang Wang ⋅ Wenya Guo ⋅ Sihan Xu ⋅ Xiangrui Cai ⋅ Xi Lin ⋅ Ying Zhang ⋅ Xiaojie Yuan
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 662
Anomaly as Non-Conformity via Training-Free Graph Laplacian Energy Minimization
Jungwook Seo ⋅ Minjeong Kim ⋅ Younkwan Lee ⋅ Seungho Shin ⋅ Sungyong Baik
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 663
VisualAD: Language-Free Zero-Shot Anomaly Detection via Vision Transformer
Yanning Hou ⋅ Peiyuan Li ⋅ Zirui Liu ⋅ Yitong Wang ⋅ Yanran Ruan ⋅ Jianfeng Qiu ⋅ Ke Xu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 664
CHAL: Causal-guided Hierarchical Anomaly-aware Learning for Moving Infrared Small Target Detection
Weiwei Duan ⋅ Luping Ji ⋅ Shipeng Lei ⋅ Sicheng Zhu ⋅ Jianghong Huang ⋅ Mao Ye
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 665
RAID: Retrieval-Augmented Anomaly Detection
Mingxiu Cai ⋅ Zhe Zhang ⋅ Gaochang Wu ⋅ Tianyou Chai ⋅ Xiatian Zhu
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 666
ADSeeker: A Knowledge-Grounded Reasoning Framework for Industry Anomaly Detection and Reasoning
Kai Zhang ⋅ Zekai Zhang ⋅ Xihe Sun ⋅ Anpeng Wang ⋅ Jingmeng Nie ⋅ Qinghui Chen ⋅ Han Hao ⋅ Jianyuan Guo ⋅ jinglin zhang
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 667
InvAD: Inversion-based Reconstruction-Free Anomaly Detection with Diffusion Models
Shunsuke Sakai ⋅ Xiangteng He ⋅ Chunzhi Gu ⋅ Leonid Sigal ⋅ Tatsuhito Hasegawa
[ Slides [ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 668
QueryOcc: Query-based Self-Supervision for 3D Semantic Occupancy
Adam Lilja ⋅ Ji Lan ⋅ Junsheng Fu ⋅ Lars Hammarstrand
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 669
GSV2X: Geometry-Aware Uncertainty Modeling and Orthogonal Fusion for Robust Roadside Perception
jianqiang xu ⋅ Gensheng Pei ⋅ 刘华峰 Liu ⋅ Yazhou Yao
[ Poster
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 670
Grounded Latents for Entity-Centric 4D Scene Generation
Jinhyung Park ⋅ Navyata Sanghvi ⋅ Erica Weng ⋅ Shawn Hunt ⋅ Shinya Tanaka ⋅ Hironobu Fujiyioshi ⋅ Kris Kitani
[ Poster
Poster Session
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F None
Poster Session 3 & Exhibit Hall
Art Program
Sat Jun 06 10:45 AM -- 05:00 PM (PDT) @ ExHall F None
Art Exhibition
Luba Elliott
Art Program
Sat Jun 06 10:45 AM -- 11:15 AM (PDT) @ ExHall F None
Art Gallery Tour with Curator and Artists
Luba Elliott
Art Program
Sat Jun 06 12:45 PM -- 01:45 PM (PDT) @ Room 201 None
Art Panel
Luba Elliott
Oral
Sat Jun 06 01:00 PM -- 01:12 PM (PDT) @ Four Seasons Ballroom None
CodeV: Code with Images for Faithful Visual Reasoning via Tool-Aware Policy Optimization
Xinhai Hou ⋅ Shaoyuan Xu ⋅ Manan Biyani ⋅ Moyan Li ⋅ Jia Liu ⋅ Todd C. Hollon ⋅ Bryan Wang
Oral
Sat Jun 06 01:00 PM -- 01:12 PM (PDT) @ Bluebird Ballroom None
Chorus: Multi-Teacher Pretraining for Holistic 3D Gaussian Scene Encoding
Yue Li ⋅ Qi Ma ⋅ Runyi Yang ⋅ Mengjiao Ma ⋅ Bin Ren ⋅ Nikola Popovic ⋅ Nicu Sebe ⋅ Theo Gevers ⋅ Luc Van Gool ⋅ Danda Paudel ⋅ Martin R. Oswald
Oral
Sat Jun 06 01:00 PM -- 01:12 PM (PDT) @ Mile High Ballroom 1A - 2A None
Breaking the Scalability Limit of Multi-Projector Calibration with Embedded Cameras
Takumi Kawano ⋅ Kohei Miura ⋅ Daisuke Iwai
Oral
Sat Jun 06 01:00 PM -- 01:12 PM (PDT) @ Mile High Ballroom 3A - 4A None
INSID3: Training-Free In-Context Segmentation with DINOv3
Claudia Cuttano ⋅ Gabriele Trivigno ⋅ Christoph Reich ⋅ Daniel Cremers ⋅ Carlo Masone ⋅ Stefan Roth
Oral Session
Sat Jun 06 01:00 PM -- 02:15 PM (PDT) @ Mile High Ballroom 3A - 4A None
Oral Session 4D: Visual Segmentation
Oral Session
Sat Jun 06 01:00 PM -- 02:15 PM (PDT) @ Four Seasons Ballroom None
Oral Session 4B: Embodied & Agentic Intelligence
Oral Session
Sat Jun 06 01:00 PM -- 02:15 PM (PDT) @ Mile High Ballroom 1A - 2A None
Oral Session 4C: Spatial Reasoning
Oral Session
Sat Jun 06 01:00 PM -- 02:15 PM (PDT) @ Bluebird Ballroom None
Oral Session 4A: Geometric Understanding
Oral
Sat Jun 06 01:12 PM -- 01:25 PM (PDT) @ Four Seasons Ballroom None
NitroGen: An Open Foundation Model for Generalist Gaming Agents
Loïc Magne ⋅ Anas Awadalla ⋅ Guanzhi Wang ⋅ Yinzhen Xu ⋅ Joshua Belofsky ⋅ Fengyuan Hu ⋅ Joohwan Kim ⋅ Ludwig Schmidt ⋅ Georgia Gkioxari ⋅ Jan Kautz ⋅ Yisong Yue ⋅ Yejin Choi ⋅ Yuke Zhu ⋅ Jim Fan
Oral
Sat Jun 06 01:12 PM -- 01:25 PM (PDT) @ Mile High Ballroom 1A - 2A None
GaussianFluent: Gaussian Simulation for Dynamic Scenes with Mixed Materials
Bei Huang ⋅ Yixin Chen ⋅ Ruijie Lu ⋅ Gang Zeng ⋅ Hongbin Zha ⋅ Yuru Pei ⋅ Siyuan Huang
Oral
Sat Jun 06 01:12 PM -- 01:25 PM (PDT) @ Bluebird Ballroom None
Featurising Pixels from Dynamic 3D Scenes with Linear In-Context Learners
Nikita Araslanov ⋅ Martin Sundermeyer ⋅ Hidenobu Matsuki ⋅ David Joseph Tan ⋅ Federico Tombari
Oral
Sat Jun 06 01:12 PM -- 01:25 PM (PDT) @ Mile High Ballroom 3A - 4A None
MARCO: Navigating the Unseen Space of Semantic Correspondence
Claudia Cuttano ⋅ Gabriele Trivigno ⋅ Carlo Masone ⋅ Stefan Roth
Oral
Sat Jun 06 01:25 PM -- 01:37 PM (PDT) @ Mile High Ballroom 1A - 2A None
InfiniBench: Infinite Benchmarking for Visual Spatial Reasoning with Customizable Scene Complexity
Haoming Wang ⋅ Qiyao Xue ⋅ Wei Gao
Oral
Sat Jun 06 01:25 PM -- 01:37 PM (PDT) @ Bluebird Ballroom None
From Pairs to Sequences: Track-Aware Policy Gradients for Keypoint Detection
yepeng liu ⋅ Hao Li ⋅ Liwen Yang ⋅ Fangzhen Li ⋅ Xudi Ge ⋅ Yuliang Gu ⋅ kuang Gao ⋅ Bing Wang ⋅ Guang Chen ⋅ Hangjun Ye ⋅ Yongchao Xu
Oral
Sat Jun 06 01:25 PM -- 01:37 PM (PDT) @ Four Seasons Ballroom None
PAI-Bench: A Comprehensive Benchmark For Physical AI
Fengzhe Zhou ⋅ Jiannan Huang ⋅ Jialuo Li ⋅ Deva Ramanan ⋅ Humphrey Shi
Oral
Sat Jun 06 01:25 PM -- 01:37 PM (PDT) @ Mile High Ballroom 3A - 4A None
PR-MaGIC: Prompt Refinement Via Mask Decoder Gradient Flow For In-Context Segmentation
Minjae Lee ⋅ Sungwoo Hur ⋅ Soojin Hwang ⋅ Won Hwa Kim
Oral
Sat Jun 06 01:37 PM -- 01:50 PM (PDT) @ Mile High Ballroom 3A - 4A None
R^2-Seg: Training-Free OOD Medical Tumor Segmentation via Anatomical Reasoning and Statistical Rejection
Shuaike Shen ⋅ Ke Liu ⋅ Jiaqing Xie ⋅ Shangde Gao ⋅ Chunhua Shen ⋅ Ge Liu ⋅ Mireia Crispin-Ortuzar ⋅ Shangqi Gao
Oral
Sat Jun 06 01:37 PM -- 01:50 PM (PDT) @ Mile High Ballroom 1A - 2A None
MAGICIAN: Efficient Long-Term Planning with Imagined Gaussians for Active Mapping
Shiyao Li ⋅ Antoine Guédon ⋅ Shizhe Chen ⋅ Vincent Lepetit
Oral
Sat Jun 06 01:37 PM -- 01:50 PM (PDT) @ Four Seasons Ballroom None
RefAV: Towards Planning-Centric Scenario Mining
Cainan Davidson ⋅ Deva Ramanan ⋅ Neehar Peri
Oral
Sat Jun 06 01:37 PM -- 01:50 PM (PDT) @ Bluebird Ballroom None
Linear Fundamental Matrix Estimation from 7 or 5 Points
Taci Ata Kucukpinar ⋅ Juan Mogollon ⋅ Joshua Fraser ⋅ Timothy Duff ⋅ Kannappan Palaniappan
Oral
Sat Jun 06 01:50 PM -- 02:02 PM (PDT) @ Bluebird Ballroom None
OccuFly: A 3D Vision Benchmark for Semantic Scene Completion from the Aerial Perspective
Markus Gross ⋅ Sai B. Matha ⋅ Aya Fahmy ⋅ Rui Song ⋅ Daniel Cremers ⋅ Henri Meeß
Oral
Sat Jun 06 01:50 PM -- 02:02 PM (PDT) @ Mile High Ballroom 1A - 2A None
Memory-Augmented Scene Understanding and Exploration for Open-World Aerial Object-Goal Navigation
Jiacong Zhou ⋅ Jiaxu Miao ⋅ Yourun Lin ⋅ Xianyun Wang ⋅ Jun Xiao ⋅ Jun Yu
Oral
Sat Jun 06 01:50 PM -- 02:02 PM (PDT) @ Four Seasons Ballroom None
SoccerMaster: A Vision Foundation Model for Soccer Understanding
Haolin Yang ⋅ Jiayuan Rao ⋅ Haoning Wu ⋅ Weidi Xie
Oral
Sat Jun 06 01:50 PM -- 02:02 PM (PDT) @ Mile High Ballroom 3A - 4A None
The SA-FARI Dataset: Segment Anything in Footage of Animals for Recognition and Identification
Dante Wasmuht ⋅ Otto Brookes ⋅ Maximilian Schall ⋅ Pablo Palencia ⋅ Christopher Beirne ⋅ Tilo Burghardt ⋅ Majid Mirmehdi ⋅ Hjalmar Kühl ⋅ Mimi Arandjelovic ⋅ Sam Pottie ⋅ Peter Bermant ⋅ Brandon Asheim ⋅ Yi Jin Toh ⋅ Adam Elzinga ⋅ Jason Allan Holmberg ⋅ Andrew Whitworth ⋅ Eleanor Flatt ⋅ Laura Gustafson ⋅ Chaitanya Ryali ⋅ Yuan-Ting Hu ⋅ Baishan Guo ⋅ Andrew Westbury ⋅ Kate Saenko ⋅ Dídac Surís
Oral
Sat Jun 06 02:02 PM -- 02:15 PM (PDT) @ Mile High Ballroom 3A - 4A None
VGGT-Segmentor: Geometry-Enhanced Cross-View Segmentation
Yulu Gao ⋅ Bohao Zhang ⋅ Zongheng Tang ⋅ Jitong Liao ⋅ wenjun wu ⋅ Si Liu
Oral
Sat Jun 06 02:02 PM -- 02:15 PM (PDT) @ Mile High Ballroom 1A - 2A None
Monocular Open Vocabulary Occupancy Prediction for Indoor Scenes
Changqing Zhou ⋅ Yueru Luo ⋅ Han Zhang ⋅ Zeyu Jiang ⋅ Changhao Chen
Oral
Sat Jun 06 02:02 PM -- 02:15 PM (PDT) @ Four Seasons Ballroom None
VS-Bench: Evaluating VLMs for Strategic Abilities in Multi-Agent Environments
Zelai Xu ⋅ Zhexuan Xu ⋅ Xiangmin Yi ⋅ Huining Yuan ⋅ Mo Guang ⋅ Kaiwen Long ⋅ Xinlei Chen ⋅ Yi Wu ⋅ Chao Yu ⋅ Yu Wang
Oral
Sat Jun 06 02:02 PM -- 02:15 PM (PDT) @ Bluebird Ballroom None
VGGT-Ω
Jianyuan Wang ⋅ Minghao Chen ⋅ Shangzhan Zhang ⋅ Nikita Karaev ⋅ Johannes Schönberger ⋅ Patrick Labatut ⋅ Piotr Bojanowski ⋅ David Novotny ⋅ Andrea Vedaldi ⋅ Christian Rupprecht
Break
Sat Jun 06 02:15 PM -- 02:30 PM (PDT) None
Courtesy Break
Poster Setup
Sat Jun 06 03:15 PM -- 03:45 PM (PDT) @ ExHall A None
Poster Setup
Demonstration
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall F None
Demos
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 1
Chorus: Multi-Teacher Pretraining for Holistic 3D Gaussian Scene Encoding
Yue Li ⋅ Qi Ma ⋅ Runyi Yang ⋅ Mengjiao Ma ⋅ Bin Ren ⋅ Nikola Popovic ⋅ Nicu Sebe ⋅ Theo Gevers ⋅ Luc Van Gool ⋅ Danda Paudel ⋅ Martin R. Oswald
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 2
Featurising Pixels from Dynamic 3D Scenes with Linear In-Context Learners
Nikita Araslanov ⋅ Martin Sundermeyer ⋅ Hidenobu Matsuki ⋅ David Joseph Tan ⋅ Federico Tombari
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 3
From Pairs to Sequences: Track-Aware Policy Gradients for Keypoint Detection
yepeng liu ⋅ Hao Li ⋅ Liwen Yang ⋅ Fangzhen Li ⋅ Xudi Ge ⋅ Yuliang Gu ⋅ kuang Gao ⋅ Bing Wang ⋅ Guang Chen ⋅ Hangjun Ye ⋅ Yongchao Xu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 4
Linear Fundamental Matrix Estimation from 7 or 5 Points
Taci Ata Kucukpinar ⋅ Juan Mogollon ⋅ Joshua Fraser ⋅ Timothy Duff ⋅ Kannappan Palaniappan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 5
OccuFly: A 3D Vision Benchmark for Semantic Scene Completion from the Aerial Perspective
Markus Gross ⋅ Sai B. Matha ⋅ Aya Fahmy ⋅ Rui Song ⋅ Daniel Cremers ⋅ Henri Meeß
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 6
VGGT-Ω
Jianyuan Wang ⋅ Minghao Chen ⋅ Shangzhan Zhang ⋅ Nikita Karaev ⋅ Johannes Schönberger ⋅ Patrick Labatut ⋅ Piotr Bojanowski ⋅ David Novotny ⋅ Andrea Vedaldi ⋅ Christian Rupprecht
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 7
CodeV: Code with Images for Faithful Visual Reasoning via Tool-Aware Policy Optimization
Xinhai Hou ⋅ Shaoyuan Xu ⋅ Manan Biyani ⋅ Moyan Li ⋅ Jia Liu ⋅ Todd C. Hollon ⋅ Bryan Wang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 8
NitroGen: An Open Foundation Model for Generalist Gaming Agents
Loïc Magne ⋅ Anas Awadalla ⋅ Guanzhi Wang ⋅ Yinzhen Xu ⋅ Joshua Belofsky ⋅ Fengyuan Hu ⋅ Joohwan Kim ⋅ Ludwig Schmidt ⋅ Georgia Gkioxari ⋅ Jan Kautz ⋅ Yisong Yue ⋅ Yejin Choi ⋅ Yuke Zhu ⋅ Jim Fan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 9
PAI-Bench: A Comprehensive Benchmark For Physical AI
Fengzhe Zhou ⋅ Jiannan Huang ⋅ Jialuo Li ⋅ Deva Ramanan ⋅ Humphrey Shi
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 10
RefAV: Towards Planning-Centric Scenario Mining
Cainan Davidson ⋅ Deva Ramanan ⋅ Neehar Peri
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 11
SoccerMaster: A Vision Foundation Model for Soccer Understanding
Haolin Yang ⋅ Jiayuan Rao ⋅ Haoning Wu ⋅ Weidi Xie
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 12
VS-Bench: Evaluating VLMs for Strategic Abilities in Multi-Agent Environments
Zelai Xu ⋅ Zhexuan Xu ⋅ Xiangmin Yi ⋅ Huining Yuan ⋅ Mo Guang ⋅ Kaiwen Long ⋅ Xinlei Chen ⋅ Yi Wu ⋅ Chao Yu ⋅ Yu Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 13
Breaking the Scalability Limit of Multi-Projector Calibration with Embedded Cameras
Takumi Kawano ⋅ Kohei Miura ⋅ Daisuke Iwai
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 14
GaussianFluent: Gaussian Simulation for Dynamic Scenes with Mixed Materials
Bei Huang ⋅ Yixin Chen ⋅ Ruijie Lu ⋅ Gang Zeng ⋅ Hongbin Zha ⋅ Yuru Pei ⋅ Siyuan Huang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 15
InfiniBench: Infinite Benchmarking for Visual Spatial Reasoning with Customizable Scene Complexity
Haoming Wang ⋅ Qiyao Xue ⋅ Wei Gao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 16
MAGICIAN: Efficient Long-Term Planning with Imagined Gaussians for Active Mapping
Shiyao Li ⋅ Antoine Guédon ⋅ Shizhe Chen ⋅ Vincent Lepetit
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 17
Memory-Augmented Scene Understanding and Exploration for Open-World Aerial Object-Goal Navigation
Jiacong Zhou ⋅ Jiaxu Miao ⋅ Yourun Lin ⋅ Xianyun Wang ⋅ Jun Xiao ⋅ Jun Yu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 18
Monocular Open Vocabulary Occupancy Prediction for Indoor Scenes
Changqing Zhou ⋅ Yueru Luo ⋅ Han Zhang ⋅ Zeyu Jiang ⋅ Changhao Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 19
INSID3: Training-Free In-Context Segmentation with DINOv3
Claudia Cuttano ⋅ Gabriele Trivigno ⋅ Christoph Reich ⋅ Daniel Cremers ⋅ Carlo Masone ⋅ Stefan Roth
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 20
MARCO: Navigating the Unseen Space of Semantic Correspondence
Claudia Cuttano ⋅ Gabriele Trivigno ⋅ Carlo Masone ⋅ Stefan Roth
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 21
PR-MaGIC: Prompt Refinement Via Mask Decoder Gradient Flow For In-Context Segmentation
Minjae Lee ⋅ Sungwoo Hur ⋅ Soojin Hwang ⋅ Won Hwa Kim
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 22
R^2-Seg: Training-Free OOD Medical Tumor Segmentation via Anatomical Reasoning and Statistical Rejection
Shuaike Shen ⋅ Ke Liu ⋅ Jiaqing Xie ⋅ Shangde Gao ⋅ Chunhua Shen ⋅ Ge Liu ⋅ Mireia Crispin-Ortuzar ⋅ Shangqi Gao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 23
The SA-FARI Dataset: Segment Anything in Footage of Animals for Recognition and Identification
Dante Wasmuht ⋅ Otto Brookes ⋅ Maximilian Schall ⋅ Pablo Palencia ⋅ Christopher Beirne ⋅ Tilo Burghardt ⋅ Majid Mirmehdi ⋅ Hjalmar Kühl ⋅ Mimi Arandjelovic ⋅ Sam Pottie ⋅ Peter Bermant ⋅ Brandon Asheim ⋅ Yi Jin Toh ⋅ Adam Elzinga ⋅ Jason Allan Holmberg ⋅ Andrew Whitworth ⋅ Eleanor Flatt ⋅ Laura Gustafson ⋅ Chaitanya Ryali ⋅ Yuan-Ting Hu ⋅ Baishan Guo ⋅ Andrew Westbury ⋅ Kate Saenko ⋅ Dídac Surís
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 24
VGGT-Segmentor: Geometry-Enhanced Cross-View Segmentation
Yulu Gao ⋅ Bohao Zhang ⋅ Zongheng Tang ⋅ Jitong Liao ⋅ wenjun wu ⋅ Si Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 25
DAGE: Dual-Stream Architecture for Efficient and Fine-Grained Geometry Estimation
Tuan Duc Ngo ⋅ Gabriel Huang ⋅ Seoung Wug Oh ⋅ Kevin Blackburn-Matzen ⋅ Evangelos Kalogerakis ⋅ Chuang Gan ⋅ Joon-Young Lee
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 26
Wave-Former: Through-Occlusion 3D Reconstruction via Wireless Shape Completion
Laura Dodds ⋅ Maisy Lam ⋅ Waleed Akbar ⋅ Yibo Cheng ⋅ Fadel Adib
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 27
Lite Any Stereo: Efficient Zero-Shot Stereo Matching
Junpeng Jing ⋅ Weixun Luo ⋅ Ye Mao ⋅ Krystian Mikolajczyk
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 28
MuM: Multi-View Masked Image Modeling for 3D Vision
David Nordström ⋅ Johan Edstedt ⋅ Fredrik Kahl ⋅ Georg Bökman
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 29
ZipMap: Linear-Time Stateful 3D Reconstruction via Test-Time Training
Haian Jin ⋅ Rundi Wu ⋅ Tianyuan Zhang ⋅ Ruiqi Gao ⋅ Jonathan T. Barron ⋅ Noah Snavely ⋅ Aleksander Holynski
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 30
Scal3R: Scalable Test-Time Training for Large-Scale 3D Reconstruction
Tao Xie ⋅ Peishan Yang ⋅ Yudong Jin ⋅ Yingfeng Cai ⋅ Wei Yin ⋅ Weiqiang Ren ⋅ Qian Zhang ⋅ Wei Hua ⋅ Sida Peng ⋅ Xiaoyang Guo ⋅ Xiaowei Zhou
[ Slides
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 31
LaRP: Efficient Multi-View Inpainting with Latent Reprojection Priors
Gaoyang Zhang ⋅ Xinguo Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 32
TopoMA: Topology-Guided Multi-Agent Dense RGB 3D Reconstruction via Distributed Inference
Xuanxuan Zhang ⋅ ShuHui Shi ⋅ Tianxiang Zhang ⋅ Zhetao Guo ⋅ Zixuan Huang ⋅ You Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 33
Sparse–View Localization via Online Neural 3D Regression
Ludvig Dillén ⋅ Magnus Oskarsson ⋅ Viktor Larsson
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 34
Dynamic Visual SLAM using a General 3D Prior
Xingguang Zhong ⋅ Liren Jin ⋅ Marija Popovic ⋅ Jens Behley ⋅ Cyrill Stachniss
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 35
Learning Scene Coordinate Reconstruction from Unposed Images via Pose Graph Optimization
Tze Ho Elden Tse ⋅ Jizong Peng ⋅ Angela Yao
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 36
FlashVGGT: Efficient and Scalable Visual Geometry Transformers with Compressed Descriptor Attention
Zipeng Wang ⋅ Dan Xu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 37
No Calibration, No Depth, No Problem: Cross-Sensor View Synthesis with 3D Consistency
Cho-Ying Wu ⋅ Zixun Huang ⋅ Xinyu Huang ⋅ Liu Ren
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 38
UFO: Unifying Feed-Forward and Optimization-based Methods for Large Driving Scene Modeling
Kaiyuan Tan ⋅ Yingying Shen ⋅ Ziyue Zhu ⋅ Mingfei Tu ⋅ HAOHUI ZHU ⋅ Haiyang Sun ⋅ Bing Wang ⋅ Guang Chen ⋅ Hangjun Ye
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 39
Reliev3R: Relieving Feed-forward 3D Reconstruction from Multi-View Geometric Annotations
Youyu Chen ⋅ Junjun Jiang ⋅ Yueru Luo ⋅ Kui Jiang ⋅ Xianming Liu ⋅ Xu Yan ⋅ Dave Chen
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 40
TALO: Pushing 3D Vision Foundation Models Towards Globally Consistent Online Reconstruction
Fengyi Zhang ⋅ Tianjun Zhang ⋅ Kasra Khosoussi ⋅ Zheng Zhang ⋅ Zi Huang ⋅ Yadan Luo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 41
Global Structure-from-Motion Meets Feedforward Reconstruction
Linfei Pan ⋅ Johannes Schönberger ⋅ Marc Pollefeys
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 42
POCA: Pareto-Optimal Curriculum Alignment for Visual Text Generation
Yaohou Fan ⋅ Qingzhong Wang ⋅ Yongsong Huang ⋅ Junyi Liu ⋅ Tomo Miyazaki ⋅ Shinichiro Omachi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 43
DuoGen: Towards Autonomous Interleaved Multimodal Generation
Min Shi ⋅ Xiaohui Zeng ⋅ Jiannan Huang ⋅ Yin Cui ⋅ Francesco Ferroni ⋅ Jialuo Li ⋅ Max Li ⋅ Yogesh Balaji ⋅ Haoxiang Wang ⋅ Tsung-Yi Lin ⋅ Xiao Fu ⋅ Yue Zhao ⋅ Chieh-Yun Chen ⋅ Ming-Yu Liu ⋅ Humphrey Shi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 44
Vibe Spaces for Creatively Connecting and Expressing Visual Concepts
Huzheng Yang ⋅ Katherine Xu ⋅ Andrew Lu ⋅ Michael D. Grossberg ⋅ Yutong Bai ⋅ Jianbo Shi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 45
StoryTailor:A Zero-Shot Pipeline for Action-Rich Multi-Subject Visual Narratives
Jinghao Hu ⋅ Yuhe Zhang ⋅ GuoHua Geng ⋅ Kang Li ⋅ Han Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 46
CREward: A Type-Specific Creativity Reward Model
Jiyeon Han ⋅ Ali Mahdavi Amiri ⋅ Hao Zhang ⋅ Haedong Jeong
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 47
LumiX: Structured and Coherent Text-to-Intrinsic Generation
Xu Han ⋅ Biao Zhang ⋅ Xiangjun Tang ⋅ Xianzhi Li ⋅ Peter Wonka
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 48
Synthetic Curriculum Reinforces Compositional Text-to-Image Generation
Shijian Wang ⋅ Runhao Fu ⋅ Siyi Zhao ⋅ Qingqin Zhan ⋅ Xingjian Wang ⋅ Jiarui Jin ⋅ Yuan Lu ⋅ Hanqian Wu ⋅ Cunjian Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 49
OmniGen2: Towards Instruction-Aligned Multimodal Generation
Chenyuan Wu ⋅ Jiahao Wang ⋅ PengFei Zheng ⋅ Ruiran Yan ⋅ Shitao Xiao ⋅ Xin Luo ⋅ Yueze Wang ⋅ Wanli Li ⋅ Xiyan Jiang ⋅ Yexin Liu ⋅ Junjie Zhou ⋅ Ziyi Xia ⋅ Ze Liu ⋅ Chaofan Li ⋅ Haoge Deng ⋅ Kun Luo ⋅ Bo Zhang ⋅ Jiajun Zhang ⋅ Dong Liu ⋅ Defu Lian ⋅ Xinlong Wang ⋅ Zhongyuan Wang ⋅ Tiejun Huang ⋅ Zheng Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 50
Selectively Extracting and Injecting Visual Attributes into Text-to-Image Models
Seunghwan Choi ⋅ Jooyeol Yun ⋅ Youngdo Lee ⋅ Jaegul Choo
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 51
LoFA: Learning to Predict Personalized Prior for Fast Adaptation of Visual Generative Models
Yiming Hao ⋅ Mutian Xu ⋅ Chongjie Ye ⋅ Jie Qin ⋅ Shunlin Lu ⋅ Yipeng Qin ⋅ Xiaoguang Han
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 52
UniVerse: Empower Unified Generation with Reasoning and Knowledge
Kaiyue Sun ⋅ Weiyang Jin ⋅ Chengqi Duan ⋅ Rongyao Fang ⋅ Xian Liu ⋅ Yuwei Niu ⋅ Chunwei Wang ⋅ Aoxue Li ⋅ Xihui Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 53
UniVerse: A Unified Modulation Framework for Segmentation-Free, Disentangled Multi-Concept Personalization
Quynh Phung ⋅ Sandesh Ghimire ⋅ Minsi Hu ⋅ Charles Tsai ⋅ Jia-Bin Huang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 54
Residual Decoder Adapter: ID-Preserving Tokenizer Adaption for Autoregressive Text Rendering
Dongxing Mao ⋅ Jinpeng Wang ⋅ Jiahao Tang ⋅ Kevin Qinghong Lin ⋅ Linjie Li ⋅ Zhengyuan Yang ⋅ Lijuan Wang ⋅ Min Li ⋅ Jingru Tan
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 55
TGT: Text-Grounded Trajectories for Locally Controlled Video Generation
Guofeng Zhang ⋅ Angtian Wang ⋅ Jacob Fang Fang ⋅ Liming Jiang ⋅ Haotian Yang ⋅ Bo Liu ⋅ Yiding Yang ⋅ Guang Chen ⋅ Longyin Wen ⋅ Alan L. Yuille ⋅ Chongyang Ma
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 56
RAISE: Requirement-Adaptive Evolutionary Refinement for Training-Free Text-to-Image Alignment
Liyao Jiang ⋅ Ruichen Chen ⋅ Chao Gao ⋅ Di Niu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 57
FlowFixer: Towards Detail-Preserving Subject-Driven Generation
Jinyoung Jun ⋅ Wondong Jang ⋅ Wenbin Ouyang ⋅ Raghudeep Gadde ⋅ Jungbeom Lee
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 58
TextPecker: Rewarding Structural Anomaly Quantification for Enhancing Visual Text Rendering
Hanshen Zhu ⋅ Yuliang Liu ⋅ Xuecheng Wu ⋅ An-Lan Wang ⋅ Chao Feng ⋅ Dingkang Yang ⋅ ChaoFeng ChaoFeng ⋅ Can Huang ⋅ Jingqun Tang ⋅ Xiang Bai
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 59
UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios
Tian Ye ⋅ Song Fei ⋅ Lei Zhu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 60
FEAT: Fashion Editing and Try-On from Any Design
Soye Kwon ⋅ Keonyoung Lee ⋅ Dahuin Jung ⋅ Jaekoo Lee
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 61
Rethinking Prompt Design for Inference-time Scaling in Text-to-Visual Generation
Subin Kim ⋅ Sangwoo Mo ⋅ Mamshad Nayeem Rizve ⋅ Yiran Xu ⋅ Difan Liu ⋅ Jinwoo Shin ⋅ Tobias Hinz
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 62
PointAlign: Feature-Level Alignment Regularization for 3D Vision-Language Models
Yuanhao Su ⋅ Shaofeng Zhang ⋅ Xiaosong Jia ⋅ Qi Fan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 63
PowerCLIP: Powerset Alignment for Contrastive Pre-Training
Masaki Kawamura ⋅ Nakamasa Inoue ⋅ Rintaro Yanagi ⋅ Hirokatsu Kataoka ⋅ Rio Yokota
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 64
MoBind: Motion Binding for Fine-Grained IMU–Video Pose Alignment
Duy Nguyen ⋅ Tat-Jun Chin ⋅ Minh Nguyen Nguyen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 65
The Geometry of Robustness: Optimizing Loss Landscape Curvature and Feature Manifold Alignment for Robust Finetuning of Vision-Language Models
Shivang Chopra ⋅ Shaunak Halbe ⋅ Chengyue Huang ⋅ Brisa Maneechotesuwan ⋅ Zsolt Kira
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 66
Tackling Model Bias via Game-theoretic Multi-agent Collaboration Framework for Hateful Meme Classification
Yiwei Wei ⋅ Zhengliang Guo ⋅ Shaozu Yuan ⋅ Chengyin Hu ⋅ Zhiyang Jia ⋅ Jiujiang Guo ⋅ Meng Chen ⋅ Peiying Wang ⋅ Longbiao Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 67
CCCaption: Dual-Reward Reinforcement Learning for Complete and Correct Image Captioning
Zhijiang Tang ⋅ Linhua Wang ⋅ JIAXIN QI ⋅ Weihao Jiang ⋅ Peng Hou ⋅ Anxiang Zeng ⋅ Jianqiang Huang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 68
MM-ReCoder: Advancing Chart-to-Code Generation with Reinforcement Learning and Self-Correction
Zitian Tang ⋅ Xu Zhang ⋅ Jianbo Yuan ⋅ Yang Zou ⋅ Varad Gunjal ⋅ Songyao Jiang ⋅ Davide Modolo
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 69
Learning to Generate via Understanding: Understanding-Driven Intrinsic Rewarding for Unified Multimodal Models
Jiadong Pan ⋅ Liang Li ⋅ Yuxin Peng ⋅ Yu-Ming Tang ⋅ Shuohuan Wang ⋅ Yu Sun ⋅ Hua Wu ⋅ Qingming Huang ⋅ Haifeng Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 70
Hierarchical Process Reward Models are Symbolic Vision Learners
Shan Zhang ⋅ Aotian Chen ⋅ Kai Zou ⋅ Jindong Gu ⋅ Yuan Xue ⋅ Anton van den Hengel
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 71
ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning
Shengyuan Ding ⋅ Xinyu Fang ⋅ Ziyu Liu ⋅ Yuhang Zang ⋅ Yuhang Cao ⋅ Xiangyu Zhao ⋅ Haodong Duan ⋅ Xiaoyi Dong ⋅ Jianze Liang ⋅ Bin Wang ⋅ Conghui He ⋅ Dahua Lin ⋅ Jiaqi Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 72
SG-LoRA: Semantic-guided LoRA Parameters Generation
Miaoge Li ⋅ Yang Chen ⋅ Zhijie Rao ⋅ Can Jiang ⋅ Kang Wei ⋅ Jingcai Guo
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 73
AcTTA: Rethinking Test-Time Adaptation via Dynamic Activation
Hyeongyu Kim ⋅ GeonHui Han ⋅ Dosik Hwang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 74
Reframing Long-Tailed Learning via Loss Landscape Geometry
shenghan chen ⋅ Yiming Liu ⋅ Yanzhen Wang ⋅ Yujia Wang ⋅ Xiankai Lu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 75
Cleaning the Pool: Progressive Filtering of Unlabeled Pools in Deep Active Learning
Denis Huseljic ⋅ Marek Herde ⋅ Lukas Rauch ⋅ Paul Hahn ⋅ Bernhard Sick
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 76
DC-Merge: Improving Model Merging with Directional Consistency
Han-Chen Zhang ⋅ Zi-Hao Zhou ⋅ Mao-Lin Luo ⋅ Shimin Di ⋅ Min-Ling Zhang ⋅ Tong Wei
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 77
TALON: Test-time Adaptive Learning for On-the-Fly Category Discovery
Yanan Wu ⋅ Yuhan Yan ⋅ Tailai Chen ⋅ Zhixiang Chi ⋅ ZiZhang Wu ⋅ Yi Jin ⋅ Yang Wang ⋅ Zhenbo Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 78
Event-Illumination Collaborative Low-light Image Enhancement with a High-resolution Real-world Dataset
Senyan Xu ⋅ Zhijing Sun ⋅ Kean Liu ⋅ Xin Lu ⋅ Ruixuan Jiang ⋅ Xueyang Fu ⋅ Zheng-Jun Zha
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 79
NEC-Diff: Noise-Robust Event-RAW Complementary Diffusion for Seeing Motion in Extreme Darkness
Haoyue Liu ⋅ Jinghan Xu ⋅ Luxin Feng ⋅ Hanyu Zhou ⋅ Haozhi Zhao ⋅ Yi Chang ⋅ Luxin Yan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 80
Towards Persistence: Learning Topological Constraints for Event-based Small Object Detection
Shiman He ⋅ Nuo Chen ⋅ Xinyi Ying ⋅ Yihang Luo ⋅ Yangsi Shi ⋅ Zaiping Lin ⋅ Miao Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 81
Geometric-Photometric Event-based 3D Gaussian Ray Tracing
Kai Kohyama ⋅ Yoshimitsu Aoki ⋅ Guillermo Gallego ⋅ Shintaro Shiba
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 82
EventDrive: Event Cameras for Vision-Language Driving Intelligence
Dongyue Lu ⋅ Rong Li ⋅ Ao Liang ⋅ Lingdong Kong ⋅ Wei Yin ⋅ Lai Xing Ng ⋅ Benoit R. Cottereau ⋅ Camille Simon Chane ⋅ Wei Tsang Ooi
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 83
EventGait: Towards Robust Gait Recognition with Event Streams
Senyan Xu ⋅ Shuai Chen ⋅ Chuanfu Shen ⋅ Kean Liu ⋅ Zhijing Sun ⋅ Chengzhi Cao ⋅ Xueyang Fu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 84
MergeVLA: Cross-Skill Model Merging Toward a Generalist Vision-Language-Action Agent
Yuxia Fu ⋅ Zhizhen Zhang ⋅ Yuqi Zhang ⋅ Zijian Wang ⋅ Zi Huang ⋅ Yadan Luo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 85
Resolving the Stability-Plasticity Dilemma in Reinforcement Learning via Complementary Continual Critics
Bo Sun ⋅ Peixi Peng ⋅ Guang Tan ⋅ Haoran Xu ⋅ Yaokun Li ⋅ Yiqian Chang ⋅ Shuaixian Wang ⋅ Luntong Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 86
SAGE: Scalable Agentic 3D Scene Generation for Embodied AI
Hongchi Xia ⋅ Xuan Li ⋅ Max Li ⋅ Qianli Ma ⋅ Jiashu Xu ⋅ Ming-Yu Liu ⋅ Yin Cui ⋅ Tsung-Yi Lin ⋅ Wei-Chiu Ma ⋅ Shenlong Wang ⋅ Shuran Song ⋅ Fangyin Wei
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 87
Semantic Audio-Visual Navigation in Continuous Environments
Yichen Zeng ⋅ Hebaixu Wang ⋅ Meng Liu ⋅ Yu ZHOU ⋅ Chen Gao ⋅ Kehan Chen ⋅ Gongping Huang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 88
Unifying Perception and Action: A Hybrid-Modality Pipeline with Implicit Visual Chain-of-Thought for Robotic Action Generation
Xiangkai Ma ⋅ Lekai Xing ⋅ Han Zhang ⋅ Wenzhong Li ⋅ Sanglu Lu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 89
FLARE: A Failure-Aware Framework for Autonomous Correction and Recovery in Visual-Language Robotic Manipulation
Ganlong Zhao ⋅ Zijia Tang ⋅ Xingping Chen ⋅ Zhanghui Kuang ⋅ Ye Tian ⋅ Guanbin Li
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 90
Learning to Adapt: Self-Improving Web Agent via Cognitive-Aware Exploration
Weile Chen ⋅ Bingchen Miao ⋅ Qifan Yu ⋅ Wendong Bu ⋅ Guoming Wang ⋅ Wenqiao Zhang ⋅ Shengyu Zhang ⋅ Juncheng Li ⋅ Siliang Tang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 91
General Process Reward Modeling for Robotic Reinforcement Learning
Huajie Tan ⋅ Sixiang Chen ⋅ Yijie Xu ⋅ Zixiao Wang ⋅ Cheng Chi ⋅ Yuheng Ji ⋅ Yaoxu Lyu ⋅ Zhongxia Zhao ⋅ Xiansheng Chen ⋅ Peterson Co ⋅ Shaoxuan Xie ⋅ Guocai Yao ⋅ Pengwei Wang ⋅ Zhongyuan Wang ⋅ Shanghang Zhang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 92
DynBridge: Bridging Imagination and Control through Interaction Dynamics for Robot Manipulation
Alex Wang ⋅ Zhiwei Dong ⋅ Qicheng Bai ⋅ Chenshi Zhang ⋅ Yujie Yi ⋅ Guang Dai ⋅ Yong Liu ⋅ Mengmeng Wang
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 93
Action-Sketcher: From Reasoning to Action via Visual Sketches for Robotic Manipulation
Huajie Tan ⋅ Peterson Co ⋅ Yijie Xu ⋅ Shanyu Rong ⋅ Yuheng Ji ⋅ Cheng Chi ⋅ Xiansheng Chen ⋅ Zhongxia Zhao ⋅ Pengwei Wang ⋅ Zhongyuan Wang ⋅ Shanghang Zhang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 94
Thinking in 360°: Humanoid Visual Search in the Wild
Heyang Yu ⋅ Yinan Han ⋅ Xiangyu Zhang ⋅ Baiqiao Yin ⋅ Bowen Chang ⋅ Xiangyu Han ⋅ Xinhao Liu ⋅ Jing Zhang ⋅ Marco Pavone ⋅ Chen Feng ⋅ Saining Xie ⋅ Yiming Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 95
Learning from Semantic Dictionaries: Discriminative Codebook Contrastive Learning for Unified Visual Representation and Generation
Imanol G. Estepa ⋅ Jesús M Rodríguez-de-Vera ⋅ Bhalaji Nagarajan ⋅ Petia Radeva
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 96
MagicQuill V2: Precise and Interactive Image Editing with Layered Visual Cues
Zichen Liu ⋅ Yue Yu ⋅ Hao Ouyang ⋅ Qiuyu Wang ⋅ Shuailei Ma ⋅ Ka Leong Cheng ⋅ Wen Wang ⋅ Qingyan Bai ⋅ Yuxuan Zhang ⋅ Yanhong Zeng ⋅ Yixuan LI ⋅ Xing Zhu ⋅ Yujun Shen ⋅ Qifeng Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 97
Cycle-Consistent Tuning for Layered Image Decomposition
Zheng Gu ⋅ Min Lu ⋅ Zhida Sun ⋅ Dani Lischinski ⋅ Daniel Cohen-Or ⋅ Hui Huang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 98
RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark
Yang Shi ⋅ Yuhao Dong ⋅ Yue Ding ⋅ Yuran Wang ⋅ Xuanyu Zhu ⋅ Sheng Zhou ⋅ Wenting Liu ⋅ Haochen Tian ⋅ rundong wang ⋅ Huanqian Wang ⋅ Zuyan Liu ⋅ Bohan Zeng ⋅ Ruizhe Chen ⋅ Qixun Wang ⋅ Zhuoran Zhang ⋅ Xinlong Chen ⋅ Chengzhuo Tong ⋅ bozhou li ⋅ Qiang Liu ⋅ Haotian Wang ⋅ Wenjing Yang ⋅ Yuanxing Zhang ⋅ Pengfei Wan ⋅ Yi-Fan Zhang ⋅ Ziwei Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 99
Beyond Objects: Contextual Synthetic Data Generation for Fine-Grained Classification
William Yang ⋅ Xindi Wu ⋅ Zhiwei Deng ⋅ Esin Tureci ⋅ Olga Russakovsky
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 100
NEAF: Natural Image Editing with Attention Fusion for Generalizable Test-time Optimization in Text-Guided Image Editing
Jisoo Kim ⋅ Heeseok Oh
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 101
OntoAug: Rethinking Generative Data Augmentation via Ontology Guidance
Shuo Wang ⋅ Zhichuan Wang ⋅ Jun Luo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 102
Spherical Voronoi: Directional Appearance as a Differentiable Partition of the Sphere
Francesco Di Sario ⋅ Daniel Rebain ⋅ Dor Verbin ⋅ Marco Grangetto ⋅ Andrea Tagliasacchi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 103
4DSurf: High-Fidelity Dynamic Scene Surface Reconstruction
Renjie Wu ⋅ Hongdong Li ⋅ Jose M. Alvarez ⋅ Miaomiao Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 104
Learning 3D Representations for Spatial Intelligence from Unposed Multi-View Images
bo zhou ⋅ Qiuxia Lai ⋅ Zeren Sun ⋅ Xiangbo Shu ⋅ Yazhou Yao ⋅ Wenguan Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 105
Depth Peeling for High-Fidelity Gaussian-Enhanced Surfel Rendering
Keyang Ye ⋅ Hongzhi Wu ⋅ Kun Zhou
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 106
Intrinsic Image Fusion for Multi-View 3D Material Reconstruction
Peter Kocsis ⋅ Lukas Höllein ⋅ Matthias Nießner
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 107
PackUV: Packed Gaussian UV Maps for 4D Volumetric Video
Aashish Rai ⋅ Angela Xing ⋅ Anushka Agarwal ⋅ Xiaoyan Cong ⋅ Zekun Li ⋅ Tao Lu ⋅ Aayush Prakash ⋅ Srinath Sridhar
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 108
Opti-NeuS: Neural Reconstruction for Dual-Layered Transparent and Opaque Objects
Yi Yang ⋅ Gaoyang Zhang ⋅ Jun Tan ⋅ Xinguo Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 109
PhysGaia: A Physics-aware Benchmark with Multi-Body Interactions for Dynamic Novel View Synthesis
Mijeong Kim ⋅ Gunhee Kim ⋅ Jungyoon Choi ⋅ WonJae Roh ⋅ Bohyung Han
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 110
MatSpray: Fusing 2D Material World Knowledge on 3D Geometry
Philipp Langsteiner ⋅ Jan-Niklas Dihlmann ⋅ Hendrik Lensch
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 111
OMoBlur: An Object Motion Blur Dataset and Benchmark for Real-World Local Motion Deblurring
Dingchuan Yu ⋅ Jiatong Li ⋅ Jingwen Zhou ⋅ Zhengyue Zhuge ⋅ Yueting Chen ⋅ Qi Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 112
Hybrid Agents for Image Restoration
Bingchen Li ⋅ Xin Li ⋅ Yiting Lu ⋅ Zhibo Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 113
Zero-Shot Image Denoising via Hybrid Prior-Guided Pseudo Sample Generation
Xiaole Zhao ⋅ Qingsong Pang ⋅ Xiaobo Zhang ⋅ Xun Xu ⋅ Xun Gong ⋅ Yan Yang ⋅ Tianrui Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 114
Self-supervised Dynamic Heterogeneous Degradation Modeling for Unified Zero-Shot Image Restoration
Xiaowan Hu ⋅ Jing Yang ⋅ Henan Liu ⋅ HuaQiu Li ⋅ Mai Xu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 115
Next-Scale Prediction: A Self-Supervised Approach for Real-World Image Denoising
Yiwen Shan ⋅ Haiyu Zhao ⋅ Peng Hu ⋅ Xi Peng ⋅ Yuanbiao Gou
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 116
PhaSR: Generalized Image Shadow Removal with Physically Aligned Priors
Chia-Ming Lee ⋅ Yu-Fan Lin ⋅ Yu-Jou Hsiao ⋅ Jin-Hui Jiang ⋅ Yu-Lun Liu ⋅ Chih-Chung Hsu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 117
UARE: A Unified Vision-Language Model for Image Quality Assessment, Restoration, and Enhancement
Weiqi Li ⋅ Xuanyu Zhang ⋅ Bin Chen ⋅ Jingfen Xie ⋅ Yan Wang ⋅ Kexin Zhang ⋅ Junlin Li ⋅ Li zhang ⋅ Jian Zhang ⋅ Shijie Zhao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 118
FastGaMer: Efficient GainMap Learning for Practical Inverse Tone Mapping
YUANSHEN GUAN ⋅ Ruikang Xu ⋅ Chang Chen ⋅ Yinuo Liao ⋅ Dehua Song ⋅ Fenglong Song ⋅ Zhiwei Xiong
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 119
MDS-VQA: Model-Informed Data Selection for Video Quality Assessment
Jian Zou ⋅ Xiaoyu Xu ⋅ Zhihua Wang ⋅ Yilin Wang ⋅ Balu Adsumilli ⋅ Kede Ma
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 120
Seeing through Light and Darkness: Sensor-Physics Grounded Deblurring HDR NeRF from Single-Exposure Images and Events
Yunshan Qi ⋅ Lin Zhu ⋅ Nan Bao ⋅ Yifan Zhao ⋅ Jia Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 121
Disentanglement-wise Image Dehazing through Cross-Domain Manifold Consensus
Tianyi Lyu ⋅ Mingye Ju ⋅ Kai-Kuang Ma
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 122
Unsupervised Multi-Scale Segmentation of 3D Subcellular World with Stable Diffusion Foundation Model
Mostofa Uddin Uddin ⋅ HM Shadman Tabib ⋅ Thanh-Huy Nguyen ⋅ Kashish Gandhi ⋅ Min Xu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 123
EchoPOSE: 6D Pose Estimation of Sparse Echocardiograms for Left-Ventricular 3D Shape Reconstruction
Lucas Iijima ⋅ Yihao Luo ⋅ Dario Sesia ⋅ Amit Kaura ⋅ Jamil Mayet ⋅ Choon Hwai Yap
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 124
Spatial-SAM: Spatially Consistent 3D Electron Microscopy Segmentation with SDF Memory and Semi-Supervised Learning
Yikai Huang ⋅ Renmin Han ⋅ Yuxuan Wang ⋅ Youcheng Cai ⋅ Ligang Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 125
LLaDA-MedV: Exploring Large Language Diffusion Models for Biomedical Image Understanding
XUANZHAO DONG ⋅ Wenhui Zhu ⋅ Xiwen Chen ⋅ Zhipeng Wang ⋅ Peijie Qiu ⋅ Shao Tang ⋅ Xin Li ⋅ Yalin Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 126
TAlignDiff: Automatic Tooth Alignment assisted by Diffusion-based Transformation Learning
Yunbi Liu ⋅ Enqi Tang ⋅ Shiyu Li ⋅ hui shuai ⋅ Lei Ma ⋅ Juncheng Li ⋅ Kuai Yu ⋅ Shu Lou ⋅ Yongchu Pan ⋅ Qingshan Liu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 127
Harmonized Feature Conditioning and Frequency-Prompt Personalization for Multi-Rater Medical Segmentation
Sanaz Karimijafarbigloo ⋅ Armin Khosravi ⋅ Alireza Kheyrkhah ⋅ Reza Azad ⋅ Mauricio Reyes ⋅ Dorit Merhof
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 128
Masked-Diffusion Autoencoders for 3D Medical Vision Representation Learning
Jiachen Tu ⋅ Guanghui Qin ⋅ Theodore Zhengde Zhao ⋅ Jeya Maria Jose Valanarasu ⋅ Sheng Zhang ⋅ Tristan Naumann ⋅ Fan Lam ⋅ Sheng Wang ⋅ Hoifung Poon
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 129
PGR-Net: Prior-Guided ROI Reasoning Network for Brain Tumor MRI Segmentation
Jiacheng Lu ⋅ Hui Ding ⋅ Shiyu Zhang ⋅ Guoping Huo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 130
Test-Time Attention Purification for Backdoored Large Vision Language Models
Zhifang Zhang ⋅ Yang Bojun ⋅ Shuo He ⋅ Weitong Chen ⋅ Wei Emma Zhang ⋅ Olaf Maennel ⋅ Lei Feng ⋅ Miao Xu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 131
AGFT: Alignment-Guided Fine-Tuning for Zero-Shot Adversarial Robustness of Vision-Language Models
Yubo Cui ⋅ Xianchao Guan ⋅ Zijun Xiong ⋅ Zheng Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 132
Towards Robust Multimodal Large Language Models Against Jailbreak Attacks
ZIYI YIN ⋅ Yuanpu Cao ⋅ Han Liu ⋅ Ting Wang ⋅ Jinghui Chen ⋅ Fenglong Ma
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 133
R^2TUA: Reconstruction-residual Based Targeted and Untargeted Attack Against Text-Image Person Re-Identification
Yubo Wang ⋅ Yan Lu ⋅ Bin Liu ⋅ Xulin Li ⋅ Jixiang Niu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 134
When Robots Obey the Patch: Universal Transferable Patch Attacks on Vision-Language-Action Models
Hui Lu ⋅ Yi Yu ⋅ Yiming Yang ⋅ Chenyu Yi ⋅ Qixin Zhang ⋅ Bingquan Shen ⋅ Alex C. Kot ⋅ Xudong Jiang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 135
FlowHijack: A Dynamics-Aware Backdoor Attack on Flow-Matching Vision-Language-Action Models
Xinyuan An ⋅ Tao Luo ⋅ gengyun peng ⋅ Yaobing Wang ⋅ Kui Ren ⋅ Dongxia Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 136
Principled Steering via Null-space Projection for Jailbreak Defense in Vision-Language Models
Xingyu Zhu ⋅ Beier Zhu ⋅ Shuo Wang ⋅ Junfeng Fang ⋅ Kesen Zhao ⋅ Hanwang Zhang ⋅ Xiangnan He
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 137
Enhancing Part-Level Point Grounding for Any Open-Source MLLMs
Jin-Cheng Jhang ⋅ Fu-En Wang ⋅ Xin Yang ⋅ Nan Qiao ⋅ Lu Xia ⋅ Min Sun ⋅ Cheng-Hao Kuo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 138
MeteorPred: A Meteorological Multimodal Large Model and Dataset for Severe Weather Event Prediction
Shuo Tang ⋅ Jian Xu ⋅ Jiadong Zhang ⋅ yi chen ⋅ Qizhao Jin ⋅ Lingdong Shen ⋅ Chenglin Liu ⋅ Shiming Xiang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 139
YieldSAT: A Multimodal Benchmark Dataset for High-Resolution Crop Yield Prediction
Miro Miranda ⋅ Deepak Pathak ⋅ Patrick Helber ⋅ Benjamin Bischke ⋅ Hiba Najjar ⋅ Francisco Mena ⋅ Cristhian Sanchez ⋅ Akshay Pai ⋅ Diego Arenas ⋅ Matias Valdenegro ⋅ Marcela Charfuelan ⋅ Marlon Nuske ⋅ Andreas Dengel
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 140
How Far Can We Go With Synthetic Data for Audio-Visual Sound Source Localization?
Arda Senocak ⋅ Sooyoung Park ⋅ Tae-Hyun Oh ⋅ Joon Chung
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 141
Modeling Cross-vision Synergy for Unified Large Vision Model
Shengqiong Wu ⋅ Lanhu Wu ⋅ Mingyang Bao ⋅ Wenhao Xu ⋅ Hanwang Zhang ⋅ Shuicheng Yan ⋅ Hao Fei ⋅ Tat-seng Chua
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 142
Beyond Missing Modalities: Hypergraph Conditioned Diffusion for Uncertainty-Aware Multimodal Emotion Recognition
Xihang Qiu ⋅ Yuhao Fang ⋅ Qing Zhou ⋅ Bin Zhai ⋅ Jialong Hong ⋅ Wanpeng Zhang ⋅ Yao Lu ⋅ Ye Zhang ⋅ Chun Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 143
Rosetta Stone For Unified MLLMs: A Unified Tokenizer to Decipher Understanding and Generation
Wenyu Sun ⋅ Hufei Li ⋅ Ruijin Jin ⋅ Xiangheng Kong ⋅ Yuning Jiang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 144
MOON2.0: Dynamic Modality-balanced Multimodal Representation Learning for E-commerce Product Understanding
Zhanheng Nie ⋅ Chenghan Fu ⋅ Daoze Zhang ⋅ Junxian Wu ⋅ Wanxian Guan ⋅ Pengjie Wang ⋅ Jian Xu ⋅ Bo Zheng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 145
Nano-EmoX: Unifying Multimodal Emotional Intelligence from Perception to Empathy
Jiahao Huang ⋅ Fengyan Lin ⋅ Xuechao Yang ⋅ Chen Feng ⋅ Kexin Zhu ⋅ Xu Yang ⋅ Zhide chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 146
AMusE: Audio-Visual Benchmark and Alignment Framework for Agentic Multi-Speaker Understanding
Sanjoy Chowdhury ⋅ Karren Dai Yang ⋅ Xudong Liu ⋅ Fartash Faghri ⋅ Pavan Kumar Anasosalu Vasu ⋅ Oncel Tuzel ⋅ Dinesh Manocha ⋅ Chun-Liang Li ⋅ Raviteja Vemulapalli
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 147
Prototype-as-Prompt: Multimodal Sentiment Prototypes Endowing Large Language Models the Capability to Perform Multimodal Sentiment Analysis
Xianbing Zhao ⋅ Lan Luo ⋅ Hengyang Lu ⋅ Buzhou Tang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 148
CF-IPT: Cross-Modal Fusion Interactive Prompt Tuning of Vision-Language Pre-Trained Model for Multisource Remote Sensing Data Classification
Jinheng Ji ⋅ Jiahui Qu ⋅ Wenqian Dong ⋅ Yunsong Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 149
EMAD: Evidence-Centric Grounded Multimodal Diagnosis for Alzheimer’s Disease
Qiuhui Chen ⋅ Xuancheng Yao ⋅ Zhenglei Zhou ⋅ Xinyue Hu ⋅ Yi Hong
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 150
Multimodal Learning on Low-Quality Data with Conformal Predictive Self-Calibration
Xun Jiang ⋅ Yufan Gu ⋅ Disen Hu ⋅ Yuqing Hou ⋅ Yazhou Yao ⋅ Fumin Shen ⋅ Heng Tao Shen ⋅ Xing Xu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 151
Cross-View Distillation and Adaptive Masking for Incomplete Multi-View Multi-Label Classification
Yadong Liu ⋅ Qiaoqi Li ⋅ Yueying Wang ⋅ Lunke Fei ⋅ Jie Wen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 152
Bootstrap Your Own AV-Proxies: Adaptive Contrastive and Prototype Learning for Audio-Visual Segmentation
Junbo Zhang ⋅ Hang Su ⋅ Zhaofan Li ⋅ Hang Dong ⋅ Chao Sun
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 153
Multimodal Distribution Matching for Vision-Language Dataset Distillation
Jongoh Jeong ⋅ Hoyong Kwon ⋅ Minseok Kim ⋅ Kuk-Jin Yoon
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 154
M4-RAG: A Massive-Scale Multilingual Multi-Cultural Multimodal RAG
David Anugraha ⋅ Patrick Irawan ⋅ Anshul Singh ⋅ En-Shiun Annie Lee ⋅ Genta Indra Winata
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 155
Text-Driven 3D Hand Motion Generation from Sign Language Data
Léore Bensabath ⋅ Mathis Petrovich ⋅ Gul Varol
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 156
Real2Edit2Real: Generating Robotic Demonstrations via a 3D Control Interface
Yujie Zhao ⋅ Hongwei Fan ⋅ Di Chen ⋅ Shengcong Chen ⋅ Liliang Chen ⋅ Xiaoqi Li ⋅ Guangrui Ren ⋅ Hao Dong
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 157
GenHOI: Towards Object-Consistent Hand–Object Interaction with Temporally Balanced and Spatially Selective Object Injection
Xuan Huang ⋅ Mochu Xiang ⋅ Zhelun Shen ⋅ Jinbo Wu ⋅ Chenming Wu ⋅ Chen Zhao ⋅ Kaisiyuan Wang ⋅ Hang Zhou ⋅ Shanshan Liu ⋅ Haocheng Feng ⋅ Wei He ⋅ Jingdong Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 158
Clay-to-Stone: Phase-wise 3D Gaussian Splatting for Monocular Articulated Hand-Object Manipulation Modeling
Xingyu Liu ⋅ Pengfei Ren ⋅ Qi Qi ⋅ Haifeng Sun ⋅ Zirui Zhuang ⋅ Jianxin Liao ⋅ Jingyu Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 159
Training-free Motion Factorization for Compositional Video Generation
Zixuan Wang ⋅ Ziqin Zhou ⋅ Feng Chen ⋅ DUO PENG ⋅ Yixin Hu ⋅ Changsheng Li ⋅ Yinjie Lei
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 160
Audio-sync Video Instance Editing with Granularity-Aware Mask Refiner
Haojie Zheng ⋅ Shuchen Weng ⋅ Jingqi Liu ⋅ Siqi Yang ⋅ Boxin Shi ⋅ Xinlong Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 161
CaTok: Taming Mean Flows for One-Dimensional Causal Image Tokenization
Yitong Chen ⋅ Zuxuan Wu ⋅ Xipeng Qiu ⋅ Yu-Gang Jiang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 162
FFP-300K: Scaling First-Frame Propagation for Generalizable Video Editing
Xijie Huang ⋅ Chengming Xu ⋅ Donghao Luo ⋅ Xiaobin Hu ⋅ Peng Tang ⋅ Xu Peng ⋅ Jiangning Zhang ⋅ Chengjie Wang ⋅ Yanwei Fu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 163
V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties
Ye Fang ⋅ Tong Wu ⋅ Valentin Deschaintre ⋅ Duygu Ceylan ⋅ Iliyan Georgiev ⋅ Chun-Hao Huang ⋅ Yiwei Hu ⋅ Xuelin Chen ⋅ Tuanfeng Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 164
PoseAnything: General Pose-guided Video Generation with Part-aware Temporal Coherence
Ruiyan Wang ⋅ Teng Hu ⋅ Kaihui Huang ⋅ Zihan Su ⋅ Ran Yi ⋅ Lizhuang Ma
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 165
FastHybrid: Accelerating Hybrid Autoregressive Image Generation with Lookahead and Guided Decoding
j zg ⋅ Fang Zhang ⋅ YongXiang Hua ⋅ Bocheng Li ⋅ Wentao Zhang ⋅ Linli Xu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 166
DPAR: Dynamic Patchification for Efficient Autoregressive Visual Generation
Divyansh Srivastava ⋅ Akshay Mehra ⋅ Pranav Maneriker ⋅ Debopam Sanyal ⋅ Vishnu Raj ⋅ Vijay Kamarshi ⋅ Fan Du ⋅ Joshua Kimball
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 167
AlcheMinT: Fine-grained Temporal Control for Multi-Reference Consistent Video Generation
Sharath Girish ⋅ Viacheslav Ivanov ⋅ Tsai-Shien Chen ⋅ Hao Chen ⋅ Aliaksandr Siarohin ⋅ Sergey Tulyakov
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 168
LeapAlign: Post-training Flow Matching Models at Any Generation Step by Building Two-Step Trajectories
Zhanhao Liang ⋅ Tao Yang ⋅ Jie Wu ⋅ Chengjian Feng ⋅ Liang Zheng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 169
EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation
Tianwei Xiong ⋅ Jun Hao Liew ⋅ Zilong Huang ⋅ Zhijie Lin ⋅ Jiashi Feng ⋅ Xihui Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 170
Flow Matching for Multimodal Distributions
Gaoxiang Luo ⋅ Frank Cole ⋅ Sihang Zhang ⋅ Yuxiang Wan ⋅ Yulong Lu ⋅ Ju Sun
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 171
From Scale to Speed: Adaptive Test-Time Scaling for Image Editing
Xiangyan Qu ⋅ Zhenlong Yuan ⋅ Jing Tang ⋅ Rui Chen ⋅ Datao Tang ⋅ Meng Yu ⋅ Lei Sun ⋅ Yancheng Bai ⋅ Xiangxiang Chu ⋅ Gaopeng Gou ⋅ Gang Xiong ⋅ Yujun Cai
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 172
ReasonEdit: Towards Reasoning-Enhanced Image Editing Models
Fukun Yin ⋅ Shiyu Liu ⋅ Yucheng Han ⋅ Zhibo Wang ⋅ Peng Xing ⋅ Rui Wang ⋅ Wei Cheng ⋅ Yingming Wang ⋅ Aojie Li ⋅ Zixin Yin ⋅ Pengtao Chen ⋅ Xianfang Zeng ⋅ Gang Yu ⋅ Daxin Jiang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 173
Cross-Subject EEG-to-Video Reconstruction and Beyond
Runduo Han ⋅ Hongchen Tan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 174
Rethinking Position Embedding as a Context Controller for Multi-Reference and Multi-Shot Video Generation
Binyuan Huang ⋅ Yuning Lu ⋅ Weinan Jia ⋅ hualiang wang ⋅ Mu Liu ⋅ Daiqing Yang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 175
Stand-In: A Lightweight and Plug-and-Play Identity Control for Video Generation
Bowen Xue ⋅ Zheng-Peng Duan ⋅ Qixin Yan ⋅ Wenjing Wang ⋅ Hao Liu ⋅ Chunle Guo ⋅ Chongyi Li ⋅ Chen Li ⋅ Jing LYU
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 176
BiFM: Bidirectional Flow Matching for Few-Step Image Editing and Generation
Yasong Dai ⋅ Zeeshan Hayder ⋅ David Ahmedt-Aristizabal ⋅ Hongdong Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 177
DTG-Restore: Training-Free Diffusion Refinement for Generative Video Super-Resolution
Hidir Yesiltepe ⋅ Koutilya PNVR ⋅ Gaurav Suresh Pathak ⋅ Navaneeth Bodla ⋅ Bharat Singh ⋅ Pinar Yanardag ⋅ Jinrong Xie
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 178
VABench: A Comprehensive Benchmark for Audio-Video Generation
Daili Hua ⋅ Xizhi Wang ⋅ Bohan Zeng ⋅ Xinyi Huang ⋅ Hao Liang ⋅ Junbo Niu ⋅ Xinlong Chen ⋅ Quanqing Xu ⋅ Wentao Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 179
Relightful Video Portrait Harmonization
Jun Myeong Choi ⋅ Jae Shin Yoon ⋅ Luchao Qi ⋅ Roni Sengupta ⋅ Joon-Young Lee
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 180
DiT360: High-Fidelity Panoramic Image Generation via Hybrid Training
Haoran Feng ⋅ Dizhe Zhang ⋅ Xiangtai Li ⋅ Bo Du ⋅ Lu Qi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 181
DVAR: Dynamic Visual Autoregressive Modeling for Image Super-Resolution
Yu Zheng ⋅ Kai Zhang ⋅ Wei Zhu ⋅ Qingguo Liu ⋅ Xiantao Hu ⋅ Jun Li ⋅ Jian Yang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 182
Gated Condition Injection without Multimodal Attention: Towards Controllable Linear-Attention Transformers
Yuhe Liu ⋅ Zhenxiong Tan ⋅ Yujia Hu ⋅ Songhua Liu ⋅ Xinchao Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 183
LinVideo: A Post-Training Framework towards O(n) Attention in Efficient Video Generation
yushi Huang ⋅ Xingtong Ge ⋅ RUIHAO GONG ⋅ Chengtao Lv ⋅ Jun Zhang
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 184
UCAN: Unified Convolutional Attention Network for Expansive Receptive Fields in Lightweight Super-Resolution
Thien Tan Cao ⋅ Phan Thi Thu Trang ⋅ Nghiem Duc ⋅ Ho Ngoc Anh ⋅ Nguyen Duc Dung ⋅ Duc Dung Nguyen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 185
EMR-Diff: Edge-aware Multimodal Residual Diffusion Model for Hyperspectral Image Super-resolution
Tao Zhang ⋅ Shengtao Yao ⋅ Rong Zeng ⋅ Zunjie Zhu ⋅ Bolun Zheng ⋅ Yaoqi Sun ⋅ Ying Fu ⋅ Chenggang Yan
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 186
RAW-Domain Degradation Models for Realistic Smartphone Super-Resolution
Ali Mosleh ⋅ Faraz Ali ⋅ Fengjia Zhang ⋅ Stavros Tsogkas ⋅ Junyong Lee ⋅ Michael S. Brown ⋅ Alex Levinshtein
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 187
One-Step Diffusion Transformer for Controllable Real-World Image Super-Resolution
Yushun Fang ⋅ Yuxiang Chen ⋅ Shibo Yin ⋅ Qiang Hu ⋅ Jiangchao Yao ⋅ Ya Zhang ⋅ Xiaoyun Zhang ⋅ Yanfeng Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 188
FRAMER: Frequency-Aligned Self-Distillation with Adaptive Modulation Leveraging Diffusion Priors for Real-World Image Super-Resolution
Seungho Choi ⋅ Jeahun Sung ⋅ Jihyong Oh
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 189
HDW-SR: High-Frequency Guided Diffusion Model based on Wavelet Decomposition for Image Super-Resolution
Chao Yang ⋅ Boqian Zhang ⋅ Jinghao Xu ⋅ Guang Jiang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 190
Unifying Precise Keyframes and Semantic Control via Multi-level Diffusion
Linjun Wu ⋅ Jiejia Yu ⋅ Leyang Jin ⋅ He Wang ⋅ Bowen Zheng ⋅ Xu Yang ⋅ Hao Jiang ⋅ Fei Xia ⋅ Fei Ling ⋅ Jun Deng ⋅ Xiaogang Jin
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 191
CIGPose: Causal Intervention Graph Neural Network for Whole-Body Pose Estimation
Bohao Li ⋅ Zhicheng Cao ⋅ Huixian Li ⋅ Yangming Guo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 192
Pressure2Motion: Hierarchical Human Motion Reconstruction from Ground Pressure with Text Guidance
Zhengxuan Li ⋅ Qinhui Yang ⋅ Yiyu Zhuang ⋅ Chuan Guo ⋅ Xinxin Zuo ⋅ Xiaoxiao Long ⋅ Yao Yao ⋅ Xun Cao ⋅ Qiu Shen ⋅ Hao Zhu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 193
From 3D Pose to Prose: Biomechanics-Grounded Vision–Language Coaching
Yuyang Ji ⋅ Yixuan Shen ⋅ Shengjie Zhu ⋅ Yu Kong ⋅ Feng Liu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 194
InterPrior: Scaling Generative Control for Physics-Based Human-Object Interactions
Sirui Xu ⋅ Samuel Schulter ⋅ Morteza Ziyadi ⋅ Xialin He ⋅ Xiaohan Fei ⋅ Yu-Xiong Wang ⋅ Liang-Yan Gui
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 195
MoCoDiff: A Controllable Autoregressive Diffusion Model for Expressive Motion Generation
Wenfeng Song ⋅ Xuehan Wang ⋅ Shuai Li ⋅ Yi Chen ⋅ Yuting Guo ⋅ Zhenyu Wu ⋅ Xingliang Jin ⋅ Chenglizhao Chen ⋅ Fei Hou ⋅ Hongyu Wu ⋅ Aimin Hao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 196
W2W: Language-Model-Based Trajectory Prediction with Reinforcement Learning
Zirui Xu ⋅ Biao Yang ⋅ rongrong Ni ⋅ Zhongkai Zhou ⋅ Shaobo Shen
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 197
ParTY: Part-Guidance for Expressive Text-to-Motion Synthesis
KunHo Heo ⋅ SuYeon Kim ⋅ Yonghyun Gwon ⋅ Youngbin Kim ⋅ MyeongAh Cho
[ Slides
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 198
Interact2Ar: Full-Body Human-Human Interaction Generation via Autoregressive Diffusion Models
Pablo Ruiz-Ponce ⋅ Sergio Escalera ⋅ Jose Garcia-Rodriguez ⋅ Jiankang Deng ⋅ Rolandos Alexandros Potamias
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 199
Unified Number-Free Text-to-Motion Generation Via Flow Matching
Guanhe Huang ⋅ Oya Celiktutan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 200
Generative Diffusion Priors for 3D Mapping of the Dark Universe
Brandon Zhao ⋅ Diana Scognamiglio ⋅ Olivier Doré ⋅ Katherine L. Bouman
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 201
FlowPalm: Optical Flow Driven Non-Rigid Deformation for Geometrically Diverse Palmprint Generation
yuchen zou ⋅ Huikai Shao ⋅ Lihuang Fang ⋅ Zhipeng Xiong ⋅ Dexing Zhong
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 202
DiffuView: Multi-View Diffusion Pretraining for 3D Aware Robotic Manipulation
Kaizhao Zhang ⋅ Tian Niu ⋅ Tianyu Liu ⋅ Chenen Guo ⋅ Zijun Xu ⋅ Qingda Hu ⋅ Wenchao Ding
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 203
Circuit Mechanisms for Spatial Relation Generation in Diffusion Transformers
Binxu Wang ⋅ Jingxuan Fan ⋅ Xu Pan
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 204
Dual Ascent Diffusion for Inverse Problems
Minseo Kim ⋅ Axel Levy ⋅ Gordon Wetzstein
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 205
Forecast the Principal, Stabilize the Residual: Subspace-Aware Feature Caching for Diffusion Transformers
Guantao Chen ⋅ Shikang Zheng ⋅ Yuqi Lin ⋅ Linfeng Zhang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 206
Spatial-Spectral Residuals Informed Diffusion Neural Operator for Pan-sharpening
jiahan huang ⋅ Ran Ran ⋅ Junming Hou ⋅ Zihao Chen ⋅ Xiaofeng Cong ⋅ Junling Li ⋅ Liang-Jian Deng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 207
PhyOceanCast: Global Ocean Forecasting with Physics-Informed Diffusion
Qixiu Li ⋅ Xiang Zhu ⋅ Xiaoyong Li ⋅ Xiaolong Xu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 208
Pixel Motion Diffusion is What We Need for Robot Control
E-Ro Nguyen ⋅ Yichi Zhang ⋅ Kanchana Ranasinghe ⋅ Xiang Li ⋅ Michael Ryoo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 209
ORIC: Benchmarking Object Recognition under Contextual Incongruity in Large Vision-Language Models
Zhaoyang Li ⋅ Zhan Ling ⋅ Yuchen Zhou ⋅ Litian Gong ⋅ Erdem Biyik ⋅ Hao Su
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 210
M3Grounder: Mask-Based Multi-Span and Multi-Granular Grounding for Document QA
Venkata Kesav Venna ⋅ Sai Madhusudan Gunda ⋅ Jyothi Swaroopa Jinka ⋅ Hrithik Sagar Rachakonda ⋅ Anirudh Srinivasan ⋅ Ravi Kiran Sarvadevabhatla
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 211
BabyVLM-V2: Toward Developmentally Grounded Pretraining and Benchmarking of Vision Foundation Models
Shengao Wang ⋅ Wenqi Wang ⋅ Zecheng Wang ⋅ Max Whitton ⋅ Michael Wakeham ⋅ Arjun Chandra ⋅ Joey Huang ⋅ Pengyue Zhu ⋅ Helen Chen ⋅ David Li ⋅ Jeffrey Li ⋅ Shawn Li ⋅ Andrew Zagula ⋅ Amy Zhao ⋅ Andrew Zhu ⋅ Sayaka Nakamura ⋅ Yuki Yamamoto ⋅ Jerry Yokono ⋅ Aaron Mueller ⋅ Bryan A. Plummer ⋅ Kate Saenko ⋅ Venkatesh Saligrama ⋅ Boqing Gong
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 212
Towards Real-World Document Parsing via Realistic Scene Synthesis and Document-Aware Training
Gengluo Li ⋅ Pengyuan Lyu ⋅ Chengquan Zhang ⋅ Huawen Shen ⋅ Liang Wu ⋅ Xingyu Wan ⋅ Gangyan Zeng ⋅ Han Hu ⋅ Can Ma ⋅ Yu ZHOU
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 213
RoadSceneBench: A Lightweight Benchmark for Mid-Level Road Scene Understanding
Xiyan Liu ⋅ Han Wang ⋅ Yuhu Wang ⋅ JUNJIE CAI ⋅ Zhe Cao ⋅ Jianzhong Yang ⋅ Zhen Lu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 214
UNICBench: UNIfied Counting Benchmark for MLLM
Chenggang Rong ⋅ Tao Han ⋅ Zhiyuan Zhao ⋅ Yaowu Fan ⋅ Jia Wan ⋅ Song Guo ⋅ Yuan Yuan ⋅ Junyu Gao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 215
CaptionQA: Is Your Caption as Useful as the Image Itself?
Shijia Yang ⋅ Yunong Liu ⋅ Bohan Zhai ⋅ Ximeng Sun ⋅ Zicheng Liu ⋅ Emad Barsoum ⋅ Manling Li ⋅ Chenfeng Xu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 216
EgoProx: Evaluating MLLMs on Egocentric 3D Proximity Reasoning Across a Cognitive Hierarchy
Jinzhao Li ⋅ Yinuo Chen ⋅ Dongxu Piao ⋅ Panwang Pan ⋅ Yifan Yu ⋅ Dong Wang ⋅ Honglei Yan ⋅ Liang Yue ⋅ Shaofei Wang ⋅ Yixin Chen ⋅ Siyuan Huang ⋅ Miao Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 217
VULCAN: Tool-Augmented Multi Agents for Iterative 3D Object Arrangement
Zhengfei Kuang ⋅ Rui Lin ⋅ Long Zhao ⋅ Gordon Wetzstein ⋅ Saining Xie ⋅ Sanghyun Woo
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 218
EmbodiedSplat: Online Feed-Forward Semantic 3DGS for Open-Vocabulary 3D Scene Understanding
Seungjun Lee ⋅ Zihan Wang ⋅ Yunsong Wang ⋅ Gim Hee Lee
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 219
Efficient Encoder-Free Fourier-based 3D Large Multimodal Model
Guofeng Mei ⋅ Wei Lin ⋅ Luigi Riz ⋅ Yujiao Wu ⋅ Yiming Wang ⋅ Fabio Poiesi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 220
Socratic-Geo: Synthetic Data Generation and Cross-Modal Geometric Reasoning via Multi-Agent Interaction
Zhengbo Jiao ⋅ Zifan Zhang ⋅ Shaobo Wang ⋅ Wei Wang ⋅ Bing Zhao ⋅ hu wei ⋅ Linfeng Zhang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 221
HAMMER: Harnessing MLLMs via Cross-Modal Integration for Intention-Driven 3D Affordance Grounding
Lei Yao ⋅ Yong Chen ⋅ YUEJIAO SU ⋅ Yi Wang ⋅ Moyun Liu ⋅ Lap-Pui Chau
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 222
Proxy3D: Efficient 3D Representations for Vision-Language Models via Semantic Clustering and Alignment
Jerry Jiang ⋅ Haowen Sun ⋅ Denis Gudovskiy ⋅ Yohei Nakata ⋅ Tomoyuki Okuno ⋅ Kurt Keutzer ⋅ Wenzhao Zheng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 223
ReLaGS: Relational Language Gaussian Splatting
Yaxu Xie ⋅ Abdalla Arafa ⋅ Alireza Javanmardi ⋅ Christen Millerdurai ⋅ Jia Cheng Hu ⋅ Shaoxiang Wang ⋅ Alain Pagani ⋅ Didier Stricker
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 224
3D-IDE: 3D Implicit Depth Emergent
Chushan Zhang ⋅ Ruihan Lu ⋅ Jinguang Tong ⋅ Yikai Wang ⋅ Hongdong Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 225
FunFact: Building Probabilistic Functional 3D Scene Graphs via Factor-Graph Reasoning
Zhengyu Fu ⋅ René Zurbrügg ⋅ Kaixian Qu ⋅ Marc Pollefeys ⋅ Marco Hutter ⋅ Hermann Blum ⋅ Zuria Bauer
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 226
Parse, Search, and Confirmation: Training-Free Aerial Vision-and-Dialog Navigation with Chain-of-Thought Reasoning and Structured Spatial Memory
Yu Qi ⋅ Hongyu Li ⋅ Shaofei Huang ⋅ Tianrui Hui ⋅ Yaxiong Wang ⋅ Lechao Cheng ⋅ Zhun Zhong ⋅ Si Liu ⋅ Meng Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 227
4DP-QA: Scalable QA for 4D Perception in Vision Language Models
Seokju Cho ⋅ Abhishek Badki ⋅ Hang Su ⋅ Jindong Jiang ⋅ Ziyao Zeng ⋅ Seungryong Kim ⋅ Sifei Liu ⋅ Orazio Gallo
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 228
LASAR: Towards Spatio-temporal Reasoning with Latent Cognitive Map
Jinzhou Tang ⋅ Sidi Liu ⋅ Waikit Xiu ⋅ weixing chen ⋅ Keze Wang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 229
Text-Phase Synergy Network with Dual Priors for Unsupervised Cross-Domain Image Retrieval
Jing Yang ⋅ Hui Xue ⋅ Shipeng Zhu ⋅ Pengfei Fang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 230
EagleNet: Energy-Aware Fine-Grained Relationship Learning Network for Text-Video Retrieval
Yuhan Chen ⋅ Pengwen Dai ⋅ Chuan Wang ⋅ Dayan Wu ⋅ Xiaochun Cao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 231
PIX-TAB: Efficient PIXel-Precise TABle Structure Recognition Approach with Speculative Decoding and Region-Based Image Segmentation
Viktor Zaytsev ⋅ Olena Vynokurova ⋅ Pavlo Tytarchuk ⋅ Dmytro Kozii ⋅ Vitalii Pohribnyi ⋅ Olga Radyvonenko ⋅ Artem Shcherbina
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 232
CARLoS: Retrieval via Concise Assessment Representation of LoRAs at Scale
Shahar Sarfaty ⋅ Adi Haviv ⋅ Uri Y. Hacohen ⋅ Niva Elkin-Koren ⋅ Roi Livni ⋅ Amit H. Bermano
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 233
Camouflage-aware Image-Text Retrieval via Expert Collaboration
Yao Jiang ⋅ Zhongkuan Mao ⋅ xuan wu ⋅ Keren Fu ⋅ Qijun Zhao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 234
TriSim: Tri-Dimensional Similarity Modeling with Extreme Value Theory for False-Negative Mitigation in Remote Sensing Image-Text Retrieval
Chengyu Zheng ⋅ Hanzhang Lu ⋅ Jie Nie ⋅ Shan Du
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 235
TIGER: A Unified Framework for Time, Images and Geo-location Retrieval
David G. ⋅ Sirnam Swetha ⋅ Mubarak Shah
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 236
Mistake Attribution: Fine-Grained Mistake Understanding in Egocentric Videos
Yayuan Li ⋅ Aadit Jain ⋅ Filippos Bellos ⋅ Jason J. Corso
[ Slides
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 237
VidTAG: Temporally Aligned Video to GPS Geolocalization with Denoising Sequence Prediction at a Global Scale
Parth Parag Kulkarni ⋅ Rohit Gupta ⋅ Prakash Chandra Chhipa ⋅ Mubarak Shah
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 238
Stitch-a-Demo: Creating Video Demonstrations from Multistep Descriptions
Chi Hsuan Wu ⋅ Kumar Ashutosh ⋅ Kristen Grauman
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 239
Prototypical Action Reasoning Facilitated by Vision-Language Alignment for Egocentric Action Anticipation
jiang shao ⋅ Xinbo Zhao ⋅ Wenyin Tuo ⋅ XiaoChun Zou
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 240
AdaSpot: Spend Resolution Where It Matters for Precise Event Spotting
Artur Xarles i Esparraguera ⋅ Sergio Escalera ⋅ Thomas B. Moeslund ⋅ Albert Clapés
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 241
Unique Lives, Shared World: Learning from Single-Life Videos
Tengda Han ⋅ Sayna Ebrahimi ⋅ Dilara Gokay ⋅ Li Yang Ku ⋅ Maks Ovsjanikov ⋅ Iva Babukova ⋅ Daniel Zoran ⋅ Viorica Patraucean ⋅ Joao Carreira ⋅ Andrew Zisserman ⋅ Dima Damen
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 242
Symphony: A Cognitively-Inspired Multi-Agent System for Long-Video Understanding
海洋 闫 ⋅ Hongyun Zhou ⋅ Peng Xu ⋅ Xiaoxue Feng ⋅ Mengyi Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 243
VideoARM: Agentic Reasoning over Hierarchical Memory for Long-Form Video Understanding
Yufei Yin ⋅ Qianke Meng ⋅ Minghao Chen ⋅ Jiajun Ding ⋅ Zhenwei Shao ⋅ Zhou Yu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 244
Wavelet-based Frame Selection by Detecting Semantic Boundary for Long Video Understanding
Wang Chen ⋅ Yuhui zeng ⋅ Yongdong Luo ⋅ Tianyu Xie ⋅ Luojun Lin ⋅ Jiayi Ji ⋅ Yan Zhang ⋅ Xiawu Zheng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 245
SVAgent: Storyline-guided Long Video Understanding via Cross-Modal Multi-Agent Collaboration
zhongyu yang ⋅ Zuhao Yang ⋅ SHUO ZHAN ⋅ Tan Yue ⋅ Wei Pang ⋅ Yingfang Yuan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 246
Frame2Freq: Spectral Adapters for Fine-Grained Video Understanding
Thinesh Thiyakesan Ponbagavathi ⋅ Constantin Seibold ⋅ Alina Roitberg
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 247
Structural Graph Probing of Vision–Language Models
Haoyu He ⋅ Yue Zhuo ⋅ Yu Zheng ⋅ Qi R. Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 248
Saliency-R1: Enforcing Interpretable and Faithful Vision-language Reasoning via Saliency-map Alignment Reward
Shizhan Gong ⋅ Minda Hu ⋅ Qiyuan Zhang ⋅ Chen Ma ⋅ Qi Dou
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 249
Hidden Monotonicity: Explaining Deep Neural Networks via their DC Decomposition
Jakob Paul Zimmermann ⋅ Georg Loho
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 250
MaskDiME: Adaptive Masked Diffusion for Precise and Efficient Visual Counterfactual Explanations
Changlu Guo ⋅ Anders Nymark Christensen ⋅ Anders Bjorholm Dahl ⋅ Morten Hannemose
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 251
TRANSPORTER: Transferring Visual Semantics from VLM Manifolds
Alexandros Stergiou
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 252
Relational Visual Similarity
Thao Nguyen ⋅ Sicheng Mo ⋅ Krishna Kumar Singh ⋅ Yilin Wang ⋅ Jing Shi ⋅ Nick Kolkin ⋅ Eli Shechtman ⋅ Yong Jae Lee ⋅ Yuheng Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 253
PointCNN++: Performant Convolution on Native Points
Lihan Li ⋅ Haofeng Zhong ⋅ Rui Bu ⋅ Mingchao Sun ⋅ Wenzheng Chen ⋅ Baoquan Chen ⋅ Yangyan Li
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 254
Fast Markov Random Field Optimisation for Topologically Noisy 3D Shape Matching
Paul Roetzer ⋅ Johan Thunberg ⋅ Zorah Lähner ⋅ Florian Bernard
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 255
LitePT: Lighter Yet Stronger Point Transformer
Yuanwen Yue ⋅ Damien Robert ⋅ Jianyuan Wang ⋅ Sunghwan Hong ⋅ Jan D. Wegner ⋅ Christian Rupprecht ⋅ Konrad Schindler
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 256
SuP: Sub-cloud Driven Point Cloud Registration
Sheldon Fung ⋅ Wei Pan ⋅ Ling Cao ⋅ Fei Hou ⋅ Ling Chen ⋅ Shasha Mao ⋅ Hongdong Li ⋅ Xuequan Lu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 257
PQDT: Pseudo-Query Dual Transformer for Robust Point Cloud Restoration
Haoqing Wu ⋅ Alexa Nawotki ⋅ Jochen Garcke
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 258
Test-Time Training for LiDAR Semantic Segmentation under Corruption via Geometric Inlier Discrimination
Hyeonseong Kim ⋅ Hyun-Kurl Jang ⋅ Kuk-Jin Yoon
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 259
MHopReg: Efficient Hierarchical Multi-Hop Graph Search for Point Cloud Registration
Yue Wu ⋅ Feng Xiao ⋅ Yongzhe Yuan ⋅ Hao Li ⋅ Kaiyuan Feng ⋅ Maoguo Gong ⋅ Qiguang Miao ⋅ Wenping Ma
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 260
GEM: Generating LiDAR World Model via Deformable Mamba
Yang Wu ⋅ Zhaojiang Liu ⋅ Qiang Meng ⋅ Youquan Liu ⋅ renliang Weng ⋅ Jianjun Qian ⋅ Jian Yang ⋅ Jin Xie
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 261
Hybrid Robust Collaborative Perception with LiDAR-4D Radar Fusion under Adverse Weather Conditions
Yuquan Yang ⋅ hui zhang ⋅ Wenyu Lu ⋅ Ziyin Zhang ⋅ Chuanming Zhang ⋅ Xiaohua Xu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 262
Task-Driven Implicit Representations for Automated Design of LiDAR Systems
Nikhil Behari ⋅ Aaron Young ⋅ Tzofi Klinghoffer ⋅ Akshat Dave ⋅ Ramesh Raskar
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 263
Hierarchical Point-Patch Fusion with Adaptive Patch Codebook for 3D Shape Anomaly Detection
Xueyang Kang ⋅ Zizhao Li ⋅ Tian Lan ⋅ Dong Gong ⋅ Kourosh Khoshelham ⋅ Liangliang Nan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 264
When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models
Zhengyang Sun ⋅ Yu Chen ⋅ Xin Zhou ⋅ Xiaofan Li ⋅ Xiwu Chen ⋅ Dingkang Liang ⋅ Xiang Bai
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 265
Beyond Layer-Wise Merging: Chain-of-Merging for Vision-Language Models
Xinyu Zhang ⋅ Yuxuan Dong ⋅ Lingling Zhang ⋅ Chengyou Jia ⋅ Zhuohang Dang ⋅ YiXing Yao ⋅ Yaqiang Wu ⋅ Basura Fernando ⋅ Jun Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 266
GazeShift: Unsupervised Gaze Estimation and Dataset for VR
Gil Shapira ⋅ Ishay Goldin ⋅ Evgeny Artyomov ⋅ Donghoon Kim ⋅ Yosi Keller ⋅ Niv Zehngut
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 267
Improving Calibration in Test-Time Prompt Tuning for Vision-Language Models via Data-Free Flatness-Aware Prompt Pretraining
Hyeonseo Jang ⋅ Jaebyeong Jeon ⋅ Joong-won Hwang ⋅ Kibok Lee
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 268
Reevaluating the Intra-Modal Misalignment Hypothesis in CLIP
Jonas Herzog ⋅ Yue Wang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 269
Dr. Seg: Revisiting GRPO Training for Visual Large Language Models through Perception-Oriented Design
Haoxiang Sun ⋅ Tao Wang ⋅ Chenwei Tang ⋅ Li Yuan ⋅ Jiancheng Lv
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 270
Soft Modality-Guided Expert Specialization in MoE-VLMs
Zi-Hao Bo ⋅ Yaqian Li ⋅ Anzhou Hou ⋅ rinyoichi takezoe ⋅ Ertao Zhao ⋅ Tianxiang Pan ⋅ Jiale Yan ⋅ Mo Guang ⋅ Kaiwen Long
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 271
CoVFT: Context-aware Visual Fine-tuning for Multimodal Large Language Models
Nan Zhou ⋅ Huiqun Wang ⋅ Yaoyan Zheng ⋅ Di Huang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 272
Retrieving Counterfactuals Improves Visual In-Context Learning
Guangzhi Xiong ⋅ Sanchit Sinha ⋅ Zhenghao He ⋅ Aidong Zhang
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 273
AutoRegressive Generation with B-rep Holistic Token Sequence Representation
Jiahao Li ⋅ Yunpeng Bai ⋅ Yongkang Dai ⋅ Hao Guo ⋅ Hongping Gan ⋅ Yilei Shi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 274
VecGlypher: Unified Vector Glyph Generation with Language Models
Xiaoke Huang ⋅ Bhavul Gauri ⋅ Kam-Woh Ng ⋅ Tony Ng ⋅ Mengmeng Xu ⋅ Zhiheng Liu ⋅ Weiming Ren ⋅ Zhaochong An ⋅ Zijian Zhou ⋅ Haonan Qiu ⋅ Yuyin Zhou ⋅ Sen He ⋅ Ziheng Wang ⋅ Tao Xiang ⋅ Xiao Han
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 275
NERFIFY: A Multi-Agent Framework for Turning NeRF Papers into Code
Seemandhar Jain ⋅ Keshav Gupta ⋅ Kunal Gupta ⋅ Manmohan Chandraker
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 276
Diagram2Structure: Unlocking LLMs' Diagram Comprehension through DiagramDiff, an Offline Diagram Structuring Framework
Haoxiang Hu ⋅ Yaokun Li ⋅ Zeyuan Huang ⋅ Cangjun Gao ⋅ Qiang He ⋅ Qingkun Li ⋅ Xiaoming Deng ⋅ Cuixia Ma ⋅ Yu-Kun Lai ⋅ Yong-Jin Liu ⋅ Hongan Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 277
ShowTable: Unlocking Creative Table Visualization with Collaborative Reflection and Refinement
Zhihang Liu ⋅ Xiaoyi Bao ⋅ Pandeng Li ⋅ Junjie Zhou ⋅ Zhaohe Liao ⋅ Yefei He ⋅ Kaixun Jiang ⋅ Chenwei Xie ⋅ Yun Zheng ⋅ Hongtao Xie
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 278
GardenDesigner: Encoding Aesthetic Principles into Jiangnan Garden Construction via a Chain of Agents
Mengtian Li ⋅ Fan Yang ⋅ Ruixue Xiong ⋅ Yiyan Fan ⋅ Zhifeng Xie ⋅ Zeyu Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 279
ShadowDraw: From Any Object to Shadow-Drawing Compositional Art
Rundong Luo ⋅ Noah Snavely ⋅ Wei-Chiu Ma
[ Slides
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 280
End-to-End Hyper-Relational Information Extraction for Engineering Diagrams via Dynamically Tokenized Relation Transformer
Tianyou Bai ⋅ Yan-Ming Zhang ⋅ Zixiang Zhang ⋅ Jibin Zhou ⋅ Fei Yin ⋅ Chenglin Liu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 281
When Anonymity Breaks: Identifying Models Behind Text-to-Image Leaderboards
Ali Naseh ⋅ Anshuman Suri ⋅ Yuefeng Peng ⋅ Harsh Chaudhari ⋅ Alina Oprea ⋅ Amir Houmansadr
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 282
Bias at the End of the Score
Salma Abdel Magid ⋅ Grace Guo ⋅ Esin Tureci ⋅ Amaya Dharmasiri ⋅ Vikram V. Ramaswamy ⋅ Hanspeter Pfister ⋅ Olga Russakovsky
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 283
PECCVAI: Overcoming the Brittleness of AI Image Watermarking Under Visual Paraphrasing Attacks
Shreyas Dixit ⋅ Ashhar Aziz ⋅ Shashwat Bajpai ⋅ Vasu Sharma ⋅ Aman Chadha ⋅ Vinija Jain ⋅ Amitava Das
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 284
Dynamic Token Reweighting for Robust Vision-Language Models
Tanqiu Jiang ⋅ Jiacheng Liang ⋅ Rongyi Zhu ⋅ Jiawei Zhou ⋅ Fenglong Ma ⋅ Ting Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 285
COPYLENS: Towards Copyrighted Characters Infringement Detection via Copyright-Aware Prompt Learning
Yaoyu Jin ⋅ Xiaochun Yang ⋅ Hong Liu ⋅ Leixia Wang ⋅ Jian Li ⋅ Rui Ding ⋅ Bin Wang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 286
Closed-Form Concept Erasure via Double Projections
CHI ZHANG ⋅ Jingpu Cheng ⋅ Zhixian Wang ⋅ Ping Liu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 287
Adaptive Bayesian Early-Exit Networks for Efficient Non-Transferable Learning
Siyu Luan ⋅ Yan Li ⋅ Zhong Chen ⋅ Zhenyi Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 288
Stake the Points: Structure-Faithful Instance Unlearning
Kiseong Hong ⋅ JungKyoo Shin ⋅ Eunwoo Kim
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 289
Federated Active Learning Under Extreme Non-IID and Global Class Imbalance
Chen-Chen Zong ⋅ Shengjun Huang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 290
FedRG: Unleashing the Representation Geometry for Federated Learning with Noisy Clients
Tian Wen ⋅ Zhiqin Yang ⋅ Yonggang Zhang ⋅ Xuefeng Jiang ⋅ Hao Peng ⋅ Yuwei Wang ⋅ Bo Han
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 291
FedCART: Tackling Long-Tailed Distributions in Federated Adversarial Training via Classifier Refinement
Yuchen Qin ⋅ Yizhi Zhou ⋅ Junxiao Wang ⋅ Xin Xie ⋅ Heng QI
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 292
Generalized and Personalized Federated Learning with Black-Box Foundation Models via Orthogonal Transformations
Eun Gyung Kong ⋅ Jewon Yeom ⋅ Yonghoon Jeon ⋅ Taesup Kim
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 293
Fully Decentralized Certified Unlearning
Hithem Lamri ⋅ Michail Maniatakos
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 294
Fed-ADE: Adaptive Learning Rate for Federated Post-adaptation under Distribution Shift
Heewon Park ⋅ Mugon Joe ⋅ Miru Kim ⋅ Kyungjin Im ⋅ Minhae Kwon
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 295
Towards Streaming Referring Video Segmentation via Large Language Model
Wenkang Zhang ⋅ Kaicheng Yang ⋅ Xiang An ⋅ Qiang Li ⋅ Ziyong Feng ⋅ Wankou Yang ⋅ Jiankang Deng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 296
Multi-speaker Attention Alignment for Multimodal Social Interaction
LIANGYANG OUYANG ⋅ Yifei Huang ⋅ Mingfang Zhang ⋅ Caixin Kang ⋅ Ryosuke Furuta ⋅ Yoichi Sato
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 297
OmniVTG: A Large-Scale Dataset and Training Paradigm for Open-World Video Temporal Grounding
Minghang Zheng ⋅ Zihao Yin ⋅ Yi Yang ⋅ Yuxin Peng ⋅ Yang Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 298
SARL-STG: A Spatially Aware Reinforcement Learning Framework for Refining MLLMs in Spatio-Temporal Video Grounding
Hong Gao ⋅ Xiangkai Xu ⋅ Bin Zhong ⋅ Junjie Yin ⋅ Fangyu Kang ⋅ Yutong Xu ⋅ Xiugang Dong ⋅ Xurui Gao ⋅ Min-Ling Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 299
VideoITG: Multimodal Video Understanding with Instructed Temporal Grounding
Shihao Wang ⋅ Guo Chen ⋅ De-An Huang ⋅ Zhiqi Li ⋅ Minghan LI ⋅ Guilin Liu ⋅ Jan Kautz ⋅ Jose M. Alvarez ⋅ Lei Zhang ⋅ Zhiding Yu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 300
DeRVOS: Decoupling Consistent Trajectory Generation and Multimodal Understanding for Referring Video Object Segmentation
WENXUAN CHENG ⋅ Ming Dai ⋅ Huimin Lu ⋅ Wankou Yang
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 301
UniCompress: Token Compression for Unified Vision–Language Understanding and Generation
Ziyao Wang ⋅ Chen Chen ⋅ Jingtao Li ⋅ Weiming Zhuang ⋅ Jiabo Huang ⋅ Ang Li ⋅ Lingjuan Lv
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 302
StreamingTOM: Streaming Token Compression for Efficient Video Understanding
Xueyi Chen ⋅ Keda Tao ⋅ Kele Shao ⋅ Huan Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 303
SCoRe: Salience-Coverage Reduction for Vision Token Pruning in Vision-Language Models
Tong Xu ⋅ Hailong Shi ⋅ Xingyu Gao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 304
VLM-PTQ: Efficient Post-Training Quantization for Large Vision-Language Models
Juncan Deng ⋅ Kejie Huang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 305
Aligning What Vision-Language Models See and Perceive with Adaptive Information Flow
Chengxin Liu ⋅ Wonseok Choi ⋅ Chenshuang Zhang ⋅ Tae-Hyun Oh
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 306
Quant Experts: Token-aware Adaptive Error Reconstruction with Mixture of Experts for Large Vision-Language Models Quantization
Chenwei Jia ⋅ Baoting Li ⋅ Xuchong Zhang ⋅ Mingzhuo Wei ⋅ Bochen Lin ⋅ Hongbin Sun
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 307
Rethinking Token Reduction for Large Vision-Language Models
Yi Wang ⋅ Haofei Zhang ⋅ Qihan Huang ⋅ Anda Cao ⋅ Gongfan Fang ⋅ Wei Wang ⋅ Xuan Jin ⋅ Jie Song ⋅ Mingli Song ⋅ Xinchao Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 308
Prototype-based Causal Intervention for Multi-Label Image Classification
Yanmin Li ⋅ Zhilong Mao ⋅ Mao Wang ⋅ Lihua Liu ⋅ Jibing Wu ⋅ Weidong Bao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 309
FAST: Topology-Aware Frequency-Domain Distribution Matching for Coreset Selection
Jin Cui ⋅ Boran Zhao ⋅ Jiajun Xu ⋅ Jiaqi guo ⋅ Shuo Guan ⋅ Pengju Ren
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 310
Face-Guided Sentiment Boundary Enhancement for Weakly-Supervised Temporal Sentiment Localization
Cailing Han ⋅ Zhangbin Li ⋅ Jinxing Zhou ⋅ Wei Qian ⋅ Jingjing Hu ⋅ Yanghao Zhou ⋅ Zhangling Duan ⋅ Dan Guo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 311
Evidential Deep Partial Label Learning to Quantify Disambiguation Uncertainty
Jinfu Fan ⋅ Jiangnan Li ⋅ Xiaohui Zhong ⋅ Kangrui Ren ⋅ Zhencun Jiang ⋅ 福建话 赣方言 ⋅ Tianhao Gu ⋅ Linqing Huang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 312
Unlocking Strong Supervision: A Data-Centric Study of General-Purpose Audio Pre-Training Methods
Xuanru Zhou ⋅ Yiwen Shao ⋅ Wei-Cheng Tseng ⋅ Dong Yu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 313
Revisiting Learning with Noisy Labels: Active Forgetting and Noise Suppression
Mengmeng Sheng ⋅ Zeren Sun ⋅ Tao Chen ⋅ Jinshan Pan ⋅ Yazhou Yao ⋅ Fumin Shen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 314
PAF: Perturbation-Aware Filtering for Open-Set Semi-Supervised Learning
Yinan Han ⋅ Qing-Yuan Jiang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 315
Global-Graph Guided and Local-Graph Weighted Contrastive Learning for Unified Clustering on Incomplete and Noise Multi-View Data
Hongqing He ⋅ Jie Xu ⋅ Wenyuan Yang ⋅ Yonghua Zhu ⋅ Guoqiu Wen ⋅ Xiaofeng Zhu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 316
Enhancing Out-of-Distribution Detection with Extended Logit Normalization
Yifan Ding ⋅ Xixi Liu ⋅ Jonas Unger ⋅ Gabriel Eilertsen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 317
Unleashing VLA Potentials in Autonomous Driving via Explicit Learning from Failures
Yuechen Luo ⋅ Fang Li ⋅ Qimao Chen ⋅ Shaoqing Xu ⋅ Jiaxin Liu ⋅ Ziying Song ⋅ Zhi-xin Yang ⋅ Fuxi Wen
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 318
Unposed-to-3D: Learning Simulation-Ready Vehicles from Real-World Images
Hongyuan Liu ⋅ Bochao Zou ⋅ Qiankun Liu ⋅ Haochen Yu ⋅ Qi Mei ⋅ Jianfei Jiang ⋅ Chen Liu ⋅ Cheng Bi ⋅ Zhao Wang ⋅ Xueyang Zhang ⋅ Yifei Zhan ⋅ Jiansheng Chen ⋅ Huimin Ma
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 319
SafeDrive: Fine-Grained Safety Reasoning for End-to-End Driving in a Sparse World
Jungho Kim ⋅ Jiyong Oh ⋅ Seunghoon Yu ⋅ Hongjae Shin ⋅ Donghyuk Kwak ⋅ Jun Won Choi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 320
RAG-TP: A General Framework for Vehicle Trajectory Prediction via Retrieval-Augmented Generation
Ziyi Wang ⋅ Yang Zhang ⋅ Guijian Tang ⋅ Chao Zhang ⋅ Shibo Zhang ⋅ Xueqiong Li ⋅ Shaowu Yang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 321
Perceiving the Near, Reasoning the Distant: Coherent Long-Horizon Trajectory Prediction for Autonomous Driving
Hua Hu ⋅ Zikang Zhou ⋅ Qian Zhou ⋅ Zihao WEN ⋅ Junjie Hu ⋅ Xinhong Chen ⋅ Zhengmin JIANG ⋅ Yung-Hui Li ⋅ Jianping Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 322
Dual-Agent Reinforcement Learning for Adaptive and Cost-Aware Visual–Inertial Odometry
Feiyang Pan ⋅ Shenghe Zheng ⋅ Chunyan Yin ⋅ Guangbin Dou
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 323
HorizonForge: Driving Scene Editing with Any Trajectories and Any Vehicles
Yifan Wang ⋅ Francesco Pittaluga ⋅ Zaid Tasneem ⋅ Chenyu You ⋅ Manmohan Chandraker ⋅ Ziyu Jiang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 324
AMap: Distilling Future Priors for Ahead-Aware Online HD Map Construction
Ruikai Li ⋅ Xinrun Li ⋅ Mengwei Xie ⋅ Hao Shan ⋅ Shoumeng Qiu ⋅ Xinyuan Chang ⋅ Yizhe Fan ⋅ Feng Xiong ⋅ Han Jiang ⋅ Yilong Ren ⋅ Haiyang Yu ⋅ Mu Xu ⋅ Yang Long ⋅ Varun Ojha ⋅ Zhiyong Cui
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 325
WAM-Flow: Parallel Coarse-to-Fine Motion Planning via Discrete Flow Matching for Autonomous Driving
Yifang Xu ⋅ Jiahao Cui ⋅ Zhihao Zhu ⋅ Hanlin Shang ⋅ Shan Luan ⋅ Mingwang Xu ⋅ Feipeng Cai ⋅ Neng Zhang ⋅ Yaoyi Li ⋅ Jia Cai ⋅ Siyu Zhu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 326
PlannerRFT: Reinforcing Diffusion Planners through Closed-Loop and Sample-Efficient Fine-Tuning
Hongchen Li ⋅ Tianyu Li ⋅ Jiazhi Yang ⋅ Mingyang Shang ⋅ Gaoqiang Wu ⋅ Caojun Wang ⋅ Haochen Tian ⋅ Zengrong Lin ⋅ Zhihui Hao ⋅ XianPeng Lang ⋅ Jia Hu ⋅ Hongyang Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 327
MARIS: Marine Open-Vocabulary Instance Segmentation
Bingyu Li ⋅ Feiyu Wang ⋅ Da Zhang ⋅ Zhiyuan Zhao ⋅ Junyu Gao ⋅ Xuelong Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 328
XSeg: A Large-scale X-ray Contraband Segmentation Benchmark For Real-World Security Screening
Hongxia Gao ⋅ Yixin Chen ⋅ Jiali Wen ⋅ Litao Li ⋅ Qianyun Liu ⋅ Kaijie Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 329
Training-Free Open-Vocabulary Camouflaged Object Segmentation via Fine-Grained Object Binding and Adaptive Hybrid Prompt
Peng Ren ⋅ Cheng Jiang ⋅ Chuande Yang ⋅ Fuming Sun ⋅ Tian Bai
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 330
M⁴-SAM: Multi-Modal Mixture-of-Experts with Memory-Augmented SAM for RGB-D Video Salient Object Detection
Jiyuan Liu ⋅ jia lin ⋅ Xiaofei Zhou ⋅ Runmin Cong ⋅ Deyang Liu ⋅ Zhi Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 331
ReAttnCLIP: Training-Free Open-Vocabulary Remote Sensing Image Segmentation via Re-defined Attention in CLIP
Xin Niu ⋅ Manqi Zhao ⋅ Dongsheng Jiang ⋅ Yingying Wu ⋅ Bing Su
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 332
Mixture of Prototypes for Test-time Adaptive Segmentation
Guangrui Li ⋅ Zhengyu Zhu ⋅ Yongxin Ge
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 333
Reconstruction-Guided Slot Curriculum: Addressing Object Over-Fragmentation in Video Object-Centric Learning
WonJun Moon ⋅ Hyun Seok Seong ⋅ Jae-Pil Heo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 334
ELVIS: Enhance Low-Light for Video Instance Segmentation in the Dark
Joanne Lin ⋅ Ruirui Lin ⋅ Yini Li ⋅ David Bull ⋅ Nantheera Anantrasirichai
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 335
Decouple Your Discovery and Memory in Continual Generalized Category Discovery
Jiawei Yu ⋅ Zijian Gao ⋅ Xingxing Zhang ⋅ Xuan Liu ⋅ Huaimin Wang ⋅ Kele Xu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 336
Beyond the Static World: Continual Category Discovery under Visual Drift
Wei Feng ⋅ Yiwen Jiang ⋅ Sijin Zhou ⋅ Zongyuan Ge
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 337
Memory-Efficient Transfer Learning with Fading Side Networks via Masked Dual Path Distillation
Yutong Zhang ⋅ Jiaxin Chen ⋅ Honglin Chen ⋅ Kaiqi Zheng ⋅ Shengcai Liao ⋅ Hanwen Zhong ⋅ Weixin Li ⋅ Yunhong Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 338
SAME: Sparse and Anchored Model Editing for Heterogeneous Incremental Learning under Limited Data
Zixuan Duan ⋅ Zeyu Zhang ⋅ Fengyuan Lu ⋅ Shaofeng Zhang ⋅ Wenbin Li ⋅ Qi Fan ⋅ Yang Gao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 339
CHEEM: Continual Learning by Reuse, New, Adapt and Skip - A Hierarchical Exploration-Exploitation Approach
Chinmay Savadikar ⋅ Michelle Dai ⋅ Tianfu Wu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 340
Exemplar-Free Continual Learning for State Space Models
ISAAC NING LEE ⋅ Leila Mahmoodi ⋅ Trung Le ⋅ Mehrtash Harandi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 341
A Faster Path to Continual Learning
Wei Li ⋅ Hangjie Yuan ⋅ Zixiang Zhao ⋅ Borui Kang ⋅ Ziwei Liu ⋅ Tao Feng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 342
Continual Learning for fMRI-Based Brain Disorder Diagnosis via Functional Connectivity Matrices Generative Replay
qianyu Chen ⋅ Shujian Yu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 343
BeautyGRPO: Aesthetic Alignment for Face Retouching via Dynamic Path Guidance and Fine-Grained Preference Modeling
Jiachen Yang ⋅ Xianhui Lin ⋅ Yi Dong ⋅ Zebiao Zheng ⋅ Xing Liu ⋅ Hong Gu ⋅ Yanmei Fang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 344
SyncDreamer: Controllable and Expressive Avatar Generation Beyond the Talking Head
Fatemeh Nazarieh ⋅ Zhenhua Feng ⋅ Diptesh Kanojia ⋅ Josef Kittler ⋅ Muhammad Awais
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 345
PerformRecast: Expression and Head Pose Disentanglement for Portrait Video Editing
Jiadong Liang ⋅ Bojun Xiong ⋅ Jie Tian ⋅ Hua Li ⋅ Xiao Long ⋅ Yong Zheng ⋅ Huan Fu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 346
UniLS: End-to-End Audio-Driven Avatars for Unified Listening and Speaking
Xuangeng Chu ⋅ Ruicong Liu ⋅ Yifei Huang ⋅ Yun Liu ⋅ YICHEN PENG ⋅ Bo Zheng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 347
PC-Talk: Precise Facial Animation Control for Audio-Driven Talking Face Generation
baiqin wang ⋅ Xiangyu Zhu ⋅ Fan Shen ⋅ HAO XU ⋅ Zhen Lei
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 348
FlashPortrait: 6x Faster Infinite Portrait Animation with Adaptive Latent Prediction
Shuyuan Tu ⋅ Yueming Pan ⋅ Yinming Huang ⋅ Xintong Han ⋅ Zhen Xing ⋅ Qi Dai ⋅ Kai Qiu ⋅ Chong Luo ⋅ Zuxuan Wu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 349
DriveVLN: Towards Mapless Vision-and-Language Navigation in Autonomous Driving
Dongqian Guo ⋅ Haoran Wei ⋅ Wencheng Han ⋅ Runzhou Tao ⋅ Zhongying Qiu ⋅ Jianfei Yang ⋅ Jianbing Shen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 350
Towards Open Environments and Instructions: General Vision-Language Navigation via Fast-Slow Interactive Reasoning
Li Yang ⋅ Aming Wu ⋅ Zihao Zhang ⋅ Yahong Han
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 351
Unifying Language-Action Understanding and Generation for Autonomous Driving
Xinyang Wang ⋅ Qian Liu ⋅ WENJIE DING ⋅ Zhao Yang ⋅ Wei Li ⋅ Chang Liu ⋅ Bailin Li ⋅ Kun Zhan ⋅ XianPeng Lang ⋅ Wei Chen
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 352
Drive My Way: Preference Alignment of Vision-Language-Action Model for Personalized Driving
Zehao Wang ⋅ Huaide Jiang ⋅ Shuaiwu Dong ⋅ Yuping Wang ⋅ Hang Qiu ⋅ Jiachen Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 353
Prune2Drive: A Plug-and-Play Framework for Accelerating Vision-Language Models in Autonomous Driving
Minhao Xiong ⋅ Zichen Wen ⋅ Zhuangcheng Gu ⋅ Xuyang Liu ⋅ Rui Zhang ⋅ Hengrui Kang ⋅ Jiabing Yang ⋅ JUNYUAN ZHANG ⋅ Weijia Li ⋅ Conghui He ⋅ Linfeng Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 354
CGHair: Compact Gaussian Hair Reconstruction with Card Clustering
Haimin Luo ⋅ Srinjay Sarkar ⋅ Albert Mosella-Montoro ⋅ Francisco Vicente Carrasco ⋅ Fernando De la Torre
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 355
HyperGaussians: High-Dimensional Gaussian Splatting for High-Fidelity Animatable Face Avatars
Gent Serifi ⋅ Marcel C.
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 356
Skullptor: High Fidelity 3D Head Reconstruction in Seconds with Multi-View Normal Prediction
Noé Artru ⋅ Rukhshanda Hussain ⋅ Emeline Got ⋅ Alexandre Messier ⋅ David B. Lindell ⋅ Abdallah Dib
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 357
RelightAnyone: A Generalized Relightable 3D Gaussian Head Model
Yingyan Xu ⋅ Pramod Rao ⋅ Sebastian Weiss ⋅ Gaspard Zoss ⋅ Markus Gross ⋅ Christian Theobalt ⋅ Marc Habermann ⋅ Derek Bradley
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 358
Feed-forward Gaussian Registration for Head Avatar Creation and Editing
Malte Prinzler ⋅ Paulo Gotardo ⋅ Siyu Tang ⋅ Timo Bolkart
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 359
Residual Decoding: Mitigating Hallucinations in Large Vision-Language Models via History-Aware Residual Guidance
Xinrong Chen ⋅ Xu Chu ⋅ Yingmin Qiu ⋅ Hengyuan Zhang ⋅ Jing Xiong ⋅ Shiyu Tang ⋅ Shuai Liu ⋅ Shaokang Yang ⋅ Cheng Yang ⋅ Hayden Kwok-Hay So ⋅ Ngai Wong
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 360
Prefill-Time Intervention for Mitigating Hallucination in Large Vision-Language Models
Chengsheng Zhang ⋅ Chenghao Sun ⋅ Xinyan Jiang ⋅ Wei Li ⋅ Xinmei Tian
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 361
SVHalluc: Benchmarking Speech–Vision Hallucination in Audio-Visual Large Language Models
Chenshuang Zhang ⋅ Kyeong Seon Kim ⋅ Chengxin Liu ⋅ Tae-Hyun Oh
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 362
Same Attention, Different Truths: Put Logit-Lens over Visual Attention to Detect and Mitigate LVLM Object Hallucination
Zichuan Wang ⋅ Songlin Yang ⋅ Bo Peng ⋅ Zhenchen Tang ⋅ Yang Li ⋅ BeibeiDong BeibeiDong ⋅ Beibei Dong
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 363
Understanding the Role of Hallucination in Reinforcement Post-Training of Multimodal Reasoning Models
Gengwei Zhang ⋅ Jie Peng ⋅ Zhen Tan ⋅ Mufan Qiu ⋅ Hossein Nourkhiz Mahjoub ⋅ Vaishnav Tadiparthi ⋅ Kwonjoon Lee ⋅ Yanyong Zhang ⋅ Tianlong Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 364
Lyapunov Probes for Hallucination Detection in Large Foundation Models
Bozhi Luan ⋅ Gen Li ⋅ Yalan Qin ⋅ Jifeng Guo ⋅ Yun Zhou ⋅ Faguo Wu ⋅ Hongwei Zheng ⋅ wenjun wu ⋅ Zhaoxin Fan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 365
Captain Safari: A World Engine with Pose-Aligned 3D Memory
Yu-Cheng Chou ⋅ Xingrui Wang ⋅ Yitong Li ⋅ Jiahao Wang ⋅ Hanting Liu ⋅ Cihang Xie ⋅ Alan L. Yuille ⋅ Junfei Xiao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 366
Gen3R: 3D Scene Generation Meets Feed-Forward Reconstruction
Jiaxin Huang ⋅ Yuanbo Yang ⋅ Bangbang Yang ⋅ Lin Ma ⋅ Yuewen Ma ⋅ Yiyi Liao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 367
PerpetualWonder: Long-horizon Action-conditioned 4D Scene Generation
Jiahao Zhan ⋅ Zizhang Li ⋅ Hong-Xing Yu ⋅ Jiajun Wu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 368
CineScene: Implicit 3D as Effective Scene Representation for Cinematic Video Generation
Kaiyi Huang ⋅ Yukun Huang ⋅ Yu Li ⋅ Jianhong Bai ⋅ Xintao Wang ⋅ Zinan Lin ⋅ Xuefei Ning ⋅ Jiwen Yu ⋅ Yu Wang ⋅ Xihui Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 369
DreamStereo: Towards Real-Time Stereo Inpainting for HD Videos
Huang yuan ⋅ Sijie Zhao ⋅ Jing Cheng ⋅ Hao Xu ⋅ Shaohui Jiao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 370
SeeThrough3D: Occlusion Aware 3D Control in Text-to-Image Generation
Vaibhav Agrawal ⋅ Rishubh Parihar ⋅ Pradhaan S Bhat ⋅ Ravi Kiran Sarvadevabhatla ⋅ R. Venkatesh Babu
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 371
RecEdit-Drive: 3D Reconstruction-Guided Spatiotemporal Video Editing for Autonomous Driving Scenes
Yipeng Wu ⋅ Xin WANG ⋅ Chenghan Yang ⋅ Chong Wang ⋅ Dongdong Wu ⋅ Wanchao Su ⋅ Hengshuang Zhao ⋅ Wei Feng ⋅ Kairui Yang ⋅ Di Lin
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 372
RAYNOVA: Scale-Temporal Autoregressive World Modeling in Ray Space
Yichen Xie ⋅ Chensheng Peng ⋅ Mazen Abdelfattah ⋅ Yihan Hu ⋅ Jiezhi Yang ⋅ Eric Higgins ⋅ Ryan Brigden ⋅ Masayoshi Tomizuka ⋅ Wei Zhan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 373
RigMo: Unifying Rig and Motion Learning for Generative Animation
Hao Zhang ⋅ Jiahao Luo ⋅ Bohui Wan ⋅ Yizhou Zhao ⋅ Zongrui Li ⋅ Michael Vasilkovsky ⋅ Chaoyang Wang ⋅ Jian Wang ⋅ Narendra Ahuja ⋅ Bing Zhou
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 374
LaVR: Scene Latent Conditioned Generative Video Trajectory Re-Rendering using Large 4D Reconstruction Models
Mingyang Xie ⋅ Numair Khan ⋅ Tianfu Wang ⋅ Naina Dhingra ⋅ Seonghyeon Nam ⋅ Haitao Yang ⋅ Zhuo Hui ⋅ Christopher Metzler ⋅ Andrea Vedaldi ⋅ Hamed Pirsiavash ⋅ Lei Luo
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 375
WHU-MARS: A Multispectral Aerial-Ground Benchmark Towards Any-Scenario Person Re-Identification
Yuxuan Zhao ⋅ Zhongao Zhou ⋅ Bin Yang ⋅ He Li ⋅ Jian Liang ⋅ Jun Chen ⋅ Bo Du ⋅ Mang Ye
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 376
Detect Anything via Next Point Prediction
Qing Jiang ⋅ Junan Huo ⋅ Xingyu Chen ⋅ Yuda Xiong ⋅ Zhaoyang Zeng ⋅ Yihao Chen ⋅ Tianhe Ren ⋅ Junzhi Yu ⋅ Lei Zhang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 377
Text-guided Feature Disentanglement for Cross-modal Gait Recognition
Zhiyang Lu ⋅ Ming Cheng
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 378
Distribution-Aligned Multimodal Fusion for Robust Object Detection
XIAOHUI HAO ⋅ Yanglin Pu ⋅ Yongjun Wang ⋅ Rui She
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 379
PaQ-DETR: Learning Pattern and Quality-Aware Dynamic Queries for Object Detection
Zhengjian Kang ⋅ Jun Zhuang ⋅ Kangtong Mo ⋅ Qi Chen ⋅ Rui Liu ⋅ Ye Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 380
Portable Active Learning for Object Detection
Rashi Sharma ⋅ Justin Timothy C. Bersamin ⋅ Karthikk Subramanian
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 381
Efficiency Follows Global-Local Decoupling
Zhenyu Yang ⋅ Gensheng Pei ⋅ Tao Chen ⋅ Yichao Zhou ⋅ Tianfei Zhou ⋅ Yazhou Yao ⋅ Fumin Shen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 382
VRCLIP: Multimodal Canonical Correlation Alignment for CLIP-Driven Vision-Radio Person Re-Identification
Rui Zhang ⋅ Yaqi Wang ⋅ Yadong Li ⋅ Ruixu Geng ⋅ Jianyang Wang ⋅ Qijun Ying ⋅ Dongheng Zhang ⋅ Yang Hu ⋅ Yan Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 383
EReCu: Pseudo-label Evolution Fusion and Refinement with Multi-Cue Learning for Unsupervised Camouflage Detection
Jiang Shuo ⋅ Gaojia Zhang ⋅ Min Tan ⋅ Yufei Yin ⋅ Gang Pan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 384
Expert-Teacher-Student Collaborative Learning for Domain Adaptive Object Detection
Yiming Cui ⋅ Liang Li ⋅ Haibing Yin ⋅ Yuhan Gao ⋅ Xichun Sheng ⋅ Chenggang Yan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 385
CI-VID: A Coherent Interleaved Text-Video Dataset
Yiming Ju ⋅ Jijin Hu ⋅ Zhengxiong Luo ⋅ Haoge Deng ⋅ Hanyu Zhao ⋅ Li Du ⋅ Wenbo Xiao ⋅ Chengwei Wu ⋅ Donglin Hao ⋅ Xinlong Wang ⋅ Tengfei Pan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 386
Generalizable Video Quality Assessment via Weak-to-Strong Learning
Linhan Cao ⋅ Wei Sun ⋅ Xiangyang Zhu ⋅ Kaiwei Zhang ⋅ Jun Jia ⋅ Yicong Peng ⋅ Dandan Zhu ⋅ Guangtao Zhai ⋅ Xiongkuo Min
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 387
EgoSound: Benchmarking Sound Understanding in Egocentric Videos
Bingwen Zhu ⋅ Yuqian Fu ⋅ Qiaole Dong ⋅ Guolei Sun ⋅ Tianwen Qian ⋅ Yuzheng Wu ⋅ Danda Paudel ⋅ Yanwei Fu ⋅ Xiangyang Xue
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 388
WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning
Woongyeong Yeo ⋅ Kangsan Kim ⋅ Jaehong Yoon ⋅ Sung Ju
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 389
GIFT: Global Irreplaceability Frame Targeting for Efficient Video Understanding
Ma Junpeng ⋅ Sashuai zhou ⋅ Guanghao Li ⋅ Xin Gao ⋅ Yue Cao ⋅ Hengyu Zeng ⋅ Yuxiang Yan ⋅ Zhibin Wang ⋅ Jun Song ⋅ Bo Zheng ⋅ Shanghang Zhang ⋅ Jian Pu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 390
Select Less, Reason More: Prioritizing Evidence Purity for Video Reasoning
Xuchen Li ⋅ Xuzhao Li ⋅ Shiyu Hu ⋅ Kaiqi Huang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 391
Ego2Web: A Web Agent Benchmark Grounded in Egocentric Videos
Shoubin Yu ⋅ Lei Shu ⋅ Antoine Yang ⋅ Yao Fu ⋅ Srinivas Sunkara ⋅ Maria Wang ⋅ Jindong Chen ⋅ Mohit Bansal ⋅ Boqing Gong
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 392
Compositional Transformation Reasoning for Composed Video Retrieval
Sihong Huang ⋅ Jiaxin Wu ⋅ Dongmei Jiang ⋅ Yi Cai ⋅ Yaowei Wang ⋅ Xiaoyong Wei
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 393
UniVBench: Towards Unified Evaluation for Video Foundation Models
Jianhui Wei ⋅ Xiaotian Zhang ⋅ Yichen Li ⋅ Yuan Wang ⋅ Yan Zhang ⋅ Ziyi Chen ⋅ Zhihang Tang ⋅ Wei Xu ⋅ Zuozhu Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 394
NAMI: Efficient Image Generation via Bridged Progressive Rectified Flow Transformers
Yuhang Ma ⋅ Bo Cheng ⋅ Shanyuan Liu ⋅ Hongyi Zhou ⋅ Liebucha Wu ⋅ Dawei Leng ⋅ Yuhui Yin
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 395
InverFill: One-Step Inversion for Enhanced Few-Step Diffusion Inpainting
Hong Duc Vu ⋅ Kien Nguyen ⋅ Trong-Tung Nguyen ⋅ Ngan Nguyen ⋅ Phong Nguyen ⋅ Khoi Nguyen ⋅ Cuong Pham ⋅ Anh Tran
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 396
TimeRipples: Accelerating vDiTs by Understanding the Spatio-Temporal Correlations in Latent Space
Wenxuan Miao ⋅ Yulin Sun ⋅ Aiyue Chen ⋅ Jing Lin ⋅ Yiwu Yao ⋅ Yiming Gan ⋅ Jieru Zhao ⋅ Jingwen Leng ⋅ Minyi Guo ⋅ Yu Feng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 397
ProcessMaker: A Generalized Process Visualization Framework with Adaptive Sequence Steps on Diffusion Transformers
Mengling Xu ⋅ Sisi You ⋅ Li Yaning ⋅ Bing-Kun Bao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 398
MeanFlow Transformers with Representation Autoencoders
Zheyuan Hu ⋅ Chieh-Hsin Lai ⋅ Ge Wu ⋅ Yuki Mitsufuji ⋅ Stefano Ermon
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 399
DiT-IC: Aligned Diffusion Transformer for Efficient Image Compression
Junqi Shi ⋅ Ming Lu ⋅ Xingchen Li ⋅ Anle Ke ⋅ Ruiqi Zhang ⋅ Zhan Ma
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 400
FARMER: Flow AutoRegressive Transformer over Pixels
GuangTing Zheng ⋅ Qinyu Zhao ⋅ Tao Yang ⋅ Fei Xiao ⋅ Zhijie Lin ⋅ Jie Wu ⋅ Jiajun Deng ⋅ Yanyong Zhang ⋅ Rui Zhu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 401
Probabilistic Precipitation Nowcasting with Rectified Flow Transformers
Johannes Schusterbauer ⋅ Jannik Wiese ⋅ Nick Stracke ⋅ Timy Phan ⋅ Björn Ommer
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 402
FlowDC: Flow-Based Decoupling-Decay for Complex Image Editing
Yilei Jiang ⋅ Zhen Wang ⋅ Yanghao Wang ⋅ Jun Yu ⋅ Yueting Zhuang ⋅ Jun Xiao ⋅ Long Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 403
High-Fidelity Diffusion Face Swapping with ID-Constrained Facial Conditioning
Dailan He ⋅ Xiahong Wang ⋅ Shulun Wang ⋅ Hao Shao ⋅ Bingqi Ma ⋅ Guanglu Song ⋅ Yu Liu ⋅ Hongsheng Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 404
3D-Object Perception Transformer (3PT)
Agastya Kalra ⋅ Tim Salzmann ⋅ Guy Stoppi ⋅ Dmitrii Marin ⋅ Rishav Agarwal ⋅ Vage Taamazyan ⋅ Martin Bokeloh ⋅ Stefan Hinterstoisser ⋅ Anton Boykov ⋅ Alberto Dall'Olio ⋅ Pravin Dangol ⋅ Kartik Venkataraman ⋅ Huaijin Chen
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 405
SemLT3D: Semantic-Guided Expert Distillation for Camera-only Long-Tailed 3D Object Detection
Hao Vo ⋅ Khoa Vo ⋅ Tran Phan Phan ⋅ Ngo Xuan Cuong ⋅ Gianfranco Doretto ⋅ Hien Nguyen ⋅ Anh Nguyen ⋅ Ngan Le
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 406
Spe-BEVHead: Rethinking the Detection Head Design for Bird’s-Eye-View Object Detection
Junshu Zhang ⋅ Sicheng Zhao ⋅ Xin Zhao ⋅ Fan Yang ⋅ Ruike Chen ⋅ Jungong Han ⋅ Guiguang Ding
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 407
Unsupervised Multi-agent and Single-agent Perception from Cooperative Views
Haochen Yang ⋅ Baolu Li ⋅ Lei Li ⋅ Delin Ren ⋅ Jiacheng Guo ⋅ Minghai Qin ⋅ Tianyun Zhang ⋅ Hongkai Yu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 408
Zoo3D: Zero-Shot 3D Object Detection at Scene Level
Andrey Lemeshko ⋅ Bulat Gabdullin ⋅ Nikita Drozdov ⋅ Anton Konushin ⋅ Danila Rukhovich ⋅ Maksim Kolodiazhnyi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 409
Beyond Appearance: Camouflaged Object Detection via Geometric Structure
Jinyu Han ⋅ changguang wu ⋅ Fuming Sun ⋅ Jinhui Tang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 410
SABER: Spatially Consistent 3D Universal Adversarial Objects for BEV Detectors
Aixuan Li ⋅ Mochu Xiang ⋅ Bosen Hou ⋅ Zhexiong Wan ⋅ Jing Zhang ⋅ Yuchao Dai
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 411
AceTone: Bridging Words and Colors for Conditional Image Grading
Tianren Ma ⋅ Mingxiang Liao ⋅ Xijin Zhang ⋅ Qixiang Ye
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 412
Do VLMs Perceive or Recall? Probing Visual Perception vs. Memory with Classic Visual Illusions
Xiaoxiao Sun ⋅ Mingyang Li ⋅ Kun yuan ⋅ Min Woo ⋅ Mark Endo ⋅ Shengguang Wu ⋅ Changlin Li ⋅ Yuhui Zhang ⋅ Zeyu Wang ⋅ Serena Yeung
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 413
Pixels Don't Lie (But Your Detector Might): Bootstrapping MLLM-as-a-Judge for Trustworthy Deepfake Detection and Reasoning Supervision
Kartik Kuckreja ⋅ Parul Gupta ⋅ Muhammad Haris Khan ⋅ Abhinav Dhall
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 414
UI-Lens: Assessing General MLLMs’ Potential to Automate UI Display Quality Assurance
Wei Xiang ⋅ Yexinrui WU ⋅ Xinli Chen ⋅ Xinran Li ⋅ Shi Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 415
Seeing is Improving: Visual Feedback for Iterative Text Layout Refinement
Junrong Guo ⋅ Shancheng Fang ⋅ Yadong Qu ⋅ Hongtao Xie
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 416
Is your VLM Sky-Ready? A Comprehensive Spatial Intelligence Benchmark for UAV Navigation
Lingfeng Zhang ⋅ Yuchen Zhang ⋅ Hongsheng Li ⋅ Haoxiang Fu ⋅ Yingbo Tang ⋅ Hangjun Ye ⋅ Long Chen ⋅ Xiaojun Liang ⋅ Xiaoshuai Hao ⋅ Wenbo Ding
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 417
Linking Perception, Confidence and Accuracy in MLLMs
Yuetian Du ⋅ Yucheng Wang ⋅ Rongyu Zhang ⋅ Zhijie Xu ⋅ BOYU YANG ⋅ Ming Kong ⋅ Jie Liu ⋅ Qiang Zhu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 418
AVA-Bench: Atomic Visual Ability Benchmark for Vision Foundation Models
Zheda Mai ⋅ Arpita Chowdhury ⋅ Zihe Wang ⋅ Sooyoung Jeon ⋅ Lemeng Wang ⋅ Jiacheng Hou ⋅ Jihyung Kil ⋅ Wei-Lun Chao
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 419
Learning to Focus and Precise Cropping: A Reinforcement Learning Framework with Information Gaps and Grounding Loss for MLLMs
Xuanpu Zhao ⋅ Zhentao Tan ⋅ Dianmo Sheng ⋅ Tianxiang Chen ⋅ Yao Liu ⋅ Yue Wu ⋅ Tao Gong ⋅ Qi Chu ⋅ Nenghai Yu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 420
From Pixel to Precision: Enhancing Handwritten Mathematical Expression Recognition with Image-Level Reward
Ze Liu ⋅ Kai Zhang ⋅ Xianquan Wang ⋅ Shuochen Liu ⋅ Jiaxian Yan ⋅ Yupeng Han ⋅ Qi Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 421
Rethinking Pose Refinement in 3D Gaussian Splatting under Pose Prior and Geometric Uncertainty
ManGyu Kong ⋅ Jaewon Lee ⋅ Seongwon Lee ⋅ Euntai Kim
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 422
Revisiting Pose Sensitivity in Splat-based Computed Tomography under Sparse-view Reconstruction
Kiseok Choi ⋅ Hyeongjun Cho ⋅ Inchul Kim ⋅ Min H. Kim
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 423
Seele: A Unified Acceleration Framework for Real-Time Gaussian Splatting on Mobile Devices
He Zhu ⋅ Xiaotong Huang ⋅ Zihan Liu ⋅ Weikai Lin ⋅ Xiaohong Liu ⋅ Zhezhi He ⋅ Jingwen Leng ⋅ Minyi Guo ⋅ Yu Feng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 424
GHPT: Real-Time Relightable Gaussian Splatting using Hybrid Path Tracing
Jinyang Bo ⋅ Fan Dou ⋅ Wenrui Quan ⋅ Shangxun Liu ⋅ Yang Xu ⋅ Yuhe Zhang ⋅ Kang Li ⋅ GuoHua Geng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 425
PolarGuide-GSDR: 3D Gaussian Splatting Driven by Polarization Priors and Deferred Reflection for Real-World Reflective Scenes
Derui Shan ⋅ Qian Qiao ⋅ Hao Lu ⋅ Tao Du ⋅ Peng Lu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 426
EcoSplat: Efficiency-controllable Feed-forward 3D Gaussian Splatting from Multi-view Images
Minh-Quan Viet Bui ⋅ Jongmin Park ⋅ Juan Luis Gonzalez Bello ⋅ Jaeho Moon ⋅ Jihyong Oh ⋅ Munchurl Kim
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 427
SGS-Intrinsic: Semantic-Invariant Gaussian Splatting for Sparse-View Indoor Inverse Rendering
jiahao niu ⋅ rongjia zheng ⋅ Wenju Xu ⋅ Wei-Shi Zheng ⋅ Qing Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 428
GIFSplat: Generative Prior-Guided Iterative Feed-Forward 3D Gaussian Splatting from Sparse Views
Tianyu Chen ⋅ Wei Xiang ⋅ Kang Han ⋅ Yu Lu ⋅ Di Wu ⋅ Gaowen Liu ⋅ Ramana Kompella
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 429
3D Gaussian Splatting with Self-Constrained Priors for High Fidelity Surface Reconstruction
Takeshi Noda ⋅ Yu-Shen Liu ⋅ Zhizhong Han
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 430
FilterGS: Traversal-Free Parallel Filtering and Adaptive Shrinking for Large-Scale LoD 3D Gaussian Splatting
Yixian Wang ⋅ HaoLin Yu ⋅ Jiadong Tang ⋅ Yu Gao ⋅ Xihan Wang ⋅ Yufeng Yue ⋅ Yi Yang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 431
TWINGS: Thin Plate Splines Warp-aligned Initialization for Sparse-View Gaussian Splatting
Hyeseong Kim ⋅ Geonhui Son ⋅ Deukhee Lee ⋅ Dosik Hwang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 432
VarSplat: Uncertainty-aware 3D Gaussian Splatting for Robust RGB-D SLAM
Anh Thuan Tran ⋅ Jana Kosecka
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 433
SpeeDe3DGS: Speedy Deformable 3D Gaussian Splatting with Temporal Pruning and Motion Grouping
Allen Tu ⋅ Haiyang Ying ⋅ Alex Hanson ⋅ Yonghan Lee ⋅ Tom Goldstein ⋅ Matthias Zwicker
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 434
FastGS: Training 3D Gaussian Splatting in 100 Seconds
Shiwei Ren ⋅ Tianci Wen ⋅ Yongchun Fang ⋅ Biao Lu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 435
BrepGaussian: CAD reconstruction from Multi-View Images with Gaussian Splatting
Jiaxing Yu ⋅ Dongyang Ren ⋅ Hangyu Xu ⋅ Zhouyuxiao Yang ⋅ Yuanqi Li ⋅ Jie Guo ⋅ Zhengkang Zhou ⋅ Yanwen Guo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 436
ODGS-SLAM: Omnidirectional Gaussian Splatting SLAM
Stefan Spiss ⋅ Joey Hieronimy ⋅ Marcel Ritter ⋅ Matthias Harders
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 437
BA-GS: Bayesian Adaptive Gaussian Splatting for SFM-Free 3D Reconstruction
Zhongjie Ma ⋅ Di Lin ⋅ Xin WANG ⋅ Haotian Dong ⋅ Chong Wang ⋅ Dongdong Wu ⋅ Changqing Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 438
FSFSplatter: Geometrically Accurate Reconstruction with Free Sparse-view Images within 2 minutes
Yibin Zhao ⋅ Yihan Pan ⋅ Jun Nan ⋅ Liwei Chen ⋅ Jianjun YI
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 439
ViRC: Enhancing Visual Interleaved Mathematical CoT with Reason Chunking
Lihong Wang ⋅ Liangqi Li ⋅ Weiwei Feng ⋅ Jiamin Wu ⋅ Changtao Miao ⋅ Tieru Wu ⋅ Rui Ma ⋅ Bo Zhang ⋅ Zhe Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 440
When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought
Yiyang Zhou ⋅ Haoqin Tu ⋅ Zijun Wang ⋅ Zeyu Wang ⋅ Niklas Muennighoff ⋅ Fan Nie ⋅ Chaorui Deng ⋅ Shen Yan ⋅ Haoqi Fan ⋅ Yejin Choi ⋅ James Zou ⋅ Cihang Xie ⋅ Huaxiu Yao ⋅ Qinghao Ye
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 441
PixDLM: A Dual-Path Multimodal Language Model for UAV Reasoning Segmentation
shuyan ke ⋅ Yifan Mei ⋅ Changli Wu ⋅ yonghan zheng ⋅ Jiayi Ji ⋅ Liujuan Cao ⋅ Rongrong Ji
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 442
Can a Second-View Image Be a Language? Geometric and Semantic Cross-Modal Reasoning for X-ray Prohibited Item Detection
Chuang Peng ⋅ Renshuai Tao ⋅ Zhongwei Ren ⋅ Xianglong Liu ⋅ Yunchao Wei
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 443
VCU-Bridge: Hierarchical Visual Connotation Understanding via Semantic Bridging
Ming Zhong ⋅ Yuanlei Wang ⋅ Liuzhou Zhang ⋅ Ruichuan An ⋅ Ray Zhang ⋅ Hao Liang ⋅ Ming Lu ⋅ Ying Shen ⋅ Wentao Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 444
Learning to See through Illumination Extremes with Event Streaming in Multimodal Large Language Models
Baoheng Zhang ⋅ Jiahui Liu ⋅ Zhao Gui ⋅ Zhang Weizhou ⋅ YIXUAN MA ⋅ Jun Jiang ⋅ Yingxian Chen ⋅ Wilton W.T Fok ⋅ Xiaojuan Qi ⋅ Hayden Kwok-Hay So
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 445
VOLD: Reasoning Transfer from LLMs to Vision-Language Models via On-Policy Distillation
Walid Bousselham ⋅ Hilde Kuehne ⋅ Cordelia Schmid
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 446
Cut to the Chase: Training-free Multimodal Summarization via Chain-of-Events
Xiaoxing You ⋅ Qiang Huang ⋅ Lingyu Li ⋅ Xiaojun Chang ⋅ Jun Yu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 447
UVU: Improving Multimodal Understanding via Vision-Language Unified Autoregressive Paradigm
Zhehan Kan ⋅ Xinghua Jiang ⋅ Yanlin Liu ⋅ Xiaochen Yang ⋅ ZHIXIANG WEI ⋅ Shifeng Liu ⋅ Yubo Zhu ⋅ Qingmin Liao ⋅ Wenming Yang ⋅ Xin Li ⋅ Yinsong Liu ⋅ Deqiang Jiang ⋅ Xing Sun
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 448
PointThinker: Point-Incentivized Parallel Thinking for Multimodal Large Language Model
Zhengdong Hu ⋅ Chao Wang ⋅ Fengyun Rao ⋅ Jing LYU ⋅ Hehe Fan ⋅ Yi Yang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 449
OctoMed: Data Recipes for State-of-the-Art Multimodal Medical Reasoning
Timothy Ossowski ⋅ Sheng Zhang ⋅ Qianchu Liu ⋅ Guanghui Qin ⋅ Reuben Tan ⋅ Tristan Naumann ⋅ Junjie Hu ⋅ Hoifung Poon
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 450
HoneyBee: Data Recipes for Vision-Language Reasoners
Hritik Bansal ⋅ Devendra Singh Sachan ⋅ Kai-Wei Chang ⋅ Aditya Grover ⋅ Gargi Ghosh ⋅ Wen-tau Yih ⋅ Ramakanth Pasunuru
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 451
VisPlay: Self-Evolving Vision-Language Models
Yicheng He ⋅ Chengsong Huang ⋅ Zongxia Li ⋅ Jiaxin Huang ⋅ Yonghui Yang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 452
Chart-FR1: Visual Focus-Driven Fine-Grained Reasoning on Dense Charts
Hongkun Pan ⋅ Yuwei Wu ⋅ Wanyi Hong ⋅ ShengHui Hu ⋅ Qitong Yan ⋅ Yi Yang ⋅ Rufei Han ⋅ Changju Zhou ⋅ Minfeng Zhu ⋅ Dongming Han ⋅ Wei Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 453
Thinking-while-Generating: Interleaving Textual Reasoning throughout Visual Generation
Ziyu Guo ⋅ Ray Zhang ⋅ Hongyu Li ⋅ Manyuan Zhang ⋅ Xinyan Chen ⋅ Sifan Wang ⋅ Yan Feng ⋅ Peng Pei ⋅ Pheng-Ann Heng
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 454
ApET: Approximation-Error Guided Token Compression for Efficient VLMs
Qiankun Ma ⋅ Ziyao Zhang ⋅ Haofei Wang ⋅ Zhen Song ⋅ Jie Chen ⋅ Hairong Zheng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 455
Granulon: Awakening Pixel-Level Visual Encoders with Adaptive Multi-Granularity Semantics for MLLM
Junyuan Mao ⋅ Qiankun Li ⋅ Linghao Meng ⋅ Zhicheng He ⋅ Xinliang Zhou ⋅ Kun Wang ⋅ Yang Liu ⋅ Yueming Jin
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 456
Vision Transformers Need More Than Registers
Cheng Shi ⋅ Yizhou Yu ⋅ Sibei Yang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 457
Head-wise Adaptive Rotary Positional Encoding for Fine-Grained Image Generation
Li jiaye ⋅ Baoyou Chen ⋅ Hui Li ⋅ Zilong Dong ⋅ Jingdong Wang ⋅ Siyu Zhu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 458
PRISM: Video Dataset Condensation with Progressive Refinement and Insertion for Sparse Motion
Jaehyun Choi ⋅ Jiwan Hur ⋅ Gyojin Han ⋅ Jaemyung Yu ⋅ Junmo Kim
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 459
AdaSVD: Singular Value Decomposition with Adaptive Mechanisms for Large Multimodal Models
Zhiteng Li ⋅ Mingyuan Xia ⋅ Jingyuan Zhang ⋅ Zheng Hui ⋅ Haotong Qin ⋅ Linghe Kong ⋅ Yulun Zhang ⋅ Xiaokang Yang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 460
ReFTA: Breaking the Weight Reconstruction Bottleneck in Tensorized Parameter-Efficient Fine-Tuning
Jingjing Zheng ⋅ Anda Tang ⋅ Qiangqiang Mao ⋅ Zhouchen Lin ⋅ Yankai Cao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 461
HTTM: Head-wise Temporal Token Merging for Faster VGGT
Weitian Wang ⋅ Lukas Meiner ⋅ Rai Shubham ⋅ Cecilia De La Parra ⋅ Akash Kumar
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 462
Reparameterized Tensor Ring Functional Decomposition for Multi-Dimensional Data Recovery
Yangyang Xu ⋅ Junbo Ke ⋅ You-Wei Wen ⋅ Chao Wang
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 463
Self-Attention Driven Tensor Representation for High-Order Data Recovery
Zhi-Wei SHI ⋅ Yu-Bang Zheng ⋅ Heng-Chao Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 464
PlanaReLoc: Camera Relocalization in 3D Planar Primitives via Region-Based Structure Matching
Hanqiao Ye ⋅ Yuzhou Liu ⋅ Yangdong Liu ⋅ Shuhan Shen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 465
MOGeo: Beyond One-to-One Cross-View Object Geo-localization
Bo Lv ⋅ Qingwang Zhang ⋅ Le Wu ⋅ Yuanyuan Li ⋅ YINGYING ZHU
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 466
Homaloidal parametrization for detecting critical two-view configurations
Rakshith Madhavan ⋅ Matteo Forlivesi ⋅ Marina Bertolini ⋅ Cristina Turrini ⋅ Federica Arrigoni ⋅ Luca Magri
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 467
AsymLoc: Towards Asymmetric Feature Matching for Efficient Visual Localization
Mohammad Omama ⋅ Gabriele Berton ⋅ Eric Foxlin ⋅ Yelin Kim
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 468
MMLandmarks: a Cross-View Instance-Level Benchmark for Geo-Spatial Understanding
Oskar Kristoffersen ⋅ Alba Reinders Sánchez ⋅ Morten Hannemose ⋅ Anders Bjorholm Dahl ⋅ Dim P. Papadopoulos
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 469
Asking like Socrates: Socrates helps VLMs understand remote sensing images
Run Shao ⋅ Ziyu Li ⋅ Zhaoyang Zhang ⋅ Linrui Xu ⋅ Xinran He ⋅ Hongyuan Yuan ⋅ Bolei He ⋅ Yongxing Dai ⋅ Yiming Yan ⋅ Yijun Chen ⋅ Wang Guo ⋅ Haifeng Li
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 470
GTR-Turbo: Merged Checkpoint is Secretly a Free Teacher for Agentic VLM Training
Tong Wei ⋅ Yijun Yang ⋅ Changhao Zhang ⋅ Junliang Xing ⋅ Yuanchun Shi ⋅ Zongqing Lu ⋅ Deheng Ye
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 471
Let VLMs Grade Their Own Thoughts: A Self-Quantification Approach to Reasoning-Aware Reward Modeling
Xing Xi ⋅ Yu Qiu ⋅ Ronghua Luo ⋅ Peixian Chen ⋅ peilin tong
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 472
SciEducator: Scientific Video Understanding and Educating via Deming-Cycle Multi-Agent System
Zhiyu Xu ⋅ Weilong Yan ⋅ YUFEI SHI ⋅ Xin Meng ⋅ Tao He ⋅ Huiping Zhuang ⋅ Ming Li ⋅ Hehe Fan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 473
SenseSearch: Empowering Vision-Language Models with High-Resolution Agentic Search-Reasoning via Reinforcement Learning
Yong Xien Chng ⋅ Tao Hu ⋅ Wenwen Tong ⋅ Xueheng Li ⋅ Jiandong Chen ⋅ Haojia Yu ⋅ Jiefan Lu ⋅ Hewei Guo ⋅ Hanming Deng ⋅ Chengjun Xie ⋅ Gao Huang ⋅ Lewei Lu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 474
Scaling Agentic Reinforcement Learning for Tool-Integrated Reasoning in VLMs
Meng Lu ⋅ Ran Xu ⋅ Yi Fang ⋅ Wenxuan Zhang ⋅ Yue Yu ⋅ Gaurav Srivastava ⋅ Yuchen Zhuang ⋅ Mohamed Elhoseiny ⋅ Charles Fleming ⋅ Carl Yang ⋅ Zhengzhong Tu ⋅ Yang Xie ⋅ Guanghua Xiao ⋅ Di Jin ⋅ Wenqi Shi ⋅ Xuan Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 475
VideoSSR: Video Self-Supervised Reinforcement Learning
Zefeng He ⋅ Xiaoye Qu ⋅ Yafu Li ⋅ Siyuan Huang ⋅ Daizong Liu ⋅ Yu Cheng
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 476
Neurodynamics-Driven Coupled Neural P Systems for Multi-Focus Image Fusion
Bo Li ⋅ Yunkuo Lei ⋅ Tingting Bao ⋅ Hang Yan ⋅ Yaxian Wang ⋅ Weiping Fu ⋅ Lingling Zhang ⋅ Jun Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 477
MagicFuse: Single Image Fusion for Visual and Semantic Reinforcement
HAO ZHANG ⋅ Yanping Zha ⋅ Zizhuo Li ⋅ Meiqi Gong ⋅ Jiayi Ma
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 478
Bridging Pixels and Words: Mask-Aware Local Semantic Fusion for Multimodal Media Verification
Zizhao Chen ⋅ Ping Wei ⋅ Ziyang Ren ⋅ Huan Li ⋅ Xiangru Yin
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 479
Human-Centric Multi-Exposure Fusion: Benchmark and Bi-level Cognition Distillation Framework
Jingjie Shang ⋅ Tengyu Ma ⋅ Heng Zhang ⋅ Jinyuan Liu ⋅ Risheng Liu ⋅ Yuan Wang ⋅ Xiaochen Bo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 480
ConceptPose: Training-Free Zero-Shot Object Pose Estimation using Concept Vectors
Liming Kuang ⋅ Yordanka Velikova ⋅ Mahdi Saleh ⋅ Jan-Nico Zaech ⋅ Danda Paudel ⋅ Benjamin Busam
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 481
A Closer Look at Cross-Domain Few-Shot Object Detection: Fine-Tuning Matters and Parallel Decoder Helps
Xuanlong Yu ⋅ Youyang Sha ⋅ Longfei Liu ⋅ Xi Shen ⋅ Di Yang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 482
NAF: Zero-Shot Feature Upsampling via Neighborhood Attention Filtering
Loick Chambon ⋅ Paul Couairon ⋅ Éloi Zablocki ⋅ Alexandre Boulch ⋅ Nicolas THOME ⋅ Matthieu Cord
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 483
Universal-to-Specific: Dynamic Knowledge-Guided Multiple Instance Learning for Few-Shot Whole Slide Image Classification
Junjian Li ⋅ Hulin Kuang ⋅ Jin Liu ⋅ Hailin Yue ⋅ Mengshen He ⋅ Jianxin Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 484
SOTA: Self-adaptive Optimal Transport for Zero-Shot Classification with Multiple Foundation Models
Zhanxuan Hu ⋅ Qiyu Xu ⋅ Yu Duan ⋅ Yonghang Tai ⋅ Huafeng Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 485
Uni-DAD: Unified Distillation and Adaptation of Diffusion Models for Few-step Few-shot Image Generation
Yara Bahram ⋅ Mélodie Desbos ⋅ Mohammadhadi Shateri ⋅ Eric Granger
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 486
Streamlined Knowledge Distillation
Hyeon-Jin Jung ⋅ Han-Jin Lee ⋅ Seok-Hwan Choi
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 487
Generalizable Knowledge Distillation from Vision Foundation Models for Semantic Segmentation
Chonghua Lv ⋅ Dong Zhao ⋅ Shuang Wang ⋅ Dou Quan ⋅ Ning Huyan ⋅ Nicu Sebe ⋅ Zhun Zhong
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 488
IMS3: Breaking Distributional Aggregation in Diffusion-Based Dataset Distillation
Chenru Wang ⋅ Yunyi Chen ⋅ Zijun Yang ⋅ Joey Tianyi Zhou ⋅ Chi Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 489
Continuous Exposure-Time Modeling for Realistic Atmospheric Turbulence Synthesis
junwei zeng ⋅ Dong Liang ⋅ Shengjun Huang ⋅ Kun Zhan ⋅ Songcan Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 490
240FPS Stereo Vision from Monocular Mixed Spikes
Yeliduosi Xiaokaiti ⋅ Yakun Chang ⋅ Yang Bai ⋅ Zhaojun Huang ⋅ Peiqi Duan ⋅ Boxin Shi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 491
D^2-FOSA: Dual-Diffusion Guided EEG-to-Image Reconstruction with Frequency-Oriented Semantic Alignment
Yu Chenglong ⋅ Shuai Shen ⋅ Xiangsheng Li ⋅ Yang Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 492
Self-Diffusion Driven Blind Imaging
Yanlong Yang ⋅ Guanxiong Luo
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 493
Differentiable Stroke Planning with Dual Parameterization for Efficient and High-Fidelity Painting Creation
Jinfan Liu ⋅ Wuze Zhang ⋅ Zhangli Hu ⋅ Zhehan Zhao ⋅ Ye Chen ⋅ Bingbing Ni
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 494
Solvability of the Viewing Graph Under the Affine Camera Model
Gabriele Pedroni ⋅ Rakshith Madhavan ⋅ Federica Arrigoni
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 495
DiffBMP: Differentiable Rendering with Bitmap Primitives
Seongmin Hong ⋅ Junghun James Kim ⋅ Daehyeop Kim ⋅ Insoo Chung ⋅ Se Young Chun
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 496
Splat-Based Metal Artifact Reduction in Cone-Beam CT via Compact Attenuation Modeling
Kiseok Choi ⋅ Jaemin Cho ⋅ Inchul Kim ⋅ Min H. Kim
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 497
Lumosaic: Hyperspectral Video via Active Illumination and Coded-Exposure Pixels
Dhruv Verma ⋅ Andrew Qiu ⋅ Roberto Rangel ⋅ Ayandev Barman ⋅ Hao Yang ⋅ Chenjia Hu ⋅ Fengqi Zhang ⋅ Roman Genov ⋅ David B. Lindell ⋅ Kiriakos N. Kutulakos ⋅ Alex Mariakakis
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 498
Towards Universal Computational Aberration Correction in Photographic Cameras: A Comprehensive Benchmark Analysis
Xiaolong Qian ⋅ Qi Jiang ⋅ Yao Gao ⋅ Lei Sun ⋅ Zhonghua Yi ⋅ Kailun Yang ⋅ Luc Van Gool ⋅ Kaiwei Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 499
Multi-View Hierarchical Alignment Learning for Spatial Transcriptomics
Zhengzhong Zhu ⋅ Liangjin Liu ⋅ Pei Zhou ⋅ Shiquan min ⋅ Jiangping Zhu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 500
FEAST: Fully Connected Expressive Attention for Spatial Transcriptomics
Taejin Jeong ⋅ Joohyeok Kim ⋅ Jinyeong Kim ⋅ Chanyoung Kim ⋅ Seong Jae Hwang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 501
TRIDENT: A Trimodal Cascade Generative Framework for Drug and RNA-Conditioned Cellular Morphology Synthesis
Rui Peng ⋅ Ziru Liu ⋅ Lingyuan Ye ⋅ Yuxing Lu ⋅ Boxin Shi ⋅ Jinzhuo Wang
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 502
OrienPose: Orientation-Guided Novel View Synthesis for Single-Image Unseen Object Pose Estimation
Yating Liu ⋅ Zhaoshuai Qi ⋅ Yang Zou ⋅ Yongnan Yang ⋅ Shizhou Zhang ⋅ Yanning Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 503
Illustrator’s Depth: Monocular Layer Index Prediction for Image Decomposition
Nissim Maruani ⋅ Peiying Zhang ⋅ Siddhartha Chaudhuri ⋅ Matthew Fisher ⋅ Nanxuan Zhao ⋅ Vladimir G. Kim ⋅ Pierre Alliez ⋅ Mathieu Desbrun ⋅ Wang Yifan
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 504
Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation
Xin Lin ⋅ Meixi Song ⋅ Dizhe Zhang ⋅ Wenxuan Lu ⋅ Haodong Li ⋅ Bo Du ⋅ Ming-Hsuan Yang ⋅ Truong Nguyen ⋅ Lu Qi
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 505
Seeing Depth Through Frequency and Motion: A Progressive Training Paradigm for Monocular Depth Estimation
Ke Li ⋅ Bolin Song ⋅ Hongbo Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 506
GeoGuide: Hierarchical Geometric Guidance for Open-Vocabulary 3D Semantic Segmentation
Xujing Tao ⋅ Chuxin Wang ⋅ Yubo Ai ⋅ Zhixin Cheng ⋅ Zhuoyuan Li ⋅ Liangsheng Liu ⋅ Yujia Chen ⋅ Xinjun Li ⋅ Qiao Li ⋅ Wenfei Yang ⋅ Tianzhu Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 507
B^3-Seg: Camera-Free, Training-Free 3DGS Segmentation via Analytic EIG and Beta-Bernoulli Bayesian Updates
Hiromichi Kamata ⋅ Samuel Arthur Munro ⋅ Fuminori Homma
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 508
PE3R: Perception-Efficient 3D Reconstruction
Jie Hu ⋅ Shizun Wang ⋅ Xinchao Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 509
GS-ASM: 2DGS-Supervised Active Stereo Matching
Zhengling Wu ⋅ Rongfeng Lu ⋅ Quan Chen ⋅ Longjian Zeng ⋅ Ming Lu ⋅ Yaoqi Sun ⋅ Yahong Chen ⋅ Baofeng Ji ⋅ Chenggang Yan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 510
Real2Sim2Real: RetinalDepth-64K for Depth Estimation in Posterior Segment Ophthalmic Surgery
Bingwen Dong ⋅ Gan Liu ⋅ Xiaoxi Lu ⋅ Guangcheng Chen ⋅ Jialu ZHANG ⋅ Yan Hu ⋅ Xiaoqing Zhang ⋅ Jiang Liu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 511
Iris: Bringing Real-World Priors into Diffusion Model for Monocular Depth Estimation
Xinhao Cai ⋅ Gensheng Pei ⋅ Zeren Sun ⋅ Yazhou Yao ⋅ Fumin Shen ⋅ Wenguan Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 512
InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields
Hao Yu ⋅ Haotong Lin ⋅ Jiawei Wang ⋅ Jiaxin Li ⋅ Yida Wang ⋅ Xueyang Zhang ⋅ Yue Wang ⋅ Xiaowei Zhou ⋅ Ruizhen Hu ⋅ Sida Peng
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 513
AirSim360: A Panoramic Simulation Platform within Drone View
Xian Ge ⋅ Yuling Pan ⋅ Yuhang Zhang ⋅ Xiang Li ⋅ Weijun Zhang ⋅ Dizhe Zhang ⋅ Zhaoliang Wan ⋅ Xin Lin ⋅ Xiangkai Zhang ⋅ Juntao Liang ⋅ Xiangtai Li ⋅ jerett Jiang ⋅ Bo Du ⋅ Ming-Hsuan Yang ⋅ Lu Qi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 514
Radar-Guided Polynomial Fitting for Metric Depth Estimation
Patrick Rim ⋅ Hyoungseob Park ⋅ Vadim Ezhov ⋅ Jeffrey Moon ⋅ Alex Wong
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 515
UniDAC: Universal Metric Depth Estimation for Any Camera
Girish Chandar ⋅ Yuliang Guo ⋅ Liu Ren ⋅ Xiaoming Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 516
SCE-Depth: A Spherical Compound Eye Framework for Wide FOV Depth Estimation
Yi Zhu ⋅ Hao Xiong ⋅ Lin Xiao ⋅ Ranfeng Shi ⋅ Qinying Gu ⋅ Leilei Gu
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 517
I-Scene: 3D Instance Models are Implicit Generalizable Spatial Learners
Lu Ling ⋅ Yunhao Ge ⋅ Yichen Sheng ⋅ Aniket Bera
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 518
REVIVE 3D: Refinement via Encoded Voluminous Inflated prior for Volume Enhancement
Hankyeol Lee ⋅ WOOYEOL BAEK ⋅ Seongdo Kim ⋅ Jongyoo Kim
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 519
Muses: Designing, Composing, Generating Nonexistent Fantasy 3D Creatures without Training
Hexiao Lu ⋅ Xiaokun Sun ⋅ Zeyu Cai ⋅ Hao Guo ⋅ Ying Tai ⋅ Jian Yang ⋅ Zhenyu Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 520
EI-Part: Explode for Completion and Implode for Refinement
wanhu sun ⋅ Zhongjin Luo ⋅ Heliang Zheng ⋅ Jiahao Chang ⋅ Chongjie Ye ⋅ Huiang He ⋅ Shengchu Zhao ⋅ Rongfei Jia ⋅ Xiaoguang Han
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 521
MorphAny3D: Unleashing the Power of Structured Latent in 3D Morphing
Xiaokun Sun ⋅ Zeyu Cai ⋅ Hao Tang ⋅ Ying Tai ⋅ Jian Yang ⋅ Zhenyu Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 522
Fast3Dcache: Training-free 3D Geometry Synthesis Acceleration
Mengyu Yang ⋅ Yanming Yang ⋅ Chenyi Xu ⋅ Chenxi Song ⋅ Yufan Zuo ⋅ Tong Zhao ⋅ Ruibo Li ⋅ Chi Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 523
ViLearn: Accelerating Training Convergence of Image-to-3D Generation via Visibility Learning
Rui Chen ⋅ Jianfeng Zhang ⋅ Jing Lin ⋅ Xuanyu Yi ⋅ Yixun Liang ⋅ Guan Luo ⋅ Xiu Li ⋅ Zeming Li ⋅ Ping Tan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 524
FlashMesh: Faster and Better Autoregressive Mesh Synthesis via Structured Speculation
Tingrui Shen ⋅ Yiheng Zhang ⋅ Chen Tang ⋅ Chuan Ping ⋅ Zixing Zhao ⋅ Le Wan ⋅ Yuwang Wang ⋅ Ronggang Wang ⋅ Shengfeng He
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 525
X-Part: High Fidelity And Structure Coherent Shape Decomposition And Completion
XINHAO YAN ⋅ Jiachen Xu ⋅ Yang Li ⋅ Changfeng Ma ⋅ Yunhan Yang ⋅ Chunshi Wang ⋅ Zibo Zhao ⋅ Zeqiang Lai ⋅ Yunfei Zhao ⋅ Zhuo Chen ⋅ Chunchao Guo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 526
Realiz3D: 3D Generation Made Photorealistic via Domain-Aware Learning
Ido Sobol ⋅ Kihyuk Sohn ⋅ Yoav Blum ⋅ Egor Zakharov ⋅ Max Bluvstein ⋅ Andrea Vedaldi ⋅ Or Litany
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 527
TopoMesh: High-Fidelity Mesh Autoencoding via Topological Unification
Guan Luo ⋅ Xiu Li ⋅ Rui Chen ⋅ Xuanyu Yi ⋅ Jing Lin ⋅ Chia-Hao Chen ⋅ Jiahang Liu ⋅ Song-Hai Zhang ⋅ Jianfeng Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 528
Nestwork: Conditional 3D Furnished House Layout Generation through Latent Heterogeneous Graph Diffusion
Shuhan Miao ⋅ Biru Cao ⋅ Junling Zhuang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 529
TEXTRIX: Latent Attribute Grid for Native Texture Generation and Beyond
Yifei Zeng ⋅ Yajie Bao ⋅ Jiachen Qian ⋅ Shuang Wu ⋅ Youtian Lin ⋅ Hao Zhu ⋅ Buyu Li ⋅ Feihu Zhang ⋅ Xun Cao ⋅ Yao Yao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 530
Beyond Geometry: Artistic Disparity Synthesis for Immersive 2D-to-3D
Ping Chen ⋅ Zezhou Chen ⋅ Xingpeng Zhang ⋅ Yanlin Qian ⋅ Huan Hu ⋅ Xiang Liu ⋅ Zipeng Wang ⋅ Xin Wang ⋅ Zhaoxiang Liu ⋅ Kai Wang ⋅ Shiguo Lian
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 531
WorldGen: From Text to Traversable and Interactive 3D Worlds
Dilin Wang ⋅ Hyunyoung Jung ⋅ Tom Monnier ⋅ Kihyuk Sohn ⋅ Chuhang Zou ⋅ Xiaoyu Xiang ⋅ Yu-Ying Yeh ⋅ Di Liu ⋅ Zixuan Huang ⋅ Thu Nguyen-Phuoc ⋅ Yuchen Fan ⋅ Sergiu Oprea ⋅ Ziyan Wang ⋅ Roman Shapovalov ⋅ Nikolaos Sarafianos ⋅ Thibault Groueix ⋅ Antoine Toisoul ⋅ Prithviraj Dhar ⋅ Xiao Chu ⋅ Minghao Chen ⋅ Geon Yeong Park ⋅ Rakesh Ranjan ⋅ Andrea Vedaldi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 532
ExMesh: EXplicit Mesh Reconstruction with Topology Adaptation
Chuanjin Fan ⋅ Lifan Wu ⋅ Wenjie Chang ⋅ Hanzhi Chang ⋅ Wenfei Yang ⋅ Tianzhu Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 533
SceneMaker: Open-set 3D Scene Generation with Decoupled De-occlusion and Pose Estimation Model
Yukai Shi ⋅ Weiyu Li ⋅ Zihao Wang ⋅ Hongyang Li ⋅ Xingyu Chen ⋅ Ping Tan ⋅ Lei Zhang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 534
ShapeR: Robust Conditional 3D Shape Generation from Casual Captures
Mohd Yawar Nihal Siddiqui ⋅ Duncan Frost ⋅ Samir Aroudj ⋅ Armen Avetisyan ⋅ Henry Howard-Jenkins ⋅ Daniel DeTone ⋅ Pierre Moulon ⋅ Qirui Wu ⋅ Zhengqin Li ⋅ Julian Straub ⋅ Richard Newcombe ⋅ Jakob Engel
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 535
SwiftTailor: Efficient 3D Garment Generation with Geometry Image Representation
Phuc Pham ⋅ Uy Dieu Tran ⋅ Binh-Son Hua ⋅ Phong Nguyen
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 536
3DrawAgent: Teaching LLM to Draw in 3D with Early Contrastive Experience
Hongcan Xiao ⋅ Xinyue Xiao ⋅ Yilin Wang ⋅ Yue Zhang ⋅ Yonggang Qi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 537
Sculpt4D: Generating 4D Shapes via Sparse-Attention Diffusion Transformers
Minghao Yin ⋅ Wenbo Hu ⋅ Jiale Xu ⋅ Ying Shan ⋅ Kai Han
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 538
HiFi-BRep: High-Fidelity Latent Representation for Robust B-Rep Generation
Junhao Hou ⋅ Chenqi Luo ⋅ PuFan Wang ⋅ Jiaying Lu ⋅ Yusheng Liu ⋅ Feiwei Qin ⋅ Meie Fang ⋅ Kun Zhou
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 539
PhysGen: Physically Grounded 3D Shape Generation for Industrial Design
Yingxuan You ⋅ Chen Zhao ⋅ Hantao Zhang ⋅ Ming Xu ⋅ Pascal Fua
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 540
Perceptual 3D Simulation With Physical World Modeling
Wanhee Lee ⋅ Klemen Kotar ⋅ Rahul Venkatesh ⋅ Jared Watrous ⋅ Daniel L.K. Yamins
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 541
EchoFoley: Event-Centric Hierarchical Control for Video Grounded Creative Sound Generation
Bingxuan Li ⋅ Yiming Cui ⋅ Yicheng He ⋅ Yiwei Wang ⋅ Shu Zhang ⋅ Longyin Wen ⋅ Yulei Niu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 542
Active Intelligence in Video Avatars via Closed-loop World Modeling
Xuanhua He ⋅ Tianyu Yang ⋅ Ke Cao ⋅ Rui-Qi Wu ⋅ Cheng Meng ⋅ Yong Zhang ⋅ Zhuoliang Kang ⋅ Xiaoming Wei ⋅ Qifeng Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 543
Enhancing Spatial Understanding in Image Generation via Reward Modeling
Zhenyu Tang ⋅ Chaoran Feng ⋅ Yufan Deng ⋅ Jie Wu ⋅ Xiaojie Li ⋅ Rui Wang ⋅ Yunpeng Chen ⋅ Daquan Zhou
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 544
Seeing What Matters: Visual Preference Policy Optimization for Visual Generation
Ziqi Ni ⋅ Yuanzhi Liang ⋅ Rui Li ⋅ Yi Zhou ⋅ Haibin Huang ⋅ Chi Zhang ⋅ Xuelong Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 545
TAG-MoE: Task-Aware Gating for Unified Generative Mixture-of-Experts
Yu Xu ⋅ Hongbin Yan ⋅ Juan Cao ⋅ YIJI CHENG ⋅ Tiankai Hang ⋅ Runze He ⋅ Zijin Yin ⋅ Shiyi Zhang ⋅ Yuxin Zhang ⋅ Jintao Li ⋅ Chunyu Wang ⋅ qinglin lu ⋅ Tong-yee Lee ⋅ Fan Tang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 546
Identity-Preserving Image-to-Video Generation via Reward-Guided Optimization
Liao Shen ⋅ Wentao Jiang ⋅ Yiran Zhu ⋅ Jiahe Li ⋅ Tiezheng Ge ⋅ Zhiguo Cao ⋅ Bo Zheng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 547
JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization
yunlong lin ⋅ Linqing Wang ⋅ Kunjie Lin ⋅ Zixu Lin ⋅ Kaixiong Gong ⋅ Wenbo Li ⋅ Bin Lin ⋅ Zhenxi Li ⋅ Shiyi Zhang ⋅ Yuyang Peng ⋅ Wenxun Dai ⋅ Xinghao Ding ⋅ Chunyu Wang ⋅ qinglin lu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 548
Learning Latent Proxies for Controllable Single-Image Relighting
Haoze Zheng ⋅ Zihao Wang ⋅ Xianfeng Wu ⋅ Yajing Bai ⋅ Yexin Liu ⋅ Yun LI ⋅ Xiaogang Xu ⋅ Harry Yang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 549
MoVie: Broaden Your Views with Human Motion for Action Detection
Di Yang ⋅ Mahmoud Ali ⋅ Xuanlong Yu ⋅ Xi Shen ⋅ Quan Kong ⋅ Gianpiero Francesca ⋅ Francois Bremond
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 550
MooCap: A Multi-View Benchmark for Cow-Object-Human Interaction and Behavior Dynamics
Ian Noronha ⋅ Heather Neave ⋅ Upinder Kaur
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 551
LAOF: Robust Latent Action Learning with Optical Flow Constraints
Xizhou Bu ⋅ Jiexi Lyu ⋅ Fulei Sun ⋅ Ruichen Yang ⋅ Zhiqiang Ma ⋅ Wei Li
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 552
DarkAct: A RGB-Thermal Dataset and Fusion Framework for Multimodal Low-Light Action Recognition
Yuanjun Tan ⋅ Aoran Xiao ⋅ Liqian Deng ⋅ Zhigang Tu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 553
Random Wins All: Rethinking Grouping Strategies for Vision Tokens
Qihang Fan ⋅ Yuang Ai ⋅ Huaibo Huang ⋅ Ran He
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 554
Steering Where to Diffuse: Generative Modeling of Phenotypic Response Simulation with Steered Diffusion Bridge
Rongchao Zhang ⋅ Chengxin Li ⋅ Yiwei Lou ⋅ Yuling Shi ⋅ Hanpin Wang ⋅ Yu Huang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 555
Deep Feature Deformation Weights
Richard Liu ⋅ Itai Lang ⋅ Rana Hanocka
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 556
Resolving Endpoint Underfitting in Diffusion Bridges via Noise Alignment
Yurong Gao ⋅ Zicheng Zhang ⋅ Congying Han ⋅ Tiande Guo ⋅ Xinmin QIu
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 557
RNN as Linear Transformer: A Closer Investigation into Representational Potentials of Visual Mamba Models
Timing Yang ⋅ Feng Wang ⋅ Guoyizhe Wei
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 558
Coupling Liquid Time‑Constant Encoders with Modern Hopfield Memory
Bishal Ranjan Swain ⋅ Kyung Joo Cheoi ⋅ Jaepil Ko
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 559
Stronger Normalization-Free Transformers
Mingzhi Chen ⋅ Taiming Lu ⋅ Jiachen Zhu ⋅ Mingjie Sun ⋅ Zhuang Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 560
HCL-FF: Hierarchical and Contrastive Learning for Forward-Forward Algorithm
Jie-En Yao ⋅ Hong-En Chen ⋅ C.-C. Jay Kuo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 561
Can You Learn to See Without Images? Procedural Warm-Up for Vision Transformers
Zachary Shinnick ⋅ Liangze Jiang ⋅ Hemanth Saratchandran ⋅ Damien Teney ⋅ Anton van den Hengel
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 562
Convolutional Neural Networks Driven by Content Similarity
Ligeng Zou ⋅ Guihu Zhao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 563
MorphSeek: Fine-grained Latent Representation-Level Policy Optimization for Deformable Image Registration
Runxun Zhang ⋅ Yizhou Liu ⋅ Dongrui Li ⋅ Bo XU ⋅ Jingwei Wei
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 564
HATS: Hardness-Aware Trajectory Synthesis for GUI Agents
Rui Shao ⋅ RUIZE GAO ⋅ Bin Xie ⋅ Yixing Li ⋅ Kaiwen Zhou ⋅ Shuai Wang ⋅ Weili Guan ⋅ Gongwei Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 565
MVP: Multiple View Prediction Improves GUI Grounding
Yunzhu Zhang ⋅ Zeyu Pan ⋅ Zhengwen Zeng ⋅ Shuheng Shen ⋅ Changhua Meng ⋅ Linchao Zhu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 566
Towards GUI Agents: Vision-Language Diffusion Models for GUI Grounding
Shrinidhi Kumbhar ⋅ Haofu Liao ⋅ srikar appalaraju ⋅ Kunwar Yashraj Singh
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 567
ProactiveMobile: A Comprehensive Benchmark for Boosting Proactive Intelligence On Mobile Devices
Dezhi Kong ⋅ Zhengzhao Feng ⋅ Qiliang Liang ⋅ Hao Wang ⋅ haofei Sun ⋅ Changpeng Yang ⋅ Yang Li ⋅ Peng Zhou ⋅ Shuai Nie ⋅ Hongzhen Wang ⋅ Linfeng Zhou ⋅ Hao Jia ⋅ Jiaming Xu ⋅ Runyu Shi ⋅ Ying Huang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 568
OS-Oracle: A Comprehensive Framework for Cross-Platform GUI Critic Models
Zhenyu Wu ⋅ JingJing Xie ⋅ Zehao Li ⋅ Bowen Yang ⋅ Qiushi Sun ⋅ Zhaoyang Liu ⋅ Zhoumianze Liu ⋅ Yu Qiao ⋅ Xiangyu Yue ⋅ Zun Wang ⋅ Zichen Ding
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 569
Training High-Level Schedulers with Execution-Feedback Reinforcement Learning for Long-Horizon GUI Automation
Zehao Deng ⋅ Tianjie Ju ⋅ Zheng Wu ⋅ Zhuosheng Zhang ⋅ Gongshen Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 570
See, Think, Act: Teaching Multimodal Agents to Effectively Interact with GUI by Identifying Toggles
Zongru Wu ⋅ Rui Mao ⋅ Zhiyuan Tian ⋅ Pengzhou Cheng ⋅ Tianjie Ju ⋅ Zheng Wu ⋅ Lingzhong Dong ⋅ Haiyue Sheng ⋅ Zhuosheng Zhang ⋅ Gongshen Liu
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 571
Beyond Weak Supervision: MLLMs-Guided Graded Knowledge Distillation for Unsupervised Camouflaged Object Detection
Huafeng Chen ⋅ Chenguang Zhu ⋅ Yueming Lyu ⋅ Caifeng Shan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 572
Detecting Unknown Objects via Energy-based Separation for Open World Object Detection
JunWoo Heo ⋅ Keonhee Park ⋅ Gyeong-Moon Park
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 573
Beyond Prompt Degradation: Prototype-guided Dual-pool Prompting for Incremental Object Detection
Yaoteng Zhang ⋅ Qing Zhou ⋅ Junyu Gao ⋅ Qi Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 574
SPAR: Single-Pass Any-Resolution ViT for Open-vocabulary Segmentation
Naomi Kombol ⋅ Ivan Martinović ⋅ Siniša Šegvić ⋅ Giorgos Tolias
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 575
TTL: Test-time Textual Learning for OOD Detection with Pretrained Vision-Language Models
Jinlun Ye ⋅ Jiang Liao ⋅ Runhe Lai ⋅ Xinhua Lu ⋅ Jiaxin Zhuang ⋅ Zhiyong Gan ⋅ Ruixuan Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 576
Parameterized Prompt for Incremental Object Detection
Zijia An ⋅ boyu diao ⋅ RuiQi Liu ⋅ Libo Huang ⋅ Chuanguang Yang ⋅ Fei Wang ⋅ Zhulin An ⋅ Yongjun Xu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 577
SRA-Det: Learning Omni-Grained Open-Vocabulary Detection Beyond Category Names
Li Yang ⋅ Boyu Cai ⋅ Wei Liu ⋅ Yan Wang ⋅ Chunfeng Yuan ⋅ Bing Li ⋅ Weiming Hu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 578
Retrieve and Segment: Are a Few Examples Enough to Bridge the Supervision Gap in Open-Vocabulary Segmentation?
Tilemachos Aravanis ⋅ Vladan Stojnić ⋅ Vasileios Psomas ⋅ Nikos Komodakis ⋅ Giorgos Tolias
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 579
PCA-Seg: Revisiting Cost Aggregation for Open-Vocabulary Semantic and Part Segmentation
Jianjian Yin ⋅ Tao Chen ⋅ Yi Chen ⋅ Gensheng Pei ⋅ Xiangbo Shu ⋅ Yazhou Yao ⋅ Fumin Shen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 580
Partial Weakly-Supervised Oriented Object Detection
Mingxin Liu ⋅ Peiyuan Zhang ⋅ Yuan Liu ⋅ Wei Zhang ⋅ Yue Zhou ⋅ Ning Liao ⋅ Ziyang Gong ⋅ Junwei Luo ⋅ Zhirui Wang ⋅ Yi Yu ⋅ Xue Yang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 581
Seeing Both Sides: Towards Bidirectional Semantic Alignment for Open-Vocabulary Camouflaged Object Segmentation
Guohui Zhang ⋅ Fuming Sun ⋅ Yu Zhao ⋅ Yuqiu Kong ⋅ Jing Sun ⋅ Ganggang Huang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 582
Towards Robust Multi-Modal Semantic Segmentation with Teacher-Student Framework and Hybrid Prototype Distillation
jiaqi tan ⋅ Xu Zheng ⋅ Yang Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 583
REL-SF4PASS: Panoramic Semantic Segmentation with REL Depth Representation and Spherical Fusion
Xuewei Li ⋅ Xinghan Bao ⋅ Zhimin Chen ⋅ Xi Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 584
Looking Beyond the Window: Global-Local Aligned CLIP for Training-free Open-Vocabulary Semantic Segmentation
ByeongCheol Lee ⋅ Hyun Seok Seong ⋅ Sangeek Hyun ⋅ Gilhan Park ⋅ WonJun Moon ⋅ Jae-Pil Heo
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 585
From Softmax to Dirichlet: Evidential Learning for Semi-supervised Semantic Segmentation
Huayu Mai ⋅ Rui Sun ⋅ Yujia Chen ⋅ Wangkai Li ⋅ Bingzhou Wang ⋅ Aibing Li ⋅ Zhangyu He ⋅ Yuan Wang
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 586
Particulate: Feed-Forward 3D Object Articulation
Ruining Li ⋅ YUXIN YAO ⋅ Chuanxia Zheng ⋅ Christian Rupprecht ⋅ Joan Lasenby ⋅ Shangzhe Wu ⋅ Andrea Vedaldi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 587
HOPS: Hierarchical Open-vocabulary Part Segmentation with Attention-Aware Filtering and Affinity-Guided Enhancement
Xinlong Li ⋅ Di Lin ⋅ Shaoyiyi Gao ⋅ Yaxuan Liu ⋅ Jixian He ⋅ Jiaxin Li ⋅ Ruonan Liu ⋅ Qing Guo ⋅ Kairui Yang ⋅ Wei Feng
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 588
Shape-of-You: Fused Gromov-Wasserstein Optimal Transport for Semantic Correspondence in-the-Wild
Jiin Im ⋅ Sisung Liu ⋅ Je Hyeong Hong
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 589
MEMO: Human-like Crisp Edge Detection Using Masked Edge Prediction
Jiaxin Cheng ⋅ Yue Wu ⋅ Yicong Zhou
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 590
MUFASA: A Multi-Layer Framework for Slot Attention
Sebastian Bock ⋅ Leonie Schüßler ⋅ Krishnakant Singh ⋅ Simone Schaub-Meyer ⋅ Stefan Roth
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 591
ChangeBridge: Spatiotemporal Image Generation with Multimodal Controls for Remote Senisng
Zhenghui Zhao ⋅ Chen Wu ⋅ Xiangyong Cao ⋅ Di Wang ⋅ Hongruixuan Chen ⋅ Datao Tang ⋅ Liangpei Zhang ⋅ Zhuo Zheng
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 592
MOMO: Mars Orbital MOdel Foundation Model for Mars Orbital Applications
Mirali Purohit ⋅ Bimal Gajera ⋅ Irish Mehta ⋅ Bhanu Tokas ⋅ Jacob Adler ⋅ Steven Lu ⋅ Scott Dickenshied ⋅ Serina Diniega ⋅ Brian Bue ⋅ Umaa Rebbapragada ⋅ Hannah Kerner
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 593
Seeing Through the Noise: Improving Infrared Small Target Detection and Segmentation from Noise Suppression Perspective
Maoxun Yuan ⋅ Duanni Meng ⋅ Ziteng Xi ⋅ Tianyi Zhao ⋅ Shiji Zhao ⋅ Yimian Dai ⋅ Xingxing Wei
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 594
GeoBridge: A Semantic-Anchored Multi-View Foundation Model Bridging Images and Text for Geo-Localization
Zixuan Song ⋅ Jing Zhang ⋅ Di Wang ⋅ Zidie Zhou ⋅ Wenbin Liu ⋅ Haonan Guo ⋅ En Wang ⋅ Bo Du
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 595
GeoSANE: Learning Geospatial Representations from Models, Not Data
Joëlle Hanna ⋅ Damian Falk ⋅ Stella X. Yu ⋅ Damian Borth
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 596
Brewing Stronger Features: Dual-Teacher Distillation for Multispectral Earth Observation
Filip Wolf ⋅ Blaz Rolih ⋅ Luka Cehovin Zajc
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 597
Spectral Super-Resolution via Adversarial Unfolding and Data-Driven Spectrum Regularization: From Multispectral Satellite Data to NASA Hyperspectral Image
Si-Sheng Yang ⋅ Chia-Hsiang Lin
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 598
RAMEN: Resolution-Adjustable Multimodal Encoder for Earth Observation
Nicolas Houdré ⋅ Diego Marcos ⋅ Hugo Riffaud de Turckheim ⋅ Dino Ienco ⋅ Laurent Wendling ⋅ Camille Kurtz ⋅ Sylvain Lobry
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 599
ORSATR-X: A Foundation Model based on Differential-and-Excitation Networks for Optical Remote Sensing Object Recognition
Canyu Mo ⋅ Yongxiang Liu ⋅ Jiehua Zhang ⋅ Zilong Yu ⋅ Zhen Liu ⋅ Tianpeng Liu ⋅ Li Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 600
SEBA: Sample-Efficient Black-Box Attacks on Visual Reinforcement Learning
Tairan HUANG ⋅ Yulin Jin ⋅ Junxu Liu ⋅ Qingqing Ye ⋅ Haibo Hu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 601
IAG: Input-aware Backdoor Attack on VLM-based Visual Grounding
Junxian Li ⋅ Beining Xu ⋅ Simin Chen ⋅ Jiatong LI ⋅ Jingdi Lei ⋅ Haodong Zhao ⋅ Di Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 602
DASH: A Meta-Attack Framework for Synthesizing Effective and Stealthy Adversarial Examples
Abdullah Al Nomaan Nafi ⋅ Habibur Rahaman ⋅ Zafaryab Haider ⋅ Tanzim Mahfuz ⋅ Fnu Suya ⋅ Swarup Bhunia ⋅ Prabuddha Chakraborty
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 603
AdapAction: Adaptive Target Action Backdoor Attack against GUI Agents
Baicheng Chen ⋅ Mingda Zhang ⋅ Min Zhang ⋅ Haizhou Li ⋅ Baoyuan Wu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 604
Phantom: Physical Object Interactions as Dynamic Triggers for NMS-Exploited Backdoors
Tianlin Huo ⋅ Dongchuan Ran ⋅ Ranjie Duan ⋅ Yao Zhu ⋅ Peilun Du ⋅ ningbo yao ⋅ Huanqian Yan ⋅ Xu Han ⋅ Qiang Yun ⋅ Yuzheng Tan ⋅ Yang Bao ⋅ Yuan He
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 605
Verifying Neural Network Robustness with Dual Perturbations
Hai Duong ⋅ Son Vu ⋅ Thanh Le ⋅ ThanhVu Nguyen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 606
Defending Unauthorized Model Merging via Dual-Stage Weight Protection
Wei-Jia Chen ⋅ Min-Yan Tsai ⋅ Cheng-Yi Lee ⋅ Chia-Mu Yu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 607
AntiStyler: Defending Object Detection Models Against Adversarial Patch Attacks Using Style Removal
Idan Yankelev ⋅ Edita Grolman ⋅ Yarin Yerushalmi Levi ⋅ Amit Giloni ⋅ Omer Hofman ⋅ Toshiya Shimizu ⋅ Yuval Elovici ⋅ Asaf Shabtai
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 608
On the Role of Temporal Granularity in the Robustness of Spiking Neural Networks
Mengting Xu ⋅ Shi Gu ⋅ Peng Lin ⋅ De Ma ⋅ Huajin Tang ⋅ Qian Zheng ⋅ Gang Pan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 609
Boosting Vision-Language-Action Finetuning with Feasible Action Neighborhood Prior
Haochen Niu ⋅ Kanyu Zhang ⋅ Shuyu Yin ⋅ Qinghai Guo ⋅ Peilin Liu ⋅ Fei Wen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 610
Exploring Conditions for Diffusion Models in Robotic Control
Heeseong Shin ⋅ Byeongho Heo ⋅ Dongyoon Han ⋅ Seungryong Kim ⋅ Taekyung Kim
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 611
A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens
Tommie Kerssies ⋅ Gabriele Berton ⋅ Ju He ⋅ Qihang Yu ⋅ Wufei Ma ⋅ Daan de Geus ⋅ Gijs Dubbelman ⋅ Liang-Chieh Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 612
Efficient Hybrid SE(3)-Equivariant Visuomotor Flow Policy via Spherical Harmonics for Robot Manipulation
Qinglun Zhang ⋅ Shen Cheng ⋅ Tian Dan ⋅ Haoqiang Fan ⋅ Guanghui Liu ⋅ Shuaicheng Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 613
TSTM: Temporal Segmentation for Task-relevant Mask in Visual Reinforcement Learning Generalization
Weicheng Du ⋅ Wenjia Meng ⋅ Zhengzhe Zhang ⋅ Yilong Yin ⋅ Xiankai Lu
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 614
Scaling Spatial and Temporal Context for Robotic Imitation Learning Policies With Scene Graphs
Jianing Qian ⋅ Qinhe Peng ⋅ Emmanuel Panov ⋅ Leonor Fermoselle ⋅ Dinesh Jayaraman ⋅ Bernadette Bucher ⋅ Tarik Kelestemur
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 615
AdaDexTrack: Dynamic Modulation for Adaptive and Generalizable Dexterous Manipulation Tracking
Jianibieke Adalibieke ⋅ Qianwei Han ⋅ Xueyi Liu ⋅ Yuzhe Qin ⋅ Li Yi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 616
GraspLDP: Towards Generalizable Grasping Policy via Latent Diffusion
Enda Xiang ⋅ Haoxiang Ma ⋅ Xinzhu Ma ⋅ Zicheng Liu ⋅ Di Huang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 617
MoEActok: A MoE-based Action Tokenizer for Vision-Language-Action Models
Chunpu Xu ⋅ Zhixuan Liang ⋅ Tianshuo Yang ⋅ Chi-Min Chan ⋅ Yang Xiao ⋅ Jessie Wang ⋅ Xiaokang Yang ⋅ Yao Mu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 618
A Cross-view Fusion Framework for Robust 6-DoF Grasp Pose Estimation
Kangjian Zhu ⋅ Haobo Jiang ⋅ Jianjun Qian ⋅ Jin Xie
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 619
SAVA-X: Ego-to-Exo Imitation Error Detection via Scene-Adaptive View Alignment and Bidirectional Cross View Fusion
Xiang Li ⋅ Heqian Qiu ⋅ Lanxiao Wang ⋅ Benliu Qiu ⋅ Fanman Meng ⋅ Linfeng Xu ⋅ Hongliang Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 620
PromptDepth: Efficient and Promptable Geometric 3D Vision Model for Embodied Intelligence
Xianyun Wang ⋅ Jiaxu Miao ⋅ Tian Xu ⋅ Siyuan Wang ⋅ Yuehao Li ⋅ Haoyang Hu ⋅ Jun Xiao ⋅ Yonghong Tian ⋅ Jun Yu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 621
Gallant: Voxel Grid-based Humanoid Locomotion and Local-navigation across 3-D Constrained Terrains
Qingwei Ben ⋅ Botian Xu ⋅ Kailin Li ⋅ Feiyu Jia ⋅ Wentao Zhang ⋅ Jingping Wang ⋅ Jingbo Wang ⋅ Dahua Lin ⋅ Jiangmiao Pang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 622
PALM: Progress-Aware Policy Learning via Affordance Reasoning for Long-Horizon Robotic Manipulation
Yuanzhe Liu ⋅ Jingyuan Zhu ⋅ Yuchen Mo ⋅ Gen Li ⋅ Xu Cao ⋅ Jin Jin ⋅ Yifan Shen ⋅ Zhengyuan Li ⋅ Tianjiao Yu ⋅ Wenzhen Yuan ⋅ Fangqiang Ding ⋅ Ismini Lourentzou
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 623
IGen: Scalable Data Generation for Robot Learning from Open-World Images
Chenghao Gu ⋅ Haolan Kang ⋅ Junchao Lin ⋅ Jinghe Wang ⋅ Duo Wu ⋅ Shuzhao Xie ⋅ Fanding Huang ⋅ Junchen Ge ⋅ Ziyang Gong ⋅ Letian Li ⋅ Hongying Zheng ⋅ Changwei Lv ⋅ Zhi Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 624
Hypergraph-State Collaborative Reasoning for Multi-Object Tracking
Zikai Song ⋅ Junqing Yu ⋅ Yi-Ping Phoebe Chen ⋅ Wei Yang ⋅ Xinchao Wang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 625
TGTrack: Temporal Generative Learning for Unified Single Object Tracking
Wanting Geng ⋅ Xin Chen ⋅ Chuanyu Sun ⋅ Jie Zhao ⋅ Ben Kang ⋅ Dong Wang ⋅ Huchuan Lu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 626
GeoMotion: Rethinking Motion Segmentation via Latent 4D Geometry
Xiankang He ⋅ Peile Lin ⋅ Ying Cui ⋅ Dongyan Guo ⋅ Chunhua Shen ⋅ Xiaoqin Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 627
Generalizable Structure-Aware Keypoint Correspondence for Category-Unified 3D Single Object Tracking
Jie Xiao ⋅ Yinchao Ma ⋅ Yuyang Tang ⋅ Dengqing Yang ⋅ Jianpeng Yang ⋅ Xu Zhou ⋅ Qiao Li ⋅ Wenfei Yang ⋅ Tianzhu Zhang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 628
Generative Point Tracking and Forecasting
Xuanchen Lu ⋅ Ang Cao ⋅ Chao Feng ⋅ Andrew Owens
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 629
RAGTrack: Language-aware RGBT Tracking with Retrieval-Augmented Generation
Hao Li ⋅ Yuhao Wang ⋅ Wenning Hao ⋅ Pingping Zhang ⋅ Dong Wang ⋅ Huchuan Lu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 630
Dual-level Adaptation for Multi-Object Tracking: Building Test-Time Calibration from Experience and Intuition
Wen Guo ⋅ Pengfei Zhao ⋅ Zongmeng Wang ⋅ Yufan Hu ⋅ Junyu Gao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 631
GMT: Effective Global Framework for Multi-Target Multi-Camera Tracking
Yihao Zhen ⋅ Mingyue Xu ⋅ Qiang Wang ⋅ Baojie Fan ⋅ Jiahua Dong ⋅ Tinghui Zhao ⋅ Huijie Fan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 632
Bridging Brain and Semantics: A Hierarchical Framework for Semantically Enhanced fMRI-to-Video Reconstruction
Yujie Wei ⋅ Chenglong Ma ⋅ Jianxiong Gao ⋅ Chenhui Wang ⋅ Shiwei Zhang ⋅ Biao Gong ⋅ Shuai Tan ⋅ Hangjie Yuan ⋅ Hongming Shan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 633
GraPHFormer: A Multimodal Graph Persistent Homology Transformer for the Analysis of Neuroscience Morphologies
Uzair Shah ⋅ Marco Agus ⋅ Mahmoud Gamal ⋅ Mahmood Alzubaidi ⋅ Corrado Cali ⋅ PIERRE MAGISTRETTI ⋅ Abdesselam Bouzerdoum ⋅ Mowafa Househ
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 634
DARC: Dual Adjustment Reasoning with Counterfactuals for Trustworthy Chest X-ray Classification
Zhifang Liao ⋅ Junhao Li ⋅ HaoKang Ding ⋅ Yucheng Song
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 635
Every Error has Its Magnitude: Asymmetric Mistake Severity Training for Multiclass Multiple Instance Learning
Sungrae Hong ⋅ Jiwon Jeong ⋅ Jisu Shin ⋅ Donghee Han ⋅ Sol Lee ⋅ Kyungeun Kim ⋅ Mun Yong Yi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 636
Phrase-grounded APO for Improving Chest X-ray Report Generation
Raziuddin Mahmood ⋅ Tanveer Syeda-Mahmood
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 637
Focus-to-Perceive Representation Learning: A Cognition-Inspired Hierarchical Framework for Endoscopic Video Analysis
Yuan Zhang ⋅ Sihao Dou ⋅ Kai Hu ⋅ Shuhua Deng ⋅ Chunhong Cao ⋅ Fen Xiao ⋅ Xieping Gao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 638
OraPO: Oracle-educated Reinforcement Learning for Data-efficient and Factual Radiology Report Generation
Zhuoxiao Chen ⋅ Hongyang Yu ⋅ Ying Xu ⋅ Yadan Luo ⋅ Long Duong ⋅ Yuan-Fang Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 639
FluoCLIP: Stain-Aware Focus Quality Assessment in Fluorescence Microscopy
Hyejin Park ⋅ Jiwon Yoon ⋅ Sumin Park ⋅ Suree Kim ⋅ Sinae Jang ⋅ Eunsoo Lee ⋅ Dongmin Kang ⋅ Dongbo Min
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 640
CryoKRAQEN: Kernel-Regularized Annealing for Quantized Embedding Networks in Cryo-EM Heterogeneous Reconstruction
Wenyuan Gao ⋅ Yutan Wu ⋅ Xuming He
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 641
Building Robust Vision Encoders for Cross-Dataset Evaluation in Immunofluorescent Microscopy
Umar Marikkar ⋅ Syed Sameed Husain ⋅ Muhammad Awais ⋅ Sara Atito
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 642
H2-Surv: Hierarchical Hyperbolic Multimodal Representation Learning for Survival Prediction
Jiaqi Yang ⋅ Wenting Chen ⋅ Xiangjian He ⋅ Yuanbai Li ⋅ Sen Yang ⋅ Linlin Shen ⋅ Xiaohan Xing
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 643
Dual-Level Hypergraph Generation for Addressing Feature Scarcity in Whole-Slide Image Classification
Shuilian Yao ⋅ Qi Jia ⋅ Qi Jia ⋅ Pengshuo Zhang ⋅ Lili Sun ⋅ Weimin Wang ⋅ Yanmei Zhu ⋅ Bo Zhang ⋅ Xin Fan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 644
Temporal Inversion for Learning Interval Change in Chest X-Rays
Hanbin Ko ⋅ Kyungmin Jeon ⋅ Doowoong Choi ⋅ Chang Min Park
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 645
JUMP-Hand: Learning Joint-wise Uncertainty to Gate Mixture of View Experts for Multi-View 3D Hand Reconstruction
Haohong Kuang ⋅ Yang Xiao ⋅ Changlong Jiang ⋅ Jinghong Zheng ⋅ Hang Xu ⋅ Ran Wang ⋅ Zhiguo Cao ⋅ Joey Tianyi Zhou
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 646
PAD-Hand: Physics-Aware Diffusion for Hand Motion Recovery
Elkhan Ismayilzada ⋅ Yufei Zhang ⋅ Zijun Cui
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 647
Anatomical Domain Shifts: Test-time Heterogeneous Adaptation for 3D Human Pose Prediction
Qiongjie Cui ⋅ Pan Zhou ⋅ Jingjing Chen ⋅ Na Zhao
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 648
Unlocking Motion from Large Vision Models with a Semantic and Kinematic Duality for Gait Recognition
Zhanbo Huang ⋅ Dingqiang Ye ⋅ Xiaoming Liu ⋅ Yu Kong
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 649
Learning 3D Shape Fidelity Metric from Real-world Distortions
Xuelu Feng ⋅ Tianyu Luan ⋅ Zixin Zhu ⋅ Akshobhya Sharma ⋅ Phani Nuney ⋅ Junsong Yuan ⋅ Chunming Qiao
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 650
BarbieGait: An Identity-Consistent Synthetic Human Dataset with Versatile Cloth-Changing for Gait Recognition
Qingyuan Cai ⋅ Saihui Hou ⋅ Xuecai Hu ⋅ Yongzhen Huang
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 651
FisherPoser: Human Motion Estimation from Sparse Observations with Hierarchical Region-Wise Fisher-Matrix Uncertainty Modeling
Songpengcheng Xia ⋅ Qingyu Zhang ⋅ Zhuo Su ⋅ Jiarui Yang ⋅ Zengyuan Lai ⋅ Qi Wu ⋅ Ling Pei
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 652
EmbodMocap: In-the-Wild 4D Human-Scene Reconstruction for Embodied Agents
Wenjia Wang ⋅ Liang Pan ⋅ Huaijin Pi ⋅ Yuke Lou ⋅ Xuqian Ren ⋅ Yifan Wu ⋅ Zhouyingcheng Liao ⋅ Lei Yang ⋅ Rishabh Dabral ⋅ Christian Theobalt ⋅ Taku Komura
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 653
Ground Reaction Inertial Poser: Physics-based Human Motion Capture from Sparse IMUs and Insole Pressure Sensors
Ryosuke Hori ⋅ Jyun-Ting Song ⋅ Zhengyi Luo ⋅ Jinkun Cao ⋅ Soyong Shin ⋅ HIDEO SAITO ⋅ Kris Kitani
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 654
FUN REC Reconstructing Functional 3D Scenes from Egocentric Interaction Videos
Alexandros Delitzas ⋅ Chenyangguang Zhang ⋅ Alexey Gavryushin ⋅ Tommaso Di Mario ⋅ Boyang Sun ⋅ Rishabh Dabral ⋅ Leonidas Guibas ⋅ Christian Theobalt ⋅ Marc Pollefeys ⋅ Francis Engelmann ⋅ Daniel Barath
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 655
VIMCAN: Visual-Inertial 3D Human Pose Estimation with Hybrid Mamba-Cross-Attention Network
Zepeng Yang ⋅ Junxuan Bai ⋅ Hao Li ⋅ Ju Dai ⋅ Junjun Pan ⋅ Yongfeng Yin ⋅ Bin Li
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 656
Bringing Your Portrait to 3D Presence
Jiawei Zhang ⋅ Lei Chu ⋅ Jiahao Li ⋅ Zhenyu Zang ⋅ Chong Li ⋅ Xiao Li ⋅ Xun Cao ⋅ Hao Zhu ⋅ Yan Lu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 657
FLOW: Feature-Level Optimal Warping for Generalized Remote Physiological Measurement
bo zhao ⋅ Junzhe Cao ⋅ Dan Guo ⋅ Dongmin Huang ⋅ Wenjin Wang ⋅ Tao Tan ⋅ Yue Sun ⋅ Zitong YU
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 658
One-to-More: High-Fidelity Training-Free Anomaly Generation with Attention Control
Haoxiang Rao ⋅ Zhao Wang ⋅ Chenyang Si ⋅ Yan LYU ⋅ Yuanyi Duan ⋅ Fang Zhao ⋅ Caifeng Shan
[ Slides [ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 659
UniMMAD: Unified Multi-Modal and Multi-Class Anomaly Detection via MoE-Driven Feature Decompression
Yuan Zhao ⋅ Youwei Pang ⋅ Lihe Zhang ⋅ Hanqi Liu ⋅ Jiaming Zuo ⋅ Huchuan Lu ⋅ Xiaoqi Zhao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 660
BUSSARD: Normalizing Flows for Bijective Universal Scene-Specific Anomalous Relationship Detection
Melissa Schween ⋅ Mathis Kruse ⋅ Bodo Rosenhahn
[ Slides
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 661
Multi-Prototype Compactness and Boundary-Aware Synthesis for Unsupervised Anomaly Detection
Liao Kailun ⋅ Jianfeng Yang ⋅ Tao Tao ⋅ Wenfei Wu ⋅ Jiaming Jiang ⋅ Jinsheng Xiao
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 662
PDD: Manifold-Prior Diverse Distillation for Medical Anomaly Detection
Xijun Lu ⋅ Hongying Liu ⋅ Fanhua Shang ⋅ Yanming hui ⋅ Liang Wan
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 663
Weakly Supervised Video Anomaly Detection with Anomaly-Connected Components and Intention Reasoning
Yu Wang ⋅ Hongli Liu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 664
SubspaceAD: Training-Free Few-Shot Anomaly Detection via Subspace Modeling
Camile Lendering ⋅ Erkut Akdag ⋅ Egor Bondarev
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 665
Learning Spatial-Temporal Consistency for 3D Semantic Scene Completion
Yujie Xue ⋅ Meng Wang ⋅ Ruihui Li ⋅ F anWu ⋅ Zhizhong Liu ⋅ Zhuo Tang ⋅ Kenli Li
[ Slides
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 666
Generalizing Visual Geometry Priors to Sparse Gaussian Occupancy Prediction
Changqing Zhou ⋅ Yueru Luo ⋅ Changhao Chen
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 667
Deformable Gaussian Occupancy: Decoupling Rigid and Nonrigid Motion with Factorized Distillation
Yang Gao ⋅ Wuyang Li ⋅ Po-Chien Luan ⋅ Alex Alahi
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 668
OccAny: Generalized Unconstrained Urban 3D Occupancy
Anh Quan Cao ⋅ Vu
[ Poster
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 669
Dr.Occ: Depth- and Region-Guided 3D Occupancy from Surround-View Cameras for Autonomous Driving
Xubo Zhu ⋅ Haoyang Zhang ⋅ Fei He ⋅ Rui Wu ⋅ Yanhu Shan ⋅ Wen Yang ⋅ Huai Yu
Poster
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A & F 670
ShelfOcc: Native 3D Supervision beyond LiDAR for Vision-Based Occupancy Estimation
Simon Boeder ⋅ Fabian Gigengack ⋅ Simon Roesler ⋅ Holger Caesar ⋅ Benjamin Risse
[ Poster
Poster Session
Sat Jun 06 03:45 PM -- 05:45 PM (PDT) @ ExHall A None
Poster Session 4 & Exhibit Hall w/ Coffee Break
Art Program
Sat Jun 06 04:00 PM -- 04:30 PM (PDT) @ ExHall F None
Art Gallery Tour with Curator and Artists
Luba Elliott
Reception
Sat Jun 06 06:00 PM -- 08:00 PM (PDT) @ Bluebird Ballroom & ExHall C None
Reception
Break
Sun Jun 07 06:30 AM -- 08:00 AM (PDT) @ ExHall C None
Breakfast
Registration
Sun Jun 07 06:30 AM -- 12:00 PM (PDT) @ Lobby A None
Registration / Badge Pickup
Oral
Sun Jun 07 08:00 AM -- 08:12 AM (PDT) @ Four Seasons Ballroom None
AToken: A Unified Tokenizer for Vision
Jiasen Lu ⋅ Liangchen Song ⋅ Mingze Xu ⋅ Byeongjoo Ahn ⋅ Yanjun Wang ⋅ Chen Chen ⋅ Afshin Dehghan ⋅ Yinfei Yang
Oral
Sun Jun 07 08:00 AM -- 08:12 AM (PDT) @ Bluebird Ballroom None
Evidential Neural Radiance Fields
Ruxiao Duan ⋅ Alex Wong
Oral
Sun Jun 07 08:00 AM -- 08:12 AM (PDT) @ Mile High Ballroom 3A - 4A None
BoostSLT: Boosting Sign Language Translation via a Plug-and-Play Diffusion-Based Semantic Enhancer
Changzhou Han ⋅ Wanlun Ma ⋅ XI TANG ⋅ Kun Hu ⋅ Sheng Wen ⋅ Yang Xiang
Oral
Sun Jun 07 08:00 AM -- 08:12 AM (PDT) @ Mile High Ballroom 1A - 2A None
AT-VLA: Adaptive Tactile Injection for Enhanced Feedback Reaction in Vision-Language-Action Models
Xiaoqi Li ⋅ Muhe Cai ⋅ Jiadong Xu ⋅ Juan Zhu ⋅ Hongwei Fan ⋅ Yan Shen ⋅ Guangrui Ren ⋅ Hao Dong
Oral Session
Sun Jun 07 08:00 AM -- 09:15 AM (PDT) @ Mile High Ballroom 1A - 2A None
Oral Session 5C: Geometry and Robotics
Oral Session
Sun Jun 07 08:00 AM -- 09:15 AM (PDT) @ Four Seasons Ballroom None
Oral Session 5B: Generalization and Adaptation
Oral Session
Sun Jun 07 08:00 AM -- 09:15 AM (PDT) @ Mile High Ballroom 3A - 4A None
Oral Session 5D: Human-Centric Modeling & Lighting
Oral Session
Sun Jun 07 08:00 AM -- 09:15 AM (PDT) @ Bluebird Ballroom None
Oral Session 5A: Dynamic Perception
Oral
Sun Jun 07 08:12 AM -- 08:25 AM (PDT) @ Mile High Ballroom 3A - 4A None
ImmerIris: A Large-Scale Dataset and Benchmark for Off-Axis and Unconstrained Iris Recognition in Immersive Applications
Yuxi Mi ⋅ Qiuyang Yuan ⋅ Zhizhou Zhong ⋅ Xuan Zhao ⋅ Jiaogen Zhou ⋅ Fubao Zhu ⋅ Jihong Guan ⋅ Shuigeng Zhou
Oral
Sun Jun 07 08:12 AM -- 08:25 AM (PDT) @ Bluebird Ballroom None
Global-Aware Edge Prioritization for Pose Graph Initialization
Tong Wei ⋅ Giorgos Tolias ⋅ Jiri Matas ⋅ Daniel Barath
Oral
Sun Jun 07 08:12 AM -- 08:25 AM (PDT) @ Mile High Ballroom 1A - 2A None
Learning Diffeomorphism for Medical Image Registration with Time-Embedded Architectures Using Semigroup Regularization
Mohammadjavad Matinkia ⋅ Nilanjan Ray
Oral
Sun Jun 07 08:12 AM -- 08:25 AM (PDT) @ Four Seasons Ballroom None
Confusion-Aware Spectral Regularizer for Long-Tailed Recognition
Ziquan Zhu ⋅ Gaojie Jin ⋅ Hanruo Zhu ⋅ Si-Yuan Lu ⋅ Yunxiao Zhang ⋅ ZEYU FU ⋅ Ronghui Mu ⋅ Guoqiang Zhang ⋅ Zhao Sun ⋅ Yuhang Xia ⋅ Jiaxing Shang ⋅ Xiang Li ⋅ Lu Liu ⋅ Tianjin Huang
Oral
Sun Jun 07 08:25 AM -- 08:37 AM (PDT) @ Four Seasons Ballroom None
Learning Latent Concepts for Detecting Out-of-Distribution Objects
Ting Peng ⋅ Junhao Dong ⋅ Yew-Soon Ong
Oral
Sun Jun 07 08:25 AM -- 08:37 AM (PDT) @ Mile High Ballroom 3A - 4A None
OLATverse: A Large-scale Real-world Object Dataset with Precise Lighting Control
Xilong Zhou ⋅ Jianchun Chen ⋅ Pramod Rao ⋅ Timo Teufel ⋅ Linjie Lyu ⋅ Tigran Minasian ⋅ Oleksandr Sotnychenko ⋅ Xiaoxiao Long ⋅ Marc Habermann ⋅ Christian Theobalt
Oral
Sun Jun 07 08:25 AM -- 08:37 AM (PDT) @ Bluebird Ballroom None
Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding
Christopher Clark ⋅ Jieyu Zhang ⋅ Zixian Ma ⋅ Jae Sung Park ⋅ Rohun Tripathi ⋅ Sangho Lee ⋅ Reza Salehi ⋅ Jason Ren ⋅ Chris Dongjoo Kim ⋅ Yinuo Yang ⋅ Vincent Shao ⋅ Yue Yang ⋅ Weikai Huang ⋅ Ziqi Gao ⋅ Taira Anderson ⋅ Jianrui Zhang ⋅ Jitesh Jain ⋅ George Stoica ⋅ Ali Farhadi ⋅ Ranjay Krishna
Oral
Sun Jun 07 08:25 AM -- 08:37 AM (PDT) @ Mile High Ballroom 1A - 2A None
QuadSync: Quadrifocal Tensor Synchronization via Tucker Decomposition
Daniel Miao ⋅ Gilad Lerman ⋅ Joe Kileel
Oral
Sun Jun 07 08:37 AM -- 08:50 AM (PDT) @ Bluebird Ballroom None
Optical Flow Matching: Reframing Optical Flow as Continuous Transport Dynamics
Ao Luo ⋅ XIN LI ⋅ Fan Yang ⋅ Yuezun Li ⋅ Zhaoquan Yuan ⋅ SHAN ZHAO ⋅ Bing Su ⋅ Xiao WU
Oral
Sun Jun 07 08:37 AM -- 08:50 AM (PDT) @ Four Seasons Ballroom None
Learning Like Humans: Analogical Concept Learning for Generalized Category Discovery
Jizhou Han ⋅ Chenhao Ding ⋅ Yuhang He ⋅ Qiang Wang ⋅ Shaokun Wang ⋅ SongLin Dong ⋅ Yihong Gong
Oral
Sun Jun 07 08:37 AM -- 08:50 AM (PDT) @ Mile High Ballroom 1A - 2A None
SocialNav: Training Human-Inspired Foundation Model for Socially-Aware Embodied Navigation
Ziyi Chen ⋅ Yingnan Guo ⋅ Zedong Chu ⋅ Minghua Luo ⋅ Yanfen Shen ⋅ Mingchao Sun ⋅ Junjun Hu ⋅ Shichao Xie ⋅ Yang Kuan ⋅ Pei Shi ⋅ Zhining Gu ⋅ Lu Liu ⋅ Honglin Han ⋅ Xiaolong Wu ⋅ Mu Xu ⋅ Yu Zhang
Oral
Sun Jun 07 08:37 AM -- 08:50 AM (PDT) @ Mile High Ballroom 3A - 4A None
OpenDance: Multimodal Controllable 3D Dance Generation with Large-scale Internet Data
Jinlu Zhang ⋅ Zixi Kang ⋅ Libin Liu ⋅ Jianlong Chang ⋅ Qi Tian ⋅ Feng Gao ⋅ Yizhou Wang
Oral
Sun Jun 07 08:50 AM -- 09:02 AM (PDT) @ Mile High Ballroom 3A - 4A None
POLAR: A Portrait OLAT Dataset and Generative Framework for Illumination-Aware Face Modeling
Zhuo Chen ⋅ Chengqun Yang ⋅ Zhuo Su ⋅ Zheng Lv ⋅ Jingnan Gao ⋅ Xiaoyuan Zhang ⋅ Xiaokang Yang ⋅ Yichao Yan
Oral
Sun Jun 07 08:50 AM -- 09:02 AM (PDT) @ Mile High Ballroom 1A - 2A None
Structural Action Transformer for 3D Dexterous Manipulation
Xiaohan Lei ⋅ Min Wang ⋅ Bohong Weng ⋅ Wengang Zhou ⋅ Houqiang Li
Oral
Sun Jun 07 08:50 AM -- 09:02 AM (PDT) @ Bluebird Ballroom None
SEATrack: Simple, Efficient, and Adaptive Multimodal Tracker
Junbin Su ⋅ Ziteng Xue ⋅ Shihui Zhang ⋅ Kun Chen ⋅ Weiming Hu ⋅ Zhipeng Zhang
Oral
Sun Jun 07 08:50 AM -- 09:02 AM (PDT) @ Four Seasons Ballroom None
Understanding and Enforcing Weight Disentanglement in Task Arithmetic
Shangge Liu ⋅ Yuehan Yin ⋅ Lei Wang ⋅ Qi Fan ⋅ Yinghuan Shi ⋅ Wenbin Li ⋅ Yang Gao ⋅ Dacheng Tao
Oral
Sun Jun 07 09:02 AM -- 09:15 AM (PDT) @ Mile High Ballroom 1A - 2A None
TESO: Online Tracking of Essential Matrix by Stochastic Optimization
Jaroslav Moravec ⋅ Radim Sara ⋅ Akihiro Sugimoto
Oral
Sun Jun 07 09:02 AM -- 09:15 AM (PDT) @ Mile High Ballroom 3A - 4A None
Relightable Holoported Characters: Capturing and Relighting Dynamic Human Performance from Sparse Views
Kunwar Maheep Singh ⋅ Jianchun Chen ⋅ Vladislav Golyanik ⋅ Stephan Garbin ⋅ Thabo Beeler ⋅ Rishabh Dabral ⋅ Marc Habermann ⋅ Christian Theobalt
Oral
Sun Jun 07 09:02 AM -- 09:15 AM (PDT) @ Bluebird Ballroom None
U^2Flow: Uncertainty-Aware Unsupervised Optical Flow Estimation
Xunpei Sun ⋅ Wenwei Lin ⋅ Yi Chang ⋅ Gang Chen
Oral
Sun Jun 07 09:02 AM -- 09:15 AM (PDT) @ Four Seasons Ballroom None
Understanding Task Transfer in Vision-Language Models
Bhuvan Sachdeva ⋅ Karan Uppal ⋅ Abhinav Java ⋅ Vineeth Balasubramanian
[ Slides
Break
Sun Jun 07 09:15 AM -- 09:30 AM (PDT) None
Courtesy Break
Keynote
Sun Jun 07 09:30 AM -- 10:30 AM (PDT) @ Bluebird Ballroom None
Scaling Laws vs. Neural Laws: Toward More Natural Artificial Vision
Thomas Serre
Poster Setup
Sun Jun 07 10:15 AM -- 10:45 AM (PDT) @ ExHall A None
Poster Setup
Demonstration
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F None
Demos
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 1
Evidential Neural Radiance Fields
Ruxiao Duan ⋅ Alex Wong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 2
Global-Aware Edge Prioritization for Pose Graph Initialization
Tong Wei ⋅ Giorgos Tolias ⋅ Jiri Matas ⋅ Daniel Barath
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 3
Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding
Christopher Clark ⋅ Jieyu Zhang ⋅ Zixian Ma ⋅ Jae Sung Park ⋅ Rohun Tripathi ⋅ Sangho Lee ⋅ Reza Salehi ⋅ Jason Ren ⋅ Chris Dongjoo Kim ⋅ Yinuo Yang ⋅ Vincent Shao ⋅ Yue Yang ⋅ Weikai Huang ⋅ Ziqi Gao ⋅ Taira Anderson ⋅ Jianrui Zhang ⋅ Jitesh Jain ⋅ George Stoica ⋅ Ali Farhadi ⋅ Ranjay Krishna
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 4
Optical Flow Matching: Reframing Optical Flow as Continuous Transport Dynamics
Ao Luo ⋅ XIN LI ⋅ Fan Yang ⋅ Yuezun Li ⋅ Zhaoquan Yuan ⋅ SHAN ZHAO ⋅ Bing Su ⋅ Xiao WU
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 5
SEATrack: Simple, Efficient, and Adaptive Multimodal Tracker
Junbin Su ⋅ Ziteng Xue ⋅ Shihui Zhang ⋅ Kun Chen ⋅ Weiming Hu ⋅ Zhipeng Zhang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 6
U^2Flow: Uncertainty-Aware Unsupervised Optical Flow Estimation
Xunpei Sun ⋅ Wenwei Lin ⋅ Yi Chang ⋅ Gang Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 7
AToken: A Unified Tokenizer for Vision
Jiasen Lu ⋅ Liangchen Song ⋅ Mingze Xu ⋅ Byeongjoo Ahn ⋅ Yanjun Wang ⋅ Chen Chen ⋅ Afshin Dehghan ⋅ Yinfei Yang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 8
Confusion-Aware Spectral Regularizer for Long-Tailed Recognition
Ziquan Zhu ⋅ Gaojie Jin ⋅ Hanruo Zhu ⋅ Si-Yuan Lu ⋅ Yunxiao Zhang ⋅ ZEYU FU ⋅ Ronghui Mu ⋅ Guoqiang Zhang ⋅ Zhao Sun ⋅ Yuhang Xia ⋅ Jiaxing Shang ⋅ Xiang Li ⋅ Lu Liu ⋅ Tianjin Huang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 9
Learning Latent Concepts for Detecting Out-of-Distribution Objects
Ting Peng ⋅ Junhao Dong ⋅ Yew-Soon Ong
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 10
Learning Like Humans: Analogical Concept Learning for Generalized Category Discovery
Jizhou Han ⋅ Chenhao Ding ⋅ Yuhang He ⋅ Qiang Wang ⋅ Shaokun Wang ⋅ SongLin Dong ⋅ Yihong Gong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 11
Understanding and Enforcing Weight Disentanglement in Task Arithmetic
Shangge Liu ⋅ Yuehan Yin ⋅ Lei Wang ⋅ Qi Fan ⋅ Yinghuan Shi ⋅ Wenbin Li ⋅ Yang Gao ⋅ Dacheng Tao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 12
Understanding Task Transfer in Vision-Language Models
Bhuvan Sachdeva ⋅ Karan Uppal ⋅ Abhinav Java ⋅ Vineeth Balasubramanian
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 13
AT-VLA: Adaptive Tactile Injection for Enhanced Feedback Reaction in Vision-Language-Action Models
Xiaoqi Li ⋅ Muhe Cai ⋅ Jiadong Xu ⋅ Juan Zhu ⋅ Hongwei Fan ⋅ Yan Shen ⋅ Guangrui Ren ⋅ Hao Dong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 14
Learning Diffeomorphism for Medical Image Registration with Time-Embedded Architectures Using Semigroup Regularization
Mohammadjavad Matinkia ⋅ Nilanjan Ray
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 15
QuadSync: Quadrifocal Tensor Synchronization via Tucker Decomposition
Daniel Miao ⋅ Gilad Lerman ⋅ Joe Kileel
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 16
SocialNav: Training Human-Inspired Foundation Model for Socially-Aware Embodied Navigation
Ziyi Chen ⋅ Yingnan Guo ⋅ Zedong Chu ⋅ Minghua Luo ⋅ Yanfen Shen ⋅ Mingchao Sun ⋅ Junjun Hu ⋅ Shichao Xie ⋅ Yang Kuan ⋅ Pei Shi ⋅ Zhining Gu ⋅ Lu Liu ⋅ Honglin Han ⋅ Xiaolong Wu ⋅ Mu Xu ⋅ Yu Zhang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 17
Structural Action Transformer for 3D Dexterous Manipulation
Xiaohan Lei ⋅ Min Wang ⋅ Bohong Weng ⋅ Wengang Zhou ⋅ Houqiang Li
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 18
TESO: Online Tracking of Essential Matrix by Stochastic Optimization
Jaroslav Moravec ⋅ Radim Sara ⋅ Akihiro Sugimoto
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 19
BoostSLT: Boosting Sign Language Translation via a Plug-and-Play Diffusion-Based Semantic Enhancer
Changzhou Han ⋅ Wanlun Ma ⋅ XI TANG ⋅ Kun Hu ⋅ Sheng Wen ⋅ Yang Xiang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 20
ImmerIris: A Large-Scale Dataset and Benchmark for Off-Axis and Unconstrained Iris Recognition in Immersive Applications
Yuxi Mi ⋅ Qiuyang Yuan ⋅ Zhizhou Zhong ⋅ Xuan Zhao ⋅ Jiaogen Zhou ⋅ Fubao Zhu ⋅ Jihong Guan ⋅ Shuigeng Zhou
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 21
OLATverse: A Large-scale Real-world Object Dataset with Precise Lighting Control
Xilong Zhou ⋅ Jianchun Chen ⋅ Pramod Rao ⋅ Timo Teufel ⋅ Linjie Lyu ⋅ Tigran Minasian ⋅ Oleksandr Sotnychenko ⋅ Xiaoxiao Long ⋅ Marc Habermann ⋅ Christian Theobalt
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 22
OpenDance: Multimodal Controllable 3D Dance Generation with Large-scale Internet Data
Jinlu Zhang ⋅ Zixi Kang ⋅ Libin Liu ⋅ Jianlong Chang ⋅ Qi Tian ⋅ Feng Gao ⋅ Yizhou Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 23
POLAR: A Portrait OLAT Dataset and Generative Framework for Illumination-Aware Face Modeling
Zhuo Chen ⋅ Chengqun Yang ⋅ Zhuo Su ⋅ Zheng Lv ⋅ Jingnan Gao ⋅ Xiaoyuan Zhang ⋅ Xiaokang Yang ⋅ Yichao Yan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 24
Relightable Holoported Characters: Capturing and Relighting Dynamic Human Performance from Sparse Views
Kunwar Maheep Singh ⋅ Jianchun Chen ⋅ Vladislav Golyanik ⋅ Stephan Garbin ⋅ Thabo Beeler ⋅ Rishabh Dabral ⋅ Marc Habermann ⋅ Christian Theobalt
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 25
Scaling View Synthesis Transformers
Evan Kim ⋅ Hyunwoo Ryu ⋅ Thomas W. Mitchel ⋅ Vincent Sitzmann
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 26
WildPose: A Unified Framework for Robust Pose Estimation in the Wild
Jianhao Zheng ⋅ Liyuan Zhu ⋅ Zihan Zhu ⋅ Iro Armeni
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 27
MoRe: Motion-aware Feed-forward 4D Reconstruction Transformer
Juntong Fang ⋅ Zequn Chen ⋅ Weiqi Zhang ⋅ Donglin Di ⋅ Xuancheng Zhang ⋅ Chengmin Yang ⋅ Yu-Shen Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 28
Revisiting Monocular SLAM with Spatio-Temporal Scene Modeling
Valter Piedade ⋅ Lalit Manam ⋅ Masashi Yamazaki ⋅ Pedro Miraldo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 29
Minimal Constraint Relaxation for Multiview Autocalibration
Norio Kosaka ⋅ Timothy Duff ⋅ Tomas Pajdla
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 30
Motion 3-to-4: 3D Motion Reconstruction for 4D Synthesis
hongyuan chen ⋅ Xingyu Chen ⋅ Zexiang Xu ⋅ Anpei Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 31
GGPT: Geometry-Grounded Point Transformer
Yutong Chen ⋅ Yiming Wang ⋅ Xucong Zhang ⋅ Sergey Prokudin ⋅ Siyu Tang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 32
MERG3R: A Divide-and-Conquer Approach to Large-Scale Neural Visual Geometry
Leo Kaixuan Cheng ⋅ Abdus Shaikh ⋅ Ruofan Liang ⋅ Zhijie Wu ⋅ Yushi Guan ⋅ Nandita Vijaykumar
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 33
Unlocking the Power of Critical Factors for 3D Visual Geometry Estimation
Guangkai Xu ⋅ Hua Geng ⋅ Huanyi Zheng ⋅ Songyi Yin ⋅ Yanlong Sun ⋅ Hao Chen ⋅ Chunhua Shen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 34
KV-Tracker: Real-Time Pose Tracking with Transformers
Marwan Taher ⋅ Ignacio Alzugaray ⋅ Kirill Mazur ⋅ Xin Kong ⋅ Andrew J. Davison
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 35
InstructMix2Mix: Consistent Sparse-View Editing Through Multi-View Model Personalization
Daniel Gilo ⋅ Or Litany
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 36
From Rays to Projections: Better Inputs for Feed-Forward View Synthesis
Zirui Wu ⋅ Zeren Jiang ⋅ Martin R. Oswald ⋅ Jie Song
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 37
SLARM: Streaming and Language-Aligned Reconstruction Model for Dynamic Scenes
ZhiCheng Qiu ⋅ Jiarui Meng ⋅ Tong-an Luo ⋅ Yican Huang ⋅ Xuan Feng ⋅ Xuanfu Li ⋅ Zhan Xu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 38
Parallel Rigidity Matters for Bundle Adjustment
Lalit Manam ⋅ Venu Madhav Govindu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 39
Simple but Effective Triplet-Based Compression Strategies for Compact Visual Localization
Torsten Sattler ⋅ Zuzana Kukelova
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 40
VIAFormer: Voxel-Image Alignment Transformer for High-Fidelity Voxel Refinement
Tiancheng Fang ⋅ Bowen Pan ⋅ Lingxi Chen ⋅ Jiangjing Lyu ⋅ Chengfei Lv ⋅ Chaoyue Niu ⋅ Fan Wu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 41
Mining Attribute Subspaces for Efficient Fine-tuning of 3D Foundation Models
Yu Jiang ⋅ Hanwen Jiang ⋅ Ahmed Abdelkader ⋅ Wen-Sheng Chu ⋅ Brandon Y. Feng ⋅ Zhangyang Wang ⋅ Qixing Huang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 42
DualPrim: Compact 3D Reconstruction with Positive and Negative Primitives
Xiaoxu Meng ⋅ Zhongmin Chen ⋅ Bo Yang ⋅ Weikai Chen ⋅ Weixiao Liu ⋅ Lin Gao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 43
StyleGallery: Training-free and Semantic-aware Personalized Style Transfer from Arbitrary Image References
Boyu He ⋅ Yunfan Ye ⋅ Chang Liu ⋅ Weishang Wu ⋅ FANG LIU ⋅ Zhiping Cai
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 44
DynFusion: Rethinking Condition Fusion for Adaptive Multi-Conditional Text-to-Image Generation
Zheng Fang ⋅ Lichuan Xiang ⋅ Xu Cai ⋅ Bing Wang ⋅ Bo Yang ⋅ Hongkai Wen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 45
Agentic Retoucher for Text-To-Image Generation
Shaocheng Shen ⋅ Jianfeng Liang ⋅ Chunlei Cai ⋅ Cong Geng ⋅ Huiyu Duan ⋅ Xiaoyun Zhang ⋅ Qiang Hu ⋅ Guangtao Zhai
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 46
StyleDoctor: Towards Specialist Reward Model for Style-centric Generation Tasks
Xilin He ⋅ Xiaole Xian ⋅ Xiangyu Yue ⋅ Muhammad Haris Khan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 47
SwitchCraft: Training-Free Multi-Event Video Generation with Attention Controls
Qianxun Xu ⋅ Chenxi Song ⋅ Yujun Cai ⋅ Chi Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 48
Premier: Personalized Preference Modulation with Learnable User Embedding in Text-to-Image Generation
Zihao Wang ⋅ Yuxiang Wei ⋅ Xinpeng Zhou ⋅ Tianyu Zhang ⋅ Tao Liang ⋅ Yalong Bai ⋅ Hongzhi Zhang ⋅ Wangmeng Zuo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 49
Paper2Figure: A Multi-Agent Collaborative System for Figure Generation Towards Academic Research Paper
Siwei Han ⋅ Haonian Ji ⋅ Siyang Xin ⋅ Juanquan Shi ⋅ Shi Qiu ⋅ Xinyu Ye ⋅ Peng Xia ⋅ Jiaqi Liu ⋅ Zhaorun Chen ⋅ Yiyang Zhou ⋅ Linjie Li ⋅ Lijuan Wang ⋅ Huaxiu Yao
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 50
Adapting In-context Generation for Enhanced Composed Image Retrieval
Haiwen Li ⋅ Zining Chen ⋅ Delong Liu ⋅ Zhaohui Hou ⋅ Zhicheng Zhao ⋅ Fei Su
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 51
Transition Models: Rethinking the Generative Learning Objective
ZiDong Wang ⋅ Yiyuan Zhang ⋅ Xiaoyu Yue ⋅ Xiangyu Yue ⋅ Yangguang Li ⋅ Wanli Ouyang ⋅ Lei Bai
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 52
Rethinking Glyph Spatial Information in Font Generation
Peng Su ⋅ Xi Yang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 53
StreamDiT: Real-Time Streaming Text-to-Video Generation
Akio Kodaira ⋅ Tingbo Hou ⋅ Ji Hou ⋅ Markos Georgopoulos ⋅ Felix Juefei-Xu ⋅ Masayoshi Tomizuka ⋅ Yue Zhao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 54
ChArtist: Generating Pictorial Charts with Unified Spatial and Subject Control
Shishi Xiao ⋅ Tongyu Zhou ⋅ David H. Laidlaw ⋅ Gromit Yeuk-Yin Chan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 55
Camera Control for Text-to-Image Generation via Learning Viewpoint Tokens
Xinxuan Lu ⋅ Charless Fowlkes ⋅ Alex Berg
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 56
3D Space as a Scratchpad for Editable Text-to-Image Generation
Oindrila Saha ⋅ Vojtech Krs ⋅ Radomir Mech ⋅ Subhransu Maji ⋅ Matheus Gadelha ⋅ Kevin Blackburn-Matzen
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 57
Aligning Multi-Character Narrative Image Generation with Multi-Aspect Human Preferences
Ziyi Gao ⋅ Zhipeng Wei ⋅ Jingjing Chen ⋅ Stewart Tan ⋅ Hao li ⋅ Yi-Ping Phoebe Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 58
FoleyDirector: Directing Temporal Controllable Video-to-Audio Generation via Fine-Grained Temporal Scripts
You Li ⋅ Dewei Zhou ⋅ Fan Ma ⋅ Fu Li ⋅ Dongliang He ⋅ Yi Yang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 59
DCoAR: Deep Concept Injection into Unified Autoregressive Models for Personalized Text-to-Image Generation
Fangtai Wu ⋅ Mushui Liu ⋅ Weijie He ⋅ Zhao Wang ⋅ Yunlong Yu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 60
DreamOmni2: Multimodal Instruction-based Generation and Editing
Bin Xia ⋅ Bohao Peng ⋅ Yuechen Zhang ⋅ Junjia Huang ⋅ Jiyang Liu ⋅ Jingyao Li ⋅ Haoru Tan ⋅ WU Sitong ⋅ Chengyao Wang ⋅ Yitong Wang ⋅ Bei Yu ⋅ Jiaya Jia
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 61
AutoDebias: An Automated Framework for Detecting and Mitigating Backdoor Biases in Text-to-Image Models
Hongyi Cai ⋅ HONGYI CAI ⋅ MingKang Dong ⋅ Muxin Pu ⋅ Moayad Aloqaily ⋅ jie li ⋅ Xinfeng Li ⋅ Jialie Shen ⋅ Meikang Qiu ⋅ Qingsong Wen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 62
PosterIQ: A Design Perspective Benchmark for Poster Understanding and Generation
Yuheng Feng ⋅ Wen Zhang ⋅ Haodong Duan ⋅ Xingxing Zou
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 63
IVAAN: Instance-level Vision-Language Alignment via Attribute-Guided Text Prompts Generation for Nuclei Analysis
Jaehoon Jeong ⋅ Yi Hu ⋅ Soopil Kim ⋅ Jongseong Jang ⋅ Soonyoung Lee ⋅ Sang Hyun Park
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 64
IsoCLIP: Decomposing CLIP Projectors for Efficient Intra-modal Alignment
Simone Magistri ⋅ Dipam Goswami ⋅ Marco Mistretta ⋅ Bartłomiej Twardowski ⋅ Joost van de Weijer ⋅ Andrew Bagdanov
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 65
TIPSv2: Advancing Vision-Language Pretraining with Enhanced Patch-Text Alignment
Bingyi Cao ⋅ Koert Chen ⋅ Kevis-kokitsi Maninis ⋅ Kaifeng Chen ⋅ Arjun Karpur ⋅ Ye Xia ⋅ Sahil Dua ⋅ Tanmaya Dabral ⋅ Guangxing Han ⋅ Bohyung Han ⋅ Joshua Ainslie ⋅ Alex Bewley ⋅ Mithun Jacob ⋅ René Wagner ⋅ Washington Ramos ⋅ Krzysztof Choromanski ⋅ Mojtaba Seyedhosseini ⋅ Howard Zhou ⋅ André Araujo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 66
BioVITA: Biological Dataset, Model, and Benchmark for Visual-Textual-Acoustic Alignment
Risa Shinoda ⋅ Kaede Shiohara ⋅ Nakamasa Inoue ⋅ Kuniaki Saito ⋅ Hiroaki Santo ⋅ Fumio Okura
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 67
Boosting Visual Reprogramming for CLIP with Dual Granularity Alignment
Jiayang Wu ⋅ Xinyang Chen ⋅ Ke Lv ⋅ Weili Guan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 68
Decouple to Generalize: Context-First Self-Evolving Learning for Data-Scarce Vision-Language Reasoning
Tingyu Li ⋅ Zheng Sun ⋅ Jingxuan Wei ⋅ Conghui He ⋅ Lijun Wu ⋅ Cheng Tan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 69
UniGen-1.5: Enhancing Image Generation and Editing through Reward Unification in RL
Rui Tian ⋅ Mingfei Gao ⋅ Haiming Gang ⋅ Jiasen Lu ⋅ Zhe Gan ⋅ Yinfei Yang ⋅ Zuxuan Wu ⋅ Afshin Dehghan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 70
PolySLGen: Online Multimodal Speaking-Listening Reaction Generation in Polyadic Interaction
Zhi-Yi Lin ⋅ Thomas Markhorst ⋅ Jouh Yeong Chew ⋅ Xucong Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 71
Label What Matters: Modality-Balanced and Difficulty-Aware Multimodal Active Learning
Yuqiao Zeng ⋅ Xu Wang ⋅ Tengfei Liang ⋅ Yiqing Hao ⋅ Yi Jin ⋅ Hui Yu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 72
Unified Personalized Understanding, Generating and Editing
Yu Zhong ⋅ Tianwei Lin ⋅ Ruike Zhu ⋅ Yuqian Yuan ⋅ Haoyu Zheng ⋅ Liang Liang ⋅ Wenqiao Zhang ⋅ Feifei Shao ⋅ Haoyuan Li ⋅ Wanggui He ⋅ Hao Jiang ⋅ Yueting Zhuang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 73
MSRL: Scaling Generative Multimodal Reward Modeling via Multi-Stage Reinforcement Learning
Chenglong Wang ⋅ Yifu Huo ⋅ Yang Gan ⋅ Qiaozhi He ⋅ Qi Meng ⋅ Bei Li ⋅ Yan Wang ⋅ Junfu Liu ⋅ Tianjua Zhou ⋅ JingBo Zhu ⋅ Tong Xiao
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 74
Towards Uncertainty-aware Unsupervised Domain Adaptation for Videos and Time-Series with Causal Optimal Transport
Khushboo Mishra ⋅ Varun Trivedi ⋅ Tanima Dutta
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 75
Foundation Model Priors Enhance Object Focus in Feature Space for Source-Free Object Detection
Sairam Rebbapragada ⋅ Rishabh Lalla ⋅ Aveen Dayal ⋅ Tejal Kulkarni ⋅ Anuj Lalla ⋅ Vineeth Balasubramanian ⋅ Muhammad Haris Khan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 76
Decision Boundary-aware Generation for Long-tailed Learning
jiacheng yang ⋅ Ruichi Zhang ⋅ Chikai Shang ⋅ Mengke Li ⋅ Xinyi Shang ⋅ Junlong Gao ⋅ Yonggang Zhang ⋅ Yang Lu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 77
Towards Stable Federated Continual Test-Time Adaptation in Wild World
Liwen Wang ⋅ Xingbo Dong ⋅ Yi Liao ⋅ Zhe Jin
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 78
HyCal: A Training-Free Prototype Calibration Method for Cross-Discipline Few-Shot Class-Incremental Learning
Eunju Lee ⋅ MiHyeon Kim ⋅ Junehyoung Kwon ⋅ Yoonji Lee ⋅ JiHyun Kim ⋅ Soojin Jang ⋅ YoungBin Kim
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 79
ACE-Merging: Data-Free Model Merging with Adaptive Covariance Estimation
Bo Xu ⋅ Haotian Wu ⋅ Hehai Lin ⋅ Weiquan Huang ⋅ Beier Zhu ⋅ Yao Shu ⋅ Chengwei Qin
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 80
CHIPS: Efficient CLIP Adaptation via Curvature-aware Hybrid Influence-based Data Selection
Xinlin Zhuang ⋅ Yichen Li ⋅ Xiwei Liu ⋅ Haolin Yang ⋅ Yifan Lu ⋅ Ziyun Zou ⋅ Yulong Li ⋅ Huifa Li ⋅ Dongliang Chen ⋅ Qinglei Wang ⋅ Weiyang Liu ⋅ Ying Qian ⋅ Jiangming Shi ⋅ Imran Razzak
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 81
Addressing Exacerbated Attention Sink for Source-Free Cross-Domain Few-Shot Learning
Shuai Yi ⋅ Yixiong Zou ⋅ Yuhua Li ⋅ Ruixuan Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 82
Depth Hypothesis Guided Iterative Refinement for Event–Image Monocular Depth Estimation
Daikun Liu ⋅ Teng Wang ⋅ Changyin Sun
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 83
High-Quality and Efficient Turbulence Mitigation with Events
Xiaoran Zhang ⋅ Jian Ding ⋅ Yuxing Duan ⋅ Haoyue Liu ⋅ Gang Chen ⋅ Yi Chang ⋅ Luxin Yan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 84
Tracking through Severe Occlusion via Event-Derived Transient Cues
Hao Dong ⋅ Yujin Liu ⋅ Haoyue Liu ⋅ Zhenyu Wang ⋅ Shihan Peng ⋅ Zhiwei Shi ⋅ Yi Chang ⋅ Luxin Yan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 85
FastEventDGS: Deformable Gaussian Splatting for Fast Dynamic Scenes from a Single Event Camera
Zijia Dai ⋅ Nico Messikommer ⋅ Rong Zou ⋅ Nikola Zubic ⋅ Davide Scaramuzza ⋅ Laurent Kneip
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 86
Event-Based Motion Deblurring Using Task-Oriented 3D Gaussian Event Representations
Shengdong Xue ⋅ Haoxiang Ma ⋅ Hao Chen ⋅ Zhen Yang ⋅ Yongjian Deng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 87
From Corners to Fiducial Tags: Revisiting Checkerboard Calibration for Event Cameras
Taehun Ryu ⋅ Changwoo Kang ⋅ Kyungdon Joo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 88
Extending Embodied Question Answering from Perception to Decision
Xicheng Gong ⋅ Qiwei Li ⋅ Peiran Xu ⋅ Yadong Mu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 89
Dejavu: Towards Experience Feedback Learning for Embodied Intelligence
Shaokai Wu ⋅ Yanbiao Ji ⋅ Qiuchang Li ⋅ Zhiyi Zhang ⋅ Qichen He ⋅ Wenyuan XIE ⋅ Guodong Zhang ⋅ Bayram Bayramli ⋅ Yue Ding ⋅ Hongtao Lu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 90
Demo2Tutorial: From Human Experience to Multimodal Software Tutorials
Zechen Bai ⋅ Zhiheng Chen ⋅ Yiqi Lin ⋅ Kevin Qinghong Lin ⋅ Difei Gao ⋅ Xiangwu Guo ⋅ Xin Wang ⋅ Mike Zheng Shou
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 91
MaskDexGrasp: Generative Masked Modeling for Part-Aware Dexterous Grasp Synthesis
Binghui Zuo ⋅ Lin Zhou ⋅ Haoxuan Xu ⋅ Jianan Yan ⋅ ZhiPeng Yu ⋅ Zekai Liu ⋅ Yangang Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 92
Predict Before You Explore: Predictive Planning with Specialized Memory for Embodied Question Answering
Bowen Yuan ⋅ Sisi You ⋅ Bing-Kun Bao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 93
VideoWeaver: Multimodal Multi-View Video-to-Video Transfer for Embodied Agents
George Eskandar ⋅ Fengyi Shen ⋅ Mohammad Altillawi ⋅ Dong Chen ⋅ Yang Bai ⋅ Liudi Yang ⋅ Ziyuan Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 94
MindPower: Enabling Theory-of-Mind Reasoning in VLM-based Embodied Agents
Ruoxuan Zhang ⋅ Qiyun Zheng ⋅ Zhiyu Zhou ⋅ Ziqi Liao ⋅ Siyu Wu ⋅ Jian-Yu Jiang-Lin ⋅ Bin Wen ⋅ Hongxia Xie ⋅ Jianlong Fu ⋅ Wen-Huang Cheng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 95
Align While Search: Belief-Guided Exploratory Inference for World-Grounded Embodied Agents
Seohui Bae ⋅ Jeonghye Kim ⋅ Youngchul Sung ⋅ Woohyung Lim
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 96
Rethinking Intermediate Representation for VLM-based Robot Manipulation
Weiliang Tang ⋅ Jialin Gao ⋅ Jia-Hui Pan ⋅ Gang Wang ⋅ Li Erran Li ⋅ Yun-Hui Liu ⋅ Mingyu Ding ⋅ Pheng-Ann Heng ⋅ Chi-Wing Fu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 97
Dexterous World Models
Byungjun Kim ⋅ Taeksoo Kim ⋅ Junyoung Lee ⋅ Hanbyul Joo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 98
FantasyVLN: Unified Multimodal Chain-of-Thought Reasoning for Vision-and-Language Navigation
Jing Zuo ⋅ Lingzhou Mu ⋅ Fan Jiang ⋅ Chengcheng Ma ⋅ Mu Xu ⋅ Yonggang Qi
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 99
UniLight: A Unified Representation for Lighting
Zitian Zhang ⋅ Iliyan Georgiev ⋅ Michael Fischer ⋅ Yannick Hold-Geoffroy ⋅ Jean-François Lalonde ⋅ Valentin Deschaintre
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 100
MICo-150K: A Comprehensive Dataset Advancing Multi-Image Composition
Xinyu Wei ⋅ Kangrui Cen ⋅ Hongyang Wei ⋅ Zhen Guo ⋅ Bairui Li ⋅ Zeqing Wang ⋅ Jinrui Zhang ⋅ Lei Zhang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 101
Upsample Anything: A Simple and Hard to Beat Baseline for Feature Upsampling
Minseok Seo ⋅ Mark Hamilton ⋅ Changick Kim
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 102
Hist2Style: Histogram-Guided Stylization with Bilateral Grids
Dekel Galor ⋅ Adam Pikielny ⋅ Zhoutong Zhang ⋅ Ke Wang ⋅ Laura Waller ⋅ Jiawen Chen ⋅ Ilya Chugunov
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 103
Harmonic Canvas: Inversion-Free Editing for Visually-Guided Music Style Transfer
Yue Lei ⋅ Siqi Yang ⋅ Ting Zhong ⋅ Fan Zhou
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 104
How to Take a Memorable Picture? Empowering Users with Actionable Feedback
Francesco Laiti ⋅ Davide Talon ⋅ Jacopo Staiano ⋅ Elisa Ricci
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 105
UniEdit-I: Training-free Image Editing for Unified VLM via Iterative Understanding, Editing and Verifying
Bai Chengyu ⋅ Jintao Chen ⋅ Xiang Bai ⋅ Yilong Chen ⋅ Qi She ⋅ Ming Lu ⋅ Shanghang Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 106
SCIEval: Evaluating and Benchmarking the Faithfulness of Scientific Image Generation and Interpretation with Large Multimodal Models
Guanghui Ye ⋅ Huan Zhao ⋅ Zhixue Zhao ⋅ Tengfei Ma ⋅ Kehan Wang ⋅ Steffen Eger ⋅ Zhihua Jiang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 107
GeoRelight: Learning Joint Geometrical Reconstruction and Relighting with Flexible Multi-Modal Diffusion Transformers
Yuxuan Xue ⋅ Ruofan Liang ⋅ Egor Zakharov ⋅ Timur Bagautdinov ⋅ Chen Cao ⋅ Giljoo Nam ⋅ Shunsuke Saito ⋅ Gerard Pons-Moll ⋅ Javier Romero
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 108
HAD: Hallucination-Aware Diffusion Priors for 3D Reconstruction
Xi Liu ⋅ Weiwei Sun ⋅ Joe Ren ⋅ Christopher Broaddus ⋅ Siyu Huang ⋅ Laurent Guigues
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 109
Catalyst4D: High-Fidelity 3D-to-4D Scene Editing via Dynamic Propagation
Shifeng Chen ⋅ Yihui Li ⋅ Jun Liao ⋅ Hongyu Yang ⋅ Di Huang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 110
ReFlow: Self-correction Motion Learning for Dynamic Scene Reconstruction
Yanzhe Liang ⋅ Ruijie Zhu ⋅ Hanzhi Chang ⋅ Zhuoyuan Li ⋅ Jiahao Lu ⋅ Tianzhu Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 111
Semantic Foam: Unifying Spatial and Semantic Scene Decomposition
Amr Sharafeldin ⋅ Aryan Mikaeili ⋅ Thomas Walker ⋅ Shrisudhan Govindarajan ⋅ Daniel Rebain ⋅ Kwang Moo Yi ⋅ Andrea Tagliasacchi
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 112
NVGS: Neural Visibility for Occlusion Culling in 3D Gaussian Splatting
Brent Zoomers ⋅ Florian Hahlbohm ⋅ Joni Vanherck ⋅ Lode Jorissen ⋅ Marcus Magnor ⋅ Nick Michiels
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 113
NeAR: Coupled Neural Asset–Renderer Stack
Hong Li ⋅ Chongjie Ye ⋅ Houyuan Chen ⋅ Weiqing Xiao ⋅ Ziyang Yan ⋅ Lixing Xiao ⋅ Zhaoxi Chen ⋅ Jianfeng XIANG ⋅ Shaocong Xu ⋅ Xuhui Liu ⋅ Yikai Wang ⋅ Baochang Zhang ⋅ Xiaoguang Han ⋅ Jiaolong Yang ⋅ Hao Zhao
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 114
Thermal is Always Wild: Characterizing and Addressing Challenges in Thermal-Only Novel View Synthesis
M. Kerem Aydin ⋅ Vishwanath Saragadam ⋅ Emma Alexander
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 115
PhysGM: Large Physical Gaussian Model for Feed-Forward 4D Synthesis
chunji lv ⋅ Zequn Chen ⋅ Donglin Di ⋅ Weinan Zhang ⋅ Hao Li ⋅ Wei Chen ⋅ Yinjie Lei ⋅ Changsheng Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 116
Life-IQA: Boosting Blind Image Quality Assessment through GCN-enhanced Layer Interaction and MoE-based Feature Decoupling
Tang Long ⋅ Huiyu Duan ⋅ Guoquan Zheng ⋅ Jianbo Zhang ⋅ Jie Hao ⋅ Liang Yuan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 117
TM-BSN: Triangular-Masked Blind-Spot Network for Real-World Self-Supervised Image Denoising
Junyoung Park ⋅ Youngjin Oh ⋅ Nam Ik Cho
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 118
Multinex: Lightweight Low-light Image Enhancement via Multi-prior Retinex
Alexandru Brateanu ⋅ Tingting Mu ⋅ Codruta O. Ancuti ⋅ Cosmin Ancuti
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 119
Beyond Ground-Truth: Leveraging Image Quality Priors for Real-World Image Restoration
Fengyang Xiao ⋅ Peng Hu ⋅ Lei Xu ⋅ XingE Guo ⋅ Guanyi Qin ⋅ Yuqi Shen ⋅ Chengyu Fang ⋅ Rihan Zhang ⋅ Chunming He ⋅ Sina Farsiu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 120
ExpoCM: Exposure-Aware One-Step Generative Single-Image HDR Reconstruction
Aoyu Liu ⋅ Zhen Liu ⋅ Ziyi Wang ⋅ Dian Chen ⋅ Bing Zeng ⋅ Shuaicheng Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 121
Physically-Grounded Turbulence Mitigation with Frame-Shared Degradation Parameters
Dongxin Xie ⋅ Yan Huang ⋅ Yong Xu ⋅ Hui Ji
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 122
Convexity-Aware Noise Calibration: A Self-Supervised Framework for Noise-Level-Unknown Image Denoising
Zhan Wang ⋅ Wang Leiquan ⋅ Chunlei Wu ⋅ Yu Meng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 123
UCMNet: Uncertainty-Aware Context Memory Network for Under-Display Camera Image Restoration
DAEHYUN KIM ⋅ Youngmin Kim ⋅ Yoon Ju Oh ⋅ Tae Hyun Kim
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 124
Beyond the Ground Truth: Enhanced Supervision for Image Restoration
Donghun Ryou ⋅ Inju Ha ⋅ Sanghyeok Chu ⋅ Bohyung Han
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 125
ShiftLUT: Spatial Shift Enhanced Look-Up Tables for Efficient Image Restoration
ZENG XIAOLONG ⋅ Yitong Yu ⋅ Shiyao Xiong ⋅ Jinhua Hao ⋅ Ming Sun ⋅ Chao Zhou ⋅ Bin Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 126
Bilevel Layer-Positioning LoRA for Real Image Dehazing
Yan Zhang ⋅ Long Ma ⋅ Yuxin Feng ⋅ Zhe Huang ⋅ Fan Zhou ⋅ Zhuo Su
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 127
SD-FSMIS: Adapting Stable Diffusion for Few-Shot Medical Image Segmentation
Meihua Li ⋅ Yang Zhang ⋅ Weizhao He ⋅ Hu Qu ⋅ Yisong Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 128
GeoSemba: Reconstructing State Space Model for Cross Paradigm Representation in Medical Image Segmentation
Xutao Sun ⋅ Jiarui Li ⋅ Junwen Liu ⋅ Yonggong Ren
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 129
SHAPE: Structure-aware Hierarchical Unsupervised Domain Adaptation with Plausibility Evaluation for Medical Image Segmentation
Linkuan Zhou ⋅ Yinghao Xia ⋅ Yufei Shen ⋅ Xiangyu Li ⋅ Wenjie Du ⋅ Cong Cong ⋅ leyi wei ⋅ Ran Su ⋅ Qiangguo Jin
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 130
Delving Aleatoric Uncertainty in Medical Image Segmentation via Vision Foundation Models
Ruiyang Li ⋅ Fang Liu ⋅ Licheng Jiao ⋅ Xinglin Xie ⋅ Jiayao Hao ⋅ Shuo Li ⋅ Xu Liu ⋅ Jingyi yang ⋅ Lingling Li ⋅ Puhua Chen ⋅ Wenping Ma
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 131
Revisiting 2D Foundation Models for Scalable 3D Medical Image Classification
Han Liu ⋅ Bogdan Georgescu ⋅ Yanbo Zhang ⋅ Youngjin Yoo ⋅ Michael Baumgartner ⋅ Riqiang Gao ⋅ Jianing Wang ⋅ Gengyan Zhao ⋅ Eli Gibson ⋅ Dorin Comaniciu ⋅ Sasa Grbic
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 132
Focus on Background: Exploring SAM's Potential in Few-shot Medical Image Segmentation with Background-centric Prompting
Yuntian Bo ⋅ Yazhou Zhu ⋅ Piotr Koniusz ⋅ Haofeng Zhang
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 133
Simple-ViLMedSAM: Simple Text Prompts Meet Vision-Language Models for Medical Image Segmentation
Chengcan Qian ⋅ Dong Nie ⋅ Geng Chen ⋅ Daoqiang Zhang ⋅ Xuyun Wen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 134
NeuroSeg Meets DINOv3: Transferring 2D Self-Supervised Visual Priors to 3D Neuron Segmentation via DINOv3 Initialization
Yik San Cheng ⋅ Runkai Zhao ⋅ Weidong Cai
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 135
Multi-Paradigm Collaborative Adversarial Attack Against Multi-Modal Large Language Models
Yuanbo Li ⋅ Tianyang Xu ⋅ Cong Hu ⋅ Tao Zhou ⋅ Xiao-Jun Wu ⋅ Josef Kittler
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 136
TINA: Text-Free Inversion Attack for Unlearned Text-to-Image Diffusion Models
Qianlong Xiang ⋅ Miao Zhang ⋅ Haoyu Zhang ⋅ Kun Wang ⋅ Junhui Hou ⋅ Liqiang Nie
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 137
Jailbreaking Vision-Language Models via Dissonance-Guided Suffix Optimization and Image–Phrase Injection
Jiacheng Pi ⋅ Zhiguo Yang ⋅ Xingxing Huang ⋅ Dongsheng Xu ⋅ Ruizhi Zhong ⋅ Wenjie Ruan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 138
BlackMirror: Black-Box Backdoor Detection for Text-to-Image Models via Instruction-Response Deviation
Feiran Li ⋅ Qianqian Xu ⋅ Shilong Bao ⋅ Zhiyong Yang ⋅ Xilin Zhao ⋅ Xiaochun Cao ⋅ Qingming Huang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 139
VCP-Attack: Visual-Contrastive Projection for Transferable Black-Box Targeted Attacks on Large Vision-Language Models
Jiawei Zhao ⋅ Minjie Du ⋅ Zihan Qin ⋅ Zhuoran Wang ⋅ Lizhe Xie ⋅ Yining Hu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 140
Adapter Shield: A Unified Framework with Built-in Authentication for Preventing Unauthorized Zero-Shot Image-to-Image Generation
Jun Jia ⋅ Hongyi Miao ⋅ Yingjie Zhou ⋅ Wangqiu Zhou ⋅ Jianbo Zhang ⋅ Linhan Cao ⋅ Dandan Zhu ⋅ Hua Yang ⋅ Xiongkuo Min ⋅ Wei Sun ⋅ Guangtao Zhai
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 141
LLaVAShield: Safeguarding Multimodal Multi-Turn Dialogues in Vision-Language Models
Guolei Huang ⋅ Qinzhi Peng ⋅ Gan Xu ⋅ Yao Huang ⋅ Yuxuan Lu ⋅ Yongjun Shen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 142
Transform to Transfer: Boosting Adversarial Attack Transferability on Vision-Language Pre-training Models
Yang Li ⋅ Jia-Li Yin ⋅ Luojun Lin ⋅ Wei Lin
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 143
Mask to Align, Weight to Disambiguate: Reliable Unsupervised Cross-Modal Hashing with Masked-Weight Contrast
Fan Yang ⋅ Yuanzhi Zhao ⋅ Haimei Zhao ⋅ Yudong Zhao ⋅ Haikun Xu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 144
Reliable Clustering Number Estimation for Contrastive Multi-View Clustering
Zhengzhong Zhu ⋅ Pei Zhou ⋅ Lanxi Bai ⋅ Li Cheng ⋅ Jia Nie ⋅ Shiquan min ⋅ Jiangping Zhu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 145
Pushing the Frontier of Audiovisual Perception with Large-Scale Multimodal Correspondence Learning
Apoorv Vyas ⋅ Heng-Jui Chang ⋅ Cheng-Fu Yang ⋅ Po-Yao Huang ⋅ Luya Gao ⋅ Julius Richter ⋅ Sanyuan Chen ⋅ Matthew Le ⋅ Piotr Dollár ⋅ Christoph Feichtenhofer ⋅ Ann Lee ⋅ Wei-Ning Hsu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 146
Enhance-then-Balance Modality Collaboration for Robust Multimodal Sentiment Analysis
Kang He ⋅ Yuzhe Ding ⋅ Xinrong Wang ⋅ Fei Li ⋅ Chong Teng ⋅ Donghong Ji
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 147
SonoWorld: From One Image to a 3D Audio-Visual Scene
Derong Jin ⋅ Xiyi Chen ⋅ Ming C. Lin ⋅ Ruohan Gao
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 148
MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping
yushi Huang ⋅ Zining Wang ⋅ Zhihang Yuan ⋅ Yifu Ding ⋅ RUIHAO GONG ⋅ Jinyang Guo ⋅ Xianglong Liu ⋅ Jun Zhang
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 149
EXOTIC: External Vision-driven Incomplete Multi-view Classification
Shilin Xu ⋅ Dezhong Peng ⋅ Zhenwen Ren ⋅ Yuan Sun
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 150
Easy2Hard: From Partially to Fully Unmatched Modalities as Negative Samples in Contrastive Learning
Zhicheng Yang ⋅ Yichen Liu ⋅ Chang Ge ⋅ Xiaopeng Jiang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 151
OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation
Han Li ⋅ Xinyu Peng ⋅ Yaoming Wang ⋅ Zelin Peng ⋅ Xin Chen ⋅ Rongxiang Weng ⋅ Jingang Wang ⋅ Xunliang Cai ⋅ Wenrui Dai ⋅ Hongkai Xiong
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 152
BALM: A Model-Agnostic Framework for Balanced Multimodal Learning under Imbalanced Missing Rates
Phuong-Anh Nguyen ⋅ Tien Anh Pham ⋅ Duc-Trong Le ⋅ Van Nguyen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 153
UniT: Unified Multimodal Chain-of-Thought Test-time Scaling
Leon Liangyu Chen ⋅ Haoyu Ma ⋅ Zhipeng Fan ⋅ Ziqi Huang ⋅ Animesh Sinha ⋅ Xiaoliang Dai ⋅ Jialiang Wang ⋅ Zecheng He ⋅ Jianwei Yang ⋅ Chunyuan Li ⋅ Junzhe Sun ⋅ Chu Wang ⋅ Serena Yeung ⋅ Felix Juefei-Xu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 154
Multi-modal Test-time Adaptation via Adaptive Probabilistic Gaussian Calibration
Jinglin Xu ⋅ Yi Li ⋅ Chuxiong Sun ⋅ Xiao Xu ⋅ Jiangmeng Li ⋅ Fanjiang Xu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 155
Information-Theoretic Decomposition for Multimodal Interaction Learning
Zequn Yang ⋅ Yake Wei ⋅ HaoTian Ni ⋅ Zhihao Xu ⋅ Di Hu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 156
Is the Modality Gap a Bug or a Feature? A Robustness Perspective
Rhea Chowers ⋅ Oshri Naparstek ⋅ Udi Barzelay ⋅ Yair Weiss
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 157
Omni-Fake: Benchmarking Unified Multimodal Social Media Deepfake Detection
Tianxiao Li ⋅ Zhenglin Huang ⋅ Haiquan Wen ⋅ Yiwei He ⋅ Xinze Li ⋅ BINGYU ZHU ⋅ WUHUI DUAN ⋅ Congang CHEN ⋅ ZEYU FU ⋅ Yi Dong ⋅ Baoyuan Wu ⋅ Xiangtai Li ⋅ Guangliang Cheng
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 158
MUST: Modality-Specific Representation-Aware Transformer for Diffusion-Enhanced Survival Prediction with Missing Modality
Kyungwon Kim ⋅ Dosik Hwang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 159
VQRAE: Representation Quantization Autoencoders for Multimodal Understanding, Generation and Reconstruction
SiNan Du ⋅ JiaHao Guo ⋅ Bo Li ⋅ Shuhao Cui ⋅ Zhengzhuo Xu ⋅ Yifu Luo ⋅ Yongxian Wei ⋅ Kun Gai ⋅ Xinggang Wang ⋅ Kai Wu ⋅ Chun Yuan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 160
MOS: Mitigating Optical-SAR Modality Gap for Cross-Modal Ship Re-Identification
Yujian Zhao ⋅ Hankun Liu ⋅ Guanglin Niu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 161
SeD-UD: An Influence-Driven and Hierarchically-Decoupled Information Bottleneck for Multimodal Intent Recognition
Qin Li ⋅ Wenbo Zhang ⋅ Limei Liu ⋅ Han Peng ⋅ Junfeng Yang ⋅ Guanying Xu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 162
MultiModalPFN: Extending Prior-Data Fitted Networks for Multimodal Tabular Learning
Wall Kim ⋅ Chaeyoung Song ⋅ Hanul Kim
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 163
LacTokGen: Latent Consistency Tokenizer for 1024-pixel Image Generation by 256 Tokens
Qingsong Xie ⋅ Luyuan Zhang ⋅ Zhao Zhang ⋅ Siyuan Li ⋅ Zhe Huang ⋅ Zhenyu Yang ⋅ Haonan Lu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 164
FlowSteer: Guiding Few-Step Image Synthesis with Authentic Trajectories
Lei Ke ⋅ Hubery Yin ⋅ Gongye Liu ⋅ Zhengyao Lv ⋅ Jingcai Guo ⋅ Chen Li ⋅ Wenhan Luo ⋅ Yujiu Yang ⋅ Jing LYU
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 165
Visual Autoregressive Modeling via Next Focus Prediction
Xiaofan Li ⋅ Chenming Wu ⋅ Yanpeng Sun ⋅ Jiaming Zhou ⋅ Delin Qu ⋅ Yansong Qu ⋅ Weihao Bo ⋅ Haibao Yu ⋅ Dingkang Liang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 166
Semantic Context Matters: Improving Conditioning for Autoregressive Models
Dongyang Jin ⋅ Ryan Xu ⋅ Jianhao Zeng ⋅ Rui Lan ⋅ Yancheng Bai ⋅ Lei Sun ⋅ Xiangxiang Chu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 167
TempoMaster: Efficient Long Video Generation via Next-Frame-Rate Prediction
Yukuo Ma ⋅ Cong Liu ⋅ Junke Wang ⋅ Junqi Liu ⋅ Haibin Huang ⋅ Zuxuan Wu ⋅ Chi Zhang ⋅ Xuelong Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 168
FlashIn: Fast and Accurate Image Inversion for Real-time Image Editing
Guangzhi Wang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 169
EasyV2V: A High-quality Instruction-based Video Editing Framework
Jinjie Mai ⋅ Chaoyang Wang ⋅ Gordon Guocheng Qian ⋅ Willi Menapace ⋅ Sergey Tulyakov ⋅ Bernard Ghanem ⋅ Peter Wonka ⋅ Ashkan Mirzaei
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 170
One Algorithm to Align Them All
Boyi Pang ⋅ Savva Ignatyev ⋅ Vladimir Ippolitov ⋅ Ramil Khafizov ⋅ Yurii Melnik ⋅ Oleg Voynov ⋅ Maksim Nakhodnov ⋅ Aibek Alanov ⋅ Xiaopeng Fan ⋅ Peter Wonka ⋅ Evgeny Burnaev
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 171
VGA-Bench: A Unified Benchmark and Multi-Model Framework for Video Aesthetics and Generation Quality Evaluation
Longteng Jiang ⋅ DanDan Zheng ⋅ Qianqian Qiao ⋅ Heng Huang ⋅ Huaye Wang ⋅ Yihang Bo ⋅ Bao Peng ⋅ Jingdong Chen ⋅ JUN ZHOU ⋅ Xin Jin
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 172
Improved Mean Flows: On the Challenges of Fastforward Generative Models
ZHENGYANG GENG ⋅ Yiyang Lu ⋅ Zongze Wu ⋅ Eli Shechtman ⋅ Zico Kolter ⋅ Kaiming He
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 173
SynMotion: Semantic-Visual Adaptation for Motion Customized Video Generation
Shuai Tan ⋅ Biao Gong ⋅ Yujie Wei ⋅ Shiwei Zhang ⋅ Zhuoxin Liu ⋅ Ke Ma ⋅ Yan Wang ⋅ Kecheng Zheng ⋅ Xing Zhu ⋅ Yujun Shen ⋅ Hengshuang Zhao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 174
Match-and-Fuse: Consistent Generation from Unstructured Image Sets
Kate Feingold ⋅ Omri Kaduri ⋅ Tali Dekel
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 175
Mixture of Style Experts for Diverse Image Stylization
Shihao Zhu ⋅ Ziheng Ouyang ⋅ Yijia Kang ⋅ Qilong Wang ⋅ Mi Zhou ⋅ Bo Li ⋅ Mingming Cheng ⋅ Qibin Hou
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 176
Mirai: Autoregressive Visual Generation Needs Foresight
Yonghao Yu ⋅ Lang Huang ⋅ Zerun Wang ⋅ Runyi Li ⋅ Toshihiko Yamasaki
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 177
Align Images Before You Generate
Shihua Zhang ⋅ Qiuhong Shen ⋅ Xinchao Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 178
Bridging the Perception Gap in Image Super-Resolution Evaluation
Shaolin Su ⋅ Josep M. ⋅ Danna Xue ⋅ David Serrano-Lozano ⋅ Lei Sun ⋅ Javier Vazquez-Corral
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 179
Time-Aware One Step Diffusion Network for Real-World Image Super-Resolution
Tianyi Zhang ⋅ Zheng-Peng Duan ⋅ Chunle Guo ⋅ Peng-Tao Jiang ⋅ Bo Li ⋅ Mingming Cheng ⋅ Chongyi Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 180
Restore Text First, Enhance Image Later: Two-Stage Scene Text Image Super-Resolution with Glyph Structure Guidance
Minxing Luo ⋅ Linlong Fan ⋅ Qiushi Wang ⋅ Ge Wu ⋅ Yiyan Luo ⋅ Yuhang Yu ⋅ Jinwei Chen ⋅ Yaxing Wang ⋅ Qingnan Fan ⋅ Jian Yang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 181
IAFMNet: Information-Aware Feature Modulation for Efficient Super-Resolution
Junwei Xu ⋅ Mengzu Liu ⋅ Zhenyu Wang ⋅ Fangfang Wu ⋅ Sijia Wu ⋅ Tao Huang ⋅ Weisheng Dong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 182
Physics-Consistent Diffusion for Efficient Fluid Super-Resolution via Multiscale Residual Correction
Zhihao LI ⋅ Shengwei Dong ⋅ Chuang Yi ⋅ Junxuan Gao ⋅ Zhilu Lai ⋅ Zhiqiang Liu ⋅ Wei Wang ⋅ Guangtao Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 183
Bridging Fidelity-Reality with Controllable One-Step Diffusion for Image Super-Resolution
Hao Chen ⋅ Junyang Chen ⋅ Jinshan Pan ⋅ Jiangxin Dong
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 184
Omni-Supervised Motion Editing: Balancing Change and Invariance through Positive-Negative Learning
Zhenwu Shi ⋅ Jingyu Gong ⋅ Peiwei Wang ⋅ Xingzan Wang ⋅ Tianwen Qian ⋅ Wenxi Li ⋅ Yuan Fang ⋅ Jiao Xie ⋅ Lizhuang Ma ⋅ Shaohui Lin
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 185
FaceCam: Portrait Video Camera Control via Scale-Aware Conditioning
Weijie Lyu ⋅ Ming-Hsuan Yang ⋅ ZHIXIN SHU
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 186
Cross-Axis Feature Fusion with Joint-Wise Motion Difference Prediction for Text-Based 3D Human Motion Editing
Gyojin Han ⋅ Junmo Kim
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 187
MotionMaster: Generalizable Text-Driven Motion Generation and Editing
Nan Jiang ⋅ yunhao li ⋅ Lexi Pang ⋅ Zimo He ⋅ Siyuan Huang ⋅ Yixin Zhu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 188
OpenT2M: No-frill Motion Generation with Open-source, Large-scale, High-quality Data
Bin Cao ⋅ Sipeng Zheng ⋅ Hao Luo ⋅ Boyuan Li ⋅ Jing Liu ⋅ Zongqing Lu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 189
Towards Decompositional Human Motion Generation with Energy-Based Diffusion Models
Jianrong Zhang ⋅ Hehe Fan ⋅ Yi Yang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 190
PAMotion: Physics-Aware Motion Generation for Full-Body Interaction with Multiple Objects
Yan Di ⋅ Yuheng Li ⋅ Yaoxing Wang ⋅ Mengge Liu ⋅ Shan Gao ⋅ Xiangyang Ji
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 191
Sketch2Colab: Sketch-Conditioned Multi-Human Animation via Controllable Flow Distillation
Divyanshu Daiya ⋅ Aniket Bera
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 192
ViHOI: Human-Object Interaction Synthesis with Visual Priors
Songjin Cai ⋅ Linjie Zhong ⋅ Ling Guo ⋅ Changxing Ding
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 193
CLEP: Contrastive Language-Pose Pretraining
Sen Jia ⋅ Huayu Wang ⋅ Hsiang-Wei Huang ⋅ Zhaochong An ⋅ Jenq-Neng Hwang ⋅ Huaping Zhang ⋅ Lei Li
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 194
OpenFS: Multi-Hand-Capable Fingerspelling Recognition with Implicit Signing-Hand Detection and Frame-Wise Letter-Conditioned Synthesis
Junuk Cha ⋅ Jihyeon Kim ⋅ Han-Mu Park
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 195
ARMFlow: AutoRegressive MeanFlow for Online 3D Human Reaction Generation
Zichen Geng ⋅ Zeeshan Hayder ⋅ Wei Liu ⋅ Hesheng Wang ⋅ Ajmal Mian
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 196
InterPhys: Physics-aware Human Motion Synthesis in a Dynamic Scene
Chaoyue Xing ⋅ Wei Mao ⋅ Miaomiao Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 197
Beyond Mimicry: Learning Whole-Body Human-Humanoid Interaction from Human-Human Demonstrations
Wei-Jin Huang ⋅ Yue-Yi Zhang ⋅ Yi-Lin Wei ⋅ Zhi-Wei Xia ⋅ Juantao Tan ⋅ Yuanming Li ⋅ Zhilin Zhao ⋅ Wei-Shi Zheng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 198
PHAC: Promptable Human Amodal Completion
Seung Young ⋅ Ju Yong Chang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 199
CoordSpeaker: Exploiting Gesture Captioning for Coordinated Caption-Empowered Co-Speech Gesture Generation
Fengyi Fang ⋅ Sicheng Yang ⋅ Wenming Yang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 200
IntrinsicWeather: Controllable Weather Editing in Intrinsic Space
Yixin Zhu ⋅ Zuo-Liang Zhu ⋅ Jian Yang ⋅ Milos Hasan ⋅ Jin Xie ⋅ Beibei Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 201
Outlier-Robust Diffusion Solvers for Inverse Problems
Yang Zheng ⋅ Jiahua Liu ⋅ Tongyao Pang ⋅ Wen Li ⋅ Zhaoqiang Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 202
Beyond Fixed Formulas: Data-Driven Linear Predictor for Efficient Diffusion Models
Zhirong Shen ⋅ Rui Huang ⋅ Jiacheng Liu ⋅ Chang Zou ⋅ Peiliang Cai ⋅ Shikang Zheng ⋅ zhengyi shi ⋅ Liang Feng ⋅ Linfeng Zhang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 203
ReasonX: MLLM-Guided Intrinsic Image Decomposition
Alara Dirik ⋅ Tuanfeng Wang ⋅ Duygu Ceylan ⋅ Stefanos Zafeiriou ⋅ Anna Frühstück
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 204
Diff-SemiER: Transparency-Aware Adaptive Fusion Diffusion Model with Generative Prior for Semi-Transparent Eyeglasses Removal
Jiahao Li ⋅ Shiqi Yin ⋅ Zhenxiang Lian ⋅ jingtao guo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 205
KLIP: Localized Distribution Shift Detection via KL-Divergence with Diffusion Priors in Inverse Problems
Alireza Kheirandish ⋅ Jihoon Hong ⋅ Sara Fridovich-Keil
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 206
Elucidating the Design Space of Arbitrary-Noise-Based Diffusion Models
Xingyu Qiu ⋅ Mengying Yang ⋅ Xinghua Ma ⋅ Dong Liang ⋅ Fanding Li ⋅ Gongning Luo ⋅ wei wang ⋅ Kuanquan Wang ⋅ Shuo Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 207
Taming Generative Diffusion Model for Task-Oriented Infrared Imaging
Tengyu Ma ⋅ Zhilong Dai ⋅ Yubo Diao ⋅ Guanming An ⋅ Long Ma ⋅ Jinyuan Liu ⋅ Risheng Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 208
Attention, May I Have Your Decision? Localizing Generative Choices in Diffusion Models
Katarzyna Zaleska ⋅ Łukasz Popek ⋅ Monika Wysoczańska ⋅ Kamil Deja
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 209
RxnCaption: Reformulating Reaction Diagram Parsing as Visual Prompt Guided Captioning
Jiahe Song ⋅ Chuang Wang ⋅ Bowen Jiang ⋅ Yinfan Wang ⋅ Hao Zheng ⋅ Xingjian Wei ⋅ Chengjin Liu ⋅ Rui Nie ⋅ Junyuan Gao ⋅ Jiaxing Sun ⋅ Yubin Wang ⋅ Lijun Wu ⋅ Zhenhua Huang ⋅ Jiang Wu ⋅ Qian Yu ⋅ Conghui He
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 210
More than the Sum: Panorama-Language Models for Adverse Omni-Scenes
Weijia Fan ⋅ Ruiping Liu ⋅ Jiale Wei ⋅ Yufan Chen ⋅ Junwei Zheng ⋅ Zichao Zeng ⋅ Jiaming Zhang ⋅ Qiufu Li ⋅ Linlin Shen ⋅ Rainer Stiefelhagen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 211
DiGraphHal-Bench: Evaluating Multimodal Large Language Models on Complex Directed Graphs
Yixin Fan ⋅ He Zhao ⋅ Yuxin Hou ⋅ Changhua Zhou ⋅ Zihao Liu ⋅ Peng Wang ⋅ Lu ChengLong ⋅ Xu Zhang ⋅ Wei Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 212
SEA-Vision: A Multilingual Benchmark for Comprehensive Document and Scene Text Understanding in Southeast Asia
Pengfei Yue ⋅ Xingran Zhao ⋅ Juntao Chen ⋅ Peng Hou ⋅ Wang Longchao ⋅ Jianghang Lin ⋅ Shengchuan Zhang ⋅ Anxiang Zeng ⋅ Liujuan Cao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 213
Time Blindness: Why Video-Language Models Can’t See What Humans Can?
Ujjwal Upadhyay ⋅ Mukul Ranjan ⋅ Zhiqiang Shen ⋅ Mohamed Elhoseiny
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 214
Spot The Ball: A Benchmark for Visual Social Inference
Neha Balamurugan ⋅ Sarah Wu ⋅ Cristobal Eyzaguirre ⋅ Tobias Gerstenberg
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 215
MM-SeR: Multimodal Self-Refinement for Lightweight Image Captioning
Junha Song ⋅ Yongsik Jo ⋅ So Yeon Min ⋅ Quanting Xie ⋅ Taehwan Kim ⋅ Yonatan Bisk ⋅ Jaegul Choo
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 216
E-comIQ-ZH: A Human-Aligned Dataset and Benchmark for Fine-Grained Evaluation of E-commerce Posters with Chain-of-Thought
Meiqi Sun ⋅ mingyu Li ⋅ Junxiong Zhu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 217
GeoWorld: Geometric World Models
Zeyu Zhang ⋅ Danning Li ⋅ Ian Reid ⋅ Richard Hartley
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 218
ORD: Object-Relation Decoupling for Generalized 3D Visual Grounding
Ronggang Huang ⋅ FanSen Meng ⋅ Huaidong Zhang ⋅ Xuemiao Xu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 219
Benchmarking PhD-Level Coding in 3D Geometric Computer Vision
Wenyi Li ⋅ Renkai Luo ⋅ Yue Yu ⋅ Huan-ang Gao ⋅ Mingju Gao ⋅ Li Yuan ⋅ Chaoyou Fu ⋅ Hao Zhao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 220
MonoVLM: Monocular 3D Visual Grounding with Vision Language Models
Huaizhi Qu ⋅ Hossein Nourkhiz Mahjoub ⋅ Vaishnav Tadiparthi ⋅ Kwonjoon Lee ⋅ Tianlong Chen
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 221
Curvature-Aware Captioning: Leveraging Geodesic Attention for 3D Scene Understanding
Ziyao He ⋅ Yingjie Liu ⋅ Zhang Yangrui ⋅ Mingsong Chen ⋅ Xuan Tang ⋅ Xian Wei
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 222
SPREAD: Spatial-Physical REasoning via geometry Aware Diffusion
Minzhang Li ⋅ Kuixiang Shao ⋅ xuebing li ⋅ Yuyang Jiao ⋅ Yinuo Bai ⋅ Hengan Zhou ⋅ Sixian Shen ⋅ Jiayuan Gu ⋅ Jingyi Yu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 223
ExtrinSplat: Decoupling Geometry and Semantics for Open-Vocabulary Understanding in 3D Gaussian Splatting
Jiayu Ding ⋅ Xinpeng Liu ⋅ Zhiyi Pan ⋅ Shiqiang Long ⋅ Ge Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 224
SpatialScore: Towards Comprehensive Evaluation for Spatial Intelligence
Haoning Wu ⋅ Xiao Huang ⋅ Yaohui Chen ⋅ Ya Zhang ⋅ Yanfeng Wang ⋅ Weidi Xie
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 225
4D-RGPT: Toward Region-level 4D Understanding via Perceptual Distillation
Chiao-An Yang ⋅ Ryo Hachiuma ⋅ Sifei Liu ⋅ Subhashree Radhakrishnan ⋅ Raymond A. Yeh ⋅ Yu-Chiang Frank Wang ⋅ Min-Hung Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 226
VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction
Zhiwen Fan ⋅ Jian Zhang ⋅ Renjie Li ⋅ Junge Zhang ⋅ Runjin Chen ⋅ Hezhen Hu ⋅ Kevin Wang ⋅ Peihao Wang ⋅ Huaizhi Qu ⋅ Shijie Zhou ⋅ Dilin Wang ⋅ Zhicheng Yan ⋅ Hongyu Xu ⋅ Justin Theiss ⋅ Tianlong Chen ⋅ Jiachen Li ⋅ Zhengzhong Tu ⋅ Zhangyang Wang ⋅ Rakesh Ranjan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 227
Merge3D: Efficient 3D Multimodal LLMs via Joint 2D-3D Token Merging
Tianbo Pan ⋅ Xingyi Yang ⋅ Xinchao Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 228
Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models
Runsen Xu ⋅ Weiyao Wang ⋅ Hao Tang ⋅ Xingyu Chen ⋅ Xiaodong Wang ⋅ Fu-Jen Chu ⋅ Matt Feiszli ⋅ Kevin J Liang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 229
LocateAnything3D: Vision-Language 3D Detection with Chain-of-Sight
Yunze Man ⋅ Shihao Wang ⋅ Guowen Zhang ⋅ Johan Bjorck ⋅ Liang-Yan Gui ⋅ Jim Fan ⋅ Jan Kautz ⋅ Yu-Xiong Wang ⋅ Zhiding Yu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 230
Quota-Calibrated Fine-Grained Alignment with Context-Aware Marginals for Text-based Person Retrieval
Dongsheng Li ⋅ Xinyuan Guo ⋅ Huijie Zhang ⋅ Pingting Hao ⋅ Qiushi Xia
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 231
Evo-Retriever: LLM-Guided Curriculum Evolution with Viewpoint-Pathway Collaboration for Multimodal Document Retrieval
Li Weiqing ⋅ Jinyue Guo ⋅ Yaqi Wang ⋅ HAIYANG XIAO ⋅ Yuewei Zhang ⋅ Guohua Liu ⋅ Hao Henry Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 232
Taxonomy-Aware Representation Alignment for Hierarchical Visual Recognition with Large Multimodal Models
Hulingxiao He ⋅ Zhi Tan ⋅ Yuxin Peng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 233
FAAR: Efficient Frequency-Aware Multi-Task Fine-Tuning via Automatic Rank Selection
Maxime Fontana ⋅ Michael Spratling ⋅ Miaojing Shi
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 234
Model Merging in the Essential Subspace
Longhua Li ⋅ Lei Qi ⋅ Qi Tian ⋅ Xin Geng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 235
Beyond Semantic Search: Towards Referential Anchoring in Composed Image Retrieval
Yuxin Yang ⋅ Yinan Zhou ⋅ Yuxin Chen ⋅ Ziqi Zhang ⋅ Zongyang Ma ⋅ Chunfeng Yuan ⋅ Bing Li ⋅ Jun Gao ⋅ Weiming Hu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 236
SAVE: Speech-Aware Video Representation Learning for Video-Text Retrieval
Ruixiang Zhao ⋅ Zhihao Xu ⋅ Bangxiang Lan ⋅ Zijie Xin ⋅ Jingyu Liu ⋅ Xirong Li
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 237
MarkushGrapher-2: End-to-end Multimodal Recognition of Chemical Structures
Tim Strohmeyer ⋅ Lucas Morin ⋅ Gerhard Ingmar Meijer ⋅ Valery Weber ⋅ Ahmed Nassar ⋅ Peter Staar
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 238
Progressive Cross-Modal Causal Intervention for Long-Term Action Recognition
Shaowu Xu ⋅ Xibin Jia ⋅ Chao Fan ⋅ Junyu Gao ⋅ Jing Chang ⋅ Qianmei Sun
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 239
EthoCLIP: Ontology-Enhanced Video-Language Pretraining for Animal Behavior Understanding
Yinuo Jing ⋅ Jinyan Wu ⋅ Zixi Yang ⋅ Kongming Liang ⋅ Xiatian Zhu ⋅ Zhanyu Ma
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 240
TrajTok: Learning Trajectory Tokens Enhances Video Understanding
Chenhao Zheng ⋅ Jieyu Zhang ⋅ Jianing Zhang ⋅ Weikai Huang ⋅ Ashutosh Kumar ⋅ Quan Kong ⋅ Oncel Tuzel ⋅ Chun-Liang Li ⋅ Ranjay Krishna
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 241
Streaming Video Instruction Tuning
Jiaer Xia ⋅ Peixian Chen ⋅ Mengdan Zhang ⋅ Xing Sun ⋅ Kaiyang Zhou
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 242
VidPrism: Heterogeneous Mixture of Experts for Image-to-Video Transfer
Rui Lin ⋅ Chuanming Wang ⋅ Huadong Ma
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 243
ViterbiPlanNet: Injecting Procedural Knowledge via Differentiable Viterbi for Planning in Instructional Videos
Luigi Seminara ⋅ Davide Moltisanti ⋅ Antonino Furnari
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 244
From Static to Dynamic: Exploring Self-supervised Image-to-Video Representation Transfer Learning
Yang Liu ⋅ Qianqian Xu ⋅ Peisong Wen ⋅ Siran Dai ⋅ Xilin Zhao ⋅ Qingming Huang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 245
Learnable Motion-Focused Tokenization for Effective and Efficient Video Unsupervised Domain Adaptation
Tzu Ling Liu ⋅ Ian Stavness ⋅ Mrigank Rochan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 246
FluxMem: Adaptive Hierarchical Memory for Streaming Video Understanding
Yiweng Xie ⋅ Bo He ⋅ Junke Wang ⋅ Xiangyu Zheng ⋅ Ziyi Ye ⋅ Zuxuan Wu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 247
Learning Transferable Temporal Primitives for Video Reasoning via Synthetic Videos
Sontao Jiang ⋅ Sibo Song ⋅ Chenyi Zhou ⋅ Yuan Wang ⋅ Ruizhe Chen ⋅ Tongkun Guan ⋅ Ruilin Luo ⋅ Yan Zhang ⋅ Zhihang Tang ⋅ Yuchong Sun ⋅ Hang Zhang ⋅ Zhibo Yang ⋅ Shuai Bai ⋅ Junyang Lin ⋅ Zuozhu Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 248
Video Panels for Long Video Understanding
Lars Doorenbos ⋅ Federico Spurio ⋅ Jürgen Gall
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 249
Gaze Target Estimation Anywhere with Concepts
Xu Cao ⋅ Houze Yang ⋅ Vipin Gunda ⋅ Zhongyi Zhou ⋅ Tianyu Xu ⋅ Adarsh Kowdle ⋅ Inki Kim ⋅ James M.
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 250
Select, Hypothesize and Verify: Towards Verified Neuron Concept Interpretation
ZeBin Ji ⋅ Yang Hu ⋅ Xiuli Bi ⋅ Bo Liu ⋅ Bin Xiao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 251
Finding Distributed Object-Centric Properties in Self-Supervised Transformers
Samyak Rawlekar ⋅ Amitabh Swain ⋅ Yujun Cai ⋅ Yiwei Wang ⋅ Ming-Hsuan Yang ⋅ Narendra Ahuja
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 252
Explaining CLIP Zero-shot Predictions Through Concepts
Onat Ozdemir ⋅ Anders Christensen ⋅ Stephan Alaniz ⋅ Zeynep Akata ⋅ Emre Akbas
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 253
See Through the Noise: Improving Domain Generalization in Gaze Estimation
Yanming Peng ⋅ Shijing Wang ⋅ Yaping Huang ⋅ Yi Tian
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 254
Mechanisms of Object Localization in Vision–Language Models
Timothy Schaumlöffel ⋅ Martina G. Vilas ⋅ Gemma Roig
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 255
mmWaveFlow: Unified Enhancement and Generation of mmWave Human Point Clouds
Chang Su ⋅ Beihong Jin ⋅ Qiwen Shi ⋅ Zhi Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 256
From Feature Learning to Spectral Basis Learning: A Unifying and Flexible Framework for Efficient and Robust Shape Matching
Feifan Luo ⋅ Hongyang Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 257
Topology-aware Feature Propagation for Unsupervised Non-rigid Point Cloud Correspondence
Haozhe Chen ⋅ Rui Li ⋅ 正宝 王 ⋅ Xinhao Zhu ⋅ Linjie Li ⋅ Tianyu Xiong ⋅ Xuan Ouyang ⋅ Jiaqi Yang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 258
BEV-SLD: Self-Supervised Scene Landmark Detection for Global Localization with LiDAR Bird’s-Eye View Images
David Skuddis ⋅ Vincent Ress ⋅ Wei Zhang ⋅ Vincent Ofosu Nyako ⋅ Norbert Haala
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 259
SAG-GNN: Semantic-Aware Guided GNN for Descriptor-Free 2D-3D Matching
Shihua Zhang ⋅ Tianhao Xu ⋅ Zizhuo Li ⋅ Qing Ma ⋅ Jiayi Ma
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 260
LiREC-Net: A Target-Free and Learning-Based Network for LiDAR, RGB, and Event Calibration
Aditya Ranjan Dash ⋅ Ramy Battrawy ⋅ René Schuster ⋅ Didier Stricker
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 261
GM-R^2: Generative Matching Learning for Unsupervised Geometric Representation and Registration
Haobo Jiang ⋅ Liang Yu ⋅ Jianmin Zheng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 262
4D Local Modeling Toward Dynamic Global Perception for Ambiguity-free Rotation-Invariant Point Cloud Analysis
JIAXUN GUO ⋅ Wentao Fan ⋅ Manar Amayri ⋅ Nizar Bouguila
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 263
PointNSP: Autoregressive 3D Point Cloud Generation with Next-Scale Level-of-Detail Prediction
Ziqiao Meng ⋅ Qichao Wang ⋅ Zhiyang Dou ⋅ Zixing Song ⋅ Zhipeng Zhou ⋅ Irwin King ⋅ Peilin Zhao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 264
MORE-STEM: Long-Short MemOry REcall and Spatio-TEmporal Consistency Model for Query-Driven 3D/4D Point Cloud Segmentation
Chade Li ⋅ Haida Feng ⋅ Pengju Zhang ⋅ Yihong Wu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 265
Low-Rank Test-Time Training for Pre-Trained Point Cloud Models
Ouyangzi Ye ⋅ Feifei Shao ⋅ Kexin Li ⋅ Yawei Luo ⋅ Zikai Song ⋅ Ping Liu ⋅ Fengda Zhang ⋅ Hongwei Wang ⋅ Jun Xiao
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 266
STAR: Test-Time Adaptation Can Enhance Universal Prompt Learning for Vision-Language Models
Yiwei Fu ⋅ Hui Wan ⋅ Xiao Luo ⋅ Minghua Deng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 267
Exploring Visual Pretraining for Learning Language Intelligence
Zhonghan Zhao ⋅ Yiming Zhang ⋅ Wenwei Zhang ⋅ Haiteng Zhao ⋅ Xingguang Wei ⋅ Zhangwei Gao ⋅ Kuikun Liu ⋅ Yuzhe Gu ⋅ Size Wu ⋅ Haian Huang ⋅ Jianfei Gao ⋅ haijun Lv ⋅ Demin Song ⋅ Yunhua Zhou ⋅ Qipeng Guo ⋅ Gaoang Wang ⋅ Kai Chen
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 268
VL-Eraser: Vacuum Distillation for Machine Unlearning in Vision-Language Models
Yili Wang ⋅ Lu Dai ⋅ Tairan Huang ⋅ Yijie Xu ⋅ Hui Xiong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 269
DeAR: Fine-Grained VLM Adaptation by Decomposing Attention Head Roles
Yiming Ma ⋅ Hongkun Yang ⋅ Lionel Z. Wang ⋅ BIN CHEN ⋅ Weizhi Xian ⋅ Jianzhi Teng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 270
SynCLIP: Synonym-Coherent Language-Image Pretraining for Robust Open-Vocabulary Dense Perception
Mingjie Xie ⋅ Guangjun He ⋅ Dongli Xu ⋅ Youtian Lin ⋅ Hongjue Li ⋅ Pengming Feng ⋅ Jian Guan ⋅ Yue Deng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 271
MODIX: A Training-Free Multimodal Information-Driven Positional Index Scaling for Vision-Language Models
Ruoxiang Huang ⋅ Zhen Yuan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 272
VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models
Xinlei Yu ⋅ Chengming Xu ⋅ Guibin Zhang ⋅ Zhangquan Chen ⋅ Yudong Zhang ⋅ Yongbo He ⋅ Peng-Tao Jiang ⋅ Jiangning Zhang ⋅ Xiaobin Hu ⋅ Shuicheng Yan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 273
ORION: ORthonormal Text Encoding for Universal VLM AdaptatION
Omprakash Chakraborty ⋅ Jose Dolz ⋅ Ismail Ben Ayed
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 274
CASPA: Graph-Structured Concept Anchors for Modality-Agnostic Adaptation in Vision–Language Models
Abhiroop Chatterjee ⋅ Susmita Ghosh ⋅ Ashish Ghosh ⋅ Emmett Ientilucci
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 275
Mirror Illusion Art
Xiaopei Zhu ⋅ Zeyuan Li ⋅ Jun Zhu ⋅ Xiaolin Hu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 276
HOG-Layout: Hierarchical 3D Scene Generation, Optimization and Editing via Vision-Language Models
Haiyan Jiang ⋅ Deyu Zhang ⋅ dongdong weng ⋅ Weitao Song ⋅ Henry Been-Lirn Duh
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 277
Towards Human-Like Robot Handwriting via Contour-Aware Generation
Yutao Qin ⋅ Gang Dai ⋅ Yifan Zhang ⋅ Youwei Han ⋅ Qisheng He ⋅ Shuangping Huang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 278
MajutsuCity: Language-driven Aesthetic-adaptive City Generation with Controllable 3D Assets and Layouts
Zilong Huang ⋅ Jun He ⋅ Xiaobin Huang ⋅ Ziyi Xiong ⋅ Yang Luo ⋅ Junyan Ye ⋅ Weijia Li ⋅ Yiping Chen ⋅ Ting Han
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 279
VectorArk: Learning Practical Image Vectorization with Rounded Polygon Representation
Tarun Gehlaut ⋅ Difan Liu ⋅ Charu Bansal ⋅ Krutik Malani ⋅ Souymodip Chakraborty ⋅ Ankit Phogat ⋅ Matthew Fisher ⋅ Vineet Batra
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 280
OctoT2I: A Self-Evolving Agentic Text-to-Image Router
Jiang Xu ⋅ Bin Chen ⋅ Gehui Li ⋅ Yule Duan ⋅ Ronggang Wang ⋅ Jian Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 281
LottieGPT: Tokenizing Vector Animation for Autoregressive Generation
Junhao Chen ⋅ Gao Kejun ⋅ Yuehan Cui ⋅ Mingze Sun ⋅ Mingjin Chen ⋅ Shaohui Wang ⋅ Xiaoxiao Long ⋅ Fei Ma ⋅ Qi Tian ⋅ Hao Zhao ⋅ Ruqi Huang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 282
SEA: Evaluating Sketch Abstraction Efficiency via Element-level Commonsense Visual Question Answering
Jiho Park ⋅ Sieun Choi ⋅ Jaeyoon Seo ⋅ Minho Sohn ⋅ Yeana Kim ⋅ Jihie Kim
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 283
Selective Amnesia using Contrastive Subnet Erasure for Class Level Unlearning in Vision Models
Vishal Pramanik ⋅ Maisha Maliha ⋅ Susmit Jha ⋅ Alvaro Velasquez ⋅ Olivera Kotevska ⋅ Sumit Jha
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 284
A Closed-Form Solution for Debiasing Vision-Language Models with Utility Guarantees Across Modalities and Tasks
Tangzheng Lian ⋅ Guanyu Hu ⋅ Yijing Ren ⋅ Dimitrios Kollias ⋅ Oya Celiktutan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 285
Rank-Guided Pseudo-Bias Learning for Robust Black-Box Adaptation
Rajeev Ranjan Dwivedi ⋅ Anshuman Dangwal ⋅ Vinod Kurmi
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 286
Diagnosing and Repairing Unsafe Channels in Vision-Language Models via Causal Discovery and Dual-Modal Safety Subspace Projection
Jinhu Fu ⋅ Yihang Lou ⋅ Qingyi Si ⋅ Shudong Zhang ⋅ Sen Su
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 287
WaTeRFlow: Watermark Temporal Robustness via Flow Consistency
Utae Jeong ⋅ Sumin In ⋅ Hyunju Ryu ⋅ Jaewan Choi ⋅ Feng Yang ⋅ Jongheon Jeong ⋅ Seungryong Kim ⋅ Sangpil Kim
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 288
DSO: Direct Steering Optimization for Bias Mitigation
Lucas Monteiro Paes ⋅ Nivedha Sivakumar ⋅ Yinong Oliver Wang ⋅ Masha Fedzechkina ⋅ Barry-John Theobald ⋅ Luca Zappella ⋅ Nicholas Apostoloff
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 289
SWIFT: Sliding Window Reconstruction for Few-Shot Training-Free Generated Video Attribution
Chao Wang ⋅ Zijin Yang ⋅ Yaofei Wang ⋅ Yuang Qi ⋅ Weiming Zhang ⋅ Nenghai Yu ⋅ Kejiang Chen
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 290
SineProject: Machine Unlearning for Stable Vision-Language Alignment
Arpit Garg ⋅ Hemanth Saratchandran ⋅ Simon Lucey
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 291
HiLoRA: Hierarchical Low-Rank Adaptation for Personalized Federated Learning
Zihao Peng ⋅ Nan Zou ⋅ Jiandian Zeng ⋅ Guo Li ⋅ Ke Chen ⋅ Boyuan Li ⋅ Tian Wang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 292
OS-Fed: One Snapshot Is All You Need
Xuwei Qian ⋅ Jinghui Zhang ⋅ Yuchuan Tan ⋅ Wenbo Huang ⋅ Zhen Wu ⋅ Shen Zhou ⋅ LiSha Gao ⋅ Ding Ding ⋅ Fang Dong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 293
FedAlign: Differentially Private Distribution Alignment for Non-IID Federated Learning
Peng Wu ⋅ Jiapeng Zhang ⋅ Yingjie Song ⋅ Xiong Xiao ⋅ Zhuo Tang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 294
Guiding Diffusion Models with Fine-Grained Conditions and Semantics-Preserving Sampling for One-Shot Federated Learning
Xiaojun Deng ⋅ Tianchi Liao ⋅ Zhiyuan Liu ⋅ Chuan Chen ⋅ Zibin Zheng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 295
Personalized Federated Training of Diffusion Models with Privacy Guarantees
Kumar Kshitij Patel ⋅ Bingqing Jiang ⋅ A F M Mahfuzul Kabir ⋅ Weitong Zhang ⋅ Difan Zou ⋅ Lingxiao Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 296
FedRAC: Rolling Submodel Allocation for Collaborative Fairness in Federated Learning
Zihui Wang ⋅ Yuhang Fu ⋅ Mengmeng Du ⋅ Zhimin Yuan ⋅ Yachen Liu ⋅ Weisheng Liao ⋅ Kaiyu Wang ⋅ Zheng Wang
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 297
Understanding Temporal Logic Consistency in Video-Language Models through Cross-Modal Attention Discriminability
Chengzhi Li ⋅ Heyan Huang ⋅ Ping Jian ⋅ Zhen Yang ⋅ Yaning Tian ⋅ Zhongbin Guo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 298
Small Object, Great Challenge: A Benchmark for Small Object Visual Grounding
Wenqi Jia ⋅ Ruifan Li ⋅ Pengyue Lin ⋅ Fangxiang Feng ⋅ Zhanyu Ma ⋅ Xiaojie Wang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 299
UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models
Hewen Pan ⋅ Cong Wei ⋅ Dashuang Liang ⋅ Zepeng Huang ⋅ Pengfei Gao ⋅ Ziqi Zhou ⋅ Lulu Xue ⋅ Pengfei Yan ⋅ Xiaoming Wei ⋅ Minghui Li ⋅ Shengshan Hu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 300
ReMoRa: Multimodal Large Language Model based on Refined Motion Representation for Long-Video Understanding
Daichi Yashima ⋅ Shuhei Kurita ⋅ Yusuke Oda ⋅ Komei Sugiura
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 301
CaST-Bench: Benchmarking Causal Chain-Grounded Spatio-Temporal Reasoning for Video Question Answering
Mingfang Zhang ⋅ Jingjing Pan ⋅ Ashutosh Kumar ⋅ Rajat Saini ⋅ Mustafa Erdogan ⋅ Hsuan-Kung Yang ⋅ Caixin Kang ⋅ Yifei Huang ⋅ Yoichi Sato ⋅ Quan Kong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 302
HERO: Hierarchical Embedding-Refinement for Open-Vocabulary Temporal Sentence Grounding in Videos
Tingting Han ⋅ Xinsong Tao ⋅ Yufei Yin ⋅ Min Tan ⋅ Sicheng Zhao ⋅ Zhou Yu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 303
Scaling the Long Video Understanding of Multimodal Large Language Models via Visual Memory Mechanism
Tao Chen ⋅ Kun Zhang ⋅ Qiong Wu ⋅ Xiao Chen ⋅ Chao Chang ⋅ Xiaoshuai Sun ⋅ Yiyi Zhou ⋅ Rongrong Ji
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 304
Hybrid Token Compression for Vision-Language Models
jusheng zhang ⋅ Xiaoyang Guo ⋅ Kaitong Cai ⋅ Qinhan Lv ⋅ Yijia Fan ⋅ Wenhao Chai ⋅ Jian Wang ⋅ Keze Wang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 305
Focus, Don’t Prune: Identifying Instruction-Relevant Regions for Information-Rich Image Understanding
Mincheol Kwon ⋅ MINSEUNG LEE ⋅ Seonga Choi ⋅ Miso Choi ⋅ Kyeongjin Oh ⋅ Hyunyoung Lee ⋅ Cheonyoung Park ⋅ Yongho Song ⋅ Seunghyun Park ⋅ Jinkyu Kim
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 306
When Token Pruning is Worse than Random: Understanding Visual Token Information in VLLMs
Yahong Wang ⋅ Juncheng Wu ⋅ Zhangkai Ni ⋅ Longzhen Yang ⋅ Yihang Liu ⋅ Chengmei Yang ⋅ Ying Wen ⋅ Lianghua He ⋅ Xianfeng Tang ⋅ Hui Liu ⋅ Yuyin Zhou
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 307
VISion On Request: Enhanced VLLM efficiency with sparse, dynamically selected, vision-language interactions
Adrian Bulat ⋅ Alberto Baldrati ⋅ Ioannis Maniadis Metaxas ⋅ Yassine Ouali ⋅ Georgios Tzimiropoulos
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 308
BiGain: Unified Token Compression for Joint Generation and Classification
Jiacheng Liu ⋅ Shengkun Tang ⋅ Jiacheng Cui ⋅ Dongkuan Xu ⋅ Zhiqiang Shen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 309
Hi-Lo Prune: Look at What You'll Lose before Pruning with Hierarchical Token Selection
Zixun Sun ⋅ Yubo Dong ⋅ Hehe Fan ⋅ Yi Yang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 310
VLM-Pruner: Buffering for Spatial Sparsity in an Efficient VLM Centrifugal Token Pruning Paradigm
Zhenkai Wu ⋅ Xiaowen Ma ⋅ ZHENLIANG NI ⋅ Dengming Zhang ⋅ Han Shu ⋅ Xin Jiang ⋅ Xinghao Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 311
Bridge: Basis-Driven Causal Inference Marries VFMs for Domain Generalization
Mingbo Hong ⋅ Feng Liu ⋅ Caroline Gevaert ⋅ George Vosselman ⋅ Hao Cheng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 312
In Pursuit of Pixel Supervision for Visual Pre-training
Lihe Yang ⋅ Shang-Wen Li ⋅ Yang Li ⋅ Xinjie Lei ⋅ Dong Wang ⋅ Abdelrahman Mohamed ⋅ Saining Xie ⋅ Hengshuang Zhao ⋅ Kaiming He ⋅ Hu Xu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 313
GaussianMatch: Semi-Supervised Regression with Pseudo-Label Filtering via Multi-View Gaussian Consistency
Yin Wang ⋅ Hao Lu ⋅ Zixuan Wang ⋅ Zhen Qin ⋅ Li Kuang ⋅ Mengchu Zhou ⋅ Shuiguang Deng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 314
TAR: Token-Aware Refinement for Fine-grained Generalized Category Discovery
XingYu Yang ⋅ Yu Zhang ⋅ Siya Mi ⋅ Xiu-Shen Wei
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 315
Semantic Noise Reduction via Teacher-Guided Dual-Path Audio-Visual Representation Learning
Linge Wang ⋅ Yingying Chen ⋅ Bingke Zhu ⋅ Lu Zhou ⋅ Jinqiao Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 316
The Universal Normal Embedding
Chen Tasker ⋅ Roy Betser ⋅ Eyal Gofer ⋅ Meir Yossef Levi ⋅ Guy Gilboa
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 317
Bypassing the Transport Plan: Dynamic Reweighting for Out-of-Distribution Detection with Optimal Transport
Yang Xiao ⋅ Weiming Liu ⋅ Jun Dan ⋅ Tengyue Xu ⋅ Fan Wang ⋅ Hua Yu ⋅ Junhao Dong ⋅ Jiao Liu ⋅ Shunjie Dong ⋅ Lianyong Qi
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 318
Cross-domain Dual-stream Feature Disentanglement for Brain Disorder Prediction with Sparsely Labeled PET
Huabin Wang ⋅ Xinyu Chen ⋅ Yuan Zhou ⋅ Fei Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 319
Debiased Sample Selection for Learning with Noisy Labels
Weiran Pan ⋅ Wei Wei ⋅ Wenfeng xie
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 320
Driving on Registers
Ellington Kirby ⋅ Alexandre Boulch ⋅ Yihong Xu ⋅ Yuan Yin ⋅ Gilles Puy ⋅ Éloi Zablocki ⋅ Andrei Bursuc ⋅ Spyros Gidaris ⋅ Renaud Marlet ⋅ Florent Bartoccioni ⋅ Anh Quan Cao ⋅ Nermin Samet ⋅ Vu ⋅ Matthieu Cord
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 321
Open-Ended Instruction Realization with LLM-Enabled Multi-Planner Scheduling in Autonomous Vehicles
Jiawei Liu ⋅ Xun Gong ⋅ Fen Fang ⋅ Muli Yang ⋅ Bohao Qu ⋅ Yunfeng hu ⋅ Hong Chen ⋅ Xulei Yang ⋅ Qing Guo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 322
EE-RL: Vision Language Guided Reinforcement Learning with Explorer and Expert model for End-to-End Autonomous Driving
Xiaolong Li ⋅ Lan Yang ⋅ Ruyang Li ⋅ Shan Fang ⋅ Yang Liu ⋅ Xiangmo Zhao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 323
Sensor2Sensor: Cross-Embodiment Sensor Conversion for Autonomous Driving
Jiahao Wang ⋅ Bo Sun ⋅ Yijing Bai ⋅ Vincent Casser ⋅ Songyou Peng ⋅ Zehao Zhu ⋅ Meng-Li Shih ⋅ Xander Masotto ⋅ Shih-Yang Su ⋅ Kanaad Parvate ⋅ Tiancheng Ge ⋅ Linn Bieske ⋅ Dragomir Anguelov ⋅ Mingxing Tan ⋅ Chiyu “Max” Jiang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 324
SHARP: Short-Window Streaming for Accurate and Robust Prediction in Motion Forecasting
Alexander Prutsch ⋅ Christian Fruhwirth-Reisinger ⋅ David Schinagl ⋅ Horst Possegger
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 325
DriveCombo: Benchmarking Compositional Traffic Rule Reasoning in Autonomous Driving
Enhui Ma ⋅ Jiahuan Zhang ⋅ Guantian Zheng ⋅ Tao Tang ⋅ Shengbo Eben Li ⋅ Yuhang Lu ⋅ xia zhou ⋅ Xueyang Zhang ⋅ Yifei Zhan ⋅ Kun Zhan ⋅ Zhihui Hao ⋅ XianPeng Lang ⋅ Kaicheng Yu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 326
CausalVAD: De-confounding End-to-End Autonomous Driving via Causal Intervention
Jiacheng Tang ⋅ Zhiyuan Zhou ⋅ Zhuolin He ⋅ Jia Zhang ⋅ Kai Zhang ⋅ Jian Pu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 327
Reliable Policy Transfer for Safety-Aware End-to-End Driving with Deep Reinforcement Learning
Uddin Md. Borhan ⋅ Arif Raza ⋅ Zhiliang Lin ⋅ Lu Wang ⋅ Jianqiang Li ⋅ Jie Chen
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 328
Learning to Drive is a Free Gift: Large-Scale Label-Free Autonomy Pretraining from Unposed In-The-Wild Videos
Matthew Strong ⋅ Wei-Jer Chang ⋅ Quentin HERAU ⋅ Jiezhi Yang ⋅ Yihan Hu ⋅ Chensheng Peng ⋅ Wei Zhan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 329
WhisperNet: A Scalable Solution for Bandwidth-Efficient Collaboration
Gong Chen ⋅ Chaokun Zhang ⋅ Xinyan Zhao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 330
Efficient Equivariant Transformer for Self-Driving Agent Modeling
Scott Xu ⋅ Dian Chen ⋅ Kelvin Wong ⋅ Chris Zhang ⋅ Kion Fallah ⋅ Raquel Urtasun
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 331
Generalizable Co-Salient Object Detection via Mixed Content-Style Modulation
Guanting Guo ⋅ Shenglong Hu ⋅ Kaihua Zhang ⋅ Guangcan Liu ⋅ Min Xia
[ Slides
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 332
Saliency-Driven Token Merging for Vision Transformers
Weiying Xie ⋅ Xiaoyu Chen ⋅ Xin Zhang ⋅ Chenhe Hao ⋅ Jitao Ma ⋅ Yunsong Li ⋅ Leyuan Fang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 333
RISE: Single Static Radar-based Indoor Scene Understanding
Kaichen Zhou ⋅ Laura Dodds ⋅ Sayed Saad Afzal ⋅ Fadel Adib
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 334
Mixture-of-Experts based Feature Decoupling for Open Vocabulary Scene Graph Generation
Yiming Li ⋅ Sisi You ⋅ Bing-Kun Bao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 335
TF-SSD: A Strong Pipeline via Synergic Mask Filter for Training-free Co-salient Object Detection
Zhijin He ⋅ Shuo Jin ⋅ Siyue Yu ⋅ Shuwei Wu ⋅ Bingfeng Zhang ⋅ Li Yu ⋅ Jimin Xiao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 336
Denoise and Align: Towards Source-Free UDA for Robust Panoramic Semantic Segmentation
Yaowen Chang ⋅ Zhen Cao ⋅ Xu Zheng ⋅ Xiaoxin Mi ⋅ Zhen Dong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 337
SPOT: Spatiotemporal Prompt Optimization for Motion-Stabilized MLLM-Guided Video Segmentation
Jiayi Fan ⋅ Zheyun Qin ⋅ Xiaoming Xi ⋅ Xiushan Nie ⋅ Yilong Yin
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 338
Changes in Real Time: Online Scene Change Detection with Multi-View Fusion
Chamuditha Jayanga Galappaththige ⋅ Jason Lai ⋅ Lloyd Windrim ⋅ Donald Dansereau ⋅ Niko Suenderhauf ⋅ Dimity Miller
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 339
Subspace Alignment for CLIP-based Continual Learning via Canonical Correlation Analysis
Huan Zhang ⋅ Shuyu Dong ⋅ Yujin Zheng ⋅ Dingwen Wang ⋅ Shenghua Fan ⋅ Fan Lyu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 340
DGS: Dual Gradient and Semantic-Shift Guided Low-Rank Adaptation for Class Incremental Learning
KAI LI ⋅ Jiafeng Li ⋅ Lianghua He ⋅ Ying Wen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 341
Dynamic Magic: Unleashing Restricted Knowledge for Lifelong Person Re-Identification
Jinjia Peng ⋅ Jican Tan ⋅ Jiazuo Yu ⋅ Zeze Tao ⋅ Huibing Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 342
Which Concepts to Forget and How to Refuse? Decomposing Concepts for Continual Unlearning in Large Vision-Language Models
Hyundong Jin ⋅ Dongyoon Han ⋅ Eunwoo Kim
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 343
Temporal Imbalance of Positive and Negative Supervision in Class-Incremental Learning
Jinge Ma ⋅ Fengqing Zhu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 344
Forging a Dynamic Memory: Retrieval-Guided Continual Learning for Generalist Medical Foundation Models
Zizhi Chen ⋅ Yizhen Gao ⋅ Minghao Han ⋅ Yizhou Liu ⋅ Zhaoyu Chen ⋅ Dingkang Yang ⋅ Lihua Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 345
Dance Across Shifts: Forward-Facilitation Continual Test-Time Adaptation through Dynamic Style Bridging
Zhilin Zhu ⋅ Yabin Wang ⋅ Zhiheng Ma ⋅ Yaguang Song ⋅ Yaowei Wang ⋅ Xiaopeng Hong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 346
Few-Shot Hybrid Incremental Learning: Continually Learning under Data Scarcity and Task Uncertainty
Yan Li ⋅ Yuzhu Shi ⋅ Kan Zhou ⋅ Shu Zhang ⋅ Diqi He ⋅ Dingwen Zhang ⋅ Junwei Han
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 347
High-Fidelity Mobile Avatars with Pruned Local Blendshapes
Youyi Zhan ⋅ He Wang ⋅ Tianjia Shao ⋅ Kun Zhou
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 348
PhysSkin: Real-Time and Generalizable Physics-Based Animation via Self-Supervised Neural Skinning
Yuanhang Lei ⋅ Tao Cheng ⋅ Xingxuan Li ⋅ Boming Zhao ⋅ Siyuan Huang ⋅ Ruizhen Hu ⋅ Peter Yichen Chen ⋅ Hujun Bao ⋅ Zhaopeng Cui
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 349
Bridging Privacy and Provenance: Traceable Virtual Identity Generation
Xianhan Zeng ⋅ Xiaoxiao Hu ⋅ Sheng Li ⋅ Zhenxing Qian ⋅ Xinpeng Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 350
PortraitDirector: A Hierarchical Disentanglement Framework for Controllable and Real-time Facial Reenactment
Chaonan Ji ⋅ Jinwei Qi ⋅ Sheng Xu ⋅ Peng Zhang ⋅ Bang Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 351
Dynamic Label Noise Suppression with Optimal Teacher Pool for Facial Expression Recognition
Yuzhuang Yang ⋅ Xiaolin Tian ⋅ Qigong Sun
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 352
MimicTalker: A Multimodal Interactive and Memory-Enhanced Framework for Real-Time Dyadic 3D Head Generation
Yinuo Wang ⋅ Yanbo Fan ⋅ Xuan Wang ⋅ Boyao Zhou ⋅ Yu Guo ⋅ Yujun Shen ⋅ Fei Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 353
DecoVLN: Decoupling Observation, Reasoning, and Correction for Vision-and-Language Navigation
zihao xin ⋅ Wentong Li ⋅ Yixuan Jiang ⋅ Bin Wang ⋅ Runmin Cong ⋅ Jie Qin ⋅ Shengjun Huang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 354
HybridDriveVLA: Vision-Language-Action Model with Visual CoT reasoning and ToT Evaluation for Autonomous Driving
Yipene Cedric Francois Bassole ⋅ Sungwoo Kim ⋅ Jiwoo Jung ⋅ Yunsick Sung
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 355
NavForesee: A Unified Vision-Language World Model for Hierarchical Planning and Dual-Horizon Navigation Prediction
Fei Liu ⋅ Shichao Xie ⋅ Minghua Luo ⋅ Zedong Chu ⋅ Junjun Hu ⋅ Xiaolong Wu ⋅ Mu Xu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 356
LookasideVLN: Direction-Aware Aerial Vision-and-Language Navigation
Yuwei Ning ⋅ Ganlong Zhao ⋅ Yipeng Qin ⋅ Si Liu ⋅ Yang Liu ⋅ Liang Lin ⋅ Guanbin Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 357
MAPS: Preserving Vision-Language Representations via Module-Wise Proximity Scheduling for Better Vision-Language-Action Generalization
Chengyue Huang ⋅ Mellon M. Zhang ⋅ Robert Azarcon ⋅ Glen Chou ⋅ Zsolt Kira
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 358
D3D-VLP: Dynamic 3D Vision-Language-Planning Model for Embodied Grounding and Navigation
Zihan Wang ⋅ Seungjun Lee ⋅ Guangzhao Dai ⋅ Gim Hee Lee
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 359
FreeForm: Reduced-Order Deformable Simulation from Particle-Based Skinning Eigenmodes
Donglai Xiang ⋅ Vismay Modi ⋅ Rishit Dagli ⋅ Ty Trusty ⋅ Gilles Daviet ⋅ Anka Chen ⋅ Nicholas Sharp ⋅ David I. W. Levin
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 360
GeoDiff4D: Geometry-Aware Diffusion for 4D Head Avatar Reconstruction
Chao Xu ⋅ Xiaochen Zhao ⋅ xiang deng ⋅ Jingxiang Sun ⋅ Donglin Di ⋅ Zhuo Su ⋅ Yebin Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 361
4DEquine: Disentangling Motion and Appearance for 4D Equine Reconstruction from Monocular Video
Jin Lyu ⋅ Liang An ⋅ Pujin Cheng ⋅ Yebin Liu ⋅ Xiaoying Tang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 362
PhysHO: Physics-Based Dynamic 3D Gaussian Human and Object from Monocular Video
Suyi Jiang ⋅ Gim Hee Lee
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 363
ProgressiveAvatars: Progressive Animatable 3D Gaussian Avatars
Kaiwen Song ⋅ Jinkai Cui ⋅ Juyong Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 364
ZINA: Multimodal Fine-grained Hallucination Detection and Editing
Yuiga Wada ⋅ Kazuki Matsuda ⋅ Komei Sugiura ⋅ Graham Neubig
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 365
Mitigating Multimodal Hallucinations via Gradient-based Self-Reflection
Shan Wang ⋅ Maying Shen ⋅ Nadine Chang ⋅ Chuong Nguyen ⋅ Hongdong Li ⋅ Jose M. Alvarez
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 366
HalluGen: Synthesizing Realistic and Controllable Hallucinations for Evaluating Image Restoration
Seunghoi Kim ⋅ Henry F. J. Tregidgo ⋅ Chen Jin ⋅ Matteo Figini ⋅ Daniel C. Alexander
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 367
KVSmooth: Mitigating Hallucination in Multi-modal Large Language Models through Key-Value Smoothing
Siyu Jiang ⋅ Feiyang Chen ⋅ Xiaojin Zhang ⋅ Kun He
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 368
ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Video Understanding
Hao Lu ⋅ Jiahao Wang ⋅ Yaolun Zhang ⋅ Ruohui Wang ⋅ Xuanyu Zheng ⋅ Yepeng Tang ⋅ Dahua Lin ⋅ Lewei Lu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 369
Tell Model Where to Look: Mitigating Hallucinations in MLLMs by Vision-Guided Attention
Jianfei Zhao ⋅ Feng Zhang ⋅ Xin Sun ⋅ Chong Feng ⋅ Zhixing Tan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 370
Circular-DPO: Aligning Multi-Stage 3D Generative Models via Preference Feedback Loop
Zejian Li ⋅ Jiarui Ma ⋅ Han Xu ⋅ Weiting Zheng ⋅ Yangrui Zhu ⋅ Chenye Meng ⋅ Pei Chen ⋅ Ling Yang ⋅ Zhiyuan Yang ⋅ Changyuan Yang ⋅ Guang Yang ⋅ Immanuel Koh ⋅ Lingyun Sun
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 371
Cloning Deterministic Worlds: The Critical Role of Latent Geometry in Long-Horizon World Models
Zaishuo Xia ⋅ Yukuan Lu ⋅ Xinyi Li ⋅ Yifan Xu ⋅ Yubei Chen
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 372
PrITTI: Primitive-based Generation of Controllable and Editable 3D Semantic Urban Scenes
Christina Ourania Tze ⋅ Daniel Dauner ⋅ Yiyi Liao ⋅ Dzmitry Tsishkou ⋅ Andreas Geiger
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 373
CubeComposer: Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video
Lingen Li ⋅ Guangzhi Wang ⋅ Xiaoyu Li ⋅ Zhaoyang Zhang ⋅ Qi Dou ⋅ Jinwei Gu ⋅ Tianfan Xue ⋅ Ying Shan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 374
ExPose: Reinforcing Video Generation Models for Extreme Pose Estimation
Youngho Yoon ⋅ Wonjune Cho ⋅ Hyunho Ha ⋅ Sujung Kim ⋅ Kuk-Jin Yoon
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 375
Choreographing a World of Dynamic Objects
Yanzhe Lyu ⋅ Chen Geng ⋅ Karthik Dharmarajan ⋅ Yunzhi Zhang ⋅ Hadi Alzayer ⋅ Shangzhe Wu ⋅ Jiajun Wu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 376
SounDiT: Geo-Contextual Soundscape-to-Landscape Generation
Junbo Wang ⋅ Haofeng Tan ⋅ Bowen Liao ⋅ Albert Jiang ⋅ Teng Fei ⋅ Qixing Huang ⋅ Bing Zhou ⋅ Zhengzhong Tu ⋅ Shan Ye ⋅ Yuhao Kang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 377
Vista4D: Video Reshooting with 4D Point Clouds
Kuan Heng Lin ⋅ Zhizheng Liu ⋅ Pablo Salamanca ⋅ Yash Kant ⋅ Ryan Burgert ⋅ Yuancheng Xu ⋅ Koichi Namekata ⋅ Yiwei Zhao ⋅ Bolei Zhou ⋅ Micah Goldblum ⋅ Paul Debevec ⋅ Ning Yu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 378
CamDirector: Towards Long-Term Coherent Video Trajectory Editing
Kejia Yin ⋅ Zhihao Shi ⋅ Weilin Wan ⋅ Yuhongze Zhou ⋅ YUANHAO YU ⋅ Xinxin Zuo ⋅ Qiang Sun ⋅ Juwei Lu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 379
Elastic3D: Controllable Stereo Video Conversion with Guided Latent Decoding
Nando Metzger ⋅ Prune Truong ⋅ Goutam Bhat ⋅ Konrad Schindler ⋅ Federico Tombari
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 380
Decoupling Bias, Aligning Distributions: Synergistic Fairness Optimization for Deepfake Detection
Feng Ding ⋅ Wenhui Yi ⋅ Yunpeng Zhou ⋅ Xinan He ⋅ Hong Rao ⋅ Shu Hu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 381
Target-Aware Invertible Encoder with Reconstruction Guidance for Infrared Small Target Detection
Shule Yan ⋅ Zetian Zhang ⋅ Xiao Ma ⋅ Zexuan Ji
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 382
BDNet:Bio-Inspired Dual-Backbone Small Object Detection Network
Wenchao Guan ⋅ Chuan Lin ⋅ Sihan Huang ⋅ Xiongzhen Wang ⋅ Xintao Pang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 383
ElasticFormer: Detecting Objects in HRW Shots via Elastic Computing Vision Transformer
Wenxi Li ⋅ Jingchen Huang ⋅ Chenyang Lyu ⋅ Moran Liu ⋅ Haozhe Lin ⋅ Guiguang Ding ⋅ Yuchen Guo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 384
RGB-Event based Pedestrian Attribute Recognition: A Benchmark Dataset and An Asymmetric RWKV Fusion Framework
Xiao Wang ⋅ Haiyang Wang ⋅ Shiao Wang ⋅ Qiang Chen ⋅ Jiandong Jin ⋅ Haoyu Song ⋅ Bo Jiang ⋅ Chenglong Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 385
FusionAgent: A Multimodal Agent with Dynamic Model Selection for Human Recognition
Jie Zhu ⋅ Xiao Guo ⋅ Yiyang Su ⋅ Anil Kumar Jain ⋅ Xiaoming Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 386
Free-Grained Hierarchical Visual Recognition
Seulki Park ⋅ Zilin Wang ⋅ Stella X. Yu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 387
URICA: A Uniformity Region Affine Identifier Capture Algorithm for Arbitrary Region Retrieval in Pathology Images
Ri Su ⋅ Zhao CHEN ⋅ Caleb Chen Cao ⋅ Lei Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 388
Online Data Curation for Object Detection via Marginal Contributions to Dataset-level Average Precision
Zitang Sun ⋅ Masakazu Yoshimura ⋅ Junji Otsuka ⋅ Atsushi Irie ⋅ Takeshi Ohashi
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 389
DetAny4D: Detect Anything 4D Temporally in a Streaming RGB Video
Jiawei Hou ⋅ Shenghao Zhang ⋅ Can Wang ⋅ Zheng Gu ⋅ Yonggen Ling ⋅ Taiping Zeng ⋅ Xiangyang Xue ⋅ Jingbo Zhang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 390
Follow the Saliency: Supervised Saliency for Retrieval-augmented Dense Video Captioning
Seung hee Choi ⋅ minju Jeon ⋅ Hyunwoo Oh ⋅ Jihwan Lee ⋅ Dong-Jin Kim
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 391
Video-CoE: Reinforcing Video Event Prediction via Chain of Events
Qile Su ⋅ Jing Tang ⋅ Rui Chen ⋅ Lei Sun ⋅ Xiangxiang Chu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 392
VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice
Shuming Liu ⋅ Mingchen Zhuge ⋅ Changsheng Zhao ⋅ Jun Chen ⋅ Lemeng Wu ⋅ Zechun Liu ⋅ Chenchen Zhu ⋅ zhipeng cai ⋅ Chong Zhou ⋅ Haozhe Liu ⋅ Ernie Chang ⋅ Saksham Suri ⋅ Hongyu Xu ⋅ Qi Qian ⋅ Wei Wen ⋅ Balakrishnan Varadarajan ⋅ Zhuang Liu ⋅ Hu Xu ⋅ Florian Bordes ⋅ Raghuraman Krishnamoorthi ⋅ Bernard Ghanem ⋅ Vikas Chandra ⋅ Yunyang Xiong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 393
VRR-QA: Visual Relational Reasoning in Videos Beyond Explicit Cues
Sirnam Swetha ⋅ Rohit Gupta ⋅ Parth Parag Kulkarni ⋅ David G. ⋅ Jeffrey A. Chan-Santiago ⋅ Nyle Siddiqui ⋅ Joseph Fioresi ⋅ Mubarak Shah
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 394
Question-guided Visual Compression with Memory Feedback for Long-Term Video Understanding
Sosuke Yamao ⋅ Natsuki Miyahara ⋅ Yuankai Qi ⋅ Shun Takeuchi
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 395
CURVE: A Benchmark for Cultural and Multilingual Long Video Reasoning
Darshan Singh S ⋅ Arsha Nagrani ⋅ Kawshik Manikantan ⋅ Harman Singh ⋅ Dinesh Tewari ⋅ Tobias Weyand ⋅ Cordelia Schmid ⋅ Anelia Angelova ⋅ Shachi Dave
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 396
SVBench: Evaluation of Video Generation Models on Social Reasoning
Wenshuo Peng ⋅ Gongxuan Wang ⋅ Tianmeng Yang ⋅ Chuanhao Li ⋅ Xiaojie Xu ⋅ Hui He ⋅ Kaipeng Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 397
Hierarchical Long Video Understanding with Audiovisual Entity Cohesion and Agentic Search
Xinlei Yin ⋅ Xiulian Peng ⋅ Xiao Li ⋅ Zhiwei Xiong ⋅ Yan Lu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 398
LifeEval: A Multimodal Benchmark for Assistive AI in Egocentric Daily Life Tasks
Hengjian Gao ⋅ Kaiwei Zhang ⋅ Shibo Wang ⋅ Mingjie Chen ⋅ Qihang Cao ⋅ Xianfeng Wang ⋅ Yucheng Zhu ⋅ Xiongkuo Min ⋅ Wei Sun ⋅ Dandan Zhu ⋅ Guangtao Zhai
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 399
Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning
Haoji Zhang ⋅ Xin Gu ⋅ Jiawen Li ⋅ Chixiang Ma ⋅ Sule Bai ⋅ Chubin Zhang ⋅ bowen zhang ⋅ zhichao zhou ⋅ Dongliang He ⋅ Yansong Tang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 400
Attention Surgery: An Efficient Recipe to Linearize Your Video Diffusion Transformer
Mohsen Ghafoorian ⋅ Denis Korzhenkov ⋅ Amir Habibian
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 401
YOSE: You Only Select Essential Tokens for Efficient DiT-based Video Object Removal
wu chenyang ⋅ Lina Lei ⋅ Fan Li ⋅ Chunle Guo ⋅ Dehong Kong ⋅ Xinran Qin ⋅ Zhixin Wang ⋅ Mingming Cheng ⋅ Chongyi Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 402
CADC: Content Adaptive Diffusion-Based Generative Image Compression
Xihua Sheng ⋅ lingyu ZHU ⋅ Tianyu Zhang ⋅ Dong Liu ⋅ Shiqi Wang ⋅ Jing Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 403
FG-Portrait: 3D Flow Guided Editable Portrait Animation
Yating Xu ⋅ Yunqi Miao ⋅ Evangelos Ververas ⋅ Jiankang Deng ⋅ Jifei Song
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 404
ResCa: Residual Caching for Diffusion Transformers Acceleration
Haipeng Fang ⋅ Yu Li ⋅ Fan Tang ⋅ Yixing Lu ⋅ Juan Cao ⋅ Sheng Tang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 405
IP-Adapter Is All You Need: Towards Fine-Tuning-Free Diffusion-Based Talking Face Generation
Hao Wu ⋅ Xiangyang Luo ⋅ Hao Wang ⋅ Jiawei Zhang ⋅ Yi Zhang ⋅ Jinwei Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 406
SRA 2: Variational Autoencoder Self-Representation Alignment for Efficient Diffusion Training
Mengmeng Wang ⋅ Dengyang Jiang ⋅ Liuzhuozheng Li ⋅ Yucheng Lin ⋅ Guojiang Shen ⋅ Xiangjie Kong ⋅ Yong Liu ⋅ Guang Dai ⋅ Jingdong Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 407
InnoAds-Composer: Efficient Condition Composition for E-Commerce Poster Generation
Yuxin Qin ⋅ Ke Cao ⋅ Haowei Liu ⋅ Ao Ma ⋅ Fengheng Li ⋅ Honghe Zhu ⋅ Zheng Zhang ⋅ Run Ling ⋅ Wei Feng ⋅ Xuanhua He ⋅ Zhanjie Zhang ⋅ Zhen Guo ⋅ Haoyi Bian ⋅ Jingjing Lv ⋅ Junjie Shen ⋅ Ching Law
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 408
Multi-Patch Global-to-Local Transformer Architecture For Efficient Flow Matching and Diffusion Model
Minh Quan Dao ⋅ Dimitris Metaxas
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 409
SODA: Sensitivity-Oriented Dynamic Acceleration for Diffusion Transformer
Tong Shao ⋅ Yusen Fu ⋅ Guoying Sun ⋅ Jingde Kong ⋅ Zhuotao Tian ⋅ Jingyong Su
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 410
DSERT-RoLL: Robust Multi-Modal Perception for Diverse Driving Conditions with Stereo Event-RGB-Thermal Cameras, 4D Radar, and Dual-LiDAR
Hoonhee Cho ⋅ Jae-Young Kang ⋅ Yuhwan Jeong ⋅ Yunseo Yang ⋅ Wonyoung Lee ⋅ Youngho Kim ⋅ Kuk-Jin Yoon
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 411
A Semantically Disentangled Unified Model for Multi-category 3D Anomaly Detection
SuYeon Kim ⋅ Wongyu Lee ⋅ MyeongAh Cho
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 412
ReManNet: A Riemannian Manifold Network for Monocular 3D Lane Detection
Chengzhi Hong ⋅ Bijun Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 413
PanDA: Unsupervised Domain Adaptation for Multimodal 3D Panoptic Segmentation in Autonomous Driving
Yining Pan ⋅ Shijie Li ⋅ Yuchen Wu ⋅ Xulei Yang ⋅ Na Zhao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 414
STUR3D: Spatio-Temporal Unified Representation Learning for 3D Object Detection
Huijie Fan ⋅ Pengrui huang ⋅ Qiang Wang ⋅ Baojie Fan ⋅ Jiahua Dong ⋅ Liangqiong Qu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 415
Exploring 6D Object Pose Estimation with Deformation
Zhiqiang Liu ⋅ Rui Song ⋅ Duanmu Chuangqi ⋅ Jiaojiao Li ⋅ David Ferstl ⋅ Yinlin Hu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 416
SearchAD: Large-Scale Rare Image Retrieval Dataset for Autonomous Driving
Felix Embacher ⋅ Jonas Uhrig ⋅ Marius Cordts ⋅ Markus Enzweiler
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 417
Improving Vision-language Models with Perception-centric Process Reward Models
Yingqian Min ⋅ Kun Zhou ⋅ Yifan Li ⋅ Yuhuan Wu ⋅ Han Peng ⋅ Yifan Du ⋅ Wayne Xin Zhao ⋅ Min Yang ⋅ Ji-Rong Wen
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 418
X-PCR: A Benchmark for Cross-modality Progressive Clinical Reasoning in Ophthalmic Diagnosis
Gui Wang ⋅ Zehao Zhong ⋅ YongSong Zhou ⋅ Yudong Li ⋅ Ende Wu ⋅ Wooi Ping Cheah ⋅ Rong Qu ⋅ Jianfeng Ren ⋅ Linlin Shen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 419
Better, Stronger, Faster: Tackling the Trilemma in MLLM-based Segmentation with Simultaneous Textual Mask Prediction
Jiazhen Liu ⋅ Mingkuan Feng ⋅ Long Chen
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 420
PhysInOne: Visual Physics Learning and Reasoning in One Suite
Siyuan Zhou ⋅ Hejun Wang ⋅ Hu Cheng ⋅ Jinxi Li ⋅ Dongsheng Wang ⋅ Junwei Jiang ⋅ Yixiao Jin ⋅ Jiayue Huang ⋅ Shiwei Mao ⋅ Shangjia Liu ⋅ Yafei Yang ⋅ Hongkang Song ⋅ Shenxing Wei ⋅ Zihui Zhang ⋅ DataTeam vLAR ⋅ Bing Wang ⋅ Zhihua Wang ⋅ Chuhang Zou ⋅ Bo Yang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 421
AviaSafe: A Physics-Informed Data-Driven Model for Aviation Safety–Critical Cloud Forecasts
ZIJIAN ZHU ⋅ Huang Qiusheng ⋅ Anboyu Guo ⋅ Xiaohui Zhong ⋅ Hao li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 422
TTRV: Test-Time Reinforcement Learning for Vision Language Models
Akshit Singh ⋅ Shyam Marjit ⋅ Wei Lin ⋅ Paul Gavrikov ⋅ Serena Yeung ⋅ Hilde Kuehne ⋅ Rogerio Feris ⋅ Sivan Doveh ⋅ James Glass ⋅ M. Jehanzeb Mirza
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 423
Reading or Reasoning? Format Decoupled Reinforcement Learning for Document OCR
Yufeng Zhong ⋅ Lei Chen ⋅ Zhixiong Zeng ⋅ Xuanle Zhao ⋅ Deyang Jiang ⋅ Liming Zheng ⋅ Jing Huang ⋅ Haibo Qiu ⋅ Peng Shi ⋅ Siqi Yang ⋅ Lin Ma
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 424
QUANTIPHY: A Quantitative Benchmark Evaluating Physical Reasoning Abilities of Vision-Language Models
Puyin Li ⋅ Tiange Xiang ⋅ Ella Mao ⋅ Shirley Wei ⋅ Xinye Chen ⋅ Adnan Masood ⋅ Li Fei-Fei ⋅ Ehsan Adeli
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 425
VisRes Bench: On Evaluating the Visual Reasoning Capabilities of VLMs
Brigitta Malagurski Törtei ⋅ Yasser Dahou ⋅ Ngoc Dung Huynh ⋅ Wamiq Reyaz Para ⋅ Phúc H. Lê Khắc ⋅ Ankit Singh ⋅ Sofian Chaybouti ⋅ Sanath Narayan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 426
TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition
JUNYUAN ZHANG ⋅ Bin Wang ⋅ Qintong Zhang ⋅ Fan Wu ⋅ Zichen Wen ⋅ Jialin Lu ⋅ Junjie Shan ⋅ Ziqi Zhao ⋅ Shuya Yang ⋅ Ziling Wang ⋅ Ziyang Miao ⋅ Huaping Zhong ⋅ Yuhang Zang ⋅ Xiaoyi Dong ⋅ Ka-Ho Chow ⋅ Conghui He
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 427
Urban-GS: A Unified 3D Gaussian Splatting Framework for Compact and High-Fidelity Aerial-to-Street Reconstruction
Meng Wang ⋅ Changqun Xia ⋅ Yuze Wang ⋅ Junyi Wang ⋅ Wantong Duan ⋅ Xinxiong Xie ⋅ Yue Qi
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 428
Generalizable Sparse-View 3D Reconstruction from Unconstrained Images
Vinayak Gupta ⋅ Chih-Hao Lin ⋅ Shenlong Wang ⋅ Anand Bhattad ⋅ Jia-Bin Huang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 429
RemedyGS: Defend 3D Gaussian Splatting Against Computation Cost Attacks
Yanping LI ⋅ Zhening Liu ⋅ Zijian Li ⋅ Zehong Lin ⋅ Jun Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 430
SparseCam4D: Spatio-Temporally Consistent 4D Reconstruction from Sparse Cameras
Weihong Pan ⋅ XiaoYu Zhang ⋅ Zhuang Zhang ⋅ Zhichao Ye ⋅ Nan Wang ⋅ Haomin Liu ⋅ Guofeng Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 431
IDESplat: Iterative Depth Probability Estimation for Generalizable 3D Gaussian Splatting
Wei Long ⋅ Haifeng Wu ⋅ SHIYIN JIANG ⋅ Jinhua Zhang ⋅ Xinchun Ji ⋅ Shuhang Gu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 432
GS^2: Graph-based Spatial Distribution Optimization for Compact 3D Gaussian Splatting
Xianben Yang ⋅ Tao Wang ⋅ Yuxuan Li ⋅ Yi Jin ⋅ Haibin Ling
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 433
OnlinePG: Online Open-Vocabulary Panoptic Mapping with 3D Gaussian Splatting
Hongjia Zhai ⋅ Qi Zhang ⋅ Xiaokun Pan ⋅ Xiyu Zhang ⋅ Yitong Dong ⋅ Huaqi Zhang ⋅ Dan Xu ⋅ Guofeng Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 434
Uni3R: Unified 3D Reconstruction and Semantic Understanding via Generalizable Gaussian Splatting from Unposed Multi-View Images
Xiangyu Sun ⋅ Haoyi Jiang ⋅ Liu Liu ⋅ Seungtae Nam ⋅ Gyeongjin Kang ⋅ Xinjie wang ⋅ Wei Sui ⋅ Zhizhong Su ⋅ Wenyu Liu ⋅ Xinggang Wang ⋅ Eunbyung Park
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 435
Learning Explicit Continuous Motion Representation for Dynamic Gaussian Splatting from Monocular Videos
Xuankai Zhang ⋅ Junjin Xiao ⋅ Shangwei Huang ⋅ Wei-Shi Zheng ⋅ Qing Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 436
MLLMSplat: A 2D MLLM-Powered Framework for 3D Gaussian Splatting Understanding, Generation, and Editing
Jingqiao Xiu ⋅ Can Wang ⋅ Dong Xu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 437
Dropping Anchor and Spherical Harmonics for Sparse-view Gaussian Splatting
Shuangkang Fang ⋅ I-Chao Shen ⋅ Xuanyang Zhang ⋅ Zesheng Wang ⋅ Yufeng Wang ⋅ Wenrui Ding ⋅ Gang Yu ⋅ Takeo Igarashi
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 438
RAP: Fast Feedforward Rendering-Free Attribute-Guided Primitive Importance Score Prediction for Efficient 3D Gaussian Splatting Processing
Kaifa Yang ⋅ Qi Yang ⋅ Yiling Xu ⋅ Zhu Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 439
Plug-and-Play PDE Optimization for 3D Gaussian Splatting: Toward High-Quality Rendering and Reconstruction
Yifan Mo ⋅ Youcheng Cai ⋅ Ligang Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 440
PointGS: Semantic-Consistent Unsupervised 3D Point Cloud Segmentation with 3D Gaussian Splatting
Yixiao Song ⋅ Qingyong Li ⋅ Wen Wang ⋅ Zhicheng Yan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 441
Scene Grounding in the Wild
Tamir Cohen ⋅ Leo Segre ⋅ Shay Shomer-Chai ⋅ Shai Avidan ⋅ Hadar Averbuch-Elor
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 442
Flow4DGS-SLAM: Optical Flow-Guided 4D Gaussian Splatting SLAM
Yunsong Wang ⋅ Gim Hee Lee
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 443
Revisiting 3D Reconstruction Kernels as Low-Pass Filters
Shengjun Zhang ⋅ Min Chen ⋅ Yibo Wei ⋅ Mingyu Dong ⋅ Yueqi Duan
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 444
SR3R: Rethinking Super-Resolution 3D Reconstruction With Feed-Forward Gaussian Splatting
Xiang Feng ⋅ Xiangbo Wang ⋅ Tieshi Zhong ⋅ Chengkai Wang ⋅ Yiting Zhao ⋅ Tianxiang Xu ⋅ Zhenzhong Kuang ⋅ Feiwei Qin ⋅ Xuefei Yin ⋅ Yanming Zhu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 445
GP-4DGS: Probabilistic 4D Gaussian Splatting from Monocular Video via Variational Gaussian Processes
Mijeong Kim ⋅ Jungtaek Kim ⋅ Bohyung Han
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 446
VisRef: Visual Refocusing while Thinking Improves Test-Time Scaling in Multi-Modal Large Reasoning Models
Soumya Suvra Ghosal ⋅ Youngeun Kim ⋅ Zhuowei Li ⋅ Ritwick Chaudhry ⋅ Linghan Xu ⋅ Hongjing Zhang ⋅ Jakub Zablocki ⋅ Yifan Xing ⋅ Qin ZHANG
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 447
IPR-1: Interactive Physical Reasoner
Mingyu Zhang ⋅ lifeng zhuo ⋅ Tianxi Tan ⋅ Guocan Xie ⋅ Xian Nie ⋅ Yan Li ⋅ Renjie Zhao ⋅ Zizhu He ⋅ Ziyu Wang ⋅ Jiting Cai ⋅ Yonglu Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 448
VIRO: Robust and Efficient Neuro-Symbolic Reasoning with Verification for Referring Expression Comprehension
Hyejin Park ⋅ Junhyuk Kwon ⋅ Suha Kwak ⋅ Jungseul Ok
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 449
Fuel Gauge: Estimating Chain-of-Thought Length Ahead of Time in Large Multimodal Models
Yuedong Yang ⋅ Xiwen Wei ⋅ Mustafa Munir ⋅ Radu Marculescu
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 450
Thinking in Dynamics: How Multimodal Large Language Models Perceive, Track, and Reason Dynamics in Physical 4D World
Yuzhi Huang ⋅ Kairun Wen ⋅ Rongxin Gao ⋅ Dongxuan Liu ⋅ Yibin Lou ⋅ Jie Wu ⋅ Jing Xu ⋅ Jian Zhang ⋅ Zheng Yang ⋅ yunlong lin ⋅ Chenxin Li ⋅ Panwang Pan ⋅ Junbin Lu ⋅ Jingyan Jiang ⋅ Xinghao Ding ⋅ Yue Huang ⋅ Zhi Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 451
Latent Implicit Visual Reasoning
Kelvin Li ⋅ Chuyi Shang ⋅ Leonid Karlinsky ⋅ Rogerio Feris ⋅ Trevor Darrell ⋅ Roei Herzig
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 452
Thinking with Programming Vision: Towards a Unified View for Thinking with Images
Zirun Guo ⋅ Minjie Hong ⋅ Feng Zhang ⋅ Kai Jia ⋅ Tao Jin
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 453
AV-Reasoner: Improving and Benchmarking Clue-Grounded Audio-Visual Counting for MLLMs
Lidong Lu ⋅ Guo Chen ⋅ Wei Zhu ⋅ Zhiqi Li ⋅ Yicheng Liu ⋅ Tong Lu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 454
All Roads Lead to Rome: Incentivizing Divergent Thinking in Vision-Language Models
Xinyu Tian ⋅ Shu Zou ⋅ Zhaoyuan Yang ⋅ Mengqi He ⋅ Peter Henry Tu ⋅ Jing Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 455
See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning
Shuoshuo Zhang ⋅ Yizhen Zhang ⋅ JINGJING FU ⋅ Lei Song ⋅ Jiang Bian ⋅ Yujiu Yang ⋅ Rui Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 456
Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens
Zeyuan Yang ⋅ Xueyang Yu ⋅ Delin Chen ⋅ Maohao Shen ⋅ Chuang Gan
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 457
ReaGEN: Adaptive Generation of Structured Chains-of-Thought for Efficient Multimodal Reasoning
Ruiqing Tian ⋅ Mohan Sai Singamsetti ⋅ Di Niu ⋅ Bahador Rashidi
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 458
Breaking the Regional Perception Bottleneck of Multimodal Large Language Models via External Reasoning Framework
Jinrong Zhang ⋅ Zhaoyang Xu ⋅ Xusheng He ⋅ Xinrui Li ⋅ Na Zheng ⋅ Jianlong Wu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 459
CodePercept: Code-Grounded Visual STEM Perception for MLLMs
Tongkun Guan ⋅ Zhibo Yang ⋅ Jianqiang Wan ⋅ Mingkun Yang ⋅ Zhentao Guo ⋅ Zijian Hu ⋅ Ruilin Luo ⋅ Ruizhe Chen ⋅ Sontao Jiang ⋅ Peng Wang ⋅ Wei Shen ⋅ Junyang Lin ⋅ Xiaokang Yang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 460
TableMix: Enhancing Multimodal Table Reasoning in MLLMs from a Data-Centric Perspective
Chaohu Liu ⋅ Shida Wang ⋅ Yubo Wang ⋅ Linli Xu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 461
Harnessing Chain-of-Thought Reasoning in Multimodal Large Language Models for Face Anti-Spoofing
Honglu Zhang ⋅ Zhiqin Fang ⋅ Ningning Zhao ⋅ Saihui Hou ⋅ Long Ma ⋅ Renwang Pei ⋅ Zhaofeng He
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 462
Grounded Chain-of-Thought for Multimodal Large Language Models
Qiong Wu ⋅ Xiangcong Yang ⋅ Yiyi Zhou ⋅ Chenxin Fang ⋅ Baiyang Song ⋅ Xiaoshuai Sun ⋅ Rongrong Ji
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 463
LS-ViT: Least-Squares Hessian Based Block Reconstruction for Low-Bit Post-Training Quantization of Vision Transformers
Hyunha Hwang ⋅ Xuan Truong Nguyen ⋅ Hyuk-Jae Lee
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 464
SegMo: Co-Designing Content-Aware Sparsity and Locally-Cohesive Segment Parallelism for Efficient VLM Inference
Haojuan Li ⋅ Ruohan Tang ⋅ Dongzhou Cheng ⋅ Zongpu Zhang ⋅ Jian Li ⋅ Jiaqi Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 465
Rethinking Asymmetric Quantization: Hidden Symmetry in Vision Model Weights
Masafumi Mori ⋅ Shinya Gongyo ⋅ Mitsuru Ambai
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 466
Compressed-Domain-Aware Online Video Super-Resolution
Yuhang Wang ⋅ Hai Li ⋅ Shujuan Hou ⋅ Zhetao Dong ⋅ Xiaoyao Yang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 467
CAR-SAM: Cross-Attention Reconstruction for Post-Training Quantization of the Segment Anything Model
Houji Wen ⋅ Jiangyong Yu ⋅ Dawei Yang ⋅ Jun Li
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 468
Is Bin Generation Indispensable? A Bin-Generation-Free Dataset Quantization via Semantic Perspective
Maijie Deng ⋅ Yuhua Li ⋅ Yixiong Zou ⋅ Yao Wu ⋅ Chenru Ma
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 469
High Resolution Neural Video Coding with Bi-directional Confidence-Guided Reference Information Modeling
Feng Ye ⋅ Kai Zhang ⋅ Li zhang ⋅ Chuanmin Jia
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 470
Distributed Image Compression with Multimodal Side Information at Extremely Low Bitrates
Guojun Xu ⋅ Mingyang Zhang ⋅ Jianwen Xiang ⋅ Cheng Tan ⋅ Yanchao Yang ⋅ Junwei Zhou
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 471
Task-Aware Image Signal Processor for Advanced Visual Perception
CHEN KAI ⋅ Jin Xiao ⋅ Leheng Zhang ⋅ Kexuan Shi ⋅ Shuhang Gu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 472
Enhancing Video Vision Language Model with Hippocampal Sensing
Xu Cao
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 473
VIRD: View-Invariant Representation through Dual-Axis Transformation for Cross-View Pose Estimation
Juhye Park ⋅ Wooju Lee ⋅ Dasol Hong ⋅ Changki Sung ⋅ Youngwoo Seo ⋅ DongWan Kang ⋅ Hyun Myung
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 474
WRIVINDER: Towards Spatial Intelligence for Geo-locating Ground Images onto Satellite Imagery
Chandrakanth Gudavalli ⋅ Tajuddin Manhar Mohammed ⋅ Abhay Yadav ⋅ Ananth Vishnu Bhaskar ⋅ Hardik Prajapati ⋅ Cheng Peng ⋅ Rama Chellappa ⋅ Shivkumar Chandrasekaran ⋅ B.S. Manjunath
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 475
SoPE: Spherical Coordinate-Based Positional Embedding for Enhancing Spatial Perception of 3D LVLMs
Koonting Yip ⋅ Qiyan Zhao ⋅ Wenhao Yu ⋅ Liangyu Yuan ⋅ Mingkai LI ⋅ Xiaofeng Zhang ⋅ Jianmin Ji ⋅ Yanyong Zhang ⋅ Qing Jiang ⋅ Ka-Veng Yuen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 476
RHO: Robust Holistic OSM-Based Metric Cross-View Geo-Localization
Junwei Zheng ⋅ Ruize Dai ⋅ Ruiping Liu ⋅ Zichao Zeng ⋅ Yufan Chen ⋅ Fangjinhua Wang ⋅ Kunyu Peng ⋅ Kailun Yang ⋅ Jiaming Zhang ⋅ Rainer Stiefelhagen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 477
EfficientVPR: Toward Efficient Visual Place Recognition via Scene-Aware Prompt Tuning and Adaptive Feature Enhancement
Wenjing Tang ⋅ Chuanguang Yang ⋅ Zhulin An ⋅ Libo Huang ⋅ boyu diao ⋅ Yongjun Xu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 478
Universal Guideline-Driven Image Clustering via a Hybrid LLM Agent
Wenliang Zhong ⋅ Rob Barton ⋅ Lucas Goncalves ⋅ Kushal Kumar ⋅ Feng Jiang ⋅ Hehuan Ma ⋅ Yuzhi Guo ⋅ Vidit Bansal ⋅ Karim Bouyarmane ⋅ Junzhou Huang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 479
ReLaX: Reasoning with Latent Exploration for Large Reasoning Models
Shimin Zhang ⋅ Xianwei Chen ⋅ Yufan Shen ⋅ Ziyuan Ye ⋅ Jibin Wu
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 480
VideoChat-M1: Collaborative Policy Planning for Video Understanding via Multi-Agent Reinforcement Learning
Boyu Chen ⋅ Zikang Wang ⋅ Zhengrong Yue ⋅ Kainan Yan ⋅ Chenyun Yu ⋅ Yi Huang ⋅ Zijun Liu ⋅ Yafei Wen ⋅ Xiaoxin Chen ⋅ Yang Liu ⋅ Peng Li ⋅ Yali Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 481
Think, Then Verify: A Hypothesis–Verification Multi-Agent Framework for Long Video Understanding
Zheng Wang ⋅ Haoran Chen ⋅ Haoxuan Qin ⋅ Zhipeng Wei ⋅ Tianwen Qian ⋅ Cong Bai
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 482
Reinforce to Learn, Elect to Reason: A Dual Paradigm for Video Reasoning
Songyuan Yang ⋅ Weijiang Yu ⋅ Jilin Ma ⋅ Ziyu Liu ⋅ Guijian Tang ⋅ Wenjing Yang ⋅ Huibin Tan ⋅ Nong Xiao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 483
Graph-to-Frame RAG: Visual-Space Knowledge Fusion for Training-Free and Auditable Video Reasoning
Songyuan Yang ⋅ Weijiang Yu ⋅ Ziyu Liu ⋅ Guijian Tang ⋅ Wenjing Yang ⋅ Huibin Tan ⋅ Nong Xiao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 484
LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling
Zuhao Yang ⋅ Sudong Wang ⋅ Kaichen Zhang ⋅ Keming Wu ⋅ Sicong Leng ⋅ Yifan Zhang ⋅ Bo Li ⋅ Chengwei Qin ⋅ Shijian Lu ⋅ Xingxuan Li ⋅ Lidong Bing
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 485
Multi-Modal Image Fusion via Intervention-Stable Feature Learning
Xue Wang ⋅ Zheng Guan ⋅ Wenhua Qian ⋅ Chengchao Wang ⋅ Runzhuo MA
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 486
ReCoFuse: Ultra-Robust Image Fusion via Restorative Multi-Modal Diffusion Reciprocal Coupling
HAO ZHANG ⋅ Shuhan Yang ⋅ Linfeng Tang ⋅ Xunpeng Yi ⋅ Jiayi Ma
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 487
Degradation-Robust Fusion: An Efficient Degradation-Aware Diffusion Framework for Multimodal Image Fusion in Arbitrary Degradation Scenarios
Yu Shi ⋅ Yu Liu ⋅ Zhong-Cheng Wu ⋅ Juan Cheng ⋅ Huafeng Li ⋅ Xun Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 488
DF^2-VB: Dual-level Fuzzy Fusion with View-specific Boosting for Multi-view Multi-label Classification
Yuena Lin ⋅ Haichun Cai ⋅ Yi Shan ⋅ Hao Wei ⋅ Yongjian Deng ⋅ Zhen Yang ⋅ Gengyu Lyu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 489
UniFusion: A Unified Image Fusion Framework with Robust Representation and Source-Aware Preservation
Xingyuan Li ⋅ Songcheng Du ⋅ Yang Zou ⋅ HaoYuan Xu ⋅ Zhiying Jiang ⋅ Jinyuan Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 490
Self-guided Semantic Inspection for Zero-Shot Composed Image Retrieval
Jingjing Zhang ⋅ Lei Zhang ⋅ Zheren Fu ⋅ Bo Hu ⋅ Zhendong Mao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 491
G-MIXER: Geodesic Mixup-based Implicit Semantic Expansion and Explicit Semantic Re-ranking for Zero-Shot Composed Image Retrieval
jiyoung lim ⋅ Heejae Yang ⋅ Jee-Hyong Lee
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 492
No Hard Negatives Required: Concept Centric Learning Leads to Compositionality without Degrading Zero-shot Capabilities of Contrastive Models
Hai X. Pham ⋅ David T. ⋅ Ricardo Guerrero ⋅ Brais Martinez
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 493
MUSE: Harnessing Precise and Diverse Semantics for Few-Shot Whole Slide Image Classification
Jiahao Xu ⋅ Sheng Huang ⋅ Xin Zhang ⋅ Zhixiong Nan ⋅ Jiajun Dong ⋅ Nankun Mu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 494
Pointing at Parts: Training-Free Few-Shot Grounding in Multimodal LLMs
Shiang-Feng Tsai ⋅ Yuan-Hong Liao ⋅ Jin-Cheng Jhang ⋅ Nan Qiao ⋅ Min Sun
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 495
Graph Attention Prototypical Network for Robust Few-Shot Classification
Tingyun Liu ⋅ Licheng Liu ⋅ Qibin Zhang ⋅ Qiying Feng ⋅ C.L.Philip Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 496
Mitigating The Distribution Shift of Diffusion-based Dataset Distillation
Yue Xu ⋅ Chenyu Hu ⋅ Pengyu An ⋅ Yonglu Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 497
EVLF: Early Vision-Language Fusion for Generative Dataset Distillation
WENQI CAI ⋅ Yawen Zou ⋅ Guang Li ⋅ Chunzhi Gu ⋅ Chao Zhang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 498
Fixed Anchors Are Not Enough: Dynamic Retrieval and Persistent Homology for Dataset Distillation
Muquan Li ⋅ Hang Gou ⋅ Yingyi Ma ⋅ Rongzheng Wang ⋅ Ke Qin ⋅ Tao He
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 499
Flow Map Distillation Without Data
Shangyuan Tong ⋅ Nanye Ma ⋅ Saining Xie ⋅ Tommi Jaakkola
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 500
F^2HDR: Two-Stage HDR Video Reconstruction via Flow Adapter and Physical Motion Modeling
Huanjing Yue ⋅ Dawei Li ⋅ Shaoxiong Tu ⋅ Jingyu Yang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 501
Learning Latent Transmission and Glare Maps for Lens Veiling Glare Removal
Xiaolong Qian ⋅ Qi Jiang ⋅ Lei Sun ⋅ Zongxi Yu ⋅ Kailun Yang ⋅ Peixuan Wu ⋅ Jiacheng Zhou ⋅ Yao Gao ⋅ Yaoguang Ma ⋅ Ming-Hsuan Yang ⋅ Kaiwei Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 502
Inter-Photon-Limited Videography
Andrew Xie ⋅ Dongyu Du ⋅ Sotiris Nousias ⋅ David B. Lindell ⋅ Kiriakos N. Kutulakos
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 503
A Bit is All You Need! Efficient Video Capture via Single Bit Imaging
Kanchana Vaishnavi Gandikota ⋅ Michael Moeller ⋅ Andreas Kolb ⋅ Bhaskar Choubey ⋅ Paramanand Chandramouli
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 504
From Events to Clarity: The Event-Guided Diffusion Framework for Dehazing
Ling Wang ⋅ Yunfan Lu ⋅ Wenzong Ma ⋅ Huizai Yao ⋅ Pengteng Li ⋅ Hui Xiong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 505
Electromagnetic Inverse Scattering from a Single Transmitter
Yizhe Cheng ⋅ Chunxun Tian ⋅ Haoru Wang ⋅ Wentao Zhu ⋅ Xiaoxuan Ma ⋅ Yizhou Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 506
Statistical Characteristic-Guided Denoising for Rapid High-Resolution Transmission Electron Microscopy Imaging
Hesong Li ⋅ Ziqi Wu ⋅ Ruiwen Shao ⋅ Ying Fu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 507
Physics-Guided Multistep Deformation Reversal for Ancient Bamboo Slip Restoration
Qianqian Tang ⋅ Jinchi Zhu ⋅ Xiaolu Zhou ⋅ Yongchao Xu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 508
cryoSENSE: Compressive Sensing Enables High-throughput Microscopy with Sparse and Generative Priors on the Protein Cryo-EM Image Manifold
Zain Shabeeb ⋅ Daniel Saeedi ⋅ Darin Tsui ⋅ Vida Jamali ⋅ Amirali Aghazadeh
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 509
SGDE: Self-supervised Geometry Degradation Estimation Framework for Coded Aperture Compressive Spectral Imaging
Yuqiao He ⋅ Xiaoyan LIU ⋅ Jianxu Mao ⋅ Yaonan Wang ⋅ Hui Zhang ⋅ Lizhu Liu ⋅ Yurong Chen ⋅ Wenbin He
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 510
Factorized Context Aggregation for Robust Cancer Risk Estimation via Soft Re-Ranked Retrieval and Hierarchical Anchors
Puria Azadi Moghadam ⋅ Ali Khajegili Mirabadi ⋅ Behnam Maneshgar ⋅ Hossein Farahani ⋅ Ali Bashashati
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 511
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
Zhuangcheng Gu ⋅ Guang Liang ⋅ Bin Wang ⋅ Zhiyuan Zhao ⋅ Qintong Zhang ⋅ Weijia Li ⋅ Chao Xu ⋅ Bo Zhang ⋅ Botian Shi ⋅ Jiang Wu ⋅ Wentao Zhang ⋅ Conghui He
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 512
GeneVAR: Causal MeanFlow for Autoregressive Gene-to-WSI Tile Synthesis
Jianwei Zhao ⋅ Fan Yang ⋅ XIN LI ⋅ Qiang Zhai ⋅ Ao Luo ⋅ Ziqi Ren ⋅ Zhicheng Jiao ⋅ Hong Cheng
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 513
Depth Any Endoscopy: Towards Self-Supervised Generalizable Depth Estimation in Monocular Endoscopy
Shuwei Shao ⋅ Kejin Zhu ⋅ Shixing Ma ⋅ Xinzhe Du ⋅ Baochang Zhang ⋅ Zhe Min
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 514
RoSAMDepth: Robust Self-supervised Depth Estimation Leveraging Segment Anything Model
Xuanang Gao ⋅ Ning Zhiwei ⋅ Gengming Zhang ⋅ Jiaxi Cao ⋅ Runze Yang ⋅ Zhonglong Zheng ⋅ JIE YANG ⋅ Rong Xiao ⋅ Wei Liu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 515
AdaSFormer: Adaptive Serialized Transformers for Monocular Semantic Scene Completion from Indoor Environments
xuzhi wang ⋅ Xinran Wu ⋅ Song Wang ⋅ Lingdong Kong ⋅ Ziping Zhao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 516
Dark3R: Learning Structure from Motion in the Dark
Andrew Y. Guo ⋅ Anagh Malik ⋅ SaiKiran Tedla ⋅ Yutong Dai ⋅ Yiqian Qin ⋅ Zach Salehe ⋅ Benjamin Attal ⋅ Sotiris Nousias ⋅ Kiriakos N. Kutulakos ⋅ David B. Lindell
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 517
What Makes Good Synthetic Training Data for Zero-Shot Stereo Matching?
David Yan ⋅ Alexander Raistrick ⋅ Jia Deng
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 518
TR2M: Transferring Monocular Relative Depth to Metric Depth with Language Descriptions and Dual-Level Scale-Oriented Contrast
Beilei Cui ⋅ Yiming Huang ⋅ Long Bai ⋅ Hongliang Ren
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 519
Iris: Integrating Language into Diffusion-based Monocular Depth Estimation
Ziyao Zeng ⋅ Jingcheng Ni ⋅ Daniel Wang ⋅ Patrick Rim ⋅ Younjoon Chung ⋅ Fengyu Yang ⋅ Byung-Woo Hong ⋅ Alex Wong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 520
Ov3R: Open-Vocabulary Semantic 3D Reconstruction from RGB Videos
ZIREN GONG ⋅ Xiaohan Li ⋅ Fabio Tosi ⋅ Jiawei Han ⋅ Stefano Mattoccia ⋅ Jianfei Cai ⋅ Matteo Poggi
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 521
M3DLayout: A Multi-Source Dataset of 3D Indoor Layouts and Structured Descriptions for 3D Generation
Yiheng Zhang ⋅ Zhuojiang Cai ⋅ Mingdao Wang ⋅ Meitong Guo ⋅ Tianxiao Li ⋅ Li Lin ⋅ Yuwang Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 522
UniPart: Part-Level 3D Generation with Unified 3D Geom–Seg Latents
Xufan He ⋅ Yushuang Wu ⋅ Xiaoyang Guo ⋅ Chongjie Ye ⋅ Jiaqing Zhou ⋅ Tianlei Hu ⋅ Xiaoguang Han ⋅ Dong Du
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 523
Photo3D: Advancing Photorealistic 3D Generation through Structure‑Aligned Detail Enhancement
Xinyue Liang ⋅ Zhiyuan Ma ⋅ Lingchen Sun ⋅ Yanjun Guo ⋅ Lei Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 524
Mesh-Pro: Asynchronous Advantage-guided Ranking Preference Optimization for Artist-style Quadrilateral Mesh Generation
Zhen Zhou ⋅ Jian Liu ⋅ Biwen Lei ⋅ Jing Xu ⋅ Haohan Weng ⋅ Yiling Zhu ⋅ Zhuo Chen ⋅ Junfeng Fan ⋅ Yunkai Ma ⋅ Dazhao Du ⋅ Song Guo ⋅ Fengshui Jing ⋅ Chunchao Guo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 525
Order Matters: 3D Shape Generation from Sequential VR Sketches
Yizi Chen ⋅ Sidi Wu ⋅ Tianyi Xiao ⋅ Nina Wiedemann ⋅ Loic Landrieu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 526
Think-Then-Generate: Structural Chain-of-Thought Reasoning for Consistent 3D Generation
Xinyue Liu ⋅ Jin Liu ⋅ Hongbo Wang ⋅ Ran He ⋅ Huaibo Huang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 527
ArtLLM: Generating Articulated Assets via 3D LLM
Penghao Wang ⋅ Siyuan Xie ⋅ Jiawei Zhou ⋅ Xianghui Yang ⋅ Jingwei Huang ⋅ Chunchao Guo ⋅ Jiayuan Gu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 528
PoseMaster: A Unified 3D Native Framework for Stylized Pose Generation
Hongyu Yan ⋅ Kunming Luo ⋅ Weiyu Li ⋅ Kaiyi Zhang ⋅ Yixun Liang ⋅ Jingwei Huang ⋅ Chunchao Guo ⋅ Ping Tan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 529
2D-LFM: Lifting Foundation Model without 3D Supervision
Mosam Dabhi ⋅ Irhas Gill ⋅ László A. Jeni ⋅ Simon Lucey
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 530
ActionMesh: Animated 3D Mesh Generation with Temporal 3D Diffusion
Remy Sabathier ⋅ David Novotny ⋅ Niloy J. Mitra ⋅ Tom Monnier
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 531
4DWorldBench: A Comprehensive Evaluation Framework for 3D/4D World Generation Models
Yiting Lu ⋅ Wei Luo ⋅ Peiyan Tu ⋅ Haoran Li ⋅ Hanxin Zhu ⋅ Zihao Yu ⋅ Xingrui Wang ⋅ Xinyi Chen ⋅ Xinge Peng ⋅ Xin Li ⋅ Zhibo Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 532
FabricGen: Microstructure-Aware Woven Fabric Generation
Yingjie Tang ⋅ Di Luo ⋅ Zixiong Wang ⋅ Xiaoli Ling ⋅ Jian Yang ⋅ Beibei Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 533
Leveraging Verifier-Based Reinforcement Learning in Image Editing
Hanzhong Guo ⋅ Jie Wu ⋅ Jie Liu ⋅ Yu Gao ⋅ Zilyu Ye ⋅ Linxiao Yuan ⋅ Xionghui Wang ⋅ Yizhou Yu ⋅ Weilin Huang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 534
PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling
Bowen Ping ⋅ Chengyou Jia ⋅ Minnan Luo ⋅ Changliang Xia ⋅ Xin Shen ⋅ Zhuohang Dang ⋅ Hangwei Qian
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 535
VIVA: VLM-Guided Instruction-Based Video Editing with Reward Optimization
Xiaoyan Cong ⋅ Haotian Yang ⋅ Angtian Wang ⋅ Yizhi Wang ⋅ Yiding Yang ⋅ Canyu Zhang ⋅ Chongyang Ma
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 536
MapReduce LoRA: Advancing the Pareto Front in Multi-Preference Optimization for Generative Models
Chieh-Yun Chen ⋅ Zhonghao Wang ⋅ Qi Chen ⋅ Zhifan Ye ⋅ Min Shi ⋅ Yue Zhao ⋅ Yinan Zhao ⋅ Hui Qu ⋅ Wei-An Lin ⋅ Yiru Shen ⋅ Ajinkya Kale ⋅ Irfan Essa ⋅ Humphrey Shi
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 537
Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation
Yunhong Lu ⋅ Yanhong Zeng ⋅ Haobo Li ⋅ Hao Ouyang ⋅ Qiuyu Wang ⋅ Ka Leong Cheng ⋅ Jiapeng Zhu ⋅ Hengyuan Cao ⋅ Zhipeng Zhang ⋅ Xing Zhu ⋅ Yujun Shen ⋅ Min Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 538
C^2FG: Control Classifier-Free Guidance via Score Discrepancy Analysis
Jiayang Gao ⋅ Tianyi Zheng ⋅ Jiayang Zou ⋅ Fengxiang Yang ⋅ Shice Liu ⋅ Luyao Fan ⋅ Zheyu Zhang ⋅ Hao Zhang ⋅ Jinwei Chen ⋅ Peng-Tao Jiang ⋅ Bo Li ⋅ Jia Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 539
Learning What to Trust: Bayesian Prior-Guided Optimization for Visual Generation
Ruiying Liu ⋅ Yuanzhi Liang ⋅ Haibin Huang ⋅ Tianshu Yu ⋅ Chi Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 540
Unified Customized Generation by Disentangled Reward Modeling
Shaojin Wu ⋅ Mengqi Huang ⋅ Yufeng Cheng ⋅ wenxu wu ⋅ Jiahe Tian ⋅ Yiming Luo ⋅ Fei Ding ⋅ Qian HE
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 541
Region-Aware Instance Consistency Learning for Micro-Expression Recognition
Yaomin Cai ⋅ C.L.Philip Chen ⋅ Shiting Xu ⋅ Haiqi Liu ⋅ Tong Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 542
MPL: Match-guided Prototype Learning for Few-shot Action Recognition
Feng Yang ⋅ Jie Zhao ⋅ Fulin Luo ⋅ Anyong Qin ⋅ Tiecheng Song ⋅ Yue Zhao ⋅ CHENQIANG GAO ⋅ Junwei Han
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 543
LaDy: Lagrangian-Dynamic Informed Network for Skeleton-based Action Segmentation via Spatial-Temporal Modulation
Haoyu Ji ⋅ Xueting Liu ⋅ Yu Gao ⋅ Wenze Huang ⋅ Zhihao Yang ⋅ Weihong Ren ⋅ Zhiyong Wang ⋅ Honghai LIU
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 544
LA-Pose: Latent Action Pretraining Meets Pose Estimation
Zhengqing Wang ⋅ Saurabh Nair ⋅ Prajwal Chidananda ⋅ Pujith Kachana ⋅ Samuel Li ⋅ Matthew Brown ⋅ Yasutaka Furukawa
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 545
RAAS: LLM Agentic System Architecture Search with GRPO
Jiayi Yang ⋅ Guancheng Wan ⋅ Man Zhang ⋅ Mang Ye
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 546
Temporal Representation Enhancement (TRE): Learning to Forget Dominant Patterns for Enhanced Temporal Spiking Features
Wei Liu ⋅ Li Yang ⋅ Yufei Wang ⋅ Han Xiao ⋅ Boyu Cai ⋅ Weiming Hu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 547
Chain-of-Models Pre-Training: Rethinking Training Acceleration of Vision Foundation Models
Jiawei Fan ⋅ Shigeng Wang ⋅ Chao Li ⋅ Xiaolong Liu ⋅ Anbang Yao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 548
Unlocking Pre-trained Weights: Parameter Inheritance for Zero-Shot Initialization
Jiaze Xu ⋅ Shiyu Xia ⋅ Jiaqi Lv ⋅ Xin Geng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 549
Deconstructing the Failure of Ideal Noise Correction: A Three-Pillar Diagnosis
Chen Feng ⋅ Zhuo ZHI ⋅ Zhao Huang ⋅ Jiawei Ge ⋅ Ling Xiao ⋅ Nicu Sebe ⋅ Georgios Tzimiropoulos ⋅ Ioannis Patras
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 550
Progressive Neural Architecture Generation
Caiyang Yu ⋅ Chen Huang ⋅ Yun Liu ⋅ Chenwei Tang ⋅ Wei Ju ⋅ Jiancheng Lv
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 551
A Unified Framework for Knowledge Transfer in Bidirectional Model Scaling
Jianlu Shen ⋅ Fu Feng ⋅ Jiaze Xu ⋅ Yucheng Xie ⋅ Jiaqi Lv ⋅ Xin Geng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 552
When Do Models Actually Decide? Mapping the Layer-Wise Decision Timeline in Pretrained Neural Networks
Minhyeok Lee
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 553
Temporal Interaction in Spiking Transformers with Multi-Delay Mixer
Kexin Shi ⋅ Hanwen Liu ⋅ Zeyang Song ⋅ Yang Liu ⋅ Jieyuan Zhang ⋅ Shuai Wang ⋅ Jibin Wu ⋅ Malu Zhang ⋅ Yang Yang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 554
Consensus vs. Controversy: Mapping the Decision Space Where Architectures Diverge
Minhyeok Lee
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 555
Sparsely Timing the Change: A Spiking Temporal Framework for Remote Sensing Interpretation
Shilong Li ⋅ Xiurui Xie ⋅ Qiugang Zhan ⋅ Luochao Wang ⋅ Yong Deng ⋅ Guisong Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 556
ProSoftArena: Benchmarking Hierarchical Capabilities of Multi-modal Agents in Professional Software Environments
Jiaxin Ai ⋅ Yukang Feng ⋅ Fanrui Zhang ⋅ Jianwen Sun ⋅ Zizhen Li ⋅ Chuanhao Li ⋅ Yifan Chang ⋅ Wenxiao Wu ⋅ Ruoxi Wang ⋅ Mingliang Zhai ⋅ Kaipeng Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 557
BAMI: Training-Free Bias Mitigation in GUI Grounding
Borui Zhang ⋅ Bo Zhang ⋅ Bo Wang ⋅ Wenzhao Zheng ⋅ Yuhao Cheng ⋅ Liang Tang ⋅ Yiqiang Yan ⋅ Jie Zhou ⋅ Jiwen Lu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 558
DRS-GUI: Dynamic Region Search for Training-Free GUI Grounding
Yichao Liu ⋅ Huawen Shen ⋅ Liu Yu ⋅ Shiyu Liu ⋅ Zeyu Chen ⋅ Yu ZHOU
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 559
Consistency Beyond Contrast: Enhancing Open-Vocabulary Object Detection Robustness via Contextual Consistency Learning
bozhao Li ⋅ Shaocong Wu ⋅ Tong Shao ⋅ Senqiao Yang ⋅ Qiben Shan ⋅ Zhuotao Tian ⋅ Jingyong Su
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 560
Thermal-Det: Language-Guided Cross-Modal Distillation for Open-Vocabulary Thermal Object Detection
Yasiru Ranasinghe ⋅ Elim Schenck ⋅ Florence Yellin ⋅ Shuowen Hu ⋅ Christopher Funk ⋅ Vishal M. Patel
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 561
Geometry-driven OOD Detectors Are Class-Incremental Learners
Wangwang Jia ⋅ Zijian Gao ⋅ Tianjiao Wan ⋅ Yuan Cao ⋅ Yong Dou ⋅ Kele Xu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 562
Mind the Way You Select Negative Texts: Pursuing the Distance Consistency in OOD Detection with VLMs
Zhikang Xu ⋅ Qianqian Xu ⋅ Zitai Wang ⋅ Cong Hua ⋅ Sicong Li ⋅ Zhiyong Yang ⋅ Qingming Huang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 563
Prompt-Free Unknown Label Generation for Open World Detection in Remote Sensing
Abdullah Azeem ⋅ Ruisheng Wang ⋅ Qingquan Li ⋅ Abubakar Siddique
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 564
Learning to Diversify and Focus: A Reinforcement Framework for Open-Vocabulary HOI Detection
Yongchao Xu ⋅ Jiawei Liu ⋅ Junfeng Wang ⋅ Sen Tao ⋅ Na Jiang ⋅ Zheng-Jun Zha
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 565
RINO: Rotation-Invariant Non-Rigid Correspondences
Maolin Gao ⋅ Shao Jie Hu-Chen ⋅ Congyue Deng ⋅ Riccardo Marin ⋅ Leonidas Guibas ⋅ Daniel Cremers
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 566
Hyperbolic Prototype Learning with Uncertainty-Aware Consistency for Continual Test-Time Segmentation
Siddhant Gole ⋅ Akash Pal ⋅ Amit Popat More ⋅ S Divakar Bhat ⋅ Subhasis Chaudhuri ⋅ Biplab Banerjee
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 567
DINO Eats CLIP: Adapting Beyond Knowns for Open-set 3D Object Retrieval
Xinwei He ⋅ Yansong Zheng ⋅ Qianru Han ⋅ Zhichuan Wang ⋅ Yuxuan Cai ⋅ Yang Zhou ⋅ Jingbo Xia ⋅ Yulong Wang ⋅ Jinhai Xiang ⋅ Xiang Bai
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 568
Leveraging Class Distributions in CLIP for Weakly Supervised Semantic Segmentation
Ziqian Yang ⋅ Xinqiao Zhao ⋅ Xiaolei Wang ⋅ Quan Zhang ⋅ Jimin Xiao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 569
CompetitorFormer: Mitigating Query Conflicts for 3D Instance Segmentation via Competitive Strategy
wang duanchu ⋅ Junjie Yang ⋅ Haoran Gong ⋅ Jing Liu ⋅ Di Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 570
D2Dewarp: Dual Dimensions Geometric Representation Learning Based Document Image Dewarping
Heng Li ⋅ Xiangping Wu ⋅ Qingcai Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 571
Discover, Segment, and Select: A Progressive Mechanism for Zero-shot Camouflaged Object Segmentation
Yilong Yang ⋅ Jianxin Tian ⋅ Shengchuan Zhang ⋅ Liujuan Cao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 572
D-Convexity: A Unified Differentiable Convex Shape Prior via Quasi-Concavity for Data-driven Image Segmentation
Shengzhe Chen ⋅ Hao Yan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 573
Fast Reasoning Segmentation for Images and Videos
Yiqing Shen ⋅ Mathias Unberath
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 574
Structure-Aware Representation Distillation for Tiny-Dense Object Segmentation
Xuesong Liu ⋅ Anke Xu ⋅ Wenbo Cao ⋅ Emmett Ientilucci
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 575
CRFT: Consistent–Recurrent Feature Flow Transformer for Cross-Modal Image Registration
Xuecong Liu ⋅ Mengzhu Ding ⋅ Zixuan Sun ⋅ Zhang Li ⋅ Xichao Teng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 576
FireScope: Wildfire Risk Raster Prediction With a Chain-of-Thought Oracle
Mario Markov ⋅ Stefan Ailuro ⋅ Luc Van Gool ⋅ Konrad Schindler ⋅ Danda Paudel
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 577
OlmoEarth: Stable Latent Image Modeling for Multimodal Earth Observation
Henry Herzog ⋅ Favyen Bastani ⋅ Yawen Zhang ⋅ Gabriel Tseng ⋅ Joseph Redmon ⋅ Hadrien Sablon ⋅ Ryan Park ⋅ Jacob Morrison ⋅ Alexandra Buraczynski ⋅ Karen Farley ⋅ Josh Hansen ⋅ Andrew Howe ⋅ Patrick Alan Johnson ⋅ Mark Otterlee ⋅ Ted Schmitt ⋅ Hunter Pitelka ⋅ Stephen Daspit ⋅ Rachel Ratner ⋅ Christopher Wilhelm ⋅ Sebastian Wood ⋅ Mike Jacobi ⋅ Hannah Kerner ⋅ Evan Shelhamer ⋅ Ali Farhadi ⋅ Ranjay Krishna ⋅ Patrick Beukema
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 578
TESSERA: Temporal Embeddings of Surface Spectra for Earth Representation and Analysis
Zhengpeng Feng ⋅ Clement Atzberger ⋅ Sadiq Jaffer ⋅ Jovana Knezevic ⋅ Silja Sormunen ⋅ Robin Young ⋅ Madeline C. Lisaius ⋅ Markus Immitzer ⋅ Toby Jackson ⋅ James Ball ⋅ David A. Coomes ⋅ Anil Madhavapeddy ⋅ Andrew Blake ⋅ Srinivasan Keshav
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 579
Regulating Rather than Constraining: Adaptive Guidance for Complex Spectral Reconstruction in Pansharpening
Zhuwei Wen ⋅ Zimin Xia ⋅ He Chen ⋅ Linwei Yue ⋅ Xianwei Zheng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 580
GeoMMBench and GeoMMAgent: Toward Expert-Level Multimodal Intelligence in Geoscience and Remote Sensing
Aoran Xiao ⋅ Shihao Cheng ⋅ Yonghao Xu ⋅ Yexian Ren ⋅ Hongruixuan Chen ⋅ Naoto Yokoya
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 581
Revisiting the Necessity of Full Accuracy: Weakly Supervised Object-Level Offset Correction for Misaligned Building Labels
Junda Xu ⋅ Yanmeng Liu ⋅ Xiangqiang Zeng ⋅ Jinrong Wu ⋅ Ying Qu ⋅ Libao Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 582
UniGeoSeg: Towards Unified Open-World Segmentation for Geospatial Scenes
Shuo Ni ⋅ Di Wang ⋅ He Chen ⋅ Haonan Guo ⋅ Ning Zhang ⋅ Jing Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 583
ZoomEarth: Active Perception for Ultra-High-Resolution Geospatial Vision-Language Tasks
Ruixun Liu ⋅ Bowen Fu ⋅ Jiayi Song ⋅ Kaiyu Li ⋅ Wanchen Li ⋅ Lanxuan Xue ⋅ Hui Qiao ⋅ Weizhan Zhang ⋅ Deyu Meng ⋅ Xiangyong Cao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 584
Unleashing Stealthy Backdoor Pandemic by Infecting a Single Diffusion Model
Mohaiminul Al Nahian ⋅ Abeer Matar Almalky ⋅ Sabbir Ahmed ⋅ Abdullah Al Arafat ⋅ Mamshad Nayeem Rizve ⋅ Adnan Rakin Rakin
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 585
Taming the Long Tail: Rebalancing Adversarial Training via Adaptive Perturbation
Lilin Zhang ⋅ Yimo Guo ⋅ Yue Li ⋅ Jiancheng Shi ⋅ Xianggen Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 586
Robustness Under Data Scarcity: Few-Shot Continual Adversarial Training for Evolving Threats
Wenxuan Wang ⋅ Chenglei Wang ⋅ Chengzhi Yan ⋅ Xuelin Qian ⋅ Yanning Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 587
Logit-Margin Repulsion for Backdoor Defense
Zhiguo Yang ⋅ Dongsheng Xu ⋅ Ruizhi Zhong ⋅ Jiacheng Pi ⋅ Xingxing Huang ⋅ Wenjie Ruan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 588
Thermally Activated Dual-Modal Adversarial Clothing against AI Surveillance Systems
Jiahuan Long ⋅ Tingsong Jiang ⋅ Hanqing Liu ⋅ Chao Ma ⋅ Weien Zhou ⋅ Yang Yang ⋅ Wen Yao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 589
Immunizing Models Against Harmful Long-Horizon Fine-Tuning via Contractive Optimization Dynamics
Najibul Haque Sarker ⋅ Zaber Ibn Abdul Hakim ⋅ Ali Asgarov ⋅ Chia-Wei Tang ⋅ Alvi Md Ishmam ⋅ Chris Thomas
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 590
Towards Stealthy and Effective Backdoor Attacks on Lane Detection: A Naturalistic Data Poisoning Approach
YIFAN LIAO ⋅ Yuxin Cao ⋅ Yedi Zhang ⋅ Wentao He ⋅ Yan XIAO ⋅ Xianglong Du ⋅ Zhiyong Huang ⋅ Jin Song Dong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 591
Red-teaming Retrieval-Augmented Diffusion Models via Poisoning Knowledge Bases
Xinqi Lyu ⋅ Liu of second author ⋅ Dong Wang ⋅ Bin Xiao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 592
Latent Diffusion Inversion Requires Understanding the Latent Space
Mingxing Rao ⋅ Bowen Qu ⋅ Daniel Moyer
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 593
Fractal Camouflage: A Bio-Inspired Approach for Multi-Scale Adversarial Attacks in the Infrared Domain
Chengyin Hu ⋅ Xin wang ⋅ Rui Qiu ⋅ Zhe Jia ⋅ Yingying Zhao ⋅ Kai Wang ⋅ Xu Kang ⋅ Yiwei Wei
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 594
EgoRoC: Towards Egocentric Robotic Control via Task-Agnostic Visual Alignment
Wei Feng ⋅ Chi Zhang ⋅ Nan Li ⋅ Qian Zhang ⋅ Qi Zhang ⋅ Mingyan Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 595
Describe Anything Anywhere At Any Moment
Nicolas Gorlo ⋅ Lukas Schmid ⋅ Luca Carlone
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 596
StaMo: Unsupervised Learning of Generalizable Robot Motion from Compact State Representation
Mingyu Liu ⋅ Jiuhe Shu ⋅ Hui Chen ⋅ Zeju Li ⋅ Canyu Zhao ⋅ Jiange Yang ⋅ Shenyuan Gao ⋅ Hao Chen ⋅ Chunhua Shen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 597
VLA Models Are More Generalizable Than You Think: Revisiting Physical and Spatial Modeling
weiqi li ⋅ Quande Zhang ⋅ ruifeng zhai ⋅ Liang Lin ⋅ Guangrun Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 598
Action–Geometry Prediction with 3D Geometric Prior for Bimanual Manipulation
Chongyang Xu ⋅ Li Haipeng ⋅ Shen Cheng ⋅ Haoqiang Fan ⋅ Ziliang Feng ⋅ Shuaicheng Liu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 599
Joint-Aligned Latent Action: Towards Scalable VLA Pretraining in the Wild
Hao Luo ⋅ Ye Wang ⋅ Wanpeng Zhang ⋅ Haoqi Yuan ⋅ Yicheng Feng ⋅ Haiweng Xu ⋅ Sipeng Zheng ⋅ Zongqing Lu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 600
Rethinking Camera Choice: An Empirical Study on Fisheye Camera Properties in Robotic Manipulation
Han Xue ⋅ Nan Min ⋅ Xiaotong Liu ⋅ Wendi Chen ⋅ Fang Yuan ⋅ Jun Lv ⋅ Cewu Lu ⋅ Chuan Wen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 601
INSIGHT Bench: Towards Grounded IN-SItu Guidance for Robotic ManipulaTion
Seonho Kim ⋅ Junhyeong Hong ⋅ Kyungjae Lee ⋅ Yoonseon Oh
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 602
MM-ACT: Learn from Multimodal Parallel Generation to Act
Haotian Liang ⋅ Xinyi Chen ⋅ Bin Wang ⋅ MingKang Chen ⋅ Yitian Liu ⋅ Yuhao Zhang ⋅ Zanxin Chen ⋅ Tianshuo Yang ⋅ Yilun Chen ⋅ Jiangmiao Pang ⋅ Dong Liu ⋅ Xiaokang Yang ⋅ Yao Mu ⋅ Wenqi Shao ⋅ Ping Luo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 603
HQC-NBV: A Hybrid Quantum-Classical View Planning Approach
Xiaotong Yu ⋅ Chang Wen Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 604
Motus: A Unified Latent Action World Model
Hongzhe Bi ⋅ Hengkai Tan ⋅ Shenghao Xie ⋅ Zeyuan Wang ⋅ Shuhe Huang ⋅ Haitian Liu ⋅ Ruowen Zhao ⋅ Yao Feng ⋅ Chendong Xiang ⋅ Yinze Rong ⋅ Hongyan Zhao ⋅ Hanyu Liu ⋅ Zhizhong Su ⋅ Lei Ma ⋅ Hang Su ⋅ Jun Zhu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 605
SE(3)-Equivariance with Geometric and Topological Guidance for Category-Level Object Pose Estimation
Sheng Yu ⋅ Di-Hua Zhai ⋅ Yuanqing Xia
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 606
SPEAR-1: Scaling Beyond Robot Demonstrations via 3D Understanding
Nikolay Nikolov ⋅ Giuliano Albanese ⋅ Sombit Dey ⋅ Aleksandar Yanev ⋅ Luc Van Gool ⋅ Jan-Nico Zaech ⋅ Danda Paudel
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 607
Global Prior Meets Local Consistency: Dual-Memory Augmented Vision-Language-Action Model for Efficient Robotic Manipulation
Zaijing Li ⋅ Bing Hu ⋅ Rui Shao ⋅ Gongwei Chen ⋅ Dongmei Jiang ⋅ Pengwei Xie ⋅ Jianye Hao ⋅ Liqiang Nie
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 608
RoboTAG: End-to-end Robot Pose Estimation via Topological Alignment Graph
Yifan Liu ⋅ Fangneng Zhan ⋅ Wanhua Li ⋅ Haowen Sun ⋅ Katerina Fragkiadaki ⋅ Hanspeter Pfister
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 609
MVLM: Template-Free Tracking via Vision–Language Margin Confidence and Memory-Gated Tracking
Dae-Hyeon Park ⋅ Mina Baek ⋅ Jeong-Hun Ha ⋅ Chan-Seop Park ⋅ Jamshidjon Ganiev ⋅ Seung-Hwan Bae
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 610
Interactive Tracking: A Human-in-the-Loop Paradigm with Memory-Augmented Adaptation
Yuqing Huang ⋅ Guotian Zeng ⋅ Zhenqiao Yuan ⋅ Zhenyu He ⋅ Xin Li ⋅ Yaowei Wang ⋅ Ming-Hsuan Yang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 611
VidEoMT: Your ViT is Secretly Also a Video Segmentation Model
Narges Norouzi ⋅ Idil Esen Zulfikar ⋅ Niccolò Cavagnero ⋅ Tommie Kerssies ⋅ Bastian Leibe ⋅ Gijs Dubbelman ⋅ Daan de Geus
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 612
Matching Every Pair to Track Every Point: PairFormer for All-Pairs Tracking and Video Trajectory Fields
Guangyang Wu ⋅ Youran Ding ⋅ Xinyu Che ⋅ BENYUAN SUN ⋅ Yi Yang ⋅ Xiaohong Liu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 613
Boosting Self-Supervised Tracking with Contextual Prompts and Noise Learning
Yaozong Zheng ⋅ Qihua Liang ⋅ Bineng Zhong ⋅ Shuimu Zeng ⋅ Yuanliang Xue ⋅ Ning Li ⋅ Shuxiang Song
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 614
Progressive Multi-cue Alignment for Unaligned RGBT Tracking
Jiandong Jin ⋅ Chenglong Li ⋅ Hao Feng ⋅ Andong Lu ⋅ Lili Huang ⋅ Jin Tang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 615
Real-Time Neural Video Compression with Unified Intra and Inter Coding
Hui Xiang ⋅ Yifan Bian ⋅ Li Li ⋅ Jingran Wu ⋅ Xianguo Zhang ⋅ Dong Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 616
Adapting Lightweight Image-based Counting Models for Video Crowd Counting
Weibo Shu ⋅ Antoni B. Chan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 617
Sparse Task Vector Mixup with Hypernetworks for Efficient Knowledge Transfer in Whole-Slide Image Prognosis
Pei Liu ⋅ xiangxiang Zeng ⋅ Tengfei Ma ⋅ Yucheng Xing ⋅ Xuanbai Ren ⋅ Yiping Liu
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 618
MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and Diagnosis
Yuting Zhang ⋅ Kaishen Yuan ⋅ Hao Lu ⋅ Yutao Yue ⋅ Jintai Chen ⋅ Kaishun Wu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 619
MedKCO: Medical Vision-Language Pretraining via Knowledge-Driven Cognitive Orchestration
Chenran Zhang ⋅ Ruiqi Wu ⋅ Tao Zhou ⋅ Yi Zhou
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 620
Toward Generalizable Whole Brain Representations with High-Resolution Light-Sheet Data
Minyoung E. Kim ⋅ Dae Hee Yun ⋅ Aditi V. Patel ⋅ Madeline Hon ⋅ Webster Guan ⋅ Taegeon Lee ⋅ Brian Nguyen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 621
CryoHype: Reconstructing a thousand cryo-EM structures with transformer-based hypernetworks
Jeffrey Gu ⋅ Minkyu Jeon ⋅ Ambri Ma ⋅ Serena Yeung ⋅ Ellen D. Zhong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 622
GenTract: Generative Global Tractography
Alec Sargood ⋅ Lemuel Puglisi ⋅ Elinor Thompson ⋅ Mirco Musolesi ⋅ Daniel C. Alexander
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 623
LUMINA: A Multi-Vendor Mammography Benchmark with Energy Harmonization Protocol
Hongyi Pan ⋅ Gorkem Durak ⋅ Halil Ertugrul Aktas ⋅ Andrea M. Bejar ⋅ Baver Tutun ⋅ Emre Uysal ⋅ Ezgi Bülbül ⋅ Mehmet Faith Dogan ⋅ Berrin Erok ⋅ Berna Yildirim ⋅ Sukru Mehmet Erturk ⋅ Ulas Bagci
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 624
Virtual Immunohistochemistry Staining with Dual-Aligned Multi-Task Feature Guidance
Shigeng Xie ⋅ Hongming Xu ⋅ Guiyang Jiang ⋅ Tuomo Rossi ⋅ Tommi Kärkkäinen ⋅ Fengyu Cong
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 625
Can Natural Image Autoencoders Compactly Tokenize fMRI Volumes for Long-Range Dynamics Modeling?
Peter Yongho Kim ⋅ Juhyeon Park ⋅ Jungwoo Park ⋅ Jubin Choi ⋅ Jungwoo Seo ⋅ Jiook Cha ⋅ Taesup Moon
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 626
IEBGL:An Interpretability-Enhanced Brain Graph Learning Framework with LLM-Instructed Topology and Literature-Augmented Semantics
Yihang Duan ⋅ Shuo Huang ⋅ Lizhang Lizhang ⋅ Meiling Wang ⋅ Li Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 627
F^2-Assist: Multi-Phase Fetal Growth Forecast and Report Generation from Ultrasound Examination
Bin Pu ⋅ XUSHENG LIANG ⋅ Xinpeng Ding ⋅ Jinlin Wu ⋅ Zhen Lei ⋅ Shengli Li ⋅ Kenli Li ⋅ Jiawei Ma
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 628
Sparse Spectral LoRA: Routed Experts for Medical VLMs
Omid Nejatimanzari ⋅ Hojat Asgariandehkordi ⋅ Taha Koleilat ⋅ Yiming Xiao ⋅ Hassan Rivaz
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 629
SAT-RRG: LLM-Guided Self-Adaptive Training for Radiology Report Generation with Token-Level Push–Pull Optimization
YUNYI LIU ⋅ Yingshu Li ⋅ Tong Chen ⋅ Lingqiao Liu ⋅ Lei Wang ⋅ Luping Zhou
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 630
OralGPT-Plus: Learning to Use Visual Tools via Reinforcement Learning for Panoramic X-ray Analysis
Yuxuan Fan ⋅ JING HAO ⋅ Hong Chen ⋅ Jiahao Bao ⋅ Yihua Shao ⋅ Yuci Liang ⋅ Kuo Feng Hung ⋅ Hao Tang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 632
Forensic-Friendly Image Manipulation via Controllable Latent Diffusion
Hanyu Chen ⋅ Haiwei Wu ⋅ Jinyu Tian ⋅ Jianqing Li ⋅ Jiantao Zhou
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 633
IncreFA: Breaking the Static Wall of Generative Model Attribution
Haotian Qin ⋅ Dongliang Chang ⋅ Yueying Gao ⋅ Yuexuan Tan ⋅ Lei Chen ⋅ Zhanyu Ma
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 634
AVFakeBench: A Comprehensive Audio-Video Forgery Detection Benchmark for AV-LMMs
Shuhan Xia ⋅ Peipei Li ⋅ Xuannan Liu ⋅ Dongsen Zhang ⋅ Xinyu Guo ⋅ Zekun Li
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 635
Detecting Compressed AI-Generated Images via Phase Spectrum Robustness
Kai Li ⋅ Wenqi Ren ⋅ Wei Wang ⋅ Xiaochun Cao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 636
Detect Any AI-Counterfeited Text Image
Chenfan Qu ⋅ Yiwu Zhong ⋅ Xuekang Zhu ⋅ Junchi Li ⋅ Changjiang Jiang ⋅ Jian liu ⋅ Lianwen Jin
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 637
DeepfakeImpact: A Two-Stage Benchmark with Real-World Impact in Deepfake Detection
Chaoyu Gong ⋅ Han Zhang ⋅ Siqiang Luo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 638
Enhancing the Security of Visual Speaker Authentication Based on Dynamic Lip-Print Analysis
Yi He ⋅ Lei Yang ⋅ Bofan Chen ⋅ Shilin Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 639
SimLBR: Learning to Detect Fake Images by Learning to Detect Real Images
Aayush Dhakal ⋅ Subash Khanal ⋅ Srikumar Sastry ⋅ Jacob Arndt ⋅ Philipe Ambrozio Dias ⋅ Dalton Lunga ⋅ Nathan Jacobs
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 640
Editprint: General Digital Image Forensics via Editing Fingerprint with Self-Augmentation Training
Haiwei Wu ⋅ Kemou Li ⋅ Yuanman Li ⋅ Jiantao Zhou
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 641
Detecting AI-Generated Forgeries via Iterative Manifold Deviation Amplification
Jiangling Zhang ⋅ Shuxuan Gao ⋅ Bofan Liu ⋅ Siqiang Feng ⋅ Jirui Huang ⋅ Yaxiong Chen ⋅ Ziyu Chen
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 642
Goldilocks Test Sets for Face Verification
Haiyu Wu ⋅ Sicong Tian ⋅ Aman Bhatta ⋅ Jacob Gutierrez ⋅ Grace Bezold ⋅ Genesis Argueta ⋅ Karl Ricanek ⋅ Michael C. King ⋅ Kevin W. Bowyer
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 643
Fine-VAD: Towards Fine-Grained Video Anomaly Detection via Progressive Cross-Granularity Learning
Menghao Zhang ⋅ Yiyan Zhu ⋅ Pengfei Ren ⋅ Haifeng Sun ⋅ Qi Qi ⋅ Zirui Zhuang ⋅ Huazheng Wang ⋅ Lei Zhang ⋅ Jianxin Liao ⋅ Jingyu Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 644
DLVP-CLIP: Enhancing Fine-Grained Zero-Shot Anomaly Detection via Dynamic Local Visual Prompting
Gaowei Zhang ⋅ Lihe Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 645
MoECLIP: Patch-Specialized Experts for Zero-shot Anomaly Detection
Jun Yeong Park ⋅ JunYoung Seo ⋅ Minji Kang ⋅ Yu Rang Park
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 646
Alert-CLIP: Abnormality-aware Latent-Enhanced Representation Tuning of CLIP for Video Anomaly Detection
Yiyan Zhu ⋅ Menghao Zhang ⋅ Haifeng Sun ⋅ Pengfei Ren ⋅ Xianao Chu ⋅ Chenye Xu ⋅ Hong Tan ⋅ Jinghan Wang ⋅ Qi Qi ⋅ Jingyu Wang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 647
AnomalyVFM -- Transforming Vision Foundation Models into Zero-Shot Anomaly Detectors
Matic Fučka ⋅ Vitjan Zavrtanik ⋅ Danijel Skočaj
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 648
LayoutAD: Exploring Semantic-Geometric Misalignment Reasoning for Scene Layout Anomaly Detection
Zhichao Zeng ⋅ Jiasheng Zhang ⋅ Jiyun Sun ⋅ Jiangtao Cui ⋅ Xiaotian Qiao
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 649
Bidirectional Multimodal Prompt Learning with Scale-Aware Training for Few-Shot Multi-Class Anomaly Detection
Yujin Lee ⋅ Sewon Kim ⋅ Daeun Moon ⋅ Seoyoon Jang ⋅ Hyunsoo Yoon
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 650
GS-CLIP: Zero-shot 3D Anomaly Detection by Geometry-Aware Prompt and Synergistic View Representation Learning
Zehao Deng ⋅ An Liu ⋅ Yan Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 651
TLMA: Mitigating the Impact of Weakly Labeled Information for Video Anomaly Detection
Rong Xu ⋅ Runqi Wang ⋅ Yingjun Zhang ⋅ Tao Tao ⋅ Xiaomeng Li ⋅ Liping Jing
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 652
Defect Cue-Preserved Structural Feature Refinement for Few-Shot Anomaly Detection
Le Jiang ⋅ Yan Huang ⋅ Zhen Xu ⋅ Yong Xu ⋅ Hau San Wong ⋅ Si Wu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 653
Anomaly-Related Residual Fields for Cross-domain Anomaly Detection
Kewei Gao ⋅ Jiayi Xie ⋅ Zhengda Shen ⋅ Weijun Qin ⋅ Lingxiang Jia ⋅ Kejia Chen ⋅ Zunlei Feng ⋅ Yijun Bei
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 654
From Attraction to Equilibrium: Physics-Inspired Semantic Gravitons for Zero-Shot Anomaly Detection
Yuwen Pan ⋅ Yuan Wang ⋅ Shaohui Li ⋅ Zhi Li ⋅ Yu LIU ⋅ You He
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 655
Joint Learning of General and Diverse Patterns with Mixture of Memory Experts for Weakly-Supervised Video Anomaly Detection
Bo Sun ⋅ Junxi Chen ⋅ Zhe Wu ⋅ Feng Gao ⋅ Fan Yang ⋅ Li Su ⋅ Yaowei Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 656
No Need For Real Anomaly: MLLM Empowered Zero-Shot Video Anomaly Detection
Zunkai Dai ⋅ Ke Li ⋅ JIAJIA LIU ⋅ Jie Yang ⋅ Yuanyuan Qiao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 657
FB-CLIP: Fine-Grained Zero-Shot Anomaly Detection with Foreground-Background Disentanglement
Ming Hu ⋅ Yongsheng Huo ⋅ Mingyu Dou ⋅ Jianfu Yin ⋅ Peng Zhao ⋅ Yao Wang ⋅ Cong Hu ⋅ Bingliang Hu ⋅ Quan Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 658
DynamicVGGT: Learning Dynamic Point Maps for 4D Scene Reconstruction in Autonomous Driving
Zhuolin He ⋅ Jing Li ⋅ Guanghao Li ⋅ Xiaolei Chen ⋅ Jiacheng Tang ⋅ Siyang Zhang ⋅ Zhounan Jin ⋅ Feipeng Cai ⋅ Bin Li ⋅ Jian Pu ⋅ Jia Cai ⋅ Xiangyang Xue
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 659
GenieDrive: Towards Physics-Aware Driving World Model with 4D Occupancy Guided Video Generation
Zhenya Yang ⋅ Zhe Liu ⋅ Yuxiang Lu ⋅ Liping Hou ⋅ Chenxuan Miao ⋅ peng siyi ⋅ Bailan Feng ⋅ Xiang Bai ⋅ Hengshuang Zhao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 660
Test-Time 3D Occupancy Prediction
Fengyi Zhang ⋅ Xiangyu Sun ⋅ Huitong Yang ⋅ Zheng Zhang ⋅ Zi Huang ⋅ Yadan Luo
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 661
Group Diffusion: Enhancing Image Generation by Unlocking Cross-Sample Collaboration
Sicheng Mo ⋅ Thao Nguyen ⋅ Richard Zhang ⋅ Nick Kolkin ⋅ Siddharth Srinivasan Iyer ⋅ Eli Shechtman ⋅ Krishna Kumar Singh ⋅ Yong Jae Lee ⋅ Bolei Zhou ⋅ Yuheng Li
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 662
Diffusion Mental Averages
Phonphrm Thawatdamrongkit ⋅ Sukit Seripanitkarn ⋅ Supasorn Suwajanakorn
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 663
dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Yi Xin ⋅ Siqi Luo ⋅ Tianxiang Xu ⋅ Qi Qin ⋅ Haoxing Chen ⋅ Kaiwen Zhu ⋅ Zhiwei Zhang ⋅ Yangfan He ⋅ Rongchao Zhang ⋅ Jinbin Bai ⋅ Shuo Cao ⋅ Bin Fu ⋅ Junjun He ⋅ Yihao Liu ⋅ Yuewen Cao ⋅ Xiaohong Liu
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 664
RegionRoute: Regional Style Transfer with Diffusion Model
Bowen Chen ⋅ Jake Zuena ⋅ Alan C. ⋅ Divya Kothandaraman
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 665
Low-Rank Residual Diffusion Models
Junfu Tan ⋅ Jiang Yuan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 666
RDF-MIG: A Robust Diffusion Framework for Masked Image Generation to Augment Semantic Segmentation and Change Detection
Zian Cao ⋅ Wei Wei ⋅ QINGSHAN GAO ⋅ Yuanyuan Fu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 667
TC-Padé: Trajectory-Consistent Padé Approximation for Diffusion Acceleration
Shaoxuan He ⋅ Benlei Cui ⋅ Bukun Huang ⋅ Zhizeng Ye ⋅ Yunyun Sun ⋅ Longtao Huang ⋅ Hui Xue ⋅ Yang Yang ⋅ Haiwen Hong ⋅ Jingqun Tang ⋅ Zhou Zhao
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 668
Bi-directional Autoregressive Diffusion for Large Complex Motion Interpolation
Yongrui Ma ⋅ Shijie Zhao ⋅ Mingde Yao ⋅ Junlin Li ⋅ Li zhang ⋅ Xiaohong Liu ⋅ Qi Dou ⋅ Jinwei Gu ⋅ Tianfan Xue
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 669
Guiding Token-Sparse Diffusion Models
Felix Krause ⋅ Stefan Andreas Baumann ⋅ Johannes Schusterbauer ⋅ Olga Grebenkova ⋅ Ming Gui ⋅ Vincent Tao Hu ⋅ Björn Ommer
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 670
Accelerating Diffusion-based Video Editing via Heterogeneous Caching: Beyond Full Computing at Sampled Denoising Timestep
Tianyi Liu ⋅ Ye Lu ⋅ Linfeng Zhang ⋅ Chen Cai ⋅ Jianjun Gao ⋅ Yi Wang ⋅ Kim-Hui Yap ⋅ Lap-Pui Chau
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 671
See and Fix the Flaws: Enabling VLMs and Diffusion Models to Comprehend Visual Artifacts via Agentic Data Synthesis
Jaehyun Park ⋅ Minyoung Ahn ⋅ Minkyu Kim ⋅ Jonghyun Lee ⋅ Jae-Gil Lee ⋅ Dongmin Park
[ Slides [ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 672
High-Fidelity Virtual Try-On beyond Paired Data Scarcity via Diffusion-based Cycle-Consistent Learning
Jia Wu ⋅ Yijing Dai ⋅ Tingfeng Cao ⋅ Meiling Wu ⋅ Tao Luo ⋅ Jian Dong Zhang ⋅ Guangming Lu ⋅ Xiaoyi Zeng
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 673
Sampling-Aware Quantization for Diffusion Models
Qian Zeng ⋅ Jie Song ⋅ Yuanyu Wan ⋅ Huiqiong Wang ⋅ Mingli Song
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 674
CRAFT: Aligning Diffusion Models with Fine-Tuning Is Easier Than You Think
Zening Sun ⋅ Zhengpeng Xie ⋅ Lichen Bai ⋅ Shitong Shao ⋅ Shuo Yang ⋅ Zeke Xie
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 675
Scale Space Diffusion
Soumik Mukhopadhyay ⋅ Prateksha Udhayanan ⋅ Abhinav Shrivastava
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 676
Making Training-Free Diffusion Segmentors Scale with the Generative Power
Benyuan Meng ⋅ Qianqian Xu ⋅ Zitai Wang ⋅ Xiaochun Cao ⋅ Longtao Huang ⋅ Qingming Huang
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 677
Roots Beneath the Cut: Uncovering the Risk of Concept Recovery in Pruning-Based Unlearning for Diffusion Models
Ci Zhang ⋅ Zhaojun Ding ⋅ Chence Yang ⋅ Jun Liu ⋅ Xiaoming Zhai ⋅ Shaoyi Huang ⋅ Beiwen Li ⋅ Xiaolong Ma ⋅ Jin Lu ⋅ Geng Yuan
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 678
Few-Step Diffusion Sampling Through Instance-Aware Discretizations
Liangyu Yuan ⋅ Ruoyu Wang ⋅ Tong Zhao ⋅ Dingwen Fu ⋅ Mingkun Lei ⋅ Beier Zhu ⋅ Chi Zhang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 679
SpeeDiff: Scalable Pixel-Anchored End-to-End Latent Diffusion Model
Bingliang Zhang ⋅ Wenda Chu ⋅ Yizhuo Li ⋅ Linjie Yang ⋅ Yisong Yue ⋅ Katherine L. Bouman ⋅ Yang Song ⋅ Qiushan Guo
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 680
Structure-to-Intensity Diffusion for Adverse-Weather LiDAR Generation
Peiyang Ni ⋅ Longyu Yang ⋅ Lu Zhang ⋅ Kuniaki Saito ⋅ Yap-Peng Tan ⋅ Fumin Shen ⋅ Heng Tao Shen ⋅ Xiaofeng Zhu ⋅ Ping Hu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 681
Focal–General Diffusion Model with Semantic Consistent Guidance for Sign Language Production
Yiheng Yu ⋅ Sheng Liu ⋅ Yuan Feng ⋅ Zhelun Jin ⋅ Yining Jiang ⋅ Min Xu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 682
Diffusion Probe: Generated Image Result Prediction Using CNN Probes
Bukun Huang ⋅ Benlei Cui ⋅ Zhizeng Ye ⋅ Xuemei Dong ⋅ Tuo Chen ⋅ Hui Xue ⋅ Dingkang Yang ⋅ Longtao Huang ⋅ Haiwen Hong ⋅ Jingqun Tang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 683
Content-Aware Dynamic Patchification for Efficient Video Diffusion
Sheng Li ⋅ Connelly Barnes ⋅ Mamshad Nayeem Rizve ⋅ Hongwu Peng ⋅ Zhengang Li ⋅ Ohi Dibua ⋅ Alireza Ganjdanesh ⋅ Xulong Tang ⋅ Yan Kang ⋅ Yifan Gong
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 684
PixelRush: Ultra-Fast, Training-Free High-Resolution Image Generation via One-step Diffusion
Hong-Phuc Lai ⋅ Phong Nguyen ⋅ Anh Tran
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 685
Diffusion-Based sRGB Real Noise Generation via Prompt-Driven Noise Representation Learning
Jaekyun Ko ⋅ Dongjin Kim ⋅ Soomin Lee ⋅ Guanghui Wang ⋅ Tae Hyun Kim
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 686
Decoupled Residual Denoising Diffusion Models for Unified and Data Efficient Image-to-Image Translation
Ziyue Lin ⋅ Jiahe Hou ⋅ Xia Hongyu ⋅ Xinrui Xie ⋅ Feifei Wang ⋅ Yuyin Zhou ⋅ Wei Wang ⋅ Jiawei Liu ⋅ Liangqiong Qu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 687
GROW: Watermark Generation with Progressive Guidance for Diffusion Models
Pengcheng Luo ⋅ Zexi Jia ⋅ Yijia Zhong ⋅ Jinchao Zhang ⋅ Jie Zhou
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 688
MotionV2V: Editing Motion in a Video
Ryan Burgert ⋅ Charles Herrmann ⋅ Forrester Cole ⋅ Michael Ryoo ⋅ Neal Wadhwa ⋅ Andrey Voynov ⋅ Nataniel Ruiz
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 689
Mind the Generative Details: Direct Localized Detail Preference Optimization for Video Diffusion Models
Zitong Huang ⋅ Kaidong Zhang ⋅ Yukang Ding ⋅ Chao Gao ⋅ Rui Ding ⋅ Ying Chen ⋅ Wangmeng Zuo
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 690
OrthoFuse: Training-free Riemannian Fusion of Orthogonal Style-Concept Adapters for Diffusion Models
Ali Aliev ⋅ Kamil Garifullin ⋅ Nikolay Yudin ⋅ Vera Soboleva ⋅ Alexander Molozhavenko ⋅ Ivan Oseledets ⋅ Aibek Alanov ⋅ Maxim Rakhuba
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 691
DreamStyle: A Unified Framework for Video Stylization
Mengtian Li ⋅ Jinshu Chen ⋅ Songtao Zhao ⋅ Wanquan Feng ⋅ Pengqi Tu ⋅ Qian HE
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 692
Diffusion Sampling Path Tells More: An Efficient Plug-and-Play Strategy for Sample Filtering
SIXIAN WANG ⋅ Zhiwei Tang ⋅ Tsung-Hui Chang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 693
Designing Instance-Level Sampling Schedules via REINFORCE with James-Stein Shrinkage
Peiyu Yu ⋅ Suraj Kothawade ⋅ Sirui Xie ⋅ Ying Nian Wu ⋅ Hongliang Fei
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 694
Reward Sharpness-Aware Fine-Tuning for Diffusion Models
Kwanyoung Kim ⋅ Byeongsu Sim
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 695
DBMSolver: A Training-free Diffusion Bridge Sampler for High-Quality Image-to-Image Translation
SANKARSHANA VENUGOPAL ⋅ Mohammad Mostafavi ⋅ Jonghyun Choi
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 696
Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens
Yuqing Wang ⋅ Chuofan Ma ⋅ Zhijie Lin ⋅ Yao Teng ⋅ Lijun Yu ⋅ Shuai Wang ⋅ Jiaming Han ⋅ Jiashi Feng ⋅ Yi Jiang ⋅ Xihui Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 697
TAP: A Token-Adaptive Predictor Framework for Training-Free Diffusion Acceleration
Haowei Zhu ⋅ Tingxuan Huang ⋅ XING WANG ⋅ Tianyu Zhao ⋅ Jiexi Wang ⋅ Weifeng Chen ⋅ Xurui Peng ⋅ Fangmin Chen ⋅ Junhai Yong ⋅ Bin Wang
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 698
Cross-modal Representation Learning for Diffusion-generated Image Detection
Tao Gong ⋅ Dayong Wang ⋅ Qi Chu ⋅ Bin Liu ⋅ Nenghai Yu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 699
Sparse-LaViDa: Sparse Multimodal Discrete Diffusion Language Models
Shufan Li ⋅ Jiuxiang Gu ⋅ Kangning Liu ⋅ Zhe Lin ⋅ Zijun Wei ⋅ Aditya Grover ⋅ Jason Kuen
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 700
Back to Basics: Let Denoising Generative Models Denoise
Tianhong Li ⋅ Kaiming He
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 701
CaricHarmony: Contrastive Diffusion Paths for Identity-Preserving Caricature Synthesis
Dongyu Wang ⋅ Dar-Yen Chen ⋅ Yi-Zhe Song
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 702
DiP: Taming Diffusion Models in Pixel Space
Zhennan Chen ⋅ junwei zhu ⋅ Xu Chen ⋅ Jiangning Zhang ⋅ Xiaobin Hu ⋅ Hanzhen Zhao ⋅ Chengjie Wang ⋅ Jian Yang ⋅ Ying Tai
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 703
RAPID: Reusing Attention Sparsity with Inter-step Adaptation for Efficient Video Diffusion
Shangran Lin ⋅ Lu Lu ⋅ Jian Chen ⋅ Qiang Liu
[ Poster
Poster
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F 704
Efficient and Training-Free Single-Image Diffusion Models
Haojun Qiu ⋅ Kiriakos N. Kutulakos ⋅ David B. Lindell
[ Poster
Poster Session
Sun Jun 07 10:45 AM -- 12:45 PM (PDT) @ ExHall F None
Poster Session 5 & Exhibit Hall
Art Program
Sun Jun 07 10:45 AM -- 02:00 PM (PDT) @ ExHall F None
Art Exhibition
Luba Elliott
Art Program
Sun Jun 07 10:45 AM -- 11:15 AM (PDT) @ ExHall F None
Art Gallery Tour with Curator and Artists
Luba Elliott
Oral
Sun Jun 07 01:00 PM -- 01:15 PM (PDT) @ Mile High Ballroom 3A - 4A None
Efficient Unrolled Networks for Large-Scale 3D Inverse Problems
Romain Vo ⋅ Julián Tachella
Oral
Sun Jun 07 01:00 PM -- 01:12 PM (PDT) @ Mile High Ballroom 1A - 2A None
CURE: Curriculum-guided Multi-task Training for Reliable Anatomy Grounded Report Generation
Pablo Messina ⋅ Andrés Villa ⋅ Juan León Alcázar ⋅ Karen Sanchez ⋅ Carlos Hinojosa ⋅ Denis Parra ⋅ Alvaro Soto ⋅ Bernard Ghanem
Oral
Sun Jun 07 01:00 PM -- 01:15 PM (PDT) @ Bluebird Ballroom None
Differentiable Laplacian Matrix Guided Superpixel Segmentation
Jeremy Juybari ⋅ Joshua Hamilton ⋅ Shuvra Das ⋅ Chaofan Chen ⋅ Andre Khalil ⋅ Yifeng Zhu
Oral
Sun Jun 07 01:00 PM -- 01:15 PM (PDT) @ Four Seasons Ballroom None
CineBrain: A Large-Scale Multi-Modal Audiovisual Brain Dataset for Brain-Conditioned Video Generation
Jianxiong Gao ⋅ Yichang Liu ⋅ baofeng yang ⋅ Jianfeng Feng ⋅ Yanwei Fu
Oral Session
Sun Jun 07 01:00 PM -- 02:15 PM (PDT) @ Bluebird Ballroom None
Oral Session 6A: Geometric Learning
Oral Session
Sun Jun 07 01:00 PM -- 02:15 PM (PDT) @ Mile High Ballroom 1A - 2A None
Oral Session 6C: Medical Vision
Oral Session
Sun Jun 07 01:00 PM -- 02:15 PM (PDT) @ Four Seasons Ballroom None
Oral Session 6B: Multimodal Reasoning
Oral Session
Sun Jun 07 01:00 PM -- 02:15 PM (PDT) @ Mile High Ballroom 3A - 4A None
Oral Session 6D: Large-Scale Neural Modeling
Oral
Sun Jun 07 01:12 PM -- 01:25 PM (PDT) @ Mile High Ballroom 1A - 2A None
DK-DDIL: Adaptive Knowledge Retention for Dynamic Domain-Incremental Learning in Medical Imaging
Yuxi Ma ⋅ Sujie Liu ⋅ Jing Yang ⋅ Jiacheng Wang ⋅ Yiping Chen ⋅ Baptiste Magnier ⋅ Liansheng Wang
Oral
Sun Jun 07 01:15 PM -- 01:30 PM (PDT) @ Bluebird Ballroom None
FILTR: Extracting Topological Features from Pretrained 3D Models
Louis Martinez ⋅ Maks Ovsjanikov
Oral
Sun Jun 07 01:15 PM -- 01:30 PM (PDT) @ Mile High Ballroom 3A - 4A None
FedAdamom: Adaptive Momentum for Improved Generalization in Federated Optimization
Wenjie Hou ⋅ Tianxiang Chen ⋅ Feng Wang ⋅ Tiantong Wu ⋅ Zhiming Zheng ⋅ Shaoting Tang ⋅ Wei Yang Bryan Lim
Oral
Sun Jun 07 01:15 PM -- 01:30 PM (PDT) @ Four Seasons Ballroom None
Hearing the Room Through the Shape of the Drum: Modal-Guided Sound Recovery from Multi-Point Surface Vibrations
Shai Bagon ⋅ Matan Kichler ⋅ Mark Sheinin
Oral
Sun Jun 07 01:25 PM -- 01:37 PM (PDT) @ Mile High Ballroom 1A - 2A None
Dual-level Adapter Boosting Prompt-free Curvilinear Structure Segmentation
Kai Zhu ⋅ Li Chen ⋅ Jun Cheng
Oral
Sun Jun 07 01:30 PM -- 01:45 PM (PDT) @ Bluebird Ballroom None
Learning Convex Decomposition via Feature Fields
Yuezhi Yang ⋅ Qixing Huang ⋅ Mikaela Angelina Uy ⋅ Nicholas Sharp
Oral
Sun Jun 07 01:30 PM -- 01:45 PM (PDT) @ Four Seasons Ballroom None
SDTrack: A Baseline for Event-based Tracking via Spiking Neural Networks
Yimeng Shan ⋅ Zhenbang Ren ⋅ Haodi Wu ⋅ Wenjie Wei ⋅ Rui-Jie Zhu ⋅ Shuai Wang ⋅ Dehao Zhang ⋅ Yichen Xiao ⋅ Jieyuan Zhang ⋅ Kexin Shi ⋅ Jingzhinan Wang ⋅ Jason K. Eshraghian ⋅ Haicheng Qu ⋅ Malu Zhang
Oral
Sun Jun 07 01:30 PM -- 01:45 PM (PDT) @ Mile High Ballroom 3A - 4A None
SimScale: Learning to Drive via Real-World Simulation at Scale
Haochen Tian ⋅ Tianyu Li ⋅ Haochen Liu ⋅ Jiazhi Yang ⋅ Yihang Qiu ⋅ Guang Li ⋅ junli wang ⋅ Yinfeng Gao ⋅ Zhang Zhang ⋅ Liang Wang ⋅ Hangjun Ye ⋅ Long Chen ⋅ Hongyang Li
Oral
Sun Jun 07 01:37 PM -- 01:50 PM (PDT) @ Mile High Ballroom 1A - 2A None
LATA: Laplacian-Assisted Transductive Adaptation for Conformal Uncertainty in Medical VLMs
Behzad Bozorgtabar ⋅ Dwarikanath Mahapatra ⋅ Sudipta Roy ⋅ Muzammal Naseer ⋅ Imran Razzak ⋅ Zongyuan Ge
Oral
Sun Jun 07 01:45 PM -- 02:00 PM (PDT) @ Mile High Ballroom 3A - 4A None
Texvent: Asynchronous Event Data Simulation via Text Prompt
Ruofei Wang ⋅ Peiqi Duan ⋅ Ka Chun Cheung ⋅ Simon See ⋅ Boxin Shi ⋅ Renjie Wan
Oral
Sun Jun 07 01:45 PM -- 02:00 PM (PDT) @ Bluebird Ballroom None
Learning Eigenstructures of Unstructured Data Manifolds
Roy Velich ⋅ Arkadi Piven ⋅ David Bensaid ⋅ Daniel Cremers ⋅ Thomas Dagès ⋅ Ron Kimmel
Oral
Sun Jun 07 01:45 PM -- 02:00 PM (PDT) @ Four Seasons Ballroom None
Thinking with Drafts: Speculative Temporal Reasoning for Efficient Long Video Understanding
Pengfei Hu ⋅ Meng Cao ⋅ Yingyao Wang ⋅ Yi Wang ⋅ Jiahua Dong ⋅ Jun Song ⋅ Cheng Yu ⋅ Bo Zheng ⋅ Xiaodan Liang
Oral
Sun Jun 07 01:50 PM -- 02:02 PM (PDT) @ Mile High Ballroom 1A - 2A None
Medic-AD: Towards Medical Vision-Language Model's Clinical Intelligence
Woohyeon Park ⋅ Jaeik Kim ⋅ Sunghwan Steve Cho ⋅ Pa Hong ⋅ Wookyoung Jeong ⋅ Yoojin Nam ⋅ Namjoon Kim ⋅ Ginny Y. Wong ⋅ Ka Chun Cheung ⋅ Jaeyoung Do
Poster Setup
Sun Jun 07 02:00 PM -- 02:30 PM (PDT) @ ExHall A None
Poster Setup
Oral
Sun Jun 07 02:00 PM -- 02:15 PM (PDT) @ Four Seasons Ballroom None
Wan-Weaver: Interleaved Multi-modal Generation via Decoupled Training
Jinbo Xing ⋅ Zeyinzi Jiang ⋅ Yuxiang Tuo ⋅ Chaojie Mao ⋅ Xiaotang Gai ⋅ Xi Chen ⋅ Jingfeng Zhang ⋅ Yulin Pan ⋅ Zhen Han ⋅ Jie Xiao ⋅ Keyu Yan ⋅ Chenwei Xie ⋅ Chongyang Zhong ⋅ Kai Zhu ⋅ Tong Shen ⋅ Lianghua Huang ⋅ Yu Liu ⋅ Yujiu Yang
Oral
Sun Jun 07 02:00 PM -- 02:15 PM (PDT) @ Mile High Ballroom 3A - 4A None
WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World
Ao Liang ⋅ Lingdong Kong ⋅ Tianyi Yan ⋅ Hongsi Liu ⋅ Yu Yang ⋅ Ziqi Huang ⋅ Wei Yin ⋅ Jialong Zuo ⋅ Yixuan Hu ⋅ Dekai Zhu ⋅ Dongyue Lu ⋅ Youquan Liu ⋅ Guangfeng Jiang ⋅ Linfeng Li ⋅ Xiangtai Li ⋅ Long Zhuo ⋅ Lai Xing Ng ⋅ Benoit R. Cottereau ⋅ Changxin Gao ⋅ Liang Pan ⋅ Wei Tsang Ooi ⋅ Ziwei Liu
Oral
Sun Jun 07 02:00 PM -- 02:15 PM (PDT) @ Bluebird Ballroom None
Mapping Networks
Lord Sen ⋅ Shyamapada Mukherjee
Oral
Sun Jun 07 02:02 PM -- 02:15 PM (PDT) @ Mile High Ballroom 1A - 2A None
SegMoTE: Token-Level Mixture of Experts for Medical Image Segmentation
Yujie Lu ⋅ Jingwen Li ⋅ Sibo Ju ⋅ Yanzhou Su ⋅ He Yao ⋅ Yisong Liu ⋅ Min Zhu ⋅ Junlong Cheng
Break
Sun Jun 07 02:15 PM -- 02:30 PM (PDT) None
Courtesy Break
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 1
Differentiable Laplacian Matrix Guided Superpixel Segmentation
Jeremy Juybari ⋅ Joshua Hamilton ⋅ Shuvra Das ⋅ Chaofan Chen ⋅ Andre Khalil ⋅ Yifeng Zhu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 2
FILTR: Extracting Topological Features from Pretrained 3D Models
Louis Martinez ⋅ Maks Ovsjanikov
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 3
Learning Convex Decomposition via Feature Fields
Yuezhi Yang ⋅ Qixing Huang ⋅ Mikaela Angelina Uy ⋅ Nicholas Sharp
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 4
Learning Eigenstructures of Unstructured Data Manifolds
Roy Velich ⋅ Arkadi Piven ⋅ David Bensaid ⋅ Daniel Cremers ⋅ Thomas Dagès ⋅ Ron Kimmel
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 5
Mapping Networks
Lord Sen ⋅ Shyamapada Mukherjee
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 6
CineBrain: A Large-Scale Multi-Modal Audiovisual Brain Dataset for Brain-Conditioned Video Generation
Jianxiong Gao ⋅ Yichang Liu ⋅ baofeng yang ⋅ Jianfeng Feng ⋅ Yanwei Fu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 7
Hearing the Room Through the Shape of the Drum: Modal-Guided Sound Recovery from Multi-Point Surface Vibrations
Shai Bagon ⋅ Matan Kichler ⋅ Mark Sheinin
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 8
SDTrack: A Baseline for Event-based Tracking via Spiking Neural Networks
Yimeng Shan ⋅ Zhenbang Ren ⋅ Haodi Wu ⋅ Wenjie Wei ⋅ Rui-Jie Zhu ⋅ Shuai Wang ⋅ Dehao Zhang ⋅ Yichen Xiao ⋅ Jieyuan Zhang ⋅ Kexin Shi ⋅ Jingzhinan Wang ⋅ Jason K. Eshraghian ⋅ Haicheng Qu ⋅ Malu Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 9
Thinking with Drafts: Speculative Temporal Reasoning for Efficient Long Video Understanding
Pengfei Hu ⋅ Meng Cao ⋅ Yingyao Wang ⋅ Yi Wang ⋅ Jiahua Dong ⋅ Jun Song ⋅ Cheng Yu ⋅ Bo Zheng ⋅ Xiaodan Liang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 10
Wan-Weaver: Interleaved Multi-modal Generation via Decoupled Training
Jinbo Xing ⋅ Zeyinzi Jiang ⋅ Yuxiang Tuo ⋅ Chaojie Mao ⋅ Xiaotang Gai ⋅ Xi Chen ⋅ Jingfeng Zhang ⋅ Yulin Pan ⋅ Zhen Han ⋅ Jie Xiao ⋅ Keyu Yan ⋅ Chenwei Xie ⋅ Chongyang Zhong ⋅ Kai Zhu ⋅ Tong Shen ⋅ Lianghua Huang ⋅ Yu Liu ⋅ Yujiu Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 11
CURE: Curriculum-guided Multi-task Training for Reliable Anatomy Grounded Report Generation
Pablo Messina ⋅ Andrés Villa ⋅ Juan León Alcázar ⋅ Karen Sanchez ⋅ Carlos Hinojosa ⋅ Denis Parra ⋅ Alvaro Soto ⋅ Bernard Ghanem
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 12
DK-DDIL: Adaptive Knowledge Retention for Dynamic Domain-Incremental Learning in Medical Imaging
Yuxi Ma ⋅ Sujie Liu ⋅ Jing Yang ⋅ Jiacheng Wang ⋅ Yiping Chen ⋅ Baptiste Magnier ⋅ Liansheng Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 13
Dual-level Adapter Boosting Prompt-free Curvilinear Structure Segmentation
Kai Zhu ⋅ Li Chen ⋅ Jun Cheng
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 14
LATA: Laplacian-Assisted Transductive Adaptation for Conformal Uncertainty in Medical VLMs
Behzad Bozorgtabar ⋅ Dwarikanath Mahapatra ⋅ Sudipta Roy ⋅ Muzammal Naseer ⋅ Imran Razzak ⋅ Zongyuan Ge
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 15
Medic-AD: Towards Medical Vision-Language Model's Clinical Intelligence
Woohyeon Park ⋅ Jaeik Kim ⋅ Sunghwan Steve Cho ⋅ Pa Hong ⋅ Wookyoung Jeong ⋅ Yoojin Nam ⋅ Namjoon Kim ⋅ Ginny Y. Wong ⋅ Ka Chun Cheung ⋅ Jaeyoung Do
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 16
SegMoTE: Token-Level Mixture of Experts for Medical Image Segmentation
Yujie Lu ⋅ Jingwen Li ⋅ Sibo Ju ⋅ Yanzhou Su ⋅ He Yao ⋅ Yisong Liu ⋅ Min Zhu ⋅ Junlong Cheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 17
Efficient Unrolled Networks for Large-Scale 3D Inverse Problems
Romain Vo ⋅ Julián Tachella
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 18
FedAdamom: Adaptive Momentum for Improved Generalization in Federated Optimization
Wenjie Hou ⋅ Tianxiang Chen ⋅ Feng Wang ⋅ Tiantong Wu ⋅ Zhiming Zheng ⋅ Shaoting Tang ⋅ Wei Yang Bryan Lim
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 19
SimScale: Learning to Drive via Real-World Simulation at Scale
Haochen Tian ⋅ Tianyu Li ⋅ Haochen Liu ⋅ Jiazhi Yang ⋅ Yihang Qiu ⋅ Guang Li ⋅ junli wang ⋅ Yinfeng Gao ⋅ Zhang Zhang ⋅ Liang Wang ⋅ Hangjun Ye ⋅ Long Chen ⋅ Hongyang Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 20
Texvent: Asynchronous Event Data Simulation via Text Prompt
Ruofei Wang ⋅ Peiqi Duan ⋅ Ka Chun Cheung ⋅ Simon See ⋅ Boxin Shi ⋅ Renjie Wan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 21
WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World
Ao Liang ⋅ Lingdong Kong ⋅ Tianyi Yan ⋅ Hongsi Liu ⋅ Yu Yang ⋅ Ziqi Huang ⋅ Wei Yin ⋅ Jialong Zuo ⋅ Yixuan Hu ⋅ Dekai Zhu ⋅ Dongyue Lu ⋅ Youquan Liu ⋅ Guangfeng Jiang ⋅ Linfeng Li ⋅ Xiangtai Li ⋅ Long Zhuo ⋅ Lai Xing Ng ⋅ Benoit R. Cottereau ⋅ Changxin Gao ⋅ Liang Pan ⋅ Wei Tsang Ooi ⋅ Ziwei Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 22
BuildingGPT: Auto-Regressive Building Wireframe Reconstruction Model with Reinforcement Learning
Yuzhou Liu ⋅ Lingjie Zhu ⋅ Hanqiao Ye ⋅ Yujun Liu ⋅ Shangfeng Huang ⋅ Xiang Gao ⋅ Ruisheng Wang ⋅ Shuhan Shen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 23
Emergent Extreme-View Geometry in 3D Foundation Models
Yiwen Zhang ⋅ Joseph Tung ⋅ Ruojin Cai ⋅ David Fouhey ⋅ Hadar Averbuch-Elor
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 24
LiteVGGT: Boosting Vanilla VGGT via Geometry-aware Cached Token Merging
Zhijian Shu ⋅ Cheng Lin ⋅ Tao Xie ⋅ Wei Yin ⋅ Ben Li ⋅ Zhiyuan Pu ⋅ Weize Li ⋅ Yao Yao ⋅ Xun Cao ⋅ Xiaoyang Guo ⋅ Xiaoxiao Long
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 25
LASER: Layer-wise Scale Alignment for Training-Free Streaming 4D Reconstruction
Tianye Ding ⋅ Yiming Xie ⋅ Yiqing Liang ⋅ Moitreya Chatterjee ⋅ Pedro Miraldo ⋅ Huaizu Jiang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 26
PanoVGGT: Feed-Forward 3D Reconstruction from Panoramic Imagery
Yijing Guo ⋅ Mengjun Chao ⋅ Luo Wang ⋅ Tianyang Zhao ⋅ Haizhao Dai ⋅ Yingliang Zhang ⋅ Jingyi Yu ⋅ Yujiao Shi
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 27
Rascene: High-Fidelity 3D Scene Imaging with mmWave Communication Signals
Kunzhe Song ⋅ Geo Jie Zhou ⋅ Xiaoming Liu ⋅ Huacheng Zeng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 28
VGG-T^3: Offline Feed-Forward 3D Reconstruction at Scale
Sven Elflein ⋅ Ruilong Li ⋅ Sérgio Agostinho ⋅ Žan Gojčič ⋅ Laura Leal-Taixe ⋅ Qunjie Zhou ⋅ Aljoša Ošep
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 29
SEA-Flow3D: Simplified, Efficient, and Accurate Scene Flow via Spatial Vector Sampling and Multi-scale Refinement
Han Ling ⋅ Quansen Sun ⋅ Yinghua Yao ⋅ Ivor Tsang ⋅ Yinghui Sun
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 30
OmniVGGT: Omni-Modality Driven Visual Geometry Grounded Transformer
Hao Li ⋅ Hao Li ⋅ Yalun Dai ⋅ Yushi Lan ⋅ Yihang Luo ⋅ Tianyu Qi ⋅ Zhengshen Zhang ⋅ Yufeng Zhan ⋅ Junfei Zhang ⋅ Wenchao Xu ⋅ Ziwei Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 31
DROID-SLAM in the Wild
Moyang Li ⋅ Zihan Zhu ⋅ Marc Pollefeys ⋅ Daniel Barath
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 32
HeSS: Head Sensitivity Score for Sparsity Redistribution in VGGT
Yongsung Kim ⋅ Wooseok Song ⋅ Jaihyun Lew ⋅ Hun Hwangbo ⋅ Jaehoon Lee ⋅ Sungroh Yoon
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 33
Dense Metric Depth Completion from Sparse Direct Time-of-Flight Sensors
Hakyeong Kim ⋅ Ruicheng Wang ⋅ Chengtang Yao ⋅ Jiaolong Yang ⋅ Min H. Kim
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 34
Online3R: Online Learning for Consistent Sequential Reconstruction Based on Geometry Foundation Model
Shunkai Zhou ⋅ Zike Yan ⋅ fei xue ⋅ Dong Wu ⋅ Yuchen Deng ⋅ Hongbin Zha
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 35
Neu-PiG: Neural Preconditioned Grids for Fast Dynamic Surface Reconstruction on Long Sequences
Julian Kaltheuner ⋅ Hannah Dröge ⋅ Markus Plack ⋅ Patrick Stotko ⋅ Reinhard Klein
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 36
Learning 3D Reconstruction with Priors in Test Time
Lei Zhou ⋅ Haoyu Wu ⋅ Akshat Dave ⋅ Dimitris Samaras
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 37
ArchSym: Detecting 3D-Grounded Architectural Symmetries in the Wild
Hanyu Chen ⋅ Ruojin Cai ⋅ Steve Marschner ⋅ Noah Snavely
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 38
PointTPA: Dynamic Network Parameter Adaptation for 3D Scene Understanding
Siyuan Liu ⋅ Chaoqun Zheng ⋅ Xin Zhou ⋅ Tianrui Feng ⋅ Dingkang Liang ⋅ Xiang Bai
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 39
tttLRM: Test-Time Training for Long Context and Autoregressive 3D Reconstruction
Chen Wang ⋅ Hao Tan ⋅ Wang Yifan ⋅ Zhiqin Chen ⋅ Yuheng Liu ⋅ Kalyan Sunkavalli ⋅ Sai Bi ⋅ Lingjie Liu ⋅ Yiwei Hu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 40
Hint2Gen: Bridging Understanding and Generation via Code-structured Hints
Yuanpeng Tu ⋅ Yunpeng Chen ⋅ Xi Chen ⋅ Liang Li ⋅ Hengshuang Zhao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 41
Compositional Text-to-Image Generation Via Region-aware Bimodal Direct Preference Optimization
Zhuohan Liu ⋅ Wujian Peng ⋅ Yitong Chen ⋅ Zuxuan Wu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 42
Learning by Analogy: A Causal Framework for Compositional Generalization
Lingjing Kong ⋅ Shaoan Xie ⋅ Yang Jiao ⋅ Yetian Chen ⋅ Yanhui Guo ⋅ Simone Shao ⋅ Yan Gao ⋅ Guangyi Chen ⋅ Kun Zhang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 43
ID-Crafter: VLM-Grounded Online RL for Compositional Multi-Subject Video Generation
Panwang Pan ⋅ Jingjing Zhao ⋅ Yuchen Lin ⋅ Chenguo Lin ⋅ Chenxin Li ⋅ Hengyu Liu ⋅ Tingting Shen ⋅ Yadong Mu
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 44
GenColorBench: A Color Evaluation Benchmark for Text-to-Image Generation
Muhammad Atif Butt ⋅ Alexandra Gomez-Villa ⋅ Tao Wu ⋅ Javier Vazquez-Corral ⋅ Joost van de Weijer ⋅ Kai Wang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 45
Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation
Chenxi Zhao ⋅ Chen Zhu ⋅ Xiaokun Feng ⋅ Aiming Hao ⋅ Jiashu Zhu ⋅ Jiachen Lei ⋅ Jiahong Wu ⋅ Xiangxiang Chu ⋅ Jufeng Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 46
When Pretty Isn’t Useful: Investigating Why Modern Text-to-Image Models Fail as Reliable Training Data Generators
Krzysztof Adamkiewicz ⋅ Brian B. Moser ⋅ Stanislav Frolov ⋅ Tobias Christian Nauen ⋅ Federico Raue ⋅ Andreas Dengel
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 47
TempoControl: Temporal Attention Guidance for Text-to-Video Models
Shira Schiber ⋅ Ofir Lindenbaum ⋅ Idan Schwartz
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 48
Hear What Matters! Text-conditioned Selective Video-to-Audio Generation
Junwon Lee ⋅ Juhan Nam ⋅ Jiyoung Lee
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 49
MultiCrafter: High-Fidelity Multi-Subject Generation via Disentangled Attention and Identity-Aware Preference Alignment
Tao Wu ⋅ Yibo Jiang ⋅ Yehao Lu ⋅ Zhizhong Wang ⋅ Zeyi Huang ⋅ Zequn Qin ⋅ Xi Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 50
Resolving the Identity Crisis in Text-to-Image Generation
Shubhankar Borse ⋅ Farzad Farhadzadeh ⋅ Munawar Hayat ⋅ Fatih Porikli
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 51
DiffGraph: An Automated Agent-driven Model Merging Framework for In-the-Wild Text-to-Image Generation
Zhuoling Li ⋅ Hossein Rahmani ⋅ Jiarui Zhang ⋅ Yu Xue ⋅ Majid Mirmehdi ⋅ Jason Kuen ⋅ Jiuxiang Gu ⋅ Jun Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 52
Gloria: Consistent Character Video Generation via Content Anchors
Yuhang Yang ⋅ Fan Zhang ⋅ Huaijin Pi ⋅ Ailing Zeng ⋅ Shuai Guo ⋅ Guowei Xu ⋅ Wei Zhai ⋅ Yang Cao ⋅ Zheng-Jun Zha
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 53
DreamShot: Personalized Storyboard Synthesis with Video Diffusion Prior
Junjia Huang ⋅ Binbin Yang ⋅ Pengxiang Yan ⋅ Jiyang Liu ⋅ Bin Xia ⋅ Zhao Wang ⋅ Yitong Wang ⋅ Liang Lin ⋅ Guanbin Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 54
M4V: Multimodal Mamba for Efficient Text-to-Video Generation
Jiancheng Huang ⋅ Gengwei Zhang ⋅ Zequn Jie ⋅ Siyu Jiao ⋅ Yinlong Qian ⋅ Ling Chen ⋅ Yunchao Wei ⋅ Lin Ma
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 55
Property-Informed Diffusion-Based Text-to-Microstructure Generation
Bingxuan Dai ⋅ Hongsong Wang ⋅ Jie Gui
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 56
DreamingComics: A Story Visualization Pipeline via Subject and Layout Customized Generation using Video Models
Patrick Kwon ⋅ Chen Chen
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 57
Mixture of States: Routing Token-Level Dynamics for Multimodal Generation
Haozhe Liu ⋅ Ding Liu ⋅ Mingchen Zhuge ⋅ Zijian Zhou ⋅ Tian Xie ⋅ Sen He ⋅ Yukang Yang ⋅ Shuming Liu ⋅ Yuren Cong ⋅ Jiadong Guo ⋅ Hongyu Xu ⋅ Ke Xu ⋅ Kam-Woh Ng ⋅ Juan C. Perez ⋅ Juan-Manuel Pérez-Rúa ⋅ Tao Xiang ⋅ Wei Liu ⋅ Shikun Liu ⋅ Jürgen Schmidhuber
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 58
HiCoGen: Hierarchical Compositional Text-to-Image Generation in Diffusion Models via Reinforcement Learning
Hongji Yang ⋅ Yucheng Zhou ⋅ Wencheng Han ⋅ Runzhou Tao ⋅ Zhongying Qiu ⋅ Jianfei Yang ⋅ Jianbing Shen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 59
TherA: Thermal-Aware Visual-Language Prompting for Controllable RGB-to-Thermal Infrared Translation
Dong-Guw Lee ⋅ Tai Hyoung Rhee ⋅ Hyunsoo Jang ⋅ Young-Sik Shin ⋅ Ukcheol Shin ⋅ Ayoung Kim
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 60
See What I Mean: Aligning Vision and Language Representations for Video Fine-grained Object Understanding
Bo-Yuan Sun ⋅ Bowen Yin ⋅ Yuanming Li ⋅ Xihan Wei ⋅ Qibin Hou
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 61
CoV-Align: Efficient Fine-grained Cross-Modal Alignment with Cohesive Visual Semantics Priority
Hengqi Liu ⋅ Wanting Zhou ⋅ Longteng Kong ⋅ Fangxiang Feng ⋅ Lei Ren ⋅ Wei Chen ⋅ Xiaojie Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 62
TDATR: Improving End-to-End Table Recognition via Table Detail-Aware Learning and Cell-Level Visual Alignment
Qin Chunxia ⋅ Chenyu Liu ⋅ Pengcheng Xia ⋅ Jun Du ⋅ Baocai Yin ⋅ Bing Yin ⋅ Cong Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 63
A Mixed Diet Makes DINO An Omnivorous Vision Encoder
Rishabh Kabra ⋅ Maks Ovsjanikov ⋅ Drew A Hudson ⋅ Ye Xia ⋅ Skanda Koppula ⋅ André Araujo ⋅ Joao Carreira ⋅ Niloy J. Mitra
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 64
Uncertainty-guided Compositional Alignment with Part-to-Whole Semantic Representativeness in Hyperbolic Vision-Language Models
Hayeon Kim ⋅ Ji Ha Jang ⋅ Junghun James Kim ⋅ Se Young Chun
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 65
TaskForce: Cooperative Multi-agent Reinforcement Learning for Multi-task Optimization
Wonhyeok Choi ⋅ Kyumin Hwang ⋅ Jihun Park ⋅ Kyoungmin Lee ⋅ Seunghun Lee ⋅ Jaeyeul Kim ⋅ Minwoo Choi ⋅ Sunghoon Im
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 66
PhyCritic: Multimodal Critic Models for Physical AI
Tianyi Xiong ⋅ Shihao Wang ⋅ Guilin Liu ⋅ Yi Dong ⋅ Ming Li ⋅ Heng Huang ⋅ Jan Kautz ⋅ Zhiding Yu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 67
R-C2: Cycle-Consistent Reinforcement Learning Improves Multimodal Reasoning
Zirui Zhang ⋅ Haoyu Dong ⋅ Kexin Pei ⋅ Chengzhi Mao
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 68
Multimodal RewardBench 2: Evaluating Omni Reward Models for Interleaved Text and Image
Yushi Hu ⋅ Reyhane Askari ⋅ Melissa Hall ⋅ Emily Dinan ⋅ Luke Zettlemoyer ⋅ Marjan Ghazvininejad
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 69
Unified Generation and Self-Verification for Vision-Language Models via Advantage Decoupled Preference Optimization
Xinyu Qiu ⋅ Heng Jia ⋅ Zhengwen Zeng ⋅ Shuheng Shen ⋅ Changhua Meng ⋅ Yi Yang ⋅ Linchao Zhu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 70
Anchoring the Mind of Multimodal Reasoners: Cognitive Bias as a Vector for Jailbreak Attacks
Linhua Cong ⋅ Bingrui Sima ⋅ Kun He
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 71
InsCal: Calibrated Multi-Source Fully Test-Time Prompt Tuning for Object Detection
Xiaofan Que ⋅ Dingrong Wang ⋅ Xumin Liu ⋅ Qi Yu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 72
Why Not Hyperparameter-Friendly Optimisation? A Monotonic Adaptive Norm Rescaling Approach For Long-Tailed Recognition
Shuo Zhang ⋅ Chenqi Li ⋅ Tingting Zhu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 73
Decoupling Vision and Language: Codebook Anchored Visual Adaptation
Jason Wu ⋅ Tianchen Zhao ⋅ Chang Liu ⋅ Jiarui Cai ⋅ Zheng Zhang ⋅ Zhuowei Li ⋅ Aaditya Singh ⋅ Xiang Xu ⋅ Mani Srivastava ⋅ Jonathan Wu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 74
MemFlow: A Lightweight Forward Memorizing Framework for Quick Domain Adaptive Feature Mapping
Jianming Lv ⋅ Chengjun Wang ⋅ Depin Liang ⋅ Qianli Ma ⋅ Wei Chen ⋅ Xueqi Cheng
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 75
Mind the Discriminability Trap in Source-Free Cross-domain Few-shot Learning
ZHENYU ZHANG ⋅ Yixiong Zou ⋅ Yuhua Li ⋅ Ruixuan Li ⋅ Guangyao Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 76
Vision-Language Model Guided Source-Free Domain Adaptation via Optimal Transport
Shuo Han ⋅ Xu Tang ⋅ Jingjing Ma ⋅ Xiangrong Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 77
Masked Representation Modeling for Domain-Adaptive Segmentation
Wenlve Zhou ⋅ Zhiheng Zhou ⋅ Tiantao Xian ⋅ Yikui Zhai ⋅ Weibin Wu ⋅ Biyun MA
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 78
TaskIT: Memory-Efficient Fine-Tuning of Multi-LoRA LLMs via Cross-Task Importance Transfer
Cheng Fang ⋅ Zimu Zhou ⋅ Ke Ma ⋅ Bin Guo
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 79
ARES: Unifying Asymmetric RGB-Event Stereo for Probabilistic Scene Flow Estimation
Jie Long Lee ⋅ Gim Hee Lee
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 80
MER-Tracker: Towards High-Speed 3D Point Tracking via Multi-View Event-RGB Hybrid Cameras
Yiqian Chang ⋅ Qinghong Ye ⋅ Haoran Xu ⋅ Jianing Li ⋅ Dongyang Ma ⋅ Xuan Wang ⋅ Wei Zhang ⋅ Yonghong Tian ⋅ Peixi Peng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 81
Moving Border Ownership for Event-based Motion Segmentation
Zhiyuan Hua ⋅ Cornelia Fermuller ⋅ Yiannis Aloimonos
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 82
TTAPFormer: Robust Arbitrary Point Tracking via Transient Asynchronous Fusion of Frames and Events
Jiaxiong Liu ⋅ Zhen Tan ⋅ Jinpu Zhang ⋅ Yi Zhou ⋅ Hui Shen ⋅ Xieyuanli Chen ⋅ Dewen Hu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 83
EventHub: Data Factory for Generalizable Event-Based Stereo Networks without Active Sensors
Luca Bartolomei ⋅ Fabio Tosi ⋅ Matteo Poggi ⋅ Stefano Mattoccia ⋅ Guillermo Gallego
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 84
Seeing Motion Through Polarity for Event-based Action Recognition
Meiqi Cao ⋅ Jiachao Zhang ⋅ Xin Jiang ⋅ Rui Yan ⋅ Yazhou Yao ⋅ Zechao Li ⋅ Xiangbo Shu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 85
Multi-Scale Gaussian-Language Map for Zero-shot Embodied Navigation and Reasoning
Sixian Zhang ⋅ Yiyao Wang ⋅ Xinhang Song ⋅ Keming Zhang ⋅ Zijian Xu ⋅ Shuqiang Jiang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 86
Explore with Long-term Memory: A Benchmark and Multimodal LLM-based Reinforcement Learning Framework for Embodied Exploration
sen wang ⋅ Bangwei Liu ⋅ Zhenkun Gao ⋅ Lizhuang Ma ⋅ Xuhong Wang ⋅ Yuan Xie ⋅ Xin Tan
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 87
SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL
Siyi Chen ⋅ Mikaela Angelina Uy ⋅ Chan Hee Song ⋅ Faisal Ladhak ⋅ Adithya Murali ⋅ Qing Qu ⋅ Stan Birchfield ⋅ Valts Blukis ⋅ Jonathan Tremblay
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 88
TeamHOI: Learning a Unified Policy for Cooperative Human-Object Interactions with Any Team Size
Stefan Lionar ⋅ Gim Hee Lee
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 89
AREA3D: Active Reconstruction Agent with Unified Feed-Forward 3D Perception and Vision-Language Guidance
Tianling Xu ⋅ Shengzhe GAN ⋅ Leslie Gu ⋅ Yuelei Li ⋅ Fangneng Zhan ⋅ Hanspeter Pfister
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 90
Experience Transfer for Multimodal LLM Agents in Minecraft Game
Chenghao Li ⋅ Jun Liu ⋅ Songbo Zhang ⋅ HuaDong Jian ⋅ Hao Ni ⋅ LIK-HANG LEE ⋅ SUNG BAE BAE ⋅ Guoqing Wang ⋅ Yang Yang ⋅ Chaoning Zhang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 91
MSGNav: Unleashing the Power of Multi-modal 3D Scene Graph for Zero-Shot Embodied Navigation
Xun Huang ⋅ Shijia Zhao ⋅ Yunxiang Wang ⋅ Xin Lu ⋅ Wanfa Zhang ⋅ Rongsheng Qu ⋅ Weixin Li ⋅ Yunhong Wang ⋅ Chenglu Wen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 92
SaPaVe: Towards Active Perception and Manipulation in Vision-Language Action Models for Robotics
Mengzhen Liu ⋅ Enshen Zhou ⋅ Cheng Chi ⋅ Yi Han ⋅ Shanyu Rong ⋅ Liming Chen ⋅ Pengwei Wang ⋅ Zhongyuan Wang ⋅ Shanghang Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 93
MANSION: Multi-floor lANguage-to-3D Scene generatIOn for loNg-horizon tasks
Lirong Che ⋅ Shuo Wen ⋅ Huang Shan ⋅ wang chuang ⋅ yuzhe yang ⋅ Gregory Dudek ⋅ Chuang Wang ⋅ Jian Su
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 94
RealAppiance: Let High-fidelity Appliance Assets Controllable and Workable as Aligned Real Manauls
Yuzheng Gao ⋅ Yuxing Long ⋅ Lei Kang ⋅ Yuchong Guo ⋅ Ziyan Yu ⋅ Shangqing Mao ⋅ Jiyao Zhang ⋅ Ruihai Wu ⋅ Dongjiang Li ⋅ Hui Shen ⋅ Hao Dong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 95
ForeAct: Steering Your VLA with Efficient Visual Foresight Planning
Zhuoyang Zhang ⋅ Shang Yang ⋅ Qinghao Hu ⋅ Luke J. Huang ⋅ James Hou ⋅ Yufei Sun ⋅ Yao Lu ⋅ Song Han
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 96
Affordance Field Intervention: Enabling VLAs to Escape Memory Traps in Robotic Manipulation
Siyu Xu ⋅ Zijian Wang ⋅ Yunke Wang ⋅ Chenghao Xia ⋅ Tao Huang ⋅ Chang Xu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 97
MERIT: Multi-domain Efficient RAW Image Translation
Wenjun Huang ⋅ Shenghao Fu ⋅ Yian Jin ⋅ Yang Ni ⋅ Ziteng Cui ⋅ Hanning Chen ⋅ Yirui He ⋅ Yezi Liu ⋅ Sanggeon Yun ⋅ SungHeon Jeong ⋅ Ryozo Masukawa ⋅ William Youngwoo Chung ⋅ Mohsen Imani
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 98
Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing
Yusu Qian ⋅ Eli Bocek-Rivele ⋅ Liangchen Song ⋅ Jialing Tong ⋅ Yinfei Yang ⋅ Jiasen Lu ⋅ Wenze Hu ⋅ Zhe Gan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 99
Probabilistic Prompt Adaptation for Unified Image Aesthetics and Quality Assessment
Takayuki Hara ⋅ Yuya Otsuka
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 100
EMMA: Concept Erasure Benchmark with Comprehensive Semantic Metrics and Diverse Categories
Lu Wei ⋅ Yuta Nakashima ⋅ Noa Garcia
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 101
Too Vivid to Be Real? Benchmarking and Calibrating Generative Color Fidelity
Zhengyao Fang ⋅ Zexi Jia ⋅ Yijia Zhong ⋅ Pengcheng Luo ⋅ Jinchao Zhang ⋅ Guangming Lu ⋅ Jun Yu ⋅ Wenjie Pei
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 102
WiseEdit: Benchmarking Cognition- and Creativity-Informed Image Editing
Kaihang Pan ⋅ Weile Chen ⋅ Haiyi Qiu ⋅ Qifan Yu ⋅ Wendong Bu ⋅ zehan wang ⋅ Yun Zhu ⋅ Juncheng Li ⋅ Siliang Tang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 103
UnicEdit-10M: A Dataset and Benchmark Breaking the Scale-Quality Barrier via Unified Verification for Reasoning-Enriched Edits
Keming Ye ⋅ Zhipeng Huang ⋅ Canmiao Fu ⋅ Qingyang Liu ⋅ Jiani Cai ⋅ Zheqi Lv ⋅ Chen Li ⋅ Jing LYU ⋅ Zhou Zhao ⋅ Shengyu Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 104
Inter-Edit: First Benchmark for Interactive Instruction-Based Image Editing
Delong Liu ⋅ Haotian Hou ⋅ Zhaohui Hou ⋅ Zhiyuan Huang ⋅ Shihao Han ⋅ Mingjie Zhan ⋅ Zhicheng Zhao ⋅ Fei Su
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 105
PR-IQA: Partial-Reference Image Quality Assessment for Diffusion-Based Novel View Synthesis
Inseong Choi ⋅ Siwoo Lee ⋅ Seung-Hun Nam ⋅ Soohwan Song
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 106
LumiMotion: Improving Gaussian Relighting with Scene Dynamics
Joanna Kaleta ⋅ Piotr Wójcik ⋅ Kacper Marzol ⋅ Tomasz Trzciński ⋅ Kacper Kania ⋅ Marek Kowalski
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 107
Let it Snow! Animating 3D Gaussian Scenes with Dynamic Weather Effects via Physics-Guided Score Distillation
Gal Fiebelman ⋅ Hadar Averbuch-Elor ⋅ Sagie Benaim
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 108
iLRM: An Iterative Large 3D Reconstruction Model
Gyeongjin Kang ⋅ Seungtae Nam ⋅ Seung kwon Yang ⋅ Xiangyu Sun ⋅ Sameh Khamis ⋅ Abdelrahman Mohamed ⋅ Eunbyung Park
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 109
MVInverse: Feed-forward Multiview Inverse Rendering in Seconds
Xiangzuo Wu ⋅ Chengwei Ren ⋅ Jun Zhou ⋅ Xiu Li ⋅ Yuan Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 110
From None to All: Self-Supervised 3D Reconstruction via Novel View Synthesis
Ranran Huang ⋅ Weixun Luo ⋅ Ye Mao ⋅ Krystian Mikolajczyk
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 111
MoRel: Long-Range Flicker-Free 4D Motion Modeling via Anchor Relay-based Bidirectioanl Blending with Hierarchical Densification
Sangwoon Kwak ⋅ WEEYOUN KWON ⋅ Jun Young Jeong ⋅ Geonho Kim ⋅ Won-Sik Cheong ⋅ Jihyong Oh
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 112
Multi-view Pyramid Transformer: Look Coarser to See Broader
Gyeongjin Kang ⋅ Seung kwon Yang ⋅ Seungtae Nam ⋅ Younggeun Lee ⋅ Jungwoo Kim ⋅ Eunbyung Park
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 113
CaT-GS: Efficient 3DGS Rendering for Large Scale Scenes via Inter-frame Caching and Tile Scheduling
TingJia Zhang ⋅ Bo Chen ⋅ Shengzhong Liu ⋅ Fan Wu ⋅ Guihai Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 114
RL‑ScanIQA: Reinforcement-Learned Scanpaths for Blind 360° Image Quality Assessment
yujia wang ⋅ Yuyan Li ⋅ Jiuming Liu ⋅ Fang-Lue Zhang ⋅ Xinhu Zheng ⋅ Neil.A Dodgson
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 115
Benchmarking Endoscopic Surgical Image Restoration and Beyond
Jialun Pei ⋅ Diandian Guo ⋅ Donghui Yang ⋅ Zhixi Li ⋅ Yuxin Feng ⋅ Long Ma ⋅ Bo Du ⋅ Pheng-Ann Heng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 116
SDUIE: Semi-Supervised Diffusion for Underwater Image Enhancement with Quant-Text Dual Control
Xiaofeng Cong ⋅ Yu-Xin Zhang ⋅ Hao Shen ⋅ Yeying Jin ⋅ Junming Hou ⋅ Jie Gui
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 117
HiDRA: Hierarchical Degradation Representation and Adaptation with Generative Priors for Enhancing Infrared Vision
Zihang Chen ⋅ Zhu Liu ⋅ Changbo Yan ⋅ Jinyuan Liu ⋅ Risheng Liu
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 118
BluRef: Unsupervised Image Deblurring with Dense-Matching References
Bang-Dang Pham ⋅ Anh Tran ⋅ Cuong Pham ⋅ Minh Nguyen Nguyen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 119
Bi-Bridge: Bidirectional Diffusion Bridges for Low-Light Image Enhancement
Zeyu Hua ⋅ HUI LI ⋅ Yu Wang ⋅ Song Wang ⋅ Congchao Zhu ⋅ Caixia Zheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 120
UniLDiff: Unlocking the Power of Diffusion Priors for All-in-One Image Restoration
Zihan Cheng ⋅ Liangtai Zhou ⋅ Dian Chen ⋅ Ni Tang ⋅ Xiaotong Luo ⋅ Yuan Xie ⋅ Yanyun Qu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 121
MatAnyone 2: Scaling Video Matting via a Learned Quality Evaluator
Peiqing Yang ⋅ Shangchen Zhou ⋅ Kai Hao ⋅ Qingyi Tao
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 122
SelfHVD: Self-Supervised Handheld Video Deblurring
Honglei Xu ⋅ Zhilu Zhang ⋅ Junjie Fan ⋅ Xiaohe Wu ⋅ Wangmeng Zuo
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 123
Spatio-Temporal Difference Guided Motion Deblurring with the Complementary Vision Sensor
Yapeng Meng ⋅ Lin Yang ⋅ Yuguo Chen ⋅ Xiangru Chen ⋅ Taoyi Wang ⋅ Lijian Wang ⋅ Zheyu Yang ⋅ Yihan Lin ⋅ Rong Zhao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 124
Learning Where to Look and How to Judge: Resolution-agnostic Image Quality Assessment with Quality-aware Saliency
Hakan Emre Gedik ⋅ Shashank Gupta ⋅ Alan C.
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 125
Bridging RGB and Hematoxylin Components: An Interleaved Guidance and Fusion Framework for Point Supervised Nuclei Segmentation
Zihan Huan ⋅ Xipeng Pan ⋅ Hualong Zhang ⋅ Siyang Feng ⋅ Rushi Lan ⋅ Huadeng Wang ⋅ Haoxiang Lu ⋅ Zhenbing Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 126
Virtual Nodes Guided Dynamic Graph Neural Network for Brain Tumor Segmentation with Missing Modalities
Sha Tao ⋅ Jiao PAN ⋅ Yu Guo ⋅ Chao Yao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 127
VoxTell: Free-Text Promptable Universal 3D Medical Image Segmentation
Maximilian Rokuss ⋅ Moritz Langenberg ⋅ Yannick Kirchhoff ⋅ Fabian Isensee ⋅ Benjamin Hamm ⋅ Constantin Ulrich ⋅ Sebastian Regnery ⋅ Lukas Bauer ⋅ Efthimios Katsigiannopulos ⋅ Tobias Norajitra ⋅ Klaus Maier-Hein
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 128
Photo-Guided Tooth Segmentation on 3D Oral Scan Model
Shaojie Zhuang ⋅ Guangshun Wei ⋅ Jiangxin He ⋅ Yuanfeng Zhou
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 129
Breaking the Continuum: Discrete Distribution Learning for Structural MRI Reconstruction
Tianle Lyu ⋅ Mengjingcheng Mo ⋅ Ting Wen ⋅ Zhen Song ⋅ Zinan Xiong ⋅ Yanjie Zhu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 130
Uni-Hema: Unified Model for Digital Hematopathology
Abdul Rehman ⋅ Iqra Rasool ⋅ Ayisha Imran ⋅ Mohsen Ali ⋅ Waqas Sultani
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 131
Post-training Feature Pruning for Fundus Images Classification
Van-Nguyen Pham ⋅ Duc-Tai Le ⋅ Junghyun Bum ⋅ Hyunseung Choo
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 132
Sketch2CT: Multimodal Diffusion for Structure-Aware 3D Medical Volume Generation
Delin An ⋅ Chaoli Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 133
SafeLogo: Turning Your Logos into Jailbreak Shields via Micro-Regional Adversarial Training
Zhiyi Duan ⋅ Xiaoyue Zhang ⋅ Tianxing Man
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 134
Anti-I2V: Safeguarding your Photos from Malicious Image-to-video Generation
Hong Duc Vu ⋅ Anh Nguyen ⋅ Chi Tran ⋅ Anh Tran
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 135
UniGame: Turning a Unified Multimodal Model Into Its Own Adversary
Zhaolong Su ⋅ Wang Lu ⋅ Hao Chen ⋅ Yixuan Li ⋅ Jindong Wang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 136
Hierarchically Robust Zero-shot Vision-language Models
Junhao Dong ⋅ Yifei Zhang ⋅ Hao Zhu ⋅ Yew-Soon Ong ⋅ Piotr Koniusz
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 137
Beyond Text Prompts: Precise Concept Erasure through Text–Image Collaboration
Jun Li ⋅ Lizhi Xiong ⋅ Ziqiang Li ⋅ Weiwei Jiang ⋅ Zhangjie Fu ⋅ Yong Li ⋅ Guo-Sen Xie
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 138
AGENTSAFE: Benchmarking the Safety of Embodied Agents on Hazardous Instructions
Zonghao Ying ⋅ Le Wang ⋅ Yisong Xiao ⋅ Jiakai Wang ⋅ Yuqing Ma ⋅ Jinyang Guo ⋅ Zhenfei Yin ⋅ Mingchuan Zhang ⋅ Aishan Liu ⋅ Xianglong Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 139
ReMoE: Region-Mixture Experts for Adversarially-Robust Vision Transformers
Qinghao Zhong ⋅ Bingzhi Chen ⋅ Yishu Liu ⋅ Minhua Lu ⋅ Guangming Lu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 140
TreeTeaming: Autonomous Red-Teaming of Vision-Language Models via Hierarchical Strategy Exploration
Chunxiao Li ⋅ Lijun Li ⋅ Jing Shao
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 141
SO-Bench: A Structural Output Evaluation of Multimodal LLM
Di Feng ⋅ Kaixin Ma ⋅ Feng Nan ⋅ Haofeng Chen ⋅ Bohan Zhai ⋅ David Griffiths ⋅ Mingfei Gao ⋅ Zhe Gan ⋅ Eshan Verma ⋅ Yinfei Yang ⋅ Zhifeng Chen ⋅ Afshin Dehghan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 142
Chain-of-Thought Guided Multi-Modal Object Re-Identification
Ya Gao ⋅ Shihao Li ⋅ ZhaoJun Liu ⋅ AIHUA ZHENG ⋅ Chenglong Li ⋅ Jin Tang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 143
When Lines Meet Textures: Spatial-Frequency Aligned Diffusion Features for Cross-Sparsity Correspondence
Mingrui Zhu ⋅ Fengzhi Wang ⋅ Xin Wei ⋅ Jun Wang ⋅ Nannan Wang ⋅ Xinbo Gao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 144
CountGD++: Generalized Prompting for Open-World Counting
Niki Amini-Naieni ⋅ Andrew Zisserman
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 145
AudioStory: Generating Long-Form Narrative Audio with Large Language Models
Yuxin Guo ⋅ Teng Wang ⋅ Yuying Ge ⋅ Shijie Ma ⋅ Yixiao Ge ⋅ Wei Zou
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 146
Parameter-Efficient Adaptation for MLLMs via Implicit Modality Decomposition
Mingfang Zhang ⋅ Yunhong Wang ⋅ Lu Wang ⋅ Jiaxin Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 147
Hyperbolic Gramian Volumes for Multimodal Alignment
Saiyang Na ⋅ Feng Jiang ⋅ Qifeng Zhou ⋅ Wenliang Zhong ⋅ Thao M. Dang ⋅ Yuzhi Guo ⋅ Hehuan Ma ⋅ Chunyuan Li ⋅ Weizhi An ⋅ Junzhou Huang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 148
Venus: Benchmarking and Empowering Multimodal Large Language Models for Aesthetic Guidance and Cropping
Tianxiang Du ⋅ Hulingxiao He ⋅ Yuxin Peng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 149
AutoCut: End-to-end advertisement video editing based on multimodal discretization and controllable generation
Milton Zhou ⋅ Sizhong Qin ⋅ Yongzhi Li ⋅ Quan Chen ⋅ Peng Jiang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 150
StableMTL: Repurposing Latent Diffusion Models for Multi-Task Learning from Partially Annotated Synthetic Datasets
Anh Quan Cao ⋅ Ivan Lopes ⋅ Raoul de Charette
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 151
CaReFlow: Cyclic Adaptive Rectified Flow for Multimodal Fusion
Sijie Mai ⋅ Shiqin Han
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 152
Lenses: Toward Polysemous Vision–Language Understanding
Hani Alomari ⋅ Ali Asgarov ⋅ Chris Thomas
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 153
CoRiM: Conflict-driven Risk Minimization for Dynamic Multimodal Fusion
shihao Zou ⋅ Wei Wei
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 154
Uncertainty-Aware Exploratory Direct Preference Optimization for Multimodal Large Language Models
Huatian Zhang ⋅ Zhendong Mao ⋅ Lei Zhang ⋅ Yongdong Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 155
CICA: Coupling Confidence-Aware Pretraining with Confidence-Informed Attention for Robust Multimodal Sentiment Analysis
Haoyu Jiang ⋅ Xiaoliang Chen ⋅ Duoqian Miao ⋅ Xiaolin Qin ⋅ Xianyong Li ⋅ Yajun Du
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 156
SAMTok: Representing Any Mask with Two Words
yikang zhou ⋅ Tao Zhang ⋅ Dengxian Gong ⋅ Yuanzheng Wu ⋅ Ye Tian ⋅ Haochen Wang ⋅ Haobo Yuan ⋅ Jiacong Wang ⋅ Lu Qi ⋅ Hao Fei ⋅ Shunping Ji ⋅ Anran Wang ⋅ Zhuochen Wang ⋅ Yujing Wang ⋅ Cheng CHEN ⋅ Xiangtai Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 157
Multi-Metric Representation Learning Strategy Based on Clustering for Fine-Grained Multimodal Sentiment Analysis
Yidan Wang ⋅ Zongheng Wang ⋅ Hongjie Xing ⋅ Chunguo Li ⋅ Xiaoxiao Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 158
Cinematic Audio Source Separation Using Visual Cues
Kang Zhang ⋅ Suyeon Lee ⋅ Arda Senocak ⋅ Joon Chung
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 159
MMSD3.0: A Multi-Image Benchmark for Real-World Multimodal Sarcasm Detection
HAOCHEN ZHAO ⋅ Yuyao Kong ⋅ Yongxiu Xu ⋅ Gaopeng Gou ⋅ Hongbo Xu ⋅ Yubin Wang ⋅ Haoliang Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 160
Anchor-Guided Gradient Alignment for Incomplete Multimodal Learning
Zhi-Hao Guan ⋅ Longfei Huang ⋅ Yang Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 161
PyraTok: Language-Aligned Pyramidal Tokenizer for Video Understanding and Generation
Onkar Susladkar ⋅ Tushar Prakash ⋅ Adheesh Juvekar ⋅ Kiet A. Nguyen ⋅ Dong-Hwan Jang ⋅ Inderjit S Dhillon ⋅ Ismini Lourentzou
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 162
VDE: Training-Free Accelerating Rectified Flow Model via Velocity Decomposition and Estimation
Junwen Tan ⋅ Jinglin Liang ⋅ Hongyuan Chen ⋅ Shuangping Huang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 163
Kontinuous Kontext: Continuous Strength Control for Instruction-based Image Editing
Rishubh Parihar ⋅ Or Patashnik ⋅ Daniil Ostashev ⋅ R. Venkatesh Babu ⋅ Daniel Cohen-Or ⋅ Kuan-Chieh Jackson Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 164
VideoCoF: Unified Video Editing with Temporal Reasoner
xiangpeng yang ⋅ Ji Xie ⋅ Yiyuan Yang ⋅ Yue Ma ⋅ Yan Huang ⋅ Min Xu ⋅ Qiang Wu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 165
Progressive Supernet Training for Efficient Visual Autoregressive Modeling
Xiaoyue Chen ⋅ Yuling Shi ⋅ kaiyuan Li ⋅ Huandong Wang ⋅ Yong Li ⋅ Xiaodong Gu ⋅ Xinlei Chen ⋅ Mingbao Lin
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 166
CoT-Edit: Let CoT Guide Instruction Video Editing
Sen Liang ⋅ Fengbin Guan ⋅ Youliang Zhang ⋅ Xin Li ⋅ Zhibo Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 167
Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset
Qingyan Bai ⋅ Qiuyu Wang ⋅ Hao Ouyang ⋅ Yue Yu ⋅ Hanlin Wang ⋅ Wen Wang ⋅ Ka Leong Cheng ⋅ Shuailei Ma ⋅ Yanhong Zeng ⋅ Zichen Liu ⋅ Yinghao Xu ⋅ Yujun Shen ⋅ Qifeng Chen
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 168
Test-Time Instance-Specific Parameter Composition: A New Paradigm for Adaptive Generative Modeling
Minh-Tuan Tran ⋅ Xuan-May Le ⋅ Quan Hung Tran ⋅ Mehrtash Harandi ⋅ Dinh Phung ⋅ Trung Le
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 169
Understanding, Accelerating, and Improving MeanFlow Training
Jin-Young Kim ⋅ Hyojun Go ⋅ Lea Bogensperger ⋅ Julius Erbach ⋅ Nikolai Kalischek ⋅ Federico Tombari ⋅ Konrad Schindler ⋅ Dominik Narnhofer
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 170
Meta-CoT: Enhancing Granularity and Generalization in Image Editing
Shiyi Zhang ⋅ YIJI CHENG ⋅ Tiankai Hang ⋅ Zijin Yin ⋅ Runze He ⋅ Yu Xu ⋅ Wenxun Dai ⋅ yunlong lin ⋅ Chunyu Wang ⋅ qinglin lu ⋅ Yansong Tang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 171
Dual-Granularity Memory for Efficient Video Generation
Hongjun Wang ⋅ Lin Liu ⋅ Jianguo Li ⋅ Tao Lin
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 172
Unified Camera Positional Encoding for Controlled Video Generation
Cheng Zhang ⋅ Boying Li ⋅ Meng Wei ⋅ Yan-Pei Cao ⋅ Camilo Cruz Gambardella ⋅ Dinh Phung ⋅ Jianfei Cai
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 173
EditMGT: Unleashing Potentials of Masked Generative Transformers in Image Editing
Wei Chow ⋅ Linfeng Li ⋅ Lingdong Kong ⋅ Zefeng Li ⋅ Qi Xu ⋅ Hang Song ⋅ Tian Ye ⋅ Xian Wang ⋅ Jinbin Bai ⋅ Shilin Xu ⋅ Xiangtai Li ⋅ Junting Pan ⋅ Shaoteng Liu ⋅ Ran Zhou ⋅ Tianshu Yang ⋅ Songhua Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 174
MU-GeNeRF: Multi-view Uncertainty-guided Generalizable Neural Radiance Fields for Distractor-aware Scene
wenjie mu ⋅ Zhan Li ⋅ Chuanzhou su ⋅ XUANYI SHEN ⋅ Ziniu Liu ⋅ Fan Lu ⋅ Yujian Mo ⋅ Junqiao Zhao ⋅ Tiantian Feng ⋅ chen ye ⋅ Guang Chen
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 175
PLACID: Identity-Preserving Multi-Object Compositing via Video Diffusion with Synthetic Trajectories
Gemma Canet Tarrés ⋅ Manel Baradad ⋅ Francesc Moreno-Noguer ⋅ Yumeng Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 176
Object-WIPER: Training-Free Object and Associated Effect Removal in Videos
Saksham Singh Kushwaha ⋅ Sayan Nag ⋅ Yapeng Tian ⋅ Kuldeep Kulkarni
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 177
Mobile-VTON: High-Fidelity On-Device Virtual Try-On
Zhenchen Wan ⋅ Ce Chen ⋅ Runqi Lin ⋅ Jiaxin Huang ⋅ Tianxi Chen ⋅ Yanwu Xu ⋅ Tongliang Liu ⋅ Mingming Gong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 178
Progress by Pieces: Test-Time Scaling for Autoregressive Image Generation
Joonhyung Park ⋅ Hyeongwon Jang ⋅ Joowon Kim ⋅ Eunho Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 179
Towards Robust Sequential Decomposition for Complex Image Editing
Zilai Zeng ⋅ Mingdeng Cao ⋅ Zijie Li ⋅ Xiaochen Lian ⋅ Yichun Shi ⋅ Peihao Zhu ⋅ Chen Sun ⋅ Peng Wang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 180
Layer Consistency Matters: Elegant Latent Transition Discrepancy for Generalizable Synthetic Image Detection
Yawen Yang ⋅ Feng Li ⋅ Shuqi Kong ⋅ Yunfeng Diao ⋅ Xinjian Gao ⋅ Zenglin Shi ⋅ Meng Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 181
Chain of Event-Centric Causal Thought for Physically Plausible Video Generation
Zixuan Wang ⋅ Yixin Hu ⋅ Haolan Wang ⋅ Feng Chen ⋅ Yan Liu ⋅ Wen Li ⋅ Yinjie Lei
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 182
LoL: Longer than Longer, Scaling Video Generation to Hour
Jiaxing Cui ⋅ Jie Wu ⋅ Ming Li ⋅ Tao Yang ⋅ Xiaojie Li ⋅ Rui Wang ⋅ Andrew Bai ⋅ Yuanhao Ban ⋅ Cho-Jui Hsieh
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 183
FlowMotion: Training-Free Flow Guidance for Video Motion Transfer
Zhen Wang ⋅ Youcan Xu ⋅ Jun Xiao ⋅ Long Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 184
Learning Straight Flows: Variational Flow Matching for Efficient Generation
Chenrui Ma ⋅ Xi Xiao ⋅ Tianyang Wang ⋅ Xiao Wang ⋅ Yanning Shen
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 185
SIGMA: Selective-Interleaved Generation with Multi-Attribute Tokens
Xiaoyan Zhang ⋅ Zechen Bai ⋅ Haofan Wang ⋅ Yiren Song
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 186
DNF-SR: Dual-Input and Negative-Aware Feature Fine-Tuning for Real-World Image Super-Resolution
Shuhao Han ⋅ Wenjie Liao ⋅ Hayden Vance ⋅ Hang Dong ⋅ Rui Zhang ⋅ Chunle Guo ⋅ Chongyi Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 187
IFCSR: Inference-Free Fidelity-Realism Control for One-Step Diffusion-based Real-World Image Super-Resolution
Jonghee Back ⋅ Jongju Kim ⋅ Jeong-Uk Kim ⋅ Eunjin Kim ⋅ Minyong Jeon
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 188
Edge-Focused Super-Resolution for Omnidirectional Images with Spherical Geometric Augmentation
Shaolin Wang ⋅ Yuying Li ⋅ Lei Zhong ⋅ Shigang Li ⋅ Jianfeng Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 189
TUDSR: Twice Upsampling-Diffusion for Higher Super-Resolution
Zhiqiang Wu ⋅ Yitong Dong ⋅ Xian Wei
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 190
PS-SR: Pseudo-Single-Step Video Super-Resolution via Speculative Diffusion
Aiqiu Wu ⋅ Zhaofan Qiu ⋅ Ting Yao ⋅ Tao Mei
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 191
Disentangled Textual Priors for Diffusion-based Image Super-Resolution
Lei Jiang ⋅ Xin Liu ⋅ Xinze Tong ⋅ Zhiliang Li ⋅ Jie Liu ⋅ Jie Tang ⋅ Gangshan Wu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 192
Remote Sensing Image Super-Resolution for Imbalanced Textures: A Texture-Aware Diffusion Framework
Enzhuo Zhang ⋅ Sijie Zhao ⋅ Dilxat Muhtar ⋅ Zhenshi Li ⋅ Xueliang Zhang ⋅ Pengfeng Xiao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 193
Rethinking Diffusion Model-Based Video Super-Resolution: Leveraging Dense Guidance from Aligned Features
Jingyi Xu ⋅ Meisong Zheng ⋅ Ying Chen ⋅ Minglang Qiao ⋅ Xin Deng ⋅ Mai Xu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 194
DreamSR: Towards Ultra-High-Resolution Image Super-Resolution via a Receptive-Field Enhanced Diffusion Transformer
Qingji Dong ⋅ Hang Dong ⋅ Mingqin Chen ⋅ Rui Zhang ⋅ Yitong Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 195
FiDeSR: High-Fidelity and Detail-Preserving One-Step Diffusion Super-Resolution
Aro Kim ⋅ Myeongjin Jang ⋅ Chaewon Moon ⋅ Youngjin Shin ⋅ Jinwoo Jeong ⋅ Sang-hyo Park
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 196
STCDiT: Spatio-Temporally Consistent Diffusion Transformer for High-Quality Video Super-Resolution
Junyang Chen ⋅ Jiangxin Dong ⋅ Long Sun ⋅ Yixin Yang ⋅ Jinshan Pan
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 197
Towards Highly-Constrained Human Motion Generation with Retrieval-Guided Diffusion Noise Optimization
Hanchao Liu ⋅ Fang-Lue Zhang ⋅ Shining Zhang ⋅ Tai-Jiang Mu ⋅ Shi-Min Hu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 198
Learning to Control Physically-simulated 3D Characters via Generating and Mimicking 2D Motions
Jianan Li ⋅ Xiao Chen ⋅ Tao Huang ⋅ Tien-Tsin Wong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 199
Human Geometry Distribution for 3D Animation Generation
Xiangjun Tang ⋅ Biao Zhang ⋅ Peter Wonka
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 200
A Temporal and Content Co-Awareness Latent Diffusion for Controllable Hand Image Generation
Shuang Hao ⋅ Pengfei Ren ⋅ Haifeng Sun ⋅ Ting Pan ⋅ Qi Qi ⋅ Lei Zhang ⋅ Cong Liu ⋅ Jianxin Liao ⋅ Jingyu Wang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 201
Superman: Unifying Skeleton and Vision for Human Motion Perception and Generation
Xinshun Wang ⋅ Peiming Li ⋅ Ziyi Wang ⋅ Zhongbin Fang ⋅ Zhichao Deng ⋅ Songtao Wu ⋅ Xiangtai Li ⋅ Mengyuan Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 202
Learning to Assist: Physics-Grounded Human-Human Control via Multi-Agent Reinforcement Learning
Yuto Shibata ⋅ Kashu Yamazaki ⋅ Lalit Jayanti ⋅ Yoshimitsu Aoki ⋅ Mariko Isogawa ⋅ Katerina Fragkiadaki
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 203
Stability-Driven Motion Generation for Object-Guided Human-Human Co-Manipulation
Jiahao Xu ⋅ Xiaohan Yuan ⋅ Xingchen Wu ⋅ Chongyang Xu ⋅ Kun Li ⋅ Buzhen Huang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 204
Causal Motion Diffusion Models for Autoregressive Motion Generation
Qing Yu ⋅ Akihisa Watanabe ⋅ Kent Fujiwara
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 205
Towards Storytelling Animations: Joint Synthesis of Human and Camera Motions
Boyuan Cheng ⋅ Yingjie Xi ⋅ Rui He ⋅ Jinhe Na ⋅ Ying Cao ⋅ Pengjie Wang ⋅ Jian Jun Zhang ⋅ Xiaosong Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 206
MoLingo: Motion–Language Alignment for Text-to-Human Motion Generation
Yannan He ⋅ Garvita Tiwari ⋅ Xiaohan Zhang ⋅ Pankaj Bora ⋅ Tolga Birdal ⋅ Jan Lenssen ⋅ Gerard Pons-Moll
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 207
End-to-End Language-Action Model for Humanoid Whole Body Control
Yuxuan Wang ⋅ Haobin Jiang ⋅ Shiqing Yao ⋅ Ziluo Ding ⋅ Zongqing Lu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 208
Toward Early Quality Assessment of Text-to-Image Diffusion Models
Huanlei Guo ⋅ Hongxin Wei ⋅ Bingyi Jing
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 209
CoD: A Diffusion Foundation Model for Image Compression
Zhaoyang Jia ⋅ Zihan Zheng ⋅ Naifu Xue ⋅ Jiahao Li ⋅ Bin Li ⋅ Zongyu Guo ⋅ Xiaoyi Zhang ⋅ Houqiang Li ⋅ Yan Lu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 210
Diffusion MRI Transformer with a Diffusion Space Rotary Positional Embedding (D-RoPE)
Gustavo Chau Loo Kung ⋅ Mohammad H. Abbasi ⋅ Camila Blank ⋅ Juze Zhang ⋅ Alan Q. Wang ⋅ Sophie Ostmeier ⋅ Akshay Chaudhari ⋅ Kilian Pohl ⋅ Ehsan Adeli
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 211
Language-Guided One-Step Diffusion Model for Nighttime Flare Removal
Aoxiang Ning ⋅ Kailong Yu ⋅ Minglong Xue ⋅ Liyuan Pan ⋅ Jinhong He ⋅ Wenchao Yan ⋅ Mingliang Zhou ⋅ Yirui Wu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 212
SpiralDiff: Spiral Diffusion with LoRA for RGB-to-RAW Conversion Across Cameras
Huanjing Yue ⋅ Shangbin Xie ⋅ Cong Cao ⋅ Qian Wu ⋅ Lei Zhang ⋅ Zhao Lei ⋅ Jingyu Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 213
PnP-CM: Consistency Models as Plug-and-Play Priors for Inverse Problems
Merve Gulle ⋅ junno yun ⋅ Yasar Utku Alcalar ⋅ Mehmet Akcakaya
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 214
Landscape-Awareness for Geometric View Diffusion Model
Yan-Ting Chen ⋅ Hao-Wei Chen ⋅ Tsu-Ching Hsiao ⋅ Chun-Yi Lee
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 215
Otil: Accelerating Diffusion Model Inference via Communication-Efficient Multi-GPU Parallelism
Xin Li ⋅ Shujun Tian ⋅ Tao Lu ⋅ Han Bao ⋅ Zonghui Wang ⋅ Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 216
REACH: Explicit Recovery Behavior for Diffusion Policies
zundong Ke ⋅ Junlin Chen ⋅ Jiayi Zhu ⋅ Kuanhao Xia ⋅ Jiayuan Gu ⋅ boyi zhao
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 217
OralGPT-Omni: A Versatile Dental Multimodal Large Language Model
JING HAO ⋅ Yuci Liang ⋅ Lizhuo Lin ⋅ Yuxuan Fan ⋅ Wenkai Zhou ⋅ Kaixin Guo ⋅ Zanting Ye ⋅ Yanpeng Sun ⋅ Xinyu Zhang ⋅ Yanqi Yang ⋅ Qiankun Li ⋅ Hao Tang ⋅ James Kit-Hon Tsoi ⋅ Linlin Shen ⋅ Kuo Feng Hung
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 218
CrossHOI-Bench: A Unified Benchmark for HOI Evaluation across Vision-Language Models and HOI-Specific Methods
Qinqian Lei ⋅ Bo Wang ⋅ Robby T. Tan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 219
The LLM Bottleneck: Why Open-Source Vision LLMs Struggle with Hierarchical Visual Recognition
Yuwen Tan ⋅ Yuan Qing ⋅ Boqing Gong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 220
Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench
Fenfen Lin ⋅ Yesheng Liu ⋅ Haiyu Xu ⋅ Yue Chen ⋅ Zheqi He ⋅ Mingxuan Zhao ⋅ Miguel Hu Chen ⋅ JG Yao ⋅ Xi Yang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 221
KαLOS finds Consensus: A Meta-Algorithm for Evaluating Inter-Annotator Agreement in Complex Vision Tasks
David Tschirschwitz ⋅ Volker Rodehorst
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 222
Beyond Single Images: A Comprehensive Benchmark for Album-Level Vision-Language Understanding
Shawn Huang ⋅ Brian Price ⋅ Yifei Fan ⋅ Bryan Morse
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 223
LIBERO-Plus: A Progressive Robustness Benchmark for Visual-Language-Action Models
Senyu Fei ⋅ Siyin Wang ⋅ Junhao Shi ⋅ Zihao Dai ⋅ Jikun Cai ⋅ Pengfang Qian ⋅ Li Ji ⋅ Xinzhe He ⋅ Shiduo Zhang ⋅ Zhaoye Fei ⋅ Jinlan Fu ⋅ Jingjing Gong ⋅ Xipeng Qiu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 224
Scenes as Tokens: Multi-Scale Normal Distributions Transform Tokenizer for General 3D Vision–Language Understanding
Yutao Tang ⋅ Cheng Zhao ⋅ Gaurav Mittal ⋅ Rohith Kukkala ⋅ Rama Chellappa ⋅ Cheng Peng ⋅ Mei Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 225
LangRef3DGS: Natural Language-Guided 3D Referential Segmentation from Partial Observations via 3D Gaussian Splatting
xulun ye ⋅ Qin Zhang ⋅ Kun Zhou
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 226
Hear you are: Teaching LLMs Spatial Reasoning with Vision and Spatial Sound
Hyeonggon Ryu ⋅ Joon Chung ⋅ David Harwath
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 227
EgoMind: Activating Spatial Cognition through Linguistic Reasoning in MLLMs
Zhenghao Chen ⋅ Huiqun Wang ⋅ Di Huang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 228
SAQN: Semantic-based Adaptive Query Network for 3D Referring Expression Segmentation
Jiale Huang ⋅ Shangfei Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 229
EagleVision: A Dual-Stage Framework with BEV-grounding-based Chain-of-Thought for Spatial Intelligence
Jiaxu Wan ⋅ Xu Wang ⋅ Mengwei Xie ⋅ Hang Zhang ⋅ Mu Xu ⋅ Yang Han ⋅ Ding Yuan ⋅ Hong Zhang ⋅ Yifan Yang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 230
Abstract 3D Perception for Spatial Intelligence in Vision-Language Models
Yifan Liu ⋅ Fangneng Zhan ⋅ Kaichen Zhou ⋅ Yilun Du ⋅ Paul Pu Liang ⋅ Hanspeter Pfister
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 231
PV-Ground: Text-Guided Point-Voxel Interaction for 3D Visual Grounding
Junpeng Shang ⋅ Feifei Shao ⋅ Jun Xiao ⋅ Lin Li ⋅ Hongwei Wang ⋅ Dongfang Ma
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 232
Masking Matters: Unlocking the Spatial Reasoning Capabilities of LLMs for 3D Scene-Language Understanding
Yerim Jeon ⋅ Miso Lee ⋅ WonJun Moon ⋅ Jae-Pil Heo
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 233
SpatialStack: Layered Geometry-Language Fusion for 3D VLM Spatial Reasoning
Jian Zhang ⋅ Shijie Zhou ⋅ Bangya LIU ⋅ Achuta Kadambi ⋅ Zhiwen Fan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 234
Geometrically-Constrained Agent for Spatial Reasoning
Zeren Chen ⋅ Xiaoya Lu ⋅ Zhijie Zheng ⋅ Pengrui Li ⋅ Lehan He ⋅ Yijin Zhou ⋅ Jing Shao ⋅ Bohan Zhuang ⋅ Lu Sheng
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 235
PARSE: Part-Aware Relational Spatial Modeling
Yinuo Bai ⋅ Peijun Xu ⋅ Kuixiang Shao ⋅ Yuyang Jiao ⋅ Jingxuan Zhang ⋅ Kaixin Yao ⋅ Jiayuan Gu ⋅ Jingyi Yu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 236
R4: Retrieval-Augmented Reasoning for Vision-Language Models in 4D Spatio-Temporal Space
Tin Stribor Sohn ⋅ Maximilian Dillitzer ⋅ Jason J. Corso ⋅ Eric Sax
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 237
MCHDoc: A Comprehensive Benchmark for Reading Multi-Carrier Chinese Historical Documents
YiJun Sheng ⋅ Shipeng Zhu ⋅ Ruijia Zuo ⋅ Na Nie ⋅ Hui Xue
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 238
Cross-modal Fuzzy Alignment Network for Text-Aerial Person Retrieval and A Large-scale Benchmark
Yifei Deng ⋅ Chenglong Li ⋅ YUYANG ZHANG ⋅ Guyue Hu ⋅ Jin Tang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 239
CodeMMR: Bridging Natural Language, Code, and Image for Unified Retrieval
Jiahui Geng ⋅ Qing Li ⋅ Fengyu Cai ⋅ Fakhri Karray
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 240
DiT-Distill: Open-Set Fine-Grained Retrieval via Generative Curriculum Knowledge
Xin Jiang ⋅ Hao Tang ⋅ Meiqi Cao ⋅ Junyao Gao ⋅ Fei Shen ⋅ Zechao Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 241
ReCALL: Recalibrating Capability Degradation for MLLM-based Composed Image Retrieval
tianyu yang ⋅ ChenWei He ⋅ xiangzhao hao ⋅ Tianyue Wang ⋅ Jiarui Guo ⋅ Haiyun Guo ⋅ Leigang Qu ⋅ Jinqiao Wang ⋅ Tat-seng Chua
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 242
Love Me, Love My Label: Rethinking the Role of Labels in Prompt Retrieval for Visual In-Context Learning
Tianci Luo ⋅ Haohao Pan ⋅ Jinpeng Wang ⋅ Niu Lian ⋅ Xinrui Chen ⋅ Bin Chen ⋅ Shu-Tao Xia ⋅ Chun Yuan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 243
Rethinking BCE Loss for Multi-Label Image Recognition with Fine-Tuning
Ao Zhou ⋅ Zhiwei Jiang ⋅ Zifeng Cheng ⋅ Cong Wang ⋅ Yafeng Yin ⋅ Shufan Yang ⋅ Qing Gu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 244
CAST: Context-Aware Dynamic Latent Space Transformation for Interactive Text-to-Image Retrieval
Xuanzuo Lin ⋅ Min Zhang ⋅ Daizong Liu ⋅ Zhiwen Zuo ⋅ Xun Yang ⋅ Changting Lin ⋅ Xun Wang ⋅ Jianfeng Dong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 245
PriVi: Towards a General-Purpose Video Model for Primate Behavior in the Wild
Felix B. Mueller ⋅ Jan F. Meier ⋅ Timo Lüddecke ⋅ Richard Vogg ⋅ Roger L. Freixanet ⋅ Valentin Hassler ⋅ Tiffany Bosshard ⋅ Elif Karakoc ⋅ William O'Hearn ⋅ Sofia M. Pereira ⋅ Sandro Sehner ⋅ Kaja Wierucka ⋅ Judith Burkart ⋅ Claudia Fichtel ⋅ Julia Fischer ⋅ Alexander Gail ⋅ Catherine Hobaiter ⋅ Julia Ostner ⋅ Liran Samuni ⋅ Oliver Schülke ⋅ Neda Shahidi ⋅ Erin G. Wessling ⋅ Alexander S. Ecker
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 246
Seeing Conversations: Communication Context Identification in Egocentric Video
Tobias Dorszewski ⋅ Jens Hjortkjær
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 247
Interactive Episodic Memory with User Feedback
Nikesh Subedi ⋅ Loris Bazzani ⋅ Ziad Al-Halah
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 248
Seeing without Pixels: Perception from Camera Trajectories
Zihui Xue ⋅ Kristen Grauman ⋅ Dima Damen ⋅ Andrew Zisserman ⋅ Tengda Han
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 249
PFGNet: A Fully Convolutional Frequency-Guided Peripheral Gating Network for Efficient Spatiotemporal Predictive Learning
Xinyong Cai ⋅ Changbin Sun ⋅ Yong Wang ⋅ Hongyu Yang ⋅ Yuankai Wu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 250
Minerva-Ego: Spatiotemporal Hints for Egocentric Video Understanding
Arsha Nagrani ⋅ Jasper Uijlings ⋅ Shyamal Buch ⋅ Tobias Weyand ⋅ Sudheendra Vijayanarasimhan ⋅ Bo Hu ⋅ Ramin Mehran ⋅ David A. Ross ⋅ Cordelia Schmid
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 251
StreamRAG: Enhancing Real-Time Video Understanding with Retrieval Augmentation
Junlin Xie ⋅ Quanlong Zheng ⋅ Ruifei Zhang ⋅ Kuo Wang ⋅ Yanhao Zhang ⋅ Jinguo Luo ⋅ Haonan Lu ⋅ Xiang Wan ⋅ Guanbin Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 252
ViKey: Enhancing Temporal Understanding in Videos via Visual Prompting
Yeonkyung Lee ⋅ Dayun Ju ⋅ Youngmin Kim ⋅ seil kang ⋅ Seong Jae Hwang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 253
SkillSight: Efficient First-Person Skill Assessment with Gaze
Chi Hsuan Wu ⋅ Kumar Ashutosh ⋅ Kristen Grauman
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 254
BriMA: Bridged Modality Adaptation for Multi-Modal Continual Action Quality Assessment
Kanglei Zhou ⋅ Chang Li ⋅ Qingyi Pan ⋅ Liyuan Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 255
Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO
JUNHAO CHENG ⋅ Liang Hou ⋅ Xin Tao ⋅ Jing Liao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 256
MedLIME: A Distribution-Aligned and Evidence-Supported Framework for Medical Saliency Explanations
Raghav Magazine ⋅ Xingjian Li ⋅ Min Xu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 257
Inside-Out: Measuring Generalization in Vision Transformers Through Inner Workings
Yunxiang Peng ⋅ Mengmeng Ma ⋅ Ziyu Yao ⋅ Xi Peng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 258
Language Models Can Explain Visual Features via Steering
Javier Ferrando ⋅ Enrique Lopez-Cuena ⋅ Pablo Agustin Martin-Torres ⋅ Daniel Hinjos ⋅ Anna Arias Duart ⋅ Dario Garcia-Gasulla
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 259
Making the Classification Explanation Faithful to the Confidence Score
Jian-Xun Mi ⋅ Lu Pan ⋅ Weisheng Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 260
Intrinsic Concept Extraction Based on Compositional Interpretability
Hanyu Shi ⋅ Hong Tao ⋅ Guoheng Huang ⋅ Jianbin Jiang ⋅ Xuhang Chen ⋅ Chi-Man Pun ⋅ Shanhu Wang ⋅ Pan Pan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 261
Attribution-Guided Model Rectification of Unreliable Neural Network Behaviors
Peiyu Yang ⋅ Naveed Akhtar ⋅ Jiantong Jiang ⋅ Ajmal Mian
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 262
Measuring the (Un)Faithfulness of Concept-Based Explanations
Shubham Kumar ⋅ Narendra Ahuja
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 263
Deformation-based In-Context Learning for Point Cloud Understanding
Chengxing Lin ⋅ Jinhong Deng ⋅ Yinjie Lei ⋅ Wen Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 264
ELiC: Efficient LiDAR Geometry Compression via Cross-Bit-depth Feature Propagation and Bag-of-Encoders
Junsik Kim ⋅ Gun Bang ⋅ Soowoong Kim
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 265
ESAM++: Efficient Online 3D Perception on the Edge
Qin Liu ⋅ Lavisha Aggarwal ⋅ Saptarashmi Bandyopadhyay ⋅ Vikas Bahirwani ⋅ Marc Niethammer ⋅ Ehsan Adeli ⋅ Andrea Colaco
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 266
DualReg: Dual-Space Filtering and Reinforcement for Rigid Registration
Jiayi Li ⋅ Yuxin Yao ⋅ Qiuhang Lu ⋅ Juyong Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 267
Hg-I2P: Bridging Modalities for Generalizable Image-to-Point-Cloud Registration via Heterogeneous Graphs
Pei An ⋅ Junfeng Ding ⋅ Jiaqi Yang ⋅ Yulong Wang ⋅ Jie Ma ⋅ Liangliang Nan
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 268
Rethinking 2D-3D Registration: A Novel Network for High-Value Zone Selection and Representation Consistency Alignment
Zhixin Cheng ⋅ Bohao Liao ⋅ Jiacheng Deng ⋅ Xiaotian Yin ⋅ Xinjun Li ⋅ Yujia Chen ⋅ Baoqun Yin ⋅ Tianzhu Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 269
Adaptive 3D Perception for Small Aerial Targets Under Sparse Sampling via Reinforcement Learning
Shenghai Yuan ⋅ Yihan Wei ⋅ Jason Yee ⋅ Zhuoran Qiao ⋅ boyang lou ⋅ Enwen Hu
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 270
3D sans 3D Scans: Scalable Pre-training from Video-Generated Point Clouds
Ryousuke Yamada ⋅ Kohsuke Ide ⋅ Yoshihiro Fukuhara ⋅ Hirokatsu Kataoka ⋅ Gilles Puy ⋅ Andrei Bursuc ⋅ Yuki M Asano
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 271
StreamVLO: Streaming Visual–LiDAR Odometry with Cumulative Drift Compensation
Mengmeng Liu ⋅ Jiuming Liu ⋅ Michael Ying Yang ⋅ Chaokang Jiang ⋅ Jiangtao Li ⋅ Yunpeng Zhang ⋅ Hesheng Wang ⋅ Francesco Nex ⋅ Hao Cheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 272
Mamba Learns in Context: Structure-Aware Domain Generalization for Multi-Task Point Cloud Understanding
Jincen Jiang ⋅ Qianyu Zhou ⋅ Yuhang Li ⋅ Kui Su ⋅ Meili Wang ⋅ Jian Chang ⋅ Jian Jun Zhang ⋅ Xuequan Lu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 273
Routing on Demand: DSNet for Efficient Progressive Point Cloud Denoising
Xiaoqian Cheng ⋅ Dong Xiao ⋅ Husen Li ⋅ Zheng Liu ⋅ Renjie Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 274
Hyper-PCN: Hypergraph-Based Point Cloud Completion via High-Order Correlation Modeling
Linfei Li ⋅ Pei Tan ⋅ Siqi Li ⋅ Changqing Zou ⋅ Yue Gao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 275
Towards Calibrating Prompt Tuning of Vision- Language Models
Ashshak Sharifdeen ⋅ Fahad Shamshad ⋅ Muhammad Akhtar Munir ⋅ Abhishek Basu ⋅ Mohamed Ismithdeen ⋅ Jeyapriyan Jeyamohan ⋅ Chathurika Silva ⋅ Karthik Nandakumar ⋅ Muhammad Haris Khan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 276
DEVA: Fine-tuning Multimodal Large Language Models for Visual Perception Tasks
Debasmit Das ⋅ Munawar Hayat ⋅ Fatih Porikli
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 277
LOREAL: Mitigating Low-Resolution Challenges in Vision-Language Models with Attribute-driven Prompt Self-Distillation
Xucong Wang ⋅ Pengkun Wang ⋅ Zhe Zhao ⋅ Liheng Yu ⋅ Rui Mao ⋅ Yang Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 278
OpenVision 2: A Family of Generative Pretrained Visual Encoders for Multimodal Learning
Yanqing Liu ⋅ Xianhang li ⋅ Letian Zhang ⋅ Zirui Wang ⋅ Zeyu Zheng ⋅ Yuyin Zhou ⋅ Cihang Xie
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 279
Language-guided Frequency Modulation for Large Vision-Language Models
Shuyi Ouyang ⋅ Gongfan Fang ⋅ Xinyin Ma ⋅ Yen-Wei Chen ⋅ Lanfen Lin ⋅ Xinchao Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 280
TANGO: Text-Anchored Guided Optimization for Robust Fine-tuning Vision-Language Models under Label Noise
Tengfei Ma ⋅ Weiran Pan ⋅ Wei Wei
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 281
Cluster-Wise Spatio-Temporal Masking for Efficient Video-Language Pretraining
Weijun Zhuang ⋅ Yuqing Huang ⋅ Weikang Meng ⋅ Xin Li ⋅ Ming Liu ⋅ Xiaopeng Hong ⋅ Yaowei Wang ⋅ Wangmeng Zuo
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 282
Reconstructing CLIP for Open-Vocabulary Dense Perception
Yajie Liu ⋅ Jinjin Zhang ⋅ Qingjie Liu ⋅ Di Huang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 283
DPL: Decoupled Prototype Learning for Enhancing Robustness of Vision–Language Transformers to Missing Modalities
Jueqing Lu ⋅ Yuanyuan Qi ⋅ Xiaohao Yang ⋅ Shuaicheng Niu ⋅ Fucai Ke ⋅ Shujie Zhou ⋅ Wei Tan ⋅ Jionghao Lin ⋅ Wray Buntine ⋅ Hamid Rezatofighi ⋅ Lan Du
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 284
BrepVGAE: Variational Graph Autoencoder with Unified Latent Representation for B-rep
Hao Guo ⋅ Liyuan Deng ⋅ Yongkang Dai ⋅ Ruohan Wang ⋅ Jiahao Li ⋅ Yunpeng Bai ⋅ Yilei Shi
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 285
NeuROK: Generative 4D Neural Object Kinematics
Chen Geng ⋅ Guangzhao He ⋅ Yue Gao ⋅ Yunzhi Zhang ⋅ Shangzhe Wu ⋅ Jiajun Wu
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 286
BrickNet: Graph-Backed Generative Brick Assembly
Peter Kulits ⋅ Cordelia Schmid
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 287
Unified Vector Floorplan Generation via Markup Representation
Kaede Shiohara ⋅ Toshihiko Yamasaki
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 288
CME-CAD: Heterogeneous Collaborative Multi-Expert Reinforcement Learning for CAD Code Generation
Ke Niu ⋅ Haiyang Yu ⋅ Zhuofan Chen ⋅ Zhengtao Yao ⋅ Weitao Jia ⋅ Xiaodong Ge ⋅ Jingqun Tang ⋅ Benlei Cui ⋅ Bin Li ⋅ Xiangyang Xue
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 289
Robo-SGG: Exploiting Layout-Oriented Normalization and Restitution Can Improve Robust Scene Graph Generation
Changsheng Lv ⋅ Zijian Fu ⋅ Mengshi Qi
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 290
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens
Yiying Yang ⋅ Wei Cheng ⋅ Sijin Chen ⋅ Honghao Fu ⋅ Xianfang Zeng ⋅ Yujun Cai ⋅ Gang Yu ⋅ Xingjun Ma
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 291
EpiAgent: An Agent-Centric System for Ancient Inscription Restoration
Shipeng Zhu ⋅ Ang Chen ⋅ Na Nie ⋅ Pengfei Fang ⋅ Min-Ling Zhang ⋅ Hui Xue
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 292
The Invisible Gorilla Effect in Out-of-distribution Detection
Harry Anthony ⋅ Ziyun Liang ⋅ Hermione Warr ⋅ Konstantinos Kamnitsas
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 293
Interpretable Debiasing of Vision-Language Models for Social Fairness
Na Min An ⋅ Yoonna Jang ⋅ Yusuke Hirota ⋅ Ryo Hachiuma ⋅ Isabelle Augenstein ⋅ Hyunjung Shim
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 294
Image-based Outlier Synthesis With Training Data
Sudarshan Regmi
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 295
SALMUBench: A Benchmark for Sensitive Association-Level Multimodal Unlearning
Cai Selvas-Sala ⋅ Lei Kang ⋅ Lluis Gomez
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 296
Scaling Test-Time Robustness of Vision-Language Models via Self-Critical Inference Framework
Kaihua Tang ⋅ JIAXIN QI ⋅ Jinli Ou ⋅ Yuhua Zheng ⋅ Jianqiang Huang
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 297
When Understanding Becomes a Risk: Authenticity and Safety Risks in the Emerging Image Generation Paradigm
Ye Leng ⋅ Junjie Chu ⋅ Mingjie Li ⋅ Chenhao Lin ⋅ Chao Shen ⋅ Michael Backes ⋅ Yun Shen ⋅ Yang Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 298
IrisFP: Adversarial-Example-based Model Fingerprinting with Enhanced Uniqueness and Robustness
Ziye Geng ⋅ Guang Yang ⋅ Yihang Chen ⋅ Changqing Luo
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 299
Mark4D: Temporally-Consistent Watermarking for 4D Gaussian Splatting
Jaejin Lee ⋅ Minjae Jeong ⋅ Joonhyuk Park ⋅ Yechan Hwang ⋅ Seunghun Baek ⋅ Won Hwa Kim
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 300
Machine Unlearning via Adaptive Gradient Reweighting and Multi-stage Objective Optimization
Juxin Lu ⋅ Haoyu Shi ⋅ Mengyao Wang ⋅ Huaiwen Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 301
Taming Noise-Induced Prototype Degradation for Privacy-Preserving Personalized Federated Fine-Tuning
Yuhua Wang ⋅ Qinnan Zhang ⋅ Xiaodong Li ⋅ Huan Zhang ⋅ Yifan Sun ⋅ Wangjie Qiu ⋅ Hainan Zhang ⋅ Yongxin Tong ⋅ Zhiming Zheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 302
FedMOP: Achieving Enhanced Privacy and Performance in Federated Learning via Momentum Orthogonal Projection
Yunlong Zhao ⋅ Xiaoheng Deng ⋅ Hongyan Xu ⋅ Zhuohua Qiu ⋅ Xiaowen Hu ⋅ Shan You ⋅ Yi Chen ⋅ Chang Xu ⋅ Xiu Su
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 303
HFedATM: Hierarchical Federated Domain Generalization via Optimal Transport and Regularized Mean Aggregation
Thinh Nguyen ⋅ Le Trung Phan ⋅ Binh Nguyen ⋅ Khoa D Doan ⋅ KOK SENG WONG
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 304
Single-Round Scalable Analytic Federated Learning
Alan T. L. Bacellar ⋅ Mustafa Munir ⋅ Felipe M.G. França ⋅ Priscila Machado Vieira Lima ⋅ Radu Marculescu ⋅ Lizy Kurian John
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 305
Controllable Federated Prompt Learning at Test Time
Rui Zhu ⋅ Liang Bai ⋅ Yanming Guo ⋅ Yirun Ruan ⋅ Tianyuan Yu ⋅ Zhihe Lu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 306
FedRE: A Representation Entanglement Framework for Model-Heterogeneous Federated Learning
Yuan Yao ⋅ Lixu Wang ⋅ Jiaqi Wu ⋅ Jin Song ⋅ Simin Chen ⋅ Zehua Wang ⋅ Zijian Tian ⋅ Wei Chen ⋅ Huixia Li ⋅ Xiaoxiao Li
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 307
Conversational Image Segmentation: Grounding Abstract Concepts with Scalable Supervision
Aadarsh Sahoo ⋅ Georgia Gkioxari
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 308
Spatial Matters: Position-Guided 3D Referring Expression Segmentation
Yabing Wang ⋅ Zhuotao Tian ⋅ Le Wang ⋅ Zheng Qin ⋅ Sanping Zhou
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 309
Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation
Tianming Liang ⋅ Haichao Jiang ⋅ Yuting Yang ⋅ Chaolei Tan ⋅ Shuai Li ⋅ Wei-Shi Zheng ⋅ Jian-Fang Hu
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 310
Refer-Agent: A Collaborative Multi-Agent System with Reasoning and Reflection for Referring Video Object Segmentation
Haichao Jiang ⋅ Tianming Liang ⋅ Wei-Shi Zheng ⋅ Jian-Fang Hu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 311
CaptionFormer: Unified Segmentation, Tracking, and Captioning for Spatio-Temporal Objects
Gabriel Fiastre ⋅ Antoine Yang ⋅ Cordelia Schmid
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 312
TransPrune: Token Transition Pruning for Efficient Large Vision-Language Model
Ao Li ⋅ Yuxiang Duan ⋅ Jinghui Zhang ⋅ Congbo Ma ⋅ Yutong Xie ⋅ Gustavo Carneiro ⋅ Mohammad Yaqub ⋅ Hu Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 313
QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models
Jingxuan Zhang ⋅ Yun-Ta Hsieh ⋅ Zhongwei Wan ⋅ Haokun Lin ⋅ Xin Wang ⋅ Ziqi Wang ⋅ Yingtie Lei ⋅ Mi Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 314
Revisiting Multimodal KV Cache Compression: A Frequency-Domain-Guided Outlier-KV-Aware Approach
Yaoxin Yang ⋅ Peng Ye ⋅ Xudong Tan ⋅ Chongjun Tu ⋅ Maosen Zhao ⋅ Jia Hao ⋅ Tao Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 315
Collaborative Multi-Mode Pruning for Vision-Language Models
Zimeng Wu ⋅ Yunhong Wang ⋅ Donghao Wang ⋅ Jiaxin Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 316
ZOO-Prune: Training-Free Token Pruning via Zeroth-Order Gradient Estimation in Vision-Language Models
Youngeun Kim ⋅ Youjia Zhang ⋅ Huiling Liu ⋅ Aecheon Jung ⋅ Sunwoo Lee ⋅ Sungeun Hong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 317
HAWK: Head Importance-Aware Visual Token Pruning in Multimodal Models
Qihui Zhu ⋅ Tao Zhang ⋅ yuchen wang ⋅ Shuangwu chen ⋅ Xiaobin Tan ⋅ Jian Yang ⋅ Yang Liu ⋅ Yinfei Pan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 318
CORE: Compact Object-centric REpresentations as a New Paradigm for Token Merging in LVLMs
Jingyu Lei ⋅ Gaoang Wang ⋅ Der-Horng Lee
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 319
Imbalanced View Contribution Evaluation and Refinement for Deep Incomplete Multi-View Clustering
Taichun Zhou ⋅ Zhibin Dong ⋅ Hao Tan ⋅ Siwei Wang ⋅ Xinwang Liu ⋅ En Zhu ⋅ Di Hu ⋅ Tianrui Liu ⋅ chuankun Li ⋅ Kunlun He
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 320
Multi-Hierarchical Contrastive Spectral Fusion for Multi-View Clustering
Bing Cai ⋅ Xiaoli Wang ⋅ Gui-Fu Lu ⋅ Zechao Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 321
SECOS: Semantic Capture for Rigorous Classification in Open-World Semi-Supervised Learning
Hezhao Liu ⋅ jiacheng yang ⋅ Junlong Gao ⋅ Mengke Li ⋅ Yiqun Zhang ⋅ Shreyank Gowda Gowda ⋅ Yang Lu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 322
Multi-Modal Representation Learning via Semi-Supervised Rate Reduction for Generalized Category Discovery
Wei He ⋅ Xianghan Meng ⋅ Zhiyuan Huang ⋅ Xianbiao Qi ⋅ Rong Xiao ⋅ CHUNGUANG LI
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 323
TimeBridge: Self-Supervised Video Representation Learning via Start-End Joint Embedding and In-Between Frame Prediction
Qin Wang ⋅ Abigail Morrison ⋅ Hanno Scharr ⋅ Kai Krajsek
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 324
Mitigating Instance Entanglement in Instance-Dependent Partial Label Learning
Rui Zhao ⋅ Bin Shi ⋅ Kai Sun ⋅ Bo Dong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 325
Residual Connections Harm Generative Representation Learning
Xiao Zhang ⋅ Ruoxi Jiang ⋅ William Gao ⋅ Rebecca Willet ⋅ Michael Maire
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 326
Neural Mixture Density Processes
yi ding ⋅ Qi Tao ⋅ Xingxing Liang ⋅ Longfei Zhang ⋅ Yiqin Lv ⋅ weitao song ⋅ Fangjie Yang ⋅ Qi Wang ⋅ Guangquan Cheng
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 327
Large-scale Robust Enhanced Ensemble Clustering via Outlier Decoupling
Jiaxuan Xu ⋅ Lei Duan ⋅ Xinye Wang ⋅ Liang Du
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 328
DriveLaW: Unifying Planning and Video Generation in a Latent Driving World
Tianze Xia ⋅ Yongkang Li ⋅ Lijun Zhou ⋅ Jingfeng Yao ⋅ Kaixin Xiong ⋅ Haiyang Sun ⋅ Bing Wang ⋅ Kun Ma ⋅ Guang Chen ⋅ Hangjun Ye ⋅ Wenyu Liu ⋅ Xinggang Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 329
DLWM: Dual Latent World Models enable Holistic Gaussian-centric Pre-training in Autonomous Driving
Yiyao Zhu ⋅ Ying Xue ⋅ Haiming Zhang ⋅ Guangfeng Jiang ⋅ Wending Zhou ⋅ Xu Yan ⋅ Jiantao Gao ⋅ Yingjie CAI ⋅ Bingbing Liu ⋅ Zhen Li ⋅ Shaojie Shen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 330
Latent Chain-of-Thought World Modeling for End-to-End Driving
Shuhan Tan ⋅ Kashyap Chitta ⋅ Yuxiao Chen ⋅ Thomas Tian ⋅ Yurong You ⋅ Yan Wang ⋅ Wenjie Luo ⋅ Yulong Cao ⋅ Philipp Krähenbühl ⋅ Marco Pavone ⋅ Boris Ivanovic
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 331
RLFTSim: Realistic and Controllable Multi-Agent Traffic Simulation via Reinforcement Learning Fine-Tuning
Ehsan Ahmadi ⋅ Hunter Schofield ⋅ Behzad Khamidehi ⋅ Fazel Arasteh ⋅ Jinjun Shan ⋅ Lili Mou ⋅ Dongfeng Bai ⋅ Kasra Rezaee
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 332
TrafficAlign: Aligning Large Language Models for Traffic Scenario Generation
Zhi Tu ⋅ Liangkun Niu ⋅ Tianyi Zhang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 333
Failure Modes for Deep Learning–Based Online Mapping: How to Measure and Address Them
Michael Hubbertz ⋅ Qi Han ⋅ Tobias Meisen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 334
Linking Modality Isolation in Heterogeneous Collaborative Perception
Changxing Liu ⋅ Zichen Chao ⋅ Siheng Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 335
LEAD: Minimizing Learner-Expert Asymmetry in End-to-End Driving
Long Nguyen ⋅ Micha Fauth ⋅ Bernhard Jaeger ⋅ Daniel Dauner ⋅ Maximilian Igl ⋅ Andreas Geiger ⋅ Kashyap Chitta
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 336
DriverGaze360: OmniDirectional Driver Attention with Object-Level Guidance
Shreedhar Govil ⋅ Didier Stricker ⋅ Jason Rambach
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 337
Diffusion Forcing Planner: History-Annealed Planning with Time-Dependent Guidance for Autonomous Driving
Zehan Zhang ⋅ Yaoyi Li ⋅ Neng Zhang ⋅ Jia Cai
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 338
DIMOS: Disentangling Instance-level Moving Object Segmentation
Hongxiang HUANG ⋅ Hongwei Ren ⋅ Xiaopeng LIN ⋅ Yulong Huang ⋅ Zeke Xie ⋅ Bojun Cheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 339
EvObj: Learning Evolving Object-centric Representations for 3D Instance Segmentation without Scene Supervision
Jiahao Chen ⋅ Zihui Zhang ⋅ Yafei Yang ⋅ Jinxi Li ⋅ Shenxing Wei ⋅ Zhixuan Sun ⋅ Bo Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 340
Live Interactive Training for Video Segmentation
Xinyu Yang ⋅ Haozheng Yu ⋅ Yihong Sun ⋅ Bharath Hariharan ⋅ Jennifer J. Sun
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 341
Robust Promptable Video Object Segmentation
Sohyun Lee ⋅ Yeho Gwon ⋅ Lukas Hoyer ⋅ Konrad Schindler ⋅ Christos Sakaridis ⋅ Suha Kwak
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 342
Scene-VLM: Multimodal Video Scene Segmentation via Vision-Language Models
Nimrod Berman ⋅ Adam Botach ⋅ Emanuel Ben-Baruch ⋅ Shunit Haviv Hakimi ⋅ Asaf Gendler ⋅ Ilan Naiman ⋅ Erez Yosef ⋅ Igor Kviatkovsky
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 343
Concept-Aware LoRA for Domain-Aligned Segmentation Dataset Generation
Minho Park ⋅ Sunghyun Park ⋅ Jungsoo Lee ⋅ Hyojin Park ⋅ Kyuwoong Hwang ⋅ Fatih Porikli ⋅ Jaegul Choo ⋅ Sungha Choi
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 344
BEV-CAR: Enhancing Monocular Bird’s Eye View Segmentation with Context-Aware Rasterization
Yixin Xiong ⋅ Ke Wang ⋅ Tongtong Cheng ⋅ Chunhui Liu ⋅ Kai Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 345
Exploring the Underwater World Segmentation without Extra Training
Bingyu Li ⋅ Tao Huo ⋅ Da Zhang ⋅ Zhiyuan Zhao ⋅ Junyu Gao ⋅ Xuelong Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 346
Learning from Oblivion: Predicting Knowledge-Overflowed Weights via Retrodiction of Forgetting
Jinhyeok Jang ⋅ Jaehong Kim ⋅ Jung Uk Kim
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 347
Cross-Architecture Adaptation: Cloud-Edge Continual Test-Time Adaptation with Dynamic Sampling and Heterogeneous Distillation
Zirui Xu ⋅ Xianhang Chu ⋅ Jiahao Li ⋅ Xu Yang ⋅ Cheng Deng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 348
Towards Dynamic Modality Alignment in Multimodal Continual Learning
Jiayao Tan ⋅ Fan Lyu ⋅ Tianle Liu ⋅ Fuyuan Hu ⋅ Wei Feng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 349
ϕ-DPO: Fairness Direct Preference Optimization Approach to Continual Learning in Large Multimodal Models
Thanh-Dat Truong ⋅ Huu-Thien Tran ⋅ Jackson Cothren ⋅ Bhiksha Raj ⋅ Khoa Luu
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 350
Incremental Object Detection via Future-Aware Decoupled Cross-Head Distillation
Chenfeng Yin ⋅ De Cheng ⋅ Wenlong Luo ⋅ Mingyue Zeng ⋅ Shizhou Zhang ⋅ Nannan Wang ⋅ Xinbo Gao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 351
Smart Replay: Adaptive Scheduling of Memory Rehearsal for Computational Resource-Aware Incremental Learning
Jianting CHEN ⋅ Dianzhi Yu ⋅ Irwin King
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 352
ReBaPL: Repulsive Bayesian Prompt Learning
Yassir Bendou ⋅ Omar Ezzahir ⋅ Remove middle name Fernandes ⋅ Gabriel Mahuas ⋅ Victoria Shevchenko ⋅ Mike Gartrell
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 353
Spectral Mixture-of-Experts for Continual Learning
Chen Yin ⋅ Xingbo Dong ⋅ Xuelin Shen ⋅ Zhe Jin
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 354
ActAvatar: Temporally-Aware Precise Action Control for Talking Avatars
Ziqiao Peng ⋅ Yi Chen ⋅ Yifeng Ma ⋅ Guozhen Zhang ⋅ Zhiyao Sun ⋅ Zixiang Zhou ⋅ Youliang Zhang ⋅ zhengguang zhou ⋅ Zhaoxin Fan ⋅ Hongyan Liu ⋅ Yuan Zhou ⋅ qinglin lu ⋅ Jun He
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 355
ViBES: A Conversational Agent with Behaviorally-Intelligent 3D Virtual Body
Juze Zhang ⋅ Changan Chen ⋅ Xin Chen ⋅ Heng Yu ⋅ Tiange Xiang ⋅ Ali Khan ⋅ Shrinidhi K. Lakshmikanth ⋅ Ehsan Adeli
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 356
DeX-Portrait: Disentangled and Expressive Portrait Animation via Explicit and Latent Motion Representations
Yuxiang Shi ⋅ Zhe Li ⋅ Yanwen Wang ⋅ Hao Zhu ⋅ Xun Cao ⋅ Ligang Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 357
SketchFaceGS: Real-Time Sketch-Driven Face Editing and Generation with Gaussian Splatting
Bo Li ⋅ Jiahao Kang ⋅ Yubo Ma ⋅ Feng-Lin Liu ⋅ Bin Liu ⋅ Fang-Lue Zhang ⋅ Lin Gao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 358
MIBURI: Towards Expressive Interactive Gesture Synthesis
M. Hamza Mughal ⋅ Rishabh Dabral ⋅ Vera Demberg ⋅ Christian Theobalt
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 359
Personalized Image Descriptions from Attention Sequences
Ruoyu Xue ⋅ Hieu Le ⋅ Jingyi Xu ⋅ Sounak Mondal ⋅ Abe Leite ⋅ Gregory Zelinsky ⋅ Minh Nguyen Nguyen ⋅ Dimitris Samaras
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 360
GA-VLN: Geometry-Aware BEV Representation for Efficient Vision-Language Navigation
Jiahao Yang ⋅ Zihan Wang ⋅ Xiangyang Li ⋅ Xing Zhu ⋅ Yujun Shen ⋅ Yinghao Xu ⋅ Shuqiang Jiang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 361
IMAIA: Interactive Maps AI Assistant for Travel Planning and Geo-Spatial Intelligence
Jieren Deng ⋅ Zhizhang Hu ⋅ Ziyan He ⋅ Aleksandar Cvetkovic ⋅ Pak Kiu Chung ⋅ Dragomir Yankov ⋅ Chiqun Zhang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 362
OctoNav: Towards Generalist Embodied Navigation
Chen Gao ⋅ Liankai Jin ⋅ Xingyu Peng ⋅ Jiazhao Zhang ⋅ Yue Deng ⋅ Annan Li ⋅ He Wang ⋅ Si Liu
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 363
WalkGPT: Grounded Vision–Language Conversation with Depth-Aware Segmentation for Pedestrian Navigation
Rafi Ibn Sultan ⋅ Hui Zhu ⋅ Xiangyu Zhou ⋅ Chengyin Li ⋅ Prashant Khanduri ⋅ Marco Brocanelli ⋅ Dongxiao Zhu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 364
SpaceDrive: Infusing Spatial Awareness into VLM-based Autonomous Driving
Peizheng Li ⋅ Zhenghao Zhang ⋅ David Holtz ⋅ Hang Yu ⋅ Yutong Yang ⋅ Yuzhi Lai ⋅ Rui Song ⋅ Andreas Geiger ⋅ Andreas Zell
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 365
SMAP: Semantic Route Planning with Map-Grounded Multimodal Alignment
Wenjie Zhang ⋅ Chen Yang ⋅ Xin Lu ⋅ Zhen Wang ⋅ Yue Liu ⋅ Bobo Xi ⋅ Pengbo Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 366
IDperturb: Enhancing Variation in Synthetic Face Generation via Angular Perturbations
Fadi Boutros ⋅ Eduarda Caldeira ⋅ Tahar Chettaoui ⋅ Naser Damer
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 367
Fresco: Frequency–Spatial Consistent Optimization for Fine-Grained Head Avatar Modeling
shikun zhang ⋅ Yong Li ⋅ Yiqun Wang ⋅ Qiuhong Ke ⋅ Cunjian Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 368
Motion-Aware Animatable Gaussian Avatars Deblurring
Muyao Niu ⋅ Yifan Zhan ⋅ Qingtian Zhu ⋅ Zhuoxiao Li ⋅ Wei Wang ⋅ Zhihang Zhong ⋅ Xiao Sun ⋅ Yinqiang Zheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 369
ELITE: Efficient Gaussian Head Avatar from a Monocular Video via Learned Initialization and Test-time Generative Adaptation
Kim Youwang ⋅ Lee Hyoseok ⋅ Park Subin ⋅ Gerard Pons-Moll ⋅ Tae-Hyun Oh
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 370
Multi-view Consistent 3D Gaussian Head Avatars 'without' Multi-view Generation
Aviral Chharia ⋅ Fernando De la Torre
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 371
MAD: Modality-Adaptive Decoding for Mitigating Cross-Modal Hallucinations in Multimodal Large Language Models
Sang Yun Chung ⋅ Se Yeon Kim ⋅ Youngchae Chee ⋅ Yong Man Ro
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 372
Cross-Modal Attention Calibration for LVLM Hallucination Mitigation
Jiaming Li ⋅ Jiacheng Zhang ⋅ Zequn Jie ⋅ Lin Ma ⋅ Ming Li ⋅ Xiaonan Luo ⋅ Guanbin Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 373
3D-VCD: Hallucination Mitigation in 3D-LLM Embodied Agents through Visual Contrastive Decoding
Makanjuola Ogunleye ⋅ Eman Abdelrahman ⋅ Ismini Lourentzou
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 374
Exposing and Evaluating Hallucinations for GUI Grounding
Zicheng Zhang ⋅ Hongyi Jing ⋅ Rui Lv ⋅ Shuo Fang ⋅ Shiai Zhu ⋅ Junying Wang ⋅ Chunyi Li ⋅ Xiaohong Liu ⋅ Chenguang Ma ⋅ Guangtao Zhai
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 375
Understanding and Mitigating Hallucinations in Multimodal Chain-of-Thought Models
Ji Ma ⋅ Wei Suo ⋅ Peng Wang ⋅ Yanning Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 376
Beyond the Global Scores: Fine-Grained Token Grounding as a Robust Detector of LVLM Hallucinations
Tuan Dung Nguyen ⋅ Minh Khoi Ho ⋅ Qi Chen ⋅ Yutong Xie ⋅ Cam-Tu Nguyen ⋅ Minh Khoi Nguyen ⋅ Dang Huy Pham Nguyen ⋅ Anton van den Hengel ⋅ Johan Verjans ⋅ Le Nguyen ⋅ Vu Minh Hieu Phan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 377
StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation
Ke Xing ⋅ longfei li ⋅ Yuyang Yin ⋅ Hanwen Liang ⋅ Guixun Luo ⋅ Chen Fang ⋅ Jue Wang ⋅ Konstantinos N. Plataniotis ⋅ Xiaojie Jin ⋅ Yao Zhao ⋅ Yunchao Wei
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 378
Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout
Hidir Yesiltepe ⋅ Tuna Han Salih Meral ⋅ Adil Kaan Akan ⋅ Kaan Oktay ⋅ Pinar Yanardag
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 379
AniMimic: Imitating 3D Animation from Video Priors
Tianyi Xie ⋅ Yunuo Chen ⋅ Yaowei Guo ⋅ Yin Yang ⋅ Bolei Zhou ⋅ Demetri Terzopoulos ⋅ Ying Jiang ⋅ Chenfanfu Jiang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 380
VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control
Sixiao Zheng ⋅ Minghao Yin ⋅ Wenbo Hu ⋅ Xiaoyu Li ⋅ Ying Shan ⋅ Yanwei Fu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 381
ScenDi: 3D-to-2D Scene Diffusion Cascades for Urban Generation
Hanlei Guo ⋅ Jiahao Shao ⋅ Xinya Chen ⋅ Xiyang Tan ⋅ Sheng Miao ⋅ Yujun Shen ⋅ Yiyi Liao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 382
MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE
Ruijie Zhu ⋅ Jiahao Lu ⋅ Wenbo Hu ⋅ Xiaoguang Han ⋅ Jianfei Cai ⋅ Ying Shan ⋅ Chuanxia Zheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 383
GeodesicNVS: Probability Density Geodesic Flow Matching for Novel View Synthesis
Xuqin Wang ⋅ Tao Wu ⋅ Yanfeng Zhang ⋅ Lu Liu ⋅ mingwei Sun ⋅ Yongliang Wang ⋅ Niclas Zeller ⋅ Daniel Cremers
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 384
WorldStereo: Bridging Controllable Video Generation and Scene Reconstruction via 3D Geometric Memories
Yisu Zhang ⋅ Chenjie Cao ⋅ Tengfei Wang ⋅ Xuhui Zuo ⋅ Junta Wu ⋅ Jianke Zhu ⋅ Chunchao Guo
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 385
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
Yuxue Yang ⋅ Lue Fan ⋅ Ziqi Shi ⋅ Junran Peng ⋅ Feng Wang ⋅ Zhaoxiang Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 386
Taming Video Models for 3D and 4D Generation via Zero-Shot Camera Control
Chenxi Song ⋅ Yanming Yang ⋅ Tong Zhao ⋅ Ruibo Li ⋅ Chi Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 387
Improving Motion in Image-to-Video Models via Adaptive Low-Pass Guidance
William June Suk Choi ⋅ Kyungmin Lee ⋅ Sihyun Yu ⋅ Yisol Choi ⋅ Jinwoo Shin ⋅ Kimin Lee
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 388
SANER: Switchable Adapter with Non-parametric Enhanced Routing for Person De-Reidentification
Yimin Liu ⋅ Nan Pu ⋅ Fengxiang Yang ⋅ Wenjing Li ⋅ Zhihui Li ⋅ Zhun Zhong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 389
BIT: Matching-based Bi-directional Interaction Transformation Network for Visible-Infrared Person Re-Identification
Haoxuan Xu ⋅ Guanglin Niu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 390
Vision-Language Attribute Disentanglement and Reinforcement for Lifelong Person Re-Identification
Kunlun Xu ⋅ Haotong Cheng ⋅ Jiangmeng Li ⋅ Xu Zou ⋅ Jiahuan Zhou
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 391
Diversity over Uniformity: Rethinking Representation in Generated Image Detection
Qinghui He ⋅ Haifeng Zhang ⋅ Qiao Qin ⋅ Bo Liu ⋅ Xiuli Bi ⋅ Bin Xiao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 392
Mining Instance-Centric Vision–Language Contexts for Human–Object Interaction Detection
Soo Won Seo ⋅ Kyungchae Lee ⋅ Hyungchan Cho ⋅ Taein Son ⋅ Nam Ik Cho ⋅ Jun Won Choi
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 393
FSLoRA: Harmonizing Detection and Re-Identification via Freq-Spatial Low-Rank Adapter for One-Stage Person Search
Yanling TIAN ⋅ Shanshan Zhang ⋅ Di Chen ⋅ Jian Yang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 394
EEGiT: Teaching Vision Transformers to Understand the EEG signal
Jiahao Zhou ⋅ Chenghao Xu ⋅ Wei Wang ⋅ Erkun Yang ⋅ Cheng Deng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 395
FedBPrompt: Federated Domain Generalization Person Re-Identification via Body Distribution Aware Visual Prompts
Xin Xu ⋅ Weilong Li ⋅ Wei Liu ⋅ Wenke Huang ⋅ Zhixi Yu ⋅ Bin Yang ⋅ Xiaoying Liao ⋅ Kui Jiang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 396
Pose-guided Enriched Feature Learning for Federated-by-camera Person Re-identification
JooHyung Oh ⋅ Minyoung Oh ⋅ Sung Whan Yoon ⋅ Jae-Young Sim
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 397
UAV-CB: A Complex-Background RGB–T Dataset and Local Frequency Bridge Network for UAV Detection
Shenghui Huang ⋅ Menghao Hu ⋅ Longkun Zou ⋅ Hongyu Chi ⋅ Zekai Li ⋅ Feng Gao ⋅ Fan Yang ⋅ Qingyao Wu ⋅ Ke Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 398
TimeViper: A Hybrid Mamba-Transformer Vision-Language Model for Efficient Long Video Understanding
Boshen Xu ⋅ Zihan Xiao ⋅ Jiaze Li ⋅ Jianzhong Ju ⋅ Zhenbo Luo ⋅ Jian Luan ⋅ Qin Jin
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 399
StreamReady: Learning What to Answer and When in Long Streaming Videos
Shehreen Azad ⋅ Vibhav Vineet ⋅ Yogesh Rawat
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 400
LongVideo-R1: Smart Navigation for Low-cost Long Video Understanding
Jihao Qiu ⋅ Lingxi Xie ⋅ Xinyue Huo ⋅ Qi Tian ⋅ Qixiang Ye
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 401
Agentic Video Summarization via Self-Reflecting Multimodal Understanding
Miaotian Guo ⋅ Shuguang Dou ⋅ Yin Li ⋅ Aidong Men ⋅ Dongsheng Jiang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 402
Self-Critical Distillation Network for Video-based Commonsense Captioning
Mengqi Yuan ⋅ Gengyun Jia ⋅ Bing-Kun Bao
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 403
Ego-Grounding for Personalized Question-Answering in Egocentric Videos
Junbin Xiao ⋅ Shenglang Zhang ⋅ Pengxiang Zhu ⋅ Angela Yao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 404
AdaSpark: Adaptive Sparsity for Efficient Long-Video Understanding
Handong Li ⋅ Zikang Liu ⋅ Longteng Guo ⋅ Tongtian Yue ⋅ Yepeng Tang ⋅ Xinxin Zhu ⋅ Chuanyang Zheng ⋅ Ziming Wang ⋅ Zhibin Wang ⋅ Jun Song ⋅ Cheng Yu ⋅ Bo Zheng ⋅ Jing Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 405
EarlyTom: Early Token Compression Completes Fast Video Understanding
Hesong Wang ⋅ Xin Jin ⋅ Lu Lu ⋅ Chenhaowen Li ⋅ Jian Chen ⋅ Qiang Liu ⋅ Huan Wang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 406
VideoWorld 2: Learning Transferable Knowledge from Real-world Videos
Zhongwei Ren ⋅ Yunchao Wei ⋅ Xiao Yu ⋅ Guixun Luo ⋅ Yao Zhao ⋅ Bingyi Kang ⋅ Jiashi Feng ⋅ Xiaojie Jin
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 407
VirtueBench: Evaluating Trustworthiness under Uncertainty in Long Video Understanding
Xueqing Yu ⋅ Bohan Li ⋅ Yan Li ⋅ Zhenheng Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 408
DiverseDiT: Towards Diverse Representation Learning in Diffusion Transformers
Mengping Yang ⋅ Stewart Tan ⋅ Binglei Li ⋅ Xiaomeng Yang ⋅ Hesen Chen ⋅ Hao li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 409
RenderFlow: Single-Step Neural Rendering via Flow Matching
Shenghao Zhang ⋅ Runtao Liu ⋅ Christopher Schroers ⋅ Yang Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 410
ResDiT: Evoking the Intrinsic Resolution Scalability in Diffusion Transformers
Yiyang Ma ⋅ Feng Zhou ⋅ Xuedan Yin ⋅ Pu Cao ⋅ Yonghao Dang ⋅ Jianqin Yin
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 411
Masked Region Transformer for Layered Image Generation and Editing at Scale
Zhicong Tang ⋅ Jingye Chen ⋅ Zhao Zhang ⋅ Mohan Zhou ⋅ Yuchi Liu ⋅ Yifan Pu ⋅ Yalong Bai ⋅ Ethan Smith ⋅ Yuhui Yuan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 412
DDT: Decoupled Diffusion Transformer
Shuai Wang ⋅ Zhi Tian ⋅ Weilin Huang ⋅ Limin Wang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 413
Just-in-Time: Training-Free Spatial Acceleration for Diffusion Transformers
Wenhao Sun ⋅ Ji Li ⋅ Zhaoqiang Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 414
Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality
Zekai Luo ⋅ Zongze Du ⋅ Zhouhang Zhu ⋅ Hao Zhong ⋅ Muzhi Zhu ⋅ Wen Wang ⋅ Yuling Xi ⋅ Chenchen Jing ⋅ Hao Chen ⋅ Chunhua Shen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 415
ShapeAR: Generating Editable Shape Layers via Autoregressive Diffusion
Souymodip Chakraborty ⋅ Ankur Singh ⋅ Amit Vikram Singh ⋅ Vineet Batra ⋅ Ankit Phogat
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 416
ReHyAt: Recurrent Hybrid Attention for Video Diffusion Transformers
Mohsen Ghafoorian ⋅ Amir Habibian
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 417
RecTok: Reconstruction Distillation along Rectified Flow
Qingyu Shi ⋅ Size Wu ⋅ Jinbin Bai ⋅ Kaidong Yu ⋅ Yujing Wang ⋅ Yunhai Tong ⋅ Xiangtai Li ⋅ Xuelong Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 418
EgoXtreme: A Dataset for Robust Object Pose Estimation in Egocentric Views under Extreme Conditions
Taegyoon Yoon ⋅ Yegyu Han ⋅ Seojin Ji ⋅ Jaewoo Park ⋅ Sojeong Kim ⋅ Taein Kwon ⋅ Hyung-Sin Kim
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 419
CoIn3D: Revisiting Configuration-Invariant Multi-Camera 3D Object Detection
Zhaonian Kuang ⋅ Rui Ding ⋅ Haotian Wang ⋅ Xinhu Zheng ⋅ Meng Yang ⋅ Gang Hua
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 420
H^2A^2: Homogeneity-Aware and Heterogeneity-Aware Feature Perception for Unified Indoor 3D Object Detection
Tao Xie ⋅ Tao An ⋅ Feng Liu ⋅ Jin Wensheng ⋅ Zhengyu Li ⋅ lijun zhao ⋅ Ruifeng Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 421
Cov2Pose: Leveraging Spatial Covariance for Direct Manifold-aware 6-DoF Object Pose Estimation
Nassim Ali Ousalah ⋅ Peyman Rostami ⋅ Vincent Gaudillière ⋅ Emmanuel Koumandakis ⋅ Anis Kacem ⋅ Enjie Ghorbel ⋅ Djamila Aouada
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 422
Towards Intrinsic-Aware Monocular 3D Object Detection
Zhihao Zhang ⋅ Abhinav Kumar ⋅ Xiaoming Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 423
SToRe3D: Sparse Token Relevance in ViTs for Efficient Multi-View 3D Object Detection
Sandro Papais ⋅ lezhou feng ⋅ Charles Cossette ⋅ Lingting Ge
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 424
SPAN: Spatial-Projection Alignment for Monocular 3D Object Detection
Yifan Wang ⋅ Yian Zhao ⋅ Fanqi Pu ⋅ Xiaochen Yang ⋅ YANG TANG ⋅ Xi Chen ⋅ Wenming Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 425
DSCA: Dynamic Subspace Concept Alignment for Lifelong VLM Editing
Gyanendra Das ⋅ Sai Jena
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 426
FailureAtlas: Mapping the Failure Landscape of T2I Models via Active Exploration
Muxi Chen ⋅ Zhaohua Zhang ⋅ Chenchen Zhao ⋅ Mingyang Chen ⋅ Wenyu Jiang ⋅ Tianwen Jiang ⋅ Jianhuan Zhuo ⋅ Yu Tang ⋅ Qiuyong Xiao ⋅ Jihong Zhang ⋅ Qiang Xu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 427
HDR-VLM: HDR-Domain Adaptation of VLMs and Preference-Aligned Quality Assessment for HDR Video Color Grading
Hao Yuan ⋅ Jiabin Zhang ⋅ Yajing Wu ⋅ Ruixuan Pang ⋅ Jing Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 428
RobustVisRAG: Causality-Aware Vision-Based Retrieval-Augmented Generation under Visual Degradations
I-Hsiang (Aaron) Chen ⋅ Yu-Wei Liu ⋅ Tse-Yu Wu ⋅ Yu-Chien Chiang ⋅ Jen-Chieh Yang ⋅ Wei-Ting Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 429
BiomedCCPL: Causal Conditional Prompt Learning for Biomedical Vision-Language Models
Xueliang Cui ⋅ Juncai Zhang ⋅ Jiacheng Hou ⋅ Dan Lu ⋅ Hao Zhang ⋅ Ruxin Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 430
DynamicGTR: Leveraging Graph Topology Representation Preferences to Boost VLM Capabilities on Graph QAs
Yanbin Wei ⋅ Jiangyue Yan ⋅ Chun Kang ⋅ Yang Chen ⋅ Hua Liu ⋅ James Kwok ⋅ Yu Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 431
VisualOverload: Probing Visual Understanding of VLMs in Really Dense Scenes
Paul Gavrikov ⋅ Wei Lin ⋅ M. Jehanzeb Mirza ⋅ Soumya Jahagirdar ⋅ Muhammad Huzaifa ⋅ Sivan Doveh ⋅ James Glass ⋅ Serena Yeung ⋅ Hilde Kuehne
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 432
Revisiting Visual Corruptions in LVLMs: A Shape–Texture Perspective on Model Failures
Xinkuan Qiu ⋅ Meina Kan ⋅ Zhenliang He ⋅ Yongbin Zhou ⋅ Shiguang Shan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 433
From Intuition to Investigation: A Tool-Augmented Reasoning MLLM Framework for Generalizable Face Anti-Spoofing
Haoyuan Zhang ⋅ Keyao Wang ⋅ Guosheng Zhang ⋅ Haixiao Yue ⋅ Zhiwen Tan ⋅ Siran Peng ⋅ Tianshuo Zhang ⋅ Xiao Tan ⋅ Kunbin Chen ⋅ Wei He ⋅ Jingdong Wang ⋅ Ajian Liu ⋅ Xiangyu Zhu ⋅ Zhen Lei
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 434
Trust-calibrated Collaborative Learning for Long-Tailed Visual Recognition
Hao Zhou ⋅ Tingjin Luo
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 435
SunFaded: Illumination-Aware Gaussian Splatting for Dark Scenes with Camera-Mounted Active Lighting
Wenjie Chang ⋅ Tianle Ding ⋅ Wenfei Yang ⋅ Tianzhu Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 436
TokenSplat: Token-aligned 3D Gaussian Splatting for Feed-forward Pose-free Reconstruction
Yihui Li ⋅ Chengxin Lv ⋅ Zichen Tang ⋅ Hongyu Yang ⋅ Di Huang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 437
GOR-IS: 3D Gaussian Object Removal In the Intrinsic Space
Yonghao Zhao ⋅ Yupeng Gao ⋅ Jian Yang ⋅ Jin Xie ⋅ Beibei Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 438
AeroGS: Scale-Aware Gaussian Splatting for Pose-Free Dynamic UAV Scene Reconstruction
Tingyun Li ⋅ Xinyi Liu ⋅ Yongjun Zhang ⋅ Yi Wan ⋅ Xiaoan Liu ⋅ Weiwei Fan ⋅ Jiahao Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 439
Intrinsic Geometry-Appearance Consistency Optimization for Sparse-View Gaussian Splatting
Kaiqiang Xiong ⋅ Rui Peng ⋅ Jiahao Wu ⋅ Zhanke Wang ⋅ Jie Liang ⋅ Xiaoyun Zheng ⋅ Feng Gao ⋅ Ronggang Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 440
AERGS-SLAM: Auto-Exposure-Robust Stereo 3D Gaussian Splatting SLAM
Zhiyu Zhou ⋅ Feng Hui ⋅ Yu Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 441
Learning Differentiable Hierarchies in 3D Gaussian Splatting
Youqi Pan ⋅ Wugen Zhou ⋅ Hongbin Zha
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 442
WeatherCity: Urban Scene Reconstruction with Controllable Multi-Weather Transformation
Wenhua Wu ⋅ Huai Guan ⋅ Zhe Liu ⋅ Hesheng Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 443
Cross-View Splatter: Feed-Forward View Synthesis with Georeferenced Images
Matias Turkulainen ⋅ Akshay Krishnan ⋅ Filippo Aleotti ⋅ Mohamed Sayed ⋅ Guillermo Garcia-Hernando ⋅ Juho Kannala ⋅ Arno Solin ⋅ Gabriel Brostow ⋅ Daniyar Turmukhambetov
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 444
TagSplat: Topology-Aware Gaussian Splatting for Dynamic Mesh Modeling and Tracking
Hanzhi Guo ⋅ dongdong weng ⋅ Mo Su ⋅ Yixiao Chen ⋅ Xiaonuo Dongye ⋅ Chenyu Xu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 445
Hierarchical Visual Relocalization with Nearest View Synthesis from Feature Gaussian Splatting
Huaqi Tao ⋅ Bingxi Liu ⋅ Guangcheng Chen ⋅ Fulin Tang ⋅ Li He ⋅ Hong Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 446
Tracking-Guided 4D Generation: Foundation-Tracker Motion Priors for 3D Model Animation
Su Sun ⋅ Cheng Zhao ⋅ Himangi Mittal ⋅ Gaurav Mittal ⋅ Rohith Kukkala ⋅ Yingjie Chen ⋅ Mei Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 447
3D Gaussian Splatting from Unposed Spike Stream
Yijia Guo ⋅ Tong Hu ⋅ Liwen Hu ⋅ Lei Ma ⋅ Tiejun Huang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 448
SparseOIT: Improving Order-Independent Transparency 3DGS via Active Set Method
Wentao Yang ⋅ FanZhen KONG ⋅ Zejian Kang ⋅ Xiangru Huang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 449
ClipGStream: Clip-Stream Gaussian Splatting for Any Length and Any Motion Multi-View Dynamic Scene Reconstruction
Jie Liang ⋅ Jiahao Wu ⋅ Chao Wang ⋅ Jiayu Yang ⋅ Xiaoyun Zheng ⋅ Kaiqiang Xiong ⋅ Zhanke Wang ⋅ Jinbo Yan ⋅ Feng Gao ⋅ Ronggang Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 450
Space-Time Forecasting of Dynamic Scenes with Motion-aware Gaussian Grouping
Junmyeong Lee ⋅ Hoseung Choi ⋅ Minsu Cho
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 451
MoRGS: Efficient Per-Gaussian Motion Reasoning for Streamable Dynamic 3D Scenes
Wonjoon Lee ⋅ Sungmin Woo ⋅ Donghyeong Kim ⋅ Jungho Lee ⋅ Sangheon Park ⋅ Sangyoun Lee
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 452
BEA-GS: BEyond RAdiance Supervision in 3DGS for Precise Object Extraction
Alessio Mazzucchelli ⋅ María Naranjo Almeida ⋅ Jorge Bustos Sanchez ⋅ Mariella Dimiccoli ⋅ Francesc Moreno-Noguer ⋅ Jordi Sanchez-Riera ⋅ Adrian Penate-Sanchez
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 453
EDGS: Eliminating Densification for Efficient Convergence of 3DGS
Dmytro Kotovenko ⋅ Olga Grebenkova ⋅ Björn Ommer
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 454
ReasonMap: Towards Fine-Grained Visual Reasoning from Transit Maps
Sicheng Feng ⋅ Song Wang ⋅ Shuyi Ouyang ⋅ Lingdong Kong ⋅ Zikai Song ⋅ Jianke Zhu ⋅ Huan Wang ⋅ Xinchao Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 455
Conan: Progressive Learning to Reason Like a Detective over Multi-Scale Visual Evidence
Kun Ouyang ⋅ Yuanxin Liu ⋅ Linli Yao ⋅ Yishuo Cai ⋅ Hao Zhou ⋅ Fandong Meng ⋅ Jie Zhou ⋅ Xu Sun
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 456
DialogueVPR: Towards Conversational Visual Place Recognition
yukun Song ⋅ Changwei Wang ⋅ Xingtian Pei ⋅ Shibiao Xu ⋅ Wenhao Xu ⋅ Shunpeng Chen ⋅ Yu Zhang ⋅ Ke Zhang ⋅ Rongtao Xu ⋅ Xuxiang Feng ⋅ Pengyang Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 457
Perceptual-Evidence Anchored Reinforced Learning for Multimodal Reasoning
Chi Zhang ⋅ Haibo Qiu ⋅ Qiming Zhang ⋅ Yufei Xu ⋅ Zhixiong Zeng ⋅ Siqi Yang ⋅ Peng Shi ⋅ Lin Ma ⋅ Jing Zhang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 458
Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm
Jingqi Tong ⋅ Yurong Mou ⋅ Hangcheng Li ⋅ Mingzhe Li ⋅ Yongzhuo Yang ⋅ Ming Zhang ⋅ Qiguang Chen ⋅ Tianyi Liang ⋅ Xiaomeng Hu ⋅ Yining Zheng ⋅ Xinchi Chen ⋅ Jun Zhao ⋅ Xuanjing Huang ⋅ Xipeng Qiu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 459
VinQA: Visual Elements Interleaved Long-form Answer Generation for Real-World Multimodal Document QA
Young Rok Jang ⋅ Hyesoo Kong ⋅ Kyunghwan An ⋅ Jae Sub Huh ⋅ Gyeonghun KIM ⋅ Stanley Jungkyu Choi
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 460
DocSeeker: Structured Visual Reasoning with Evidence Grounding for Long Document Understanding
Hao Yan ⋅ Yuliang Liu ⋅ Xingchen Liu ⋅ Yuyi Zhang ⋅ Minghui Liao ⋅ Jihao Wu ⋅ Wei Chen ⋅ Xiang Bai
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 461
Recurrent Reasoning with Vision-Language Models for Estimating Long-Horizon Embodied Task Progress
Yuelin Zhang ⋅ Sijie Cheng ⋅ Chen Li ⋅ Zongzhao Li ⋅ Yuxin Huang ⋅ Yang Liu ⋅ Wenbing Huang
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 462
VGent: Visual Grounding via Modular Design for Disentangling Reasoning and Prediction
Weitai Kang ⋅ Jason Kuen ⋅ Mengwei Ren ⋅ Zijun Wei ⋅ Yan Yan ⋅ Kangning Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 463
Grounding Everything in Tokens for Multimodal Large Language Models
Xiangxuan Ren ⋅ Zhongdao Wang ⋅ Liping Hou ⋅ Pin Tang ⋅ Guoqing Wang ⋅ Chao Ma
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 464
Evolving Contextual Safety in Multi-Modal Large Language Models via Inference-Time Self-Reflective Memory
Ce Zhang ⋅ Jinxi He ⋅ Junyi He ⋅ Katia Sycara ⋅ Yaqi Xie
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 465
ChartR: Evaluating Reasoning Accuracy and Robustness in Chart Question Answering
Xiaojun Chen ⋅ Sixiao Luo ⋅ Ziqi Liu ⋅ Min Yang ⋅ Qin Zhang ⋅ Liang-Jie Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 466
Think Visually, Reason Textually: Vision-Language Synergy in Abstract Reasoning
Beichen Zhang ⋅ Yuhang Zang ⋅ Xiaoyi Dong ⋅ Yuhang Cao ⋅ Haodong Duan ⋅ Dahua Lin ⋅ Jiaqi Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 467
VKG-QA: Visual Knowledge Graph-based Question Answer for Large Multimodal Models
Yuntao Du ⋅ Yiming Wang ⋅ Renshuo Yuan ⋅ Jincheng Yue ⋅ Yijing Chen ⋅ Yue Fan ⋅ Bo Zhang ⋅ Qian Li ⋅ Lizhen Cui
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 468
Med-CMR: A Fine-Grained Benchmark Integrating Visual Evidence and Clinical Logic for Medical Complex Multimodal Reasoning
Haozhen Gong ⋅ Xiaozhong Ji ⋅ Yuansen Liu ⋅ Wenbin Wu ⋅ Xiaoxiao Yan ⋅ jingjing liu ⋅ Kai WU ⋅ Jiazhen Pan ⋅ Bailiang Jian ⋅ Jiangning Zhang ⋅ Xiaobin Hu ⋅ Hongwei Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 469
Human-like Abstract Visual Reasoning via Understanding and Solving Reasoning Loop
Xinwang Chen ⋅ Xiuxing Li ⋅ Qing Li ⋅ Ziyue Zhuang ⋅ Yutong Wu ⋅ Ziyu Li ⋅ Zhuo Wang ⋅ Kai Li ⋅ Jianye Hao ⋅ Xia Wu
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 470
VITAL: Vision-Encoder-centered Pre-training for LMMs in Visual Quality Assessment
Ziheng Jia ⋅ Linhan Cao ⋅ Jinliang Han ⋅ Zicheng Zhang ⋅ Jiaying Qian ⋅ Wang Jiarui ⋅ Zijian Chen ⋅ Guangtao Zhai ⋅ Xiongkuo Min
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 471
Generative Video Compression with One-Dimensional Latent Representation
Zihan Zheng ⋅ Zhaoyang Jia ⋅ Naifu Xue ⋅ Jiahao Li ⋅ Bin Li ⋅ Zongyu Guo ⋅ Xiaoyi Zhang ⋅ Zhenghao Chen ⋅ Houqiang Li ⋅ Yan Lu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 472
Markovian Scale Prediction: A New Era of Visual Autoregressive Generation
Yu Zhang ⋅ Jingyi Liu ⋅ Yiwei Shi ⋅ Qi Zhang ⋅ Duoqian Miao ⋅ Changwei Wang ⋅ Longbing Cao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 473
Learned Image Compression via Sparse Attention and Adaptive Frequency
Huidong Ma ⋅ Xinyan Shi ⋅ Hui Sun ⋅ Xiaofei Yue ⋅ xiaoguang Liu ⋅ Gang Wang ⋅ Wentong Cai
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 474
UPLiFT: Efficient Pixel-Dense Feature Upsampling with Local Attenders
Matthew Walmer ⋅ Saksham Suri ⋅ Anirud Aggarwal ⋅ Abhinav Shrivastava
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 475
VecAttention: Vector-wise Sparse Attention for Accelerating Long Context Inference
Anmin Liu ⋅ Ruixuan Yang ⋅ Huiqiang Jiang ⋅ Bin Lin ⋅ Minmin Sun ⋅ Yong Li ⋅ CHEN ZHANG ⋅ Tao Xie
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 476
Ultra-Fast Neural Video Compression
Jiahao Li ⋅ Wenxuan Xie ⋅ Zhaoyang Jia ⋅ Bin Li ⋅ Zongyu Guo ⋅ Xiaoyi Zhang ⋅ Yan Lu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 477
Parallax to Align Them All: An OmniParallax Attention Mechanism for Distributed Multi-View Image Compression
Haotian Zhang ⋅ Feiyue Long ⋅ Yixin Yu ⋅ Jian Xue ⋅ Haocheng Tang ⋅ Tongda Xu ⋅ Zhenning Shi ⋅ Yan Wang ⋅ Siwei Ma ⋅ Jiaqi Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 478
Scaling Parallel Sequence Models to Vision Foundation Models
Yitong Jiang ⋅ Collin McCarthy ⋅ Hongjun Wang ⋅ Hanrong Ye ⋅ Qi Dou ⋅ Tianfan Xue ⋅ Jinwei Gu ⋅ Jan Kautz ⋅ Danny Yin ⋅ Pavlo Molchanov ⋅ Sifei Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 479
Revisiting Model Stitching In the Foundation Model Era
Zheda Mai ⋅ Ke Zhang ⋅ Fu-En Wang ⋅ Zixiao Ken Wang ⋅ Albert Chen ⋅ Lu Xia ⋅ Min Sun ⋅ Wei-Lun Chao ⋅ Cheng-Hao Kuo
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 480
GeoAgent: Learning to Geolocate Everywhere with Reinforced Geographic Characteristics
Modi Jin ⋅ Yiming Zhang ⋅ Bo-Yuan Sun ⋅ Dingwen Zhang ⋅ Mingming Cheng ⋅ Qibin Hou
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 481
VLM-Loc: Localization in Point Cloud Maps via Vision-Language Models
Shuhao Kang ⋅ Youqi Liao ⋅ Peijie Wang ⋅ Wenlong Liao ⋅ Qilin Zhang ⋅ Benjamin Busam ⋅ Xieyuanli Chen ⋅ Yun Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 482
HOLO: Homography-Guided Pose Estimator Network for Fine-Grained Visual Localization on SD Maps
Xuchang Zhong ⋅ Xu Cao ⋅ Jinke Feng ⋅ Hao Fang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 483
TriLite: Efficient Weakly Supervised Object Localization with Universal Visual Features and Tri-Region Disentanglement
Arian Sabaghi ⋅ Jose Oramas
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 484
GeoSURGE: Geo-localization using Semantic Fusion with Hierarchy of Geographic Embeddings
Angel Daruna ⋅ Nicholas Meegan ⋅ Han-Pang Chiu ⋅ Supun Samarasekera ⋅ Rakesh “Teddy” Kumar
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 485
Towards Visual Query Localization in the 3D World
liang peng ⋅ Bohan Tan ⋅ Zhipeng Zhang ⋅ Haobo Li ⋅ Yifan Jiao ⋅ Xingping Dong ⋅ Libo Zhang
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 486
OVOD-Agent: A Markov–Bandit Framework for Proactive Visual Reasoning and Self-Evolving Detection
Chujie Wang ⋅ Jianyu Lu ⋅ Zhiyuan Luo ⋅ Xi Chen ⋅ Chu He
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 487
Pixel2Phys: Distilling Governing Laws from Visual Dynamics
Ruikun Li ⋅ Jun Yao ⋅ Yingfan Hua ⋅ SHIXIANG TANG ⋅ Biqing Qi ⋅ Bin Liu ⋅ Wanli Ouyang ⋅ Yan Lu
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 488
Tutor-Student Reinforcement Learning: A Dynamic Curriculum for Robust Deepfake Detection
Zhanhe Lei ⋅ Zhongyuan Wang ⋅ Jikang Cheng ⋅ Baojin Huang ⋅ Yuhong Yang ⋅ Zhen Han ⋅ Chao Liang ⋅ Dengpan Ye
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 489
Seeing as Experts Do: A Knowledge-Augmented Agent for Open-Set Fine-Grained Visual Understanding
Junhan Chen ⋅ Zilu Zhou ⋅ Yujun Tong ⋅ Dongliang Chang ⋅ Yitao Luo ⋅ Zhanyu Ma
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 490
Dynamic Important Example Mining for Reinforcement Finetuning
Haoru Tan ⋅ WU Sitong ⋅ Yanfeng Chen ⋅ Shizhen Zhao ⋅ Yangtian Sun ⋅ Tianjia Liu ⋅ Chirui Chang ⋅ Shaofeng Zhang ⋅ Xingwu Sun ⋅ Xiuzhe Wu ⋅ Ruobing Xie ⋅ Xiaojuan Qi
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 491
Specificity-aware reinforcement learning for fine-grained open-world classification
Samuele Angheben ⋅ Davide Berasi ⋅ Alessandro Conti ⋅ Elisa Ricci ⋅ Yiming Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 492
SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning
Jitesh Jain ⋅ Jialuo Li ⋅ Zixian Ma ⋅ Jieyu Zhang ⋅ Chris Dongjoo Kim ⋅ Sangho Lee ⋅ Rohun Tripathi ⋅ Tanmay Gupta ⋅ Christopher Clark ⋅ Humphrey Shi
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 493
Uncertainty-Aware Modality Fusion for Unaligned RGB-T Salient Object Detection
Mianzhao Wang ⋅ Fan Shi ⋅ Xu Cheng ⋅ Chen Jia ⋅ Shengyong Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 494
Fusion in Your Way: Aligning Image Fusion with Heterogeneous Demands via Direct Preference Optimization
Weijian Su ⋅ Songqian Zhang ⋅ Yuqi Han ⋅ Jian Zhuang ⋅ Yongdong Huang ⋅ Qiang Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 495
More Than Meets the Eye: A Unified Image Fusion Framework via Semantic-Pixel Entropy Trade-off for Zero-Shot Generalization
Xiaowen Liu ⋅ Jing Li ⋅ Hongtao Huo ⋅ Haozhe Cao ⋅ Renhua Wang ⋅ Xu Dong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 496
Beyond Sequential Tools: A Unified VLM Agent System for Photographic Post-Processing via Dynamic Multi-Expert Fusion
Honglin Xiong ⋅ Chenjie Zhu ⋅ Jianbiao Ding ⋅ Zixuan Ni ⋅ Wei Li ⋅ Zhenpeng Mi ⋅ Qian Wang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 497
Multi-modal Frequency Decomposition Network for Semantic Scene Completion
Die Zuo ⋅ Lubo Wang ⋅ Ruonan Liu ⋅ Qing Guo ⋅ Chong Wang ⋅ Dongdong Wu ⋅ Wei Feng ⋅ Kairui Yang ⋅ Di Lin
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 498
BiEvLight: Bi-level Learning of Task-Aware Event Refinement for Low-Light Image Enhancement
Zishu Yao ⋅ Xiang-Xiang Su ⋅ Shengning Zhou ⋅ Guang-Yong Chen ⋅ Guodong Fan ⋅ Xing Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 499
FusionRegister: Every Infrared and Visible Image Fusion Deserves Registration
Congcong Bian ⋅ HaoLong Ma ⋅ Hui Li ⋅ Zhongwei Shen ⋅ Xiaoqing Luo ⋅ Xiaoning Song ⋅ Xiao-Jun Wu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 500
OmniFood8K: Single-Image Nutrition Estimation via Hierarchical Frequency-Aligned Fusion
Dongjian Yu ⋅ Weiqing Min ⋅ Qian Jiang ⋅ Xing Lin ⋅ Xin Jin ⋅ Shuqiang Jiang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 501
Enhancing Unregistered Hyperspectral Image Super-Resolution via Unmixing-based Abundance Fusion Learning
Yingkai Zhang ⋅ Tao Zhang ⋅ Jing Nie ⋅ Ying Fu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 502
LRHDR: Learning Representation-enhanced HDR Video Reconstruction
Chenzhuo Liao ⋅ Xin Chen ⋅ Bingchen Li ⋅ Yu Meng ⋅ Tao Yue ⋅ Xuemei Hu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 503
Cross-Domain Few-Shot Segmentation via Multi-view Progressive Adaptation
Jiahao Nie ⋅ Guanqiao Fu ⋅ Wenbin An ⋅ Yap-Peng Tan ⋅ Alex C. Kot ⋅ Shijian Lu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 504
Interpretable Cross-Domain Few-Shot Learning with Rectified Target-Domain Local Alignment
Yaze Zhao ⋅ Yixiong Zou ⋅ Yuhua Li ⋅ Ruixuan Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 505
PP-Brep: Few-Shot B-rep Classification with Hybrid Graph Representation
Jiacheng Hao ⋅ Chunying Liu ⋅ Hao Guo ⋅ Ruohan Wang ⋅ Hongping Gan ⋅ Yilei Shi
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 506
AgentDet: A Shared-Blackboard Multi-Agent Framework for Zero-/Few-Shot Object Detection
Haolin Li ⋅ Yaohua Wang ⋅ Ze Yan ⋅ Lijie Wen ⋅ Biqing Huang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 507
SFR-Net: Steering-Fusion-Refining Network in Multi-label Zero-Shot Sewer Defect Detection
Zhao-Min Chen ⋅ Xinjian Huang ⋅ Yisu Ge ⋅ Yu Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 508
Noise-Aware Few-Shot Learning through Bi-directional Multi-View Prompt Alignment
Lu Niu ⋅ Cheng Xue
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 509
Learnability-Guided Diffusion for Dataset Distillation
Jeffrey A. Chan-Santiago ⋅ Mubarak Shah
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 510
Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals
Xiangyu Fan ⋅ Zesong Qiu ⋅ Zhuguanyu Wu ⋅ Fanzhou Wang ⋅ Zhiqian Lin ⋅ Tianxiang Ren ⋅ Dahua Lin ⋅ RUIHAO GONG ⋅ Lei Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 511
Progressive Mask Distillation for Self-supervised Video Representation
Kewei Wu ⋅ Chong Liang ⋅ Zhao Xie ⋅ Dan Guo
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 512
HierAmp: Coarse-to-Fine Autoregressive Amplification for Generative Dataset Distillation
Lin Zhao ⋅ Xinru Jiang ⋅ Xi Xiao ⋅ Qihui Fan ⋅ Lei Lu ⋅ Yanzhi Wang ⋅ Xue Lin ⋅ OCTAVIA CAMPS ⋅ Pu Zhao ⋅ Jianyang Gu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 513
SpiderCam: Low-Power Snapshot Depth from Differential Defocus
Marcos A. Ferreira ⋅ Tianao Li ⋅ John Mamish ⋅ Josiah Hester ⋅ Yaman Sangar ⋅ Qi Guo ⋅ Emma Alexander
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 514
Computational Speckle Pattern Interferometry
Shengxi Wu ⋅ Sophia Yang ⋅ Dorian Chan ⋅ Matthew O’Toole
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 515
DetectSCI: Toward Object-Guided ROI Reconstruction for High-Resolution Video Snapshot Compressive Imaging
Xingjian Jiang ⋅ Lishun Wang ⋅ Ping Wang ⋅ Xin Yuan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 516
Solving a Nonlinear Blind Inverse Problem for Tagged MRI with Physics and Deep Generative Priors
Zhangxing Bian ⋅ Shuwen Wei ⋅ Samuel W. Remedios ⋅ Junyu Chen ⋅ Aaron Carass ⋅ Blake E. Dewey ⋅ Jerry L Prince
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 517
Nonlinear Color Transfer via Learnable Bezier Flows
Junhyoung Lee ⋅ Seongwoon Jo ⋅ JeongHun Park ⋅ Yeonji Ryou ⋅ Jeongha Yang ⋅ Jangho Kim
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 518
VT-Intrinsic: Physics-Based Decomposition of Reflectance and Shading using a Single Visible-Thermal Image Pair
Zeqing Yuan ⋅ Mani Ramanagopal ⋅ Aswin C. Sankaranarayanan ⋅ Srinivasa G. Narasimhan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 519
GH-NAF: Grid-Adaptive Hash-Level–Attended Neural Attenuation Fields for Discrepancy-Aware CBCT
Seong Je Oh ⋅ Ju Hwan Lee ⋅ Chae Yeon Lim ⋅ Donghwan Lee ⋅ Myung Jin Ching ⋅ Kyungsu Kim
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 520
Computer Vision with a Superpixelation Camera
Sasidharan Mahalingam ⋅ Rachel Brown ⋅ Atul Ingle
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 521
Color-Encoded Illumination for High-Speed Volumetric Scene Reconstruction
David Novikov ⋅ Eilon Vaknin ⋅ Narek Tumanyan ⋅ Mark Sheinin
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 522
Multi-Scale Gradient-Guided Unrolling Architecture with Adaptive Mamba for Compressive Sensing
Le Yang ⋅ Hongping Gan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 523
Deciphering Genotype-Phenotype Mechanisms from High-Content Profiling via Knowledge-Guided Multi-modal Graph Learning
Hanjing Lin ⋅ Jiahua Rao ⋅ Youhan Sun ⋅ Jiancong Xie ⋅ Yuedong Yang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 524
Bulk RNA-seq Guided Multi-modal Detection of Anomalous Regions in Human Cancer via Spatial Transcriptomics
Hang Shi ⋅ Ruocheng Yang ⋅ Wenjie You ⋅ Zhilin Huang ⋅ Daoqiang Zhang ⋅ WEI SHAO
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 525
Intervention-Aware Multiscale Representation Learning from Imaging Phenomics and Perturbation Transcriptomics
Jiayuan Chen ⋅ Ruoqi Liu ⋅ Zishan Gu ⋅ Ping Zhang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 526
ParaUni: Enhance Generation in Unified Multimodal Model with Reinforcement-driven Hierarchical Parallel Information Interaction
Jiangtong Tan ⋅ Lin Liu ⋅ Jie Huang ⋅ Xiaopeng Zhang ⋅ Qi Tian ⋅ Feng Zhao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 527
PhysVid: Physics Aware Local Conditioning for Generative Video Models
Saurabh Pathak ⋅ Elahe Arani ⋅ Mykola Pechenizkiy ⋅ Bahram Zonooz
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 528
PromptLoop: Plug-and-Play Prompt Refinement via Latent Feedback for Diffusion Model Alignment
Suhyeon Lee ⋅ Jong Chul
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 529
EvoID: Reinforced Evolution for Identity-Preserving Video Generation
Yiheng Zhang ⋅ Zhaofan Qiu ⋅ Zunxu Liu ⋅ Yingwei Pan ⋅ Ting Yao ⋅ Tao Mei
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 530
Masked Auto-Regressive Variational Acceleration: Fast Inference Makes Practical Reinforcement Learning
Yuxuan Gu ⋅ Weimin Bai ⋅ Yifei Wang ⋅ Weijian Luo ⋅ He Sun
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 531
PhyCo: Learning Controllable Physical Priors for Generative Motion
Sriram Narayanan ⋅ Ziyu Jiang ⋅ Srinivasa G. Narasimhan ⋅ Manmohan Chandraker
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 532
Unified Multimodal Models as Auto-Encoders
Zhiyuan Yan ⋅ Kaiqing Lin ⋅ Hao Li ⋅ Junyan Ye ⋅ Hui Han ⋅ Haochen Wang ⋅ Zhendong Wang ⋅ Bin Lin ⋅ Li Hao ⋅ Xinyan Xiao ⋅ Jingdong Wang ⋅ Haifeng Wang ⋅ Li Yuan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 533
Expand and Prune: Maximizing Trajectory Diversity for Effective GRPO in Generative Models
Shiran Ge ⋅ Chenyi Huang ⋅ Yuang Ai ⋅ Qihang Fan ⋅ Huaibo Huang ⋅ Ran He
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 534
ThinkingViT: Matryoshka Thinking Vision Transformer for Elastic Inference
Ali Hojjat ⋅ Janek Haberer ⋅ Sören Pirk ⋅ Olaf Landsiedel
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 535
Drainage: A Unifying Framework for Addressing Class Uncertainty
Yasser Taha ⋅ Grégoire Montavon ⋅ Nils Körber
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 536
Neural Differentiation in Deep Networks: A Theoretical Framework for Expressivity and Representational Diversity
Boyuan Wang ⋅ Richard Jiang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 537
DuetMerging: Synergizing Dynamic and Static Strategies for Mitigating Task Interference in Model Merging
Yan Li ⋅ Guiping Cao ⋅ Yaguang Song ⋅ Ming Tao ⋅ Haoran Gong ⋅ Junhui Liu ⋅ Yaowei Wang ⋅ Dongmei Jiang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 538
SASNet: Spatially-Adaptive Sinusoidal Networks for INRs
Haoan Feng ⋅ Diana Aldana ⋅ Tiago Novello ⋅ Leila De Floriani
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 539
Generative Modeling of Weights: Generalization or Memorization?
Boya Zeng ⋅ Yida Yin ⋅ Zhiqiu Xu ⋅ Zhuang Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 540
Vision-Oriented Lightweight Neural Architecture Search with Budget-Adaptive Evaluation
Yi Fan ⋅ Yu-Bin Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 541
Improving Sparse Autoencoder with Dynamic Attention
Dongsheng Wang ⋅ Jinsen Zhang ⋅ Dawei Su ⋅ Hui Huang
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 542
Stepwise Credit Assignment for GRPO on Flow-Matching Models
Yash Savani ⋅ Branislav Kveton ⋅ Yuchen Liu ⋅ Yilin Wang ⋅ Jing Shi ⋅ Subhojyoti Mukherjee ⋅ Nikos Vlassis ⋅ Krishna Kumar Singh
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 543
FINE: Factorizing Knowledge for Initialization of Variable-sized Diffusion Models
Yucheng Xie ⋅ Fu Feng ⋅ Ruixiao Shi ⋅ Jianlu Shen ⋅ Jing Wang ⋅ Yong Rui ⋅ Xin Geng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 544
Hyperbolic Busemann Neural Networks
Ziheng Chen ⋅ Bernhard Schölkopf ⋅ Nicu Sebe
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 545
FlowDIS: Language-Guided Dichotomous Image Segmentation with Flow Matching
Andranik Sargsyan ⋅ Shant Navasardyan
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 546
Image-to-Point Cloud Feature Back-Projection for Multimodal Training of 3D Semantic Segmentation
Jiawei Han ⋅ Matteo Poggi ⋅ HUAN LI ⋅ Changshuo Wang ⋅ Kaiqi Liu ⋅ Wei Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 547
NG-GS: NeRF-guided 3D Gaussian Splatting Segmentation
Yi He ⋅ Tao Wang ⋅ Yi Jin ⋅ Congyan Lang ⋅ Yidong Li ⋅ Haibin Ling
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 548
Teaching DINOv3 About Partial 3D Geometry: A Self-Supervised Geometry-Aware Approach
Viktoria Ehm ⋅ Dongliang Cao ⋅ Riccardo Marin ⋅ Daniel Scholz ⋅ Weikang Wang ⋅ Florian Bernard ⋅ Daniel Cremers
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 549
SemLayer: Semantic-aware Generative Segmentation and Layer Construction for Abstract Icons
Haiyang Xu ⋅ Ronghuan Wu ⋅ Li-Yi Wei ⋅ Nanxuan Zhao ⋅ Chenxi Liu ⋅ Cuong Nguyen ⋅ Zhuowen Tu ⋅ Zhaowen Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 550
MatchED: Crisp Edge Detection Using End-to-End, Matching-based Supervision
bedrettin cetinkaya ⋅ Sinan Kalkan ⋅ Emre Akbas
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 551
SegGBC: Justifiable Coarse-to-Fine Granular-Ball Computing for Enhancing Clustering Image Segmentation
Qianpeng Chong ⋅ Wenyi Zeng ⋅ Xiuxuan Shen ⋅ Jiajie Li ⋅ Qian Yin ⋅ Xin Zheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 552
Seeing Beyond: Extrapolative Domain Adaptive Panoramic Segmentation
Yuanfan Zheng ⋅ Kunyu Peng ⋅ Xu Zheng ⋅ Kailun Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 553
MatchMask: Mask-Centric Generative Data Augmentation for Label-Scarce Semantic Segmentation
Yuqi Lin ⋅ Hao Zhang ⋅ Wenqi Shao ⋅ Shiqu Liu ⋅ Zhihong Gu ⋅ Wenxiao Wang ⋅ Xiaofei He ⋅ Kaipeng Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 554
Boundary-Responsive Differentiable Gating for Superpixel-Based Segmentation
Fatmaelzahraa Ali Ahmed ⋅ Zhihe Lu ⋅ Gianni Di ⋅ Diram Tabaa ⋅ Mohamed Hamdy ⋅ Muraam Abdel-Ghani ⋅ Abdulaziz Al-Ali ⋅ Muhammad Arsalan ⋅ Shidin Balakrishnan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 555
Task-Oriented Data Synthesis and Control-Rectify Sampling for Remote Sensing Semantic Segmentation
Yunkai Yang ⋅ Yudong Zhang ⋅ Kunquan Zhang ⋅ Jinxiao Zhang ⋅ Xinying Chen ⋅ Haohuan Fu ⋅ Runmin Dong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 556
FUSAR-GPT: A Spatiotemporal Feature-Embedded and Two-Stage Decoupled Visual Language Model for SAR Imagery
Xiaokun Zhang ⋅ Yi Yang ⋅ Ziqi Ye ⋅ Baiyun Baiyun ⋅ Xiaorong Guo ⋅ Qingchen Fang ⋅ Ry Zhang ⋅ Xinpeng Zhou ⋅ Haipeng Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 557
UniChange: Unifying Change Detection with Multimodal Large Language Model
Xu Zhang ⋅ Danyang Li ⋅ Xiaohang Dong ⋅ Tianhao Wu ⋅ Hualong Yu ⋅ Jianye Wang ⋅ Qicheng Li ⋅ Xiang Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 558
Spatiotemporal Pyramid Flow Matching for Climate Emulation
Jeremy A. Irvin ⋅ Jiaqi Han ⋅ Zikui Wang ⋅ Abdulaziz Alharbi ⋅ Yufei Zhao ⋅ Nomin-Erdene Bayarsaikhan ⋅ Daniele Visioni ⋅ Andrew Y. Ng ⋅ Duncan Watson-Parris
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 559
See What We Cannot See: A Geo-guided Reasoning Benchmark for Object Counting under Adverse Earth Observation Conditions
Jiayi Wang ⋅ Zhihong Tan ⋅ Hongchen Wei ⋅ Daiqing Yang ⋅ Zhenzhong Chen
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 560
MM-OVSeg: Multimodal Optical–SAR Fusion for Open-Vocabulary Segmentation in Remote Sensing
YIMIN WEI ⋅ Aoran Xiao ⋅ Hongruixuan Chen ⋅ Junshi Xia ⋅ Naoto Yokoya
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 561
RECS4R: Bridging Semantics and Geometry for Referring Remote Sensing Interpretation
Jinming Chai ⋅ Lingling Li ⋅ Licheng Jiao ⋅ Xiaoqiang Lu ⋅ Long Sun ⋅ Xu Liu ⋅ Wenping Ma ⋅ Weibin Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 562
Fourier Angle Alignment for Oriented Object Detection in Remote Sensing
Changyu Gu ⋅ Linwei Chen ⋅ Lin Gu ⋅ Ying Fu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 563
Learning to Infer Parameterized Representations of Plants from 3D Scans
Samara Ghrer ⋅ Christophe Godin ⋅ Stefanie Wuhrer
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 564
Good Can Sometimes be Bad: A Unified Attack against 3D Point Cloud Classifier by a Flexible Isotropic Resampling
linkun fan ⋅ Jiahao Zhang ⋅ JunTao Zhang ⋅ Lei Zhang ⋅ Fazhi He ⋅ Daojun Han
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 565
V-Attack: Targeting Disentangled Value Features for Controllable Adversarial Attacks on LVLMs
Sen Nie ⋅ Jie Zhang ⋅ Jianxin Yan ⋅ Shiguang Shan ⋅ Xilin Chen
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 566
FeatureFool: Zero-Query Fooling of Video Models via Feature Map
Duoxun Tang ⋅ Xi Xiao ⋅ Guangwu Hu ⋅ Kangkang Sun ⋅ Xiao Yang ⋅ Dongyang Chen ⋅ Qing Li ⋅ Yongjie Yin ⋅ Jiyao Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 567
RankOOD - Class Ranking-based Out-of-Distribution Detection
Dishanika Denipitiyage ⋅ Naveen Karunanayake ⋅ Suranga Seneviratne ⋅ Sanjay Chawla
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 568
AdvFM: Lookahead Flow-Matching Velocity-Field Attacks for Imperceptible and Transferable Adversarial Examples
Runze Liu ⋅ Zeyue Wang ⋅ Fanghui Sun ⋅ Rui Liu ⋅ Yihan Yan ⋅ Shen Wang ⋅ Zhaoyang Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 569
The Power of Decaying Steps: Enhancing Attack Stability and Transferability for Sign-based Optimizers
Wei Tao ⋅ Yang Dai ⋅ Jincai Huang ⋅ Qing Tao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 570
Your Classifier Can Do More: Towards Balancing the Gaps in Classification, Robustness, and Generation
kaichao jiang ⋅ He Wang ⋅ Xiaoshuai Hao ⋅ Xiulong Yang ⋅ Ajian Liu ⋅ Qi Chu ⋅ Yunfeng Diao ⋅ Richang Hong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 571
Learning Mutual View Information Graph for Adaptive Adversarial Collaborative Perception
Yihang Tao ⋅ Senkang Hu ⋅ Haonan An ⋅ Zhengru Fang ⋅ Hangcheng Cao ⋅ Yuguang Fang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 572
Hierarchical Attacks for Multi‑Modal Multi‑Agent Reasoning
Hao Zhou ⋅ Tiru Wu ⋅ yan jiang ⋅ Wanqi Zhou ⋅ Junxing Hu ⋅ Ai Han
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 573
Omni-Attack: Adversarial Attacks on Open-Ended VQA in Black-Box Multimodal LLMs
Kai Hu ⋅ Weichen Yu ⋅ Li Zhang ⋅ Alexander Robey ⋅ Andy Zou ⋅ Haoqi Hu ⋅ Chengming Xu ⋅ Matt Fredrikson
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 574
CoMo: Learning Continuous Latent Motion from Internet Videos for Scalable Robot Learning
Jiange Yang ⋅ tom tomlinson ⋅ Haoyi Zhu ⋅ Mingyu Liu ⋅ Kaijing Ma ⋅ Yating Wang ⋅ Gangshan Wu ⋅ Tong He ⋅ Limin Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 575
Δynamics: Language-Based Representation for Inferring Rigid-Body Dynamics From Videos
Chia-Hsiang Kao ⋅ Cong Phuoc Huynh ⋅ Chien-Yi Wang ⋅ Noranart Vesdapunt ⋅ Stefan Stojanov ⋅ Bharath Hariharan ⋅ Oleksandr Obiednikov ⋅ Ning Zhou
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 576
PvP: Data-Efficient Humanoid Robot Learning with Proprioceptive-Privileged Contrastive Representations
Mingqi Yuan ⋅ Tao Yu ⋅ Haolin Song ⋅ Bo Li ⋅ Xin Jin ⋅ Hua Chen ⋅ Wenjun Zeng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 577
Diagnose, Correct, and Learn from Manipulation Failures via Visual Symbols
Xianchao Zeng ⋅ Xinyu Zhou ⋅ Youcheng Li ⋅ Jiayou Shi ⋅ Tianle Li ⋅ Liangming Chen ⋅ Lei Ren ⋅ Yonglu Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 578
RealVLG-R1: A Large-Scale Real-World Visual-Language Grounding Benchmark for Robotic Perception and Manipulation
Linfei Li ⋅ Lin Zhang ⋅ Ying Shen
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 579
GeCo-SRT: Geometry-aware Continual Adaptation for Cross-Task Sim-to-Real Transfer
Wenbo Yu ⋅ Wenke Xia ⋅ Weitao Zhang ⋅ Di Hu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 580
ActiveGrasp: Information-Guided Active Grasping with Calibrated Energy-based Model
Boshu Lei ⋅ Wen Jiang ⋅ Kostas Daniilidis
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 581
BiPreManip: Learning Affordance-Based Bimanual Pre-Manipulation through Anticipatory Collaboration
Yan Shen ⋅ Feng Jiang ⋅ Zichen He ⋅ Xiaoqi Li ⋅ Yuchen Liu ⋅ Zhiyu Li ⋅ Ruihai Wu ⋅ Hao Dong
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 582
Learning Surgical Robotic Manipulation with 3D Spatial Priors
Yu Sheng ⋅ Lidian Wang ⋅ Xiaomeng Chu ⋅ Jiajun Deng ⋅ Min Cheng ⋅ Yanyong Zhang ⋅ Bei Hua ⋅ Houqiang Li ⋅ Jianmin Ji
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 583
SimRecon: SimReady Compositional Scene Reconstruction from Real Videos
Chong Xia ⋅ Kai Zhu ⋅ Zizhuo Wang ⋅ Fangfu Liu ⋅ Zhizheng Zhang ⋅ Yueqi Duan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 584
STRNet: Visual Navigation with Spatio-Temporal Representation through Dynamic Graph Aggregation
Hao Ren ⋅ Zetong Bi ⋅ Yiming Zeng ⋅ Zhaoliang Wan ⋅ Lu Qi ⋅ Hui Cheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 585
RaUF: Learning the Spatial Uncertainty Field of Radar
Shengpeng Wang ⋅ Kuangyu Wang ⋅ Wei Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 586
SIR: Structured Image Representations for Explainable Robot Learning
Paul Mattes ⋅ Jan Schwab ⋅ Jens Bosch ⋅ Maximilian Li ⋅ Nils Blank ⋅ Minh-Trung Tang ⋅ Moritz Haberland ⋅ Rudolf Lioutikov
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 587
Instance-level Visual Active Tracking with Occlusion-Aware Planning
Haowei Sun ⋅ Kai Zhou ⋅ Hao Gao ⋅ Shiteng Zhang ⋅ Jinwu Hu ⋅ Xutao Wen ⋅ Qixiang Ye ⋅ Mingkui Tan
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 588
Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight
Yi Yang ⋅ Xueqi Li ⋅ Yiyang Chen ⋅ Jin Song ⋅ Yihan Wang ⋅ Zipeng Xiao ⋅ Jiadi Su ⋅ You Qiaoben ⋅ Pengfei Liu ⋅ Zhijie Deng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 589
AnthroTAP: Learning Point Tracking with Real-World Motion
Inès Hyeonsu Kim ⋅ Seokju Cho ⋅ Jahyeok Koo ⋅ Junghyun Park ⋅ Gabriel Huang ⋅ Honglak Lee ⋅ Joon-Young Lee ⋅ Seungryong Kim
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 590
Tracking by Predicting 3-D Gaussians Over Time
Tanish Baranwal ⋅ Himanshu Singh Singh ⋅ Jathushan Rajasegaran ⋅ Jitendra Malik
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 591
Toward Low-Cost yet Effective Temporal Learning for UAV Tracking
chaocan xue ⋅ Qihua Liang ⋅ Bineng Zhong ⋅ Yanting Zu ⋅ Yuanliang Xue ⋅ Haiying Xia ⋅ Shuxiang Song
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 592
Rethinking Two-Stage Referring-by-Tracking in Referring Multi-Object Tracking: Make it Strong Again
Weize Li ⋅ Yunhao Du ⋅ Qixiang Yin ⋅ Zhicheng Zhao ⋅ Fei Su
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 593
Occlusion-Aware SORT: Observing Occlusion for Robust Multi-Object Tracking
Chunjiang Li ⋅ Jianbo Ma ⋅ Li Shen ⋅ Yanru Chen ⋅ Liangyin Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 594
CoWTracker: Tracking by Warping instead of Correlation
Zihang Lai ⋅ Eldar Insafutdinov ⋅ Edgar Sucar ⋅ Andrea Vedaldi
[ Slides
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 595
Learning Long-term Motion Embeddings for Efficient Kinematics Generation
Nick Stracke ⋅ Kolja Bauer ⋅ Stefan Andreas Baumann ⋅ Miguel Ángel Bautista ⋅ Joshua Susskind ⋅ Björn Ommer
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 596
SpatialVID: A Large-Scale Video Dataset with Spatial Annotations
Jiahao Wang ⋅ Yufeng Yuan ⋅ Rujie Zheng ⋅ Youtian Lin ⋅ Jian Gao ⋅ Lin-Zhuo Chen ⋅ Yajie Bao ⋅ Chang Zeng ⋅ Yanxi Zhou ⋅ Xiaoxiao Long ⋅ Hao Zhu ⋅ Zhaoxiang Zhang ⋅ Xun Cao ⋅ Yao Yao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 597
Beyond Explicit Language: Plug-and-Play Visual-to-Linguistic Modeling Toward General Object Tracking
Kaiyang Lan ⋅ Ying Cui ⋅ Chenchen Jing ⋅ Jianwei Zheng ⋅ Dongyan Guo
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 598
FairLLaVA: Fairness-Aware Parameter-Efficient Fine-Tuning for Large Vision-Language Assistants
Mahesh Bhosale ⋅ Abdul Wasi Lone ⋅ Shantam Srivastava ⋅ Shifa Latif ⋅ Tianyu Luan ⋅ Mingchen Gao ⋅ David Doermann ⋅ Xuan Gong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 599
InvCoSS: Inversion-driven Continual Self-supervised Learning in Medical Multi-modal Image Pre-training
Zihao Luo ⋅ Shaohao Rui ⋅ Zhenyu Tang ⋅ Guotai Wang ⋅ Xiaosong Wang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 600
PETAR: Localized Findings Generation with Mask-Aware Vision-Language Modeling for PET Automated Reporting
Danyal Maqbool ⋅ Changhee Lee ⋅ Zachary Huemann ⋅ Samuel D. Church ⋅ Matthew E. Larson ⋅ Scott B. Perlman ⋅ Tomas A. Romero ⋅ Joshua D. Warner ⋅ Meghan Lubner ⋅ Xin Tie ⋅ Jameson Merkow ⋅ Junjie Hu ⋅ Steve Y. Cho ⋅ Tyler J. Bradshaw
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 601
From Panel to Pixel: Zoom-In Vision–Language Pretraining from Biomedical Scientific Literature
Kun yuan ⋅ Min Woo ⋅ Zhen Chen ⋅ Alejandro Lozano ⋅ Xiangteng He ⋅ Shi Li ⋅ Nassir Navab ⋅ Xiaoxiao Sun ⋅ Nicolas Padoy ⋅ Serena Yeung
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 602
LEMON: A Large Endoscopic MONocular Dataset and Foundation Model for Perception in Surgical Settings
chengan che ⋅ Chao Wang ⋅ Tom Vercauteren ⋅ Sophia Tsoka ⋅ Luis Carlos Garcia Peraza Herrera
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 603
D2T2 - Multimodal Automated Planning for Brachytherapy
Lance C. Moore ⋅ Aranyo Mitra ⋅ Ryan Truong ⋅ Karoline Kallis ⋅ Kelly Kisling ⋅ Sandra M. Meyers ⋅ Nuno Vasconcelos
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 604
TopoCL: Topological Contrastive Learning for Medical Imaging
Guangyu Meng ⋅ Pengfei Gu ⋅ Peixian Liang ⋅ John P. Lalor ⋅ Erin Wolf Chambers ⋅ Danny Z. Chen
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 605
Diffusion with a Linguistic Compass: Steering the Generation of Clinically Plausible Future sMRI Representations for Early MCI Conversion Prediction
Zhihao Tang ⋅ Chaozhuo Li ⋅ Litian Zhang ⋅ Xi Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 606
Personalized Longitudinal Medical Report Generation via Temporally-Aware Federated Adaptation
He Zhu ⋅ Ren Togo ⋅ Takahiro Ogawa ⋅ Kenji Hirata ⋅ Minghui Tang ⋅ Takaaki Yoshimura ⋅ Hiroyuki Sugimori ⋅ Noriko Nishioka ⋅ Yukie Shimizu ⋅ Kohsuke Kudo ⋅ Miki Haseyama
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 607
Decoding 3D Perception via BrainSSD: Synergistic Fusion of EEG Representations from Static and Dynamic Visual Streams
Yincheng Yao ⋅ Enze Shi ⋅ Shu Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 608
Duala: Dual-Level Alignment of Subjects and Stimuli for Cross-Subject fMRI Decoding
Shumeng Li ⋅ Jintao Guo ⋅ Jian Zhang ⋅ Yulin Zhou ⋅ Luyang Cao ⋅ Yinghuan Shi
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 609
OmniBrainBench: A Comprehensive Multimodal Benchmark for Brain Imaging Analysis Across Multi-stage Clinical Tasks
Zhihao Peng ⋅ Cheng Wang ⋅ Shengyuan Liu ⋅ Zhiying Liang ⋅ Zanting Ye ⋅ Min Jie Ju ⋅ Peter YM Woo ⋅ Yixuan Yuan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 610
Beyond Pixel Simulation: Pathology Image Generation via Diagnostic Semantic Tokens and Prototype Control
Minghao Han ⋅ Yichen Liu ⋅ Yizhou Liu ⋅ Zizhi Chen ⋅ Jingqun Tang ⋅ Xuecheng Wu ⋅ Dingkang Yang ⋅ Lihua Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 611
MedFG-VQA: Low-Frequency Memory and Graph Attention for Lightweight Medical VQA
haowen gu ⋅ Gensheng Pei ⋅ Zeren Sun ⋅ Mingwu Ren ⋅ Xiangbo Shu ⋅ Yazhou Yao ⋅ Fumin Shen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 612
FISHuman: Fine-grained Single-image 3D Human Reconstruction via Multi-view 4D Remeshing
Hanxi Liu ⋅ Yifang Men ⋅ Zhouhui Lian
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 613
DuoMo: Dual Motion Diffusion for World-Space Human Reconstruction
Yufu Wang ⋅ Evonne Ng ⋅ Soyong Shin ⋅ Rawal Khirodkar ⋅ Yuan Dong ⋅ Zhaoen Su ⋅ Jinhyung Park ⋅ Kris Kitani ⋅ Alexander Richard ⋅ Fabian Prada ⋅ Michael Zollhoefer
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 614
RAM: Recover Any 3D Human Motion in-the-Wild
Sen Jia ⋅ Ning Zhu ⋅ Jinqin Zhong ⋅ Jiale Zhou ⋅ Huaping Zhang ⋅ Jenq-Neng Hwang ⋅ Lei Li
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 615
From 2D Alignment to 3D Plausibility: Unifying Heterogeneous 2D Priors and Penetration-Free Diffusion for Occlusion-Robust Two-Hand Reconstruction
Gaoge Han ⋅ Yongkang Cheng ⋅ Zhe Chen ⋅ Shaoli Huang ⋅ Tongliang Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 616
MV-Fashion: Towards Enabling Virtual Try-On and Size Estimation with Multi-View Paired Data
Hunor Laczko ⋅ Libang Jia ⋅ Loc-Phat Truong ⋅ Diego Hernández ⋅ Sergio Escalera ⋅ Jordi Gonzàlez ⋅ Meysam Madadi
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 617
Forecasting 3D Scanpaths in Egocentric Video
Fiona Ryan ⋅ Ishwarya Ananthabhotla ⋅ Yijun Qian ⋅ Judy Hoffman ⋅ James M. ⋅ Vamsi Krishna Ithapu ⋅ Calvin Murdock
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 618
M4Human: A Large-Scale Multimodal mmWave Radar Benchmark for Human Mesh Reconstruction
Fan Junqiao ⋅ Yunjiao Zhou ⋅ Yizhuo Yang ⋅ Xinyuan Cui ⋅ Jiarui Zhang ⋅ Lihua Xie ⋅ Jianfei Yang ⋅ Chris Xiaoxuan Lu ⋅ Fangqiang Ding
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 619
ReGenHOI: Unifying Reconstruction and Generation for 3D Human–Object Interaction Understanding
miao xu ⋅ Xiangyu Zhu ⋅ Zidu Wang ⋅ XUSHENG LIANG ⋅ Bao Li ⋅ Jinlin Wu ⋅ Zelin Zang ⋅ Zhen Lei
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 620
Through the Frequency Lens: Cross-Domain Generalisable Gaze Estimation with Adaptive Modulation
Yang Xu ⋅ Yiwei Bao ⋅ Feng Lu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 621
Mocap-2-to-3: Multi-view Lifting for Monocular Motion Recovery with 2D Pretraining
Zhumei Wang ⋅ Zechen Hu ⋅ Ruoxi Guo ⋅ Huaijin Pi ⋅ Ziyong Feng ⋅ Liang Zhang ⋅ Mingtao Pei ⋅ Siyuan Huang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 622
SHands: A Multi-View Dataset and Benchmark for Surgical Hand-Gesture and Error Recognition Toward Medical Training
Le Ma ⋅ Thiago Freitas dos Santos ⋅ Nadia Magnenat-Thalmann ⋅ Katarzyna Wac
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 623
Beyond Static Frames: Temporal Aggregate-and-Restore Vision Transformer for Human Pose Estimation
Hongwei Fang ⋅ Jiahang Cai ⋅ Xun Wang ⋅ Wenwu Yang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 624
IMU-HOI: A Symbiotic Framework for Coherent Human-Object Interaction and Motion Capture via Contact-Conscious Inertial Fusion
Lizhou Lin ⋅ Songpengcheng Xia ⋅ Zengyuan Lai ⋅ Lan Sun ⋅ Jiarui Yang ⋅ Ling Pei
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 625
Learning Forgery-Aware Lip Representations Without Forgery Priors
Bofan Chen ⋅ Hongyu Zhu ⋅ Yi He ⋅ Sichu Liang ⋅ Shilin Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 626
Beyond [CLS] Token: Query-Driven Token-Level Forgery Purification for Generalizable Deepfake Detection
Wang Changshuo ⋅ Jiangming Wang ⋅ Ke-Yue Zhang ⋅ Taiping Yao ⋅ Shouhong Ding ⋅ Shunli Wang ⋅ Ran Yi ⋅ Lizhuang Ma
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 627
GEM-TFL: Bridging Weak and Full Supervision for Forgery Localization through EM-Guided Decomposition and Temporal Refinement
Xiaodong Zhu ⋅ Yuanming Zheng ⋅ Suting Wang ⋅ Junqi Yang ⋅ Yuhong Yang ⋅ Weiping Tu ⋅ Zhongyuan Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 628
TokenTrace: Multi-Concept Attribution through Watermarked Token Recovery
Li Zhang ⋅ Shruti Agarwal ⋅ John Collomosse ⋅ Pengtao Xie ⋅ Vishal Asnani
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 629
Unleashing Vision-Language Semantics for Deepfake Video Detection
Jiawen Zhu ⋅ Yunqi Miao ⋅ Xueyi Zhang ⋅ Jiankang Deng ⋅ Guansong Pang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 630
A Difference-in-Difference Approach to Detecting AI-Generated Images
Xinyi Qi ⋅ Kai Ye ⋅ Chengchun Shi ⋅ Ying Yang ⋅ Jin Zhu ⋅ Hongyi Zhou
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 631
RDFace: A Benchmark Dataset for Rare Disease Facial Image Analysis under Extreme Data Scarcity and Phenotype-Aware Synthetic Generation
Ganlin Feng ⋅ Yuxi Long ⋅ Hafsa Moontari Ali ⋅ Erin Lou ⋅ Fahad Butt ⋅ Qian Liu ⋅ Yang Wang ⋅ Pingzhao Hu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 632
ActivityForensics: A Comprehensive Benchmark for Localizing Manipulated Activity in Videos
Peijun Bao ⋅ Anwei Luo ⋅ Gang Pan ⋅ Alex C. Kot ⋅ Xudong Jiang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 633
Zero-shot Detection of AI-Generated Image via RAW-RGB Alignment
Haiwei Wu ⋅ Fengpeng Li ⋅ Zhilin Tu ⋅ Yuanman Li ⋅ Xiong Li ⋅ Jiantao Zhou
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 634
Scaling Up AI-Generated Image Detection with Generator-Aware Prototypes
Ziheng Qin ⋅ Yuheng Ji ⋅ Renshuai Tao ⋅ Yuxuan Tian ⋅ Yuyang Liu ⋅ Yipu Wang ⋅ Xiaolong Zheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 635
Investigating Self-Supervised Representations for Audio-Visual Deepfake Detection
Dragos-Alexandru Boldisor ⋅ Stefan Smeu ⋅ Dan Oneata ⋅ Elisabeta Oneata
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 636
TIACam: Text-Anchored Invariant Feature Learning with Auto-Augmentation for Camera-Robust Zero-Watermarking
Abdullah All Tanvir ⋅ Agnibh Dasgupta ⋅ Xin Zhong
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 637
FastRef: Fast Prototype Refinement for Few-shot Industrial Anomaly Detection
Yufei Li ⋅ Long Tian ⋅ Yuyang Dai ⋅ Wenchao Chen ⋅ Liang Bao ⋅ Xiyang Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 638
RC-NF: Robot-Conditioned Normalizing Flow for Real-Time Anomaly Detection in Robotic Manipulation
Shijie Zhou ⋅ Bin Zhu ⋅ Jiarui Yang ⋅ Xiangyu Zhao ⋅ Jingjing Chen ⋅ Yu-Gang Jiang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 639
Reasoning-Driven Anomaly Detection and Localization with Image-Level Supervision
yizhou jin ⋅ Yuezhu Feng ⋅ Jinjin Zhang ⋅ Peng Wang ⋅ Qingjie Liu ⋅ Yunhong Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 640
MMR-AD: A Large-Scale Multimodal Dataset for Benchmarking General Anomaly Detection with Multimodal Large Language Models
Xincheng Yao ⋅ Zefeng Qian ⋅ Chao Shi ⋅ Jiayang Song ⋅ Chongyang Zhang
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 641
Wavelet-Driven 3D Anomaly Detection under Pose-Agnostic and Sparse-View
Mingwen Shao ⋅ Qiao Zhang ⋅ Xinyuan Chen ⋅ Xiang Lv ⋅ Lingzhuang Meng ⋅ Chang Liu ⋅ Qinglin Zhan ⋅ Ling Jian
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 642
Hunting Normality from Query Sample via Residual Learning for Generalist Anomaly Detection
Xiaolei Wang ⋅ Yuexin Wang ⋅ Tianhong Dai ⋅ Huihui Bai ⋅ Yao Zhao ⋅ Jimin Xiao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 643
GPFlow: Gaussian Prototype Probability Flow for Unsupervised Multi-Modal Anomaly Detection
YITING LI ⋅ Xulei Yang ⋅ Jingyi Liao ⋅ Jing Zhang ⋅ Fayao Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 644
HP-Edit: A Human-Preference Post-Training Framework for Image Editing
Fan Li ⋅ Chonghuinan Wang ⋅ Lina Lei ⋅ Yuping Qiu ⋅ Jiaqi Xu ⋅ Jiaxiu Jiang ⋅ Xinran Qin ⋅ Zhikai Chen ⋅ Fenglong Song ⋅ Zhixin Wang ⋅ Renjing Pei ⋅ Wangmeng Zuo
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 645
It's Never Too Late: Noise Optimization for Collapse Recovery in Trained Diffusion Models
Anne Harrington ⋅ A. Koepke ⋅ Shyamgopal Karthik ⋅ Trevor Darrell ⋅ Alexei A. Efros
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 646
RebRL: Reinforcing Discrete Visual Diffusion Models with Rebalanced Timestep Credits
Mu Zhang ⋅ Tianren Ma ⋅ Yunfan Liu ⋅ Kun Hu ⋅ Qixiang Ye
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 647
Ego-InBetween: Generating Object State Transitions in Ego-Centric Videos
Mengmeng Ge ⋅ Takashi Isobe ⋅ Xu Jia ⋅ Yanan Sun ⋅ Zetong Yang ⋅ Weinong Wang ⋅ Dong Zhou ⋅ Dong Li ⋅ Huchuan Lu ⋅ Emad Barsoum
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 648
Towards Fine-Grained Attribution: Instance-Aware Preference Optimization for Aligning Diffusion Models
Jiayang Sun ⋅ Pin Wang ⋅ Hongbo Wang ⋅ Xinyue Liu ⋅ Huaibo Huang ⋅ Ran He
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 649
SketchRevive: Fine-Grained Pixel-to-Vector Sketch Completion with Diffusion-Prior-Guided Multimodal LLMs
Ran Zuo ⋅ Haoxiang Hu ⋅ Chenxi Pei ⋅ Yanxuan Liu ⋅ Wenwen Qiang ⋅ Fang Liu ⋅ Xiaoming Deng ⋅ Cuixia Ma ⋅ Yong-Jin Liu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 650
UniPercept: A Unified Diffusion Model for Generalizable Visual Perception
Zuyan Zhao ⋅ Zhenliang He ⋅ Meina Kan ⋅ Shiguang Shan ⋅ Xilin Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 651
Visual Diffusion Models are Geometric Solvers
Nir Goren ⋅ Shai Yehezkel ⋅ Omer Dahary ⋅ Andrey Voynov ⋅ Or Patashnik ⋅ Daniel Cohen-Or
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 652
You Only Erase Once: Erasing Anything without Bringing Unexpected Content
Yixing Zhu ⋅ Qing Zhang ⋅ Wenju Xu ⋅ Wei-Shi Zheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 653
Smoothing the Score Function to Enhance Generalization in Diffusion Models
Xinyu Zhou ⋅ Jiawei Zhang ⋅ Stephen J. Wright
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 654
NS-Diff: Fluid Navier–Stokes Guided Video Diffusion via Reinforcement Learning
Zijun Deng ⋅ Yuxin Peng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 655
PropFly: Learning to Propagate via On-the-Fly Supervision from Pre-trained Video Diffusion Models
Wonyong Seo ⋅ Jaeho Moon ⋅ Jaehyup Lee ⋅ Soo Ye Kim ⋅ Munchurl Kim
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 656
Generative Neural Video Compression via Video Diffusion Prior
Qi Mao ⋅ Hao Cheng ⋅ Tinghan Yang ⋅ Libiao Jin ⋅ Siwei Ma
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 657
AdaCluster: Adaptive Query-Key Clustering for Sparse Attention in Video Generation
Haoyue Tan ⋅ Shengnan Wang ⋅ Yulin Qiao ⋅ juncheng zhang ⋅ Youhui Bai ⋅ Ping Gong ⋅ Zewen Jin ⋅ Cheng Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 658
Denoising, Fast and Slow: Difficulty-Aware Adaptive Sampling for Image Generation
Johannes Schusterbauer ⋅ Ming Gui ⋅ Yusong Li ⋅ Pingchuan Ma ⋅ Felix Krause ⋅ Björn Ommer
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 659
Image Diffusion Preview with Consistency Solver
Fu-Yun Wang ⋅ Hao Zhou ⋅ Liangzhe Yuan ⋅ Sanghyun Woo ⋅ Boqing Gong ⋅ Bohyung Han ⋅ Ming-Hsuan Yang ⋅ Han Zhang ⋅ Yukun Zhu ⋅ Ting Liu ⋅ Long Zhao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 660
The Drift Kernel: Why Diffusion Models Change Even When Told Not To
Gokul Srinath Seetha Ram ⋅ Rashmi Elavazhagan
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 661
Interpretable Prompts made Edit-Friendly: Token-to-Token Similarity Reduction in dLLMs for Edit-Friendly Hard Prompt Inversion
Naresh Kumar Devulapally ⋅ Shruti Agarwal ⋅ Vishal Asnani ⋅ Vishnu Suresh Lokhande
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 662
LESA: Learnable Stage-Aware Predictors for Diffusion Model Acceleration
Peiliang Cai ⋅ Jiacheng Liu ⋅ Haowen Xu ⋅ Xinyu Wang ⋅ Chang Zou ⋅ Linfeng Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 663
Vision Foundation Models Can Be Good Tokenizers for Latent Diffusion Models
Tianci Bi ⋅ Xiaoyi Zhang ⋅ Yan Lu ⋅ Nanning Zheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 664
Adaptive Spectral Feature Forecasting for Diffusion Sampling Acceleration
Jiaqi Han ⋅ Juntong Shi ⋅ Puheng Li ⋅ Haotian Ye ⋅ Qiushan Guo ⋅ Stefano Ermon
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 665
Proxy-Tuning: Tailoring Multimodal Autoregressive Models for Subject-Driven Image Generation
Yi Wu ⋅ Shengju Qian ⋅ Lingting Zhu ⋅ Lei Liu ⋅ Wandi Qiao ⋅ Ziqiang Li ⋅ Lequan Yu ⋅ Bin Li
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 666
EasyOmnimatte: Taming Pretrained Inpainting Diffusion Models for End-to-End Video Layered Decompositio
Yihan Hu ⋅ Xuelin Chen ⋅ Xiaodong Cun
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 667
Hierarchical Codec Diffusion for Video-to-Speech Generation
Jiaxin Ye ⋅ Gaoxiang Cong ⋅ Chenhui Wang ⋅ Xin-Cheng Wen ⋅ Zhaoyang Li ⋅ Boyuan Cao ⋅ Hongming Shan
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 668
Semantic Alignment for Pose-Invariant Identity Preserving Diffusion
Jiwon Kim ⋅ SeonHwa Kim ⋅ Soobin Park ⋅ Eunju Cha ⋅ Kyong Hwan Jin
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 669
Causality in Video Diffusers is Separable from Denoising
Xingjian Bai ⋅ Guande He ⋅ Zhengqi Li ⋅ Eli Shechtman ⋅ Xun Huang ⋅ Zongze Wu
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 670
2ndMatch: Finetuning Pruned Diffusion Models via Second-Order Jacobian Matching
Caleb Zheng ⋅ Eli Shlizerman
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 671
Hear What You See: Video-to-Audio Generation with Diffusion Transformer and Semantic-Temporal Alignment-Ranked Direct Preference Optimization
Kai Wang ⋅ Tao Zhou ⋅ jiayi lei ⋅ Jing Wang ⋅ Jinman Zhao ⋅ Weiguo Pian ⋅ Yuan Cheng ⋅ Yapeng Tian ⋅ Peng Gao ⋅ Bin Fu ⋅ Yihao Liu ⋅ Dimitrios Hatzinakos ⋅ Yuewen Cao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 672
MacTok: Robust Continuous Tokenization for Image Generation
Hengyu Zeng ⋅ Xin Gao ⋅ Guanghao Li ⋅ Yuxiang Yan ⋅ Jiaoyang Ruan ⋅ Ma Junpeng ⋅ Haoyu Albert Wang ⋅ Jian Pu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 673
Group Editing: Edit Multiple Images in One Go
Yue Ma ⋅ Xinyu Wang ⋅ Qianli Ma ⋅ Qinghe Wang ⋅ Mingzhe Zheng ⋅ xiangpeng yang ⋅ Hao Li ⋅ Chongbo Zhao ⋅ Jixuan Ying ⋅ Harry Yang ⋅ Hongyu Liu ⋅ Qifeng Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 674
Adaptive Video Distillation: Mitigating Oversaturation and Temporal Collapse in Few-Step Generation
Yuyang You ⋅ Yongzhi Li ⋅ Jiahui Li ⋅ Yadong Mu ⋅ Quan Chen ⋅ Peng Jiang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 675
Beyond the Golden Data: Resolving the Motion-Vision Quality Dilemma via Timestep Selective Training
Xiangyang Luo ⋅ Qingyu Li ⋅ Yuming Li ⋅ Guanbo Huang ⋅ Yongjie Zhu ⋅ Wenyu Qin ⋅ Meng Wang ⋅ Pengfei Wan ⋅ Shao-Lun Huang
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 676
Toward Diffusible High-Dimensional Latent Spaces: A Frequency Perspective
Bolin Lai ⋅ XuDong Wang ⋅ Saketh Rambhatla ⋅ James M. ⋅ Zsolt Kira ⋅ Rohit Girdhar ⋅ Ishan Misra
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 677
Elucidating the SNR-t Bias of Diffusion Probabilistic Models
Meng Yu ⋅ Lei Sun ⋅ Jianhao Zeng ⋅ Xiangxiang Chu ⋅ Kun Zhan
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 678
What Is It Like to Be a Noise? An Entropy-based Gaussian Noise Regularization for Diffusion Models
Pascal Chang ⋅ Kai Lascheit ⋅ Jingwei Tang ⋅ Markus Gross ⋅ Vinicius Azevedo
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 679
FlashVSR: Towards Real-time Diffusion-Based Streaming Video Super Resolution
Junhao Zhuang ⋅ Shi Guo ⋅ Xin Cai ⋅ Xiaohui Li ⋅ Yihao Liu ⋅ Chun Yuan ⋅ Tianfan Xue
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 680
DiffusionHarmonizer: Bridging Neural Reconstruction and Photorealistic Simulation with Online Diffusion Enhancer
Yuxuan Zhang ⋅ Katarina Tothova ⋅ Zian Wang ⋅ Kangxue Yin ⋅ Haithem Turki ⋅ Riccardo de Lutio ⋅ Yen-Yu Chang ⋅ Or Litany ⋅ Sanja Fidler ⋅ Žan Gojčič
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 681
GDRO: Group-level Reward Post-training Suitable for Diffusion Models
Yiyang Wang ⋅ Xi Chen ⋅ Xiaogang Xu ⋅ Yu Liu ⋅ Hengshuang Zhao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 682
RFDM: Residual Flow Diffusion Models for Video Editing
Mohammadreza Salehi ⋅ Mehdi Noroozi ⋅ Luca Morreale ⋅ Ruchika Chavhan ⋅ Malcolm Chadwick ⋅ Alberto Gil Couto Pimentel Ramos ⋅ Abhinav Mehrotra
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 683
FreqEdit: Preserving High-Frequency Features for Robust Multi-Turn Image Editing
Yucheng Liao ⋅ Jiajun Liang ⋅ Kaiqian Cui ⋅ Baoquan Zhao ⋅ Haoran Xie ⋅ Wei Liu ⋅ Qing Li ⋅ Xudong Mao
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 684
Graph-Guided Online Concept Erasure for Text-to-Image Diffusion Models
Ning Han ⋅ Zhenyu Ge ⋅ Feng Han ⋅ Yuhua Sun ⋅ Chengqing Li ⋅ Jingjing Chen
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 685
HierEdit: Region-Aware Hierarchical Diffusion for Efficient High-Resolution Editing
Yuyao Zhang ⋅ Alexander Huang-Menders ⋅ Yu-Wing Tai
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 686
CTCal: Rethinking Text-to-Image Diffusion Models via Cross-Timestep Self-Calibration
Xiefan Guo ⋅ Xinzhu Ma ⋅ Haiyu Zhang ⋅ Di Huang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 687
Edit2Perceive: Image Editing Diffusion Models Are Strong Dense Perceivers
Yiqing Shi ⋅ Yiren Song ⋅ Mike Zheng Shou
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 688
DeltaQuant: 4-bit Video Diffusion Models with Spatiotemporal Delta Smoothing
Xingyang Li ⋅ Samuel Tesfai ⋅ Zhekai Zhang ⋅ Haocheng Xi ⋅ Shuo Yang ⋅ Lvmin Zhang ⋅ Yufei Sun ⋅ Kelly Peng ⋅ Maneesh Agrawala ⋅ Ion Stoica ⋅ Kurt Keutzer ⋅ Jun-Yan Zhu ⋅ Song Han ⋅ Yujun Lin ⋅ Muyang Li
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 689
D2Cache: Second-Order Delta Caching for Higher Video Diffusion Acceleration
Enhuai Liu ⋅ Yunke Wang ⋅ Changming Sun ⋅ Chang Xu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 690
DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation
Zehong Ma ⋅ Longhui Wei ⋅ Shuai Wang ⋅ Shiliang Zhang ⋅ Qi Tian
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 691
Test-Time Alignment of Text-to-Image Diffusion Models via Null-Text Embedding Optimisation
Taehoon Kim ⋅ Henry Gouk ⋅ Timothy Hospedales
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 692
Accelerating Diffusion Model Training under Minimal Budgets: A Condensation-Based Perspective
Rui Huang ⋅ Shitong Shao ⋅ zikai zhou ⋅ Pukun Zhao ⋅ Hangyu Guo ⋅ Tian Ye ⋅ Lichen Bai ⋅ Shuo Yang ⋅ Zeke Xie
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 693
Denoising as Path Planning: Training-Free Acceleration of Diffusion Models with DPCache
Bowen Cui ⋅ Yuanbin Wang ⋅ Huajiang Xu ⋅ Biaolong Chen ⋅ Aixi Zhang ⋅ Hao Jiang ⋅ Zhengzheng Jin ⋅ Xu Liu ⋅ Pipei Huang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 694
Taming Sampling Perturbations with Variance Expansion Loss for Latent Diffusion Models
Qifan Li ⋅ Xingyu Zhou ⋅ Jinhua Zhang ⋅ Weiyi You ⋅ Shuhang Gu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 695
Guiding Diffusion Models with Semantically Degraded Conditions
shilong han ⋅ Yuming Zhang ⋅ Hongxia Wang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 696
Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion
Yueming Pan ⋅ Ruoyu Feng ⋅ Qi Dai ⋅ Yuqi Wang ⋅ Wenfeng LIN ⋅ MINGYU GUO ⋅ Chong Luo ⋅ Nanning Zheng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 697
Reviving ConvNeXt for Efficient Convolutional Diffusion Models
Taesung Kwon ⋅ Lorenzo Bianchi ⋅ Lennart Wittke ⋅ Felix Watine ⋅ Fabio Carrara ⋅ Jong Chul ⋅ Romann Weber ⋅ Vinicius Azevedo
[ Slides [ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 698
Coupled Diffusion Sampling for Training-Free Multi-View Image Editing
Hadi Alzayer ⋅ Yunzhi Zhang ⋅ Chen Geng ⋅ Jia-Bin Huang ⋅ Jiajun Wu
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 699
Improving Diffusion Generalization with Weak-to-Strong Segmented Guidance
Liangyu Yuan ⋅ Yufei Huang ⋅ Mingkun Lei ⋅ Tong Zhao ⋅ Ruoyu Wang ⋅ Chi Changxi ⋅ Yiwei Wang ⋅ Chi Zhang
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 700
Adaptive Auxiliary Prompt Blending for Target-Faithful Diffusion Generation
Kwanyoung Lee ⋅ SeungJu Cha ⋅ Yebin Ahn ⋅ Hyunwoo Oh ⋅ Sungho Koh ⋅ Dong-Jin Kim
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 701
SegQuant: A Semantics-Aware and Generalizable Quantization Framework for Diffusion Models
Jiaji Zhang ⋅ Ruichao Sun ⋅ Hailiang Zhao ⋅ Jiaju Wu ⋅ Peng Chen ⋅ Hao Li ⋅ Yuying Liu ⋅ Kingsum Chow ⋅ GANG XIONG ⋅ Shuiguang Deng
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 702
BAgger: Backwards Aggregation for Mitigating Drift in Autoregressive Video Diffusion Models
Ryan Po ⋅ Eric Ryan Chan ⋅ Changan Chen ⋅ Gordon Wetzstein
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 703
Accelerating Autoregressive Video Diffusion via History-Guided Cache and Residual Correction
Kepan Nan ⋅ Wangbo Zhao ⋅ Penghao Zhou ⋅ Jun Li ⋅ Zhenheng Yang ⋅ Jian Yang ⋅ Ying Tai
[ Poster
Poster
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A 704
MusicInfuser: Making Video Diffusion Listen and Dance
Susung Hong ⋅ Ira Kemelmacher-Shlizerman ⋅ Brian Curless ⋅ Steve M. Seitz
[ Poster
Poster Session
Sun Jun 07 02:30 PM -- 04:30 PM (PDT) @ ExHall A None
Poster Session 6